Twiceler – Robots.txt – Block Cuil – Hope I Don’t Jinx Myself Here!

Those of you that have been following me know that I have had my share of hosting issues along the way. I thought it was my cron jobs or scripts causing the issues. HostGator told me it was my scripts. Well they are correct to an extent.

I was having issues again about a week ago. Usually this site and my others start to run slow and sluggish or don’t respond at all. When that happens I can expect a call from HostGator saying they disabled something to preserve the health of the server and other users on it. I do not have a dedicated server. I use HostGators reseller package that gives me WHM access. Sure enough I got the call. I emailed them and this time I got a little more information. They said my search script was being abused by Cuil.

Turns out that this is not uncommon. After a little searching I found that they randomly throw stuff into the search script to try and find new pages to index. Well the way my script works, it will always give them a results page no matter what they search for. Sometimes it will say no results found but it is still a page. Cuil is known for this behavior and has brought many servers down by doing this.

I could care less about Cuil. When was the last time you saw a referral to your site from Cuil? I don’t think I ever did. So HostGator said they would block Cuil using Robots.txt for me. They offered to put it in all my sites… 100+! Sure enough they did. This was above and beyond in my book. It saved me a ton of time.

This is what needs to be placed in your robots.txt file to block Cuil:

User-agent: twiceler
Disallow: /

Turns out that the Cuil robot can bring your site to its knees… but for the most part they honor the robots.txt… go figure…

Also you can visit this site to request to be excluded from indexing:
http://www.cuil.com/info/webmaster_info/

I wanted to share this because my sites have been running much faster. It makes it easier for me to setup new sites and manage my existing ones. Every site I put out now has this code in the robots.txt file. I suggest you do the same because once Cuil finds you… look out…

About Joe

An active Internet Marketer that likes hanging out in the Warrior Forum on his spare time. My favorite ways to earn money at the moment are software products, eBay Partner Network and Google Adsense. Got a question for me? Just ask!