# robots.txt for http://www.st-minutiae.com/ # Compiled by Dan Carlson # Updated on Saturday, December 13, 2008 User-agent: CherryPicker User-agent: CopyRightCheck User-agent: Crescent User-agent: EmailCollector User-agent: EmailSiphon User-agent: EmailWolf User-agent: ExtractorPro User-agent: FairAd Client User-agent: Flaming AttackBot User-agent: Generic User-agent: GigaBot User-agent: grub-client User-agent: Harvest/1.5 User-agent: Indy Library User-agent: k2spider User-agent: Microsoft URL Control User-agent: Microsoft.URL.Control User-agent: Microsoft-URL-Control User-agent: NetAnts User-agent: NPbot User-agent: pompos User-agent: SiteSnagger User-agent: TurnitinBot User-agent: URL_Spider_Pro User-agent: WebBandit User-agent: WebCopier User-agent: WebSauger User-agent: WebStripper User-agent: Wget User-agent: ZealBot User-agent: Fasterfox Disallow: / User-agent: * Disallow: /error/ Disallow: /includes/ Disallow: /movable_type/ Disallow: /site/search/ Disallow: /temp/ # Most of these bots are ones that have been found in my access logs. Others are preemptively blocked. # Some of these (and many others) ignore robots.txt, and are forcibly blocked by my .htaccess script.