# $Id: robots.txt,v magento-specific 2010/28/01 18:24:19 goba Exp $ # # robots.txt # # This file is to prevent the crawling and indexing of certain parts # of your site by web crawlers and spiders run by sites like Yahoo! # and Google. By telling these "robots" where not to go on your site, # you save bandwidth and server resources. # # This file will be ignored unless it is at the root of your host: # Used: http://example.com/robots.txt # Ignored: http://example.com/site/robots.txt # # For more information about the robots.txt standard, see: # http://www.robotstxt.org/wc/robots.html # # For syntax checking, see: # http://www.sxw.org.uk/computing/robots/check.html # Website Sitemap Sitemap: http://www.itead.cc/sitemap.xml # Crawlers Setup User-agent: * Crawl-delay: 10 # Please Don't Crawl Me #User-agent: Baiduspider #Disallow: / #User-agent: baiduspider #Disallow: / User-Agent: Sosospider Disallow: / User-Agent: Sogou web spider Disallow: / User-Agent: sogou spider Disallow: / User-Agent: YoudaoBot Disallow: / User-agent: EasouSpider Disallow: / User-agent: AhrefsBot disallow: / # Allowable Index Allow: /index.php/blog/ Allow: /catalog/seo_sitemap/category/ # Directories Disallow: /404/ Disallow: /app/ Disallow: /cgi-bin/ Disallow: /downloader/ Disallow: /includes/ Disallow: /lib/ # Disallow: /media/ // I would personally allow this folder for google product caching Disallow: /pkginfo/ Disallow: /report/ Disallow: /skin/ Disallow: /stats/ Disallow: /var/ # Paths (clean URLs) Disallow: /index.php/ Disallow: /catalog/product_compare/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /catalogsearch/ Disallow: /checkout/ Disallow: /control/ Disallow: /contacts/ Disallow: /customer/ Disallow: /customize/ Disallow: /newsletter/ Disallow: /poll/ Disallow: /review/ Disallow: /sendfriend/ Disallow: /tag/ Disallow: /wishlist/ # Files Disallow: /cron.php Disallow: /cron.sh Disallow: /error_log Disallow: /install.php Disallow: /LICENSE.html Disallow: /LICENSE.txt Disallow: /LICENSE_AFL.txt Disallow: /RELEASE_NOTES.txt # Paths (no clean URLs) Disallow: /*.php$ Disallow: /*?p=*& Disallow: /*?SID= #2015-10-08(Search Console URL Parameters) Disallow: /*?q= Disallow: /*?dir= Disallow: /*?order= Disallow: /*?limit= Disallow: /*?mode= Disallow: /*?cat= Disallow: /*?compatible_mainboard= Disallow: /*?operation_level= Disallow: /*?shield_functions= Disallow: /*?power_supply= Disallow: /*?ram_capacity= Disallow: /*?flash_capacity= Disallow: /*?controller_type= Disallow: /*?io_operation_level= Disallow: /*?module_chip= Disallow: /*?project_case_color= Disallow: /*?p= Disallow: /*?price= Disallow: /*?type= Disallow: /*?display_resolution= Disallow: /*?period= Disallow: /*?external_resources= Disallow: /*?contact_length= Disallow: /*?pin_pitch= Disallow: /*?material= Disallow: /*?contact_type= Disallow: /*?pcb_size= Disallow: /*?board_color= Disallow: /*?battery_qty= Disallow: /*?board_thickness= Disallow: /*?brick_interface= Disallow: /*?display_interface= Disallow: /*?character_resolution= Disallow: /*?display_interface_config= Disallow: /*?pcb_quantity= User-agent: AdsBot-Google Disallow: