# file: robots.txt,v 1.0 2013/02/22 # author: zhimin.liuzm # ye.1688.com # 按照robots.txt的标准写法,规定一些不允许爬虫爬的页面或目录。 # robots.txt 的写法参照 <URL:http://www.robotstxt.org/wc/exclusion.html#robotstxt> # Format is: # User-agent: <name of spider> # Disallow: <nothing> | <path> # ----------------------------------------------------------------------------- User-agent: * Allow: /*.htm Disallow: /bin/ Disallow: /monitor/ Disallow: /industrybelt/ Disallow: /admin/ Disallow: /sitemanager/ Disallow: /advantage/*.htm Disallow: /search/*.htm Disallow: /chanpin.htm?* Disallow: /qiye.htm?* Disallow: /gongsi/*.htm?* #Sitemap: http://ye.1688.com/aliwiki_sitemaps/sitemap_index.xml