# robots.txt for Magento / v1.3 2015-04-23 / Peeter Marvet # based on http://inchoo.net/ecommerce/ultimate-magento-robots-txt-file-examples/ and http://www.byte.nl/blog/magento-robots-txt/ # comment and clone at https://gist.github.com/petskratt/016c9dbf159a81b9d6aa # Keep in mind that by standard robots.txt should NOT contain empty lines, except between UA blocks! # # Sitemap (uncomment, change and add language/shop specific sitemaps, if running on multiple domains # keep in mind sitemap can only point to own domain so something like sitemapindex.php is needed) # Sitemap: http://example.com/sitemap.xml Sitemap: https://www.houseofireland.com/sitemap/sitemap.xml # # Google Image Crawler Setup - having crawler-specific sections makes it ignore generic e.g * User-agent: Googlebot-Image Disallow: # # Crawlers Setup User-agent: * # # Allow paging (unless has more params, disallowed below) Allow: /*?p= # # Directories Disallow: /404/ Disallow: /app/ Disallow: /cgi-bin/ Disallow: /downloader/ Disallow: /errors/ Disallow: /includes/ Disallow: /magento/ #Disallow: /media/ Disallow: /media/captcha/ #Disallow: /media/catalog/ Disallow: /media/customer/ Disallow: /media/dhl/ Disallow: /media/downloadable/ Disallow: /media/import/ Disallow: /media/pdf/ Disallow: /media/sales/ Disallow: /media/tmp/ #Disallow: /media/wysiwyg/ Disallow: /media/xmlconnect/ Disallow: /pkginfo/ Disallow: /report/ Disallow: /scripts/ Disallow: /shell/ #Disallow: /skin/ Disallow: /stats/ Disallow: /var/ # # Paths (if using shop id in URL must prefix with * or copy for each) Disallow: /index.php/ Disallow: /catalog/product_compare/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /catalog/product/gallery/ Disallow: /catalogsearch/ Disallow: /control/ Disallow: /contacts/ Disallow: /customer/ Disallow: /customize/ Disallow: /newsletter/ Disallow: /poll/ Disallow: /review/ Disallow: /sendfriend/ Disallow: /tag/ Disallow: /wishlist/ Disallow: /subscription-success # # Files Disallow: /cron.php Disallow: /cron.sh Disallow: /error_log Disallow: /install.php Disallow: /LICENSE.html Disallow: /LICENSE.txt Disallow: /LICENSE_AFL.txt Disallow: /STATUS.txt # # Paths (no clean URLs) Disallow: /*?p=*& Disallow: /*.php$ Disallow: /*?SID=