<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" > <channel><title>Comments on: Robots.txt analysis tool</title> <atom:link href="http://www.mattcutts.com/blog/robotstxt-analysis-tool/feed/" rel="self" type="application/rss+xml" /><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/</link> <description>neat fun stuff</description> <lastBuildDate>Wed, 08 Feb 2012 21:30:01 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: Ali Nasir</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-571375</link> <dc:creator>Ali Nasir</dc:creator> <pubDate>Tue, 08 Jun 2010 12:57:25 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-571375</guid> <description>There are also robot.txt tools that allows you to experiment a little, letting you know if their are any problems with your file prior to putting it online.</description> <content:encoded><![CDATA[<p>There are also robot.txt tools that allows you to experiment a little, letting you know if their are any problems with your file prior to putting it online.</p> ]]></content:encoded> </item> <item><title>By: Pharos</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-396570</link> <dc:creator>Pharos</dc:creator> <pubDate>Fri, 25 Sep 2009 14:02:02 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-396570</guid> <description>The webmaster tools page is doing something strange.Pages blocked by my robots.txt file are slowly being listed on the &quot;Restricted by robots.txt&quot; page.  The number was slowly going up til it got to 23. It should have continued to go up as Google continued to index my site, since there are a lot more blocked pages.However, now the number is going down.  It is at 22. So pages that are blocked are disappearing from the list.Is this normal?  I don&#039;t see how it can be.I have checkd the robots.txt file and url&#039;s in Google and everything seems to be working properly.I also wish Google woudn&#039;t take so long to index my entire site.  It is not that big.  Very frustrating.</description> <content:encoded><![CDATA[<p>The webmaster tools page is doing something strange.</p><p>Pages blocked by my robots.txt file are slowly being listed on the &#8220;Restricted by robots.txt&#8221; page.  The number was slowly going up til it got to 23.<br /> It should have continued to go up as Google continued to index my site, since there are a lot more blocked pages.</p><p>However, now the number is going down.  It is at 22.<br /> So pages that are blocked are disappearing from the list.</p><p>Is this normal?  I don&#8217;t see how it can be.</p><p>I have checkd the robots.txt file and url&#8217;s in Google and everything seems to be working properly.</p><p>I also wish Google woudn&#8217;t take so long to index my entire site.  It is not that big.  Very frustrating.</p> ]]></content:encoded> </item> <item><title>By: Projectconsultant@linuxossolutions.com</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-394697</link> <dc:creator>Projectconsultant@linuxossolutions.com</dc:creator> <pubDate>Mon, 21 Sep 2009 23:31:54 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-394697</guid> <description>Hi MattHopefully you can assist me, we are dumbstruck by what has happend with our site we are developing currently we launced our indexing to google  in July and we` got a very high ranking first page well a few weeks in an suddenly we have lost our indexing and placement but still in some search engines we are on the first page, the other strange thing that has happend is the page we were shown on is showing other companies with LINUXOS or Linux Os Solutions in there titles etc...What we have done is we have looked in our webmaster tools and we can see our rankings were high through google but now we are struggling to have our description etc, through google indexed, I have been looking at our robot txts etc but would appreciate if someone could explain what had happened..Many thanks for your time</description> <content:encoded><![CDATA[<p>Hi Matt</p><p>Hopefully you can assist me, we are dumbstruck by what has happend with our site we are developing currently we launced our indexing to google  in July and we` got a very high ranking first page well a few weeks in an suddenly we have lost our indexing and placement but still in some search engines we are on the first page, the other strange thing that has happend is the page we were shown on is showing other companies with LINUXOS or Linux Os Solutions in there titles etc&#8230;</p><p>What we have done is we have looked in our webmaster tools and we can see our rankings were high through google but now we are struggling to have our description etc, through google indexed, I have been looking at our robot txts etc but would appreciate if someone could explain what had happened..</p><p>Many thanks for your time</p> ]]></content:encoded> </item> <item><title>By: Felix</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-329446</link> <dc:creator>Felix</dc:creator> <pubDate>Sun, 26 Apr 2009 18:48:12 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-329446</guid> <description>Hi Matt,My webmaster tool was indicating the following error with respect to my sitemap.xml.  &quot;URL timeout: robots.txt timeout We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.&quot;My sitemap was generated by following the guidelines and it was definitely in the right location.Can you please help me to resolve this problem? Many thanks</description> <content:encoded><![CDATA[<p>Hi Matt,</p><p>My webmaster tool was indicating the following error with respect to my sitemap.xml.  &#8220;URL timeout: robots.txt timeout<br /> We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit.&#8221;</p><p>My sitemap was generated by following the guidelines and it was definitely in the right location.</p><p>Can you please help me to resolve this problem?<br /> Many thanks</p> ]]></content:encoded> </item> <item><title>By: Richard</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-129532</link> <dc:creator>Richard</dc:creator> <pubDate>Mon, 30 Jun 2008 23:42:15 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-129532</guid> <description>I had a very frustrating experience with my robots.txt file that I wanted to share so nobody makes the same mistake.  Practicing good SEO, I wanted to create a robots.txt file for improved results and to prevent the bots from searching unnecessary pages.After I created my robots.txt, using the one on wordpress as a template, I noticed none of my new pages were being indexed anymore.  They use to be indexed immediately and then NOTHING.I had used these recommended lines:Disallow: /*?* Disallow: /*?I did this particulary because a survey form I use creates multiple pages with the ? that don&#039;t need to be indexed.However, for some reason (I don&#039;t know why), the googlebot sees many of my pages with this type of permalink:  http://www.thisishowyoudoit.com/blog/?p=57However, my permalink structure looks like this:  http://www.thisishowyoudoit.com/blog/10-reasons-why-not-to-host-your-wordpress-blog-on-a-windowsiis-platform/Thus, after implementing my robots.txt, many of my pages were not longer indexed.Just wanted to warn people about these particular lines in the robots.txt file.Thanks, Richard</description> <content:encoded><![CDATA[<p>I had a very frustrating experience with my robots.txt file that I wanted to share so nobody makes the same mistake.  Practicing good SEO, I wanted to create a robots.txt file for improved results and to prevent the bots from searching unnecessary pages.</p><p>After I created my robots.txt, using the one on wordpress as a template, I noticed none of my new pages were being indexed anymore.  They use to be indexed immediately and then NOTHING.</p><p>I had used these recommended lines:</p><p>Disallow: /*?*<br /> Disallow: /*?</p><p>I did this particulary because a survey form I use creates multiple pages with the ? that don&#8217;t need to be indexed.</p><p>However, for some reason (I don&#8217;t know why), the googlebot sees many of my pages with this type of permalink: <a href="http://www.thisishowyoudoit.com/blog/?p=57" rel="nofollow">http://www.thisishowyoudoit.com/blog/?p=57</a></p><p>However, my permalink structure looks like this: <a href="http://www.thisishowyoudoit.com/blog/10-reasons-why-not-to-host-your-wordpress-blog-on-a-windowsiis-platform/" rel="nofollow">http://www.thisishowyoudoit.com/blog/10-reasons-why-not-to-host-your-wordpress-blog-on-a-windowsiis-platform/</a></p><p>Thus, after implementing my robots.txt, many of my pages were not longer indexed.</p><p>Just wanted to warn people about these particular lines in the robots.txt file.</p><p>Thanks,<br /> Richard</p> ]]></content:encoded> </item> <item><title>By: Michael Heraghty</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-123222</link> <dc:creator>Michael Heraghty</dc:creator> <pubDate>Thu, 28 Feb 2008 10:15:59 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-123222</guid> <description>I am having a similar problem to AndySaid. Worse, Google&#039;s cached version of robots.txt disallows access entirely to one of my sites (I recently implemented a new Wordpress design redux but forgot to change WP&#039;s privacy settings to allow search engines to access the site).In the two weeks since the redesign, I had been seeing URLs only with no snippets in the SERPs.Last night, I finally figured out what the problem was. Not taking any chances, I updated the Wordpress settings AND manually uploaded a robots.txt file (the Wordpress one is somehow invisible!).This morning, still no change. In webmaster tools, Google is still showing the old robots.txt. How long do I have to wait?</description> <content:encoded><![CDATA[<p>I am having a similar problem to AndySaid. Worse, Google&#8217;s cached version of robots.txt disallows access entirely to one of my sites (I recently implemented a new WordPress design redux but forgot to change WP&#8217;s privacy settings to allow search engines to access the site).</p><p>In the two weeks since the redesign, I had been seeing URLs only with no snippets in the SERPs.</p><p>Last night, I finally figured out what the problem was. Not taking any chances, I updated the WordPress settings AND manually uploaded a robots.txt file (the WordPress one is somehow invisible!).</p><p>This morning, still no change. In webmaster tools, Google is still showing the old robots.txt. How long do I have to wait?</p> ]]></content:encoded> </item> <item><title>By: Andy</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-123130</link> <dc:creator>Andy</dc:creator> <pubDate>Tue, 26 Feb 2008 14:52:57 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-123130</guid> <description>Is anyone else having problems with the Google webmaster tools robots.txt analyzer?I&#039;ve noticed over a few weeks that it&#039;s not working properly. It displays the correct robots.txt and you can check URL&#039;s against that. However if you make any edits in the tool, running a check appears to use the cached version of the file rather than the locally edited version. Kind of defeats the purpose of the tool really.I hope Google fix this soon as when it works, it&#039;s a God send!</description> <content:encoded><![CDATA[<p>Is anyone else having problems with the Google webmaster tools robots.txt analyzer?</p><p>I&#8217;ve noticed over a few weeks that it&#8217;s not working properly.<br /> It displays the correct robots.txt and you can check URL&#8217;s against that.<br /> However if you make any edits in the tool, running a check appears to use the cached version of the file rather than the locally edited version. Kind of defeats the purpose of the tool really.</p><p>I hope Google fix this soon as when it works, it&#8217;s a God send!</p> ]]></content:encoded> </item> <item><title>By: bkkdreamer</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-110856</link> <dc:creator>bkkdreamer</dc:creator> <pubDate>Sat, 11 Aug 2007 15:06:59 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-110856</guid> <description>I am on Blogger, and my robots.text file is incorrect. It is excluding search pages - a problem many other bloggers have noticed since the robots.text file was introduced.I know what changes to make to the text, but not how to upload them to the root directory. I tried inserting the text in the template, but i doubt that will work.I am not sure Blogger allows us to upload HTML to blogger templates. Anyone have any advice?</description> <content:encoded><![CDATA[<p>I am on Blogger, and my robots.text file is incorrect. It is excluding search pages &#8211; a problem many other bloggers have noticed since the robots.text file was introduced.</p><p>I know what changes to make to the text, but not how to upload them to the root directory. I tried inserting the text in the template, but i doubt that will work.</p><p>I am not sure Blogger allows us to upload HTML to blogger templates. Anyone have any advice?</p> ]]></content:encoded> </item> <item><title>By: reliable web hosting</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-110580</link> <dc:creator>reliable web hosting</dc:creator> <pubDate>Tue, 07 Aug 2007 15:26:44 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-110580</guid> <description>Hmm, interesting. I never thought that google would refuse to crawl a site because of an invalid robots.txt file, by common sense it would just be better to ignore the file.</description> <content:encoded><![CDATA[<p>Hmm, interesting. I never thought that google would refuse to crawl a site because of an invalid robots.txt file, by common sense it would just be better to ignore the file.</p> ]]></content:encoded> </item> <item><title>By: The Dog Clothing Company</title><link>http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-108427</link> <dc:creator>The Dog Clothing Company</dc:creator> <pubDate>Wed, 11 Jul 2007 16:10:08 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/robotstxt-analysis-tool/#comment-108427</guid> <description>I am new to the webmaster console and so far I think it is a great tool for managing the site, particularly this robot.txt analysis tool. Thanks Matt for this useful post!</description> <content:encoded><![CDATA[<p>I am new to the webmaster console and so far I think it is a great tool for managing the site, particularly this robot.txt analysis tool. Thanks Matt for this useful post!</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced)
Database Caching 3/11 queries in 0.005 seconds using disk

Served from: www.mattcutts.com @ 2012-02-08 23:32:13 -->
