<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" > <channel><title>Comments on: Learn more about robots.txt</title> <atom:link href="http://www.mattcutts.com/blog/robots-txt-remove-url/feed/" rel="self" type="application/rss+xml" /><link>http://www.mattcutts.com/blog/robots-txt-remove-url/</link> <description>neat fun stuff</description> <lastBuildDate>Wed, 08 Feb 2012 21:30:01 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: Derek Jansen</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-729128</link> <dc:creator>Derek Jansen</dc:creator> <pubDate>Wed, 02 Mar 2011 09:56:12 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-729128</guid> <description>Thanks Matt - Was a little confused between robots.txt and the .htaccess but this clarifies the two.</description> <content:encoded><![CDATA[<p>Thanks Matt &#8211; Was a little confused between robots.txt and the .htaccess but this clarifies the two.</p> ]]></content:encoded> </item> <item><title>By: Andy H</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-492851</link> <dc:creator>Andy H</dc:creator> <pubDate>Wed, 10 Mar 2010 17:19:03 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-492851</guid> <description>The only way to figure out Google is to look backwards. People should forget all they think they&#039;ve learnt and read &lt;a href=&quot;http://infolab.stanford.edu/~backrub/google.html&quot; rel=&quot;nofollow&quot;&gt;The Anatomy of a Search Engine&lt;/a&gt; before they do anything else.In there, you will find the answer to the universe and everything. Matts video explanation above was first explained back in &#039;97. It seems to me that very little has really changed, which is a good thing.</description> <content:encoded><![CDATA[<p>The only way to figure out Google is to look backwards. People should forget all they think they&#8217;ve learnt and read <a href="http://infolab.stanford.edu/~backrub/google.html" rel="nofollow">The Anatomy of a Search Engine</a> before they do anything else.</p><p>In there, you will find the answer to the universe and everything. Matts video explanation above was first explained back in &#8217;97. It seems to me that very little has really changed, which is a good thing.</p> ]]></content:encoded> </item> <item><title>By: Yudh</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-483586</link> <dc:creator>Yudh</dc:creator> <pubDate>Tue, 02 Mar 2010 00:27:07 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-483586</guid> <description>Thank you very much for this video, its helpful for newbie like me</description> <content:encoded><![CDATA[<p>Thank you very much for this video, its helpful for newbie like me</p> ]]></content:encoded> </item> <item><title>By: Bobby</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-479322</link> <dc:creator>Bobby</dc:creator> <pubDate>Thu, 25 Feb 2010 07:09:44 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-479322</guid> <description>Thanks for posting this. i&#039;ve been wondering about those robots violating this for a while.</description> <content:encoded><![CDATA[<p>Thanks for posting this. i&#8217;ve been wondering about those robots violating this for a while.</p> ]]></content:encoded> </item> <item><title>By: Evans</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-467078</link> <dc:creator>Evans</dc:creator> <pubDate>Thu, 11 Feb 2010 21:43:31 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-467078</guid> <description>Well I always write &#039;index, follow&#039; in the robots.txt I&#039;ve never really cared about changing it into anything else.</description> <content:encoded><![CDATA[<p>Well I always write &#8216;index, follow&#8217; in the robots.txt I&#8217;ve never really cared about changing it into anything else.</p> ]]></content:encoded> </item> <item><title>By: Amy</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-452750</link> <dc:creator>Amy</dc:creator> <pubDate>Wed, 13 Jan 2010 22:06:58 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-452750</guid> <description>I usually ignore the robots.txt file. Thanks for the clip, I will keep it higher on my priority list now.</description> <content:encoded><![CDATA[<p>I usually ignore the robots.txt file. Thanks for the clip, I will keep it higher on my priority list now.</p> ]]></content:encoded> </item> <item><title>By: Tom</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-452210</link> <dc:creator>Tom</dc:creator> <pubDate>Tue, 12 Jan 2010 20:18:24 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-452210</guid> <description>Thanks Matt! Great information as always. I rely heavily on my robots.txt. It is time I double check mine...I did the old &quot;set and forget.&quot; Thanks!</description> <content:encoded><![CDATA[<p>Thanks Matt! Great information as always. I rely heavily on my robots.txt. It is time I double check mine&#8230;I did the old &#8220;set and forget.&#8221; Thanks!</p> ]]></content:encoded> </item> <item><title>By: zoe</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-445990</link> <dc:creator>zoe</dc:creator> <pubDate>Wed, 30 Dec 2009 05:09:05 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-445990</guid> <description>Hi, Matt. I&#039;m so disappointed that I cannot watch the video in my country. However, thank you for sharing.</description> <content:encoded><![CDATA[<p>Hi, Matt. I&#8217;m so disappointed that I cannot watch the video in my country. However, thank you for sharing.</p> ]]></content:encoded> </item> <item><title>By: Roni</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-425627</link> <dc:creator>Roni</dc:creator> <pubDate>Tue, 24 Nov 2009 14:49:47 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-425627</guid> <description>Great simple &amp; to the point explanation Matt. I always thought blocking in robots.txt = no crawl = noindex. Now I know noindex and no crawl are two completely different things. Besides, you can always remove specific urls you do not want showing on Google in Webmaster Tools...</description> <content:encoded><![CDATA[<p>Great simple &amp; to the point explanation Matt. I always thought blocking in robots.txt = no crawl = noindex. Now I know noindex and no crawl are two completely different things. Besides, you can always remove specific urls you do not want showing on Google in Webmaster Tools&#8230;</p> ]]></content:encoded> </item> <item><title>By: Jeremy Chatfield</title><link>http://www.mattcutts.com/blog/robots-txt-remove-url/#comment-411868</link> <dc:creator>Jeremy Chatfield</dc:creator> <pubDate>Sat, 31 Oct 2009 22:50:13 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/?p=3204#comment-411868</guid> <description>Thanks for the video Matt; consistent with everything you, robotstxt.org and the Google webmaster blog have said. Useful re-confirmation and explanation.@Anthony Von Ducci - The listing that I can see for you is rationally consistent with what Matt said. The page has not been crawled, in accordance with your robots.txt, so it consists of a link and &quot;similar&quot;. If you put &quot;Disallow: /&quot;, that&#039;s exactly what should happen. The robots can&#039;t see the NOINDEX you put in the home page, because they can&#039;t crawl it.  If you *had* an ODP (Open Directory Project aka DMOZ) listing, then that would be shown by Google, even if you used the NOODP robots directive on the home page, because the directive would be on a page that the robots aren&#039;t allowed to see. Shows that the robots.txt file is working exactly as it should!@Aery - read robotstxt.org and use the &quot;User-Agent&quot; lines to identify the bots by name.@Dudibob - you need &quot;Canonical link refs&quot;, I think, to solve your duplicate pages problem; just search for it, it works for all the major search engines, eventually - you need to have the dups crawled, of course, to see the on-page meta tag!</description> <content:encoded><![CDATA[<p>Thanks for the video Matt; consistent with everything you, robotstxt.org and the Google webmaster blog have said. Useful re-confirmation and explanation.</p><p>@Anthony Von Ducci &#8211; The listing that I can see for you is rationally consistent with what Matt said. The page has not been crawled, in accordance with your robots.txt, so it consists of a link and &#8220;similar&#8221;. If you put &#8220;Disallow: /&#8221;, that&#8217;s exactly what should happen. The robots can&#8217;t see the NOINDEX you put in the home page, because they can&#8217;t crawl it.  If you *had* an ODP (Open Directory Project aka DMOZ) listing, then that would be shown by Google, even if you used the NOODP robots directive on the home page, because the directive would be on a page that the robots aren&#8217;t allowed to see. Shows that the robots.txt file is working exactly as it should!</p><p>@Aery &#8211; read robotstxt.org and use the &#8220;User-Agent&#8221; lines to identify the bots by name.</p><p>@Dudibob &#8211; you need &#8220;Canonical link refs&#8221;, I think, to solve your duplicate pages problem; just search for it, it works for all the major search engines, eventually &#8211; you need to have the dups crawled, of course, to see the on-page meta tag!</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced)
Database Caching 3/11 queries in 0.005 seconds using disk

Served from: www.mattcutts.com @ 2012-02-08 23:01:49 -->
