<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	>
<channel>
	<title>Comments on: Bot Obedience: Herding Googlebot</title>
	<atom:link href="https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/</link>
	<description></description>
	<lastBuildDate>Sat, 06 Dec 2014 05:30:21 +0000</lastBuildDate>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=4.0.1</generator>
	<item>
		<title>By: brandi belle</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28246</link>
		<dc:creator><![CDATA[brandi belle]]></dc:creator>
		<pubDate>Fri, 16 Apr 2010 02:14:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28246</guid>
		<description><![CDATA[This comment is directed directed toward to the comment person above &quot;dave&quot; . I agree, I would also be interested in reading a blog post from you on the “site:” command issue. I haven&#039;t read to much into this but am always looking for more. Dave, from what I heard, the quality from backlinks are good.]]></description>
		<content:encoded><![CDATA[<p>This comment is directed directed toward to the comment person above &#8220;dave&#8221; . I agree, I would also be interested in reading a blog post from you on the “site:” command issue. I haven&#8217;t read to much into this but am always looking for more. Dave, from what I heard, the quality from backlinks are good.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kristi Wachter</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28245</link>
		<dc:creator><![CDATA[Kristi Wachter]]></dc:creator>
		<pubDate>Wed, 26 Nov 2008 19:20:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28245</guid>
		<description><![CDATA[Did something change with this during the past week or so? I have a test site that has this in the robots file:

User-agent: *
Disallow: /*?

and also has this on every page:



Up until last week, pages were steadily disappearing from Google; last week, a search of 

commonword site:test.example.com

returned only 5 results  (down from more than 1000).

Suddenly, today, it&#039;s returning 16,000 pages!

I&#039;ve double-checked that the robot file and the noindex metatag on every page are still in place.

Any idea why this would have changed suddenly? Lots of folks have test sites, and of course we don&#039;t want Google to index all that duplicate content. I can put an htaccess password on the site, but it&#039;d be nice not to have to.

Thanks for the great info!]]></description>
		<content:encoded><![CDATA[<p>Did something change with this during the past week or so? I have a test site that has this in the robots file:</p>
<p>User-agent: *<br />
Disallow: /*?</p>
<p>and also has this on every page:</p>
<p>Up until last week, pages were steadily disappearing from Google; last week, a search of </p>
<p>commonword site:test.example.com</p>
<p>returned only 5 results  (down from more than 1000).</p>
<p>Suddenly, today, it&#8217;s returning 16,000 pages!</p>
<p>I&#8217;ve double-checked that the robot file and the noindex metatag on every page are still in place.</p>
<p>Any idea why this would have changed suddenly? Lots of folks have test sites, and of course we don&#8217;t want Google to index all that duplicate content. I can put an htaccess password on the site, but it&#8217;d be nice not to have to.</p>
<p>Thanks for the great info!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28244</link>
		<dc:creator><![CDATA[Mark]]></dc:creator>
		<pubDate>Mon, 24 Sep 2007 07:55:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28244</guid>
		<description><![CDATA[Matt, you robots.txt file is pretty open - does that result in the bots picking up duplicate content on your site from the various locations your posts are stored on your site? Great articles btw.]]></description>
		<content:encoded><![CDATA[<p>Matt, you robots.txt file is pretty open &#8211; does that result in the bots picking up duplicate content on your site from the various locations your posts are stored on your site? Great articles btw.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Danielle Fox</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28243</link>
		<dc:creator><![CDATA[Danielle Fox]]></dc:creator>
		<pubDate>Fri, 27 Jul 2007 16:53:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28243</guid>
		<description><![CDATA[IncrediBILL, maybe it’s better to use Agent tag. I’m not sure you should stick to IP’s…]]></description>
		<content:encoded><![CDATA[<p>IncrediBILL, maybe it’s better to use Agent tag. I’m not sure you should stick to IP’s…</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ZZPrices</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28242</link>
		<dc:creator><![CDATA[ZZPrices]]></dc:creator>
		<pubDate>Tue, 19 Dec 2006 18:27:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28242</guid>
		<description><![CDATA[I have a similar case where I&#039;ve tried to control Google by denying access to /cgi-bin/ and it still indexes all of those pages.]]></description>
		<content:encoded><![CDATA[<p>I have a similar case where I&#8217;ve tried to control Google by denying access to /cgi-bin/ and it still indexes all of those pages.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: TheWheeler</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28241</link>
		<dc:creator><![CDATA[TheWheeler]]></dc:creator>
		<pubDate>Fri, 03 Nov 2006 11:58:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28241</guid>
		<description><![CDATA[Looks like Google is spidering all of the pages, no matter what&#039;s written in robots.txt. At least that happened in my case.]]></description>
		<content:encoded><![CDATA[<p>Looks like Google is spidering all of the pages, no matter what&#8217;s written in robots.txt. At least that happened in my case.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Data Recovery</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28240</link>
		<dc:creator><![CDATA[Data Recovery]]></dc:creator>
		<pubDate>Sat, 23 Sep 2006 19:27:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28240</guid>
		<description><![CDATA[How safe is google URL removal ???

I have mistakenly uploaded two pages index.htm and index.php

Google cached index.php and yahoo and MSN has cached index.htm

Which URL should I remove ?]]></description>
		<content:encoded><![CDATA[<p>How safe is google URL removal ???</p>
<p>I have mistakenly uploaded two pages index.htm and index.php</p>
<p>Google cached index.php and yahoo and MSN has cached index.htm</p>
<p>Which URL should I remove ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28239</link>
		<dc:creator><![CDATA[Joe]]></dc:creator>
		<pubDate>Fri, 11 Aug 2006 12:42:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28239</guid>
		<description><![CDATA[Hi Matt,

This is a late reply, but a valid &quot;feature request&quot;

My site has a high incidence of traffic from site rippers, spambots, image farming, and competitors stealing images and content. I find sites using my content and URLs in Google searches on a regular basis and report them as SPAM. I use traps, but a spider authentication process would be an easily implemented method of alleviating the problem of bad bots and rippers. A credit card processing company uses a simple method. I receive a payment notification via the company’s server posting a query to a designated URL on my site. I post (reflect if you will) that query, appended with a validation request back to their server, and they display a string of either “verified” or “invalid”. A method similar to this would simplify the process of setting trusted “users” and granting access to spider my site.

Best of luck and keep those videos coming!!!

Joe]]></description>
		<content:encoded><![CDATA[<p>Hi Matt,</p>
<p>This is a late reply, but a valid &#8220;feature request&#8221;</p>
<p>My site has a high incidence of traffic from site rippers, spambots, image farming, and competitors stealing images and content. I find sites using my content and URLs in Google searches on a regular basis and report them as SPAM. I use traps, but a spider authentication process would be an easily implemented method of alleviating the problem of bad bots and rippers. A credit card processing company uses a simple method. I receive a payment notification via the company’s server posting a query to a designated URL on my site. I post (reflect if you will) that query, appended with a validation request back to their server, and they display a string of either “verified” or “invalid”. A method similar to this would simplify the process of setting trusted “users” and granting access to spider my site.</p>
<p>Best of luck and keep those videos coming!!!</p>
<p>Joe</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: In-House SEO</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28238</link>
		<dc:creator><![CDATA[In-House SEO]]></dc:creator>
		<pubDate>Mon, 07 Aug 2006 22:30:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28238</guid>
		<description><![CDATA[Matt said: &quot;At a page level, use meta tags at the top of your html page. The noindex meta tag will keep a page from showing up in Google’s index at all. This tag is great on any page that’s confidential.&quot;

This doesn&#039;t seem to be working as designed. I&#039;m seeing pages tagged &quot;noindex&quot; all over Google&#039;s search results -- and ranking well. Not isolated instances either, but dozens of pages that clearly are not intended to supposed to be indexed showing up at the top of the SERPs.]]></description>
		<content:encoded><![CDATA[<p>Matt said: &#8220;At a page level, use meta tags at the top of your html page. The noindex meta tag will keep a page from showing up in Google’s index at all. This tag is great on any page that’s confidential.&#8221;</p>
<p>This doesn&#8217;t seem to be working as designed. I&#8217;m seeing pages tagged &#8220;noindex&#8221; all over Google&#8217;s search results &#8212; and ranking well. Not isolated instances either, but dozens of pages that clearly are not intended to supposed to be indexed showing up at the top of the SERPs.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DIO</title>
		<link>https://www.mattcutts.com/blog/bot-obedience-herding-googlebot/#comment-28237</link>
		<dc:creator><![CDATA[DIO]]></dc:creator>
		<pubDate>Sun, 06 Aug 2006 16:44:37 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=329#comment-28237</guid>
		<description><![CDATA[SEO is more of an art than a science since most people are not aware of what the exact search engine algorithm is like. Nonetheless, the tips listed on this page are useful.]]></description>
		<content:encoded><![CDATA[<p>SEO is more of an art than a science since most people are not aware of what the exact search engine algorithm is like. Nonetheless, the tips listed on this page are useful.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
