<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: What should NOINDEX do?</title>
	<atom:link href="http://www.mattcutts.com/blog/google-noindex-behavior/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mattcutts.com/blog/google-noindex-behavior/</link>
	<description>neat fun stuff</description>
	<lastBuildDate>Fri, 06 Nov 2009 18:35:07 -0800</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Umang</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-352398</link>
		<dc:creator>Umang</dc:creator>
		<pubDate>Tue, 23 Jun 2009 16:11:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-352398</guid>
		<description>My understanding is that the no follow tag is is used to google indexing and crawling your pages. Someone please help elaborate on my understanding.

Thanks</description>
		<content:encoded><![CDATA[<p>My understanding is that the no follow tag is is used to google indexing and crawling your pages. Someone please help elaborate on my understanding.</p>
<p>Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scanto</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-239567</link>
		<dc:creator>Scanto</dc:creator>
		<pubDate>Sun, 08 Feb 2009 00:53:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-239567</guid>
		<description>Ok guys, so I just mistakenly moved a site live that had that noindex tag from the test site. It was there for about a week. So how long before my rankings return?  Is there anything I can do to encourage the spiders to come back and crawl in the meantime?</description>
		<content:encoded><![CDATA[<p>Ok guys, so I just mistakenly moved a site live that had that noindex tag from the test site. It was there for about a week. So how long before my rankings return?  Is there anything I can do to encourage the spiders to come back and crawl in the meantime?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-134285</link>
		<dc:creator>Robert</dc:creator>
		<pubDate>Fri, 03 Oct 2008 11:34:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-134285</guid>
		<description>@ Alan Perkins

&quot;If somebody links to your site, and Google indexes that link, then your gripe is with the person who linked to your site … not with Google.&quot;

This is the precise reason that the robots.txt file is flawed and the Meta attribute of &quot;NoFollow&quot; is required.</description>
		<content:encoded><![CDATA[<p>@ Alan Perkins</p>
<p>&#8220;If somebody links to your site, and Google indexes that link, then your gripe is with the person who linked to your site … not with Google.&#8221;</p>
<p>This is the precise reason that the robots.txt file is flawed and the Meta attribute of &#8220;NoFollow&#8221; is required.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robert</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-134283</link>
		<dc:creator>Robert</dc:creator>
		<pubDate>Fri, 03 Oct 2008 10:54:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-134283</guid>
		<description>Well I see it as a pretty simple open and shut case myself.

If I&#039;ve specified NoIndex for any reason then don&#039;t index.  Yes, I might have shot myself in the foot but isn&#039;t that every business (read website) owners choice?  We can&#039;t tell you how to run your search engine, but we can ask you not to tell us how to run our websites.

Let me put it this way Matt, if I come into your home and take photos of you sleeping, can I post those anywhere?  No?  But you might be an exhibitionist wanting me to do so, right?  I think you know where I&#039;m going with this.

If the door is closed, if the sign says do not enter, then it should be respected by everyone, everybot, everything.  No matter how good or bad the reason.</description>
		<content:encoded><![CDATA[<p>Well I see it as a pretty simple open and shut case myself.</p>
<p>If I&#8217;ve specified NoIndex for any reason then don&#8217;t index.  Yes, I might have shot myself in the foot but isn&#8217;t that every business (read website) owners choice?  We can&#8217;t tell you how to run your search engine, but we can ask you not to tell us how to run our websites.</p>
<p>Let me put it this way Matt, if I come into your home and take photos of you sleeping, can I post those anywhere?  No?  But you might be an exhibitionist wanting me to do so, right?  I think you know where I&#8217;m going with this.</p>
<p>If the door is closed, if the sign says do not enter, then it should be respected by everyone, everybot, everything.  No matter how good or bad the reason.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SEO</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-130005</link>
		<dc:creator>SEO</dc:creator>
		<pubDate>Thu, 10 Jul 2008 13:04:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-130005</guid>
		<description>I’m surprised this debate is going on but hats off to you for giving us a say. 
noindex should do what it says. Don’t index. I feel MSN and Yahoo should follow suit. Google already provides the best user experience and I don’t think this issue affects that. It would be great to see a across the board approach to this on all search engines. 
Most people reading your blog will have an interest in what happening with Google and other search engines but there are many web developers who don’t keep up with what’s happening. How could you inform all those people of the change? What about wee Johnny who has a hidden page dedicated to his childhood sweetheart? What if Google changed the rules on the noindex tag?  Imagine the embarrassment at school when he’s back from his summer break! So for wee Johnny please keep noindex. 
I feel it’s common sense – sure some people will make mistakes with it but if a listing on Google is important to them they’ll work it out.</description>
		<content:encoded><![CDATA[<p>I’m surprised this debate is going on but hats off to you for giving us a say.<br />
noindex should do what it says. Don’t index. I feel MSN and Yahoo should follow suit. Google already provides the best user experience and I don’t think this issue affects that. It would be great to see a across the board approach to this on all search engines.<br />
Most people reading your blog will have an interest in what happening with Google and other search engines but there are many web developers who don’t keep up with what’s happening. How could you inform all those people of the change? What about wee Johnny who has a hidden page dedicated to his childhood sweetheart? What if Google changed the rules on the noindex tag?  Imagine the embarrassment at school when he’s back from his summer break! So for wee Johnny please keep noindex.<br />
I feel it’s common sense – sure some people will make mistakes with it but if a listing on Google is important to them they’ll work it out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ray Paseur</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-129069</link>
		<dc:creator>Ray Paseur</dc:creator>
		<pubDate>Sat, 21 Jun 2008 13:02:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-129069</guid>
		<description>The Atomz web site search engine uses [noindex] and [/noindex] and the corresponding [nofollow] tags embedded in HTML to control its spider, so that only parts of a web page will be spidered.  It also respects the robots.txt and meta tags. It would be great if Google could also support an inline [noindex] tag of some sort.  My $0.02. ~Ray</description>
		<content:encoded><![CDATA[<p>The Atomz web site search engine uses [noindex] and [/noindex] and the corresponding [nofollow] tags embedded in HTML to control its spider, so that only parts of a web page will be spidered.  It also respects the robots.txt and meta tags. It would be great if Google could also support an inline [noindex] tag of some sort.  My $0.02. ~Ray</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Xel</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-126777</link>
		<dc:creator>Xel</dc:creator>
		<pubDate>Sun, 04 May 2008 15:16:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-126777</guid>
		<description>SAM:
As allready mentioned Google will not stop indexing your files when you use robot.txt. As this poll tells us (Google never *really* cared for webmasters whishes, they will not start to do so now!) this will be the same with the meta. You can simply leave your pages as they are. But it might be an genious idea to hide categories which are unfilled (at least for *not logged in* users) - since your visitors will dislike empty pages anyway.</description>
		<content:encoded><![CDATA[<p>SAM:<br />
As allready mentioned Google will not stop indexing your files when you use robot.txt. As this poll tells us (Google never *really* cared for webmasters whishes, they will not start to do so now!) this will be the same with the meta. You can simply leave your pages as they are. But it might be an genious idea to hide categories which are unfilled (at least for *not logged in* users) &#8211; since your visitors will dislike empty pages anyway.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SamK</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-126618</link>
		<dc:creator>SamK</dc:creator>
		<pubDate>Tue, 29 Apr 2008 20:30:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-126618</guid>
		<description>With all of the discussions about NOINDEX, I&#039;ve been unable to find an answer to a question I have.  I&#039;m currently programming a community-based web site that has several thousand pages (basically categories), but at the beginning, will not have any meaningful content for a good chunk of those pages.

If I put a NOINDEX robots tag on the pages without content, will the search engines come back and try to index the page again at a later date?  If they will not come back, what is the best way stop bots from spidering the pages until there is some meaningful content?

Thanks,
Sam</description>
		<content:encoded><![CDATA[<p>With all of the discussions about NOINDEX, I&#8217;ve been unable to find an answer to a question I have.  I&#8217;m currently programming a community-based web site that has several thousand pages (basically categories), but at the beginning, will not have any meaningful content for a good chunk of those pages.</p>
<p>If I put a NOINDEX robots tag on the pages without content, will the search engines come back and try to index the page again at a later date?  If they will not come back, what is the best way stop bots from spidering the pages until there is some meaningful content?</p>
<p>Thanks,<br />
Sam</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: herbi</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-125483</link>
		<dc:creator>herbi</dc:creator>
		<pubDate>Sun, 06 Apr 2008 10:44:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-125483</guid>
		<description>&gt;&gt;&gt; I’d also be interested in (constructive) 
&gt;&gt;&gt; suggestions in the comments about how 
&gt;&gt;&gt; Google should treat the NOINDEX meta tag. 

matt, this is a funny question ...
read my lips: NOINDEX ^^

i don&#039;t really see why this should be discussed in the first place. we had a similar discussion at the german google-webmaster-help-group about robots.txt, the google-webmaster trying to tell us something about how hard google&#039;s task was in &quot;interpreting&quot; what publishers meant by excluding files from crawling and i didn&#039;t understand the point there either ... so i wonder what you&#039;re at? take over total control letting publishers no choices at all anymore?</description>
		<content:encoded><![CDATA[<p>&gt;&gt;&gt; I’d also be interested in (constructive)<br />
&gt;&gt;&gt; suggestions in the comments about how<br />
&gt;&gt;&gt; Google should treat the NOINDEX meta tag. </p>
<p>matt, this is a funny question &#8230;<br />
read my lips: NOINDEX ^^</p>
<p>i don&#8217;t really see why this should be discussed in the first place. we had a similar discussion at the german google-webmaster-help-group about robots.txt, the google-webmaster trying to tell us something about how hard google&#8217;s task was in &#8220;interpreting&#8221; what publishers meant by excluding files from crawling and i didn&#8217;t understand the point there either &#8230; so i wonder what you&#8217;re at? take over total control letting publishers no choices at all anymore?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: R. Richard Hobbs</title>
		<link>http://www.mattcutts.com/blog/google-noindex-behavior/#comment-124848</link>
		<dc:creator>R. Richard Hobbs</dc:creator>
		<pubDate>Tue, 25 Mar 2008 18:35:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/google-noindex-behavior/#comment-124848</guid>
		<description>I havent finished reading thru all the comments but for myself I will say what I am seeking is a way to keep all the flotsam and jetsam (archives of several species and other secondary and likely duplicated content directories and files neccesary for my site to run itself) out of sight, but I want (and expect Google would want same) to take advantage of any opportunities to discover new links  as might be available on the site as well as spider and index any unique content. 

So here&#039;s my 2 cents / suggestion:

 - you dont want your site / content seen /examined at all = disallow
- you dont want the content appearing in the results = noindex BUT either by default or by including i.e follow (?) the spider will follow any links
- noindex / nofollow could be tantamount to disallow

...anyway this the way my mind works about this - 

So my next question is how to put this work? the syntaxes are necessarily strict and somewhat arcane to most hobby coders (like moi) and certainly any newbies - and mistakes can be costly... what needs to go in the robots.txt do what I want to do - without a computer science degree ?

noindex - will this *just* keep the specified content from appearing in the index?

I tried adding &quot;noindex,follow&quot; and &quot;noindex, follow&quot; to robots.txt they both threw syntax errors

Hmm this is interesting - noindex: follow: DIDNT throw an error but whats the Googlebot going to do with that?

All I really want to completely disallow is administrative / code  / backend / internal stuff that nobody with good intentions is likely to be interested in - read: potentially security  sensitive areas....

Hope any of this makes sense is there anybody else out there with similar objectives?

thx</description>
		<content:encoded><![CDATA[<p>I havent finished reading thru all the comments but for myself I will say what I am seeking is a way to keep all the flotsam and jetsam (archives of several species and other secondary and likely duplicated content directories and files neccesary for my site to run itself) out of sight, but I want (and expect Google would want same) to take advantage of any opportunities to discover new links  as might be available on the site as well as spider and index any unique content. </p>
<p>So here&#8217;s my 2 cents / suggestion:</p>
<p> &#8211; you dont want your site / content seen /examined at all = disallow<br />
- you dont want the content appearing in the results = noindex BUT either by default or by including i.e follow (?) the spider will follow any links<br />
- noindex / nofollow could be tantamount to disallow</p>
<p>&#8230;anyway this the way my mind works about this &#8211; </p>
<p>So my next question is how to put this work? the syntaxes are necessarily strict and somewhat arcane to most hobby coders (like moi) and certainly any newbies &#8211; and mistakes can be costly&#8230; what needs to go in the robots.txt do what I want to do &#8211; without a computer science degree ?</p>
<p>noindex &#8211; will this *just* keep the specified content from appearing in the index?</p>
<p>I tried adding &#8220;noindex,follow&#8221; and &#8220;noindex, follow&#8221; to robots.txt they both threw syntax errors</p>
<p>Hmm this is interesting &#8211; noindex: follow: DIDNT throw an error but whats the Googlebot going to do with that?</p>
<p>All I really want to completely disallow is administrative / code  / backend / internal stuff that nobody with good intentions is likely to be interested in &#8211; read: potentially security  sensitive areas&#8230;.</p>
<p>Hope any of this makes sense is there anybody else out there with similar objectives?</p>
<p>thx</p>
]]></content:encoded>
	</item>
</channel>
</rss>
