<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" > <channel><title>Comments on: Duplicate content question</title> <atom:link href="http://www.mattcutts.com/blog/duplicate-content-question/feed/" rel="self" type="application/rss+xml" /><link>http://www.mattcutts.com/blog/duplicate-content-question/</link> <description>neat fun stuff</description> <lastBuildDate>Wed, 08 Feb 2012 21:30:01 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: David</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-896276</link> <dc:creator>David</dc:creator> <pubDate>Sat, 03 Sep 2011 22:39:41 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-896276</guid> <description>Does anyone know how many page can have a duplicate content. For instance if there is a server issue where pages that do not exist shwing up as duplicate content - do the site get penalised? I would really appreciate if someone could answer. Thank you.  David</description> <content:encoded><![CDATA[<p>Does anyone know how many page can have a duplicate content. For instance if there is a server issue where pages that do not exist shwing up as duplicate content &#8211; do the site get penalised? I would really appreciate if someone could answer. Thank you.  David</p> ]]></content:encoded> </item> <item><title>By: Sahil Kotak</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-844086</link> <dc:creator>Sahil Kotak</dc:creator> <pubDate>Sun, 17 Jul 2011 05:52:01 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-844086</guid> <description>It&#039;s really problematic especially the UTM tags which are making my articles index twice and thrice in Google. I have currently blocked UTM tags with the help of .htaccess but I would like to know that does it make any problems?</description> <content:encoded><![CDATA[<p>It&#8217;s really problematic especially the UTM tags which are making my articles index twice and thrice in Google. I have currently blocked UTM tags with the help of .htaccess but I would like to know that does it make any problems?</p> ]]></content:encoded> </item> <item><title>By: ugyen</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-787499</link> <dc:creator>ugyen</dc:creator> <pubDate>Fri, 06 May 2011 07:15:35 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-787499</guid> <description>Hello Matt and everyone,My question is what if a publisher has 100+ sites and they plan to produce a standard &quot;about us&quot;, &quot;terms of use&quot; and &quot;privacy policy page&quot; for all the 100+ sites. Is this a case of duplicate content or will the google algorithm be kind enough to turn a blind eye to it? Please answer, it&#039;s very important when trying to create 100+ sites</description> <content:encoded><![CDATA[<p>Hello Matt and everyone,</p><p>My question is what if a publisher has 100+ sites and they plan to produce a standard &#8220;about us&#8221;, &#8220;terms of use&#8221; and &#8220;privacy policy page&#8221; for all the 100+ sites. Is this a case of duplicate content or will the google algorithm be kind enough to turn a blind eye to it? Please answer, it&#8217;s very important when trying to create 100+ sites</p> ]]></content:encoded> </item> <item><title>By: Hanoi Hotel</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-756588</link> <dc:creator>Hanoi Hotel</dc:creator> <pubDate>Thu, 31 Mar 2011 13:54:57 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-756588</guid> <description>Hi Matt!In these times were social sites, twitter, facebook and stuff are taking over the net, it’s hard to tell what’s duplicate content. If someone bookmarks my post on digg, mixx, delicious and other how can i prevent that my ‘duplicate content’ is being distributed over the internet and harm my true content and web page?</description> <content:encoded><![CDATA[<p>Hi Matt!</p><p>In these times were social sites, twitter, facebook and stuff are taking over the net, it’s hard to tell what’s duplicate content. If someone bookmarks my post on digg, mixx, delicious and other how can i prevent that my ‘duplicate content’ is being distributed over the internet and harm my true content and web page?</p> ]]></content:encoded> </item> <item><title>By: Andrew</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-740437</link> <dc:creator>Andrew</dc:creator> <pubDate>Mon, 14 Mar 2011 22:07:56 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-740437</guid> <description>Can someone who is sure of a correct answer please respond to these questions? If I can get answers for these it will help me to save a lot of time:If I am a part of one of those membership clubs where every member of the club (from 50 up to 150 members) gets the same sites and pages, then these sites are made up of syndicated content and not duplicate content, right? And Google treats them differently in the search results? Would Google filter out these pages and possibly site on which the pages reside?If I change the keywords on the pages so that my pages are called up for different keywords (than the pages of the sites of the other members who leave the pages the way they got them) then that should work in giving me the uniqueness I need to get different traffic from different keywords? Most of the other members do not change anything on their pages. Would changing all the keywords work even though most of the content is the same as other members?What if I only want to rank for the home page and I completely rewrite the home page and the first sentence of each paragraph in the other 19 pages of the site? Will I be able to rank for the home page since Google ranks pages and not sites? Or, will my whole site be sand boxed since there is so much syndicated PLR content on the other pages of my sites that are not rewritten (only the first sentence of each paragraph is)?The new google farmer penalty does not affect my membership site home page and I can theoretically get #1 in Google because the home page is completely rewritten and the domain key word is unique compared to the other 50 to 150 membership sites? This ranking can happen even though most of the content on the other pages is not rewritten?</description> <content:encoded><![CDATA[<p>Can someone who is sure of a correct answer please respond to these questions? If I can get answers for these it will help me to save a lot of time:</p><p>If I am a part of one of those membership clubs where every member of the club (from 50 up to 150 members) gets the same sites and pages, then these sites are made up of syndicated content and not duplicate content, right? And Google treats them differently in the search results? Would Google filter out these pages and possibly site on which the pages reside?</p><p>If I change the keywords on the pages so that my pages are called up for different keywords (than the pages of the sites of the other members who leave the pages the way they got them) then that should work in giving me the uniqueness I need to get different traffic from different keywords? Most of the other members do not change anything on their pages. Would changing all the keywords work even though most of the content is the same as other members?</p><p>What if I only want to rank for the home page and I completely rewrite the home page and the first sentence of each paragraph in the other 19 pages of the site? Will I be able to rank for the home page since Google ranks pages and not sites? Or, will my whole site be sand boxed since there is so much syndicated PLR content on the other pages of my sites that are not rewritten (only the first sentence of each paragraph is)?</p><p>The new google farmer penalty does not affect my membership site home page and I can theoretically get #1 in Google because the home page is completely rewritten and the domain key word is unique compared to the other 50 to 150 membership sites? This ranking can happen even though most of the content on the other pages is not rewritten?</p> ]]></content:encoded> </item> <item><title>By: Nick D</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-666994</link> <dc:creator>Nick D</dc:creator> <pubDate>Sun, 21 Nov 2010 01:13:52 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-666994</guid> <description>@ Andrew KeirHi Andrew, you could try submitting a DMCA infringement notice. http://www.google.co.uk/dmca.htmlBut I would first try contacting the hosting company that host the offending site and requesting that they forcibly remove the content. (use WHOIS to find the host) They will either remove it themselves of ask their customer to remove it.There is a good template for a removal request letter on the link below: http://labnol.blogspot.com/2007/09/dmca-notice-of-copyright-infringement.htmlHope this helps.</description> <content:encoded><![CDATA[<p>@ Andrew Keir</p><p>Hi Andrew, you could try submitting a DMCA infringement notice.<br /> <a href="http://www.google.co.uk/dmca.html" rel="nofollow">http://www.google.co.uk/dmca.html</a></p><p>But I would first try contacting the hosting company that host the offending site and requesting that they forcibly remove the content. (use WHOIS to find the host)<br /> They will either remove it themselves of ask their customer to remove it.</p><p>There is a good template for a removal request letter on the link below:<br /> <a href="http://labnol.blogspot.com/2007/09/dmca-notice-of-copyright-infringement.html" rel="nofollow">http://labnol.blogspot.com/2007/09/dmca-notice-of-copyright-infringement.html</a></p><p>Hope this helps.</p> ]]></content:encoded> </item> <item><title>By: Andrew Keir</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-615940</link> <dc:creator>Andrew Keir</dc:creator> <pubDate>Wed, 25 Aug 2010 03:58:21 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-615940</guid> <description>Hi Matt,Not sure you&#039;ll notice this comment on a post so old, but here we go ;)Regarding the syndication of content, there is a syndication website that is publishing my content in full, which I don&#039;t want obviously. There are no contact details or any way to request a removal. I&#039;ve sent emails to generic email address&#039; like admin, administrator, postmaster etc. to try and have it removed with no luck.The content is clearly mine, links back to my site etc. Is there anything I can do with Google to have these pages de-indexed or something along those lines?Thanks for any advice in advance.</description> <content:encoded><![CDATA[<p>Hi Matt,</p><p>Not sure you&#8217;ll notice this comment on a post so old, but here we go <img src='http://www.mattcutts.com/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /></p><p>Regarding the syndication of content, there is a syndication website that is publishing my content in full, which I don&#8217;t want obviously. There are no contact details or any way to request a removal. I&#8217;ve sent emails to generic email address&#8217; like admin, administrator, postmaster etc. to try and have it removed with no luck.</p><p>The content is clearly mine, links back to my site etc. Is there anything I can do with Google to have these pages de-indexed or something along those lines?</p><p>Thanks for any advice in advance.</p> ]]></content:encoded> </item> <item><title>By: Nick D</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-612768</link> <dc:creator>Nick D</dc:creator> <pubDate>Wed, 18 Aug 2010 11:45:29 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-612768</guid> <description>&lt;strike&gt;it infact benefit Google&lt;/strike&gt; it would infact benefit Google</description> <content:encoded><![CDATA[<p><strike>it infact benefit Google</strike><br /> it would infact benefit Google</p> ]]></content:encoded> </item> <item><title>By: Nick D</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-612766</link> <dc:creator>Nick D</dc:creator> <pubDate>Wed, 18 Aug 2010 11:40:46 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-612766</guid> <description>Hi Matt,I have a question about the way Google deals with duplicate content that comes in from a totally different angle to the above. My question regards the Copyscape service. As you are no doubt aware, copyscape&#039;s free search detects duplicate content in Googles index, which in turn allows webmasters to remove or otherwise correct the duplicate content issues. This also presumably has a very positive effect on Googles index itself, because as the duplicate pages are removed the index is in effect cleaned and results therefore improved. However, when a user needs to check a larger site with copyscape, Google API restrictions come into play which means that the user has to check against Yahoo&#039;s index (sorry for the foul language hehe :) I cant help but think that Google is missing out on a trick here, and that in this instance the restrictions which are in place to preserve Google&#039;s server resources are doing more harm than good, and resulting in Yahoo&#039;s index being improved instead of your own. I would like to point out that I have nothing whatsoever to do with copyscape, but I am sure that if Google were to adopt a special API stance for plagiarism detection services such as this, it infact benefit Google and it&#039;s users as much as it would the webmasters using copyscape. I would be interested to hear your take on this.Kind Regards, Nick D</description> <content:encoded><![CDATA[<p>Hi Matt,</p><p>I have a question about the way Google deals with duplicate content that comes in from a totally different angle to the above. My question regards the Copyscape service. As you are no doubt aware, copyscape&#8217;s free search detects duplicate content in Googles index, which in turn allows webmasters to remove or otherwise correct the duplicate content issues. This also presumably has a very positive effect on Googles index itself, because as the duplicate pages are removed the index is in effect cleaned and results therefore improved. However, when a user needs to check a larger site with copyscape, Google API restrictions come into play which means that the user has to check against Yahoo&#8217;s index (sorry for the foul language hehe <img src='http://www.mattcutts.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> I cant help but think that Google is missing out on a trick here, and that in this instance the restrictions which are in place to preserve Google&#8217;s server resources are doing more harm than good, and resulting in Yahoo&#8217;s index being improved instead of your own. I would like to point out that I have nothing whatsoever to do with copyscape, but I am sure that if Google were to adopt a special API stance for plagiarism detection services such as this, it infact benefit Google and it&#8217;s users as much as it would the webmasters using copyscape. I would be interested to hear your take on this.</p><p>Kind Regards,<br /> Nick D</p> ]]></content:encoded> </item> <item><title>By: Rob</title><link>http://www.mattcutts.com/blog/duplicate-content-question/#comment-585437</link> <dc:creator>Rob</dc:creator> <pubDate>Wed, 30 Jun 2010 15:56:06 +0000</pubDate> <guid isPermaLink="false">http://www.mattcutts.com/blog/duplicate-content-question/#comment-585437</guid> <description>Yesiree, Jason.  Doesn&#039;t that just suck?  All someone has to do to f*@! up your site in Google is to swipe a lot of your content.  Even the pages that aren&#039;t lifted from your site can suffer.  Not your fault, not much you can do about it, and the only way to recover from it is to spend a ton of time either pursuing legal action to get the content removed from the offending sites or rewriting your content.  Neither of these options is particularly fair.</description> <content:encoded><![CDATA[<p>Yesiree, Jason.  Doesn&#8217;t that just suck?  All someone has to do to f*@! up your site in Google is to swipe a lot of your content.  Even the pages that aren&#8217;t lifted from your site can suffer.  Not your fault, not much you can do about it, and the only way to recover from it is to spend a ton of time either pursuing legal action to get the content removed from the offending sites or rewriting your content.  Neither of these options is particularly fair.</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk
Page Caching using disk (enhanced)
Database Caching 4/11 queries in 0.005 seconds using disk

Served from: www.mattcutts.com @ 2012-02-09 02:23:12 -->
