<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Don&#8217;t end your urls with .exe</title>
	<atom:link href="http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/</link>
	<description>neat fun stuff</description>
	<pubDate>Thu, 20 Nov 2008 16:12:59 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
		<item>
		<title>By: Kris</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128970</link>
		<dc:creator>Kris</dc:creator>
		<pubDate>Thu, 19 Jun 2008 17:11:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128970</guid>
		<description>Hello Matt, 

I have still a question about urls. I know now that .html or .php are the same, but what about pages without an extension?

example: 
www.domain.com/product1.html
www.domain.com/product1

Which do you suggest we use for our websites?

Thanks!</description>
		<content:encoded><![CDATA[<p>Hello Matt, </p>
<p>I have still a question about urls. I know now that .html or .php are the same, but what about pages without an extension?</p>
<p>example:<br />
<a href="http://www.domain.com/product1.html" rel="nofollow">http://www.domain.com/product1.html</a><br />
<a href="http://www.domain.com/product1" rel="nofollow">http://www.domain.com/product1</a></p>
<p>Which do you suggest we use for our websites?</p>
<p>Thanks!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Multi-Worded Adam</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128870</link>
		<dc:creator>Multi-Worded Adam</dc:creator>
		<pubDate>Wed, 18 Jun 2008 15:09:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128870</guid>
		<description>&lt;blockquote&gt;
M.W.A., that’s because the url is “cgi.ebay.ca/ws/eBayISAPI.dll?ViewItem&#38;item=270224738405″. If the url were “cgi.ebay.ca/ws/eBayISAPI. dll” then we wouldn’t have crawled it, but since there was something else in the url that followed the .dll, Google was willing to crawl the url. Note that I said Google won’t crawl the url if ends directly in .exe or .dll or whatever. So http://www.example.com/page.exe we wouldn’t crawl. But http://www.example.com/page.exe?whatever we probably would be willing to crawl.
&lt;/blockquote&gt;

This explanation makes some sense.  I have always used the "definition" of a URL as provided by ASP's ServerVariables collection, which doesn't include the querystring (which is a separate animal).

But this leads to another concern, not so much for myself personally but for the average user who may not be so tech-savvy (i.e. the one Sint asked).  By this explanation, theoretically http://www.some-domain.com/some-virus.exe wouldn't be indexed, but http://www.some-domain.com/some-virus.exe?this=a_random_querystring would.  I know a lot of people who don't bother to look at the extension of a link before they click on it (in fact, I would say the vast majority wouldn't) and may well stumble upon something they're not supposed to.

It's also theoretically possible, through cloaking, to present "valid" HTML content to a bot and something completely different to what is perceived to be a human.  

I think Sint has a good question...obviously the querystring as a standalone cannot be considered.  So what else is?</description>
		<content:encoded><![CDATA[<blockquote><p>
M.W.A., that’s because the url is “cgi.ebay.ca/ws/eBayISAPI.dll?ViewItem&amp;item=270224738405″. If the url were “cgi.ebay.ca/ws/eBayISAPI. dll” then we wouldn’t have crawled it, but since there was something else in the url that followed the .dll, Google was willing to crawl the url. Note that I said Google won’t crawl the url if ends directly in .exe or .dll or whatever. So <a href="http://www.example.com/page.exe" rel="nofollow">http://www.example.com/page.exe</a> we wouldn’t crawl. But <a href="http://www.example.com/page.exe?whatever" rel="nofollow">http://www.example.com/page.exe?whatever</a> we probably would be willing to crawl.
</p></blockquote>
<p>This explanation makes some sense.  I have always used the &#8220;definition&#8221; of a URL as provided by ASP&#8217;s ServerVariables collection, which doesn&#8217;t include the querystring (which is a separate animal).</p>
<p>But this leads to another concern, not so much for myself personally but for the average user who may not be so tech-savvy (i.e. the one Sint asked).  By this explanation, theoretically <a href="http://www.some-domain.com/some-virus.exe" rel="nofollow">http://www.some-domain.com/some-virus.exe</a> wouldn&#8217;t be indexed, but <a href="http://www.some-domain.com/some-virus.exe?this=a_random_querystring" rel="nofollow">http://www.some-domain.com/some-virus.exe?this=a_random_querystring</a> would.  I know a lot of people who don&#8217;t bother to look at the extension of a link before they click on it (in fact, I would say the vast majority wouldn&#8217;t) and may well stumble upon something they&#8217;re not supposed to.</p>
<p>It&#8217;s also theoretically possible, through cloaking, to present &#8220;valid&#8221; HTML content to a bot and something completely different to what is perceived to be a human.  </p>
<p>I think Sint has a good question&#8230;obviously the querystring as a standalone cannot be considered.  So what else is?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tim Rimmer</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128839</link>
		<dc:creator>Tim Rimmer</dc:creator>
		<pubDate>Wed, 18 Jun 2008 03:57:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128839</guid>
		<description>on the topic of statics versus dynamic pages. 

I am surprised to learn that Google does not use the page extension as an indicator of quality. Even if a small one. 

Obviously anyone can hide a dynamic page through mod rewrite etc but still I would have thought that overall on the web there is more webspam in php, asp, aspx, cgi etc pages. 

Google, having the worlds best data set to test these things could do a correlation between extensions and quality and I am suprised that there is not a significant lean towards /html or no extension having better quality pages. 

Is there really no difference in the quality of pages with php vs htm or is it moreso that the colateral damage to good quality dynamic sites is too high?</description>
		<content:encoded><![CDATA[<p>on the topic of statics versus dynamic pages. </p>
<p>I am surprised to learn that Google does not use the page extension as an indicator of quality. Even if a small one. </p>
<p>Obviously anyone can hide a dynamic page through mod rewrite etc but still I would have thought that overall on the web there is more webspam in php, asp, aspx, cgi etc pages. </p>
<p>Google, having the worlds best data set to test these things could do a correlation between extensions and quality and I am suprised that there is not a significant lean towards /html or no extension having better quality pages. </p>
<p>Is there really no difference in the quality of pages with php vs htm or is it moreso that the colateral damage to good quality dynamic sites is too high?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: g1smd</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128819</link>
		<dc:creator>g1smd</dc:creator>
		<pubDate>Tue, 17 Jun 2008 23:25:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128819</guid>
		<description>***  use of the trailing slash in filenames  ***  


No.   The trailing slash is NOT used for a filename.

Trailing slash is the correct canonical form for a FOLDER.

Redirect to that form, from the one without it.</description>
		<content:encoded><![CDATA[<p>***  use of the trailing slash in filenames  ***  </p>
<p>No.   The trailing slash is NOT used for a filename.</p>
<p>Trailing slash is the correct canonical form for a FOLDER.</p>
<p>Redirect to that form, from the one without it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: chuckallied</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128740</link>
		<dc:creator>chuckallied</dc:creator>
		<pubDate>Mon, 16 Jun 2008 22:10:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128740</guid>
		<description>&#62;&#62;Jarlskov&#62;&#62; I could, and do, but in the same way Google image search displays image results differently, it'd be nice to see a Google File search that likewise includes sorting and display features specifically designed for files.</description>
		<content:encoded><![CDATA[<p>&gt;&gt;Jarlskov&gt;&gt; I could, and do, but in the same way Google image search displays image results differently, it&#8217;d be nice to see a Google File search that likewise includes sorting and display features specifically designed for files.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt Cutts</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128732</link>
		<dc:creator>Matt Cutts</dc:creator>
		<pubDate>Mon, 16 Jun 2008 19:40:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128732</guid>
		<description>Ankit, I think that's a result of Google responding to feedback and trying to crawl urls that end in ".0"; we'll see how many binary packages come in as well before making a final decision.</description>
		<content:encoded><![CDATA[<p>Ankit, I think that&#8217;s a result of Google responding to feedback and trying to crawl urls that end in &#8220;.0&#8243;; we&#8217;ll see how many binary packages come in as well before making a final decision.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: BradleyT</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128728</link>
		<dc:creator>BradleyT</dc:creator>
		<pubDate>Mon, 16 Jun 2008 17:16:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128728</guid>
		<description>What about no extension and no trailing slash?  This is the way Drupal creates pages when you turn on friendly URLs.

I guess since a site:drupal.org query returns 1.4 million results with many of them having no trailing slash and no extension it's fair to assume Google is OK with this?</description>
		<content:encoded><![CDATA[<p>What about no extension and no trailing slash?  This is the way Drupal creates pages when you turn on friendly URLs.</p>
<p>I guess since a site:drupal.org query returns 1.4 million results with many of them having no trailing slash and no extension it&#8217;s fair to assume Google is OK with this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Irving</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128725</link>
		<dc:creator>Mike Irving</dc:creator>
		<pubDate>Mon, 16 Jun 2008 16:21:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128725</guid>
		<description>Hi Matt,

Interesting Article.

You briefly mention use of the trailing slash in filenames, i.e. www.somesite.com/some-content/

I have used various URL Re-Write methods in the past, some using the trailing slash and some not.

Also, sites with no URL Re-Write in place but with, for example, default.aspx or default.php placed in various folders are quite common so as to achieve "re-write effect" URLs without any actual rewriting.

I am just wondering whether "trailing slash" filenames are prefereble over non slash URLs, or vice-versa.

I suppose "trailing slash" URLs will be one character longer, accounting for the slash, but would that have any advantage/disadvantage when it comes to search ranking?

Would a "trailing slash" page be given less relevance as it could be interpreted as looking as if it belongs a section below the parent document?

Interesting things URLs, they vary so much from site to site.

Mike.</description>
		<content:encoded><![CDATA[<p>Hi Matt,</p>
<p>Interesting Article.</p>
<p>You briefly mention use of the trailing slash in filenames, i.e. <a href="http://www.somesite.com/some-content/" rel="nofollow">http://www.somesite.com/some-content/</a></p>
<p>I have used various URL Re-Write methods in the past, some using the trailing slash and some not.</p>
<p>Also, sites with no URL Re-Write in place but with, for example, default.aspx or default.php placed in various folders are quite common so as to achieve &#8220;re-write effect&#8221; URLs without any actual rewriting.</p>
<p>I am just wondering whether &#8220;trailing slash&#8221; filenames are prefereble over non slash URLs, or vice-versa.</p>
<p>I suppose &#8220;trailing slash&#8221; URLs will be one character longer, accounting for the slash, but would that have any advantage/disadvantage when it comes to search ranking?</p>
<p>Would a &#8220;trailing slash&#8221; page be given less relevance as it could be interpreted as looking as if it belongs a section below the parent document?</p>
<p>Interesting things URLs, they vary so much from site to site.</p>
<p>Mike.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ankit</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128722</link>
		<dc:creator>Ankit</dc:creator>
		<pubDate>Mon, 16 Jun 2008 15:03:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128722</guid>
		<description>matt seems like mozzers are back with there web2.0 page in Google index !!
http://ankitrawat.com/blog/seomozz-got-its-web20-page-back-in-google/
what u have to say abt this ?</description>
		<content:encoded><![CDATA[<p>matt seems like mozzers are back with there web2.0 page in Google index !!<br />
<a href="http://ankitrawat.com/blog/seomozz-got-its-web20-page-back-in-google/" rel="nofollow">http://ankitrawat.com/blog/seomozz-got-its-web20-page-back-in-google/</a><br />
what u have to say abt this ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sint</title>
		<link>http://www.mattcutts.com/blog/dont-end-your-urls-with-exe/#comment-128713</link>
		<dc:creator>Sint</dc:creator>
		<pubDate>Mon, 16 Jun 2008 10:05:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.mattcutts.com/blog/?p=962#comment-128713</guid>
		<description>Hi Matt,

Thanks for clearing some things up again.

I assume Google not only keeps these specific kinds of files (executables, binaries) from the SERP's because it is possible they aren't ordinary web pages, but also because downloading these files could be a security risk to users, when an executable contains hazardous code.

Does Google only look at the last characters of an URL, or do you also look at content types etc? Because else one could still get a file into Google by linking to it like www.domain.com/somevirus.exe?hello=world (adding a random parameter).

It seems logic to block some extensions of files that are, according to the web standards, no part of a web page, but ignoring the extension if a GET parameter is provided sounds a bit inconsistent to me. Especially when there are better ways to determine of which type a file is.

Maybe you could tell a bit more about why Google is handling the extensions this way?</description>
		<content:encoded><![CDATA[<p>Hi Matt,</p>
<p>Thanks for clearing some things up again.</p>
<p>I assume Google not only keeps these specific kinds of files (executables, binaries) from the SERP&#8217;s because it is possible they aren&#8217;t ordinary web pages, but also because downloading these files could be a security risk to users, when an executable contains hazardous code.</p>
<p>Does Google only look at the last characters of an URL, or do you also look at content types etc? Because else one could still get a file into Google by linking to it like <a href="http://www.domain.com/somevirus.exe?hello=world" rel="nofollow">http://www.domain.com/somevirus.exe?hello=world</a> (adding a random parameter).</p>
<p>It seems logic to block some extensions of files that are, according to the web standards, no part of a web page, but ignoring the extension if a GET parameter is provided sounds a bit inconsistent to me. Especially when there are better ways to determine of which type a file is.</p>
<p>Maybe you could tell a bit more about why Google is handling the extensions this way?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
