Archive for December, 2006

My Firefox vs. IE stats

At Pubcon in Vegas, the speakers did a bit of 10-second market research on 1000+ site owners and webmasters. The questions were: “How many of you use IE as your primary browsers? Now, how many of you use Firefox as your primary browsers?” By my count, among the savvy webmasters who go to Pubcon, Firefox users outnumbered IE users 2 to 1 or 3 to 1.

So I was interested to learn (via The Inquirer) that one metrics company recently claimed Firefox use is growing in Europe. In Finland, over 39% of people reportedly used Firefox! Finland! Land of beautiful koivu! (That’s birch to us English speakers.)

My stats run closer to 50/50. Here’s a snapshot from last month, for example:
Browser share in November 2006 for mattcutts.com

Frankly, I’m a little surprised IE has that much share for my visitors. This past week my “How Google deals with hacked sites” post showed up on digg/Slashdot/reddit. How does that crowd stack up? IE was only 22.5% of browsers that day, with more Safari users and many more Firefox users:
Browser share on December 4th 2006 for mattcutts.com

Now I have a couple questions. First, do any other metrics companies break down browser market share? Bill Tancer or Compete, do you have any Firefox vs. IE data? OneStat, StatCounter, or eXTReMe Tracking, whaddaya got for us? Second question: how does Firefox vs. IE break down for your websites?

Added: I use Google Analytics for my stats, just because it is easy and I am lazy. And now I really am going to try to get outside or exercise and not just blog all weekend. :)

Comments (95)

By the way…

In case you couldn’t tell, my wife has been out of town this week. :) She went back to Omaha to help her sister take care of our new niece. Soon my wife will return and then I’m sure I’ll return to my normal, slower blogging pace. That’ll be okay though, because I’ll sleep more.

Update: I made it a good 8-10 hours without posting! I’ll never know how Barry Schwartz does it..

Comments (15)

2006 Search Blogs Awards

Okay, that was quite a few heavy posts there. Let’s do something fun now. Loren Baker has opened nominations for the 2006 Search Blogs Awards. Here’s the votes I threw in:

* Best Search Engine Corporate Blog (owned by the search engines)
Tough one. Google Reader Blog for me.

* Best Contextual Advertising Blog
Gotta be Jensense.

* Best Search Engine Community Blog
I’m going to go with SEOmoz for this year. The signal is high and lively compared to the noise.

* Best Web 2.0 Blog
TechCrunch

But I found myself wishing for some other categories.

* Most interesting Whitehat blog:
Spamhuntress, narrowly edging out Eric Enge. Keep an eye on Eric, though. He’s got a fun idea for a series called This Old Web Site. Just like This Old House shows you how to remodel your home, Eric goes through a real-world example of a site that they remodeled and shows you everything they changed. The site was Videomaker.com, which is a real-world magazine, and Eric increased visitors by 67%. Good stuff.

* Most interesting Blackhat blog:
SEO Black hat. What the hell, I already linked to him once tonight. Fun to read, but following his advice could get all your sites burned to the ground.

* Most tireless chronicler of the search industry:
Barry over at Search Engine Roundtable. Heck, Barry could halve his blog posts and stay ahead of pretty much everyone except Gary Price.

* Thinks most like a Googler:
Philipp over at Google Blogoscoped. I wish I’d had the ideas for this click survey or this Internet chess moves visualization. Philipp, I’m tired of being mad about this post. I’m ready to talk about the bet now whenever you want.

* Best off-the-beaten-track Google news:
Google Operating System

* Best up-and-comer blog of the year:
Bruce Clay Blog. I know I know, it didn’t really start in 2006, but 2006 is the year it became a must-read. Lisa and Susan, get them to turn on comments over there, will ya? :)

* Best blog covering the legal side of search:
Eric Goldman

* Blog most likely to provoke deep thoughts:
Greg Linden. How come we can’t get another 5-10 blogs like Greg’s?

* Biggest search story of the year:
China. Runner-up: Danny Sullivan leaves Search Engine Watch after 10 years.

* Blogger who gave me the most work combating hoax marketing:
Easily Graywolf. However, I create a special lifetime achievement award in this category for Threadwatch. :)

* Best search patent blog:
SEO by the SEA. This category will be summarily given to Bill Slawski for a five year term, barring a major shake-up.

* Best search blog for people who have real lives and don’t live and breathe search 24 hours a day:
John Battelle

* Best search radio show/podcast of the year:
The Daily SearchCast

* Best search video of the year:
(Okay, I did some. Did anyone else do search videos? If so, comment. Maybe more people will hop in this niche next year.)

* Dead blogs I miss the most:
Tie: Xooglers and Spam Kings. I’ll pour out a 40oz for you, homies.

* Favorite video games this year:
Tie: LEGO Star Wars II: The Original Trilogy. Games made for 10-year-olds are just about the right difficulty level for me. :) The other was online-game Kingdom of Loathing. If you wondering why I didn’t post as much in say, April or May, blame it on KoL. :)

Comments (22)

Communication in other languages

Presumably you saw my post about talkorigins.org, a site that was recently hacked so that the front page had spammy porn text and links. Google temporarily removed talkorigins.org from our index, but we emailed talkorigins.org to alert them that they had been hacked. We also made it possible for talkorigins.org to confirm the penalty in our Webmaster console tools. Once the spammy porn links/text were gone, Google reincluded the site in our index within days.

So Google tries to alert hacked sites of problems; that’s good. But we also email many sites for violations of our quality guidelines, such as hidden text. Take for example the case of trouw.nl, a leading Dutch newspaper. They wrote an article criticizing the fact that Google temporarily removed trouw.nl from our index for hidden text, and emphasized their belief that Google should have alerted them to the removal. In fact, Google did email trouw.nl.

What exactly was trouw.nl doing? By using Cascading Style Sheet (CSS), dozens of words were hidden on hundreds or thousands of pages on the site. Here’s the code that was on the front page of trouw.nl in October, for example:


<div class=”indexKeywords”>
dagblad trouw, podium, nieuws, achtergronden, kranten, verdieping, opvoeding, onderwijs, religie, filosofie, natuurtochten, gezondheid(s)zorg, cultuur, natuur, milieu, stijlboek, recensies, boeken, chat, polderpeil, maandaggids, dinsdaggids, woensdaggids, donderdaggids, vrijdaggids, weekendgids, letter, geest, letter&geest, boekrecensies, novum, laatstenieuws, rss, handheld, dossiers, trouwkabinet, illegaletrouw, ephimenco, schouten, spotprenten, spotprent, len, tom, modernemanieren, cryptogram, zusje, kritieken, nieuwskoppen, horizonreizen, relatie, parship, schrijfboek, webshop, trouwcompact, compact, animatie(s), Flash, video, radio, strip(s).</div>

See the indexKeywords div? If you examine http://www.trouw.nl/trouw.nl/styles/basic.css, you’ll see that the properties of that div are


.indexKeywords,
.indexLinks,
.copyright,
.nostylesheetText {
display: none;
visibility: hidden;
width: 776px;
}

The net effect of that CSS div is to hide those 60+ keywords in a way that is completely invisible to users. In case you’re wondering, trouw.nl also used the indexLinks div style to hide multiple links as well. It’s interesting that the definition of indexKeywords remains in the CSS of trouw.nl, even though they’ve removed the actual hidden text.

In general, I do not feel that Google is obligated to notify every site that we remove from Google’s index for violating our quality guidelines. Our webspam team does not have infinite resources, and our primary goal has to be to protect Google users by keeping our index clean. However, in this case Google did email trouw.nl (in Dutch) to alert them about their hidden text. I’ll include an excerpt of the email that we sent to multiple email addresses, including webmaster at trouw.nl and support at trouw.nl:

Geachte eigenaar of webmaster van trouw.nl/,

Tijdens het indexeren van uw webpagina’s is geconstateerd dat enkele van uw pagina’s technieken gebruiken die in strijd zijn met onze kwaliteitsrichtlijnen. Deze richtlijnen kunt u vinden op: http://www.google.nl/webmasters/guidelines.html
Om de kwaliteit van onze zoekmachine te waarborgen zullen enkele van uw pagina’s tijdelijk uit onze zoekresultaten verwijderd worden. Momenteel staan de pagina’s van trouw.nl/ op het punt om verwijderd te worden voor een periode van ten minste 30 dagen.

In het bijzonder zijn de volgende technieken geconstateerd op uw pagina’s:

* De onderstaande verborgen tekst op trouw.nl/:

dagblad trouw, podium, nieuws, achtergronden, kranten, verdieping, opvoeding, onderwijs, religie, filosofie, natuurtochten, gezondheid(s)zorg, cultuur, natuur, milieu, stijlboek, recensies, boeken, chat, polderpeil, maandaggids, dinsdaggids, woensdaggids, donderdaggids, vrijdaggids, weekendgids, letter, geest, letter&geest, boekrecensies, novum, laatstenieuws, rss, handheld, dossiers, trouwkabinet, illegaletrouw, ephimenco, schouten, spotprenten, spotprent, len, tom, modernemanieren, cryptogram, zusje, kritieken, nieuwskoppen, horizonreizen, relatie, parship, schrijfboek, webshop, trouwcompact, compact, animatie(s), Flash, video, radio, strip(s).

As you can see, we tried to alert trouw.nl that we were taking action on their hidden text and hidden links. We mentioned the page with the issue (in this case, the root page), and we included the actual hidden text. The rest of the email goes on to describe how to request that Google reconsider the site for reinclusion in our index. After trouw.nl removed the hidden text and hidden links, Google reincluded the site.

I understand that trouw.nl was frustrated to be removed from Google’s index, but our users have told Google repeatedly that they hate webspam and don’t like seeing pages with hidden text secretly buried on the page. Hidden text is also not fair to other sites that try to compete for similar queries without hiding words from users.

In this case, I believe that Google did more than any other search engine does:
- We provided our webmaster guidelines in Dutch at http://www.google.nl/webmasters/guidelines.html (”Avoid hidden text or hidden links”? “Vermijd verborgen teksten en verborgen links.”)
- We scheduled the site to be removed for 30+ days so that users wouldn’t get hidden-text, hidden-link pages back in response to searches.
- We made it possible for trouw.nl to confirm that they had a penalty via our Webmaster console.
- We emailed trouw.nl in Dutch with the exact page to check and the exact text to look for.
- Once the site removed the hidden text and hidden links, we reincluded trouw.nl.

In reviewing this situation, I believe that the webspam team handled this issue in a way that protected our users but also tried to alert the site to issues. We will continue to work to improve our communication so that legitimate sites receive even more information to help them with webspam-related issues.

Q: So you’re not just working on webspam in English?
A: No! We are continuing our anti-spam efforts in many different languages, as you can see from this situation. In fact, I expect Google to focus even more effort on other languages in coming months. I’m extremely proud of our webspam team members who are located in Europe (and other places around the world). I’m also sending one of the best people on our Mountain View-based webspam team, Brian White, to Europe for six months in 2007 to provide the webspam team in Europe with more even more visibility and more support.

Q: Matt, you still love Dutch sites and the Netherlands, right?
A: Yes. :) One of my favorite authors is Dutch. Janwillem van de Wetering is the best existential mystery writer in the world, without question. Because of him, I can’t wait to drink jenever in Amsterdam someday. But we also have to protect Google’s users and the quality of our index. :)

Update: I’ve been in contact with someone at Trouw.nl, and as always, there are two sides to the story. The email addresses we tried to use to contact Trouw didn’t exist, so Trouw couldn’t have received our message. This situation shows that the idea of contacting site owners is solid, but we can still find ways to improve our communication and webmaster outreach. Trouw has also added an update at the end of their article saying much the same thing.

Comments (89)

New Google UI feature: Plus Box

This will interest webmasters and local businesses. If someone searches for your business and we have good confidence that we know your business address, we’ll include an expandable “Plus Box” in your search result listing. If a user clicks to expand the Plus Box, they’ll see something like this:

Plus box example

I really like this feature because it’s useful but you don’t have to remember to go to a completely different place to look up local info. Instead, this feature integrates with our main web search to help the majority of our users without them doing anything extra.

What’s that? You have a local business and would like to get a Plus Box too? You can tell us your business address at Google’s Local Business Center .

Q: How much does it cost?
A: It’s free.

Q: How do you know my address/info is right?
A: I believe we can verify your info by phone or by postcard.

Q: What if I’ve got multiple store locations?
A: No problem. You can upload multiple listings, thanks to Google Base.

Q: What sort of info can I tell you about my business?
A: I’m so glad you asked! It looks like this:

Google Local Business Center

Q: What Google help documentation exists for the Plus Box?
A: I know of http://www.google.com/support/webmasters/bin/answer.py?answer=52181
and http://www.google.com/support/webmasters/bin/answer.py?answer=55093
and http://www.google.com/support/webmasters/bin/answer.py?answer=52171 .

I like this feature. You can tell us store hours, what sort of payment you accept, phone number, etc. I think every local business would benefit from using this service. It’s free, and it makes it easier for searchers to find your business. If you’ve got a local business, feel free to sign up now.

Hat tips to Michael, Ionut, Philipp, and Barry.

P.S. Here’s a completely unrelated tip for small businesses. If your small business doesn’t have a website, you can still advertise on AdWords and Google will host a business page for you.

Comments (49)

Next entries » · « Previous entries