So online metrics are hard, there’s no question about it. Some are harder than others (oh, say.. time spent on site as an example), but they are generally difficult. Even the most basic metric, hits/page views, is difficult and building on top of that the notion of counting visits is more difficult and unique visitors the most perilous of the three.

The problem with counting hits is that there’re all sorts of critters now visiting your site that aren’t really people. All sorts of spiders and crawlers and robots that shouldn’t be counted as part of your legitimate ad loving traffic. Logfile analysis is one way to pull this data - it has all the information of everything your webserver has ever served up. The rub is that it is tough to determine real from automated - looking at User Agents works somewhat, but it is nearly impossible to maintain a list of every known robot out there and then to keep up with them. Getting most of them right, especially, the big ones, might be close enough for some. On the other hand, web beacons (like Google Analytics and Omniture) use javascript to count your pageviews live. This has the advantage of never counting robots (since robots, to my knowledge, never interpret javascript) but may miss pageviews if the browser doesn’t run javascript, has javascript turned off or the user simply clicks to another page before the beacon executes.

Counting unique visitors is even more difficult. Assuming that you were counting your pageviews correctly, it’s remarkably difficult to accurately determine who one visitor is. The most enduring example of this is the user who surfs from home and at work - there is really no way to generically determine that they are and I’d guess every analytics package, online or off, double counts them.

So, as you can see, pulling those numbers reliably is a feat. However, in general, I think that most web beacons are very close to the truth on this. I won’t say the same for most log analysis, but I don’t have a breadth of experience with these so there very well may be many that get it right. But check out this piece from the nytimes that talks about how ComScore and Nielsen’s numbers are radically different (and lower) than the publishers’ own numbers.

At first blush you figure that the independent auditors numbers must be more right, since it is obviously in the publishers’ favour to inflate their traffic numbers (thus making them more attractive to advertisers). But here’s the thing, ComScore and Nielsen use neither of the above methods of pulling this data, they’re using the old method of selecting a panel of users, installing some software on their systems to monitor where they go and then extrapolate the world of internet users from that base. This is literally the dumbest thing I’ve heard all day (sure the day is young, but I’m confident I’ll hear nothing dumber).

Selecting this panel gives all sorts of bias to the data. For an extreme example of this sample bias, look no further than the oft and justifiably maligned Alexa. These numbers are so demonstrably wrong it’s amazing. I can imagine that Nielsen has some better distribution, but there’s always going to be selection bias. Do you or anyone you know or anyone you know know anyone who is part of the Nielsen net ratings panel? Does that give you any confidence that you are being represented in their scores?

In this online world where all the data is there to pull real numbers I don’t understand why they would be doing panel sampling to estimate the true numbers, they are all there. Far from understanding their own fallibility or at least their weakness/strength when compared to these other numbers, they are attacking them assuming theirs to be the one true metric. Nielsen isn’t used to a world with metrics that compete with their own (and I suspect that if there were other tv metrics, you would find a similar difference). If they employed a combination of panel and beacon/log analysis and did some kind of super fancy PhD voodoo reconciling the two, that might provide the most interesting insight into traffic. If they partnered with Omniture, as an example, that could be a huge partnership in the metrics world. Unfortunately, they don’t seem to be interested in this.

Beyond even all that are the numbers that 3rd party advertising systems pull, like DoubleClick. They tell their users how many impressions each ad receives. This often diverges from how many impressions the publisher believes they’ve delivered because of the way impressions are counted. For example, DoubleClick stops counting impressions after 3 or 5 deliveries of an ad to the same user. A publisher who wasn’t doing that would continue counting and thus be over delivering by DoubleClick’s measures. The problem is that there is no standard.

As you can see there’s a lot of trickiness to pulling metrics. It is a little bit of the wild west out there with many competing standards. It would be good to get some kind of organizational body that had the clout to force some sort of uniformity on all these different services, but I doubt that that will happen anytime soon. Still, if I had to put my money on one of these horses, I’d bet that the big web beacons are the most accurate of the lot - handily beating log and panel analysis.

UPDATE: Well, as Kirkunit mentions below… Quantcast is one pioneer of the hybrid model. And it’s great. I’d sure like to see more people enter that game, though.

← newer WiMax, we barely knew ye  ↑  Breakfast Links: He-Man, Fraudster & Fraud older →

TwitterCounter for @nybble73