site statistics Archives

Individuals-Moving Range Analysis

This table shows the individual values and moving ranges for the three main browsers. The means and control limits are computed below the table of values, and values in the table are colored red if they lie outside the control limits. The values show browser usage each month by percent of visits to my site.

There is a lot of red (i.e., out of statistical control) in the IE and Chrome individual values, notably at the beginning and end, indicating a trend from start to finish. Firefox shows only one red point, and there’s no obvious trend. The only red value in the moving range data is a single point for IE.

The data is plotted in the following I-MR charts. The Y axis ranges are the same for all browsers for easy comparison. The trends for Internet Explorer and Chrome are rather obvious when the new control limits are plotted.

For IE (above) the upper and lower control limits calculated using mean and standard deviation were 67.9% and 53.5%, much further apart than those in the I-MR chart; in fact, those limits fall outside the Y axis scale of the I-MR chart. For Chrome (below), the Mean-SD upper control limit is 10.4%, which also falls outside the corresponding I-MR chart. Both calculations for Chrome’s LCL are below zero; since this makes no physical sense, zero is used.

The Firefox control limits based on mean and SD are further apart than the I-MR limits, by one percentage point (32.4% and 27.0%), but would still be visible in this chart.

I-MR Analysis for Sparse Data

The conclusion from my earlier post was that three points over 18 months is insufficient data to judge whether there was a trend in the browser usage percentages. This conclusion holds when the more rigorous I-MR evaluation is carried out. If we perform the above analysis on four points, one point every six months, the I-MR calculations and charts show the processes are in control, and cannot be attributed to changing patterns of usage.

The moving range values are much larger than for monthly data points, since six months of changes are lumped into one point. As a result, the control limits are pushed far enough away from the means that there are no out-of-control points.

The data is plotted in the following I-MR charts. The Y axis ranges are the same for all browsers for easy comparison. Although we “see” trends for Internet Explorer and Chrome, since there are no points outside the control limits and not enough points to invoke the special Western Electric rules, we cannot conclude there is any variation not attributable to random fluctuations.

Yesterday we were treated to discussions about the visitor stats of several Excel bloggers:

Chandoo: June was Pointy Haired Dilbert Blog’s Best month ever
J-Walk’s Spreadsheet Page Blog: June Visitor Stats
Daily Dose of Excel’s June Stats
Peltier Tech Blog’s Web Stats – June 2009
Debra’s Contextures 200906 Site Stats

I thought it would be interesting to compare some of my favorite blogs and web sites. Unless you have access to the data for each site, it’s not so easy. One service that lets you make comparisons is Alexa. Using Alexa you can compare up to five sites in a number of categories. I usually look at Reach, Pageviews, and Traffic Rank, which are defined by Alexa as follows:

Reach measures the number of users. Reach is typically expressed as the percentage of all Internet users who visit a given site.

Pageviews measure the number of pages viewed by site visitors. Multiple page views of the same page made by the same user on the same day are counted only once.

Traffic rank is based on three months of aggregated historical traffic data from millions of Alexa Toolbar users and data obtained from other, diverse traffic data sources, and is a combined measure of page views and users (reach).

I plotted a few groups of data, since Alexa only allows five curves on a chart. Since this is my blog, I plotted the Peltier Tech data on all charts, as a benchmark. In the first group I plotted Daily Dose and Spreadsheet Page. Then I wondered whether John Walkenbach’s old site, J-Walk.com, is still getting traffic, taking away from Spreadsheet Page. Finally I added old pal Tushar Mehta’s site, since he’s been around a long time (after j-walk.com but before peltiertech.com).

I’m pulling a slightly larger audience than the other sites, according to the Reach figures. I don’t know what the spike is in early May: it doesn’t show up in any other website analytics service I follow.

Daily Dose and Spreadsheet Page are close to each other, and it looks like j-walk.com still has a significant presence. John could do some .htaccess magic to transfer the link juice to his newer site.

It would be nice to be able to rescale the Y axis, maybe 0 to 0.007%, to provide a little resolution at the low end. Or maybe use a log scale.

Reach: PTS Blog, J-Walk, Spreadsheet Page, Daily Dose of Excel, and Tushar-Mehta

Pageviews is a hard chart to analyze. Again, stretching the Y axis to scale from 0 to 0.0002% or using a log scale would spread out the low-end values.

Pageviews: PTS Blog, J-Walk, Spreadsheet Page, Daily Dose of Excel, and Tushar-Mehta

Traffic Rank is shown on a logarithmic scale. I don’t know what kind of algorithm Alexa uses to get Rank from Pageviews and Reach, but it must be somehow multiplicative. A slight advantage in both measures of Peltier Tech over the others means Peltier Tech is visible near the bottom (actually, alternating above and below the X axis), while the others are apparently below the axis.

Traffic Rank: PTS Blog, J-Walk, Spreadsheet Page, Daily Dose of Excel, and Tushar-Mehta

In my next group, I included The Spreadsheet Page (again), and I added Chandoo’s Pointy Hairde Dilbert, since he started this whole discussion. I’ve also included Debra Dalgleish’s popular Contextures web site, and also Chip Pearson’s encyclopedic site.

Chip pretty much owns the rest of us in terms of Reach, except for my early May spike, and Chandoo’s Lifehacker spike in mid-June. Debra’s site is consistently close to Chip’s and a bit higher than mine. PHD and Spreadsheet Page track each other fairly closely, except for Chandoo’s spike in June.

Reach: PTS Blog, Pointy Haired Dilbert, Spreadsheet Page, Chip Pearson, and Contextures

As before, the inflexible Y axis scale makes Pageviews simply a reminder to mow the lawn.

Pageviews: PTS Blog, Pointy Haired Dilbert, Spreadsheet Page, Chip Pearson, and Contextures

Traffic Rank shows all of us venturing into visible territory, with the relative placements similar to those shown in the Reach chart.

Traffic Rank: PTS Blog, Pointy Haired Dilbert, Spreadsheet Page, Chip Pearson, and Contexturesa

To put this all into perspective, I decided to compare my site to a few larger ones, namely Amazon, Microsoft, and Google. These sites are a little more popular, so Alexa’s Y axes expand upwards to accommodate them (and push my data ever lower in the charts).

If you look closely at the Reach chart you can see Peltier Tech: it’s the blue line obscuring the X axis. Compared to Google, Microsoft and Amazon are near the bottom. On a log scale, they’d be much closer to Google than to Peltier Tech.

Reach: PTS Blog, Amazon, Microsoft, and Google

Again, Peltier Tech is coloring the X axis blue in the Pageviews chart. Again, Google’s numbers dwarf Amazon’s and Microsoft’s.

Pageviews: PTS Blog, Amazon, Microsoft, and Google

At least the Peltier Tech site shows a little texture in Traffic Rank. Good thing that’s a logarithmic scale! Google’s rank serves as the gridline for Y=1. The log scale also helps MS and AMZ stay near the top of the chart, with ranks around 15 and 30.

Traffic Rank: PTS Blog, Amazon, Microsoft, and Google

So what does all of this mean? I don’t know, but it’s a fun way to kill a little time. I guess I can rest assured that the Peltier Tech web site is in the middle of the pack of the popular Excel sites. But if all of you readers tell your friends, and twitter all about my site, and your friends all post on FaceBook about Peltier Tech, and I don’t know, we throw in a little MySpace and Digg and Technorati and other funny sounding words, and the momentum grows, I’ll probably still be firmly lodged in the middle of the pack.

site statistics

SPC Approach to Browser Stats

Individuals-Moving Range Analysis

I-MR Analysis for Sparse Data

Further Reading about Statistical Process Control

Statistical Process Control Articles in this Blog

More Web Stats Madness