Peltier Tech Blog

Excel Chart Add-Ins | Training | Charts and Tutorials | Peltier Tech Blog

 

Main menu:

Peltier Tech Chart Utilities for Excel Peltier Tech Panel Chart Utility Peltier Tech Waterfall Chart Utility Peltier Tech Cluster-Stack Chart Utility Peltier Tech Box and Whisker Chart Utility Peltier Tech Marimekko Chart Utility Peltier Tech Dot Plot Utility Peltier Tech Cascade Chart Utility

 
Excel Dashboards
 

 
Amazon Books
 

Subscribe

Site search

Subscribe

Site search


Recent Posts

Popular Posts

Categories


 

Privacy Policy

Creative Commons License
Licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported License.

Stacked vs. Clustered

 
by Jon Peltier
Wednesday, August 27th, 2008
Peltier Technical Services, Inc., Copyright © 2012.
Licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 Unported License.

Stacked charts are commonly used to compare proportions within a whole. In this post I will concentrate on stacked and clustered vertical column charts, but the arguments hold true for horizontal bar charts, and to some extent for area charts. This table shows some make-believe sales figures for an imaginary company, by quarter and by sales region.

Stacked Column Chart Data

Stacked Charts

In the chart below, we see quarter by quarter sales by North, South, and West sales regions. The total height of a stacked bar shows total sales, consisting of the sum of the heights (sales) for the individual regions.

Stacked Column Chart

We get a similar stacked chart if we plot the data by row instead of by column. Each stacked bar shows the total sales for all four quarters for a given region.

Stacked Column Chart

What are stacked charts good for? In the two examples above, we can easily see trends in the total sales by category (by quarter or by region). We can also judge the trends in the lowest series of the stack.

What are the problems with the stacked charts? In these two examples, we cannot follow trends in the upper series in each stack, because only the bottom series has a flat baseline. In the top chart, I cannot tell whether the South and West regions vary at all from quarter to quarter. We can see the decrease by region in the bottom chart, but only because it is almost too large an effect to miss. We cannot easily compare the different blocks in the South region or in the West region, because they are stacked upon each other, rather than laid out side by side.

Clustered Charts

How can we preserve the ability to judge trends in all series in the stacked chart? Easy: unstack the series. This clustered column chart makes it easy to compare the values of any of the regions for any of the quarters. There are in fact noticeable trends in the South and West regions, in addition to the more obvious North region.

Clustered Column Chart

I’ve seen people fancy up their clustered charts by playing with the overlap. This works okay if the taller bars are behind the shorter ones, but the width of the bars in back is partially obscured, which may lead to unintentional misjudgment of the bars’ values. It’s probably best not to risk distorting the data.

Clustered Column Chart

Here’s the clustered chart with series plotted by rows. The trends in the data are much more evident than in the stacked chart.

Clustered Column Chart

In fact, each cluster shows the trend by quarter, highlighted by the lines and markers below.

Clustered Column Chart with Lines

Line Charts

This leads to an alternative to stacked and to clustered columns: line charts. In general, time series (data plotted with time along the X axis) are better plotted with lines than columns. The markers show the discrete data values, and the connecting lines help to show the trends.

Line Chart

Excel offers a stacked line chart option. These should be avoided because the stacked nature of the data is not evident; in stacked column charts, the points have visible heights that can be added by stacking. In the chart below, a casual glance makes it seem that West is the region with highest sales.

Stacked Line Chart

A weakness of the clustered column chart, and of the line chart above, is that the chart no longer indicates the total sales by category. This could be addressed by adding another column series in the clustered chart, but I think this is somewhat confusing. The total should not be in a color that may make it seem like a regular series, but if too dark a shade of gray is used, the total series will overwhelm the chart.

Column Chart

If the total values are not too large relative to the individual values, adding a line series to the line chart is a natural way to show the totals.

Line Chart

A line chart plotted by rows instead of columns makes less sense. It seems okay at first, but there is no natural X axis flow from region to region as there is from one quarter to the next.

Line Chart

The chart above makes it look like things are getting worse, but simply rearranging the categories makes it look like there was a peak in sales.

Line Chart

The columns in a column chart emphasize the values, not the trends between values. The lines in a line chart emphasize the trends, and lead to interpolation between points. Since one can’t interpolate between non-numerical categories, a line chart should only be used for data with date/time or numerical X values. This is especially true if the categories can be arbitrarily ordered.

In summary

Although stacked charts display parts of a whole, only trends in the bottom series and in the total of all series can be accurately assessed.

Clustered charts allow all data points to be compared because there is a single baseline. Clustered charts show data at greater resolution because axis is tied to largest single point, not largest total. Totals are not usually included, so proportions are not as easy to judge.

Point-to-point trends are displayed more clearly in line charts, with markers showing values and connecting lines showing the transitions between points. If the horizontal axis is based on dates or on numerical values, line charts should be used rather than column charts. If the horizontal axis is categorical, the lines may give an erroneous impression that there is a somewhat continuous transition between categories.

For most purposes, clustered charts are preferred over stacked charts. Clustered charts should be used for non-numerical categories. Line charts should be used if the category (X) data is time based or numerical. Stacked line charts should be avoided, because the stacked nature of the data is not evident.

Related Posts:

Learn how to create Excel dashboards.

Comments


Comment from Tim Gerlitz
Time: Wednesday, December 30, 2009, 5:12 pm

Just wanted to drop you a line and thank you for your tutorials. Hard to find comprehensive information like this anywhere else. I use the tutorials all the time and they have helped me tremendously in my work. thanks again.


Comment from Jon Peltier
Time: Thursday, December 31, 2009, 2:18 am

Tim – Thanks for the note. Glad to hear it’s helping.

Write a comment

I welcome comments from my readers. If you have an opinion on this post, if you have a question or if there is anything to add, I want to hear from you. Whether you agree or disagree, please join the discussion.

If you want to include an image in your comment, post it on your own site or on one of the many free image sharing sites, and include a link in your comment. I'll download your image and insert the necessary html to display the image inline.

Read the PTS Blog Comment Policy.





Subscribe without commenting

Peltier Tech Chart Utilities for Excel Peltier Tech Waterfall Chart Utility Peltier Tech Box and Whisker Chart Utility Peltier Tech Cluster-Stack Chart Utility Peltier Tech Panel Chart Utility Peltier Tech Marimekko Chart Utility Peltier Tech Dot Plot Utility Peltier Tech Cascade Chart Utility

Create Excel dashboards quickly with Plug-N-Play reports.