survey results Archives

Cole Nussbaumer wrote about displaying survey results Logic in Order in her Storytelling with Data blog last week. You have to love the goal of her blog, which is “to help rid the world of ineffective graphs, one exploding, 3D pie chart at a time.” She writes extensively about communicating effectively with data.

The point of her article was that you should plot your data in some kind of logical order. If the values are plotted by month, sort the months chronologically, not alphabetically. It’s a good article, with a point I know I’ve often tried to make; go read it.

Stacked Bar Charts

To illustrate her points, she used sanitized results of a survey that asked users whether they were completely satisfied, very satisfied, somewhat satisfied, not very satisfied, or not satisfied at all with a number of features, or whether they used them at all. I’ve recreated Cole’s first chart below. .

Stacked Bar Chart - Unused plotted low

I’m going to let Cole go off on her tangent about presenting such data, while I go off on my own.

I thought, first of all, survey responses should be symmetric, meaning equal numbers of favorable and unfavorable responses plus a neutral response. (I’ve abandoned surveys that forced me to choose sides when I really was neutral.) I’ve changed the vaguely satisfied response to “Neutral”, and shortened the others to “Love”, “Like”, “Dislike”, and “Hate”.

Second, “Did you use a feature?” is a different question from “How did you like a feature?” The first has follow-ups to find out why a feature wasn’t used (didn’t need it, couldn’t find the button, didn’t understand the instructions); the second to find out why a feature was liked or disliked (whether it worked as expected, whether the output was acceptable).

I think we should split out the “Did not use” responses, then recompute the other percentages based on the fraction of users who used each features.

The following chart sorts the features in decreasing order of “Not used”.

Stacked Bar Chart - With Unused in Panel

The stacked bar panel on the right of the chart shows results from negative on the left to positive on the right, like the number line you learned in elementary school, or maybe high school. The only thing is, the central value of zero, between the negative and the positive responses, isn’t centered on the number line.

Diverging Stacked Bar Charts

Why not stagger the bars left or right until the neutral bar is centered on zero, as shown below:

Diverging Stacked Bar Chart

This is called a Diverging Stacked Bar Chart, which I first read about in Plotting Likert and Other Rating Scales by Naomi Robbins and Richard Heiberger. My commercial add-in Peltier Tech Charts for Excel can build these charts. I’ll also post a tutorial soon.

The following combines the diverging stacked chart with a panel showing the “Not used” data.

Diverging Stacked Bar Chart

There doesn’t seem a strong correlation between “Not used” and the ratings, other than the two most widely used features being the most liked, so it was probably wise to separate the data in this way.

Here is the same chart, plotted by the sum of positive responses.

Diverging Stacked Bar Chart

On the other hand, if I’m concerned with making improvements in a product, It might be more informative to sort by sum of negative responses…

Diverging Stacked Bar Chart

… or even by the percentage of strongly negative responses (“Hate”).

Diverging Stacked Bar Chart

The diverging stacked chart shares a drawback with most stacked charts, including the plain stacked chart at the opening of this article. Since the bars are irregularly stacked, they do not begin at a common baseline, so it is not easy to compare individual values.

“Converging” Stacked Bar Charts

Let’s take the diverging stacked bar chart, and turn it inside out. Instead of the neutral responses straddling the vertical axis, let’s place the strongest responses on either side of the axis, then the weaker responses further out, and finally split the neutral responses on the outside of the stacks. Since it inverts the Diverging Stacked Bar Chart, I’m calling this the “Converging” Stacked Bar Chart.

Diverging Stacked Bar Chart

When comparing these results, we want to see the strong responses (Love and Hate), then more than the weak responses, we’d most likely want to see the cumulative responses (Love + Like as well as Hate + Dislike). The chart above uses the vertical axis as a baseline for the strong responses as well as for the cumulative responses. Without much effort, we see that the chart above is sorted by the sum of positive responses (Love + Like), the dark blue and medium blue bars.

Since the neutral bars are merely padding the ends of the stacks, let’s just leave them off.

Diverging Stacked Bar Chart

The following chart combines the “Not used” responses with the feature ratings, sorted by increasing usage.

Diverging Stacked Bar Chart

Here the chart is sorted by total positive responses (dark plus light blue).

Diverging Stacked Bar Chart

Now we’ve sorted by total negative responses (red plus orange bars) to help focus on the features that people don’t like so much.

Diverging Stacked Bar Chart

Finally, the chart is sorted by strong negative responses, which is substantially the same as above.

Diverging Stacked Bar Chart

These “converging” charts make it easier to rank the responses by eye. However, if we want to directly compare positive and negative responses, we shouldn’t draw the bars in opposite directions. I’ve written about this weakness of tornado diagrams, and suggested ways to improve such charts.

Clustered-Stacked Bar Charts

Now let’s start with the “converging” chart above, and fold it on its vertical axis so the positive and negative bars both extend to the right. Of course we offset the bars so they are stacked next to each other without one set obscuring the other. I wrote a tutorial about these Clustered and Stacked Column and Bar Charts, and I already have a Chart Utility that cranks them out.

Diverging Stacked Bar Chart

The chart above still has the neutral responses padding both positive and negative stacks. If we leave off the neutral responses, we can reduce the area needed for the chart while reducing clutter. Obviously the chart is sorted by total positive responses. This data set was easy: the positives far outweigh the negatives; if the responses were closer, the side-by-side stacking here would make it easier to tell whether more people liked or hated certain features.

Diverging Stacked Bar Chart

Here I’ve sorted by total negative responses, and to make the negative values stand out, I’ve used a lighter tint for the fill of the positive bars.

Diverging Stacked Bar Chart

Here I’ve sorted by strong negative responses, and lightened the weak negative responses.

Diverging Stacked Bar Chart

For completeness, here I’ve combined the “Not used” panel with the clustered-stacked bars, sorted by feature usage.

Diverging Stacked Bar Chart

The same sorted by total positive responses…

Diverging Stacked Bar Chart

… by total negative responses…

Diverging Stacked Bar Chart

… and by strong negative responses.

Diverging Stacked Bar Chart

Dot Plots

There is a problem with legibility in clustered-stacked charts when there are a lot of categories and the bars become thin. If you look back at the first couple of clustered-stacked charts in the preceding section, you may feel that the charts are dense, heavy, dark. This is simply because there are many dark bars in close proximity, and such saturation of ink is overwhelming. Not only does the chart feel heavy, it is difficult to make out all of the series of data, unless some of the colors are lightened to emphasize certain other colors (as in the later charts).

Dot plots solve this issue by only placing a marker at the data point itself, rather than a bar spanning the entire distance from the baseline to the data point. This leaves plenty of refreshing white space, while making all of the data clearly visible, particularly the smaller values. You can read about Dot Plots in Dot Plots: A Useful Alternative to Bar Charts, by Naomi Robbins.

The dot plot below shows Strong Negative (e.g., Hate), Total Negative (Hate + Dislike), Strong Positive (Love), and Total Positive (Love + Like). We really don’t need to see the weak positives and negatives, since the totals are more interesting.

Dot Plot Sorted by Positive Responses

These dot plots are sorted by Total Negative (left) and Strong Negative (right).

Dot Plot Sorted by Total Negative and by Strong Negative Responses

Dot plots can be combined in a panel chart with the “Not Used” bar chart seen in the other panel charts. This chart is sorted by “Not Used”.

Dot Plot Panel Sorted by 'Not Used'

This dot plot panel chart is sorted by Total Positive. It’s much easier in this chart than in the clustered-stacked chart to see how the negatives are increasing while the positives decrease.

Dot Plot Panel Sorted by Positive Responses

For consistency and because I’m paid by the column-inch, here is a dot plot panel chart sorted by Total Negative…

Dot Plot Panel Sorted by Negative Responses

… and here’s one sorted by Strong Negative.

Dot Plot Panel Sorted by Strong Negative Responses

Summary

There are numerous ways to evaluate survey responses. I took a sample survey and separated out two types of responses: whether users used features, and how much users liked the features they did use. Since these responses asked different questions and lead to different follow-up questions, it is beneficial to consider them separately. In many cases I plotted the “Not used” responses in a separate panel from the user ratings of the features.

Simple stacked bar charts allow us to show survey responses ranked from worst on the left to best on the right.

Diverging stacked bar charts keep the stacking order the same, and align the bars so that positive responses all lie to the right of a vertical axis, and negative responses lie to the left.

“Converging” stacked bar charts keep the alignment of the diverging bars (negative to the left and positive to the right of a reference line), but locate the strongest responses next to the reference line so that both strong responses and total cumulative responses use the line as a baseline.

Clustered-stacked bar charts stack up positive and negative responses to the right of a common reference line, so that positive and negative responses can be directly compared.

Dot plots show the data in the same way as clustered-stacked bar charts, except there are only markers and not entire bars in the chart. This makes the chart more open and allows easier reading of smaller values.

Panel charts can be used to show different types of data, such as the “Features Not Used” and “How Users Liked Features They Used” panels in these charts.

As always, your choice of chart type depends on your immediate requirements. For example, if I wanted to use these results to drive development of a software package, I might simplify the plots, and only plot “Not used” and either strong negative or total negative. This would show the development team which features need better visibility, so more people would use them, and which features need improvement, so fewer people hate them.

Diverging Stacked Bar Charts in Peltier Tech Charts for Excel

This article shows how to use Diverging Stacked Bar Charts to display survey results. This manual process takes time, is prone to error, and becomes tedious.

I have created Peltier Tech Charts for Excel to create Diverging Stacked Bar Charts (and many other custom charts) automatically from raw data. This utility, a standard Excel add-in, lays out data in the required layout, then constructs a chart with the right combination of formatting and chart types.

Diverging Stacked Bar Chart

This is a commercial product, tested on thousands of machines in a wide variety of configurations, Windows and Mac, which saves time and aggravation.

Please visit the Peltier Tech Charts for Excel page for more information.

In the wake of the Mark Sanford scandal, in which the South Carolina governor was accused of hiking in the Andes with his mistress or something, political blog FiveThirtyEight has published results of surveys about Sanford in Should Sanford Resign?. I was pointed to the FiveThirtyEight survey results by Andrew Gelman in Doing graphics the George Orwell way, and he got it from Should Mark Sanford Resign?. Andrew’s take is that the pie charts used by FiveThirtyEight to present the survey results weren’t as bad as they first looked.

Polling and Election Results on FiveThirtyEight

I’ve stated in this blog that the FiveThirtyEight web site uses pie charts in a very effective way. It helps that there are only two or three segments in their pies, and the third, if present, is very small. Here is a partial view of the FiveThirtyEight analysis of the November 2008 US elections.

FiveThirtyEight Pie Charts of 2008 US Election

This isn’t a full-blown Chart Busters article, but I thought I’d discuss some alternatives.

In an unfinished post from last fall, I undertook to replace FiveThirtyEight’s pies with bars, using roughly the same amount of space. The bars may have slightly better resolution, but I didn’t include a third bar for the third tiny segment. Take your pick.

Jon's Bar Charts of 2008 US Election

Survey Results

FiveThirtyEight used pie charts to illustrate the survey results. As every second grade student will tell you, pies are the chart of choice when showing parts of a whole. The educational system has trained our children to behave like sheep, and follow the herd.

Editorializing aside, these charts of the poll results are what motivated this post.

Is Mark Sanford more or less ethical than other politicians?

FiveThirtyEight didn’t even change the Excel defaults for their pie. Black chart border, ugly fill colors, and way more space everywhere than needed, especially on either side of the chart. It’s nice that the data points are labeled, but they are only partially labeled with the percentages, and the reader still must journey back and forth to the legend to see which segment means what. I also think it’s useful to put the results of a poll not in numerical order (55%-18%-18%-9%) but in some value order of the categories (more-same-less).

FiveThirtyEight Pie Chart Results of Poll on Mark Sanford's Ethics

I have redesigned the pie chart, shrinking the white space and using arguably better colors. I also reordered the points, and centered the chart differently to try to make comparisons easier.

Jon's Redone Pie Chart Results of Poll on Mark Sanford's Ethics

All well and good, but without the percentages, it’s still a guess whether More or Less had a greater response.

So I pulled a bar chart out of my magic hat. We can see that most of the cynical poll responders think Sanford’s ethics are no different than the other politicians, and equal numbers thought he was more ethical and less ethical. 9% have been busy watching Real Housewives and don’t know who Mark Sanford is.

Jon's Bar Chart Results of Poll on Mark Sanford's Ethics

Should Mark Sanford Resign?

FiveThirtyEight used another pie to show whether respondents want Sanford to resign. The colors have been changed from the gruesome defaults, but the large wasted spaces remain, as does the legend and border. Also, FiveThirtyEight didn’t use its usual style where competing results A and B meet at the vertical line at 12 o’clock.

FiveThirtyEight Pie Chart Results of Poll on Whether Mark Sanford should Resign

I’ve addressed these stylistic issues in my own version of the pie chart, and it doesn’t totally suck.

Jon's Redone Pie Chart Results of Poll on Whether Mark Sanford should Resign

Here’s the obligatory bar chart. Apparently 15% of the respondents think he should stay at the junkyard with his son.

Jon's Bar Chart Results of Poll on Whether Mark Sanford should Resign

survey results

Charting Survey Results

Stacked Bar Charts

Diverging Stacked Bar Charts

“Converging” Stacked Bar Charts

Clustered-Stacked Bar Charts

Dot Plots

Summary

Diverging Stacked Bar Charts in Peltier Tech Charts for Excel

Political Pie Charts