<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Pareto Charts</title>
	<atom:link href="http://peltiertech.com/WordPress/pareto-charts/feed/" rel="self" type="application/rss+xml" />
	<link>http://peltiertech.com/WordPress/pareto-charts/</link>
	<description>Peltier Tech Excel Charts and Programming Blog</description>
	<lastBuildDate>Thu, 09 Sep 2010 18:18:29 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: DaleW</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26871</link>
		<dc:creator>DaleW</dc:creator>
		<pubDate>Fri, 12 Feb 2010 03:15:21 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26871</guid>
		<description>Derek,

Pareto charts often include a P-P (Percent-Percent) plot such as Jon&#039;s red curves.  This represents a cumulative distribution of effect (# of apps) against the cumulative ordered distribution of categories (for Pareto).

It&#039;s not at all ridiculous to show that such a curve passes through the (0,0) origin as well as the (100%,100%) point.  In fact, at the risk of showing the obvious, that&#039;s how I&#039;d prefer to draw it.   For example, QIMacros draws its Pareto chart in Excel that way, although it doesn&#039;t explicitly put cumulative % on the category count as Jon finally did under duress.

If you check Wikipedia, you&#039;ll find a distinction between P-P and Q-Q plots, and it is P-P plots that go through (0,0) and (1,1) at their edges, while Quantile-Quantile plots may go to infinity (for continuous variables) at their hypothetical vanishing edges (not generally plotted!).</description>
		<content:encoded><![CDATA[<p>Derek,</p>
<p>Pareto charts often include a P-P (Percent-Percent) plot such as Jon&#8217;s red curves.  This represents a cumulative distribution of effect (# of apps) against the cumulative ordered distribution of categories (for Pareto).</p>
<p>It&#8217;s not at all ridiculous to show that such a curve passes through the (0,0) origin as well as the (100%,100%) point.  In fact, at the risk of showing the obvious, that&#8217;s how I&#8217;d prefer to draw it.   For example, QIMacros draws its Pareto chart in Excel that way, although it doesn&#8217;t explicitly put cumulative % on the category count as Jon finally did under duress.</p>
<p>If you check Wikipedia, you&#8217;ll find a distinction between P-P and Q-Q plots, and it is P-P plots that go through (0,0) and (1,1) at their edges, while Quantile-Quantile plots may go to infinity (for continuous variables) at their hypothetical vanishing edges (not generally plotted!).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26856</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Thu, 11 Feb 2010 18:42:02 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26856</guid>
		<description>Jeff - Yeah, fixed it. Sorry. I&#039;d just written a response to you and forgot whose comment the next one was addressing.</description>
		<content:encoded><![CDATA[<p>Jeff &#8211; Yeah, fixed it. Sorry. I&#8217;d just written a response to you and forgot whose comment the next one was addressing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Weir</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26853</link>
		<dc:creator>Jeff Weir</dc:creator>
		<pubDate>Thu, 11 Feb 2010 18:14:25 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26853</guid>
		<description>Jon - did you mean to say &lt;i&gt;Derek&lt;/i&gt;...I&#039;ve always had issues...?</description>
		<content:encoded><![CDATA[<p>Jon &#8211; did you mean to say <i>Derek</i>&#8230;I&#8217;ve always had issues&#8230;?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Colin Banfield</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26850</link>
		<dc:creator>Colin Banfield</dc:creator>
		<pubDate>Thu, 11 Feb 2010 16:32:17 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26850</guid>
		<description>Excellent! I should have guessed that you were already on the case :)</description>
		<content:encoded><![CDATA[<p>Excellent! I should have guessed that you were already on the case :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26844</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Thu, 11 Feb 2010 13:54:07 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26844</guid>
		<description>Colin -

I&#039;ve downloaded Wilkinson&#039;s article and applied its techniques to this data set. Once I understood the algorithms and ideas, making the chart was straightforward. I wish I had more time to keep up with the discussion, but I expect to post a follow-up article next week.</description>
		<content:encoded><![CDATA[<p>Colin -</p>
<p>I&#8217;ve downloaded Wilkinson&#8217;s article and applied its techniques to this data set. Once I understood the algorithms and ideas, making the chart was straightforward. I wish I had more time to keep up with the discussion, but I expect to post a follow-up article next week.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Colin Banfield</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26842</link>
		<dc:creator>Colin Banfield</dc:creator>
		<pubDate>Thu, 11 Feb 2010 13:04:52 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26842</guid>
		<description>Interesting discussion. So are you going to follow Naomi&#039;s advice? Apparently, Lee Wilkinson proposed something called a Pareto dot plot. I&#039;m sure that you must be curious to know what this is and to be the first to demonstrate publicly how to build one in Excel :)</description>
		<content:encoded><![CDATA[<p>Interesting discussion. So are you going to follow Naomi&#8217;s advice? Apparently, Lee Wilkinson proposed something called a Pareto dot plot. I&#8217;m sure that you must be curious to know what this is and to be the first to demonstrate publicly how to build one in Excel :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26839</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Thu, 11 Feb 2010 12:09:53 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26839</guid>
		<description>Derek -

I&#039;ve always had issues with how the first and last points are considered on this kind of scale. In an Excel histogram (cough!) the first bin always has one point, unless multiple points have the minimum of the data set. In fact, some of the different definitions of percentile (or in general, N-tile) came about because of different ways to do the math. (i-1/2)/N for example, which is like measuring to the middle of each bar in a bar chart, where the bars are one unit apart along X. But other methodologies use different algorithms; I remember using something like (i-j)/(N-k) in a Weibull package we used for reliability determination, which gave very similar positioning along X.

Using cumulatives allows me to skip out on the whole percentile/bin definition thing ;-)</description>
		<content:encoded><![CDATA[<p>Derek -</p>
<p>I&#8217;ve always had issues with how the first and last points are considered on this kind of scale. In an Excel histogram (cough!) the first bin always has one point, unless multiple points have the minimum of the data set. In fact, some of the different definitions of percentile (or in general, N-tile) came about because of different ways to do the math. (i-1/2)/N for example, which is like measuring to the middle of each bar in a bar chart, where the bars are one unit apart along X. But other methodologies use different algorithms; I remember using something like (i-j)/(N-k) in a Weibull package we used for reliability determination, which gave very similar positioning along X.</p>
<p>Using cumulatives allows me to skip out on the whole percentile/bin definition thing ;-)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: derek</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26831</link>
		<dc:creator>derek</dc:creator>
		<pubDate>Thu, 11 Feb 2010 09:31:20 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26831</guid>
		<description>Sorry, of course it&#039;s not quantile-quantile, or else the first point would have to be at the origin (0%, 0%), which would be ridiculous. It&#039;s a cumulative against quantile graph, and I was distracted by the fact that they&#039;re both scaled as percentages. As a cumulative, it is right that the first value should have a position 18% or so up the vertical scale right from the start; it&#039;s only on the horizontal scale that the first data point needs to start from 0%.</description>
		<content:encoded><![CDATA[<p>Sorry, of course it&#8217;s not quantile-quantile, or else the first point would have to be at the origin (0%, 0%), which would be ridiculous. It&#8217;s a cumulative against quantile graph, and I was distracted by the fact that they&#8217;re both scaled as percentages. As a cumulative, it is right that the first value should have a position 18% or so up the vertical scale right from the start; it&#8217;s only on the horizontal scale that the first data point needs to start from 0%.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: derek</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26830</link>
		<dc:creator>derek</dc:creator>
		<pubDate>Thu, 11 Feb 2010 09:25:43 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26830</guid>
		<description>Jon, I like your labeled &lt;a href=&quot;http://google.com/search?q=quantile-quantile&quot; rel=&quot;nofollow&quot;&gt;quantile-quantile plot&lt;/a&gt;, but I think the first, MAX, item should be at 0% just as the last, MIN, item is at 100%.  By being at 0% it will still be a part of the first, 0%-20%, quintile, but on the very boundary, as appropriate for its statuis as the highest value in the distribution.</description>
		<content:encoded><![CDATA[<p>Jon, I like your labeled <a href="http://google.com/search?q=quantile-quantile" rel="nofollow">quantile-quantile plot</a>, but I think the first, MAX, item should be at 0% just as the last, MIN, item is at 100%.  By being at 0% it will still be a part of the first, 0%-20%, quintile, but on the very boundary, as appropriate for its statuis as the highest value in the distribution.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/pareto-charts/comment-page-1/#comment-26772</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Tue, 09 Feb 2010 20:41:58 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=3066#comment-26772</guid>
		<description>Rob has always called it the &quot;XY&quot; Chart Labeler, so I guess he&#039;ll never change. It works on all chart types that accept data labels (i.e., all except surface/contour plots), so I just call it the Chart Labeler.</description>
		<content:encoded><![CDATA[<p>Rob has always called it the &#8220;XY&#8221; Chart Labeler, so I guess he&#8217;ll never change. It works on all chart types that accept data labels (i.e., all except surface/contour plots), so I just call it the Chart Labeler.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
