<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Deming Regression Utility</title>
	<atom:link href="http://peltiertech.com/WordPress/deming-regression-utility/feed/" rel="self" type="application/rss+xml" />
	<link>http://peltiertech.com/WordPress/deming-regression-utility/</link>
	<description>Peltier Tech Excel Charts and Programming Blog</description>
	<lastBuildDate>Tue, 16 Mar 2010 21:53:50 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: Mike</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-21712</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Fri, 06 Nov 2009 03:04:35 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-21712</guid>
		<description>Hi

When I use  PTS Deming Regression Utility in Excel 2007 I get the following message:

 &quot;Run-time error &#039;13&#039;: Type mismatch&quot;

Can you help me sort this out to get the utility working for me?

Thanks</description>
		<content:encoded><![CDATA[<p>Hi</p>
<p>When I use  PTS Deming Regression Utility in Excel 2007 I get the following message:</p>
<p> &#8220;Run-time error &#8216;13&#8242;: Type mismatch&#8221;</p>
<p>Can you help me sort this out to get the utility working for me?</p>
<p>Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20341</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Fri, 09 Oct 2009 15:22:05 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20341</guid>
		<description>Dale -  You&#039;ve been working on me for several days now, and my resistance is crumbling. I&#039;ll start planning it soon.</description>
		<content:encoded><![CDATA[<p>Dale &#8211;  You&#8217;ve been working on me for several days now, and my resistance is crumbling. I&#8217;ll start planning it soon.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DaleW</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20337</link>
		<dc:creator>DaleW</dc:creator>
		<pubDate>Fri, 09 Oct 2009 14:31:39 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20337</guid>
		<description>To follow up on Matt&#039;s comments:  as an extension of OLS, Deming regression fits an XY slope m through the center of one&#039;s data somewhere between m1 by OLS and 1/m2 by reverse OLS, depending on one&#039;s estimate of the ratio of residual variances in the two variables.

Jon&#039;s utility currently covers a very special case where the experimenter replicated every measurement to allow a good estimate of this Y:X variance ratio.

At the other extreme, in the absence of information, a data miner might guess that  the error in each variable is proportional to its scatter (standard deviation), and Deming regression splits the difference, bisecting our data ellipse along its primary axis, with a slope that simplifies to sqrt(m1/m2).  

Jon, the VBA code that you cited by E. Joseph Billo for that special case could be very easily modified to use a user selected estimate of the variance ratio &quot;lambda&quot; for the general XY regression problem, wouldn&#039;t you say?  Add radio buttons to the dialog for three or four limiting special cases, and you&#039;d have a very cool Excel utility for general Deming regression (linear orthogonal regression for two variables).</description>
		<content:encoded><![CDATA[<p>To follow up on Matt&#8217;s comments:  as an extension of OLS, Deming regression fits an XY slope m through the center of one&#8217;s data somewhere between m1 by OLS and 1/m2 by reverse OLS, depending on one&#8217;s estimate of the ratio of residual variances in the two variables.</p>
<p>Jon&#8217;s utility currently covers a very special case where the experimenter replicated every measurement to allow a good estimate of this Y:X variance ratio.</p>
<p>At the other extreme, in the absence of information, a data miner might guess that  the error in each variable is proportional to its scatter (standard deviation), and Deming regression splits the difference, bisecting our data ellipse along its primary axis, with a slope that simplifies to sqrt(m1/m2).  </p>
<p>Jon, the VBA code that you cited by E. Joseph Billo for that special case could be very easily modified to use a user selected estimate of the variance ratio &#8220;lambda&#8221; for the general XY regression problem, wouldn&#8217;t you say?  Add radio buttons to the dialog for three or four limiting special cases, and you&#8217;d have a very cool Excel utility for general Deming regression (linear orthogonal regression for two variables).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matt Healy</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20315</link>
		<dc:creator>Matt Healy</dc:creator>
		<pubDate>Fri, 09 Oct 2009 02:30:00 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20315</guid>
		<description>To follow up on previous comments, most variants of linear regression have the line pass through the point (mean_x, mean_y), so the difference is in the slope.  If we call m1 the slope from regression of Y against X, and m2 the slope from regression of X against Y (swapping variables), then m1 times m2 will equal r-squared (try it yourself!).  Of course r-squared will be the same either way.  If one assumes the error in each variable is proportional to its value, then the best estimate for the slope will be simply the ratio of the standard deviations of X and Y.

Of course robust regression techniques use shrinkage estimators, order statistics, and the like to reduce the impact of outliers on the slope, which is another whole ball game.  OLS and its variants assume an underlying Gaussian distribution but if the data come from a heavy-tailed distribution then OLS is too sensitive to outliers.</description>
		<content:encoded><![CDATA[<p>To follow up on previous comments, most variants of linear regression have the line pass through the point (mean_x, mean_y), so the difference is in the slope.  If we call m1 the slope from regression of Y against X, and m2 the slope from regression of X against Y (swapping variables), then m1 times m2 will equal r-squared (try it yourself!).  Of course r-squared will be the same either way.  If one assumes the error in each variable is proportional to its value, then the best estimate for the slope will be simply the ratio of the standard deviations of X and Y.</p>
<p>Of course robust regression techniques use shrinkage estimators, order statistics, and the like to reduce the impact of outliers on the slope, which is another whole ball game.  OLS and its variants assume an underlying Gaussian distribution but if the data come from a heavy-tailed distribution then OLS is too sensitive to outliers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20235</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Wed, 07 Oct 2009 03:41:57 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20235</guid>
		<description>Dale -

Admittedly, my machine example was a hack. Obviously when the two machines agree anywhere near as close as this, there is no practical difference between them. Your suggestion for an unpaired analysis is a good one. I&#039;ll check about altering the algorithm.</description>
		<content:encoded><![CDATA[<p>Dale -</p>
<p>Admittedly, my machine example was a hack. Obviously when the two machines agree anywhere near as close as this, there is no practical difference between them. Your suggestion for an unpaired analysis is a good one. I&#8217;ll check about altering the algorithm.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DaleW</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20232</link>
		<dc:creator>DaleW</dc:creator>
		<pubDate>Wed, 07 Oct 2009 01:54:55 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20232</guid>
		<description>Jon -

Unfortunately, the example from your prior post on Deming regression doesn&#039;t show us that Deming regression will give a noticeably different result than ordinary least squares (OLS) regression.

The slope (Y to X) of a best fit line is about 0.993 by OLS and 0.994 by reverse OLS, with a standard error of 0.014 for either, so we can&#039;t say whether the true slope is slightly more or less than one.  Orthogonal regression can try to more precisely place the line between those limiting cases (no error in X or no error in Y), but it is such a small tweak here that it makes no practical difference.

To make a real difference, r-squared, which also equals the ratio between those two OLS slopes(!), generally has to be a lot less than 100%.  It was 99.84% for your example.  On the other hand, for a good measurement system, the vast majority of the total variance is explained by the sample differences, not the machine differences, so r-squared between paired machine measurements across the same  samples tends to be very high.

Although I would prefer not to locate an example in my files of a bad enough measurement system with replicated measurements so that Deming regression could have been helpful, I&#039;ll have to check . . .</description>
		<content:encoded><![CDATA[<p>Jon -</p>
<p>Unfortunately, the example from your prior post on Deming regression doesn&#8217;t show us that Deming regression will give a noticeably different result than ordinary least squares (OLS) regression.</p>
<p>The slope (Y to X) of a best fit line is about 0.993 by OLS and 0.994 by reverse OLS, with a standard error of 0.014 for either, so we can&#8217;t say whether the true slope is slightly more or less than one.  Orthogonal regression can try to more precisely place the line between those limiting cases (no error in X or no error in Y), but it is such a small tweak here that it makes no practical difference.</p>
<p>To make a real difference, r-squared, which also equals the ratio between those two OLS slopes(!), generally has to be a lot less than 100%.  It was 99.84% for your example.  On the other hand, for a good measurement system, the vast majority of the total variance is explained by the sample differences, not the machine differences, so r-squared between paired machine measurements across the same  samples tends to be very high.</p>
<p>Although I would prefer not to locate an example in my files of a bad enough measurement system with replicated measurements so that Deming regression could have been helpful, I&#8217;ll have to check . . .</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Peltier</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20220</link>
		<dc:creator>Jon Peltier</dc:creator>
		<pubDate>Tue, 06 Oct 2009 18:01:22 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20220</guid>
		<description>Dale -

When I &quot;made up&quot; the data, I used Excel&#039;s NORMINV to compute the errors, and I think I used sigma values that differed by about 3:1, with higher variance in X. I was playing around to find points that looked good on screen.

Do you have any real world data that would satisfy your last sentence? It would be interesting. I presume you want more deviation than in my last chart in my first post?</description>
		<content:encoded><![CDATA[<p>Dale -</p>
<p>When I &#8220;made up&#8221; the data, I used Excel&#8217;s NORMINV to compute the errors, and I think I used sigma values that differed by about 3:1, with higher variance in X. I was playing around to find points that looked good on screen.</p>
<p>Do you have any real world data that would satisfy your last sentence? It would be interesting. I presume you want more deviation than in my last chart in my first post?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DaleW</title>
		<link>http://peltiertech.com/WordPress/deming-regression-utility/comment-page-1/#comment-20219</link>
		<dc:creator>DaleW</dc:creator>
		<pubDate>Tue, 06 Oct 2009 16:49:07 +0000</pubDate>
		<guid isPermaLink="false">http://peltiertech.com/WordPress/?p=2472#comment-20219</guid>
		<description>Hi Jon,

If I follow correctly, your Deming regression utility is designed for analyzing pairs of data where each point is immediately replicated, hence the requirement that both X and Y ranges have an even number of points?

Those replicates allow the algorithm to have a good estimate of the variance ratio for the errors (simple measurement repeatability, ignoring any bias to the nominal value).  This Y to X variance ratio -- that is assumed infinite for ordinary least squares where all the assumed error is Y -- seems to be about 0.306 in your example dataset.  (This might be another useful output for your utility?)

If I round to four decimal places, your Deming regression slope was given as 0.9940, your Demining utility gave 0.9942 for the rounded data on your screenshot, and I got an estimate of 0.9939 inputting my quick estimate of the variance ratio and your screenshot data into a commercial statistics package that supports Orthogonal Regression.  The statistical uncertainty at 95% confidence level for the orthogonal slope is a couple orders of magnitude greater than these order 0.0001 differences, so I&#039;m going to say our results agree.  

Probably a better test case would have lower correlation between the two variables, so some meaningful differences might emerge by comparison with Ordinary Least Squares (both Y v. X and X v. Y).</description>
		<content:encoded><![CDATA[<p>Hi Jon,</p>
<p>If I follow correctly, your Deming regression utility is designed for analyzing pairs of data where each point is immediately replicated, hence the requirement that both X and Y ranges have an even number of points?</p>
<p>Those replicates allow the algorithm to have a good estimate of the variance ratio for the errors (simple measurement repeatability, ignoring any bias to the nominal value).  This Y to X variance ratio &#8212; that is assumed infinite for ordinary least squares where all the assumed error is Y &#8212; seems to be about 0.306 in your example dataset.  (This might be another useful output for your utility?)</p>
<p>If I round to four decimal places, your Deming regression slope was given as 0.9940, your Demining utility gave 0.9942 for the rounded data on your screenshot, and I got an estimate of 0.9939 inputting my quick estimate of the variance ratio and your screenshot data into a commercial statistics package that supports Orthogonal Regression.  The statistical uncertainty at 95% confidence level for the orthogonal slope is a couple orders of magnitude greater than these order 0.0001 differences, so I&#8217;m going to say our results agree.  </p>
<p>Probably a better test case would have lower correlation between the two variables, so some meaningful differences might emerge by comparison with Ordinary Least Squares (both Y v. X and X v. Y).</p>
]]></content:encoded>
	</item>
</channel>
</rss>
