<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Siegel-Tukey: a Non-parametric test for equality in variability (R code)</title>
	<atom:link href="http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/</link>
	<description>Writing about statistics with R, and open source stuff (software, data, community)</description>
	<lastBuildDate>Fri, 10 Feb 2012 03:07:42 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Andrea</title>
		<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/comment-page-1/#comment-13557</link>
		<dc:creator>Andrea</dc:creator>
		<pubDate>Fri, 25 Nov 2011 13:17:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.r-statistics.com/?p=161#comment-13557</guid>
		<description>Hello Chris, 

maybe the problems of Melissa could be arise from the asymptotic distribution of the statistic when n1+n2&gt;20. n1 =size sample1; n2= size sample2; In this example the sum of the sizes of the samples is exactly 20. 

I have a question about which rank sum has to be choice when samples are the same size. When n1==n2 which rank sum we have to choice the rank sum of the sample1 or the sample2?</description>
		<content:encoded><![CDATA[<p>Hello Chris, </p>
<p>maybe the problems of Melissa could be arise from the asymptotic distribution of the statistic when n1+n2&gt;20. n1 =size sample1; n2= size sample2; In this example the sum of the sizes of the samples is exactly 20. </p>
<p>I have a question about which rank sum has to be choice when samples are the same size. When n1==n2 which rank sum we have to choice the rank sum of the sample1 or the sample2?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/comment-page-1/#comment-12376</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Thu, 13 Oct 2011 10:09:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.r-statistics.com/?p=161#comment-12376</guid>
		<description>Hello Mélissa

Do you have any news on this issue?

Thanks

Chris</description>
		<content:encoded><![CDATA[<p>Hello Mélissa</p>
<p>Do you have any news on this issue?</p>
<p>Thanks</p>
<p>Chris</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: melissa</title>
		<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/comment-page-1/#comment-4547</link>
		<dc:creator>melissa</dc:creator>
		<pubDate>Thu, 10 Feb 2011 11:29:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.r-statistics.com/?p=161#comment-4547</guid>
		<description>Hello,

First of all, thank you for sharing your code.
I have got some questions/notices concerning it:
- It seems that the line:
&quot;print(wilcox.test(data$x[data$y==1],data$x[data$y==y]))&quot; provides an error while looking with data with decimals.

(I just remove it and no more errors when there are decimals)

- The adjustment of the medians does not seem to work.
Below is an example of the fact that it does not work:
### adjust.median=F
x&lt;-c(177,200,227,230,232,268,272,297)
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)
siegel.tukey(x,y)
## pval : 0.9385

### adjust.median=T
x&lt;-c(177,200,227,230,232,268,272,297)
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)
siegel.tukey(x,y,adjust.median=T)
## pval : 0.9079

### by adjusting before the  medians

x&lt;-c(177,200,227,230,232,268,272,297)
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)
medx&lt;-median(x)
medymedy){
    x &lt;-x -(medx-medy) 
  }
  if (medx&lt;medy){
    y &lt;- y -(medy-medx) 
  }
siegel.tukey(x,y)
### pval: 0.09716
We do not have the same pvalues (at all as you can notice).

Just for your information, these two vectors come from a book (i can give you the reference) and the pvalue they found is 0.0976, by doing the median adjustment.

For the moment, I do not know how to correct it (sorry for that, I will do this adjustment manually before), but I can in the future propose something if you are interested of course.
By the way I’m just wondering whether this test is meaningful if we don’t adjust the medians… for me no but I may be wrong.

Best regards,

Mélissa</description>
		<content:encoded><![CDATA[<p>Hello,</p>
<p>First of all, thank you for sharing your code.<br />
I have got some questions/notices concerning it:<br />
- It seems that the line:<br />
&#8220;print(wilcox.test(data$x[data$y==1],data$x[data$y==y]))&#8221; provides an error while looking with data with decimals.</p>
<p>(I just remove it and no more errors when there are decimals)</p>
<p>- The adjustment of the medians does not seem to work.<br />
Below is an example of the fact that it does not work:<br />
### adjust.median=F<br />
x&lt;-c(177,200,227,230,232,268,272,297)<br />
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)<br />
siegel.tukey(x,y)<br />
## pval : 0.9385</p>
<p>### adjust.median=T<br />
x&lt;-c(177,200,227,230,232,268,272,297)<br />
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)<br />
siegel.tukey(x,y,adjust.median=T)<br />
## pval : 0.9079</p>
<p>### by adjusting before the  medians</p>
<p>x&lt;-c(177,200,227,230,232,268,272,297)<br />
y&lt;-c(47,105,126,142,158,172,197,220,225,230,262,270)<br />
medx&lt;-median(x)<br />
medymedy){<br />
    x &lt;-x -(medx-medy)<br />
  }<br />
  if (medx&lt;medy){<br />
    y &lt;- y -(medy-medx)<br />
  }<br />
siegel.tukey(x,y)<br />
### pval: 0.09716<br />
We do not have the same pvalues (at all as you can notice).</p>
<p>Just for your information, these two vectors come from a book (i can give you the reference) and the pvalue they found is 0.0976, by doing the median adjustment.</p>
<p>For the moment, I do not know how to correct it (sorry for that, I will do this adjustment manually before), but I can in the future propose something if you are interested of course.<br />
By the way I’m just wondering whether this test is meaningful if we don’t adjust the medians… for me no but I may be wrong.</p>
<p>Best regards,</p>
<p>Mélissa</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tal Galili</title>
		<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/comment-page-1/#comment-3806</link>
		<dc:creator>Tal Galili</dc:creator>
		<pubDate>Fri, 17 Dec 2010 19:42:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.r-statistics.com/?p=161#comment-3806</guid>
		<description>Thank you for catching (and reporting!) this opossum.

I&#039;ve updated the code with your corrections.</description>
		<content:encoded><![CDATA[<p>Thank you for catching (and reporting!) this opossum.</p>
<p>I&#8217;ve updated the code with your corrections.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: opossum</title>
		<link>http://www.r-statistics.com/2010/02/siegel-tukey-a-non-parametric-test-for-equality-in-variability-r-code/comment-page-1/#comment-3805</link>
		<dc:creator>opossum</dc:creator>
		<pubDate>Fri, 17 Dec 2010 19:17:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.r-statistics.com/?p=161#comment-3805</guid>
		<description>The code for determining ranks is buggy. Check, e.g.,

x1 &lt;- c(85, 106)
x2 &lt;- c(96, 105, 104, 108, 86)
iv &lt;- rep(1:2, c(length(x1), length(x2)))
siegel.tukey(c(x1, x2), iv, id.col=TRUE)

The rank for the middle element (104) should be 7, but it&#039;s calculated as 8. If N is the length of the combined data, this works:

TF &lt;- rep(c(TRUE, FALSE, FALSE, TRUE), ceiling(N/4))
up &lt;- TF[1:min(N, length(TF))]
Rup &lt;- rank(X)[up]
Rdown &lt;- rev(rank(X)[!up])
Rx &lt;- c(Rup, Rdown)</description>
		<content:encoded><![CDATA[<p>The code for determining ranks is buggy. Check, e.g.,</p>
<p>x1 &lt;- c(85, 106)<br />
x2 &lt;- c(96, 105, 104, 108, 86)<br />
iv &lt;- rep(1:2, c(length(x1), length(x2)))<br />
siegel.tukey(c(x1, x2), iv, id.col=TRUE)</p>
<p>The rank for the middle element (104) should be 7, but it&#039;s calculated as 8. If N is the length of the combined data, this works:</p>
<p>TF &lt;- rep(c(TRUE, FALSE, FALSE, TRUE), ceiling(N/4))<br />
up &lt;- TF[1:min(N, length(TF))]<br />
Rup &lt;- rank(X)[up]<br />
Rdown &lt;- rev(rank(X)[!up])<br />
Rx &lt;- c(Rup, Rdown)</p>
]]></content:encoded>
	</item>
</channel>
</rss>

