Comments on: Siegel-Tukey: a Non-parametric test for equality in variability (R code)

By: boris

boris — Thu, 02 Mar 2017 03:04:30 +0000

> # Loading the function
> source(“https://www.r-statistics.com/wp-content/uploads/2012/01/source_https.r.txt”) # Making sure we can source code from github
Error in file(filename, “r”, encoding = encoding) :
cannot open connection
In addition: Warning message:
In file(filename, “r”, encoding = encoding) :
URL ‘https://www.r-statistics.com/wp-content/uploads/2012/01/source_https.r.txt’: status was ‘404 Not Found’
> source_https(“https://raw.github.com/talgalili/R-code-snippets/master/siegel.tukey.r”)
Error: could not find function “source_https”

By: Hurstrd198

Hurstrd198 — Sat, 03 Mar 2012 18:25:00 +0000

I found that by changing line 18 the code can handle x and y of different lengths.
print(wilcox.test(data$x[data$y==1],data$x[data$y==y])) should beprint(wilcox.test(data$x[data$y==1],data$x[data$y==2]))

By: Daniel Malter

Daniel Malter — Wed, 29 Feb 2012 20:30:00 +0000

Original post of the corrected code: https://stat.ethz.ch/pipermail/r-help/2012-February/304958.html

By: Daniel Malter

Daniel Malter — Wed, 29 Feb 2012 05:29:00 +0000

The issues have been fixed. Tal will certainly upload the code I sent him soon.

By: Daniel Malter

Daniel Malter — Tue, 28 Feb 2012 16:45:00 +0000

My original function was suitable only if x and y were of the same length. Thanks for fixing this. However, the function as shown above returns incorrect ranks, as well. Take the example in Sheskin’s book:

x=c(4,4,5,5,6,6)
y=c(0,0,1,9,10,10)The ranks for

c(0,0,1,4,4,5,5,6,6,9,10,10)

should be

1, 4, 5, 8, 9, 12, 11, 10, 7, 6, 3, 2

so that the adjusted ranks would be

2.5, 2.5, 5, 8.5, 8.5, 11.5, 11.5, 8.5, 8.5, 6, 2.5, 2.5

However, currently the function returns:

unique values of x tie-adjusted Siegel-Tukey rank 0 3.00 1 4.50 4 8.50 5 11.50 6 8.25 9 6.50 10 2.25

By: melissa

melissa — Thu, 10 Feb 2011 11:29:33 +0000

Hello,

First of all, thank you for sharing your code.
I have got some questions/notices concerning it:
– It seems that the line:
“print(wilcox.test(data$x[data$y==1],data$x[data$y==y]))” provides an error while looking with data with decimals.

(I just remove it and no more errors when there are decimals)

– The adjustment of the medians does not seem to work.
Below is an example of the fact that it does not work:
### adjust.median=F
x<-c(177,200,227,230,232,268,272,297)
y<-c(47,105,126,142,158,172,197,220,225,230,262,270)
siegel.tukey(x,y)
## pval : 0.9385

### adjust.median=T
x<-c(177,200,227,230,232,268,272,297)
y<-c(47,105,126,142,158,172,197,220,225,230,262,270)
siegel.tukey(x,y,adjust.median=T)
## pval : 0.9079

### by adjusting before the medians

x<-c(177,200,227,230,232,268,272,297)
y<-c(47,105,126,142,158,172,197,220,225,230,262,270)
medx<-median(x)
medymedy){
x <-x -(medx-medy)
}
if (medx y <- y -(medy-medx)
}
siegel.tukey(x,y)
### pval: 0.09716
We do not have the same pvalues (at all as you can notice).

Just for your information, these two vectors come from a book (i can give you the reference) and the pvalue they found is 0.0976, by doing the median adjustment.

For the moment, I do not know how to correct it (sorry for that, I will do this adjustment manually before), but I can in the future propose something if you are interested of course.
By the way I’m just wondering whether this test is meaningful if we don’t adjust the medians… for me no but I may be wrong.

Best regards,

Mélissa

By: Tal Galili

Tal Galili — Fri, 17 Dec 2010 19:42:31 +0000

In reply to opossum. Thank you for catching (and reporting!) this opossum. I've updated the code with your corrections.

By: opossum

opossum — Fri, 17 Dec 2010 19:17:28 +0000

The code for determining ranks is buggy. Check, e.g.,

x1 <- c(85, 106)
x2 <- c(96, 105, 104, 108, 86)
iv <- rep(1:2, c(length(x1), length(x2)))
siegel.tukey(c(x1, x2), iv, id.col=TRUE)

The rank for the middle element (104) should be 7, but it's calculated as 8. If N is the length of the combined data, this works:

TF <- rep(c(TRUE, FALSE, FALSE, TRUE), ceiling(N/4))
up <- TF[1:min(N, length(TF))]
Rup <- rank(X)[up]
Rdown <- rev(rank(X)[!up])
Rx <- c(Rup, Rdown)