NAG CL Interface
g08agc (test_wilcoxon)
1
Purpose
g08agc performs the Wilcoxon signed rank test on a single sample of size .
2
Specification
void |
g08agc (Integer n,
const double x[],
double median,
Nag_TailProbability tail,
Nag_IncSignZeros zeros,
double *w,
double *z,
double *p,
Integer *non_zero,
NagError *fail) |
|
The function may be called by the names: g08agc, nag_nonpar_test_wilcoxon or nag_wilcoxon_test.
3
Description
The Wilcoxon one sample signed rank test may be used to test whether a particular sample came from a population with a specified median. It is assumed that the population distribution is symmetric. The data consist of a single sample of observations denoted by . This sample may arise from the difference between pairs of observations from two matched samples of equal size taken from two populations, in which case the test may be used to test whether the median of the first population is the same as that of the second population.
The hypothesis under test,
, often called the null hypothesis, is that the median is equal to some given value
, and this is to be tested against an alternative hypothesis
which is
- : population median ; or
- : population median ; or
- : population median ,
using a two tailed, upper tailed or lower tailed probability respectively. You select the alternative hypothesis by choosing the appropriate tail probability to be computed (see the description of argument
tail in
Section 5).
The Wilcoxon test differs from the Sign test (see
g08aac) in that the magnitude of the scores is taken into account, rather than simply the direction of such scores.
The test procedure is as follows:
-
(a)For each , for , the signed difference is found, where is a given test value for the median of the sample.
-
(b)The absolute differences are ranked with rank and any tied values of are assigned the average of the tied ranks. You may choose whether or not to ignore any cases where by removing them before or after ranking (see the description of the argument zeros in Section 5).
-
(c)The number of nonzero 's is found.
-
(d)To each rank is affixed the sign of the to which it corresponds. Let .
-
(e)The sum of the positive-signed ranks, , is calculated.
g08agc returns:
-
(a)The test statistic ;
-
(b)The number of nonzero 's;
-
(c)The approximate Normal test statistic , where
-
(d)The tail probability, , corresponding to , depending on the choice of the alternative hypothesis, .
If , is computed exactly; otherwise, an approximation to is returned based on an approximate Normal statistic corrected for continuity according to the tail specified.
The value of can be used to perform a significance test on the median against the alternative hypothesis. Let be the size of the significance test (that is, is the probability of rejecting when is true). If then the null hypothesis is rejected. Typically might be or 0.01.
4
References
Conover W J (1980) Practical Nonparametric Statistics Wiley
Neumann N (1988) Some procedures for calculating the distributions of elementary nonparametric teststatistics Statistical Software Newsletter 14(3) 120–126
Siegel S (1956) Non-parametric Statistics for the Behavioral Sciences McGraw–Hill
5
Arguments
-
1:
– Integer
Input
-
On entry: the size of the sample, .
Constraint:
.
-
2:
– const double
Input
-
On entry: the sample observations, .
-
3:
– double
Input
-
On entry: the median test value, .
-
4:
– Nag_TailProbability
Input
-
On entry: indicates the choice of tail probability, and hence the alternative hypothesis.
- A two tailed probability is calculated and the alternative hypothesis is : population median .
- An upper tailed probability is calculated and the alternative hypothesis is : population median .
- A lower tailed probability is calculated and the alternative hypothesis is : population median .
Constraint:
, or .
-
5:
– Nag_IncSignZeros
Input
-
On entry: indicates whether or not to include the cases where
in the ranking of the
's.
- All are included when ranking.
- All , are ignored, that is all cases where are removed before ranking.
Constraint:
or .
-
6:
– double *
Output
-
On exit: the Wilcoxon rank sum statistic, , being the sum of the positive ranks.
-
7:
– double *
Output
-
On exit: the approximate Normal test statistic,
, as described in
Section 3.
-
8:
– double *
Output
-
On exit: the tail probability,
, as specified by the argument
tail.
-
9:
– Integer *
Output
-
On exit: the number of nonzero 's, .
-
10:
– NagError *
Input/Output
-
The NAG error argument (see
Section 7 in the Introduction to the NAG Library CL Interface).
6
Error Indicators and Warnings
- NE_ALLOC_FAIL
-
Dynamic memory allocation failed.
- NE_BAD_PARAM
-
On entry, argument
tail had an illegal value.
On entry, argument
zeros had an illegal value.
- NE_G08AG_SAMP_IDEN
-
The whole sample is identical to the given median test value.
- NE_INT_ARG_LT
-
On entry, .
Constraint: .
- NE_INTERNAL_ERROR
-
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact
NAG for assistance.
7
Accuracy
The approximation used to calculate when will return a value with a relative error of less than 10 percent for most cases. The error may increase for cases where there are a large number of ties in the sample.
8
Parallelism and Performance
g08agc is not threaded in any implementation.
The time taken by g08agc increases with , until , from which point on the approximation is used. The time decreases significantly at this point and increases again modestly with for .
10
Example
The example program performs the Wilcoxon signed rank test on two matched samples of size , taken from two populations. The distribution of the differences between pairs of observations from the two populations is assumed to be symmetric. The test is used to test whether the medians of the two distributions of the populations are equal or not. The test statistic, the approximate Normal statistic and the two tailed probability are computed and printed.
10.1
Program Text
10.2
Program Data
10.3
Program Results