PDF version (NAG web site
, 64bit version, 64bit version)
NAG Toolbox: nag_nonpar_test_cochranq (g08al)
Purpose
nag_nonpar_test_cochranq (g08al) performs the Cochran $Q$test on crossclassified binary data.
Syntax
Note: the interface to this routine has changed since earlier releases of the toolbox:
At Mark 22: 
n was made optional 
Description
Cochran's $Q$test may be used to test for differences between $k$ treatments applied independently to $n$ individuals or blocks ($k$ related samples of equal size $n$), where the observed response can take only one of two possible values; for example a treatment may result in a ‘success’ or ‘failure’. The data is recorded as either $1$ or $0$ to represent this dichotomization.
The use of this ‘randomized block design’ allows the effect of differences between the blocks to be separated from the differences between the treatments. The test assumes that the blocks were randomly selected from all possible blocks and that the result may be one of two possible outcomes common to all treatments within blocks.
The null and alternative hypotheses to be tested may be stated as follows.
${H}_{0}$ :  the treatments are equally effective, that is the probability of obtaining a $1$ within a block is the same for each treatment. 
${H}_{1}$ :  there is a difference between the treatments, that is the probability of obtaining a $1$ is not the same for different treatments within blocks. 
The data is often represented in the form of a table with the
$n$ rows representing the blocks and the
$k$ columns the treatments. Let
${R}_{\mathit{i}}$ represent the row totals, for
$\mathit{i}=1,2,\dots ,n$, and
${C}_{\mathit{j}}$ represent the column totals, for
$\mathit{j}=1,2,\dots ,k$. Let
${x}_{ij}$ represent the response or result where
${x}_{ij}=0\text{ or}1$.

Treatments 

Blocks 
1 
2 

$k$ 
Row Totals 
1 
${x}_{11}$ 
${x}_{12}$ 
$\cdots $ 
${x}_{1k}$ 
${R}_{1}$ 
2 
${x}_{21}$ 
${x}_{22}$ 
$\cdots $ 
${x}_{2k}$ 
${R}_{2}$ 
$\vdots $ 


$\vdots $ 

$\vdots $ 
$n$ 
${x}_{n1}$ 
${x}_{n2}$ 
$\cdots $ 
${x}_{nk}$ 
${R}_{n}$ 
Column Totals 
${C}_{1}$ 
${C}_{2}$ 

${C}_{k}$ 
$N=\text{Grand Total}$ 
If
${p}_{ij}=\mathrm{Pr}\left({x}_{ij}=1\right)$, for
$i=1,2,\dots ,n$ and
$j=1,2,\dots ,k$, then the hypotheses may be restated as follows
${H}_{0}$ :  ${p}_{i1}={p}_{i2}=\dots ={p}_{ik}$, for each $i=1,2,\dots ,n$. 
${H}_{1}$:  ${p}_{ij}\ne {p}_{ik}$, for some $j$ and $k$, and for some $i$. 
The test statistic is defined as
When the number of blocks,
$n$, is large relative to the number of treatments,
$k$,
$Q$ has an approximate
${\chi}^{2}$distribution with
$k1$ degrees of freedom. This is used to find the probability,
$p$, of obtaining a statistic greater than or equal to the computed value of
$Q$. Thus
$p$ is the upper tail probability associated with the computed value of
$Q$, where the
${\chi}^{2}$distribution is used to approximate the true distribution of
$Q$.
References
Conover W J (1980) Practical Nonparametric Statistics Wiley
Siegel S (1956) Nonparametric Statistics for the Behavioral Sciences McGraw–Hill
Parameters
Compulsory Input Parameters
 1:
$\mathrm{x}\left(\mathit{ldx},{\mathbf{k}}\right)$ – double array

ldx, the first dimension of the array, must satisfy the constraint
$\mathit{ldx}\ge {\mathbf{n}}$.
The matrix of observed zeroone data.
${\mathbf{x}}\left(\mathit{i},\mathit{j}\right)$ must contain the value ${x}_{\mathit{i}\mathit{j}}$, for $\mathit{i}=1,2,\dots ,n$ and $\mathit{j}=1,2,\dots ,k$.
Constraint:
${\mathbf{x}}\left(\mathit{i},\mathit{j}\right)=0.0$ or $1.0$, for $\mathit{i}=1,2,\dots ,n$ and $\mathit{j}=1,2,\dots ,k$.
Optional Input Parameters
 1:
$\mathrm{n}$ – int64int32nag_int scalar

Default:
the first dimension of the array
x.
$n$, the number of blocks.
Constraint:
${\mathbf{n}}\ge 2$.
 2:
$\mathrm{k}$ – int64int32nag_int scalar

Default:
the second dimension of the array
x.
$k$, the number of treatments.
Constraint:
${\mathbf{k}}\ge 2$.
Output Parameters
 1:
$\mathrm{q}$ – double scalar

The value of the Cochran $Q$test statistic.
 2:
$\mathrm{prob}$ – double scalar

The upper tail probability,
$p$, associated with the Cochran
$Q$test statistic, that is the probability of obtaining a value greater than or equal to the observed value (the output value of
q).
 3:
$\mathrm{ifail}$ – int64int32nag_int scalar
${\mathbf{ifail}}={\mathbf{0}}$ unless the function detects an error (see
Error Indicators and Warnings).
Error Indicators and Warnings
Errors or warnings detected by the function:
Cases prefixed with W are classified as warnings and
do not generate an error of type NAG:error_n. See nag_issue_warnings.
 ${\mathbf{ifail}}=1$

On entry,  ${\mathbf{n}}<2$, 
or  ${\mathbf{k}}<2$, 
or  $\mathit{ldx}<{\mathbf{n}}$. 
 ${\mathbf{ifail}}=2$

On entry,  ${\mathbf{x}}\left(i,j\right)\ne 0.0$ or $1.0$ for some $i$ and $j$, $i=1,2,\dots ,n$ and $j=1,2,\dots ,k$. 
 W ${\mathbf{ifail}}=3$

The approximation process used to calculate the tail probability has failed to converge. The result returned in
prob may still be a reasonable approximation.
 ${\mathbf{ifail}}=99$
An unexpected error has been triggered by this routine. Please
contact
NAG.
 ${\mathbf{ifail}}=399$
Your licence key may have expired or may not have been installed correctly.
 ${\mathbf{ifail}}=999$
Dynamic memory allocation failed.
Accuracy
The use of the ${\chi}^{2}$distribution as an approximation to the true distribution of the Cochran $Q$test statistic improves as $k$ increases and as $n$ increases relative to $k$. This approximation should be a reasonable one when the total number of observations left, after omitting those rows containing all $0$ or $1$, is greater than about $25$ and the number of rows left is larger than $5$.
Further Comments
None.
Example
The following example is taken from page 201 of
Conover (1980). The data represents the success of three basketball enthusiasts in predicting the outcome of
$12$ collegiate basketball games, selected at random, using
$1$ for successful prediction of the outcome and
$0$ for unsuccessful prediction. This data is read in and the Cochran
$Q$test statistic and its corresponding upper tail probability are computed and printed.
Open in the MATLAB editor:
g08al_example
function g08al_example
fprintf('g08al example results\n\n');
x = [1, 1, 1;
1, 1, 1;
0, 1, 0;
1, 1, 0;
0, 0, 0;
1, 1, 1;
1, 1, 1;
1, 1, 0;
0, 0, 1;
0, 1, 0;
1, 1, 1;
1, 1, 1];
[ifail] = x04ca( ...
'General', 'Nonunit', x, 'Data matrix');
fprintf('\n');
[q, prob, ifail] = g08al(x);
fprintf('Cochrans Q test statistic = %10.4f\n', q);
fprintf('Degrees of freedom = %5d\n', size(x,2)1);
fprintf('Uppertail probability = %10.4f\n', prob);
g08al example results
Data matrix
1 2 3
1 1.0000 1.0000 1.0000
2 1.0000 1.0000 1.0000
3 0.0000 1.0000 0.0000
4 1.0000 1.0000 0.0000
5 0.0000 0.0000 0.0000
6 1.0000 1.0000 1.0000
7 1.0000 1.0000 1.0000
8 1.0000 1.0000 0.0000
9 0.0000 0.0000 1.0000
10 0.0000 1.0000 0.0000
11 1.0000 1.0000 1.0000
12 1.0000 1.0000 1.0000
Cochrans Q test statistic = 2.8000
Degrees of freedom = 2
Uppertail probability = 0.2466
PDF version (NAG web site
, 64bit version, 64bit version)
© The Numerical Algorithms Group Ltd, Oxford, UK. 2009–2015