g08cg:: Nonparametric Statistics (NAG Toolbox)

The

χ^{2}

goodness-of-fit test performed by nag_nonpar_test_chisq (g08cg) is used to test the null hypothesis that a random sample arises from a specified distribution against the alternative hypothesis that the sample does not arise from the specified distribution.

Given a sample of size

n

, denoted by

x_{1}, x_{2}, \dots, x_{n}

, drawn from a random variable

X

, and that the data has been grouped into

k

classes,

\begin{matrix} x \leq c_{1}, \\ c_{i - 1} < x \leq c_{i}, & i = 2, 3, \dots, k - 1, \\ x > c_{k - 1}, \end{matrix}

then the

χ^{2}

goodness-of-fit test statistic is defined by

X^{2} = \sum_{i = 1}^{k} \frac{{(O_{i} - E_{i})}^{2}}{E_{i}},

where

O_{i}

is the observed frequency of the

i

th class, and

E_{i}

is the expected frequency of the

i

th class.

The expected frequencies are computed as

E_{i} = p_{i} \times n,

where

p_{i}

is the probability that

X

lies in the

i

th class, that is

\begin{matrix} p_{1} = P (X \leq c_{1}), \\ p_{i} = P (c_{i - 1} < X \leq c_{i}), & i = 2, 3, \dots, k - 1, \\ p_{k} = P (X > c_{k - 1}) . \end{matrix}

These probabilities are either taken from a common probability distribution or are supplied by you. The available probability distributions within this function are:

Normal distribution with mean $μ$ , variance $σ^{2}$ ;
uniform distribution on the interval $[a, b]$ ;
exponential distribution with probability density function $(pdf) = λ e^{- λ x}$ ;
$χ^{2}$ -distribution with $f$ degrees of freedom; and
gamma distribution with $pdf = \frac{x^{α - 1} e^{- x / β}}{Γ (α) β^{α}}$ .

nag_nonpar_test_chisq (g08cg) returns the

χ^{2}

test statistic,

X^{2}

, together with its degrees of freedom and the upper tail probability from the

χ^{2}

-distribution associated with the test statistic. Note that the use of the

χ^{2}

-distribution as an approximation to the distribution of the test statistic improves as the expected values in each class increase.

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Cases prefixed with W are classified as warnings and do not generate an error of type NAG:error_n. See nag_issue_warnings.

Accuracy

Further Comments

Example

This example applies the

χ^{2}

goodness-of-fit test to test whether there is evidence to suggest that a sample of

100

randomly generated observations do not arise from a uniform distribution

U (0, 1)

. The class intervals are calculated such that the interval

(0, 1)

is divided into five equal classes. The frequencies for each class are calculated using nag_stat_frequency_table (g01ae).

function g08cg_example


fprintf('g08cg example results\n\n');

x = [ 0.59 0.23 0.76 0.96 0.20 0.91 0.29 0.22 0.36 0.81 ...
      0.91 0.80 0.17 0.82 0.07 0.74 0.15 0.91 0.26 0.98 ...
      0.59 0.34 0.28 0.95 0.33 0.42 0.72 0.35 0.86 0.22 ...
      0.15 0.39 0.32 0.82 0.13 0.48 0.46 0.74 0.99 0.26 ...
      0.04 0.21 0.04 0.24 0.56 0.36 0.48 0.53 1.00 0.58 ...
      0.50 0.41 0.03 0.38 0.89 0.40 0.66 0.79 0.34 0.94 ...
      0.49 0.12 0.24 0.05 1.00 0.29 0.67 0.29 0.75 0.81 ...
      0.45 0.21 0.51 0.68 0.78 0.20 0.23 0.57 0.25 0.48 ...
      0.96 0.33 0.48 0.55 0.04 0.48 0.42 0.11 0.38 0.73 ...
      0.91 0.45 0.59 0.97 0.27 0.27 0.25 0.99 0.99 0.80];

cb     = [0.2;     0.4;     0.6;     0.8;    1.0 ];
nclass = int64(5);

% Produce frequency table
[~, ifreq, ~, ~, ifail] = ...
  g01ae( ...
         nclass, x, 'cb', cb);

% Test parameters
dist   = 'Uniform';
npest  = int64(0);
par    = [0;  1];
prob   = zeros(nclass,1);

% Perform Chi^2 test
[chisq, p, ndf, eval, chisqi, ifail] = ...
  g08cg( ...
         ifreq, cb, dist, par, npest, prob, 'nclass', nclass);

fprintf('Chi-squared test statistic   = %10.4f\n', chisq);
fprintf('Degrees of freedom.          = %5d\n', ndf);
fprintf('Significance level           = %10.4f\n\n', p);
fprintf('The contributions to the test statistic are :-\n');
disp(chisqi');

g08cg example results

Chi-squared test statistic   =    14.2000
Degrees of freedom.          =     4
Significance level           =     0.0067

The contributions to the test statistic are :-
    3.2000    6.0500    0.4500    4.0500    0.4500

On entry,	$npest < 0$ ,
or	$npest \geq nclass - 1$ .

On entry,	with $dist ='A'$ , $prob (i) \leq 0.0$ for some $i$ , for $i = 1, 2, \dots, k$ ,
or	$\sum_{i = 1}^{k} prob (i) \neq 1.0$ .

NAG Toolbox: nag_nonpar_test_chisq (g08cg)

▸▿ Contents

Purpose

Syntax

Description