G04AGF (PDF version)
G04 Chapter Contents
G04 Chapter Introduction
NAG Library Manual

NAG Library Routine Document

G04AGF

Note:  before using this routine, please read the Users' Note for your implementation to check the interpretation of bold italicised terms and other implementation-dependent details.

+ Contents

    1  Purpose
    7  Accuracy

1  Purpose

G04AGF performs an analysis of variance for a two-way hierarchical classification with subgroups of possibly unequal size, and also computes the treatment group and subgroup means. A fixed effects model is assumed.

2  Specification

SUBROUTINE G04AGF ( Y, N, K, LSUB, NOBS, L, NGP, GBAR, SGBAR, GM, SS, IDF, F, FP, IFAIL)
INTEGER  N, K, LSUB(K), NOBS(L), L, NGP(K), IDF(4), IFAIL
REAL (KIND=nag_wp)  Y(N), GBAR(K), SGBAR(L), GM, SS(4), F(2), FP(2)

3  Description

In a two-way hierarchical classification, there are k (2) treatment groups, the ith of which is subdivided into li treatment subgroups. The jth subgroup of group i contains nij observations, which may be denoted by
y1ij,y2ij,,ynijij.
The general observation is denoted by ymij, being the mth observation in subgroup j of group i, for 1ik, 1jli, 1mnij.
The following quantities are computed
(i) The subgroup means
y-.ij=m=1nijymijnij
(ii) The group means
y-.i.=j= 1lim= 1nijymij j= 1linij
(iii) The grand mean
y-=i=1kj=1lim=1nijymij i=1kj=1linij
(iv) The number of observations in each group
ni.= j= 1 li nij
(v) Sums of squares
Between groups =SSg = i=1 k ni. y-.i.-y- 2 Between subgroups within groups =SSsg= i=1 k j=1 li nij y.ij-y-.i. 2 Residual (within subgroups) =SSres= i=1 k j=1 li m=1 nij ymij-y-.ij 2 =SStot-SSg-SSsg Corrected total =SStot= i=1 k j=1 li m=1 nij ymij-y- 2
(vi) Degrees of freedom of variance components
Between groups: k-1
Subgroups within groups: l-k
Residual: n-l
Total: n-1
where
  • l=i=1kli,
  • n=i=1kni.
(vii) F ratios. These are the ratios of the group and subgroup mean squares to the residual mean square.
Groups F1= Between groups sum of squares/k-1 Residual sum of squares/n-l = SSg/k-1 SSres/n-l
Subgroups F2= Between subgroups (within group) sum of squares/l-k Residual sum of squares/n-l = SSsg/l-k SSres/n-l
If either F ratio exceeds 9999.0, the value 9999.0 is assigned instead.
(viii) F significances. The probability of obtaining a value from the appropriate F-distribution which exceeds the computed mean square ratio.
Groups p 1 = Prob F k - 1 , n - l > F 1  
Subgroups p 2 = Prob F l - k , n - l > F 2  
where Fν1,ν2 denotes the central F-distribution with degrees of freedom ν1 and ν2.
If any Fi=9999.0, then pi is set to zero, i=1,2.

4  References

Kendall M G and Stuart A (1976) The Advanced Theory of Statistics (Volume 3) (3rd Edition) Griffin
Moore P G, Shirley E A and Edwards D E (1972) Standard Statistical Calculations Pitman

5  Parameters

1:     Y(N) – REAL (KIND=nag_wp) arrayInput
On entry: the elements of Y must contain the observations ymij in the following order:
y111,y211,,yn1111,y112,y212,,yn1212,,y11l1,,
yn1l11l1,, y1ij,,ynijij,,y1klk,, ynklkklk.
In words, the ordering is by group, and within each group is by subgroup, the members of each subgroup being in consecutive locations in Y.
2:     N – INTEGERInput
On entry: n, the total number of observations.
3:     K – INTEGERInput
On entry: k, the number of groups.
Constraint: K2.
4:     LSUB(K) – INTEGER arrayInput
On entry: the number of subgroups within group i, li, for i=1,2,,k.
Constraint: LSUBi>0, for i=1,2,,k.
5:     NOBS(L) – INTEGER arrayInput
On entry: the numbers of observations in each subgroup, nij, in the following order:
n11,n12,,n1l1,n21,,n2l2,,nk1,,nklk
Constraint: n=i=1kj=1linij, that is N=i=1lNOBSi and NOBSi>0, for i=1,2,,l.
6:     L – INTEGERInput
On entry: l, the total number of subgroups.
Constraint: L=i=1kLSUBi.
7:     NGP(K) – INTEGER arrayOutput
On exit: the total number of observations in group i, ni., for i=1,2,,k.
8:     GBAR(K) – REAL (KIND=nag_wp) arrayOutput
On exit: the mean for group i, y-.i., for i=1,2,,k.
9:     SGBAR(L) – REAL (KIND=nag_wp) arrayOutput
On exit: the subgroup means, y-.ij, in the following order:
y-.11,y-.12,,y-.1l1,y-.21,y-.22,,y-.2l2,,y-.k1,y-.k2,,y-.klk.
10:   GM – REAL (KIND=nag_wp)Output
On exit: the grand mean, y-.
11:   SS(4) – REAL (KIND=nag_wp) arrayOutput
On exit: contains the sums of squares for the analysis of variance, as follows;
  • SS1= Between group sum of squares, SSg,
  • SS2= Between subgroup within groups sum of squares, SSsg,
  • SS3= Residual sum of squares, SSres,
  • SS4= Corrected total sum of squares, SStot.
12:   IDF(4) – INTEGER arrayOutput
On exit: contains the degrees of freedom attributable to each sum of squares in the analysis of variance, as follows:
  • IDF1= Degrees of freedom for between group sum of squares,
  • IDF2= Degrees of freedom for between subgroup within groups sum of squares,
  • IDF3= Degrees of freedom for residual sum of squares,
  • IDF4= Degrees of freedom for corrected total sum of squares.
13:   F(2) – REAL (KIND=nag_wp) arrayOutput
On exit: contains the mean square ratios, F1 and F2, for the between groups variation, and the between subgroups within groups variation, with respect to the residual, respectively.
14:   FP(2) – REAL (KIND=nag_wp) arrayOutput
On exit: contains the significances of the mean square ratios, p1 and p2 respectively.
15:   IFAIL – INTEGERInput/Output
On entry: IFAIL must be set to 0, -1​ or ​1. If you are unfamiliar with this parameter you should refer to Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this parameter, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of IFAIL on exit.
On exit: IFAIL=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6  Error Indicators and Warnings

If on entry IFAIL=0 or -1, explanatory error messages are output on the current error message unit (as defined by X04AAF).
Errors or warnings detected by the routine:
IFAIL=1
On entry,K1.
IFAIL=2
On entry,LSUBi0, for some i=1,2,,k.
IFAIL=3
On entry,Li=1kLSUBi
IFAIL=4
On entry,NOBSi0, for some i=1,2,,l.
IFAIL=5
On entry,Ni=1lNOBSi.
IFAIL=6
The total corrected sum of squares is zero, indicating that all the data values are equal. The means returned are therefore all equal, and the sums of squares are zero. No assignments are made to IDF, F, and FP.
IFAIL=7
The residual sum of squares is zero. This arises when either each subgroup contains exactly one observation, or the observations within each subgroup are equal. The means, sums of squares, and degrees of freedom are computed, but no assignments are made to F and FP.

7  Accuracy

The computations are believed to be stable.

8  Further Comments

The time taken by G04AGF increases approximately linearly with the total number of observations, n.

9  Example

This example has two groups, the first of which consists of five subgroups, and the second of three subgroups. The numbers of observations in each subgroup are not equal. The data represent the percentage stretch in the length of samples of sack kraft drawn from consignments (subgroups) received over two years (groups). For details see Moore et al. (1972).

9.1  Program Text

Program Text (g04agfe.f90)

9.2  Program Data

Program Data (g04agfe.d)

9.3  Program Results

Program Results (g04agfe.r)


G04AGF (PDF version)
G04 Chapter Contents
G04 Chapter Introduction
NAG Library Manual

© The Numerical Algorithms Group Ltd, Oxford, UK. 2012