NAG Library Routine Document

g02cef (linregm_service_select)

1
Purpose

g02cef takes selected elements from two vectors (typically vectors of means and standard deviations) to form two smaller vectors, and selected rows and columns from two matrices (typically either matrices of sums of squares and cross-products of deviations from means and Pearson product-moment correlation coefficients, or matrices of sums of squares and cross-products about zero and correlation-like coefficients) to form two smaller matrices, allowing reordering of elements in the process.

2
Specification

Fortran Interface
Subroutine g02cef ( n, xbar, std, ssp, ldssp, r, ldr, m, korder, xbar2, std2, ssp2, ldssp2, r2, ldr2, ifail)
Integer, Intent (In):: n, ldssp, ldr, m, korder(m), ldssp2, ldr2
Integer, Intent (Inout):: ifail
Real (Kind=nag_wp), Intent (In):: xbar(n), std(n), ssp(ldssp,n), r(ldr,n)
Real (Kind=nag_wp), Intent (Inout):: ssp2(ldssp2,m), r2(ldr2,m)
Real (Kind=nag_wp), Intent (Out):: xbar2(m), std2(m)
C Header Interface
#include <nagmk26.h>
void  g02cef_ (const Integer *n, const double xbar[], const double std[], const double ssp[], const Integer *ldssp, const double r[], const Integer *ldr, const Integer *m, const Integer korder[], double xbar2[], double std2[], double ssp2[], const Integer *ldssp2, double r2[], const Integer *ldr2, Integer *ifail)

3
Description

Input to the routine consists of:
(a) A vector of means:
x-1,x-2,x-3,,x-n,  
where n is the number of input variables.
(b) A vector of standard deviations:
s1,s2,s3,,sn.  
(c) A matrix of sums of squares and cross-products of deviations from means:
S11 S12 S13 . . . S1n S21 S22 S2n S31 . . . . . . . Sn1 Sn2 . . . . Snn .  
(d) A matrix of correlation coefficients:
R11 R12 R13 . . . R1n R21 R22 R2n R31 . . . . . . . Rn1 Rn2 . . . . Rnn .  
(e) The number of variables, m, in the required subset, and their row/column numbers in the input data, i1,i2,i3,,im,
iikn  for k=1,2,,mn2,m1  and  mn.  
New vectors and matrices are output containing the following information:
(i) A vector of means:
x-i1,x-i2,x-i3,,x-im.  
(ii) A vector of standard deviations:
si1,si2,si3,,sim.  
(iii) A matrix of sums of squares and cross-products of deviations from means:
Si1i1 Si1i2 Si1i3 . . . Si1im Si2i1 Si2i2 . Si3i1 . . . . . . . Simi1 Simi2 . . . . Simim .  
(iv) A matrix of correlation coefficients:
Ri1i1 Ri1i2 Ri1i3 . . . Ri1im Ri2i1 Ri2i2 . Ri3i1 . . . . . . . Rimi1 Rimi2 . . . . Rimim .  
Note:  for sums of squares of cross-products of deviations about zero and correlation-like coefficients Sij and Rij should be replaced by S~ij and R~ij in the description of the input and output above.

4
References

None.

5
Arguments

1:     n – IntegerInput
On entry: n, the number of variables in the input data.
Constraint: n2.
2:     xbarn – Real (Kind=nag_wp) arrayInput
On entry: xbari must be set to x-i, the mean of variable i, for i=1,2,,n.
3:     stdn – Real (Kind=nag_wp) arrayInput
On entry: stdi must be set to si, the standard deviation of variable i, for i=1,2,,n.
4:     sspldsspn – Real (Kind=nag_wp) arrayInput
On entry: sspij must be set to the sum of cross-products of deviations from means Sij (or about zero, S~ij) for variables i and j, for i=1,2,,n and j=1,2,,n.
5:     ldssp – IntegerInput
On entry: the first dimension of the array ssp as declared in the (sub)program from which g02cef is called.
Constraint: ldsspn.
6:     rldrn – Real (Kind=nag_wp) arrayInput
On entry: rij must be set to the Pearson product-moment correlation coefficient Rij (or the correlation-like coefficient, R~ij) for variables i and j, for i=1,2,,n and j=1,2,,n.
7:     ldr – IntegerInput
On entry: the first dimension of the array r as declared in the (sub)program from which g02cef is called.
Constraint: ldrn.
8:     m – IntegerInput
On entry: the number of variables m, required in the reduced vectors and matrices.
Constraint: 1mn.
9:     korderm – Integer arrayInput
On entry: korderi must be set to the number of the original variable which is to be the ith variable in the output vectors and matrices, for i=1,2,,m.
Constraint: 1korderin, for i=1,2,,m.
10:   xbar2m – Real (Kind=nag_wp) arrayOutput
On exit: the mean of variable i, xbari, where i=korderk, for k=1,2,,m. (The array xbar2 must differ from xbar and std.)
11:   std2m – Real (Kind=nag_wp) arrayOutput
On exit: the standard deviation of variable i, stdi, where i=korderk, for k=1,2,,m. (The array std2 must differ from both xbar and std.)
12:   ssp2ldssp2m – Real (Kind=nag_wp) arrayOutput
On exit: ssp2kl contains the value of sspij, where i=korderk and j=korderl, for k=1,2,,m and l=1,2,,m. (The array ssp2 must differ from both ssp and r.)
That is to say: on exit, ssp2kl contains the sum of cross-products of deviations from means Sij (or about zero, S~ij).
13:   ldssp2 – IntegerInput
On entry: the first dimension of the array ssp2 as declared in the (sub)program from which g02cef is called.
Constraint: ldssp2m.
14:   r2ldr2m – Real (Kind=nag_wp) arrayOutput
On exit: r2kl contains the value of rij, where i=korderk and j=korderl, for k=1,2,,m and l=1,2,,m. (The array r2 must differ from both ssp and r.)
That is to say: on exit, r2kl contains the Pearson product-moment coefficient Rij (or the correlation-like coefficient, R~ij).
15:   ldr2 – IntegerInput
On entry: the first dimension of the array r2 as declared in the (sub)program from which g02cef is called.
Constraint: ldr2m.
16:   ifail – IntegerInput/Output
On entry: ifail must be set to 0, -1 or 1. If you are unfamiliar with this argument you should refer to Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1 or 1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this argument, the recommended value is 0. When the value -1 or 1 is used it is essential to test the value of ifail on exit.
On exit: ifail=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6
Error Indicators and Warnings

If on entry ifail=0 or -1, explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
ifail=1
On entry, m=value.
Constraint: m1.
On entry, n=value.
Constraint: n2.
ifail=2
On entry, n=value and m=value.
Constratint: nm.
ifail=3
On entry, ldr2=value and m=value.
Constraint: ldr2m.
On entry, ldr=value and n=value.
Constraint: ldrn.
On entry, ldssp2=value and m=value.
Constraint: ldssp2m.
On entry, ldssp=value and n=value.
Constraint: ldsspn.
ifail=4
On entry, kordervalue=value and n=value.
Constraint: 1korderin, for i=1,2,,m.
ifail=-99
An unexpected error has been triggered by this routine. Please contact NAG.
See Section 3.9 in How to Use the NAG Library and its Documentation for further information.
ifail=-399
Your licence key may have expired or may not have been installed correctly.
See Section 3.8 in How to Use the NAG Library and its Documentation for further information.
ifail=-999
Dynamic memory allocation failed.
See Section 3.7 in How to Use the NAG Library and its Documentation for further information.

7
Accuracy

Not applicable.

8
Parallelism and Performance

g02cef is not threaded in any implementation.

9
Further Comments

The time taken by g02cef depends on n and m.
The routine is intended primarily for use when a subset of variables from a larger set of variables is to be used in a regression, and is described accordingly. There is however no reason why the routine should not also be used to select specific rows and columns from vectors and arrays which contain any other non-statistical information; the matrices need not be symmetric.
The routine may be used either with sums of squares and cross-products of deviations from means and Pearson product-moment correlation coefficients in connection with a regression involving a constant, or with sums of squares and cross-products about zero and correlation-like coefficients in connection with a regression with no constant.

10
Example

This example reads in the means, standard deviations, sums of squares and cross-products, and correlation coefficients for four variables. New vectors and matrices are created containing the means, standard deviations, sums of squares and cross-products, and correlation coefficients for the fourth, first and second variables (in that order). Finally these new vectors and matrices are printed.

10.1
Program Text

Program Text (g02cefe.f90)

10.2
Program Data

Program Data (g02cefe.d)

10.3
Program Results

Program Results (g02cefe.r)