g02dg:: Correlation and Regression Analysis (NAG Toolbox)

nag_correg_linregm_fit (g02da) computes a

Q R

decomposition of the matrix of

p

independent variables and also, if the model is not of full rank, a singular value decomposition (SVD). These results can be used to compute estimates of the arguments for a general linear model with a new dependent variable. The

Q R

decomposition leads to the formation of an upper triangular

p

p

matrix

R

and an

n

n

orthogonal matrix

Q

. In addition the vector

c = Q^{T} y

(or

Q^{T} W^{1 / 2} y

) is computed. For a new dependent variable,

y_{new}

, nag_correg_linregm_fit_newvar (g02dg) computes a new value of

c = Q^{T} y_{new}

Q^{T} W^{1 / 2} y_{new}

R

is not of full rank, then nag_correg_linregm_fit (g02da) will have computed an SVD of

R

R = Q_{*} (\begin{array}{l} D & 0 \\ 0 & 0 \end{array}) P^{T},

where

D

is a

k

k

diagonal matrix with nonzero diagonal elements,

k

being the rank of

R

, and

Q_{*}

and

P

are

p

p

orthogonal matrices. This gives the solution

\hat{β} = P_{1} D^{- 1} Q_{*_{1}}^{T} c_{1},

P_{1}

being the first

k

columns of

P

, i.e.,

P = (P_{1} P_{0})

, and

Q_{*_{1}}

being the first

k

columns of

Q_{*}

. Details of the SVD are made available by nag_correg_linregm_fit (g02da) in the form of the matrix

P^{*}

P^{*} = (\begin{matrix} D^{- 1} P_{1}^{T} \\ P_{0}^{T} \end{matrix}) .

The matrix

Q_{*}

is made available through the workspace of nag_correg_linregm_fit (g02da).

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Accuracy

Further Comments

Example

A dataset consisting of

12

observations with four independent variables and two dependent variables are read in. A model with all four independent variables is fitted to the first dependent variable by nag_correg_linregm_fit (g02da) and the results printed. The model is then fitted to the second dependent variable by nag_correg_linregm_fit_newvar (g02dg) and those results printed.

function g02dg_example


fprintf('g02dg example results\n\n');

x = [1, 0, 0, 0;
     0, 0, 0, 1;
     0, 1, 0, 0;
     0, 0, 1, 0;
     0, 0, 0, 1;
     0, 1, 0, 0;
     0, 0, 0, 1;
     1, 0, 0, 0;
     0, 0, 1, 0;
     1, 0, 0, 0;
     0, 0, 1, 0;
     0, 1, 0, 0];
y = [33.63;     39.62;     38.18;     41.46;     38.02;     35.83;
     35.99;     36.58;     42.92;     37.80;     40.43;     37.89];
ynew = [63; 69; 68; 71; 68; 65; 65; 66; 72; 67; 70; 67];

[n,m]  = size(x);
isx    = ones(m,1,'int64');
mean_p = 'M';
ip     = int64(m+1);

% Fit general linear regression model to y
[rss, idf, b, se, covar, res, h, q, svd, irank, p, wk, ifail] = ...
  g02da(mean_p, x, isx, ip, y);

% Display results for y
fprintf('Results for original y-variable using g02da\n\n');
if svd
  fprintf('Model not of full rank\n\n');
end
fprintf('Residual sum of squares = %12.4e\n', rss);
fprintf('Degrees of freedom      = %4d\n', idf);
fprintf('\nVariable   Parameter estimate   Standard error\n\n');
ivar = double([1:ip]');
fprintf('%6d%20.4e%20.4e\n',[ivar b se]');

% Fit same model to ynew
[rss, covar, q, b, se, res, ifail] = ...
    g02dg( ...
           rss, ip, irank, covar, q, svd, p, ynew, wk);

% Display results for ynew
fprintf('\nResults for second y-variable using g02dg\n\n');
fprintf('Residual sum of squares = %12.4e\n', rss);
fprintf('Degrees of freedom      = %4d\n', idf);
fprintf('\nVariable   Parameter estimate   Standard error\n\n');
ivar = double([1:ip]');
fprintf('%6d%20.4e%20.4e\n',[ivar b se]');

g02dg example results

Results for original y-variable using g02da

Model not of full rank

Residual sum of squares =   2.2227e+01
Degrees of freedom      =    8

Variable   Parameter estimate   Standard error

     1          3.0557e+01          3.8494e-01
     2          5.4467e+00          8.3896e-01
     3          6.7433e+00          8.3896e-01
     4          1.1047e+01          8.3896e-01
     5          7.3200e+00          8.3896e-01

Results for second y-variable using g02dg

Residual sum of squares =   2.4000e+01
Degrees of freedom      =    8

Variable   Parameter estimate   Standard error

     1          5.4067e+01          4.0000e-01
     2          1.1267e+01          8.7178e-01
     3          1.2600e+01          8.7178e-01
     4          1.6933e+01          8.7178e-01
     5          1.3267e+01          8.7178e-01

On entry,	$ip < 1$ ,
or	$n < ip$ ,
or	$irank \leq 0$ ,
or	$svd = false$ and $irank \neq ip$ ,
or	$svd = true$ and $irank > ip$ ,
or	$ldq < n$ ,
or	$rss \leq 0.0$ ,
or	$weight \neq'U'$ or $'W'$ .

NAG Toolbox: nag_correg_linregm_fit_newvar (g02dg)

▸▿ Contents

Purpose

Syntax

Description