g02hd:: Correlation and Regression Analysis (NAG Toolbox)

For the linear regression model

y = X θ + ε,

where	$y$ is a vector of length $n$ of the dependent variable,
	$X$ is a $n$ by $m$ matrix of independent variables of column rank $k$ ,
	$θ$ is a vector of length $m$ of unknown arguments,
and	$ε$ is a vector of length $n$ of unknown errors with var $(ε_{i}) = σ^{2}$ ,

nag_correg_robustm_user (g02hd) calculates the M-estimates given by the solution,

\hat{θ}

, to the equation

\sum_{i = 1}^{n} ψ (r_{i} / (σ w_{i})) w_{i} x_{i j} = 0, j = 1, 2, \dots, m,

(1)

where	$r_{i}$ is the $i$ th residual, i.e., the $i$ th element of the vector $r = y - X \hat{θ}$ ,
	$ψ$ is a suitable weight function,
	$w_{i}$ are suitable weights such as those that can be calculated by using output from nag_correg_robustm_wts (g02hb),
and	$σ$ may be estimated at each iteration by the median absolute deviation of the residuals $\hat{σ} = {med}_{i} [\|r_{i}\|] / β_{1}$

or as the solution to

\sum_{i = 1}^{n} χ (r_{i} / (\hat{σ} w_{i})) w_{i}^{2} = (n - k) β_{2}

for a suitable weight function

χ

, where

β_{1}

and

β_{2}

are constants, chosen so that the estimator of

σ

is asymptotically unbiased if the errors,

ε_{i}

, have a Normal distribution. Alternatively

σ

may be held at a constant value.

The above describes the Schweppe type regression. If the

w_{i}

are assumed to equal

1

for all

i

, then Huber type regression is obtained. A third type, due to Mallows, replaces (1) by

\sum_{i = 1}^{n} ψ (r_{i} / σ) w_{i} x_{i j} = 0, j = 1, 2, \dots, m .

This may be obtained by use of the transformations

\begin{array}{l} w_{i}^{*} & \leftarrow \sqrt{w_{i}} \\ y_{i}^{*} & \leftarrow y_{i} \sqrt{w_{i}} \\ x_{i j}^{*} & \leftarrow x_{i j} \sqrt{w_{i}}, j = 1, 2, \dots, m \end{array}

(see Marazzi (1987)).

The calculation of the estimates of

θ

can be formulated as an iterative weighted least squares problem with a diagonal weight matrix

G

given by

G_{i i} = \{\begin{array}{cl} \frac{ψ (r_{i} / (σ w_{i}))}{(r_{i} / (σ w_{i}))}, & r_{i} \neq 0 \\ ψ^{'} (0), & r_{i} = 0 . \end{array} .

The value of

θ

at each iteration is given by the weighted least squares regression of

y

X

. This is carried out by first transforming the

y

and

X

\begin{array}{l} {\tilde{y}}_{i} & = y_{i} \sqrt{G_{i i}} \\ {\tilde{x}}_{i j} & = x_{i j} \sqrt{G_{i i}}, j = 1, 2, \dots, m \end{array}

and then using nag_linsys_real_gen_solve (f04jg) . If

X

is of full column rank then an orthogonal-triangular (

Q R

) decomposition is used; if not, a singular value decomposition is used.

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Cases prefixed with W are classified as warnings and do not generate an error of type NAG:error_n. See nag_issue_warnings.

Accuracy

Further Comments

Example

function g02hd_example


fprintf('g02hd example results\n\n');

global dchi
dchi = 1.5;

x = [1, -1, -1;
     1, -1,  1;
     1,  1, -1;
     1,  1,  1;
     1,  0,  3];
y   = [10.5;    11.3;     12.6;     13.4;     17.1   ];
wgt = [0.4039;   0.5012;   0.4039;   0.5012;   0.3862];

[n,m] = size(x);

% Calculate beta
[beta] = betcal(wgt);

% Control Parameters
psip0 = 1;
indw = int64(1);
isigma = int64(1);

% Intial values
sigma = 1;
theta = zeros(m,1);

% Perform bounded influence regression
[x, y, wgt, theta, k, sigma, rs, nit, ifail] = ...
  g02hd( ...
         @chi, @psi, psip0, beta, indw, isigma, x, y, wgt, theta, sigma);

fprintf(' iterations to convergence = %4d\n', nit);
fprintf('                         k = %4d\n',k);
fprintf('                     sigma = %9.4f\n',sigma);
fprintf('Theta:\n');
disp(theta');
fprintf('\n  Weights  Residuals\n');
fprintf('%9.4f%9.4f\n', [wgt rs]');



function [result] = chi(t)
  global dchi

  if (abs(t) < dchi)
    ps=t;
  else
    ps=dchi;
  end
  result = ps*ps/2;


function [result] = psi(t)
  global dchi

  if t < -dchi
    result = -dchi;
  elseif abs(t) < dchi
    result = t;
  else
    result = dchi;
  end;

function [beta] = betcal(wgt)
  %  Calculate beta for Schweppe type regression
  global dchi

  n = numel(wgt);
  amaxex = -log(x02ak);
  anormc = sqrt(2*pi);
  d2 = dchi*dchi;
  beta = 0;
  for i = 1:n
    w2 = wgt(i)*wgt(i);
    dw = wgt(i)*dchi;
    [pc, ifail] = s15ab(dw);
    dw2 = dw*dw;
    if dw2<amaxex
      dc = exp(-dw2/2)/anormc;
    else
      dc = 0;
    end
    b = (-dw*dc+pc-0.5)/w2 + (1-pc)*d2;
    beta = b*w2/n + beta;
  end

On entry,	$n \leq 1$ ,
or	$m < 1$ ,
or	$n \leq m$ ,
or	$ldx < n$ .

On entry,	$beta \leq 0.0$ , and $isigma \neq 0$ ,
or	$sigma \leq 0.0$ .

On entry,	$tol \leq 0.0$ ,
or	$maxit \leq 0$ .

NAG Toolbox: nag_correg_robustm_user (g02hd)

▸▿ Contents

Purpose

Syntax

Description