naginterfaces.library.correg.robustm_user¶

naginterfaces.library.correg.robustm_user(psi, psip0, beta, indw, isigma, x, y, wgt, theta, sigma, chi=None, tol=5e-05, eps=5e-06, maxit=50, nitmon=0, data=None, io_manager=None)[source]¶

robustm_user performs bounded influence regression ( $M$ -estimates) using an iterative weighted least squares algorithm.

For full information please refer to the NAG Library document for g02hd

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/g02/g02hdf.html

Parameters

psicallable retval = psi(t, data=None)

$p s i$ must return the value of the weight function $ψ$ for a given value of its argument.

Parameters

tfloat: The argument for which $p s i$ must be evaluated.
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

Returns

retvalfloat: The value of the weight function $ψ$ evaluated at $t$ .

psip0float

The value of $ψ^{'} (0)$ .

betafloat

If $i s i g m a < 0$ , $b e t a$ must specify the value of $β_{1}$ .

For Huber and Schweppe type regressions, $β_{1}$ is the $75$ th percentile of the standard Normal distribution (see stat.inv_cdf_normal).

For Mallows type regression $β_{1}$ is the solution to

\frac{1}{n} n \sum i = 1 Φ (β_{1} / \sqrt{w_{i}}) = 0.75,

where $Φ$ is the standard Normal cumulative distribution function (see specfun.cdf_normal).

If $i s i g m a > 0$ , $b e t a$ must specify the value of $β_{2}$ .

\begin{matrix} \begin{matrix} β_{2} = & \int_{- \infty}^{\infty} χ (z) ϕ (z) d z, & in the Huber case; β_{2} = & \frac{1}{n} \sum_{i = 1}^{n} w_{i} \int_{- \infty}^{\infty} χ (z) ϕ (z) d z, & in the Mallows case; β_{2} = & \frac{1}{n} \sum_{i = 1}^{n} w_{i}^{2} \int_{- \infty}^{\infty} χ (z / w_{i}) ϕ (z) d z, & in the Schweppe case; \end{matrix} \end{matrix}

where $ϕ$ is the standard normal density, i.e., $\frac{1}{\sqrt{2 π}} e x p (- \frac{1}{2} x^{2})$ .

If $i s i g m a = 0$ , $b e t a$ is not referenced.

indwint

Determines the type of regression to be performed.

$i n d w = 0$

Huber type regression.

$i n d w < 0$

Mallows type regression.

$i n d w > 0$

Schweppe type regression.

isigmaint

Determines how $σ$ is to be estimated.

$i s i g m a = 0$

$σ$ is held constant at its initial value.

$i s i g m a < 0$

$σ$ is estimated by median absolute deviation of residuals.

$i s i g m a > 0$

$σ$ is estimated using the $χ$ function.

xfloat, array-like, shape $(n, m)$

The values of the $X$ matrix, i.e., the independent variables. $x [i - 1, j - 1]$ must contain the $i j$ th element of $x$ , for $j = 1, 2, \dots, m$ , for $i = 1, 2, \dots, n$ .

If $i n d w < 0$ , during calculations the elements of $x$ will be transformed as described in Notes.

Before exit the inverse transformation will be applied.

As a result there may be slight differences between the input $x$ and the output $x$ .

yfloat, array-like, shape $(n)$

The data values of the dependent variable.

$y [i - 1]$ must contain the value of $y$ for the $i$ th observation, for $i = 1, 2, \dots, n$ .

If $i n d w < 0$ , during calculations the elements of $y$ will be transformed as described in Notes.

Before exit the inverse transformation will be applied.

As a result there may be slight differences between the input $y$ and the output $y$ .

wgtfloat, array-like, shape $(n)$

The weight for the $i$ th observation, for $i = 1, 2, \dots, n$ .

If $i n d w < 0$ , during calculations elements of $w g t$ will be transformed as described in Notes.

Before exit the inverse transformation will be applied.

As a result there may be slight differences between the input $w g t$ and the output $w g t$ .

If $w g t [i - 1] \leq 0$ , the $i$ th observation is not included in the analysis.

If $i n d w = 0$ , $w g t$ is not referenced.

thetafloat, array-like, shape $(m)$

Starting values of the parameter vector $θ$ . These may be obtained from least squares regression. Alternatively if $i s i g m a < 0$ and $s i g m a = 1$ or if $i s i g m a > 0$ and $s i g m a$ approximately equals the standard deviation of the dependent variable, $y$ , then $t h e t a [i - 1] = 0.0$ , for $i = 1, 2, \dots, m$ may provide reasonable starting values.

sigmafloat

A starting value for the estimation of $σ$ . $s i g m a$ should be approximately the standard deviation of the residuals from the model evaluated at the value of $θ$ given by $t h e t a$ on entry.

chiNone or callable retval = chi(t, data=None), optional

Note: if this argument is None then a NAG-supplied facility will be used.

If $i s i g m a > 0$ , $c h i$ must return the value of the weight function $χ$ for a given value of its argument.

The value of $χ$ must be non-negative.

Parameters

tfloat: The argument for which $c h i$ must be evaluated.
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

Returns

retvalfloat: The value of the weight function $χ$ evaluated at $t$ .

tolfloat, optional

The relative precision for the final estimates. Convergence is assumed when both the relative change in the value of $s i g m a$ and the relative change in the value of each element of $t h e t a$ are less than $t o l$ .

It is advisable for $t o l$ to be greater than $100 \times machine precision$ .

epsfloat, optional

A relative tolerance to be used to determine the rank of $X$ . See linsys.real_gen_solve for further details.

If $e p s < machine precision$ or $e p s > 1.0$ , machine precision will be used in place of $t o l$ .

A reasonable value for $e p s$ is $5.0 \times 10^{- 6}$ where this value is possible.

maxitint, optional

The maximum number of iterations that should be used during the estimation.

A value of $m a x i t = 50$ should be adequate for most uses.

nitmonint, optional

Determines the amount of information that is printed on each iteration.

$n i t m o n \leq 0$

No information is printed.

$n i t m o n > 0$

On the first and every $n i t m o n$ iterations the values of $s i g m a$ , $t h e t a$ and the change in $t h e t a$ during the iteration are printed.

When printing occurs the output is directed to the file object associated with the advisory I/O unit (see FileObjManager).

dataarbitrary, optional

User-communication data for callback functions.

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

xfloat, ndarray, shape $(n, m)$: Unchanged, except as described above.
yfloat, ndarray, shape $(n)$: Unchanged, except as described above.
wgtfloat, ndarray, shape $(n)$: Unchanged, except as described above.
thetafloat, ndarray, shape $(m)$: The M-estimate of $θ_{i}$ , for $i = 1, 2, \dots, m$ .
kint: The column rank of the matrix $X$ .
sigmafloat: The final estimate of $σ$ if $i s i g m a \neq 0$ or the value assigned on entry if $i s i g m a = 0$ .
rsfloat, ndarray, shape $(n)$: The residuals from the model evaluated at final value of $t h e t a$ , i.e., $r s$ contains the vector $(y - X^θ)$ .
nitint: The number of iterations that were used during the estimation.

Raises

NagValueError

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ and $ldx = ⟨ v a l u e ⟩$ .

Constraint: $ldx \geq n$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ and $m = ⟨ v a l u e ⟩$ .

Constraint: $n > m$ .

(errno $1$ )

On entry, $m = ⟨ v a l u e ⟩$ .

Constraint: $m \geq 1$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 2$ .

(errno $2$ )

On entry, $b e t a = ⟨ v a l u e ⟩$ .

Constraint: $b e t a > 0.0$ .

(errno $2$ )

On entry, $s i g m a = ⟨ v a l u e ⟩$ .

Constraint: $s i g m a > 0.0$ .

(errno $3$ )

On entry, $m a x i t = ⟨ v a l u e ⟩$ .

Constraint: $m a x i t > 0$ .

(errno $3$ )

On entry, $t o l = ⟨ v a l u e ⟩$ .

Constraint: $t o l > 0.0$ .

(errno $4$ )

Value given by $c h i$ function $< 0$ : $c h i (⟨ v a l u e ⟩) = ⟨ v a l u e ⟩$ .

The value of $c h i$ must be non-negative.

(errno $5$ )

Estimated value of $s i g m a$ is zero.

(errno $6$ )

Iterations to solve the weighted least squares equations failed to converge.

(errno $8$ )

The function has failed to converge in $m a x i t$ iterations.

(errno $9$ )

Having removed cases with zero weight, the value of $n - k \leq 0$ , i.e., no degree of freedom for error. This error will only occur if $i s i g m a > 0$ .

Warns

NagAlgorithmicWarning

(errno $7$ ): The weighted least squares equations are not of full rank. This may be due to the $X$ matrix not being of full rank, in which case the results will be valid. It may also occur if some of the $G_{i i}$ values become very small or zero, see Further Comments. The rank of the equations is given by $k$ . If the matrix just fails the test for nonsingularity then the result $e r r n o$ = 7 and $k = m$ is possible (see linsys.real_gen_solve).

Notes

For the linear regression model

y = X θ + ϵ,

where	$y$ is a vector of length $n$ of the dependent variable,
	$X$ is an $n \times m$ matrix of independent variables of column rank $k$ ,
	$θ$ is a vector of length $m$ of unknown parameters,
and	$ϵ$ is a vector of length $n$ of unknown errors with var $(ϵ_{i}) = σ^{2}$ ,

robustm_user calculates the M-estimates given by the solution, $^θ$ , to the equation

n \sum i = 1 ψ (r_{i} / (σ w_{i})) w_{i} x_{i j} = 0, j = 1, 2, \dots, m,

where	$r_{i}$ is the $i$ th residual, i.e., the $i$ th element of the vector $r = y - X^θ$ ,
	$ψ$ is a suitable weight function,
	$w_{i}$ are suitable weights such as those that can be calculated by using output from `robustm_wts()`,
and	$σ$ may be estimated at each iteration by the median absolute deviation of the residuals $^σ = {m e d}_{i} ([\| r_{i} \|] / β_{1})$

or as the solution to

n \sum i = 1 χ (r_{i} / (^σ w_{i})) w_{i}^{2} = (n - k) β_{2}

for a suitable weight function $χ$ , where $β_{1}$ and $β_{2}$ are constants, chosen so that the estimator of $σ$ is asymptotically unbiased if the errors, $ϵ_{i}$ , have a Normal distribution. Alternatively $σ$ may be held at a constant value.

The above describes the Schweppe type regression. If the $w_{i}$ are assumed to equal $1$ for all $i$ , then Huber type regression is obtained. A third type, due to Mallows, replaces (1) by

n \sum i = 1 ψ (r_{i} / σ) w_{i} x_{i j} = 0, j = 1, 2, \dots, m .

This may be obtained by use of the transformations

\begin{matrix} \begin{matrix} w_{i}^{*} & \leftarrow \sqrt{w_{i}} y_{i}^{*} & \leftarrow y_{i} \sqrt{w_{i}} x_{i j}^{*} & \leftarrow x_{i j} \sqrt{w_{i}}, j = 1, 2, \dots, m \end{matrix} \end{matrix}

(see Marazzi (1987)).

The calculation of the estimates of $θ$ can be formulated as an iterative weighted least squares problem with a diagonal weight matrix $G$ given by

\begin{matrix} G_{i i} = ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ \begin{matrix} \frac{ψ (r_{i} / (σ w_{i}))}{(r_{i} / (σ w_{i}))}, & r_{i} \neq 0 ψ^{'} (0), & r_{i} = 0 . \end{matrix} . \end{matrix}

The value of $θ$ at each iteration is given by the weighted least squares regression of $y$ on $X$ . This is carried out by first transforming the $y$ and $X$ by

\begin{matrix} \begin{matrix} {~ y}_{i} & = y_{i} \sqrt{G_{i i}} {~ x}_{i j} & = x_{i j} \sqrt{G_{i i}}, j = 1, 2, \dots, m \end{matrix} \end{matrix}

and then using linsys.real_gen_solve. If $X$ is of full column rank then an orthogonal-triangular ( $Q R$ ) decomposition is used; if not, a singular value decomposition is used.

Observations with zero or negative weights are not included in the solution.

Note: there is no explicit provision in the function for a constant term in the regression model. However, the addition of a dummy variable whose value is $1.0$ for all observations will produce a value of $^θ$ corresponding to the usual constant term.

robustm_user is based on routines in ROBETH, see Marazzi (1987).

References

Hampel, F R, Ronchetti, E M, Rousseeuw, P J and Stahel, W A, 1986, Robust Statistics. The Approach Based on Influence Functions, Wiley

Huber, P J, 1981, Robust Statistics, Wiley

Marazzi, A, 1987, Subroutines for robust and bounded influence regression in ROBETH, Cah. Rech. Doc. IUMSP, No. 3 ROB 2, Institut Universitaire de Médecine Sociale et Préventive, Lausanne

NAG and Python

Return to Front

naginterfaces.library.correg.robustm_user¶

naginterfaces.library.correg.robustm_​user¶

naginterfaces.library.correg.robustm_user¶