naginterfaces.library.correg.robustm_wts¶

naginterfaces.library.correg.robustm_wts(ucv, x, a, bl=0.9, bd=0.9, tol=5e-05, maxit=50, nitmon=0, data=None, io_manager=None)[source]¶

robustm_wts finds, for a real matrix $X$ of full column rank, a lower triangular matrix $A$ such that ${(A^{T} A)}^{- 1}$ is proportional to a robust estimate of the covariance of the variables. robustm_wts is intended for the calculation of weights of bounded influence regression using robustm_user().

For full information please refer to the NAG Library document for g02hb

https://support.nag.com/numeric/nl/nagdoc_31.1/flhtml/g02/g02hbf.html

Parameters

ucvcallable retval = ucv(t, data=None)

$u c v$ must return the value of the function $u$ for a given value of its argument.

The value of $u$ must be non-negative.

Parameters

tfloat: The argument for which $u c v$ must be evaluated.
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

Returns

retvalfloat: The value of $u (t)$ evaluated at $t$ .

xfloat, array-like, shape $(n, m)$

The real matrix $X$ , i.e., the independent variables. $x [i - 1, j - 1]$ must contain the $i j$ th element of $x$ , for $j = 1, 2, \dots, m$ , for $i = 1, 2, \dots, n$ .

afloat, array-like, shape $(m \times (m + 1) / 2)$

An initial estimate of the lower triangular real matrix $A$ . Only the lower triangular elements must be given and these should be stored row-wise in the array.

The diagonal elements must be $\neq 0$ , although in practice will usually be $> 0$ .

If the magnitudes of the columns of $X$ are of the same order the identity matrix will often provide a suitable initial value for $A$ .

If the columns of $X$ are of different magnitudes, the diagonal elements of the initial value of $A$ should be approximately inversely proportional to the magnitude of the columns of $X$ .

blfloat, optional

The magnitude of the bound for the off-diagonal elements of $S_{k}$ .

bdfloat, optional

The magnitude of the bound for the diagonal elements of $S_{k}$ .

tolfloat, optional

The relative precision for the final value of $A$ . Iteration will stop when the maximum value of $∣ ∣ s_{j l} ∣ ∣$ is less than $t o l$ .

maxitint, optional

The maximum number of iterations that will be used during the calculation of $A$ .

A value of $m a x i t = 50$ will often be adequate.

nitmonint, optional

Determines the amount of information that is printed on each iteration.

$n i t m o n > 0$

The value of $A$ and the maximum value of $∣ ∣ s_{j l} ∣ ∣$ will be printed at the first and every $n i t m o n$ iterations.

$n i t m o n \leq 0$

No iteration monitoring is printed.

When printing occurs the output is directed to the file object associated with the advisory I/O unit (see FileObjManager).

dataarbitrary, optional

User-communication data for callback functions.

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

afloat, ndarray, shape $(m \times (m + 1) / 2)$: The lower triangular elements of the matrix $A$ , stored row-wise.
zfloat, ndarray, shape $(n)$: The value ${∥ z_{i} ∥}_{2}$ , for $i = 1, 2, \dots, n$ .
nitint: The number of iterations performed.

Raises

NagValueError

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ and $ldx = ⟨ v a l u e ⟩$ .

Constraint: $ldx \geq n$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ and $m = ⟨ v a l u e ⟩$ .

Constraint: $n \geq m$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 2$ .

(errno $1$ )

On entry, $m = ⟨ v a l u e ⟩$ .

Constraint: $m \geq 1$ .

(errno $2$ )

On entry, $b d = ⟨ v a l u e ⟩$ .

Constraint: $b d > 0.0$ .

(errno $2$ )

On entry, $b l = ⟨ v a l u e ⟩$ .

Constraint: $b l > 0.0$ .

(errno $2$ )

On entry, $m a x i t = ⟨ v a l u e ⟩$ .

Constraint: $m a x i t > 0$ .

(errno $2$ )

On entry, $t o l = ⟨ v a l u e ⟩$ .

Constraint: $t o l > 0.0$ .

(errno $2$ )

On entry, $i = ⟨ v a l u e ⟩$ and the $i$ th diagonal element of $A$ is $0$ .

Constraint: all diagonal elements of $A$ must be non-zero.

(errno $3$ )

Value returned by $u c v$ function $< 0$ : $u (⟨ v a l u e ⟩) = ⟨ v a l u e ⟩$ .

The value of $u$ must be non-negative.

(errno $4$ )

Iterations to calculate weights failed to converge in $m a x i t$ iterations: $m a x i t = ⟨ v a l u e ⟩$ .

Notes

In fitting the linear regression model

y = X θ + ϵ,

where	$y$ is a vector of length $n$ of the dependent variable,
	$X$ is an $n \times m$ matrix of independent variables,
	$θ$ is a vector of length $m$ of unknown parameters,
and	$ϵ$ is a vector of length $n$ of unknown errors,

it may be desirable to bound the influence of rows of the $X$ matrix. This can be achieved by calculating a weight for each observation. Several schemes for calculating weights have been proposed (see Hampel et al. (1986) and Marazzi (1987)). As the different independent variables may be measured on different scales one group of proposed weights aims to bound a standardized measure of influence. To obtain such weights the matrix $A$ has to be found such that

\frac{1}{n} n \sum i = 1 u ({∥ z_{i} ∥}_{2}) z_{i} z_{i}^{T} = I (I is the identity matrix)

and

z_{i} = A x_{i},

where	$x_{i}$ is a vector of length $m$ containing the elements of the $i$ th row of $X$ ,
	$A$ is an $m \times m$ lower triangular matrix,
	$z_{i}$ is a vector of length $m$ ,
and	$u$ is a suitable function.

The weights for use with robustm_user() may then be computed using

w_{i} = f ({∥ z_{i} ∥}_{2})

for a suitable user-supplied function $f$ .

robustm_wts finds $A$ using the iterative procedure

A_{k} = (S_{k} + I) A_{k - 1},

where $S_{k} = (s_{j l})$ , for $l = 1, 2, \dots, m$ , for $j = 1, 2, \dots, m$ , is a lower triangular matrix such that

$s_{j l} = ⎧ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎩ \begin{matrix} - m i n [m a x (h_{j l} / n, - BL), BL], & j > l - m i n [m a x (\frac{1}{2} (h_{j j} / n - 1), - BD), BD], & j = l \end{matrix}$

$h_{j l} = \sum_{i = 1}^{n} u ({∥ z_{i} ∥}_{2}) z_{i j} z_{i l}$

and $BD$ and $BL$ are suitable bounds.

In addition the values of ${∥ z_{i} ∥}_{2}$ , for $i = 1, 2, \dots, n$ , are calculated.

robustm_wts is based on routines in ROBETH; see Marazzi (1987).

References

Hampel, F R, Ronchetti, E M, Rousseeuw, P J and Stahel, W A, 1986, Robust Statistics. The Approach Based on Influence Functions, Wiley

Huber, P J, 1981, Robust Statistics, Wiley

Marazzi, A, 1987, Weights for bounded influence regression in ROBETH, Cah. Rech. Doc. IUMSP, No. 3 ROB 3, Institut Universitaire de Médecine Sociale et Préventive, Lausanne

NAG and Python

Return to Front

naginterfaces.library.correg.robustm_wts¶

naginterfaces.library.correg.robustm_​wts¶

naginterfaces.library.correg.robustm_wts¶