naginterfaces.library.univar.robust_1var_ci¶

naginterfaces.library.univar.robust_1var_ci(method, x, clevel)[source]¶

robust_1var_ci computes a rank based (nonparametric) estimate and confidence interval for the location parameter of a single population.

For full information please refer to the NAG Library document for g07ea

https://support.nag.com/numeric/nl/nagdoc_31.1/flhtml/g07/g07eaf.html

Parameters

methodstr, length 1

Specifies the method to be used.

$m e t h o d ='E'$

The exact algorithm is used.

$m e t h o d ='A'$

The iterative algorithm is used.

xfloat, array-like, shape $(n)$

The sample observations, $x_{i}$ , for $i = 1, 2, \dots, n$ .

clevelfloat

The confidence interval desired.

For example, for a $95 %$ confidence interval set $c l e v e l = 0.95$ .

Returns

thetafloat: The estimate of the location, $^θ$ .
thetalfloat: The estimate of the lower limit of the confidence interval, $θ_{l}$ .
thetaufloat: The estimate of the upper limit of the confidence interval, $θ_{u}$ .
estclfloat: An estimate of the actual percentage confidence of the interval found, as a proportion between $(0.0, 1.0)$ .
wlowerfloat: The upper value of the Wilcoxon test statistic, $W_{u}$ , corresponding to the lower limit of the confidence interval.
wupperfloat: The lower value of the Wilcoxon test statistic, $W_{l}$ , corresponding to the upper limit of the confidence interval.

Raises

NagValueError

(errno $1$ )

On entry, $c l e v e l = ⟨ v a l u e ⟩$ .

Constraint: $0.0 < c l e v e l < 1.0$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 2$ .

(errno $1$ )

On entry, $m e t h o d = ⟨ v a l u e ⟩$ .

Constraint: $m e t h o d ='E'$ or $'A'$ .

(errno $2$ )

Not enough information to compute an interval estimate since the whole sample is identical. The common value is returned in $t h e t a$ , $t h e t a l$ and $t h e t a u$ .

(errno $3$ )

The iterative procedure used to estimate $θ$ has not converged.

(errno $3$ )

The iterative procedure used to estimate, $θ_{u}$ , the upper confidence limit has not converged.

(errno $3$ )

The iterative procedure used to estimate, $θ_{l}$ , the lower confidence limit has not converged.

Notes

Consider a vector of independent observations, $x = {(x_{1}, x_{2}, \dots, x_{n})}_{1}^{T}$ with unknown common symmetric density $f (x_{i} - θ)$ . robust_1var_ci computes the Hodges–Lehmann location estimator (see Lehmann (1975)) of the centre of symmetry $θ$ , together with an associated confidence interval. The Hodges–Lehmann estimate is defined as

^θ=median{xi+xj2,1≤i≤j≤n}.

Let $m = (n (n + 1)) / 2$ and let $a_{k}$ , for $k = 1, 2, \dots, m$ denote the $m$ ordered averages $(x_{i} + x_{j}) / 2$ for $1 \leq i \leq j \leq n$ . Then

if $m$ is odd, $^θ = a_{k}$ where $k = (m + 1) / 2$ ;

if $m$ is even, $^θ = (a_{k} + a_{k + 1}) / 2$ where $k = m / 2$ .

This estimator arises from inverting the one-sample Wilcoxon signed-rank test statistic, $W (x - θ_{0})$ , for testing the hypothesis that $θ = θ_{0}$ . Effectively $W (x - θ_{0})$ is a monotonically decreasing step function of $θ_{0}$ with

\begin{matrix} \begin{matrix} mean (W) = μ = \frac{n (n + 1)}{4}, v a r (W) = σ^{2} = \frac{n (n + 1) (2 n + 1)}{24} . \end{matrix} \end{matrix}

The estimate $^θ$ is the solution to the equation $W (x -^θ) = μ$ ; two methods are available for solving this equation. These methods avoid the computation of all the ordered averages $a_{k}$ ; this is because for large $n$ both the storage requirements and the computation time would be excessive.

The first is an exact method based on a set partitioning procedure on the set of all ordered averages $(x_{i} + x_{j}) / 2$ for $i \leq j$ . This is based on the algorithm proposed by Monahan (1984).

The second is an iterative algorithm, based on the Illinois method which is a modification of the regula falsi method, see McKean and Ryan (1977). This algorithm has proved suitable for the function $W (x - θ_{0})$ which is asymptotically linear as a function of $θ_{0}$ .

The confidence interval limits are also based on the inversion of the Wilcoxon test statistic.

Given a desired percentage for the confidence interval, $1 - α$ , expressed as a proportion between $0$ and $1$ , initial estimates for the lower and upper confidence limits of the Wilcoxon statistic are found from

W_{l} = μ - 0.5 + (σ Φ^{- 1} (α / 2))

and

W_{u} = μ + 0.5 + (σ Φ^{- 1} (1 - α / 2)),

where $Φ^{- 1}$ is the inverse cumulative Normal distribution function.

$W_{l}$ and $W_{u}$ are rounded to the nearest integer values. These estimates are then refined using an exact method if $n \leq 80$ , and a Normal approximation otherwise, to find $W_{l}$ and $W_{u}$ satisfying

\begin{matrix} \begin{matrix} P (W \leq W_{l}) \leq α / 2 P (W \leq W_{l} + 1) > α / 2 \end{matrix} \end{matrix}

and

\begin{matrix} \begin{matrix} P (W \geq W_{u}) \leq α / 2 P (W \geq W_{u} - 1) > α / 2 . \end{matrix} \end{matrix}

Let $W_{u} = m - k$ ; then $θ_{l} = a_{k + 1}$ . This is the largest value $θ_{l}$ such that $W (x - θ_{l}) = W_{u}$ .

Let $W_{l} = k$ ; then $θ_{u} = a_{m - k}$ . This is the smallest value $θ_{u}$ such that $W (x - θ_{u}) = W_{l}$ .

As in the case of $^θ$ , these equations may be solved using either the exact or the iterative methods to find the values $θ_{l}$ and $θ_{u}$ .

Then $(θ_{l}, θ_{u})$ is the confidence interval for $θ$ . The confidence interval is thus defined by those values of $θ_{0}$ such that the null hypothesis, $θ = θ_{0}$ , is not rejected by the Wilcoxon signed-rank test at the $(100 \times α) %$ level.

References

Lehmann, E L, 1975, Nonparametrics: Statistical Methods Based on Ranks, Holden–Day

Marazzi, A, 1987, Subroutines for robust estimation of location and scale in ROBETH, Cah. Rech. Doc. IUMSP, No. 3 ROB 1, Institut Universitaire de Médecine Sociale et Préventive, Lausanne

McKean, J W and Ryan, T A, 1977, Algorithm 516: An algorithm for obtaining confidence intervals and point estimates based on ranks in the two-sample location problem, ACM Trans. Math. Software (10), 183–185

Monahan, J F, 1984, Algorithm 616: Fast computation of the Hodges–Lehman location estimator, ACM Trans. Math. Software (10), 265–270

NAG and Python

Return to Front

naginterfaces.library.univar.robust_1var_ci¶

naginterfaces.library.univar.robust_​1var_​ci¶

naginterfaces.library.univar.robust_1var_ci¶