naginterfaces.library.univar.estim_normal¶

naginterfaces.library.univar.estim_normal(method, x, xc, ic, xmu, xsig, tol, maxit)[source]¶

estim_normal computes maximum likelihood estimates and their standard errors for parameters of the Normal distribution from grouped and/or censored data.

For full information please refer to the NAG Library document for g07bb

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/g07/g07bbf.html

Parameters

methodstr, length 1

Indicates whether the Newton–Raphson or $E M$ algorithm should be used.

If $m e t h o d ='N'$ , the Newton–Raphson algorithm is used.

If $m e t h o d ='E'$ , the $E M$ algorithm is used.

xfloat, array-like, shape $(n)$

The observations $x_{i}$ , $L_{i}$ or $U_{i}$ , for $i = 1, 2, \dots, n$ .

If the observation is exactly specified – the exact value, $x_{i}$ .

If the observation is right-censored – the lower value, $L_{i}$ .

If the observation is left-censored – the upper value, $U_{i}$ .

If the observation is interval-censored – the lower or upper value, $L_{i}$ or $U_{i}$ , (see $x c$ ).

xcfloat, array-like, shape $(n)$

If the $j$ th observation, for $j = 1, 2, \dots, n$ is an interval-censored observation then $x c [j - 1]$ should contain the complementary value to $x [j - 1]$ , that is, if $x [j - 1] < x c [j - 1]$ , then $x c [j - 1]$ contains upper value, $U_{i}$ , and if $x [j - 1] > x c [j - 1]$ , then $x c [j - 1]$ contains lower value, $L_{i}$ . Otherwise if the $j$ th observation is exact or right - or left-censored $x c [j - 1]$ need not be set.

Note: if $x [j - 1] = x c [j - 1]$ then the observation is ignored.

icint, array-like, shape $(n)$

$i c [i - 1]$ contains the censoring codes for the $i$ th observation, for $i = 1, 2, \dots, n$ .

If $i c [i - 1] = 0$ , the observation is exactly specified.

If $i c [i - 1] = 1$ , the observation is right-censored.

If $i c [i - 1] = 2$ , the observation is left-censored.

If $i c [i - 1] = 3$ , the observation is interval-censored.

xmufloat

If $x s i g > 0.0$ the initial estimate of the mean, $μ$ ; otherwise $x m u$ need not be set.

xsigfloat

Specifies whether an initial estimate of $μ$ and $σ$ are to be supplied.

$x s i g > 0.0$

$x s i g$ is the initial estimate of $σ$ and $x m u$ must contain an initial estimate of $μ$ .

$x s i g \leq 0.0$

Initial estimates of $x m u$ and $x s i g$ are calculated internally from:

the exact observations, if the number of exactly specified observations is $\geq 2$ ; or

the interval-censored observations; if the number of interval-censored observations is $\geq 1$ ; or

they are set to $0.0$ and $1.0$ respectively.

tolfloat

The relative precision required for the final estimates of $μ$ and $σ$ . Convergence is assumed when the absolute relative changes in the estimates of both $μ$ and $σ$ are less than $t o l$ .

If $t o l = 0.0$ , a relative precision of $0.000005$ is used.

maxitint

The maximum number of iterations.

If $m a x i t \leq 0$ , a value of $25$ is used.

Returns

xmufloat

The maximum likelihood estimate, $^μ$ , of $μ$ .

xsigfloat

The maximum likelihood estimate, $^σ$ , of $σ$ .

sexmufloat

The estimate of the standard error of $^μ$ .

sexsigfloat

The estimate of the standard error of $^σ$ .

corrfloat

The estimate of the correlation between $^μ$ and $^σ$ .

devfloat

The maximized log-likelihood, $L (^μ,^σ)$ .

nobsint, ndarray, shape $(4)$

The number of the different types of each observation;

$n o b s [0]$ contains number of right-censored observations.

$n o b s [1]$ contains number of left-censored observations.

$n o b s [2]$ contains number of interval-censored observations.

$n o b s [3]$ contains number of exactly specified observations.

nitint

The number of iterations performed.

Raises

NagValueError

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 2$ .

(errno $1$ )

On entry, $m e t h o d = ⟨ v a l u e ⟩$ .

Constraint: $m e t h o d ='N'$ or $'E'$ .

(errno $1$ )

On entry, effective number of observations $< 2$ .

(errno $1$ )

On entry, $i = ⟨ v a l u e ⟩$ and $i c [i - 1] = ⟨ v a l u e ⟩$ .

Constraint: $i c [i - 1] = 0$ , $1$ , $2$ or $3$ .

(errno $1$ )

On entry, $t o l = ⟨ v a l u e ⟩$ .

Constraint: $machine precision < t o l \leq 1.0$ or $t o l = 0.0$ .

(errno $2$ )

The chosen method has not converged in $⟨ v a l u e ⟩$ iterations.

(errno $3$ )

The process has diverged.

(errno $3$ )

The EM process has failed.

(errno $4$ )

Standard errors cannot be computed.

Notes

A sample of size $n$ is taken from a Normal distribution with mean $μ$ and variance $σ^{2}$ and consists of grouped and/or censored data. Each of the $n$ observations is known by a pair of values $(L_{i}, U_{i})$ such that:

L_{i} \leq x_{i} \leq U_{i} .

The data is represented as particular cases of this form:

exactly specified observations occur when $L_{i} = U_{i} = x_{i}$ ,

right-censored observations, known only by a lower bound, occur when $U_{i} \to \infty$ ,

left-censored observations, known only by a upper bound, occur when $L_{i} \to - \infty$ ,

and interval-censored observations when $L_{i} < x_{i} < U_{i}$ .

Let the set $A$ identify the exactly specified observations, sets $B$ and $C$ identify the observations censored on the right and left respectively, and set $D$ identify the observations confined between two finite limits. Also let there be $r$ exactly specified observations, i.e., the number in $A$ . The probability density function for the standard Normal distribution is

Z (x) = \frac{1}{\sqrt{2 π}} e x p (- \frac{1}{2} x^{2}), - \infty < x < \infty

and the cumulative distribution function is

P (X) = 1 - Q (X) = \int_{- \infty}^{X} Z (x) d x .

The log-likelihood of the sample can be written as:

L (μ, σ) = - r log (σ) - \frac{1}{2} \sum A {(x_{i} - μ) / σ}^{2} + \sum B log (Q (l_{i})) + \sum C log (P (u_{i})) + \sum D log (p_{i})

where $p_{i} = P (u_{i}) - P (l_{i})$ and $u_{i} = (U_{i} - μ) / σ, l_{i} = (L_{i} - μ) / σ$ .

Let

S (x_{i}) = \frac{Z (x_{i})}{Q (x_{i})}, S_{1} (l_{i}, u_{i}) = \frac{Z (l_{i}) - Z (u_{i})}{p_{i}}

and

S_{2} (l_{i}, u_{i}) = \frac{u_{i} Z (u_{i}) - l_{i} Z (l_{i})}{p_{i}},

then the first derivatives of the log-likelihood can be written as:

\frac{\partial L (μ, σ)}{\partial μ} = L_{1} (μ, σ) = σ^{- 2} \sum A (x_{i} - μ) + σ^{- 1} \sum B S (l_{i}) - σ^{- 1} \sum C S (- u_{i}) + σ^{- 1} \sum D S_{1} (l_{i}, u_{i})

and

\frac{\partial L (μ, σ)}{\partial σ} = L_{2} (μ, σ) = - r σ^{- 1} + σ^{- 3} \sum A {(x_{i} - μ)}_{i}^{2} + σ^{- 1} \sum B l_{i} S (l_{i}) - σ^{- 1} \sum C u_{i} S (- u_{i})

- σ^{- 1} \sum D S_{2} (l_{i}, u_{i})

The maximum likelihood estimates, $^μ$ and $^σ$ , are the solution to the equations:

L_{1} (^μ,^σ) = 0

and

L_{2} (^μ,^σ) = 0

and if the second derivatives $\frac{\partial^{2} L}{\partial^{2} μ}$ , $\frac{\partial^{2} L}{\partial μ \partial σ}$ and $\frac{\partial^{2} L}{\partial^{2} σ}$ are denoted by $L_{11}$ , $L_{12}$ and $L_{22}$ respectively, then estimates of the standard errors of $^μ$ and $^σ$ are given by:

s e (^μ) = \sqrt{\frac{- L_{22}}{L_{11} L_{22} - L_{12}^{2}}}, s e (^σ) = \sqrt{\frac{- L_{11}}{L_{11} L_{22} - L_{12}^{2}}}

and an estimate of the correlation of $^μ$ and $^σ$ is given by:

\frac{L_{12}}{\sqrt{L_{12} L_{22}}} .

To obtain the maximum likelihood estimates the equations (1) and (2) can be solved using either the Newton–Raphson method or the Expectation-maximization $(E M)$ algorithm of Dempster et al. (1977).

Newton–Raphson Method

This consists of using approximate estimates $~ μ$ and $~ σ$ to obtain improved estimates $~ μ + δ ~ μ$ and $~ σ + δ ~ σ$ by solving

\begin{matrix} \begin{matrix} δ ~ μ L_{11} + δ ~ σ L_{12} + L_{1} = 0, δ ~ μ L_{12} + δ ~ σ L_{22} + L_{2} = 0, \end{matrix} \end{matrix}

for the corrections $δ ~ μ$ and $δ ~ σ$ .

EM Algorithm

The expectation step consists of constructing the variable $w_{i}$ as follows:

if i \in A, w_{i} = x_{i}

if i \in B, w_{i} = E (x_{i} | x_{i} > L_{i}) = μ + σ S (l_{i})

if i \in C, w_{i} = E (x_{i} | x_{i} < U_{i}) = μ - σ S (- u_{i})

if i \in D, w_{i} = E (x_{i} | L_{i} < x_{i} < U_{i}) = μ + σ S_{1} (l_{i}, u_{i})

the maximization step consists of substituting (3), (4), (5) and (6) into (1) and (2) giving:

^μ = n \sum i = 1 {^w}_{i} / n

and

^σ2=n∑i=1(^wi−^μ)2/{r+∑BT(^li)+∑CT(−^ui)+∑DT1(^li,^ui)}

where

T (x) = S (x) {S (x) - x}, T_{1} (l, u) = S_{1}^{2} (l, u) + S_{2} (l, u)

and where ${^w}_{i}$ , ${^l}_{i}$ and ${^u}_{i}$ are $w_{i}$ , $l_{i}$ and $u_{i}$ evaluated at $^μ$ and $^σ$ . Equations (3) and (8) are the basis of the $E M$ iterative procedure for finding $^μ$ and ${^σ}^{2}$ . The procedure consists of alternately estimating $^μ$ and ${^σ}^{2}$ using (7) and (8) and estimating ${^wi}$ using (3) and (6).

In choosing between the two methods a general rule is that the Newton–Raphson method converges more quickly but requires good initial estimates whereas the $E M$ algorithm converges slowly but is robust to the initial values. In the case of the censored Normal distribution, if only a small proportion of the observations are censored then estimates based on the exact observations should give good enough initial estimates for the Newton–Raphson method to be used. If there are a high proportion of censored observations then the $E M$ algorithm should be used and if high accuracy is required the subsequent use of the Newton–Raphson method to refine the estimates obtained from the $E M$ algorithm should be considered.

References

Dempster, A P, Laird, N M and Rubin, D B, 1977, Maximum likelihood from incomplete data via the $E M$ algorithm (with discussion), J. Roy. Statist. Soc. Ser. B (39), 1–38

Swan, A V, 1969, Algorithm AS 16. Maximum likelihood estimation from grouped and censored normal data, Appl. Statist. (18), 110–114

Wolynetz, M S, 1979, Maximum likelihood estimation from confined and censored normal data, Appl. Statist. (28), 185–195

NAG and Python

Return to Front

naginterfaces.library.univar.estim_normal¶

naginterfaces.library.univar.estim_​normal¶

naginterfaces.library.univar.estim_normal¶