NAG Library Routine Document

g07bef  (estim_weibull)

 Contents

    1  Purpose
    7  Accuracy

1
Purpose

g07bef computes maximum likelihood estimates for arguments of the Weibull distribution from data which may be right-censored.

2
Specification

Fortran Interface
Subroutine g07bef ( cens, n, x, ic, beta, gamma, tol, maxit, sebeta, segam, corr, dev, nit, wk, ifail)
Integer, Intent (In):: n, ic(*), maxit
Integer, Intent (Inout):: ifail
Integer, Intent (Out):: nit
Real (Kind=nag_wp), Intent (In):: x(n), tol
Real (Kind=nag_wp), Intent (Inout):: gamma
Real (Kind=nag_wp), Intent (Out):: beta, sebeta, segam, corr, dev, wk(n)
Character (1), Intent (In):: cens
C Header Interface
#include nagmk26.h
void  g07bef_ ( const char *cens, const Integer *n, const double x[], const Integer ic[], double *beta, double *gamma, const double *tol, const Integer *maxit, double *sebeta, double *segam, double *corr, double *dev, Integer *nit, double wk[], Integer *ifail, const Charlen length_cens)

3
Description

g07bef computes maximum likelihood estimates of the arguments of the Weibull distribution from exact or right-censored data.
For n realizations, yi, from a Weibull distribution a value xi is observed such that
xiyi.  
There are two situations:
(a) exactly specified observations, when xi=yi
(b) right-censored observations, known by a lower bound, when xi<yi.
The probability density function of the Weibull distribution, and hence the contribution of an exactly specified observation to the likelihood, is given by:
fx;λ,γ=λγxγ-1exp-λxγ,  x>0,   for ​λ,γ>0;  
while the survival function of the Weibull distribution, and hence the contribution of a right-censored observation to the likelihood, is given by:
Sx;λ,γ=exp-λ xγ,   x> 0,   for ​ λ ,γ> 0.  
If d of the n observations are exactly specified and indicated by iD and the remaining n-d are right-censored, then the likelihood function, Like ​λ,γ is given by
Likeλ,γλγd iDxiγ-1 exp-λi=1nxiγ .  
To avoid possible numerical instability a different parameterisation β,γ is used, with β=logλ. The kernel log-likelihood function, Lβ,γ, is then:
Lβ,γ=dlogγ+dβ+γ-1iDlogxi-eβi=1nxiγ.  
If the derivatives L β , L γ , 2L β2 , 2L β γ  and 2L γ2  are denoted by L1, L2, L11, L12 and L22, respectively, then the maximum likelihood estimates, β^ and γ^, are the solution to the equations:
L1β^,γ^=0 (1)
and
L2β^,γ^=0 (2)
Estimates of the asymptotic standard errors of β^ and γ^ are given by:
seβ^=-L22 L11L22-L122 ,  seγ^=-L11 L11L22-L122 .  
An estimate of the correlation coefficient of β^ and γ^ is given by:
L12L12L22 .  
Note:  if an estimate of the original argument λ is required, then
λ^=expβ^  and  seλ^=λ^seβ^.  
The equations (1) and (2) are solved by the Newton–Raphson iterative method with adjustments made to ensure that γ^>0.0.

4
References

Gross A J and Clark V A (1975) Survival Distributions: Reliability Applications in the Biomedical Sciences Wiley
Kalbfleisch J D and Prentice R L (1980) The Statistical Analysis of Failure Time Data Wiley

5
Arguments

1:     cens – Character(1)Input
On entry: indicates whether the data is censored or non-censored.
cens='N'
Each observation is assumed to be exactly specified. ic is not referenced.
cens='C'
Each observation is censored according to the value contained in ici, for i=1,2,,n.
Constraint: cens='N' or 'C'.
2:     n – IntegerInput
On entry: n, the number of observations.
Constraint: n1.
3:     xn – Real (Kind=nag_wp) arrayInput
On entry: xi contains the ith observation, xi, for i=1,2,,n.
Constraint: xi>0.0, for i=1,2,,n.
4:     ic* – Integer arrayInput
Note: the dimension of the array ic must be at least n if cens='C', and at least 1 otherwise.
On entry: if cens='C', ici contains the censoring codes for the ith observation, for i=1,2,,n.
If ici=0, the ith observation is exactly specified.
If ici=1, the ith observation is right-censored.
If cens='N', ic is not referenced.
Constraint: if cens='C', then ici=0 or 1, for i=1,2,,n.
5:     beta – Real (Kind=nag_wp)Output
On exit: the maximum likelihood estimate, β^, of β.
6:     gamma – Real (Kind=nag_wp)Input/Output
On entry: indicates whether an initial estimate of γ is provided.
If gamma>0.0, it is taken as the initial estimate of γ and an initial estimate of β is calculated from this value of γ.
If gamma0.0, initial estimates of γ and β are calculated, internally, providing the data contains at least two distinct exact observations. (If there are only two distinct exact observations, the largest observation must not be exactly specified.) See Section 9 for further details.
On exit: contains the maximum likelihood estimate, γ^, of γ.
7:     tol – Real (Kind=nag_wp)Input
On entry: the relative precision required for the final estimates of β and γ. Convergence is assumed when the absolute relative changes in the estimates of both β and γ are less than tol.
If tol=0.0, a relative precision of 0.000005 is used.
Constraint: machine precisiontol1.0 or tol=0.0.
8:     maxit – IntegerInput
On entry: the maximum number of iterations allowed.
If maxit0, a value of 25 is used.
9:     sebeta – Real (Kind=nag_wp)Output
On exit: an estimate of the standard error of β^.
10:   segam – Real (Kind=nag_wp)Output
On exit: an estimate of the standard error of γ^.
11:   corr – Real (Kind=nag_wp)Output
On exit: an estimate of the correlation between β^ and γ^.
12:   dev – Real (Kind=nag_wp)Output
On exit: the maximized kernel log-likelihood, Lβ^,γ^.
13:   nit – IntegerOutput
On exit: the number of iterations performed.
14:   wkn – Real (Kind=nag_wp) arrayWorkspace
15:   ifail – IntegerInput/Output
On entry: ifail must be set to 0, -1​ or ​1. If you are unfamiliar with this argument you should refer to Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this argument, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of ifail on exit.
On exit: ifail=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6
Error Indicators and Warnings

If on entry ifail=0 or -1, explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
ifail=1
On entry,cens'N' or 'C',
orn<1,
ortol<0.0,
or0.0<tol<machine precision,
ortol>1.0.
ifail=2
On entry,the ith observation, xi0.0, for some i=1,2,,n,
orthe ith censoring code, ici0 or 1, for some i=1,2,,n and cens='C'.
ifail=3
On entry, there are no exactly specified observations, or the routine was requested to calculate initial values and there are either less than two distinct exactly specified observations or there are exactly two and the largest observation is one of the exact observations.
ifail=4
The method has failed to converge in maxit iterations. You should increase tol or maxit.
ifail=5
Process has diverged. The process is deemed divergent if three successive increments of β or γ increase or if the Hessian matrix of the Newton–Raphson process is singular. Either different initial estimates should be provided or the data should be checked to see if the Weibull distribution is appropriate.
ifail=6
A potential overflow has been detected. This is an unlikely exit usually caused by a large input estimate of γ.
ifail=-99
An unexpected error has been triggered by this routine. Please contact NAG.
See Section 3.9 in How to Use the NAG Library and its Documentation for further information.
ifail=-399
Your licence key may have expired or may not have been installed correctly.
See Section 3.8 in How to Use the NAG Library and its Documentation for further information.
ifail=-999
Dynamic memory allocation failed.
See Section 3.7 in How to Use the NAG Library and its Documentation for further information.

7
Accuracy

Given that the Weibull distribution is a suitable model for the data and that the initial values are reasonable the convergence to the required accuracy, indicated by tol, should be achieved.

8
Parallelism and Performance

g07bef is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

The initial estimate of γ is found by calculating a Kaplan–Meier estimate of the survival function, S^x, and estimating the gradient of the plot of log-logS^x against x. This requires the Kaplan–Meier estimate to have at least two distinct points.
The initial estimate of β^, given a value of γ^, is calculated as
β^=logdi=1nxiγ^ .  

10
Example

In a study, 20 patients receiving an analgesic to relieve headache pain had the following recorded relief times (in hours):
1.1 1.4 1.3 1.7 1.9 1.8 1.6 2.2 1.7 2.7 4.1 1.8 1.5 1.2 1.4 3.0 1.7 2.3 1.6 2.0  
(See Gross and Clark (1975).) This data is read in and a Weibull distribution fitted assuming no censoring; the parameter estimates and their standard errors are printed.

10.1
Program Text

Program Text (g07befe.f90)

10.2
Program Data

Program Data (g07befe.d)

10.3
Program Results

Program Results (g07befe.r)

© The Numerical Algorithms Group Ltd, Oxford, UK. 2017