naginterfaces.library.correg.mixeff_reml¶

naginterfaces.library.correg.mixeff_reml(dat, levels, yvid, cwid, fvid, fint, rvid, nvpr, vpr, rint, svid, gamma, lb, maxit=- 1, tol=0.0, io_manager=None)[source]¶

mixeff_reml fits a linear mixed effects regression model using restricted maximum likelihood (REML).

Deprecated since version 27.0.0.0: mixeff_reml is deprecated. Please use lmm_init() followed by lmm_fit() instead. See also the Replacement Calls document.

For full information please refer to the NAG Library document for g02ja

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/g02/g02jaf.html

Parameters

datfloat, array-like, shape $(n, ncol)$

Array containing all of the data. For the $i$ th observation:

$d a t [i - 1, y v i d - 1]$ holds the dependent variable, $y$ ;

if $c w i d \neq 0$ , $d a t [i - 1, c w i d - 1]$ holds the case weights;

if $s v i d \neq 0$ , $d a t [i - 1, s v i d - 1]$ holds the subject variable.

The remaining columns hold the values of the independent variables.

levelsint, array-like, shape $(ncol)$

$l e v e l s [i - 1]$ contains the number of levels associated with the $i$ th variable of the data matrix $d a t$ . If this variable is continuous or binary (i.e., only takes the values zero or one) then $l e v e l s [i - 1]$ should be $1$ ; if the variable is discrete then $l e v e l s [i - 1]$ is the number of levels associated with it and $d a t [j - 1, i - 1]$ is assumed to take the values $1$ to $l e v e l s [i - 1]$ , for $j = 1, 2, \dots, n$ .

yvidint

The column of $d a t$ holding the dependent, $y$ , variable.

cwidint

The column of $d a t$ holding the case weights.

If $c w i d = 0$ , no weights are used.

fvidint, array-like, shape $(nfv)$

The columns of the data matrix $d a t$ holding the fixed independent variables with $f v i d [i - 1]$ holding the column number corresponding to the $i$ th fixed variable.

fintint

Flag indicating whether a fixed intercept is included ( $f i n t = 1$ ).

rvidint, array-like, shape $(nrv)$

The columns of the data matrix $d a t$ holding the random independent variables with $r v i d [i - 1]$ holding the column number corresponding to the $i$ th random variable.

nvprint

If $r i n t = 1$ and $s v i d \neq 0$ , $n v p r$ is the number of variance components being $estimated - 2$ , ( $g - 1$ ), else $n v p r = g$ .

If $nrv = 0$ , $n v p r$ is not referenced.

vprint, array-like, shape $(nrv)$

$v p r [i - 1]$ holds a flag indicating the variance of the $i$ th random variable. The variance of the $i$ th random variable is $σ_{j}^{2}$ , where $j = v p r [i - 1] + 1$ if $r i n t = 1$ and $s v i d \neq 0$ and $j = v p r [i - 1]$ otherwise. Random variables with the same value of $j$ are assumed to be taken from the same distribution.

rintint

Flag indicating whether a random intercept is included ( $r i n t = 1$ ).

If $s v i d = 0$ , $r i n t$ is not referenced.

svidint

The column of $d a t$ holding the subject variable.

If $s v i d = 0$ , no subject variable is used.

Specifying a subject variable is equivalent to specifying the interaction between that variable and all of the random-effects.

Letting the notation $Z_{1} \times Z_{S}$ denote the interaction between variables $Z_{1}$ and $Z_{S}$ , fitting a model with $r i n t = 0$ , random-effects $Z_{1} + Z_{2}$ and subject variable $Z_{S}$ is equivalent to fitting a model with random-effects $Z_{1} \times Z_{S} + Z_{2} \times Z_{S}$ and no subject variable.

If $r i n t = 1$ the model is equivalent to fitting $Z_{S} + Z_{1} \times Z_{S} + Z_{2} \times Z_{S}$ and no subject variable.

gammafloat, array-like, shape $(n v p r + 2)$

Holds the initial values of the variance components, $γ_{0}$ , with $g a m m a [i - 1]$ the initial value for $σ_{i}^{2} / σ_{R}^{2}$ , for $i = 1, 2, \dots, g$ . If $r i n t = 1$ and $s v i d \neq 0$ , $g = n v p r + 1$ , else $g = n v p r$ .

If $g a m m a [0] = - 1.0$ , the remaining elements of $g a m m a$ are ignored and the initial values for the variance components are estimated from the data using MIVQUE0.

lbint

The size of the array $b$ .

maxitint, optional

The maximum number of iterations.

If $m a x i t < 0$ , the default value of $100$ is used.

If $m a x i t = 0$ , the parameter estimates $(β, ν)$ and corresponding standard errors are calculated based on the value of $γ_{0}$ supplied in $g a m m a$ .

tolfloat, optional

The tolerance used to assess convergence.

If $t o l \leq 0.0$ , the default value of $ϵ^{0.7}$ is used, where $ϵ$ is the machine precision.

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

gammafloat, ndarray, shape $(n v p r + 2)$

$g a m m a [i - 1]$ , for $i = 1, 2, \dots, g$ , holds the final estimate of $σ_{i}^{2}$ and $g a m m a [g]$ holds the final estimate for $σ_{R}^{2}$ .

nffint

The number of fixed effects estimated (i.e., the number of columns, $p$ , in the design matrix $X$ ).

nrfint

The number of random effects estimated (i.e., the number of columns, $q$ , in the design matrix $Z$ ).

dfint

The degrees of freedom.

remlfloat

$- 2 l_{R} (^γ)$ where $l_{R}$ is the log of the restricted maximum likelihood calculated at $^γ$ , the estimated variance components returned in $g a m m a$ .

bfloat, ndarray, shape $(l b)$

The parameter estimates, $(β, ν)$ , with the first $n f f$ elements of $b$ containing the fixed effect parameter estimates, $β$ and the next $n r f$ elements of $b$ containing the random effect parameter estimates, $ν$ .

Fixed effects

If $f i n t = 1$ , $b [0]$ contains the estimate of the fixed intercept.

Let $L_{i}$ denote the number of levels associated with the $i$ th fixed variable, that is $L_{i} = l e v e l s [f v i d [i - 1] - 1]$ .

Define

if $f i n t = 1$ , $F_{1} = 2$ else if $f i n t = 0$ , $F_{1} = 1$ ;

$F_{i + 1} = F_{i} + m a x (L_{i} - 1, 1)$ , $i \geq 1$ .

Then for $i = 1, 2, \dots, nfv$ :

if $L_{i} > 1$ , $b [F_{i} + j - 2 - 1]$ contains the parameter estimate for the $j$ th level of the $i$ th fixed variable, for $j = 2, 3, \dots, L_{i}$ ;

if $L_{i} \leq 1$ , $b [F_{i} - 1]$ contains the parameter estimate for the $i$ th fixed variable.

Random effects

Redefining $L_{i}$ to denote the number of levels associated with the $i$ th random variable, that is $L_{i} = l e v e l s [r v i d [i - 1] - 1]$ .

Define

if $r i n t = 1$ , $R_{1} = 2$ else if $r i n t = 0$ , $R_{1} = 1$ ;

$R_{i + 1} = R_{i} + L_{i}$ , $i \geq 1$ .

Then for $i = 1, 2, \dots, nrv$ :

if $s v i d = 0$ ,

if $L_{i} > 1$ , $b [n f f + R_{i} + j - 1 - 1]$ contains the parameter estimate for the $j$ th level of the $i$ th random variable, for $j = 1, 2, \dots, L_{i}$ ;

if $L_{i} \leq 1$ , $b [n f f + R_{i} - 1]$ contains the parameter estimate for the $i$ th random variable;

if $s v i d \neq 0$ ,

let $L_{S}$ denote the number of levels associated with the subject variable, that is $L_{S} = l e v e l s [s v i d - 1]$ ;

if $L_{i} > 1$ , $b [n f f + (s - 1) L_{S} + R_{i} + j - 1 - 1]$ contains the parameter estimate for the interaction between the $s$ th level of the subject variable and the $j$ th level of the $i$ th random variable, for $j = 1, 2, \dots, L_{i}$ , for $s = 1, 2, \dots, L_{S}$ ;

if $L_{i} \leq 1$ , $b [n f f + (s - 1) L_{S} + R_{i} - 1]$ contains the parameter estimate for the interaction between the $s$ th level of the subject variable and the $i$ th random variable, for $s = 1, 2, \dots, L_{S}$ ;

if $r i n t = 1$ , $b [n f f]$ contains the estimate of the random intercept.

sefloat, ndarray, shape $(l b)$

The standard errors of the parameter estimates given in $b$ .

warnint

Is set to $1$ if a variance component was estimated to be a negative value during the fitting process. Otherwise $w a r n$ is set to $0$ .

If $w a r n = 1$ , the negative estimate is set to zero and the estimation process allowed to continue.

Raises

NagValueError

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 1$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: number of observations with nonzero weights must be greater than one.

(errno $1$ )

On entry, $ncol = ⟨ v a l u e ⟩$ .

Constraint: $ncol \geq 1$ .

(errno $1$ )

On entry, $nfv = ⟨ v a l u e ⟩$ and $ncol = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq nfv < ncol$ .

(errno $1$ )

On entry, $nrv = ⟨ v a l u e ⟩$ and $ncol = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq nrv < ncol$ and $nrv + r i n t > 0$ .

(errno $1$ )

On entry, $n v p r = ⟨ v a l u e ⟩$ and $nrv = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq n v p r \leq nrv$ and ( $nrv \neq 0$ or $n v p r \geq 1$ ).

(errno $1$ )

On entry, $y v i d = ⟨ v a l u e ⟩$ and $ncol = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq y v i d \leq ncol$ .

(errno $1$ )

On entry, $s v i d = ⟨ v a l u e ⟩$ and $ncol = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq s v i d \leq ncol$ .

(errno $1$ )

On entry, $c w i d = ⟨ v a l u e ⟩$ and $ncol = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq c w i d \leq ncol$ and any supplied weights must be $\geq 0.0$ .

(errno $1$ )

On entry, $f i n t = ⟨ v a l u e ⟩$ .

Constraint: $f i n t = 0$ or $1$ .

(errno $1$ )

On entry, $r i n t = ⟨ v a l u e ⟩$ .

Constraint: $r i n t = 0$ or $1$ .

(errno $1$ )

On entry, $l b$ too small: $l b = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, $l e v e l s [i] < 1$ , for at least one $i$ .

(errno $2$ )

On entry, $ncol = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq f v i d [i] \leq ncol$ , for all $i$ .

(errno $2$ )

On entry, $ncol = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq r v i d [i] \leq ncol$ , for all $i$ .

(errno $2$ )

On entry, $n v p r = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq v p r [i] \leq n v p r$ , for all $i$ .

(errno $2$ )

On entry, invalid data: categorical variable with value greater than that specified in $l e v e l s$ .

(errno $2$ )

On entry, $g a m m a [i] < 0.0$ , for at least one $i$ .

(errno $3$ )

Degrees of freedom $< 1$ : $d f = ⟨ v a l u e ⟩$ .

(errno $4$ )

Routine failed to converge to specified tolerance: $t o l = ⟨ v a l u e ⟩$ .

(errno $4$ )

Routine failed to converge in $m a x i t$ iterations: $m a x i t = ⟨ v a l u e ⟩$ .

Notes

mixeff_reml fits a model of the form:

y = X β + Z ν + ϵ

where

$y$ is a vector of $n$ observations on the dependent variable,

$X$ is a known $n \times p$ design matrix for the fixed independent variables,

$β$ is a vector of length $p$ of unknown fixed effects,

$Z$ is a known $n \times q$ design matrix for the random independent variables,

$ν$ is a vector of length $q$ of unknown random effects,

and

$ϵ$ is a vector of length $n$ of unknown random errors.

Both $ν$ and $ϵ$ are assumed to have a Gaussian distribution with expectation zero and

\begin{matrix} V a r [\begin{matrix} ν ϵ \end{matrix}] = [\begin{matrix} G & 0 0 & R \end{matrix}] \end{matrix}

where $R = σ_{R}^{2} I$ , $I$ is the $n \times n$ identity matrix and $G$ is a diagonal matrix. It is assumed that the random variables, $Z$ , can be subdivided into $g \leq q$ groups with each group being identically distributed with expectations zero and variance $σ_{i}^{2}$ . The diagonal elements of matrix $G$ , therefore, take one of the values ${σ_{i}^{2} : i = 1, 2, \dots, g}$ , depending on which group the associated random variable belongs to.

The model, therefore, contains three sets of unknowns, the fixed effects, $β$ , the random effects $ν$ and a vector of $g + 1$ variance components, $γ$ , where $γ = {σ_{1}^{2}, σ_{2}^{2}, \dots, σ_{g - 1}^{2}, σ_{g}^{2}, σ_{R}^{2}}$ . Rather than working directly with $γ$ , mixeff_reml uses an iterative process to estimate $γ^{*} = {σ_{1}^{2} / σ_{R}^{2}, σ_{2}^{2} / σ_{R}^{2}, \dots, σ_{g - 1}^{2} / σ_{R}^{2}, σ_{g}^{2} / σ_{R}^{2}, 1}$ . Due to the iterative nature of the estimation a set of initial values, $γ_{0}$ , for $γ^{*}$ is required. mixeff_reml allows these initial values either to be supplied by you or calculated from the data using the minimum variance quadratic unbiased estimators (MIVQUE0) suggested by Rao (1972).

mixeff_reml fits the model using a quasi-Newton algorithm to maximize the restricted log-likelihood function:

- 2 l_{R} = log (| V |) + (n - p) log (r^{'} V^{- 1} r) + log (∣ ∣ X^{'} V^{- 1} X ∣ ∣) + (n - p) (1 + log (2 π / (n - p)))

where

V = Z G Z^{'} + R, r = y - X b and b = {(X^{'} V^{- 1} X)}^{- 1} X^{'} V^{- 1} y .

Once the final estimates for $γ^{*}$ have been obtained, the value of $σ_{R}^{2}$ is given by:

σ_{R}^{2} = (r^{'} V^{- 1} r) / (n - p) .

Case weights, $W_{c}$ , can be incorporated into the model by replacing $X^{'} X$ and $Z^{'} Z$ with $X^{'} W_{c} X$ and $Z^{'} W_{c} Z$ respectively, for a diagonal weight matrix $W_{c}$ .

The log-likelihood, $l_{R}$ , is calculated using the sweep algorithm detailed in Wolfinger et al. (1994).

References

Goodnight, J H, 1979, A tutorial on the SWEEP operator, The American Statistician (33(3)), 149–158

Harville, D A, 1977, Maximum likelihood approaches to variance component estimation and to related problems, JASA (72), 320–340

Rao, C R, 1972, Estimation of variance and covariance components in a linear model, J. Am. Stat. Assoc. (67), 112–115

Stroup, W W, 1989, Predictable functions and prediction space in the mixed model procedure, Applications of Mixed Models in Agriculture and Related Disciplines (Southern Cooperative Series Bulletin No. 343), 39–48

Wolfinger, R, Tobias, R and Sall, J, 1994, Computing Gaussian likelihoods and their derivatives for general linear mixed models, SIAM Sci. Statist. Comput. (15), 1294–1310

NAG and Python

Return to Front

naginterfaces.library.correg.mixeff_reml¶

naginterfaces.library.correg.mixeff_​reml¶

naginterfaces.library.correg.mixeff_reml¶