naginterfaces.library.mv.factor¶

naginterfaces.library.mv.factor(matrix, n, x, nvar, isx, nfac, iop, wt=None, io_manager=None)[source]¶

factor computes the maximum likelihood estimates of the parameters of a factor analysis model. Either the data matrix or a correlation/covariance matrix may be input. Factor loadings, communalities and residual correlations are returned.

For full information please refer to the NAG Library document for g03ca

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/g03/g03caf.html

Parameters

matrixstr, length 1

Selects the type of matrix on which factor analysis is to be performed.

$m a t r i x ='D'$

The data matrix will be input in $x$ and factor analysis will be computed for the correlation matrix.

$m a t r i x ='S'$

The data matrix will be input in $x$ and factor analysis will be computed for the covariance matrix, i.e., the results are scaled as described in Further Comments.

$m a t r i x ='C'$

The correlation/variance-covariance matrix will be input in $x$ and factor analysis computed for this matrix.

See Further Comments.

nint

If $m a t r i x ='D'$ or $'S'$ the number of observations in the data array $x$ .

If $m a t r i x ='C'$ the (effective) number of observations used in computing the (possibly weighted) correlation/variance-covariance matrix input in $x$ .

xfloat, array-like, shape $(:, m)$

Note: the required extent for this argument in dimension 1 is determined as follows: if $m a t r i x in ('D','S')$ : $n$ ; if $m a t r i x ='C'$ : $m$ ; otherwise: $0$ .

The input matrix.

If $m a t r i x ='D'$ or $'S'$ , $x$ must contain the data matrix, i.e., $x [i - 1, j - 1]$ must contain the $i$ th observation for the $j$ th variable, for $j = 1, 2, \dots, m$ , for $i = 1, 2, \dots, n$ .

If $m a t r i x ='C'$ , $x$ must contain the correlation or variance-covariance matrix.

Only the upper triangular part is required.

nvarint

$p$ , the number of variables in the factor analysis.

isxint, array-like, shape $(m)$

$i s x [j - 1]$ indicates whether or not the $j$ th variable is included in the factor analysis. If $i s x [j - 1] \geq 1$ , the variable represented by the $j$ th column of $x$ is included in the analysis; otherwise it is excluded, for $j = 1, 2, \dots, m$ .

nfacint

$k$ , the number of factors.

iopint, array-like, shape $(5)$

Options for the optimization. There are four options to be set:

$iprint$	controls iteration monitoring;
	if $iprint \leq 0$ , there is no printing of information else if $iprint > 0$ , information is printed at every iprint iterations. The information printed consists of the value of $F (Ψ)$ at that iteration, the number of evaluations of $F (Ψ)$ , the current estimates of the communalities and an indication of whether or not they are at the boundary.
$maxfun$	the maximum number of function evaluations.
$acc$	the required accuracy for the estimates of $ψ_{i}$ .
$eps$	a lower bound for the values of $ψ$ , see Notes.

Let $ϵ = machine precision$ then if $i o p [0] = 0$ , the following default values are used:

$iprint = - 1$

$maxfun = 100 p$

$acc = 10 \sqrt{ϵ}$

$eps = ϵ$

If $i o p [0] \neq 0$ , then

$iprint = i o p [1]$

$maxfun = i o p [2]$

$acc = 10^{- l}$ where $l = i o p [3]$

$eps = 10^{- l}$ where $l = i o p [4]$

wtNone or float, array-like, shape $(:)$ , optional

Note: the required length for this argument is determined as follows: if $weight ='W' and m a t r i x in ('D','S')$ : $n$ ; otherwise: $1$ .

If $w t is not N o n e$ and $m a t r i x ='D'$ or $'S'$ , $w t$ must contain the weights to be used in the factor analysis. The effective number of observations in the analysis will then be the sum of weights. If $w t [i - 1] = 0.0$ , the $i$ th observation is not included in the analysis.

If $weight ='U'$ or $m a t r i x ='C'$ , $w t$ is not referenced and the effective number of observations is $n$ .

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

efloat, ndarray, shape $(n v a r)$

The eigenvalues $θ_{i}$ , for $i = 1, 2, \dots, p$ .

statfloat, ndarray, shape $(4)$

The test statistics.

$s t a t [0]$

Contains the value $F (^Ψ)$ .

$s t a t [1]$

Contains the test statistic, $χ^{2}$ .

$s t a t [2]$

Contains the degrees of freedom associated with the test statistic.

$s t a t [3]$

Contains the significance level.

comfloat, ndarray, shape $(n v a r)$

The communalities.

psifloat, ndarray, shape $(n v a r)$

The estimates of $ψ_{i}$ , for $i = 1, 2, \dots, p$ .

resfloat, ndarray, shape $(n v a r \times (n v a r - 1) / 2)$

The residual correlations. The residual correlation for the $i$ th and $j$ th variables is stored in $r e s [(j - 1) (j - 2) / 2 + i - 1]$ , $i < j$ .

flfloat, ndarray, shape $(n v a r, n f a c)$

The factor loadings. $f l [i - 1, j - 1]$ contains $λ_{i j}$ , for $j = 1, 2, \dots, k$ , for $i = 1, 2, \dots, p$ .

Raises

NagValueError

(errno $1$ )

On entry, $i o p [0] \neq 1$ and $i o p [4] = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq eps \leq machine precision$ .

(errno $1$ )

On entry, $i o p [0] \neq 1$ and $i o p [3] = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq acc \leq machine precision$ .

(errno $1$ )

On entry, $i o p [0] \neq 1$ and $i o p [2] = ⟨ v a l u e ⟩$ .

Constraint: $maxfun \geq 1$ .

(errno $1$ )

On entry, $n f a c = ⟨ v a l u e ⟩$ and $n v a r = ⟨ v a l u e ⟩$ .

Constraint: $n f a c \leq n v a r$ .

(errno $1$ )

On entry, $n f a c = ⟨ v a l u e ⟩$ .

Constraint: $n f a c \geq 1$ .

(errno $1$ )

On entry, $m = ⟨ v a l u e ⟩$ and $n v a r = ⟨ v a l u e ⟩$ .

Constraint: $m \geq n v a r$ .

(errno $1$ )

On entry, $m a t r i x = ⟨ v a l u e ⟩$ .

Constraint: $m a t r i x ='D'$ , $'S'$ or $'C'$ .

(errno $1$ )

On entry, $weight = ⟨ v a l u e ⟩$ .

Constraint: when $m a t r i x ='D'$ or $'S'$ , $weight ='U'$ or $'W'$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ and $n v a r = ⟨ v a l u e ⟩$ .

Constraint: $n > n v a r$ .

(errno $1$ )

On entry, $n v a r = ⟨ v a l u e ⟩$ .

Constraint: $n v a r > 1$ .

(errno $2$ )

On entry, $i = ⟨ v a l u e ⟩$ and $w t [i - 1] < 0.0$ .

Constraint: $w t [i - 1] \geq 0.0$ .

(errno $3$ )

The effective number of observations $\leq 1$ .

(errno $3$ )

The number of variables $\geq$ number of included observations.

(errno $3$ )

On entry, $n v a r = ⟨ v a l u e ⟩$ and $⟨ v a l u e ⟩$ values of $i s x > 0$

Constraint: exactly $n v a r$ elements of $i s x > 0$ .

(errno $4$ )

Two eigenvalues of $S^{*}$ are equal.

(errno $4$ )

On entry, the data matrix is not of full column rank or the input correlation/covariance matrix is not positive definite.

(errno $5$ )

The singular value decomposition has failed to converge.

(errno $6$ )

The estimation procedure has failed to converge in $⟨ v a l u e ⟩$ iterations.

Warns

NagAlgorithmicWarning

(errno $7$ ): The convergence is not certain but a lower point could not be found.

Notes

In the NAG Library the traditional C interface for this routine uses a different algorithmic base. Please contact NAG if you have any questions about compatibility.

Let $p$ variables, $x_{1}, x_{2}, \dots, x_{p}$ , with variance-covariance matrix $Σ$ be observed. The aim of factor analysis is to account for the covariances in these $p$ variables in terms of a smaller number, $k$ , of hypothetical variables, or factors, $f_{1}, f_{2}, \dots, f_{k}$ . These are assumed to be independent and to have unit variance. The relationship between the observed variables and the factors is given by the model:

x_{i} = k \sum j = 1 λ_{i j} f_{j} + e_{i}, i = 1, 2, \dots, p

where $λ_{i j}$ , for $j = 1, 2, \dots, k$ , for $i = 1, 2, \dots, p$ , are the factor loadings and $e_{i}$ , for $i = 1, 2, \dots, p$ , are independent random variables with variances $ψ_{i}$ , for $i = 1, 2, \dots, p$ . The $ψ_{i}$ represent the unique component of the variation of each observed variable. The proportion of variation for each variable accounted for by the factors is known as the communality. For this function it is assumed that both the $k$ factors and the $e_{i}$ ’s follow independent Normal distributions.

The model for the variance-covariance matrix, $Σ$ , can be written as:

Σ = Λ Λ^{T} + Ψ

where $Λ$ is the matrix of the factor loadings, $λ_{i j}$ , and $Ψ$ is a diagonal matrix of unique variances, $ψ_{i}$ , for $i = 1, 2, \dots, p$ .

The estimation of the parameters of the model, $Λ$ and $Ψ$ , by maximum likelihood is described by Lawley and Maxwell (1971). The log-likelihood is:

- \frac{1}{2} (n - 1) log (| Σ |) - \frac{1}{2} (n - 1) t r a c e (S, Σ^{- 1}) + constant,

where $n$ is the number of observations, $S$ is the sample variance-covariance matrix or, if weights are used, $S$ is the weighted sample variance-covariance matrix and $n$ is the effective number of observations, that is, the sum of the weights. The constant is independent of the parameters of the model. A two stage maximization is employed. It makes use of the function $F (Ψ)$ , which is, up to a constant, $- 2 / (n - 1)$ times the log-likelihood maximized over $Λ$ . This is then minimized with respect to $Ψ$ to give the estimates, $^Ψ$ , of $Ψ$ . The function $F (Ψ)$ can be written as:

F (Ψ) = p \sum j = k + 1 (θ_{j} - log (θ_{j})) - (p - k)

where values $θ_{j}$ , for $j = 1, 2, \dots, p$ are the eigenvalues of the matrix:

S^{*} = Ψ^{- 1 / 2} S Ψ^{- 1 / 2} .

The estimates $^Λ$ , of $Λ$ , are then given by scaling the eigenvectors of $S^{*}$ , which are denoted by $V$ :

^Λ = Ψ^{1 / 2} V {(Θ - I)}^{1 / 2} .

where $Θ$ is the diagonal matrix with elements $θ_{i}$ , and $I$ is the identity matrix.

The minimization of $F (Ψ)$ is performed using opt.bounds_mod_deriv2_comp which uses a modified Newton algorithm. The computation of the Hessian matrix is described by Clark (1970). However, instead of using the eigenvalue decomposition of the matrix $S^{*}$ as described above, the singular value decomposition of the matrix $R Ψ^{- 1 / 2}$ is used, where $R$ is obtained either from the $Q R$ decomposition of the (scaled) mean centred data matrix or from the Cholesky decomposition of the correlation/covariance matrix. The function opt.bounds_mod_deriv2_comp ensures that the values of $ψ_{i}$ are greater than a given small positive quantity, $δ$ , so that the communality is always less than $1$ . This avoids the so called Heywood cases.

In addition to the values of $Λ$ , $Ψ$ and the communalities, factor returns the residual correlations, i.e., the off-diagonal elements of $C - (Λ Λ^{T} + Ψ)$ where $C$ is the sample correlation matrix. factor also returns the test statistic:

χ^{2} = [n - 1 - (2 p + 5) / 6 - 2 k / 3] F (^Ψ)

which can be used to test the goodness-of-fit of the model (1), see Lawley and Maxwell (1971) and Morrison (1967).

References

Clark, M R B, 1970, A rapidly convergent method for maximum likelihood factor analysis, British J. Math. Statist. Psych.

Hammarling, S, 1985, The singular value decomposition in multivariate statistics, SIGNUM Newsl. (20(3)), 2–25

Lawley, D N and Maxwell, A E, 1971, Factor Analysis as a Statistical Method, (2nd Edition), Butterworths

Morrison, D F, 1967, Multivariate Statistical Methods, McGraw–Hill

NAG and Python

Return to Front

naginterfaces.library.mv.factor¶