naginterfaces.library.correg.ssqmat¶

naginterfaces.library.correg.ssqmat(x, mean='M', wt=None)[source]¶

ssqmat calculates the sample means and sums of squares and cross-products, or sums of squares and cross-products of deviations from the mean, in a single pass for a set of data. The data may be weighted.

For full information please refer to the NAG Library document for g02bu

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/g02/g02buf.html

Parameters

xfloat, array-like, shape $(n, m)$

$x [i - 1, j - 1]$ must contain the $i$ th observation on the $j$ th variable, for $j = 1, 2, \dots, m$ , for $i = 1, 2, \dots, n$ .

meanstr, length 1, optional

Indicates whether ssqmat is to calculate sums of squares and cross-products, or sums of squares and cross-products of deviations about the mean.

$m e a n ='M'$

The sums of squares and cross-products of deviations about the mean are calculated.

$m e a n ='Z'$

The sums of squares and cross-products are calculated.

wtNone or float, array-like, shape $(n)$ , optional

The optional weights of each observation. If weights are not provided then $w t$ must be set to None, otherwise $w t [i - 1]$ must contain the weight for the $i$ th observation.

Returns

swfloat

The sum of weights.

If $w t is N o n e$ , $s w$ contains the number of observations, $n$ .

wmeanfloat, ndarray, shape $(m)$

The sample means. $w m e a n [j - 1]$ contains the mean for the $j$ th variable.

cfloat, ndarray, shape $((m \times m + m) / 2)$

The cross-products.

If $m e a n ='M'$ , $c$ contains the upper triangular part of the matrix of (weighted) sums of squares and cross-products of deviations about the mean.

If $m e a n ='Z'$ , $c$ contains the upper triangular part of the matrix of (weighted) sums of squares and cross-products.

These are stored packed by columns, i.e., the cross-product between the $j$ th and $k$ th variable, $k \geq j$ , is stored in $c [k \times (k - 1) / 2 + j - 1]$ .

Raises

NagValueError

(errno $1$ )

On entry, $m = ⟨ v a l u e ⟩$ .

Constraint: $m \geq 1$ .

(errno $1$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq 1$ .

(errno $2$ )

On entry, $m e a n = ⟨ v a l u e ⟩$ .

Constraint: $m e a n ='M'$ or $'Z'$ .

(errno $3$ )

On entry, $weight = ⟨ v a l u e ⟩$ .

Constraint: $weight ='W'$ or $'U'$ .

(errno $4$ )

On entry, $w t [⟨ v a l u e ⟩] < 0.0$ .

Constraint: $w t [i - 1] \geq 0.0$ , for $i = 1, 2, \dots, n$ .

Notes

ssqmat is an adaptation of West’s WV2 algorithm; see West (1979). This function calculates the (optionally weighted) sample means and (optionally weighted) sums of squares and cross-products or sums of squares and cross-products of deviations from the (weighted) mean for a sample of $n$ observations on $m$ variables $X_{j}$ , for $j = 1, 2, \dots, m$ . The algorithm makes a single pass through the data.

For the first $i - 1$ observations let the mean of the $j$ th variable be ${¯ x}_{j} (i - 1)$ , the cross-product about the mean for the $j$ th and $k$ th variables be $c_{j k} (i - 1)$ and the sum of weights be $W_{i - 1}$ . These are updated by the $i$ th observation, $x_{i j}$ , for $j = 1, 2, \dots, m$ , with weight $w_{i}$ as follows:

\begin{matrix} \begin{matrix} W_{i} = W_{i - 1} + w_{i} {¯ x}_{j} (i) = {¯ x}_{j} (i - 1) + \frac{w_{i}}{W_{i}} (x_{j} - {¯ x}_{j} (i - 1)), j = 1, 2, \dots, m \end{matrix} \end{matrix}

and

c_{j k} (i) = c_{j k} (i - 1) + \frac{w_{i}}{W_{i}} (x_{j} - {¯ x}_{j} (i - 1)) (x_{k} - {¯ x}_{k} (i - 1)) W_{i - 1}, j = 1, 2, \dots, m and k = j, j + 1, \dots, m .

The algorithm is initialized by taking ${¯ x}_{j} (1) = x_{1 j}$ , the first observation, and $c_{i j} (1) = 0.0$ .

For the unweighted case $w_{i} = 1$ and $W_{i} = i$ for all $i$ .

Note that only the upper triangle of the matrix is calculated and returned packed by column.

References

Chan, T F, Golub, G H and Leveque, R J, 1982, Updating Formulae and a Pairwise Algorithm for Computing Sample Variances, Compstat, Physica-Verlag

West, D H D, 1979, Updating mean and variance estimates: An improved method, Comm. ACM (22), 532–555

NAG and Python

Return to Front

naginterfaces.library.correg.ssqmat¶