g13ae:: Time Series Analysis (NAG Toolbox)

The time series

x_{1}, x_{2}, \dots, x_{n}

supplied to nag_tsa_uni_arima_estim (g13ae) is assumed to follow a seasonal autoregressive integrated moving average (ARIMA) model defined as follows:

\nabla^{d} \nabla_{s}^{D} x_{t} - c = w_{t},

where

\nabla^{d} \nabla_{s}^{D} x_{t}

is the result of applying non-seasonal differencing of order

d

and seasonal differencing of seasonality

s

and order

D

to the series

x_{t}

, as outlined in the description of nag_tsa_uni_diff (g13aa). The differenced series is then of length

N = n - d^{'}

, where

d^{'} = d + (D \times s)

is the generalized order of differencing. The scalar

c

is the expected value of the differenced series, and the series

w_{1}, w_{2}, \dots, w_{N}

follows a zero-mean stationary autoregressive moving average (ARMA) model defined by a pair of recurrence equations. These express

w_{t}

in terms of an uncorrelated series

a_{t}

, via an intermediate series

e_{t}

. The first equation describes the seasonal structure:

w_{t} = Φ_{1} w_{t - s} + Φ_{2} w_{t - 2 \times s} + \dots + Φ_{P} w_{t - P \times s} + e_{t} - Θ_{1} e_{t - s} - Θ_{2} e_{t - 2 \times s} - \dots - Θ_{Q} e_{t - Q \times s} .

The second equation describes the non-seasonal structure. If the model is purely non-seasonal the first equation is redundant and

e_{t}

above is equated with

w_{t}

e_{t} = ϕ_{1} e_{t - 1} + ϕ_{2} e_{t - 2} + \dots + ϕ_{p} e_{t - p} + a_{t} - θ_{1} a_{t - 1} - θ_{2} a_{t - 2} - \dots - θ_{q} a_{t - q} .

Estimates of the model parameters defined by

\begin{array}{l} ϕ_{1}, ϕ_{2}, \dots, ϕ_{p}, θ_{1}, θ_{2}, \dots, θ_{q}, \\ Φ_{1}, Φ_{2}, \dots, Φ_{P}, Θ_{1}, Θ_{2}, \dots, Θ_{Q} \end{array}

and (optionally)

c

are obtained by minimizing a quadratic form in the vector

w = {(w_{1}, w_{2}, \dots, w_{N})}^{'}

This is

Q F = w^{'} V^{- 1} w

, where

V

is the covariance matrix of

w

, and is a function of the model parameters. This matrix is not explicitly evaluated, since

Q F

may be expressed as a ‘sum of squares’ function. When moving average parameters

θ_{i}

Θ_{i}

are present, so that the generalized moving average order

q^{'} = q + s \times Q

is positive, backforecasts

w_{1 - q^{'}}, w_{2 - q^{'}}, \dots, w_{0}

are introduced as nuisance parameters. The ‘sum of squares’ function may then be written as

S (p m) = \sum_{t = 1 - q^{'}}^{N} a_{t}^{2} - \sum_{t = 1 - q^{'} - p^{'}}^{- q^{'}} b_{t}^{2},

where

p m

is a combined vector of parameters, consisting of the backforecasts followed by the ARMA model parameters.

The equations defining

a_{t}

and

b_{t}

are precisely:

$e_{t} = w_{t} - Φ_{1} w_{t - s} - Φ_{2} w_{t - 2 \times s} - \dots - Φ_{P} w_{t - P \times s} + Θ_{1} e_{t - s} + Θ_{2} e_{t - 2 \times s} + \dots + Θ_{Q} e_{t - Q \times s}$ ,
for $t = 1 - q^{'}, 2 - q^{'}, \dots, n$ .
$a_{t} = e_{t} - ϕ_{1} e_{t - 1} - ϕ_{2} e_{t - 2} - \dots - ϕ_{p} e_{t - p} + θ_{1} a_{t - 1} + θ_{2} a_{t - 2} + \dots + θ_{q} a_{t - q}$ ,
for $t = 1 - q^{'}, 2 - q^{'}, \dots, n$ .
$f_{t} = w_{t} - Φ_{1} w_{t + s} - Φ_{2} w_{t + 2 \times s} - \dots - Φ_{P} w_{t + P \times s} + Θ_{1} f_{t - s} + Θ_{2} f_{t - 2 \times s} + \dots + Θ_{Q} f_{t - Q \times s}$ ,
for $t = (1 - q^{'} - s \times P), (2 - q^{'} - s \times P), \dots, (- q^{'} + P)$
$b_{t} = f_{t} - ϕ_{1} f_{t + 1} - ϕ_{2} f_{t + 2} - \dots - ϕ_{p} f_{t + p} + θ_{1} b_{t - 1} + θ_{2} b_{t - 2} + \dots + θ_{q} b_{t - q}$ ,
for $t = (1 - q^{'} - p^{'}), (2 - q^{'} - p^{'}), \dots, (- q^{'})$ .

For convenient application in forecasting, the following quantities constitute the ‘state set’, which contains the minimum amount of time series information needed to construct forecasts:

(i)	the differenced series $w_{t}$ , for $(N - s \times P) < t \leq N$ ,
(ii)	the $d^{'}$ values required to reconstitute the original series $x_{t}$ from the differenced series $w_{t}$ ,
(iii)	the intermediate series $e_{t}$ , for $(N - \max (p, Q \times s)) < t \leq N$ ,
(iv)	the residual series $a_{t}$ , for $(N - q) < t \leq N$ .

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Cases prefixed with W are classified as warnings and do not generate an error of type NAG:error_n. See nag_issue_warnings.

Accuracy

Further Comments

Example

function g13ae_example


fprintf('g13ae example results\n\n');

% Data
x = [-217; -177; -166; -136; -110;  -95;  -64;  -37;  -14;  -25;
      -51;  -62;  -73;  -88; -113; -120;  -83;  -33;  -19;   21;
       17;   44;   44;   78;   88;  122;  126;  114;   85;   64];
nx = numel(x);

% Orders
mr = [int64(1);1;2;0;0;0;0];

% parameters and sizes
npar = mr(1) + mr(3) + mr(4) + mr(6);
par = zeros(npar,1);
iex = mr(3) + (mr(6)*mr(7)) + nx;
igh = mr(3) + (mr(6)*mr(7)) + npar + 1;
ist = (mr(4)+mr(5))*mr(7) + mr(2) + mr(3) + max(mr(1),(mr(6)*mr(7)));

% Control parameters
c = 0;
kpiv = int64(0);
zsp = [0.001; 10; 1000; 0.0001];
kzsp = int64(1);

% Fit ARIMA model
[par, c, icount, ex, exr, al, s, g, sd, h, st, nst, itc, zsp, isf, ifail] = ...
  g13ae( ...
         mr, par, c, x, iex, igh, ist, @piv, kpiv, zsp, kzsp);

% Display results
nex = icount(4);
ndf = icount(5);
ngh = icount(6);
fprintf('Convergence was achieved after %4d cycles\n\n',itc);
fprintf('Final values of par array and the constant c are as follows\n');
fprintf('%10.4f', par, c);
fprintf('\n\nResidual sum of squares is %10.3f with %4d %s\n\n', ...
        s, ndf, 'degrees of freedom');
fprintf('The final values of ZSP were\n');
fprintf('%15.4e', zsp);
fprintf('\n\nThe number of parameters estimated was %4d\n', ngh);
fprintf('( backward forecasts, par and c, in that order )\n\n');
fprintf('The corresponding G array holds\n');
fprintf('%9.4f', g);
if  itc>0
  fprintf('\n\nThe corresponding SD array holds\n');
  fprintf('%9.4f', sd);
  fprintf('\n\n')
  [ifail] = x04ca( ...
                   'General', ' ', h(1:ngh,:), 'Corresponding H matrix');

  fprintf('\nHolds second derivatives in the upper half ');
  fprintf('(including the main diagonal)\n');
  fprintf('and correlation coefficients in the lower triangle\n');
end
fprintf('\n%s%5d%s\n','EX, EXR, and AL each hold', nex, ...
        ' values made up of');
fprintf('%5d%s\n', icount(1), ' back forecast(s),');
fprintf('%5d%s\n', icount(2), ' differenced values, and');
fprintf('%5d%s\n\n', icount(3), ' element(s) of reconstituted information');
fprintf('  Ex\n');
for j = 1:5:nex
  fprintf('%11.5f', ex(j:min(j+4,nex)));
  fprintf('\n');
end
fprintf('\n  Exr\n');
for j = 1:5:nex
  fprintf('%11.5f', exr(j:min(j+4,nex)));
  fprintf('\n');
end
fprintf('\n  Al\n');
for j = 1:5:nex
  fprintf('%11.5f', al(j:min(j+4,nex)));
  fprintf('\n');
end
fprintf('\nThe state set consists of %4d values\n',nst);
fprintf('%11.5f', st);
fprintf('\n');



function [] = piv(mr, par, npar, c, kfc, icount, s, g, h, ih, igh, itc, zsp)

  fprintf('Iteration %d  residual sum f squares = %16.4', itc, s);

g13ae example results

Convergence was achieved after   16 cycles

Final values of par array and the constant c are as follows
   -0.0547   -0.5568   -0.6636    9.9807

Residual sum of squares is   9397.924 with   25 degrees of freedom

The final values of ZSP were
     1.0000e-15     1.0000e+01     1.0000e+03     1.0000e-04

The number of parameters estimated was    6
( backward forecasts, par and c, in that order )

The corresponding G array holds
  -0.1512  -0.2343  -6.4097  13.5617 -72.6232  -0.1642

The corresponding SD array holds
  14.8379  15.1887   0.3507   0.2709   0.1695   7.3893

 Corresponding H matrix
               1            2            3            4            5
 1    1.9416E+00  -6.1794E-01   2.4409E-01   1.7942E+00  -8.3579E-01
 2    3.4176E-01   1.9446E+00  -1.6544E-01  -2.5084E-01   1.7952E+00
 3   -1.0544E-02   5.5643E-03   9.0416E+03  -9.6825E+03   5.4626E+02
 4   -1.2113E-02   5.6011E-03   8.1322E-01   1.7031E+04  -5.6761E+03
 5   -2.3216E-03  -1.1495E-03   3.6741E-01   4.7942E-01   1.7028E+04
 6   -1.4580E-01  -2.6004E-01  -4.0877E-02  -4.8389E-02  -3.7442E-02

               6
 1    2.4106E-01
 2    8.5926E-01
 3    8.1847E-01
 4    6.9417E+00
 5    6.3308E+00
 6    7.4339E+00

Holds second derivatives in the upper half (including the main diagonal)
and correlation coefficients in the lower triangle

EX, EXR, and AL each hold   32 values made up of
    2 back forecast(s),
   29 differenced values, and
    1 element(s) of reconstituted information

  Ex
   19.52500    5.87533   40.00000   11.00000   30.00000
   26.00000   15.00000   31.00000   27.00000   23.00000
  -11.00000  -26.00000  -11.00000  -11.00000  -15.00000
  -25.00000   -7.00000   37.00000   50.00000   14.00000
   40.00000   -4.00000   27.00000    0.00000   34.00000
   10.00000   34.00000    4.00000  -12.00000  -29.00000
  -21.00000   64.00000

  Exr
   19.52500   -3.92787   19.57110   -5.62907   10.22209
   15.15821   -9.32757   16.42850   15.21154   -5.42106
  -27.34437  -18.30612    5.38901  -12.98124  -22.47672
  -15.21833    4.49436   33.68668   19.75860  -27.14696
   32.24262  -12.27651    1.69412   -1.84650   23.37721
  -10.45763   14.33018   -5.70614  -28.64010  -20.45020
   -2.72147    0.00000

  Al
   19.52500    5.87533   30.01926    1.01926   20.01926
   16.01926    5.01926   21.01926   17.01926   13.01926
  -20.98074  -35.98074  -20.98074  -20.98074  -24.98074
  -34.98074  -16.98074   27.01926   40.01926    4.01926
   30.01926  -13.98074   17.01926   -9.98074   24.01926
    0.01926   24.01926   -5.98074  -21.98074  -38.98074
  -30.98074    0.00000

The state set consists of    4 values
   64.00000  -30.98074  -20.45020   -2.72147

(a)	the latest values of an approximation to the second derivative of $S$ with respect to each of the $(q + Q \times s + npar + kfc)$ parameters being estimated (backforecasts, par parameters, and where relevant the constant – in that order), and
(b)	the correlation coefficients relating to each pair of these parameters.

$- 2$	On entry parameters of this type have initial estimates which do not satisfy the stationarity or invertibility test conditions.
$- 1$	The search procedure has failed to converge because the latest set of parameter estimates of this type is invalid.
$0$	No parameter of this type is in the model.
$1$	Valid final estimates for parameters of this type have been obtained.

On entry,	$npar \neq p + q + P + Q$ ,
or	the orders vector mr is invalid (check it against the constraints in Arguments),
or	$kfc \neq 0$ or $1$ .

On entry,	one or more of the user-supplied criteria for controlling the iterative process are invalid,
or	$nit < 0$ ,
or	if $kzsp = 1$ , $zsp (1) \leq 0.0$ ;
or	if $kzsp = 1$ , $zsp (2) \leq 1.0$ ;
or	if $kzsp = 1$ , $zsp (3) < 1.0$ ;
or	if $kzsp = 1$ , $zsp (4) < 0.0$ ;
or	if $kzsp = 1$ , $zsp (4) \geq 1.0$ .

On entry,	$iex < q + (Q \times s) + nx$ ,
or	$igh < q + (Q \times s) + npar + kfc$ ,
or	$ldh \leq q + (Q \times s) + npar + kfc$ .

NAG Toolbox: nag_tsa_uni_arima_estim (g13ae)

▸▿ Contents

Purpose

Syntax

Description