PDF version (NAG web site
, 64-bit version, 64-bit version)
NAG Toolbox: nag_tsa_multi_inputmod_estim (g13be)
Purpose
nag_tsa_multi_inputmod_estim (g13be) fits a multi-input model relating one output series to the input series with a choice of three different estimation criteria: nonlinear least squares, exact likelihood and marginal likelihood. When no input series are present, nag_tsa_multi_inputmod_estim (g13be) fits a univariate ARIMA model.
Syntax
[
para,
xxy,
zsp,
itc,
sd,
cm,
s,
d,
ndf,
res,
sttf,
nsttf,
ifail] = g13be(
mr,
mt,
para,
xxy, 'nser',
nser, 'npara',
npara, 'kfc',
kfc, 'nxxy',
nxxy, 'kef',
kef, 'nit',
nit, 'zsp',
zsp, 'iwa',
iwa, 'imwa',
imwa, 'kpriv',
kpriv)
[
para,
xxy,
zsp,
itc,
sd,
cm,
s,
d,
ndf,
res,
sttf,
nsttf,
ifail] = nag_tsa_multi_inputmod_estim(
mr,
mt,
para,
xxy, 'nser',
nser, 'npara',
npara, 'kfc',
kfc, 'nxxy',
nxxy, 'kef',
kef, 'nit',
nit, 'zsp',
zsp, 'iwa',
iwa, 'imwa',
imwa, 'kpriv',
kpriv)
Note: the interface to this routine has changed since earlier releases of the toolbox:
At Mark 23: |
isttf, kzef and kzsp were removed from the interface; iwa, imwa, zsp, kfc, nit, kef and kpriv were made optional |
At Mark 22: |
nxxy was made optional |
Description
The Multi-input Model
The output series , for , is assumed to be the sum of (unobserved) components which are due respectively to the inputs , for .
Thus where is the error, or output noise component.
A typical component
may be either
(a) |
a simple regression component, (here is called a simple input), or |
(b) |
a transfer function model component which allows for the effect of lagged values of the variable, related to by
|
The noise
is assumed to follow a (possibly seasonal) ARIMA model, i.e., may be represented in terms of an uncorrelated series,
, by the hierarchy of equations
(i) |
|
(ii) |
|
(iii) |
|
as outlined in
Description in
nag_tsa_uni_arima_estim (g13ae).
Note: the orders appearing in each of the transfer function models and the ARIMA model are not necessarily the same; is the result of applying non-seasonal differencing of order and seasonal differencing of seasonality and order to the series : the differenced series is then of length ; the constant term parameter may optionally be held fixed at its initial value (usually, but not necessarily zero) rather than being estimated.
For the purpose of defining an estimation criterion it is assumed that the series is a sequence of independent Normal variates having mean and variance . An allowance has to be made for the effects of unobserved data prior to the observation period. For the noise component an allowance is always made using a form of backforecasting.
For each transfer function input, you have to decide what values are to be assumed for the pre-period terms and which are in theory necessary to re-create the component series , during the estimation procedure.
The first choice is to assume that all these values are zero. In this case, in order to avoid undesirable transient distortion of the early values , you are advised first to correct the input series by subtracting from all the terms a suitable constant to make the early values , close to zero. The series mean is one possibility, but for a series with strong trend the constant might be simply .
The second choice is to treat the unknown pre-period terms as nuisance parameters and estimate them along with the other parameters. This choice should be used with caution. For example, if and , it is equivalent to fitting to the data a decaying geometric curve of the form , for , along with the other inputs, this being the form of the transient. If the output contains a strong trend of this form, which is not otherwise represented in the model, it will have a tendency to influence the estimate of away from the value appropriate to the transfer function model.
In most applications the first choice should be adequate, with the option possibly being used as a refinement at the end of the modelling process. The number of nuisance parameters is then , with a corresponding loss of degrees of freedom in the residuals. If you align the input with the output by using in its place the shifted series , then setting in the transfer function model, there is some improvement in efficiency. On some occasions when the model contains two or more inputs, each with estimation of pre-period nuisance parameters, these parameters may be co-linear and lead to failure of the function. The option must then be ‘switched off’ for one or more inputs.
The Estimation Criterion
This is a measure of how well a proposed set of parameters in the transfer function and noise ARIMA models matches the data. The estimation function searches for parameter values which minimize this criterion. For a proposed set of parameter values it is derived by calculating
(i) |
the components as the responses to the input series using the equations (a) or (b) above, |
(ii) |
the discrepancy between the output and the sum of these components, as the noise
|
(iii) |
the residual series from by reversing the recursive equations (i), (ii) and (iii) above. |
This last step again requires treatment of the effect of unknown pre-period values of
and other terms in the equations regenerating
.
This is identical to the treatment given in
Description in
nag_tsa_uni_arima_estim (g13ae), and leads to a criterion which is a sum of squares function
, of the residuals
.
It may be shown that the finite algorithm presented there is equivalent to taking the infinite set of past values
, as (linear) nuisance parameters.
There is no loss of degrees of freedom however, because the sum of squares function
may be expressed as including the corresponding set of past residuals; see page 273 of
Box and Jenkins (1976), who prove that
The function is the first of the three possible criteria, and is quite adequate for moderate to long series with no seasonal parameters. The second is the exact likelihood criterion which considers the past set not as simple nuisance parameters, but as unobserved random variables with known distribution. Calculation of the likelihood of the observed set requires theoretical integration over the range of the past set. Fortunately this yields a criterion of the form (whose minimization is equivalent to maximizing the exact likelihood of the data), where is exactly as before, and the multiplier is a function calculated from the ARIMA model parameters. The value of is always , and tends to for any fixed parameter set as the sample size tends to . There is a moderate computational overhead in using this option, but its use avoids appreciable bias in the ARIMA model parameters and yields a better conditioned estimation problem.
The third criterion of marginal likelihood treats the coefficients of the simple inputs in a manner analogous to that given to the past set . These coefficients, together with the constant term used to represent the mean of , are in effect treated as random variables with highly dispersed distributions. This leads to the criterion again, but with a different value of which now depends on the simple input series values . In the presence of a moderate to large number of simple inputs, the marginal likelihood criterion can counteract bias in the ARIMA model parameters which is caused by estimation of the simple inputs. This is particularly important in relatively short series.
nag_tsa_multi_inputmod_estim (g13be) can be used with no input series present, to estimate a univariate ARIMA model for the output alone. The marginal likelihood criterion is then distinct from exact likelihood only if a constant term is being estimated in the model, because this is treated as an implicit simple input.
The Estimation Procedure
This is the minimization of the estimation criterion or objective function
(for deviance). The function uses an extension of the algorithm of
Marquardt (1963). The step size in the minimization is inversely related to a parameter
, which is increased or decreased by a factor
at successive iterations, depending on the progress of the minimization. Convergence is deemed to have occurred if the fractional reduction of
in successive iterations is less than a value
, while
.
Certain model parameters (in fact all excluding the s) are subject to stability constraints which are checked throughout to within a specified tolerance multiple of machine accuracy. Using the least squares criterion, the minimization may halt prematurely when some parameters ‘stick’ at a constraint boundary. This can happen particularly with short seasonal series (with a small number of whole seasons). It will not happen using the exact likelihood criterion, although convergence to a point on the boundary may sometimes be rather slow, because the criterion function may be very flat in such a region. There is also a smaller risk of a premature halt at a constraint boundary when marginal likelihood is used.
A positive, or zero number of iterations can be specified. In either case, the value of the objective function at iteration zero is presented at the initial parameter values, except for estimation of any pre-period terms for the input series, backforecasts for the noise series, and the coefficients of any simple inputs, and the constant term (unless this is held fixed).
At any later iteration, the value of is computed after re-estimation of the backforecasts to their optimal values, corresponding to the model parameters presented at that iteration. This is not true for any pre-period terms for the input series which, although they are updated from the previous iteration, may not be precisely optimal for the parameter values presented, unless convergence of those parameters has occurred. However, in the case of marginal likelihood being specified, the coefficients of the simple inputs and the constant term are also re-estimated together with the backforecasts at each iteration, to values which are optimal for the other parameter values presented.
Further Results
The residual variance is taken as , where (total number of parameters estimated), is the residual degrees of freedom.
The pre-period nuisance parameters for the input series are included in the reduction of , as is the constant if it is estimated.
The covariance matrix of the vector of model parameter estimates is given by
where
is the linearized least squares matrix taken from the final iteration of the algorithm of Marquardt. From this expression are derived the vector of standard deviations, and the correlation matrix of parameter estimates. These are approximations which are only valid asymptotically, and must be treated with great caution when the parameter estimates are close to their constraint boundaries.
The residual series is available upon completion of the iterations over the range corresponding to the differenced noise series .
Because of the algorithm used for backforecasting, these are only true residuals for , provided this is positive. Estimation of pre-period terms for the inputs will also tend to reduce the magnitude of the early residuals, sometimes severely.
The model component series and may optionally be returned in place of the supplied series values, in order to assess the effects of the various inputs on the output.
Forecasting Information
For the purpose of constructing forecasts of the output series at future time points
using
nag_tsa_multi_inputmod_forecast_state (g13bh), it is not necessary to use the whole set of observations
and
, for
. It is sufficient to retain a limited set of quantities constituting the ‘state set’ as follows: for each series which appears with lagged subscripts in equations
(a),
(b),
(i),
(ii) and
(iii) above, include the values at times
for
up to the maximum lag associated with that series in the equations. Note that
(i) implicitly includes past values of
and intermediate differences of
such as
.
If later observations of the series become available, it is possible to update the state set (without re-estimating the model) using
nag_tsa_multi_inputmod_update (g13bg). If time series data is supplied with a previously estimated model, it is possible to construct the state set (and forecasts) using
nag_tsa_multi_inputmod_forecast (g13bj).
References
Box G E P and Jenkins G M (1976) Time Series Analysis: Forecasting and Control (Revised Edition) Holden–Day
Marquardt D W (1963) An algorithm for least squares estimation of nonlinear parameters J. Soc. Indust. Appl. Math. 11 431
Parameters
Compulsory Input Parameters
- 1:
– int64int32nag_int array
-
The orders vector
of the ARIMA model for the output noise component.
, , and refer respectively to the number of autoregressive , moving average , seasonal autoregressive and seasonal moving average parameters.
, and refer respectively to the order of non-seasonal differencing, the order of seasonal differencing and the seasonal period.
Constraints:
- , , , , , , ;
- ;
- ;
- if , ;
- if , ;
- ;
- .
- 2:
– int64int32nag_int array
-
The transfer function model orders
,
and
of each of the input series. The order arguments for input series
are held in column
. Row
holds the value
, row 2 holds the value
and row 3 holds the value
. For a simple input,
.
Row 4 holds the value , where for a simple input, for a transfer function input for which no allowance is to be made for pre-observation period effects, and for a transfer function input for which pre-observation period effects will be treated by estimation of appropriate nuisance parameters.
When , any nonzero contents of rows 1, 2, and 3 of column are ignored.
Constraint:
, for .
- 3:
– double array
-
Initial values of the multi-input model parameters. These are in order, firstly the ARIMA model parameters:
values of
parameters,
values of
parameters,
values of
parameters and
values of
parameters. These are followed by initial values of the transfer function model parameters
,
for the first of any input series and similarly for each subsequent input series. The final component of
para is the initial value of the constant
, whether it is fixed or is to be estimated.
- 4:
– double array
-
ldxxy, the first dimension of the array, must satisfy the constraint
.
The columns of
xxy must contain the
nxxy original, undifferenced values of each of the input series and the output series
in that order.
Optional Input Parameters
- 1:
– int64int32nag_int scalar
-
Default:
the second dimension of the arrays
mt,
xxy. (An error is raised if these dimensions are not equal.)
The total number of input and output series. There may be any number of input series (including none), but always one output series.
Constraints:
- ;
- if there are no parameters in the model (that is, and ), .
- 2:
– int64int32nag_int scalar
-
Default:
the dimension of the array
para.
The exact number of , and parameters.
Constraint:
, the summation being over all the
supplied in
mt.
must be included, whether fixed or estimated.
- 3:
– int64int32nag_int scalar
Default:
Must be set to if the constant is to remain fixed at its initial value, and if it is to be estimated.
Constraint:
or .
- 4:
– int64int32nag_int scalar
-
Default:
the first dimension of the array
xxy.
The (common) length of the original, undifferenced input and output time series.
- 5:
– int64int32nag_int scalar
Default:
Indicates the likelihood option.
- Gives least squares.
- Gives exact likelihood.
- Gives marginal likelihood.
Constraint:
, or .
- 6:
– int64int32nag_int scalar
Default:
The maximum required number of iterations.
- No change is made to any of the model parameters in array para except that the constant (if ) and any relating to simple input series are estimated. (Apart from these, estimates are always derived for the nuisance parameters relating to any backforecasts and any pre-observation period effects for transfer function inputs.)
Constraint:
.
- 7:
– double array
-
If
, then
zsp must contain the four values used to control the strategy of the search procedure.
- Contains , the value used to constrain the magnitude of the search procedure steps.
- Contains , the multiplier which regulates the value of .
- Contains , the value of the stationarity and invertibility test tolerance factor.
- Contains , the value of the convergence criterion.
If
before entry, default values of
zsp are supplied by the function. These are
,
,
and
, respectively.
Constraint:
if , , , , .
- 8:
– int64int32nag_int scalar
Default:
The dimension of the array
wa.
It is not practical to outline a method for deriving the exact minimum permissible value of
iwa, but the following gives a reasonably good conservative approximation. (It should be noted that if
iwa is too small (but not grossly so) then the exact minimum is printed if
.)
Let and where the orders of the output noise model are , , , , , , .
Let there be input series, where .
Let
where the transfer function model orders for input
are given by
,
,
,
.
Let .
Let and .
Finally, let , and then increment by every time any of the following conditions is satisfied. (The last six conditions should be applied separately to each input series, so that, for example, if we have two input series and if and then is incremented by .)
Then .
- 9:
– int64int32nag_int scalar
Default:
The dimension of the array
mwa.
Constraint:
, where the derivation of
is shown under
iwa.
If
imwa is too small then if
it is printed.
- 10:
– int64int32nag_int scalar
Default:
Must not be set to
, if it is required to monitor the course of the optimization or to print out the requisite minimum values of
iwa or
imwa in the event of an error of the type
or
. The course of the optimization is monitored by printing out at each iteration the iteration count (
itc), the residual sum of squares (
s), the objective function (
d) and a description and value for each of the parameters in the
para array. The descriptions are PHI for
, THETA for
, SPHI for
, STHETA for
, OMEGA/SI for
in a simple input, OMEGA for
in a transfer function input, DELTA for
and CONSTANT for
. In addition SERIES 1, SERIES 2, etc. indicate the input series relevant to the OMEGA and DELTA parameters.
kpriv must be set to
if the print-out of the above information is not required.
Output Parameters
- 1:
– double array
-
The latest values of the estimates of these parameters.
- 2:
– double array
-
the columns of
xxy hold the corresponding values of the input component series
in place of
and the output noise component
in place of
, in that order.
- 3:
– double array
-
Contains the values, default or otherwise, used by the function.
- 4:
– int64int32nag_int scalar
-
The number of iterations carried out.
- Indicates that the only estimates obtained up to this point have been for the nuisance parameters relating to backforecasts, unless the marginal likelihood option is used, in which case estimates have also been obtained for simple input coefficients and for the constant (if ). This value of itc usually indicates a failure in a consequent step of estimating transfer function input pre-observation period nuisance parameters.
- Indicates that estimates have been obtained up to this point for the constant (if ), for simple input coefficients and for the nuisance parameters relating to the backforecasts and to transfer function input pre-observation period effects.
- 5:
– double array
-
The
npara values of the standard deviations corresponding to each of the parameters in
para. When the constant is fixed its standard deviation is returned as zero. When the values of
para are valid, the values of
sd are usually also valid. However, if an exit value of
,
or
, then the contents of
sd will be indeterminate.
- 6:
– double array
-
The first
npara rows and columns of
cm contain the correlation coefficients relating to each pair of parameters in
para. All coefficients relating to the constant will be zero if the constant is fixed. The contents of
cm will be indeterminate under the same conditions as
sd.
- 7:
– double scalar
-
The residual sum of squares, , at the latest set of valid parameter estimates.
- 8:
– double scalar
-
The objective function, , at the latest set of valid parameter estimates.
- 9:
– int64int32nag_int scalar
-
The number of degrees of freedom associated with .
- 10:
– double array
-
The values of the residuals relating to the differenced values of the output series. The remainder of the first
nxxy terms in the array will be zero.
- 11:
– double array
-
The
nsttf values of the state set array.
- 12:
– int64int32nag_int scalar
-
The number of values in the state set array
sttf.
- 13:
– int64int32nag_int scalar
unless the function detects an error (see
Error Indicators and Warnings).
Error Indicators and Warnings
Note: nag_tsa_multi_inputmod_estim (g13be) may return useful information for one or more of the following detected errors or warnings.
Errors or warnings detected by the function:
-
-
On entry, | , |
or | , |
or | , |
or | , |
or | , |
or | , |
or | , |
or | , |
or | and there are no parameters in the model ( and ). |
-
-
On entry, there is inconsistency between
npara and
kfc on the one hand and the orders in arrays
mr and
mt on the other, or one of the
, stored in
,
or
.
-
-
On entry or during execution, one or more sets of parameters do not satisfy the stationarity or invertibility test conditions.
-
-
On entry, | when , , |
or | , |
or | , |
or | , |
or | . |
-
-
On entry,
iwa is too small by a considerable margin. No information is supplied about the requisite minimum size.
-
-
On entry,
iwa is too small, but the requisite minimum size is returned in
, which is printed if
.
-
-
On entry,
imwa is too small, but the requisite minimum size is returned in
, which is printed if
.
-
-
This indicates a failure in
nag_linsys_real_posdef_solve_1rhs (f04as) which is used to solve the equations giving the latest estimates of the parameters.
-
-
This indicates a failure in the inversion of the second derivative matrix. This is needed in the calculation of the correlation matrix and the standard deviations of the parameter estimates.
-
-
On entry or during execution, one or more sets of the ARIMA (, , or ) parameters do not satisfy the stationarity or invertibility test conditions.
-
-
On entry,
isttf is too small. The state set information will not be produced and if
array
xxy will remain unchanged. All other arguments will be produced correctly.
-
-
The function has failed to converge after
nit iterations. If steady decreases in the objective function,
, were monitored up to the point where this exit occurred, then the exit probably occurred because
nit was set too small, so the calculations should be restarted from the final point held in
para.
-
-
On entry,
isttf is too small (see
) and
nit iterations were carried out without the convergence conditions being satisfied (see
).
-
An unexpected error has been triggered by this routine. Please
contact
NAG.
-
Your licence key may have expired or may not have been installed correctly.
-
Dynamic memory allocation failed.
Accuracy
The computation used is believed to be stable.
Further Comments
The time taken by nag_tsa_multi_inputmod_estim (g13be) is approximately proportional to .
Example
After the full
iterations, the following are computed and printed out: the final values of the
para parameters and their standard errors, the correlation matrix, the residuals for the
differenced values, the values of
and
, the values of the state set and the number of degrees of freedom.
Open in the MATLAB editor:
g13be_example
function g13be_example
fprintf('g13be example results\n\n');
mr = [int64(1); 0; 0; 0; 0; 1; 4];
mt = zeros(4, 2, 'int64');
mt(1:4,1) = [1; 0; 1; 3];
para = [0; 0; 2; 0.5; 0];
xxy = [8.075, 105; 7.819, 119; 7.366, 119; 8.113, 109;
7.380, 117; 7.134, 135; 7.222, 126; 7.768, 112;
7.386, 116; 6.965, 122; 6.478, 115; 8.105, 115;
8.060, 122; 7.684, 138; 7.580, 135; 7.093, 125;
6.129, 115; 6.026, 108; 6.679, 100; 7.414, 96;
7.112, 107; 7.762, 115; 7.645, 123; 8.639, 122;
7.667, 128; 8.080, 136; 6.678, 140; 6.739, 122;
5.569, 102; 5.049, 103; 5.642, 89; 6.808, 77;
6.636, 89; 8.241, 94; 7.968, 104; 8.044, 108;
7.791, 119; 7.024, 126; 6.102, 119; 6.053, 103];
nxxy = size(xxy,1);
kef = int64(3);
[para, xxy, zsp, itc, sd, cm, s, d, ndf, res, sttf, nsttf, ifail] = ...
g13be( ...
mr, mt, para, xxy, 'kef', kef);
fprintf('The number of iterations carried out is %4d\n\n', itc);
fprintf('Final values of the parameters and their standard deviations\n\n');
fprintf(' i parameter sd\n\n');
ivar = [1:numel(para)]';
fprintf('%4d%20.6f%20.6f\n', [ivar para sd]');
fprintf('\n');
[ifail] = x04ca( ...
'General', ' ', cm, 'The correlation matrix is');
fprintf('\nThe residuals and the z and n values are\n\n');
fprintf(' i res z n\n\n');
ndv = nxxy - mr(2) - mr(5)*mr(7);
ival = double([1:ndv]');
fprintf('%4d%15.3f%15.3f%15.3f\n', [ival res xxy(1:ndv,:)]');
ival = double([ndv+1:nxxy]');
fprintf('%4d%30.3f%15.3f\n', [ival xxy(ndv+1:nxxy,:)]');
fprintf('\nThe state set consists of %4d values\n\n', nsttf);
disp(sttf');
fprintf('\nThe number of degrees of freedom is %4d\n', ndf);
g13be example results
The number of iterations carried out is 11
Final values of the parameters and their standard deviations
i parameter sd
1 0.380924 0.166379
2 -0.257786 0.178178
3 8.956084 0.948061
4 0.659641 0.060239
5 -75.435521 33.505341
The correlation matrix is
1 2 3 4 5
1 1.0000 -0.1839 -0.1775 -0.0340 0.1394
2 -0.1839 1.0000 0.0518 0.2547 -0.2860
3 -0.1775 0.0518 1.0000 -0.3070 -0.2926
4 -0.0340 0.2547 -0.3070 1.0000 -0.8185
5 0.1394 -0.2860 -0.2926 -0.8185 1.0000
The residuals and the z and n values are
i res z n
1 0.397 180.567 -75.567
2 3.086 191.430 -72.430
3 -2.818 196.302 -77.302
4 -9.941 195.460 -86.460
5 -5.061 201.594 -84.594
6 14.053 199.076 -64.076
7 2.624 195.211 -69.211
8 -5.823 193.450 -81.450
9 -2.147 197.179 -81.179
10 -0.216 196.217 -74.217
11 -2.517 191.812 -76.812
12 7.916 184.544 -69.544
13 1.423 194.322 -72.322
14 11.936 200.369 -62.369
15 5.117 200.990 -65.990
16 -5.672 200.468 -75.468
17 -5.681 195.763 -80.763
18 -1.637 184.025 -76.025
19 -1.019 175.360 -75.360
20 -2.623 175.492 -79.492
21 3.283 182.162 -75.162
22 6.896 183.857 -68.857
23 5.395 190.797 -67.797
24 0.875 194.327 -72.327
25 -4.153 205.558 -77.558
26 6.206 204.261 -68.261
27 4.208 207.104 -67.104
28 -2.387 196.423 -74.423
29 -11.803 189.924 -87.924
30 6.435 175.158 -72.158
31 1.342 160.761 -71.761
32 -4.924 156.575 -79.575
33 4.799 164.256 -75.256
34 -0.074 167.783 -73.783
35 -6.023 184.483 -80.483
36 -6.427 193.055 -85.055
37 -2.527 199.390 -80.390
38 2.039 201.302 -75.302
39 0.243 195.695 -76.695
40 -3.166 183.738 -80.738
The state set consists of 6 values
6.0530 183.7384 -5.7855 -0.1645 0.1800 -3.0977
The number of degrees of freedom is 34
PDF version (NAG web site
, 64-bit version, 64-bit version)
© The Numerical Algorithms Group Ltd, Oxford, UK. 2009–2015