NAG FL Interface
g13ddf (multi_varma_estimate)
1
Purpose
g13ddf fits a vector autoregressive moving average (VARMA) model to an observed vector of time series using the method of Maximum Likelihood (ML). Standard errors of parameter estimates are computed along with their appropriate correlation matrix. The routine also calculates estimates of the residual series.
2
Specification
Fortran Interface
Subroutine g13ddf ( |
k, n, ip, iq, mean, par, npar, qq, kmax, w, parhld, exact, iprint, cgetol, maxcal, ishow, niter, rlogl, v, g, cm, ldcm, ifail) |
Integer, Intent (In) |
:: |
k, n, ip, iq, npar, kmax, iprint, maxcal, ishow, ldcm |
Integer, Intent (Inout) |
:: |
ifail |
Integer, Intent (Out) |
:: |
niter |
Real (Kind=nag_wp), Intent (In) |
:: |
w(kmax,n), cgetol |
Real (Kind=nag_wp), Intent (Inout) |
:: |
par(npar), qq(kmax,k), v(kmax,n), cm(ldcm,npar) |
Real (Kind=nag_wp), Intent (Out) |
:: |
rlogl, g(npar) |
Logical, Intent (In) |
:: |
mean, parhld(npar), exact |
|
C Header Interface
#include <nag.h>
void |
g13ddf_ (const Integer *k, const Integer *n, const Integer *ip, const Integer *iq, const logical *mean, double par[], const Integer *npar, double qq[], const Integer *kmax, const double w[], const logical parhld[], const logical *exact, const Integer *iprint, const double *cgetol, const Integer *maxcal, const Integer *ishow, Integer *niter, double *rlogl, double v[], double g[], double cm[], const Integer *ldcm, Integer *ifail) |
|
C++ Header Interface
#include <nag.h> extern "C" {
void |
g13ddf_ (const Integer &k, const Integer &n, const Integer &ip, const Integer &iq, const logical &mean, double par[], const Integer &npar, double qq[], const Integer &kmax, const double w[], const logical parhld[], const logical &exact, const Integer &iprint, const double &cgetol, const Integer &maxcal, const Integer &ishow, Integer &niter, double &rlogl, double v[], double g[], double cm[], const Integer &ldcm, Integer &ifail) |
}
|
The routine may be called by the names g13ddf or nagf_tsa_multi_varma_estimate.
3
Description
Let
, for
, denote a vector of
time series which is assumed to follow a multivariate ARMA model of the form
where
, for
, is a vector of
residual series assumed to be Normally distributed with zero mean and positive definite covariance matrix
. The components of
are assumed to be uncorrelated at non-simultaneous lags. The
and
are
by
matrices of parameters.
, for
, are called the autoregressive (AR) parameter matrices, and
, for
, the moving average (MA) parameter matrices. The parameters in the model are thus the
(
by
)
-matrices, the
(
by
)
-matrices, the mean vector,
, and the residual error covariance matrix
. Let
where
denotes the
by
identity matrix.
The ARMA model
(1) is said to be stationary if the eigenvalues of
lie inside the unit circle. Similarly, the ARMA model
(1) is said to be invertible if the eigenvalues of
lie inside the unit circle.
The method of computing the exact likelihood function (using a Kalman filter algorithm) is discussed in
Shea (1987). A quasi-Newton algorithm (see
Gill and Murray (1972)) is then used to search for the maximum of the log-likelihood function. Stationarity and invertibility are enforced on the model using the reparameterisation discussed in
Ansley and Kohn (1986). Conditional on the maximum likelihood estimates being equal to their true values the estimates of the residual series are uncorrelated with zero mean and constant variance
.
You have the option of setting an argument (
exact to .FALSE.) so that
g13ddf calculates conditional maximum likelihood estimates (conditional on
). This may be useful if the exact maximum likelihood estimates are close to the boundary of the invertibility region.
You also have the option (see
Section 5) of requesting
g13ddf to constrain elements of the
and
matrices and
vector to have pre-specified values.
4
References
Ansley C F and Kohn R (1986) A note on reparameterising a vector autoregressive moving average model to enforce stationarity J. Statist. Comput. Simulation 24 99–106
Gill P E and Murray W (1972) Quasi-Newton methods for unconstrained optimization J. Inst. Math. Appl. 9 91–108
Shea B L (1987) Estimation of multivariate time series J. Time Ser. Anal. 8 95–110
5
Arguments
-
1:
– Integer
Input
-
On entry: , the number of observed time series.
Constraint:
.
-
2:
– Integer
Input
-
On entry: , the number of observations in each time series.
-
3:
– Integer
Input
-
On entry: , the number of AR parameter matrices.
Constraint:
.
-
4:
– Integer
Input
-
On entry: , the number of MA parameter matrices.
Constraint:
.
is not permitted.
-
5:
– Logical
Input
-
On entry: , if components of have been estimated and , if all elements of are to be taken as zero.
Constraint:
or .
-
6:
– Real (Kind=nag_wp) array
Input/Output
-
On entry: initial parameter estimates read in row by row in the order
,
.
Thus,
- if ,
must be set equal to an initial estimate of the th element of , for , and ;
- if , must be set equal to an initial estimate of the th element of , and ;
- if , should be set equal to an initial estimate of the th component of (). (If you set to then g13ddf will calculate the mean of the th series and use this as an initial estimate of .)
The first
elements of
par must satisfy the stationarity condition and the next
elements of
par must satisfy the invertibility condition.
If in doubt set all elements of
par to
.
On exit: if
or
then all the elements of
par will be overwritten by the latest estimates of the corresponding ARMA parameters.
-
7:
– Integer
Input
-
On entry: the dimension of the arrays
par,
parhld and
g and the second dimension of the array
cm as declared in the (sub)program from which
g13ddf is called.
npar is the number of initial parameter estimates.
Constraints:
- if , npar must be set equal to ;
- if , npar must be set equal to .
The total number of observations must exceed the total number of parameters in the model ().
-
8:
– Real (Kind=nag_wp) array
Input/Output
-
On entry:
must be set equal to an initial estimate of the
th element of
. The lower triangle only is needed.
qq must be positive definite. It is strongly recommended that on entry the elements of
qq are of the same order of magnitude as at the solution point. If you set
, for
and
, then
g13ddf will calculate the covariance matrix between the
time series and use this as an initial estimate of
.
On exit: if or then will contain the latest estimate of the th element of . The lower triangle only is returned.
-
9:
– Integer
Input
-
On entry: the first dimension of the arrays
qq,
w and
v as declared in the (sub)program from which
g13ddf is called.
Constraint:
.
-
10:
– Real (Kind=nag_wp) array
Input
-
On entry: must be set equal to the th component of , for and .
-
11:
– Logical array
Input
-
On entry:
must be set to .TRUE. if
is to be held constant at its input value and .FALSE. if
is a free parameter, for
.
If in doubt try setting all elements of
parhld to .FALSE..
-
12:
– Logical
Input
-
On entry: must be set equal to .TRUE. if you wish
g13ddf to compute exact maximum likelihood estimates.
exact must be set equal to .FALSE. if only conditional likelihood estimates are required.
-
13:
– Integer
Input
-
On entry: the frequency with which the automatic monitoring routine is to be called.
- The ML search procedure is monitored once every iprint iterations and just before exit from the search routine.
- The search routine is monitored once at the final point.
- The search routine is not monitored at all.
-
14:
– Real (Kind=nag_wp)
Input
-
On entry: the accuracy to which the solution in
par and
qq is required.
If
cgetol is set to
and on exit
or
, then all the elements in
par and
qq should be accurate to approximately
decimal places. For most practical purposes the value
should suffice. You should be wary of setting
cgetol too small since the convergence criteria may then have become too strict for the machine to handle.
If
cgetol has been set to a value which is less than the
machine precision,
, then
g13ddf will use the value
instead.
-
15:
– Integer
Input
-
On entry: the maximum number of likelihood evaluations to be permitted by the search procedure.
Suggested value:
.
Constraint:
.
-
16:
– Integer
Input
-
On entry: specifies which of the following two quantities are to be printed.
-
(i)table of maximum likelihood estimates and their standard errors (as returned in the output arrays par, qq and cm);
-
(ii)table of residual series (as returned in the output array v).
- None of the above are printed.
- (i) only is printed.
- (i) and (ii) are printed.
Constraint:
.
-
17:
– Integer
Output
-
On exit: if
or
then
niter contains the number of iterations performed by the search routine.
-
18:
– Real (Kind=nag_wp)
Output
-
On exit: if
or
then
rlogl contains the value of the log-likelihood function corresponding to the final point held in
par and
qq.
-
19:
– Real (Kind=nag_wp) array
Output
-
On exit: if
or
then
will contain an estimate of the
th component of
, for
and
, corresponding to the final point held in
par and
qq.
-
20:
– Real (Kind=nag_wp) array
Output
-
On exit: if
or
then
will contain the estimated first derivative of the log-likelihood function with respect to the
th element in the array
par. If the gradient cannot be computed then all the elements of
g are returned as zero.
-
21:
– Real (Kind=nag_wp) array
Output
-
On exit: if
or
then
will contain an estimate of the correlation coefficient between the
th and
th elements in the
par array for
,
. If
, then
will contain the estimated standard error of
. If the
th component of
par has been held constant, i.e.,
was set to .TRUE., then the
th row and column of
cm will be set to zero. If the second derivative matrix cannot be computed then all the elements of
cm are returned as zero.
-
22:
– Integer
Input
-
On entry: the first dimension of the array
cm as declared in the (sub)program from which
g13ddf is called.
Constraint:
.
-
23:
– Integer
Input/Output
-
On entry:
ifail must be set to
,
. If you are unfamiliar with this argument you should refer to
Section 4 in the Introduction to the NAG Library FL Interface for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value
is recommended. If the output of error messages is undesirable, then the value
is recommended. Otherwise, because for this routine the values of the output arguments may be useful even if
on exit, the recommended value is
.
When the value is used it is essential to test the value of ifail on exit.
On exit:
unless the routine detects an error or a warning has been flagged (see
Section 6).
6
Error Indicators and Warnings
If on entry
or
, explanatory error messages are output on the current error message unit (as defined by
x04aaf).
Errors or warnings detected by the routine:
Note: in some cases g13ddf may return useful information.
-
On entry, .
Constraint: .
On entry, and .
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, and .
Constraint: .
On entry, and .
Constraint: .
On entry, .
Constraint: .
On entry, , and .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
-
The initial AR parameter estimates are outside the stationarity region. To proceed you must try a different starting point.
The initial estimate of is not positive definite. To proceed you must try a different starting point.
The initial MA parameter estimates are outside the invertibility region. To proceed you must try a different starting point.
The starting point is too close to the boundary of the admissibility region. To proceed you must try a different starting point.
-
The routine cannot compute a sufficiently accurate estimate of the gradient vector at the user-supplied starting point. This usually occurs if either the initial parameter estimates are very close to the ML parameter estimates, or you have supplied a very poor estimate of , or the starting point is very close to the boundary of the stationarity or invertibility region. To proceed you must try a different starting point.
-
There have been
maxcal log-likelihood evaluations made in the routine.
If steady increases in the log-likelihood function were monitored up to the point where this exit occurred, then the exit probably simply occurred because
maxcal was set too small, so the calculations should be restarted from the final point held in
par and
qq. This type of exit may also indicate that there is no maximum to the likelihood surface. Output quantities were computed at the final point held in
par and
qq, except that if
g or
cm could not be computed, in which case they are set to zero.
-
The conditions for a solution have not all been met, but a point at which the log-likelihood took a larger value could not be found.
Provided that the estimated first derivatives are sufficiently small, and that the estimated condition number of the second derivative (Hessian) matrix, as printed when , is not too large, this error exit may simply mean that, although it has not been possible to satisfy the specified requirements, the algorithm has in fact found the solution as far as the accuracy of the machine permits.
Such a condition can arise, for instance, if
cgetol has been set so small that rounding error in evaluating the likelihood function makes attainment of the convergence conditions impossible.
If the estimated condition number at the final point is large, it could be that the final point is a solution but that the smallest eigenvalue of the Hessian matrix is so close to zero at the solution that it is not possible to recognize it as a solution. Output quantities were computed at the final point held in
par and
qq, except that if
g or
cm could not be computed, in which case they are set to zero.
-
The ML solution is so close to the boundary of either the stationarity region or the invertibility region that
g13ddf cannot evaluate the Hessian matrix. The elements of
cm are set to zero, as are the elements of
g. All other output quantities are correct.
-
An estimate of the second derivative matrix and the gradient vector at the solution point was computed. Either the Hessian matrix was found to be too ill-conditioned to be evaluated accurately or the gradient vector could not be computed to an acceptable degree of accuracy. The elements of
cm are set to zero, as are the elements of
g. All other output quantities are correct.
-
The second-derivative matrix at the solution point is not positive definite. The elements of
cm are set to zero. All other output quantities are correct.
An unexpected error has been triggered by this routine. Please
contact
NAG.
See
Section 7 in the Introduction to the NAG Library FL Interface for further information.
Your licence key may have expired or may not have been installed correctly.
See
Section 8 in the Introduction to the NAG Library FL Interface for further information.
Dynamic memory allocation failed.
See
Section 9 in the Introduction to the NAG Library FL Interface for further information.
7
Accuracy
On exit from
g13ddf, if
or
and
cgetol has been set to
, then all the parameters should be accurate to approximately
decimal places. If
cgetol was set equal to a value less than the
machine precision,
, then all the parameters should be accurate to approximately
.
If
on exit (i.e.,
maxcal likelihood evaluations have been made but the convergence conditions of the search routine have not been satisfied), then the elements in
par and
qq may still be good approximations to the ML estimates. Inspection of the elements of
g may help you determine whether this is likely.
8
Parallelism and Performance
g13ddf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
g13ddf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the
X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the
Users' Note for your implementation for any additional implementation-specific information.
Let and . Local workspace arrays of fixed lengths are allocated internally by g13ddf. The total size of these arrays amounts to integer elements and real elements.
The number of iterations required depends upon the number of parameters in the model and the distance of the user-supplied starting point from the solution.
If the solution lies on the boundary of the admissibility region (stationarity and invertibility region) then
g13ddf may get into difficulty and exit with
. If this exit occurs you are advised to either try a different starting point or a different setting for
exact. If this still continues to occur then you are urged to try fitting a more parsimonious model.
You are advised to try and avoid fitting models with an excessive number of parameters since over-parameterisation can cause the maximization problem to become ill-conditioned.
The standardized estimates of the residual series
(denoted by
) can easily be calculated by forming the Cholesky decomposition of
, e.g.,
and setting
.
f07fdf may be used to calculate the array
g. The components of
which are now uncorrelated at
all lags can sometimes be more easily interpreted.
If your time series model provides a good fit to the data then the residual series should be approximately white noise, i.e., exhibit no serial cross-correlation. An examination of the residual cross-correlation matrices should confirm whether this is likely to be so. You are advised to call
g13dsf to provide information for diagnostic checking.
g13dsf returns the residual cross-correlation matrices along with their asymptotic standard errors.
g13dsf also computes a portmanteau statistic and its asymptotic significance level for testing model adequacy. If
or
on exit from
g13ddf then the quantities output
k,
n,
v,
kmax,
ip,
iq,
par,
parhld, and
qq will be suitable for input to
g13dsf.
10
Example
This example shows how to fit a bivariate AR(1) model to two series each of length . will be estimated and will be constrained to be zero.
10.1
Program Text
10.2
Program Data
10.3
Program Results