The routine may be called by the names f01gaf or nagf_matop_real_gen_matrix_actexp.
3Description
is computed using the algorithm described in Al–Mohy and Higham (2011) which uses a truncated Taylor series to compute the product without explicitly forming .
4References
Al–Mohy A H and Higham N J (2011) Computing the action of the matrix exponential, with an application to exponential integrators SIAM J. Sci. Statist. Comput.33(2) 488-511
Higham N J (2008) Functions of Matrices: Theory and Computation SIAM, Philadelphia, PA, USA
5Arguments
1: – IntegerInput
On entry: , the order of the matrix .
Constraint:
.
2: – IntegerInput
On entry: , the number of columns of the matrix .
Constraint:
.
3: – Real (Kind=nag_wp) arrayInput/Output
Note: the second dimension of the array a
must be at least
.
On entry: the matrix .
On exit: is overwritten during the computation.
4: – IntegerInput
On entry: the first dimension of the array a as declared in the (sub)program from which f01gaf is called.
Constraint:
.
5: – Real (Kind=nag_wp) arrayInput/Output
Note: the second dimension of the array b
must be at least
.
On entry: the matrix .
On exit: the matrix .
6: – IntegerInput
On entry: the first dimension of the array b as declared in the (sub)program from which f01gaf is called.
Constraint:
.
7: – Real (Kind=nag_wp)Input
On entry: the scalar .
8: – IntegerInput/Output
On entry: ifail must be set to , or to set behaviour on detection of an error; these values have no effect when no error is detected.
A value of causes the printing of an error message and program execution will be halted; otherwise program execution continues. A value of means that an error message is printed while a value of means that it is not.
If halting is not appropriate, the value or is recommended. If message printing is undesirable, then the value is recommended. Otherwise, the value is recommended. When the value or is used it is essential to test the value of ifail on exit.
On exit: unless the routine detects an error or a warning has been flagged (see Section 6).
6Error Indicators and Warnings
If on entry or , explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
has been computed using an IEEE double precision Taylor series, although the arithmetic precision is higher than IEEE double precision.
On entry, . Constraint: .
On entry, . Constraint: .
On entry, and . Constraint: .
On entry, and . Constraint: .
An unexpected error has been triggered by this routine. Please
contact NAG.
See Section 7 in the Introduction to the NAG Library FL Interface for further information.
Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library FL Interface for further information.
Dynamic memory allocation failed.
See Section 9 in the Introduction to the NAG Library FL Interface for further information.
7Accuracy
For a symmetric matrix (for which ) the computed matrix is guaranteed to be close to the exact matrix, that is, the method is forward stable. No such guarantee can be given for non-symmetric matrices. See Section 4 of Al–Mohy and Higham (2011) for details and further discussion.
8Parallelism and Performance
f01gaf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f01gaf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.
9Further Comments
The matrix could be computed by explicitly forming using f01ecf and multiplying by the result. However, experiments show that it is usually both more accurate and quicker to use f01gaf.
The cost of the algorithm is . The precise cost depends on since a combination of balancing, shifting and scaling is used prior to the Taylor series evaluation.
Approximately of real allocatable memory is required by f01gaf.
f01haf can be used to compute for complex , , and . f01gbf provides an implementation of the algorithm with a reverse communication interface, which returns control to the user when matrix multiplications are required. This should be used if is large and sparse.