NAG Library Routine Document
f01hbf (complex_gen_matrix_actexp_rcomm)
1
Purpose
f01hbf computes the action of the matrix exponential , on the matrix , where is a complex by matrix, is a complex by matrix and is a complex scalar. It uses reverse communication for evaluating matrix products, so that the matrix is not accessed explicitly.
2
Specification
Fortran Interface
Subroutine f01hbf ( |
irevcm, n, m, b, ldb, t, tr, b2, ldb2, x, ldx, y, ldy, p, r, z, ccomm, comm, icomm, ifail) |
Integer, Intent (In) | :: | n, m, ldb, ldb2, ldx, ldy | Integer, Intent (Inout) | :: | irevcm, icomm(2*n+40), ifail | Real (Kind=nag_wp), Intent (Inout) | :: | comm(3*n+14) | Complex (Kind=nag_wp), Intent (In) | :: | t, tr | Complex (Kind=nag_wp), Intent (Inout) | :: | b(ldb,*), b2(ldb2,*), x(ldx,*), y(ldy,*), p(n), r(n), z(n), ccomm(n*(m+2)) |
|
C Header Interface
#include <nagmk26.h>
void |
f01hbf_ (Integer *irevcm, const Integer *n, const Integer *m, Complex b[], const Integer *ldb, const Complex *t, const Complex *tr, Complex b2[], const Integer *ldb2, Complex x[], const Integer *ldx, Complex y[], const Integer *ldy, Complex p[], Complex r[], Complex z[], Complex ccomm[], double comm[], Integer icomm[], Integer *ifail) |
|
3
Description
is computed using the algorithm described in
Al–Mohy and Higham (2011) which uses a truncated Taylor series to compute the
without explicitly forming
.
The algorithm does not explicity need to access the elements of ; it only requires the result of matrix multiplications of the form or . A reverse communication interface is used, in which control is returned to the calling program whenever a matrix product is required.
4
References
Al–Mohy A H and Higham N J (2011) Computing the action of the matrix exponential, with an application to exponential integrators SIAM J. Sci. Statist. Comput. 33(2) 488-511
Higham N J (2008) Functions of Matrices: Theory and Computation SIAM, Philadelphia, PA, USA
5
Arguments
Note: this routine uses
reverse communication. Its use involves an initial entry, intermediate exits and re-entries, and a final exit, as indicated by the
argument irevcm. Between intermediate exits and re-entries,
all arguments other than b2, x, y, p and r must remain unchanged.
- 1: – IntegerInput/Output
-
On initial entry: must be set to .
On intermediate exit:
,
,
,
or
. The calling program must:
(a) |
if : evaluate , where is an by matrix, and store the result in b2;
if : evaluate , where and are by matrices, and store the result in y;
if : evaluate and store the result in x;
if : evaluate and store the result in p;
if : evaluate and store the result in r. |
(b) |
call f01hbf again with all other parameters unchanged. |
On final exit: .
Note: any values you return to f01hbf as part of the reverse communication procedure should not include floating-point NaN (Not a Number) or infinity values, since these are not handled by f01hbf. If your code does inadvertently return any NaNs or infinities, f01hbf is likely to produce unexpected results.
- 2: – IntegerInput
-
On entry: , the order of the matrix .
Constraint:
.
- 3: – IntegerInput
-
On entry: the number of columns of the matrix .
Constraint:
.
- 4: – Complex (Kind=nag_wp) arrayInput/Output
-
Note: the second dimension of the array
b
must be at least
.
On initial entry: the by matrix .
On intermediate exit:
if , contains the by matrix .
On intermediate re-entry: must not be changed.
On final exit: the by matrix .
- 5: – IntegerInput
-
On entry: the first dimension of the array
b as declared in the (sub)program from which
f01hbf is called.
Constraint:
.
- 6: – Complex (Kind=nag_wp)Input
-
On entry: the scalar .
- 7: – Complex (Kind=nag_wp)Input
-
On entry: the trace of
. If this is not available then any number can be supplied (
is a reasonable default); however, in the trivial case,
, the result
is immediately returned in the first row of
. See
Section 9.
- 8: – Complex (Kind=nag_wp) arrayInput/Output
-
Note: the second dimension of the array
b2
must be at least
.
On initial entry: need not be set.
On intermediate re-entry: if , must contain .
On final exit: the array is undefined.
- 9: – IntegerInput
-
On initial entry: the first dimension of the array
b2 as declared in the (sub)program from which
f01hbf is called.
Constraint:
.
- 10: – Complex (Kind=nag_wp) arrayInput/Output
-
Note: the second dimension of the array
x
must be at least
.
On initial entry: need not be set.
On intermediate exit:
if , contains the current by matrix .
On intermediate re-entry: if , must contain .
On final exit: the array is undefined.
- 11: – IntegerInput
-
On entry: the first dimension of the array
x as declared in the (sub)program from which
f01hbf is called.
Constraint:
.
- 12: – Complex (Kind=nag_wp) arrayInput/Output
-
Note: the second dimension of the array
y
must be at least
.
On initial entry: need not be set.
On intermediate exit:
if , contains the current by matrix .
On intermediate re-entry: if , must contain .
On final exit: the array is undefined.
- 13: – IntegerInput
-
On entry: the first dimension of the array
y as declared in the (sub)program from which
f01hbf is called.
Constraint:
.
- 14: – Complex (Kind=nag_wp) arrayInput/Output
-
On initial entry: need not be set.
On intermediate re-entry: if , must contain .
On final exit: the array is undefined.
- 15: – Complex (Kind=nag_wp) arrayInput/Output
-
On initial entry: need not be set.
On intermediate re-entry: if , must contain .
On final exit: the array is undefined.
- 16: – Complex (Kind=nag_wp) arrayInput/Output
-
On initial entry: need not be set.
On intermediate exit:
if or , contains the vector .
On intermediate re-entry: must not be changed.
On final exit: the array is undefined.
- 17: – Complex (Kind=nag_wp) arrayCommunication Array
-
- 18: – Real (Kind=nag_wp) arrayCommunication Array
-
- 19: – Integer arrayCommunication Array
-
- 20: – IntegerInput/Output
-
On entry:
ifail must be set to
,
. If you are unfamiliar with this argument you should refer to
Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value
is recommended. If the output of error messages is undesirable, then the value
is recommended. Otherwise, if you are not familiar with this argument, the recommended value is
.
When the value is used it is essential to test the value of ifail on exit.
On exit:
unless the routine detects an error or a warning has been flagged (see
Section 6).
6
Error Indicators and Warnings
If on entry
or
, explanatory error messages are output on the current error message unit (as defined by
x04aaf).
Errors or warnings detected by the routine:
-
has been computed using an IEEE double precision Taylor series, although the arithmetic precision is higher than IEEE double precision.
-
On initial entry, .
Constraint: .
On intermediate re-entry, .
Constraint: , , , or .
-
On initial entry, .
Constraint: .
-
On initial entry, .
Constraint: .
-
On initial entry, and .
Constraint: .
-
On initial entry, and .
Constraint: .
-
On initial entry, and .
Constraint: .
-
On initial entry, and .
Constraint: .
An unexpected error has been triggered by this routine. Please
contact
NAG.
See
Section 3.9 in How to Use the NAG Library and its Documentation for further information.
Your licence key may have expired or may not have been installed correctly.
See
Section 3.8 in How to Use the NAG Library and its Documentation for further information.
Dynamic memory allocation failed.
See
Section 3.7 in How to Use the NAG Library and its Documentation for further information.
7
Accuracy
For an Hermitian matrix
(for which
) the computed matrix
is guaranteed to be close to the exact matrix, that is, the method is forward stable. No such guarantee can be given for non-Hermitian matrices. See Section 4 of
Al–Mohy and Higham (2011) for details and further discussion.
8
Parallelism and Performance
f01hbf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
Please consult the
X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the
Users' Note for your implementation for any additional implementation-specific information.
The elements of are not explicitly required by f01hbf. However, the trace of is used in the preprocessing phase of the algorithm. If is not available to the calling subroutine then any number can be supplied ( is recommended). This will not affect the stability of the algorithm, but it may reduce its efficiency.
f01hbf is designed to be used when is large and sparse. Whenever a matrix multiplication is required, the routine will return control to the calling program so that the multiplication can be done in the most efficient way possible. Note that will not, in general, be sparse even if is sparse.
If
is small and dense then
f01haf can be used to compute
without the use of a reverse communication interface.
The real analog of
f01hbf is
f01gbf.
To compute
, the following skeleton code can normally be used:
revcm: Do
Call f01hbf(irevcm,n,m,b,ldb,t,tr,b2,ldb2,x,ldx,y,ldx,p,r,z, &
ccomm,comm,icomm,ifail)
If (irevcm == 0) Then
Exit revcm
Else If (irevcm == 1) Then
.. Code to compute b2=ab ..
Else If (irevcm == 2) Then
.. Code to compute y=ax ..
Else If (irevcm == 3) Then
.. Code to compute x=a^h y ..
Else If (irevcm == 4) Then
.. Code to compute p=az ..
Else If (irevcm == 5) Then
.. Code to compute r=a^h z ..
End If
End Do revcm
The code used to compute the matrix products will vary depending on the way
is stored. If all the elements of
are stored explicitly, then
f06zaf (zgemm) can be used. If
is triangular then
f06zff (ztrmm) should be used. If
is Hermitian, then
f06zcf (zhemm) should be used. If
is symmetric, then
f06ztf (zsymm) should be used. For sparse
stored in coordinate storage format
f11xnf and
f11xsf can be used. For sparse
stored in compressed column storage format (CCS) the program text of
Section 10 contains the routine matmul to perform matrix products.
10
Example
This example computes
where
and
is stored in compressed column storage format (CCS) and matrix multiplications are performed using the routine matmul.
10.1
Program Text
Program Text (f01hbfe.f90)
10.2
Program Data
Program Data (f01hbfe.d)
10.3
Program Results
Program Results (f01hbfe.r)