NAG Library Routine Document

f07cpf  (zgtsvx)

 Contents

    1  Purpose
    7  Accuracy

1
Purpose

f07cpf (zgtsvx) uses the LU factorization to compute the solution to a complex system of linear equations
AX=B ,  ATX=B   or   AHX=B ,  
where A is a tridiagonal matrix of order n and X and B are n by r matrices. Error bounds on the solution and a condition estimate are also provided.

2
Specification

Fortran Interface
Subroutine f07cpf ( fact, trans, n, nrhs, dl, d, du, dlf, df, duf, du2, ipiv, b, ldb, x, ldx, rcond, ferr, berr, work, rwork, info)
Integer, Intent (In):: n, nrhs, ldb, ldx
Integer, Intent (Inout):: ipiv(*)
Integer, Intent (Out):: info
Real (Kind=nag_wp), Intent (Out):: rcond, ferr(nrhs), berr(nrhs), rwork(n)
Complex (Kind=nag_wp), Intent (In):: dl(*), d(*), du(*), b(ldb,*)
Complex (Kind=nag_wp), Intent (Inout):: dlf(*), df(*), duf(*), du2(*), x(ldx,*)
Complex (Kind=nag_wp), Intent (Out):: work(2*n)
Character (1), Intent (In):: fact, trans
C Header Interface
#include nagmk26.h
void  f07cpf_ ( const char *fact, const char *trans, const Integer *n, const Integer *nrhs, const Complex dl[], const Complex d[], const Complex du[], Complex dlf[], Complex df[], Complex duf[], Complex du2[], Integer ipiv[], const Complex b[], const Integer *ldb, Complex x[], const Integer *ldx, double *rcond, double ferr[], double berr[], Complex work[], double rwork[], Integer *info, const Charlen length_fact, const Charlen length_trans)
The routine may be called by its LAPACK name zgtsvx.

3
Description

f07cpf (zgtsvx) performs the following steps:
1. If fact='N', the LU decomposition is used to factor the matrix A as A=LU, where L is a product of permutation and unit lower bidiagonal matrices and U is upper triangular with nonzeros in only the main diagonal and first two superdiagonals.
2. If some uii=0, so that U is exactly singular, then the routine returns with info=i. Otherwise, the factored form of A is used to estimate the condition number of the matrix A. If the reciprocal of the condition number is less than machine precision, info=n+1 is returned as a warning, but the routine still goes on to solve for X and compute error bounds as described below.
3. The system of equations is solved for X using the factored form of A.
4. Iterative refinement is applied to improve the computed solution matrix and to calculate error bounds and backward error estimates for it.

4
References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia http://www.netlib.org/lapack/lug
Golub G H and Van Loan C F (1996) Matrix Computations (3rd Edition) Johns Hopkins University Press, Baltimore
Higham N J (2002) Accuracy and Stability of Numerical Algorithms (2nd Edition) SIAM, Philadelphia

5
Arguments

1:     fact – Character(1)Input
On entry: specifies whether or not the factorized form of the matrix A has been supplied.
fact='F'
dlf, df, duf, du2 and ipiv contain the factorized form of the matrix A. dlf, df, duf, du2 and ipiv will not be modified.
fact='N'
The matrix A will be copied to dlf, df and duf and factorized.
Constraint: fact='F' or 'N'.
2:     trans – Character(1)Input
On entry: specifies the form of the system of equations.
trans='N'
AX=B (No transpose).
trans='T'
ATX=B (Transpose).
trans='C'
AHX=B (Conjugate transpose).
Constraint: trans='N', 'T' or 'C'.
3:     n – IntegerInput
On entry: n, the order of the matrix A.
Constraint: n0.
4:     nrhs – IntegerInput
On entry: r, the number of right-hand sides, i.e., the number of columns of the matrix B.
Constraint: nrhs0.
5:     dl* – Complex (Kind=nag_wp) arrayInput
Note: the dimension of the array dl must be at least max1,n-1.
On entry: the n-1 subdiagonal elements of A.
6:     d* – Complex (Kind=nag_wp) arrayInput
Note: the dimension of the array d must be at least max1,n.
On entry: the n diagonal elements of A.
7:     du* – Complex (Kind=nag_wp) arrayInput
Note: the dimension of the array du must be at least max1,n-1.
On entry: the n-1 superdiagonal elements of A.
8:     dlf* – Complex (Kind=nag_wp) arrayInput/Output
Note: the dimension of the array dlf must be at least max1,n-1.
On entry: if fact='F', dlf contains the n-1 multipliers that define the matrix L from the LU factorization of A.
On exit: if fact='N', dlf contains the n-1 multipliers that define the matrix L from the LU factorization of A.
9:     df* – Complex (Kind=nag_wp) arrayInput/Output
Note: the dimension of the array df must be at least max1,n.
On entry: if fact='F', df contains the n diagonal elements of the upper triangular matrix U from the LU factorization of A.
On exit: if fact='N', df contains the n diagonal elements of the upper triangular matrix U from the LU factorization of A.
10:   duf* – Complex (Kind=nag_wp) arrayInput/Output
Note: the dimension of the array duf must be at least max1,n-1.
On entry: if fact='F', duf contains the n-1 elements of the first superdiagonal of U.
On exit: if fact='N', duf contains the n-1 elements of the first superdiagonal of U.
11:   du2* – Complex (Kind=nag_wp) arrayInput/Output
Note: the dimension of the array du2 must be at least max1,n-2.
On entry: if fact='F', du2 contains the (n-2) elements of the second superdiagonal of U.
On exit: if fact='N', du2 contains the (n-2) elements of the second superdiagonal of U.
12:   ipiv* – Integer arrayInput/Output
Note: the dimension of the array ipiv must be at least max1,n.
On entry: if fact='F', ipiv contains the pivot indices from the LU factorization of A.
On exit: if fact='N', ipiv contains the pivot indices from the LU factorization of A; row i of the matrix was interchanged with row ipivi. ipivi will always be either i or i+1; ipivi=i indicates a row interchange was not required.
13:   bldb* – Complex (Kind=nag_wp) arrayInput
Note: the second dimension of the array b must be at least max1,nrhs.
On entry: the n by r right-hand side matrix B.
14:   ldb – IntegerInput
On entry: the first dimension of the array b as declared in the (sub)program from which f07cpf (zgtsvx) is called.
Constraint: ldbmax1,n.
15:   xldx* – Complex (Kind=nag_wp) arrayOutput
Note: the second dimension of the array x must be at least max1,nrhs.
On exit: if info=0 or n+1, the n by r solution matrix X.
16:   ldx – IntegerInput
On entry: the first dimension of the array x as declared in the (sub)program from which f07cpf (zgtsvx) is called.
Constraint: ldxmax1,n.
17:   rcond – Real (Kind=nag_wp)Output
On exit: the estimate of the reciprocal condition number of the matrix A. If rcond=0.0, the matrix may be exactly singular. This condition is indicated by info>0andinfon. Otherwise, if rcond is less than the machine precision, the matrix is singular to working precision. This condition is indicated by info=n+1.
18:   ferrnrhs – Real (Kind=nag_wp) arrayOutput
On exit: if info=0 or n+1, an estimate of the forward error bound for each computed solution vector, such that x^j-xj/xjferrj where x^j is the jth column of the computed solution returned in the array x and xj is the corresponding column of the exact solution X. The estimate is as reliable as the estimate for rcond, and is almost always a slight overestimate of the true error.
19:   berrnrhs – Real (Kind=nag_wp) arrayOutput
On exit: if info=0 or n+1, an estimate of the component-wise relative backward error of each computed solution vector x^j (i.e., the smallest relative change in any element of A or B that makes x^j an exact solution).
20:   work2×n – Complex (Kind=nag_wp) arrayWorkspace
21:   rworkn – Real (Kind=nag_wp) arrayWorkspace
22:   info – IntegerOutput
On exit: info=0 unless the routine detects an error (see Section 6).

6
Error Indicators and Warnings

info<0
If info=-i, argument i had an illegal value. An explanatory message is output, and execution of the program is terminated.
info>0andinfo<n
Element value of the diagonal is exactly zero. The factorization has not been completed, but the factor U is exactly singular, so the solution and error bounds could not be computed. rcond=0.0 is returned.
info>0andinfo=n
Element value of the diagonal is exactly zero. The factorization has been completed, but the factor U is exactly singular, so the solution and error bounds could not be computed. rcond=0.0 is returned.
info=n+1
U is nonsingular, but rcond is less than machine precision, meaning that the matrix is singular to working precision. Nevertheless, the solution and error bounds are computed because there are a number of situations where the computed solution can be more accurate than the value of rcond would suggest.

7
Accuracy

For each right-hand side vector b, the computed solution x^ is the exact solution of a perturbed system of equations A+Ex^=b, where
E c n ε L U ,  
cn is a modest linear function of n, and ε is the machine precision. See Section 9.3 of Higham (2002) for further details.
If x is the true solution, then the computed solution x^ satisfies a forward error bound of the form
x-x^ x^ wc condA,x^,b  
where condA,x^,b = A-1 A x^ + b / x^ condA = A-1 A κ A. If x^  is the j th column of X , then wc  is returned in berrj  and a bound on x - x^ / x^  is returned in ferrj . See Section 4.4 of Anderson et al. (1999) for further details.

8
Parallelism and Performance

f07cpf (zgtsvx) is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f07cpf (zgtsvx) makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

The total number of floating-point operations required to solve the equations AX=B  is proportional to nr .
The condition number estimation typically requires between four and five solves and never more than eleven solves, following the factorization. The solution is then refined, and the errors estimated, using iterative refinement.
In practice the condition number estimator is very reliable, but it can underestimate the true condition number; see Section 15.3 of Higham (2002) for further details.
The real analogue of this routine is f07cbf (dgtsvx).

10
Example

This example solves the equations
AX=B ,  
where A  is the tridiagonal matrix
A = -1.3+1.3i 2.0-1.0i 0.0i+0.0 0.0i+0.0 0.0i+0.0 1.0-2.0i -1.3+1.3i 2.0+1.0i 0.0i+0.0 0.0i+0.0 0.0i+0.0 1.0+1.0i -1.3+3.3i -1.0+1.0i 0.0i+0.0 0.0i+0.0 0.0i+0.0 2.0-3.0i -0.3+4.3i 1.0-1.0i 0.0i+0.0 0.0i+0.0 0.0i+0.0 1.0+1.0i -3.3+1.3i  
and
B = 2.4-05.0i 2.7+06.9i 3.4+18.2i -6.9-05.3i -14.7+09.7i -6.0-00.6i 31.9-07.7i -3.9+09.3i -1.0+01.6i -3.0+12.2i .  
Estimates for the backward errors, forward errors and condition number are also output.

10.1
Program Text

Program Text (f07cpfe.f90)

10.2
Program Data

Program Data (f07cpfe.d)

10.3
Program Results

Program Results (f07cpfe.r)

© The Numerical Algorithms Group Ltd, Oxford, UK. 2017