NAG Library Routine Document

f07fqf (zcposv)

1
Purpose

f07fqf (zcposv) uses the Cholesky factorization
A=UHU   or   A=LLH  
to compute the solution to a complex system of linear equations
AX=B ,  
where A is an n by n Hermitian positive definite matrix and X and B are n by r matrices.

2
Specification

Fortran Interface
Subroutine f07fqf ( uplo, n, nrhs, a, lda, b, ldb, x, ldx, work, swork, rwork, iter, info)
Integer, Intent (In):: n, nrhs, lda, ldb, ldx
Integer, Intent (Out):: iter, info
Real (Kind=nag_wp), Intent (Out):: rwork(n)
Complex (Kind=nag_wp), Intent (In):: b(ldb,*)
Complex (Kind=nag_wp), Intent (Inout):: a(lda,*), x(ldx,*)
Complex (Kind=nag_wp), Intent (Out):: work(n,nrhs)
Complex (Kind=nag_rp), Intent (Out):: swork(n*(n+nrhs))
Character (1), Intent (In):: uplo
C Header Interface
#include <nagmk26.h>
void  f07fqf_ (const char *uplo, const Integer *n, const Integer *nrhs, Complex a[], const Integer *lda, const Complex b[], const Integer *ldb, Complex x[], const Integer *ldx, Complex work[], Complexf swork[], double rwork[], Integer *iter, Integer *info, const Charlen length_uplo)
The routine may be called by its LAPACK name zcposv.

3
Description

f07fqf (zcposv) first attempts to factorize the matrix in reduced precision and use this factorization within an iterative refinement procedure to produce a solution with full precision normwise backward error quality (see below). If the approach fails the method switches to a full precision factorization and solve.
The iterative refinement can be more efficient than the corresponding direct full precision algorithm. Since the strategy implemented by f07fqf (zcposv) must perform iterative refinement on each right-hand side, any efficiency gains will reduce as the number of right-hand sides increases. Conversely, as the matrix size increases the cost of these iterative refinements become less significant relative to the cost of factorization. Thus, any efficiency gains will be greatest for a very small number of right-hand sides and for large matrix sizes. The cut-off values for the number of right-hand sides and matrix size, for which the iterative refinement strategy performs better, depends on the relative performance of the reduced and full precision factorization and back-substitution. f07fqf (zcposv) always attempts the iterative refinement strategy first; you are advised to compare the performance of f07fqf (zcposv) with that of its full precision counterpart f07fnf (zposv) to determine whether this strategy is worthwhile for your particular problem dimensions.
The iterative refinement process is stopped if iter>30 where iter is the number of iterations carried out thus far. The process is also stopped if for all right-hand sides we have
resid < n x A ε ,  
where resid is the -norm of the residual, x is the -norm of the solution, A is the -norm of the matrix A and ε is the machine precision returned by x02ajf.

4
References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia http://www.netlib.org/lapack/lug
Golub G H and Van Loan C F (1996) Matrix Computations (3rd Edition) Johns Hopkins University Press, Baltimore
Higham N J (2002) Accuracy and Stability of Numerical Algorithms (2nd Edition) SIAM, Philadelphia

5
Arguments

1:     uplo – Character(1)Input
On entry: specifies whether the upper or lower triangular part of A is stored.
uplo='U'
The upper triangular part of A is stored.
uplo='L'
The lower triangular part of A is stored.
Constraint: uplo='U' or 'L'.
2:     n – IntegerInput
On entry: n, the number of linear equations, i.e., the order of the matrix A.
Constraint: n0.
3:     nrhs – IntegerInput
On entry: r, the number of right-hand sides, i.e., the number of columns of the matrix B.
Constraint: nrhs0.
4:     alda* – Complex (Kind=nag_wp) arrayInput/Output
Note: the second dimension of the array a must be at least max1,n.
On entry: the n by n Hermitian positive definite matrix A.
  • If uplo='U', the upper triangular part of A must be stored and the elements of the array below the diagonal are not referenced.
  • If uplo='L', the lower triangular part of A must be stored and the elements of the array above the diagonal are not referenced.
On exit: if iterative refinement has been successfully used (info=0 and iter0, see description below), then a is unchanged. If full precision factorization has been used (info=0 and iter<0, see description below), then the array A contains the factor U or L from the Cholesky factorization A=UHU or A=LLH.
5:     lda – IntegerInput
On entry: the first dimension of the array a as declared in the (sub)program from which f07fqf (zcposv) is called.
Constraint: ldamax1,n.
6:     bldb* – Complex (Kind=nag_wp) arrayInput
Note: the second dimension of the array b must be at least max1,nrhs.
On entry: the right-hand side matrix B.
7:     ldb – IntegerInput
On entry: the first dimension of the array b as declared in the (sub)program from which f07fqf (zcposv) is called.
Constraint: ldbmax1,n.
8:     xldx* – Complex (Kind=nag_wp) arrayOutput
Note: the second dimension of the array x must be at least max1,nrhs.
On exit: if info=0, the n by r solution matrix X.
9:     ldx – IntegerInput
On entry: the first dimension of the array x as declared in the (sub)program from which f07fqf (zcposv) is called.
Constraint: ldxmax1,n.
10:   worknnrhs – Complex (Kind=nag_wp) arrayWorkspace
11:   sworkn×n+nrhs – Complex (Kind=nag_rp) arrayWorkspace
Note: this array is utilized in the reduced precision computation, consequently its type nag_rp reflects this usage.
12:   rworkn – Real (Kind=nag_wp) arrayWorkspace
13:   iter – IntegerOutput
On exit: information on the progress of the interative refinement process.
iter<0
Iterative refinement has failed for one of the reasons given below, full precision factorization has been performed instead.
-1The routine fell back to full precision for implementation- or machine-specific reasons.
-2Narrowing the precision induced an overflow, the routine fell back to full precision.
-3An intermediate reduced precision factorization failed.
-31The maximum permitted number of iterations was exceeded.
iter>0
Iterative refinement has been sucessfully used. iter returns the number of iterations.
14:   info – IntegerOutput
On exit: info=0 unless the routine detects an error (see Section 6).

6
Error Indicators and Warnings

info<0
If info=-i, argument i had an illegal value. An explanatory message is output, and execution of the program is terminated.
info>0andinfon
The leading minor of order value of A is not positive definite, so the factorization could not be completed, and the solution has not been computed.

7
Accuracy

For each right-hand side vector b, the computed solution x is the exact solution of a perturbed system of equations A+Ex=b, where cn is a modest linear function of n, and ε is the machine precision. See Section 10.1 of Higham (2002) for further details.
An approximate error bound for the computed solution is given by
x^ - x 1 x 1 κA E 1 A 1  
where κA = A-1 1 A 1 , the condition number of A  with respect to the solution of the linear equations. See Section 4.4 of Anderson et al. (1999) for further details.

8
Parallelism and Performance

f07fqf (zcposv) is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f07fqf (zcposv) makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

The real analogue of this routine is f07fcf (dsposv).

10
Example

This example solves the equations
AX=B ,  
where A  is the Hermitian positive definite matrix
A = 3.23i+0.00 1.51-1.92i 1.90+0.84i 0.42+2.50i 1.51+1.92i 3.58i+0.00 -0.23+1.11i -1.18+1.37i 1.90-0.84i -0.23-1.11i 4.09i+0.00 2.33-0.14i 0.42-2.50i -1.18-1.37i 2.33+0.14i 4.29i+0.00  
and
B = 3.93-06.14i 6.17+09.42i -7.17-21.83i 1.99-14.38i .  

10.1
Program Text

Program Text (f07fqfe.f90)

10.2
Program Data

Program Data (f07fqfe.d)

10.3
Program Results

Program Results (f07fqfe.r)