NAG CL Interface
f04cfc (complex_​posdef_​band_​solve)

Settings help

CL Name Style:


1 Purpose

f04cfc computes the solution to a complex system of linear equations AX=B, where A is an n×n Hermitian positive definite band matrix of band width 2k+1, and X and B are n×r matrices. An estimate of the condition number of A and an error bound for the computed solution are also returned.

2 Specification

#include <nag.h>
void  f04cfc (Nag_OrderType order, Nag_UploType uplo, Integer n, Integer kd, Integer nrhs, Complex ab[], Integer pdab, Complex b[], Integer pdb, double *rcond, double *errbnd, NagError *fail)
The function may be called by the names: f04cfc, nag_linsys_complex_posdef_band_solve or nag_herm_posdef_band_lin_solve.

3 Description

The Cholesky factorization is used to factor A as A=UHU, if uplo=Nag_Upper, or A=LLH, if uplo=Nag_Lower, where U is an upper triangular band matrix with k superdiagonals, and L is a lower triangular band matrix with k subdiagonals. The factored form of A is then used to solve the system of equations AX=B.

4 References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia https://www.netlib.org/lapack/lug
Higham N J (2002) Accuracy and Stability of Numerical Algorithms (2nd Edition) SIAM, Philadelphia

5 Arguments

1: order Nag_OrderType Input
On entry: the order argument specifies the two-dimensional storage scheme being used, i.e., row-major ordering or column-major ordering. C language defined storage is specified by order=Nag_RowMajor. See Section 3.1.3 in the Introduction to the NAG Library CL Interface for a more detailed explanation of the use of this argument.
Constraint: order=Nag_RowMajor or Nag_ColMajor.
2: uplo Nag_UploType Input
On entry: if uplo=Nag_Upper, the upper triangle of the matrix A is stored.
If uplo=Nag_Lower, the lower triangle of the matrix A is stored.
Constraint: uplo=Nag_Upper or Nag_Lower.
3: n Integer Input
On entry: the number of linear equations n, i.e., the order of the matrix A.
Constraint: n0.
4: kd Integer Input
On entry: the number of superdiagonals k (and the number of subdiagonals) of the band matrix A.
Constraint: kd0.
5: nrhs Integer Input
On entry: the number of right-hand sides r, i.e., the number of columns of the matrix B.
Constraint: nrhs0.
6: ab[dim] Complex Input/Output
Note: the dimension, dim, of the array ab must be at least max(1,pdab×n).
On entry:
  • if uplo=Nag_Upper then
    • if order=Nag_ColMajor, aij is stored in ab[(j-1)×pdab+kd+i-j];
    • if order=Nag_RowMajor, aij is stored in ab[(i-1)×pdab+j-i],
    for max(1,j-kd)ij;
  • if uplo=Nag_Lower then
    • if order=Nag_ColMajor, aij is stored in ab[(j-1)×pdab+i-j];
    • if order=Nag_RowMajor, aij is stored in ab[(i-1)×pdab+kd+j-i],
    for jimin(n,j+kd),
where pdabkd+1 is the stride separating diagonal matrix elements in the array ab.
See Section 9 below for further details.
On exit: if fail.code= NE_NOERROR or NE_RCOND, the factor U or L from the Cholesky factorization A=UHU or A=LLH, in the same storage format as A.
7: pdab Integer Input
On entry: the stride separating row or column elements (depending on the value of order) of the matrix A in the array ab.
Constraint: pdabkd+1.
8: b[dim] Complex Input/Output
Note: the dimension, dim, of the array b must be at least
  • max(1,pdb×nrhs) when order=Nag_ColMajor;
  • max(1,n×pdb) when order=Nag_RowMajor.
The (i,j)th element of the matrix B is stored in
  • b[(j-1)×pdb+i-1] when order=Nag_ColMajor;
  • b[(i-1)×pdb+j-1] when order=Nag_RowMajor.
On entry: the n×r matrix of right-hand sides B.
On exit: if fail.code= NE_NOERROR or NE_RCOND, the n×r solution matrix X.
9: pdb Integer Input
On entry: the stride separating row or column elements (depending on the value of order) in the array b.
Constraints:
  • if order=Nag_ColMajor, pdbmax(1,n);
  • if order=Nag_RowMajor, pdbmax(1,nrhs).
10: rcond double * Output
On exit: if fail.code= NE_NOERROR or NE_RCOND, an estimate of the reciprocal of the condition number of the matrix A, computed as rcond=1/(A1A-11).
11: errbnd double * Output
On exit: if fail.code= NE_NOERROR or NE_RCOND, an estimate of the forward error bound for a computed solution x^, such that x^-x1/x1errbnd, where x^ is a column of the computed solution returned in the array b and x is the corresponding column of the exact solution X. If rcond is less than machine precision, errbnd is returned as unity.
12: fail NagError * Input/Output
The NAG error argument (see Section 7 in the Introduction to the NAG Library CL Interface).

6 Error Indicators and Warnings

NE_ALLOC_FAIL
Dynamic memory allocation failed.
See Section 3.1.2 in the Introduction to the NAG Library CL Interface for further information.
NE_BAD_PARAM
On entry, argument value had an illegal value.
NE_INT
On entry, kd=value.
Constraint: kd0.
On entry, n=value.
Constraint: n0.
On entry, nrhs=value.
Constraint: nrhs0.
On entry, pdab=value.
Constraint: pdab>0.
On entry, pdb=value.
Constraint: pdb>0.
NE_INT_2
On entry, pdab =value and kd =value.
Constraint: pdabkd+1.
On entry, pdb=value and n=value.
Constraint: pdbmax(1,n).
On entry, pdb=value and nrhs=value.
Constraint: pdbmax(1,nrhs).
NE_INTERNAL_ERROR
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact NAG for assistance.
See Section 7.5 in the Introduction to the NAG Library CL Interface for further information.
NE_NO_LICENCE
Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library CL Interface for further information.
NE_POS_DEF
The principal minor of order value of the matrix A is not positive definite. The factorization has not been completed and the solution could not be computed.
NE_RCOND
A solution has been computed, but rcond is less than machine precision so that the matrix A is numerically singular.

7 Accuracy

The computed solution for a single right-hand side, x^, satisfies an equation of the form
(A+E) x^=b,  
where
E1=O(ε) A1  
and ε is the machine precision. An approximate error bound for the computed solution is given by
x^-x1 x1 κ(A) E1 A1 ,  
where κ(A)=A-11A1, the condition number of A with respect to the solution of the linear equations. f04cfc uses the approximation E1=εA1 to estimate errbnd. See Section 4.4 of Anderson et al. (1999) for further details.

8 Parallelism and Performance

Background information to multithreading can be found in the Multithreading documentation.
f04cfc is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f04cfc makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this function. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9 Further Comments

The band storage schemes for the array ab are identical to the storage schemes for symmetric and Hermitian band matrices in Chapter F07. See Section 3.4.4 in the F07 Chapter Introduction for details of the storage schemes and an illustrated example.
If uplo=Nag_Upper then the elements of the stored upper triangular part of A are overwritten by the corresponding elements of the upper triangular matrix U. Similarly, if uplo=Nag_Lower then the elements of the stored lower triangular part of A are overwritten by the corresponding elements of the lower triangular matrix L.
Assuming that nk, the total number of floating-point operations required to solve the equations AX=B is approximately n(k+1)2 for the factorization and 4nkr for the solution following the factorization. The condition number estimation typically requires between four and five solves and never more than eleven solves, following the factorization.
In practice the condition number estimator is very reliable, but it can underestimate the true condition number; see Section 15.3 of Higham (2002) for further details.
The real analogue of f04cfc is f04bfc.

10 Example

This example solves the equations
AX=B,  
where A is the Hermitian positive definite band matrix
A= ( 9.39i+0.00 1.08-1.73i 0.00i+0.00 0.00i+0.00 1.08+1.73i 1.69i+0.00 -0.04+0.29i 0.00i+0.00 0.00i+0.00 -0.04-0.29i 2.65i+0.00 -0.33+2.24i 0.00i+0.00 0.00i+0.00 -0.33-2.24i 2.17i+0.00 )  
and
B= ( -12.42+68.42i 54.30-56.56i -9.93+00.88i 18.32+04.76i -27.30-00.01i -4.40+09.97i 5.31+23.63i 9.43+01.41i ) .  
An estimate of the condition number of A and an approximate error bound for the computed solutions are also printed.

10.1 Program Text

Program Text (f04cfce.c)

10.2 Program Data

Program Data (f04cfce.d)

10.3 Program Results

Program Results (f04cfce.r)