NAG CL Interface
f04bbc (real_band_solve)
1
Purpose
f04bbc computes the solution to a real system of linear equations , where is an by band matrix, with subdiagonals and superdiagonals, and and are by matrices. An estimate of the condition number of and an error bound for the computed solution are also returned.
2
Specification
void |
f04bbc (Nag_OrderType order,
Integer n,
Integer kl,
Integer ku,
Integer nrhs,
double ab[],
Integer pdab,
Integer ipiv[],
double b[],
Integer pdb,
double *rcond,
double *errbnd,
NagError *fail) |
|
The function may be called by the names: f04bbc, nag_linsys_real_band_solve or nag_real_band_lin_solve.
3
Description
The decomposition with partial pivoting and row interchanges is used to factor as , where is a permutation matrix, is the product of permutation matrices and unit lower triangular matrices with subdiagonals, and is upper triangular with superdiagonals. The factored form of is then used to solve the system of equations .
4
References
Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999)
LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia
https://www.netlib.org/lapack/lug
Higham N J (2002) Accuracy and Stability of Numerical Algorithms (2nd Edition) SIAM, Philadelphia
5
Arguments
-
1:
– Nag_OrderType
Input
-
On entry: the
order argument specifies the two-dimensional storage scheme being used, i.e., row-major ordering or column-major ordering. C language defined storage is specified by
. See
Section 3.1.3 in the Introduction to the NAG Library CL Interface for a more detailed explanation of the use of this argument.
Constraint:
or .
-
2:
– Integer
Input
-
On entry: the number of linear equations , i.e., the order of the matrix .
Constraint:
.
-
3:
– Integer
Input
-
On entry: the number of subdiagonals , within the band of .
Constraint:
.
-
4:
– Integer
Input
-
On entry: the number of superdiagonals , within the band of .
Constraint:
.
-
5:
– Integer
Input
-
On entry: the number of right-hand sides , i.e., the number of columns of the matrix .
Constraint:
.
-
6:
– double
Input/Output
-
Note: the dimension,
dim, of the array
ab
must be at least
.
On entry: the
by
matrix
.
This is stored as a notional two-dimensional array with row elements or column elements stored contiguously. The storage of elements
, for row
and column
, depends on the
order argument as follows:
- if , is stored as ;
- if , is stored as .
See
Section 9 for further details.
On exit:
ab is overwritten by details of the factorization.
The elements, , of the upper triangular band factor with super-diagonals, and the multipliers, , used to form the lower triangular factor are stored. The elements , for and , and , for and , are stored where is stored on entry.
-
7:
– Integer
Input
On entry: the stride separating row or column elements (depending on the value of
order) of the matrix
in the array
ab.
Constraint:
.
-
8:
– Integer
Output
-
On exit: if NE_NOERROR, the pivot indices that define the permutation matrix ; at the th step row of the matrix was interchanged with row . indicates a row interchange was not required.
-
9:
– double
Input/Output
-
Note: the dimension,
dim, of the array
b
must be at least
- when
;
- when
.
The
th element of the matrix
is stored in
- when ;
- when .
On entry: the by matrix of right-hand sides .
On exit: if
NE_NOERROR or
NE_RCOND, the
by
solution matrix
.
-
10:
– Integer
Input
-
On entry: the stride separating row or column elements (depending on the value of
order) in the array
b.
Constraints:
- if ,
;
- if , .
-
11:
– double *
Output
-
On exit: if no constraints are violated, an estimate of the reciprocal of the condition number of the matrix , computed as .
-
12:
– double *
Output
-
On exit: if
NE_NOERROR or
NE_RCOND, an estimate of the forward error bound for a computed solution
, such that
, where
is a column of the computed solution returned in the array
b and
is the corresponding column of the exact solution
. If
rcond is less than
machine precision,
errbnd is returned as unity.
-
13:
– NagError *
Input/Output
-
The NAG error argument (see
Section 7 in the Introduction to the NAG Library CL Interface).
6
Error Indicators and Warnings
- NE_ALLOC_FAIL
-
Dynamic memory allocation failed.
The Integer allocatable memory required is
n, and the double allocatable memory required is
. In this case the factorization and the solution
have been computed, but
rcond and
errbnd have not been computed.
See
Section 3.1.2 in the Introduction to the NAG Library CL Interface for further information.
- NE_BAD_PARAM
-
On entry, argument had an illegal value.
- NE_INT
-
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
- NE_INT_2
-
On entry, and .
Constraint: .
On entry, and .
Constraint: .
- NE_INT_3
-
On entry, , and .
Constraint: .
- NE_INTERNAL_ERROR
-
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact
NAG for assistance.
See
Section 7.5 in the Introduction to the NAG Library CL Interface for further information.
- NE_NO_LICENCE
-
Your licence key may have expired or may not have been installed correctly.
See
Section 8 in the Introduction to the NAG Library CL Interface for further information.
- NE_RCOND
-
A solution has been computed, but
rcond is less than
machine precision so that the matrix
is numerically singular.
- NE_SINGULAR
-
Diagonal element of the upper triangular factor is zero. The factorization has been completed, but the solution could not be computed.
7
Accuracy
The computed solution for a single right-hand side,
, satisfies an equation of the form
where
and
is the
machine precision. An approximate error bound for the computed solution is given by
where
, the condition number of
with respect to the solution of the linear equations.
f04bbc uses the approximation
to estimate
errbnd. See Section 4.4 of
Anderson et al. (1999)
for further details.
8
Parallelism and Performance
f04bbc is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f04bbc makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the
X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this function. Please also consult the
Users' Note for your implementation for any additional implementation-specific information.
The band storage scheme for the array
ab stored in Nag_ColMajor is illustrated by the following example, when
,
, and
.
Storage of the band matrix
in the array
ab:
Band matrix |
Band storage in array ab |
|
|
|
|
|
Array elements marked
need not be set and are not referenced by the function. Array elements marked + need not be set, but are defined on exit from the function and contain the elements
,
,
,
and
. In this example when
the first referenced element of
ab is
; while for
the first referenced element is
.
In general, elements
are stored as follows:
- if , are stored in
- if , are stored in
where
.
The total number of floating-point operations required to solve the equations depends upon the pivoting required, but if then it is approximately bounded by for the factorization and for the solution following the factorization. The condition number estimation typically requires between four and five solves and never more than eleven solves, following the factorization.
In practice the condition number estimator is very reliable, but it can underestimate the true condition number; see Section 15.3 of
Higham (2002) for further details.
The complex analogue of
f04bbc is
f04cbc.
10
Example
This example solves the equations
where
is the band matrix
An estimate of the condition number of and an approximate error bound for the computed solutions are also printed.
10.1
Program Text
10.2
Program Data
10.3
Program Results