matrices, using the modified Cholesky factorization returned by f07jrc and an initial solution returned by f07jsc. Iterative refinement is used to reduce the backward error as much as possible.

2 Specification

#include <nag.h>

void	f07jvc (Nag_OrderType order, Nag_UploType uplo, Integer n, Integer nrhs, const double d[], const Complex e[], const double df[], const Complex ef[], const Complex b[], Integer pdb, Complex x[], Integer pdx, double ferr[], double berr[], NagError *fail)

The function may be called by the names: f07jvc, nag_lapacklin_zptrfs or nag_zptrfs.

3 Description

f07jvc should normally be preceded by calls to f07jrc and f07jsc. f07jrc computes a modified Cholesky factorization of the matrix

A

A = L D L^{H},

where

L

is a unit lower bidiagonal matrix and

D

is a diagonal matrix, with positive diagonal elements. f07jsc then utilizes the factorization to compute a solution,

\hat{X}

, to the required equations. Letting

\hat{x}

denote a column of

\hat{X}

, f07jvc computes a component-wise backward error,

β

, the smallest relative perturbation in each element of

A

and

b

such that

\hat{x}

is the exact solution of a perturbed system

(A + E) \hat{x} = b + f, with | e_{i j} | \leq β | a_{i j} |, and | f_{j} | \leq β | b_{j} | .

The function also estimates a bound for the component-wise forward error in the computed solution defined by

\max | x_{i} - \hat{x_{i}} | / \max | \hat{x_{i}} |

, where

x

is the corresponding column of the exact solution,

X

Note that the modified Cholesky factorization of

A

can also be expressed as

A = U^{H} D U,

where

U

is unit upper bidiagonal.

4 References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia https://www.netlib.org/lapack/lug

5 Arguments

1: $order$ – Nag_OrderType Input

On entry: the order argument specifies the two-dimensional storage scheme being used, i.e., row-major ordering or column-major ordering. C language defined storage is specified by

order = Nag_RowMajor

. See Section 3.1.3 in the Introduction to the NAG Library CL Interface for a more detailed explanation of the use of this argument.

Constraint:

order = Nag_RowMajor

Nag_ColMajor

2: $uplo$ – Nag_UploType Input

On entry: specifies the form of the factorization as follows:

$uplo = Nag_Upper$: $A = U^{H} D U$ .
$uplo = Nag_Lower$: $A = L D L^{H}$ .

Constraint:

uplo = Nag_Upper

Nag_Lower

3: $n$ – Integer Input

On entry:

n

, the order of the matrix

A

Constraint:

n \geq 0

4: $nrhs$ – Integer Input

On entry:

r

, the number of right-hand sides, i.e., the number of columns of the matrix

B

Constraint:

nrhs \geq 0

5: $d [\dim]$ – const double Input

Note: the dimension, dim, of the array d must be at least

\max (1, n)

On entry: must contain the

n

diagonal elements of the matrix of

A

6: $e [\dim]$ – const Complex Input

Note: the dimension, dim, of the array e must be at least

\max (1, n - 1)

On entry: if

uplo = Nag_Upper

, e must contain the

(n - 1)

superdiagonal elements of the matrix

A

uplo = Nag_Lower

, e must contain the

(n - 1)

subdiagonal elements of the matrix

A

7: $df [\dim]$ – const double Input

Note: the dimension, dim, of the array df must be at least

\max (1, n)

On entry: must contain the

n

diagonal elements of the diagonal matrix

D

from the

L D L^{T}

factorization of

A

8: $ef [\dim]$ – const Complex Input

Note: the dimension, dim, of the array ef must be at least

\max (1, n - 1)

On entry: if

uplo = Nag_Upper

, ef must contain the

(n - 1)

superdiagonal elements of the unit upper bidiagonal matrix

U

from the

U^{H} D U

factorization of

A

uplo = Nag_Lower

, ef must contain the

(n - 1)

subdiagonal elements of the unit lower bidiagonal matrix

L

from the

L D L^{H}

factorization of

A

9: $b [\dim]$ – const Complex Input

Note: the dimension, dim, of the array b must be at least

$\max (1, pdb \times nrhs)$ when $order = Nag_ColMajor$ ;
$\max (1, n \times pdb)$ when $order = Nag_RowMajor$ .

The

(i, j)

th element of the matrix

B

is stored in

$b [(j - 1) \times pdb + i - 1]$ when $order = Nag_ColMajor$ ;
$b [(i - 1) \times pdb + j - 1]$ when $order = Nag_RowMajor$ .

On entry: the

n \times r

matrix of right-hand sides

B

10: $pdb$ – Integer Input

On entry: the stride separating row or column elements (depending on the value of order) in the array b.

Constraints:

if $order = Nag_ColMajor$ , $pdb \geq \max (1, n)$ ;
if $order = Nag_RowMajor$ , $pdb \geq \max (1, nrhs)$ .

11: $x [\dim]$ – Complex Input/Output

Note: the dimension, dim, of the array x must be at least

$\max (1, pdx \times nrhs)$ when $order = Nag_ColMajor$ ;
$\max (1, n \times pdx)$ when $order = Nag_RowMajor$ .

The

(i, j)

th element of the matrix

X

is stored in

$x [(j - 1) \times pdx + i - 1]$ when $order = Nag_ColMajor$ ;
$x [(i - 1) \times pdx + j - 1]$ when $order = Nag_RowMajor$ .

On entry: the

n \times r

initial solution matrix

X

On exit: the

n \times r

refined solution matrix

X

12: $pdx$ – Integer Input

On entry: the stride separating row or column elements (depending on the value of order) in the array x.

Constraints:

if $order = Nag_ColMajor$ , $pdx \geq \max (1, n)$ ;
if $order = Nag_RowMajor$ , $pdx \geq \max (1, nrhs)$ .

13: $ferr [nrhs]$ – double Output

On exit: estimate of the forward error bound for each computed solution vector, such that

{‖ {\hat{x}}_{j} - x_{j} ‖}_{\infty} / {‖ {\hat{x}}_{j} ‖}_{\infty} \leq ferr [j - 1]

, where

{\hat{x}}_{j}

is the

j

th column of the computed solution returned in the array x and

x_{j}

is the corresponding column of the exact solution

X

. The estimate is almost always a slight overestimate of the true error.

14: $berr [nrhs]$ – double Output

On exit: estimate of the component-wise relative backward error of each computed solution vector

{\hat{x}}_{j}

(i.e., the smallest relative change in any element of

A

B

that makes

{\hat{x}}_{j}

an exact solution).

15: $fail$ – NagError * Input/Output

The NAG error argument (see Section 7 in the Introduction to the NAG Library CL Interface).

6 Error Indicators and Warnings

NE_ALLOC_FAIL: Dynamic memory allocation failed.
See Section 3.1.2 in the Introduction to the NAG Library CL Interface for further information.
NE_BAD_PARAM: On entry, argument $⟨ value ⟩$ had an illegal value.
NE_INT: On entry, $n = ⟨ value ⟩$ .
Constraint: $n \geq 0$ .

On entry, $nrhs = ⟨ value ⟩$ .
Constraint: $nrhs \geq 0$ .

On entry, $pdb = ⟨ value ⟩$ .
Constraint: $pdb > 0$ .

On entry, $pdx = ⟨ value ⟩$ .
Constraint: $pdx > 0$ .
NE_INT_2: On entry, $pdb = ⟨ value ⟩$ and $n = ⟨ value ⟩$ .
Constraint: $pdb \geq \max (1, n)$ .

On entry, $pdb = ⟨ value ⟩$ and $nrhs = ⟨ value ⟩$ .
Constraint: $pdb \geq \max (1, nrhs)$ .

On entry, $pdx = ⟨ value ⟩$ and $n = ⟨ value ⟩$ .
Constraint: $pdx \geq \max (1, n)$ .

On entry, $pdx = ⟨ value ⟩$ and $nrhs = ⟨ value ⟩$ .
Constraint: $pdx \geq \max (1, nrhs)$ .
NE_INTERNAL_ERROR: An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact NAG for assistance.
See Section 7.5 in the Introduction to the NAG Library CL Interface for further information.
NE_NO_LICENCE: Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library CL Interface for further information.

7 Accuracy

The computed solution for a single right-hand side,

\hat{x}

, satisfies an equation of the form

(A + E) \hat{x} = b,

where

{‖ E ‖}_{\infty} = O (ε) {‖ A ‖}_{\infty}

and

ε

is the machine precision. An approximate error bound for the computed solution is given by

\frac{{‖ \hat{x} - x ‖}_{\infty}}{{‖ x ‖}_{\infty}} \leq κ (A) \frac{{‖ E ‖}_{\infty}}{{‖ A ‖}_{\infty}},

where

κ (A) = {‖ A^{- 1} ‖}_{\infty} {‖ A ‖}_{\infty}

, the condition number of

A

with respect to the solution of the linear equations. See Section 4.4 of Anderson et al. (1999) for further details.

Function f07juc can be used to compute the condition number of

A

8 Parallelism and Performance

Background information to multithreading can be found in the Multithreading documentation.

f07jvc is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.

f07jvc makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.

Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this function. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9 Further Comments

The total number of floating-point operations required to solve the equations

A X = B

is proportional to

n r

. At most five steps of iterative refinement are performed, but usually only one or two steps are required.

The real analogue of this function is f07jhc.

10 Example

This example solves the equations

A X = B,

where

A

is the Hermitian positive definite tridiagonal matrix

A = (\begin{array}{r} 16.0 & 16.0 - 16.0 i & 0 & 0 \\ 16.0 + 16.0 i & 41.0 & 18.0 + 9.0 i & 0 \\ 0 & 18.0 - 9.0 i & 46.0 & 1.0 + 4.0 i \\ 0 & 0 & 1.0 - 4.0 i & 21.0 \end{array})

and

B = (\begin{array}{r} 64.0 + 16.0 i & - 16.0 - 32.0 i \\ 93.0 + 62.0 i & 61.0 - 66.0 i \\ 78.0 - 80.0 i & 71.0 - 74.0 i \\ 14.0 - 27.0 i & 35.0 + 15.0 i \end{array}) .

Estimates for the backward errors and forward errors are also output.

f07jv: FL CL CPP AD PY MB

NAG CL Interfacef07jvc (zptrfs)

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

NAG CL Interface
f07jvc (zptrfs)