NAG Library Manual, Mark 26

NAG AD Library Manual, Mark 26

NAG C Library Manual, Mark 26

F07 (lapacklin) Chapter Contents

NAG Library Chapter Introduction

F07 (lapacklin)
Linear Equations (LAPACK)

Keyword Search:

NAG Library Manual, Mark 26

NAG AD Library Manual, Mark 26

NAG C Library Manual, Mark 26

F07 (lapacklin) Chapter Contents

▸▿ Contents

1 Scope of the Chapter

▸▿ 2 Background to the Problems

2.1 Notation

2.2 Matrix Factorizations

2.3 Solution of Systems of Equations

▸▿ 2.4 Sensitivity and Error Analysis

2.4.1 Normwise error bounds

2.4.2 Estimating condition numbers

2.4.3 Scaling and Equilibration

2.4.4 Componentwise error bounds

2.4.5 Iterative refinement of the solution

2.5 Matrix Inversion

2.6 Packed Storage Formats

2.7 Band and Tridiagonal Matrices

2.8 Block Partitioned Algorithms

2.9 Mixed Precision LAPACK Routines

▸▿ 3 Recommendations on Choice and Use of Available Routines

3.1 Available Routines

3.2 NAG Names and LAPACK Names

▸▿ 3.3 Matrix Storage Schemes

3.3.1 Conventional storage

3.3.2 Packed storage

3.3.3 Rectangular Full Packed (RFP) Storage

3.3.4 Band storage

3.3.5 Unit triangular matrices

3.3.6 Real diagonal elements of complex matrices

▸▿ 3.4 Parameter Conventions

3.4.1 Option arguments

3.4.2 Problem dimensions

3.4.3 Length of work arrays

3.4.4 Error-handling and the diagnostic argument INFO

▸▿ 3.5 Tables of Driver and Computational Routines

3.5.1 Real matrices

3.5.2 Complex matrices

4 Functionality Index

5 Auxiliary Routines Associated with Library Routine Arguments

6 Routines Withdrawn or Scheduled for Withdrawal

7 References

© The Numerical Algorithms Group Ltd. 2018

1

Scope of the Chapter

This chapter provides routines for the solution of systems of simultaneous linear equations, and associated computations. It provides routines for

matrix factorizations;
solution of linear equations;
estimating matrix condition numbers;
computing error bounds for the solution of linear equations;
matrix inversion;
computing scaling factors to equilibrate a matrix.

Routines are provided for both real and complex data.

For a general introduction to the solution of systems of linear equations, you should turn first to the F04 Chapter Introduction. The decision trees, in Section 4 in the F04 Chapter Introduction, direct you to the most appropriate routines in Chapters F04 or F07 for solving your particular problem. In particular, Chapters F04 and F07 contain Black Box (or driver) routines which enable some standard types of problem to be solved by a call to a single routine. Where possible, routines in Chapter F04 call Chapter F07 routines to perform the necessary computational tasks.

There are two types of driver routines in this chapter: simple drivers which just return the solution to the linear equations; and expert drivers which also return condition and error estimates and, in many cases, also allow equilibration. The simple drivers for real matrices have names of the form F07_AF (D__SV) and for complex matrices have names of the form F07_NF (Z__SV). The expert drivers for real matrices have names of the form F07_BF (D__SVX) and for complex matrices have names of the form F07_PF (Z__SVX).

The routines in this chapter (Chapter F07) handle only dense and band matrices (not matrices with more specialised structures, or general sparse matrices).

The routines in this chapter have all been derived from the LAPACK project (see Anderson et al. (1999)). They have been designed to be efficient on a wide range of high-performance computers, without compromising efficiency on conventional serial machines.

2

Background to the Problems

This section is only a brief introduction to the numerical solution of systems of linear equations. Consult a standard textbook, for example Golub and Van Loan (1996) for a more thorough discussion.

2.1

Notation

We use the standard notation for a system of simultaneous linear equations:

A x = b

(1)

where

A

is the coefficient matrix,

b

is the right-hand side, and

x

is the solution.

A

is assumed to be a square matrix of order

n

.

If there are several right-hand sides, we write

A X = B

(2)

where the columns of

B

are the individual right-hand sides, and the columns of

X

are the corresponding solutions.

We also use the following notation, both here and in the routine documents:

$\hat{x}$	a computed solution to $A x = b$ , (which usually differs from the exact solution $x$ because of round-off error)
$r = b - A \hat{x}$	the residual corresponding to the computed solution $\hat{x}$
${‖x‖}_{\infty} = \max_{i} \|x_{i}\|$	the $\infty$ -norm of the vector $x$
${‖x‖}_{1} = \sum_{j = 1}^{n} \|x_{j}\|$	the $1$ -norm of the vector $x$
${‖A‖}_{\infty} = \max_{i} \sum_{j} \|a_{i j}\|$	the $\infty$ -norm of the matrix $A$
${‖A‖}_{1} = \max_{j} \sum_{i = 1}^{n} \|a_{i j}\|$	the $1$ -norm of the matrix $A$
$\|x\|$	the vector with elements $\|x_{i}\|$
$\|A\|$	the matrix with elements $\|a_{i j}\|$

Inequalities of the form

|A| \leq |B|

are interpreted component-wise, that is

|a_{i j}| \leq |b_{i j}|

for all

i, j

.

2.2

Matrix Factorizations

If

A

is upper or lower triangular,

A x = b

can be solved by a straightforward process of backward or forward substitution.

Otherwise, the solution is obtained after first factorizing

A

, as follows.

General matrices (LU factorization with partial pivoting)

A = P L U

where

P

is a permutation matrix,

L

is lower-triangular with diagonal elements equal to

1

, and

U

is upper-triangular; the permutation matrix

P

(which represents row interchanges) is needed to ensure numerical stability.

Symmetric positive definite matrices (Cholesky factorization)

A = U^{T} U or A = L L^{T}

where

U

is upper triangular and

L

is lower triangular.

Symmetric positive semidefinite matrices (pivoted Cholesky factorization)

A = P U^{T} U P^{T} or A = P L L^{T} P^{T}

where

P

is a permutation matrix,

U

is upper triangular and

L

is lower triangular. The permutation matrix

P

(which represents row-and-column interchanges) is needed to ensure numerical stability and to reveal the numerical rank of

A

.

Symmetric indefinite matrices (Bunch–Kaufman factorization)

A = P U D U^{T} P^{T} or A = P L D L^{T} P^{T}

where

P

is a permutation matrix,

U

is upper triangular,

L

is lower triangular, and

D

is a block diagonal matrix with diagonal blocks of order

1

or

2

;

U

and

L

have diagonal elements equal to

1

, and have

2

by

2

unit matrices on the diagonal corresponding to the

2

by

2

blocks of

D

. The permutation matrix

P

(which represents symmetric row-and-column interchanges) and the

2

by

2

blocks in

D

are needed to ensure numerical stability. If

A

is in fact positive definite, no interchanges are needed and the factorization reduces to

A = U D U^{T}

or

A = L D L^{T}

with diagonal

D

, which is simply a variant form of the Cholesky factorization.

2.3

Solution of Systems of Equations

Given one of the above matrix factorizations, it is straightforward to compute a solution to

A x = b

by solving two subproblems, as shown below, first for

y

and then for

x

. Each subproblem consists essentially of solving a triangular system of equations by forward or backward substitution; the permutation matrix

P

and the block diagonal matrix

D

introduce only a little extra complication:

General matrices ( LU factorization)

\begin{array}{l} L y = P^{T} b \\ U x = y \end{array}

Symmetric positive definite matrices (Cholesky factorization)

\begin{array}{l} U^{T} y = b \\ U x = y \end{array} or \begin{array}{l} L y = b \\ L^{T} x = y \end{array}

Symmetric indefinite matrices (Bunch–Kaufman factorization)

\begin{array}{l} P U D y = b \\ U^{T} P^{T} x = y \end{array} or \begin{array}{l} P L D y = b \\ L^{T} P^{T} x = y \end{array}

2.4

Sensitivity and Error Analysis

2.4.1

Normwise error bounds

Frequently, in practical problems the data

A

and

b

are not known exactly, and it is then important to understand how uncertainties or perturbations in the data can affect the solution.

If

x

is the exact solution to

A x = b

, and

x + δ x

is the exact solution to a perturbed problem

(A + δ A) (x + δ x) = (b + δ b)

, then

\frac{‖δ x‖}{‖x‖} \leq κ (A) (\frac{‖δ A‖}{‖A‖} + \frac{‖δ b‖}{‖b‖}) + \dots (second-order terms)

where

κ (A)

is the condition number of

A

defined by

κ (A) = ‖A‖ . ‖A^{- 1}‖ .

(3)

In other words, relative errors in

A

or

b

may be amplified in

x

by a factor

κ (A)

. Section 2.4.2 discusses how to compute or estimate

κ (A)

.

Similar considerations apply when we study the effects of rounding errors introduced by computation in finite precision. The effects of rounding errors can be shown to be equivalent to perturbations in the original data, such that

\frac{‖δ A‖}{‖A‖}

and

\frac{‖δ b‖}{‖b‖}

are usually at most

p (n) ε

, where

ε

is the machine precision and

p (n)

is an increasing function of

n

which is seldom larger than

10 n

(although in theory it can be as large as

2^{n - 1}

).

In other words, the computed solution

\hat{x}

is the exact solution of a linear system

(A + δ A) \hat{x} = b + δ b

which is close to the original system in a normwise sense.

2.4.2

Estimating condition numbers

The previous section has emphasized the usefulness of the quantity

κ (A)

in understanding the sensitivity of the solution of

A x = b

. To compute the value of

κ (A)

from equation (3) is more expensive than solving

A x = b

in the first place. Hence it is standard practice to estimate

κ (A)

, in either the

1

-norm or the

\infty

-norm, by a method which only requires

O (n^{2})

additional operations, assuming that a suitable factorization of

A

is available.

The method used in this chapter is Higham's modification of Hager's method (see Higham (1988)). It yields an estimate which is never larger than the true value, but which seldom falls short by more than a factor of

3

(although artificial examples can be constructed where it is much smaller). This is acceptable since it is the order of magnitude of

κ (A)

which is important rather than its precise value.

Because

κ (A)

is infinite if

A

is singular, the routines in this chapter actually return the reciprocal of

κ (A)

.

2.4.3

Scaling and Equilibration

The condition of a matrix and hence the accuracy of the computed solution, may be improved by scaling; thus if

D_{1}

and

D_{2}

are diagonal matrices with positive diagonal elements, then

B = D_{1} A D_{2}

is the scaled matrix. A general matrix is said to be equilibrated if it is scaled so that the lengths of its rows and columns have approximately equal magnitude. Similarly a general matrix is said to be row-equilibrated (column-equilibrated) if it is scaled so that the lengths of its rows (columns) have approximately equal magnitude. Note that row scaling can affect the choice of pivot when partial pivoting is used in the factorization of

A

.

A symmetric or Hermitian positive definite matrix is said to be equilibrated if the diagonal elements are all approximately equal to unity.

For further information on scaling and equilibration see Section 3.5.2 of Golub and Van Loan (1996), Section 7.2, 7.3 and 9.8 of Higham (1988) and Section 5 of Chapter 4 of Wilkinson (1965).

Routines are provided to return the scaling factors that equilibrate a matrix for general, general band, symmetric and Hermitian positive definite and symmetric and Hermitian positive definite band matrices.

2.4.4

Componentwise error bounds

A disadvantage of normwise error bounds is that they do not reflect any special structure in the data

A

and

b

– that is, a pattern of elements which are known to be zero – and the bounds are dominated by the largest elements in the data.

Componentwise error bounds overcome these limitations. Instead of the normwise relative error, we can bound the relative error in each component of

A

and

b

:

\max_{i j k} (\frac{|δ a_{i j}|}{|a_{i j}|}, \frac{|δ b_{k}|}{|b_{k}|}) \leq ω

where the component-wise backward error bound

ω

is given by

ω = \max_{i} \frac{|r_{i}|}{{(|A| . |\hat{x}| + |b|)}_{i}} .

Routines are provided in this chapter which compute

ω

, and also compute a forward error bound which is sometimes much sharper than the normwise bound given earlier:

\frac{{‖x - \hat{x}‖}_{\infty}}{{‖x‖}_{\infty}} \leq \frac{{‖|A^{- 1}| . |r|‖}_{\infty}}{{‖x‖}_{\infty}} .

Care is taken when computing this bound to allow for rounding errors in computing

r

. The norm

{‖|A^{- 1}| . |r|‖}_{\infty}

is estimated cheaply (without computing

A^{- 1}

) by a modification of the method used to estimate

κ (A)

.

2.4.5

Iterative refinement of the solution

If

\hat{x}

is an approximate computed solution to

A x = b

, and

r

is the corresponding residual, then a procedure for iterative refinement of

\hat{x}

can be defined as follows, starting with

x_{0} = \hat{x}

:

for $i = 0, 1, \dots$ , until convergence

compute $r_{i} = b - A x_{i}$
solve $A d_{i} = r_{i}$
compute $x_{i + 1} = x_{i} + d_{i}$

In Chapter F04, routines are provided which perform this procedure using additional precision to compute

r

, and are thus able to reduce the forward error to the level of machine precision.

The routines in this chapter do not use additional precision to compute

r

, and cannot guarantee a small forward error, but can guarantee a small backward error (except in rare cases when

A

is very ill-conditioned, or when

A

and

x

are sparse in such a way that

|A| . |x|

has a zero or very small component). The iterations continue until the backward error has been reduced as much as possible; usually only one iteration is needed.

2.5

Matrix Inversion

It is seldom necessary to compute an explicit inverse of a matrix. In particular, do not attempt to solve

A x = b

by first computing

A^{- 1}

and then forming the matrix-vector product

x = A^{- 1} b

; the procedure described in Section 2.3 is more efficient and more accurate.

However, routines are provided for the rare occasions when an inverse is needed, using one of the factorizations described in Section 2.2.

2.6

Packed Storage Formats

Routines which handle symmetric matrices are usually designed so that they use either the upper or lower triangle of the matrix; it is not necessary to store the whole matrix. If the upper or lower triangle is stored conventionally in the upper or lower triangle of a two-dimensional array, the remaining elements of the array can be used to store other useful data.

However, that is not always convenient, and if it is important to economize on storage, the upper or lower triangle can be stored in a one-dimensional array of length

n (n + 1) / 2

or a two-dimensional array with

n (n + 1) / 2

elements; in other words, the storage is almost halved.

The one-dimensional array storage format is referred to as packed storage; it is described in Section 3.3.2. The two-dimensional array storage format is referred to as Rectangular Full Packed (RFP) format; it is described in Section 3.3.3. They may also be used for triangular matrices.

Routines designed for these packed storage formats perform the same number of arithmetic operations as routines which use conventional storage. Those using a packed one-dimensional array are usually less efficient, especially on high-performance computers, so there is then a trade-off between storage and efficiency. The RFP routines are as efficient as for conventional storage, although only a small subset of routines use this format.

2.7

Band and Tridiagonal Matrices

A band matrix is one whose nonzero elements are confined to a relatively small number of subdiagonals or superdiagonals on either side of the main diagonal. A tridiagonal matrix is a special case of a band matrix with just one subdiagonal and one superdiagonal. Algorithms can take advantage of bandedness to reduce the amount of work and storage required. The storage scheme used for band matrices is described in Section 3.3.4.

The

L U

factorization for general matrices, and the Cholesky factorization for symmetric and Hermitian positive definite matrices both preserve bandedness. Hence routines are provided which take advantage of the band structure when solving systems of linear equations.

The Cholesky factorization preserves bandedness in a very precise sense: the factor

U

or

L

has the same number of superdiagonals or subdiagonals as the original matrix. In the

L U

factorization, the row-interchanges modify the band structure: if

A

has

k_{l}

subdiagonals and

k_{u}

superdiagonals, then

L

is not a band matrix but still has at most

k_{l}

nonzero elements below the diagonal in each column; and

U

has at most

k_{l} + k_{u}

superdiagonals.

The Bunch–Kaufman factorization does not preserve bandedness, because of the need for symmetric row-and-column permutations; hence no routines are provided for symmetric indefinite band matrices.

The inverse of a band matrix does not in general have a band structure, so no routines are provided for computing inverses of band matrices.

2.8

Block Partitioned Algorithms

Many of the routines in this chapter use what is termed a block partitioned algorithm. This means that at each major step of the algorithm a block of rows or columns is updated, and most of the computation is performed by matrix-matrix operations on these blocks. The matrix-matrix operations are performed by calls to the Level 3 BLAS (see Chapter F06), which are the key to achieving high performance on many modern computers. See Golub and Van Loan (1996) or Anderson et al. (1999) for more about block partitioned algorithms.

The performance of a block partitioned algorithm varies to some extent with the block size – that is, the number of rows or columns per block. This is a machine-dependent argument, which is set to a suitable value when the library is implemented on each range of machines. You do not normally need to be aware of what value is being used. Different block sizes may be used for different routines. Values in the range

16

to

64

are typical.

On some machines there may be no advantage from using a block partitioned algorithm, and then the routines use an unblocked algorithm (effectively a block size of

1

), relying solely on calls to the Level 2 BLAS (see Chapter F06 again).

The only situation in which you need some awareness of the block size is when it affects the amount of workspace to be supplied to a particular routine. This is discussed in Section 3.4.3.

2.9

Mixed Precision LAPACK Routines

Some LAPACK routines use mixed precision arithmetic in an effort to solve problems more efficiently on modern hardware. They work by converting a double precision problem into an equivalent single precision problem, solving it and then using iterative refinement in double precision to find a full precision solution to the original problem. The method may fail if the problem is too ill-conditioned to allow the initial single precision solution, in which case the routines fall back to solve the original problem entirely in double precision. The vast majority of problems are not so ill-conditioned, and in those cases the technique can lead to significant gains in speed without loss of accuracy. This is particularly true on machines where double precision arithmetic is significantly slower than single precision.

3

Recommendations on Choice and Use of Available Routines

3.1

Available Routines

Tables 1 to 8 in Section 3.5 show the routines which are provided for performing different computations on different types of matrices. Tables 1 to 4 show routines for real matrices; Tables 5 to 8 show routines for complex matrices. Each entry in the table gives the NAG routine name and the LAPACK double precision name (see Section 3.2).

Routines are provided for the following types of matrix:

general
general band
general tridiagonal
symmetric or Hermitian positive definite
symmetric or Hermitian positive definite (packed storage)
symmetric or Hermitian positive definite (RFP storage)
symmetric or Hermitian positive definite band
symmetric or Hermitian positive definite tridiagonal
symmetric or Hermitian indefinite
symmetric or Hermitian indefinite (packed storage)
triangular
triangular (packed storage)
triangular (RFP storage)
triangular band

For each of the above types of matrix (except where indicated), routines are provided to perform the following computations:

(a)	(except for RFP matrices) solve a system of linear equations (driver routines);
(b)	(except for RFP matrices) solve a system of linear equations with condition and error estimation (expert drivers);
(c)	(except for triangular matrices) factorize the matrix (see Section 2.2);
(d)	solve a system of linear equations, using the factorization (see Section 2.3);
(e)	(except for RFP matrices) estimate the condition number of the matrix, using the factorization (see Section 2.4.2); these routines also require the norm of the original matrix (except when the matrix is triangular) which may be computed by a routine in Chapter F06;
(f)	(except for RFP matrices) refine the solution and compute forward and backward error bounds (see Sections 2.4.4 and 2.4.5); these routines require the original matrix and right-hand side, as well as the factorization returned from (a) and the solution returned from (b);
(g)	(except for band and tridiagonal matrices) invert the matrix, using the factorization (see Section 2.5);
(h)	(except for tridiagonal, symmetric indefinite, triangular and RFP matrices) compute scale factors to equilibrate the matrix (see Section 2.4.3).

Thus, to solve a particular problem, it is usually only necessary to call a single driver routine, but alternatively two or more routines may be called in succession. This is illustrated in the example programs in the routine documents.

3.2

NAG Names and LAPACK Names

As well as the NAG routine name (beginning F07), Tables 1 to 8 show the LAPACK routine names in double precision.

The routines may be called either by their NAG names or by their LAPACK names. When using the NAG Library, the double precision form of the LAPACK name must be used (beginning with D- or Z-).

References to Chapter F07 routines in the manual normally include the LAPACK double precision names, for example, f07adf (dgetrf).

The LAPACK routine names follow a simple scheme (which is similar to that used for the BLAS in Chapter F06). Most names have the structure XYYZZZ, where the components have the following meanings:

–

the initial letter X indicates the data type (real or complex) and precision:

S	– real, single precision (in Fortran 77, REAL)
D	– real, double precision (in Fortran 77, DOUBLE PRECISION)
C	– complex, single precision (in Fortran 77, COMPLEX)
Z	– complex, double precision (in Fortran 77, COMPLEX*16 or DOUBLE COMPLEX)

–

exceptionally, the mixed precision LAPACK routines described in Section 2.9 replace the initial first letter by a pair of letters, as:

DS	– double precision routine using single precision internally
ZC	– double complex routine using single precision complex internally

–

the letters YY indicate the type of the matrix

A

(and in some cases its storage scheme):

GE	– general
GB	– general band
PO	– symmetric or Hermitian positive definite
PF	– symmetric or Hermitian positive definite (RFP storage)
PP	– symmetric or Hermitian positive definite (packed storage)
PB	– symmetric or Hermitian positive definite band
SY	– symmetric indefinite
SF	– symmetric indefinite (RFP storage)
SP	– symmetric indefinite (packed storage)
HE	– (complex) Hermitian indefinite
HF	– (complex) Hermitian indefinite (RFP storage)
HP	– (complex) Hermitian indefinite (packed storage)
GT	– general tridiagonal
PT	– symmetric or Hermitian positive definite tridiagonal
TR	– triangular
TF	– triangular (RFP storage)
TP	– triangular (packed storage)
TB	– triangular band

–

the last two or three letters ZZ or ZZZ indicate the computation performed. Examples are:

TRF	– triangular factorization
TRS	– solution of linear equations, using the factorization
CON	– estimate condition number
RFS	– refine solution and compute error bounds
TRI	– compute inverse, using the factorization

Thus the routine dgetrf performs a triangular factorization of a real general matrix in double precision; the corresponding routine for a complex general matrix is zgetrf.

3.3

Matrix Storage Schemes

In this chapter the following different storage schemes are used for matrices:

– conventional storage in a two-dimensional array;
– packed storage for symmetric, Hermitian or triangular matrices;
– rectangular full packed (RFP) storage for symmetric, Hermitian or triangular matrices;
– band storage for band matrices.

These storage schemes are compatible with those used in Chapter F06 (especially in the BLAS) and Chapter F08, but different schemes for packed or band storage are used in a few older routines in Chapters F01, F02, F03 and F04.

In the examples below,

*

indicates an array element which need not be set and is not referenced by the routines. The examples illustrate only the relevant part of the arrays; array arguments may of course have additional rows or columns, according to the usual rules for passing array arguments in Fortran 77.

3.3.1

Conventional storage

The default scheme for storing matrices is the obvious one: a matrix

A

is stored in a two-dimensional array a, with matrix element

a_{i j}

stored in array element

a (i, j)

.

If a matrix is triangular (upper or lower, as specified by the argument uplo), only the elements of the relevant triangle are stored; the remaining elements of the array need not be set. Such elements are indicated by * or

⌴

in the examples below.

For example, when

n = 4

:

uplo	Triangular matrix $A$	Storage in array a
'U'	$(\begin{array}{l} a_{11} & a_{12} & a_{13} & a_{14} \\ a_{22} & a_{23} & a_{24} \\ a_{33} & a_{34} \\ a_{44} \end{array})$	$\begin{matrix} a_{11} & a_{12} & a_{13} & a_{14} \\ ⌴ & a_{22} & a_{23} & a_{24} \\ ⌴ & ⌴ & a_{33} & a_{34} \\ ⌴ & ⌴ & ⌴ & a_{44} \end{matrix}$
'L'	$(\begin{array}{l} a_{11} \\ a_{21} & a_{22} \\ a_{31} & a_{32} & a_{33} \\ a_{41} & a_{42} & a_{43} & a_{44} \end{array})$	$\begin{matrix} a_{11} & ⌴ & ⌴ & ⌴ \\ a_{21} & a_{22} & ⌴ & ⌴ \\ a_{31} & a_{32} & a_{33} & ⌴ \\ a_{41} & a_{42} & a_{43} & a_{44} \end{matrix}$

Routines which handle symmetric or Hermitian matrices allow for either the upper or lower triangle of the matrix (as specified by uplo) to be stored in the corresponding elements of the array; the remaining elements of the array need not be set.

For example, when

n = 4

:

uplo	Hermitian matrix $A$	Storage in array a
'U'	$(\begin{array}{l} a_{11} & a_{12} & a_{13} & a_{14} \\ {\bar{a}}_{12} & a_{22} & a_{23} & a_{24} \\ {\bar{a}}_{13} & {\bar{a}}_{23} & a_{33} & a_{34} \\ {\bar{a}}_{14} & {\bar{a}}_{24} & {\bar{a}}_{34} & a_{44} \end{array})$	$\begin{matrix} a_{11} & a_{12} & a_{13} & a_{14} \\ ⌴ & a_{22} & a_{23} & a_{24} \\ ⌴ & ⌴ & a_{33} & a_{34} \\ ⌴ & ⌴ & ⌴ & a_{44} \end{matrix}$
'L'	$(\begin{array}{l} a_{11} & {\bar{a}}_{21} & {\bar{a}}_{31} & {\bar{a}}_{41} \\ a_{21} & a_{22} & {\bar{a}}_{32} & {\bar{a}}_{42} \\ a_{31} & a_{32} & a_{33} & {\bar{a}}_{43} \\ a_{41} & a_{42} & a_{43} & a_{44} \end{array})$	$\begin{matrix} a_{11} & ⌴ & ⌴ & ⌴ \\ a_{21} & a_{22} & ⌴ & ⌴ \\ a_{31} & a_{32} & a_{33} & ⌴ \\ a_{41} & a_{42} & a_{43} & a_{44} \end{matrix}$

3.3.2

Packed storage

Symmetric, Hermitian or triangular matrices may be stored more compactly, if the relevant triangle (again as specified by uplo) is packed by columns in a one-dimensional array. In this chapter, as in Chapters F06 and F08, arrays which hold matrices in packed storage, have names ending in P. For a matrix of order

n

, the array must have at least

n (n + 1) / 2

elements. So:

if $uplo ='U'$ , $a_{i j}$ is stored in $ap (i + j (j - 1) / 2)$ for $i \leq j$ ;
if $uplo ='L'$ , $a_{i j}$ is stored in $ap (i + (2 n - j) (j - 1) / 2)$ for $j \leq i$ .

For example:

	Triangle of matrix $A$	Packed storage in array ap
$uplo ='U'$	$(\begin{array}{l} a_{11} & a_{12} & a_{13} & a_{14} \\ a_{22} & a_{23} & a_{24} \\ a_{33} & a_{34} \\ a_{44} \end{array})$	$a_{11} \underset{︸}{a_{12} a_{22}} \underset{︸}{a_{13} a_{23} a_{33}} \underset{︸}{a_{14} a_{24} a_{34} a_{44}}$
$uplo ='L'$	$(\begin{array}{l} a_{11} \\ a_{21} & a_{22} \\ a_{31} & a_{32} & a_{33} \\ a_{41} & a_{42} & a_{43} & a_{44} \end{array})$	$\underset{︸}{a_{11} a_{21} a_{31} a_{41}} \underset{︸}{a_{22} a_{32} a_{42}} \underset{︸}{a_{33} a_{43}} a_{44}$

Note that for real symmetric matrices, packing the upper triangle by columns is equivalent to packing the lower triangle by rows; packing the lower triangle by columns is equivalent to packing the upper triangle by rows. (For complex Hermitian matrices, the only difference is that the off-diagonal elements are conjugated.)

3.3.3

Rectangular Full Packed (RFP) Storage

The rectangular full packed (RFP) storage format offers the same savings in storage as the packed storage format (described in Section 3.3.2), but is likely to be much more efficient in general since the block structure of the matrix is maintained. This structure can be exploited using block partition algorithms (see Section 2.8) in a similar way to matrices that use conventional storage.

Figure f07intro-rfp

Figure 1

Figure 1 gives a graphical representation of the key idea of RFP for the particular case of a lower triangular matrix of even dimensions. In all cases the original triangular matrix of stored elements is separated into a trapezoidal part and a triangular part. The number of columns in these two parts is equal when the dimension of the matrix is even,

n = 2 k

, while the trapezoidal part has

k + 1

columns when

n = 2 k + 1

. The smaller part is then transposed and fitted onto the trapezoidal part forming a rectangle. The rectangle has dimensions

2 k + 1

and

q

, where

q = k

when

n

is even and

q = k + 1

when

n

is odd.

For routines using RFP there is the option of storing the rectangle as described above (

transr ='N'

) or its transpose (

transr ='T'

, for real a) or its conjugate transpose (

transr ='C'

, for complex a).

As an example, we first consider RFP for the case

n = 2 k

with

k = 3

.

If

transr ='N'

, then ar holds a as follows:

For $uplo ='U'$ the upper trapezoid $ar (1 : 6, 1 : 3)$ consists of the last three columns of a upper. The lower triangle $ar (5 : 7, 1 : 3)$ consists of the transpose of the first three columns of a upper.
For $uplo ='L'$ the lower trapezoid $ar (2 : 7, 1 : 3)$ consists of the first three columns of a lower. The upper triangle $ar (1 : 3, 1 : 3)$ consists of the transpose of the last three columns of a lower.

If

transr ='T'

, then ar in both uplo cases is just the transpose of ar as defined when

transr ='N'

.

uplo	Triangle of matrix $A$	Rectangular Full Packed matrix $AR$
uplo	Triangle of matrix $A$	$transr ='N'$	$transr ='T'$
'U'	$(\begin{array}{l} 00 & 01 & 02 & 03 & 04 & 05 \\ 11 & 12 & 13 & 14 & 15 \\ 22 & 23 & 24 & 25 \\ 33 & 34 & 35 \\ 44 & 45 \\ 55 \end{array})$	$\begin{matrix} 03 & 04 & 05 \\ 13 & 14 & 15 \\ 23 & 24 & 25 \\ 33 & 34 & 35 \\ 00 & 44 & 45 \\ 01 & 11 & 55 \\ 02 & 12 & 22 \end{matrix}$	$\begin{matrix} 03 & 13 & 23 & 33 & 00 & 01 & 02 \\ 04 & 14 & 24 & 34 & 44 & 11 & 12 \\ 05 & 15 & 25 & 35 & 45 & 55 & 22 \end{matrix}$
'L'	$(\begin{array}{l} 00 \\ 10 & 11 \\ 20 & 21 & 22 \\ 30 & 31 & 32 & 33 \\ 40 & 41 & 42 & 43 & 44 \\ 50 & 51 & 52 & 53 & 54 & 55 \end{array})$	$\begin{matrix} 33 & 43 & 53 \\ 00 & 44 & 54 \\ 10 & 11 & 55 \\ 20 & 21 & 22 \\ 30 & 31 & 32 \\ 40 & 41 & 42 \\ 50 & 51 & 52 \end{matrix}$	$\begin{matrix} 33 & 00 & 10 & 20 & 30 & 40 & 50 \\ 43 & 44 & 11 & 21 & 31 & 41 & 51 \\ 53 & 54 & 55 & 22 & 32 & 42 & 52 \end{matrix}$

Now we consider RFP for the case

n = 2 k + 1

and

k = 2

.

If

transr ='N'

. ar holds a as follows:

if $uplo ='U'$ the upper trapezoid $ar (1 : 5, 1 : 3)$ consists of the last three columns of a upper. The lower triangle $ar (4 : 5, 1 : 2)$ consists of the transpose of the first two columns of a upper;
if $uplo ='L'$ the lower trapezoid $ar (1 : 5, 1 : 3)$ consists of the first three columns of a lower. The upper triangle $ar (1 : 2, 2 : 3)$ consists of the transpose of the last two columns of a lower.

If

transr ='T'

. ar in both uplo cases is just the transpose of ar as defined when

transr ='N'

.

uplo	Triangle of matrix $A$	Rectangular Full Packed matrix $AR$
uplo	Triangle of matrix $A$	$transr ='N'$	$transr ='T'$
'U'	$(\begin{array}{l} 00 & 01 & 02 & 03 & 04 \\ 11 & 12 & 13 & 14 \\ 22 & 23 & 24 \\ 33 & 34 \\ 44 \end{array})$	$\begin{matrix} 02 & 03 & 04 \\ 12 & 13 & 14 \\ 22 & 23 & 24 \\ 00 & 33 & 34 \\ 01 & 11 & 44 \end{matrix}$	$\begin{matrix} 02 & 12 & 22 & 00 & 01 \\ 03 & 13 & 23 & 33 & 11 \\ 04 & 14 & 24 & 34 & 44 \end{matrix}$
'L'	$(\begin{array}{l} 00 \\ 10 & 11 \\ 20 & 21 & 22 \\ 30 & 31 & 32 & 33 \\ 40 & 41 & 42 & 43 & 44 \end{array})$	$\begin{matrix} 00 & 33 & 43 \\ 10 & 11 & 44 \\ 20 & 21 & 22 \\ 30 & 31 & 32 \\ 40 & 41 & 42 \end{matrix}$	$\begin{matrix} 00 & 10 & 20 & 30 & 40 & 50 \\ 33 & 11 & 21 & 31 & 41 & 51 \\ 43 & 44 & 22 & 32 & 42 & 52 \end{matrix}$

Explicitly, in the real matrix case, ar is a one-dimensional array of length

n (n + 1) / 2

and contains the elements of a as follows:

for $uplo ='U'$ and $transr ='N'$ ,: $a_{i j}$ is stored in $ar ((2 k + 1) (i - 1) + j + k + 1)$ , for $1 \leq j \leq k$ and $1 \leq i \leq j$ , and
$a_{i j}$ is stored in $ar ((2 k + 1) (j - k - 1) + i)$ , for $k < j \leq n$ and $1 \leq i \leq j$ ;
for $uplo ='U'$ and $transr ='T'$ ,: $a_{i j}$ is stored in $ar (q (j + k) + i)$ , for $1 \leq j \leq k$ and $1 \leq i \leq j$ , and
$a_{i j}$ is stored in $ar (q (i - 1) + j - k)$ , for $k < j \leq n$ and $1 \leq i \leq j$ ;
for $uplo ='L'$ and $transr ='N'$ ,: $a_{i j}$ is stored in $ar ((2 k + 1) (j - 1) + i + k - q + 1)$ , for $1 \leq j \leq q$ and $j \leq i \leq n$ , and
$a_{i j}$ is stored in $ar ((2 k + 1) (i - k - 1) + j - q)$ , for $q < j \leq n$ and $j \leq i \leq n$ ;
for $uplo ='L'$ and $transr ='T'$ ,: $a_{i j}$ is stored in $ar (q (i + k - q) + j)$ , for $1 \leq j \leq q$ and $1 \leq i \leq n$ , and
$a_{i j}$ is stored in $ar (q (j - 1 - q) + i - k)$ , for $q < j \leq n$ and $1 \leq i \leq n$ .

In the case of complex matrices, the assumption is that the full matrix, if it existed, would be Hermitian. Thus, when

transr ='N'

, the triangular portion of a that is, in the real case, transposed into the notional

(2 k + 1)

by

q

RFP matrix is also conjugated. When

transr ='C'

the notional

q

by

(2 k + 1)

RFP matrix is the conjugate transpose of the corresponding

transr ='N'

RFP matrix. Explicitly, for complex a, the array ar contains the elements (or conjugated elements) of a as follows:

for $uplo ='U'$ and $transr ='N'$ ,: ${\bar{a}}_{i j}$ is stored in $ar ((2 k + 1) (i - 1) + j + k + 1)$ , for $1 \leq j \leq k$ and $1 \leq i \leq j$ , and
$a_{i j}$ is stored in $ar ((2 k + 1) (j - k - 1) + i)$ , for $k < j \leq n$ and $1 \leq i \leq j$ ;
for $uplo ='U'$ and $transr ='C'$ ,: $a_{i j}$ is stored in $ar (q (j + k) + i)$ , for $1 \leq j \leq k$ and $1 \leq i \leq j$ , and
${\bar{a}}_{i j}$ is stored in $ar (q (i - 1) + j - k)$ , for $k < j \leq n$ and $1 \leq i \leq j$ ;
for $uplo ='L'$ and $transr ='N'$ ,: $a_{i j}$ is stored in $ar ((2 k + 1) (j - 1) + i + k - q + 1)$ , for $1 \leq j \leq q$ and $j \leq i \leq n$ , and
${\bar{a}}_{i j}$ is stored in $ar ((2 k + 1) (i - k - 1) + j - q)$ , for $q < j \leq n$ and $j \leq i \leq n$ ;
for $uplo ='L'$ and $transr ='C'$ ,: ${\bar{a}}_{i j}$ is stored in $ar (q (i + k - q) + j)$ , for $1 \leq j \leq q$ and $1 \leq i \leq n$ , and
$a_{i j}$ is stored in $ar (q (j - 1 - q) + i - k)$ , for $q < j \leq n$ and $1 \leq i \leq n$ .

3.3.4

Band storage

A band matrix with

k_{l}

subdiagonals and

k_{u}

superdiagonals may be stored compactly in a two-dimensional array with

k_{l} + k_{u} + 1

rows and

n

columns. Columns of the matrix are stored in corresponding columns of the array, and diagonals of the matrix are stored in rows of the array. This storage scheme should be used in practice only if

k_{l}

,

k_{u} ≪ n

, although the routines in Chapters F07 and F08 work correctly for all values of

k_{l}

and

k_{u}

. In Chapters F07 and F08 arrays which hold matrices in band storage have names ending in

B

.

To be precise, elements of matrix elements

a_{i j}

are stored as follows:

$a_{i j}$ is stored in $ab (k_{u} + 1 + i - j, j)$ for $\max (1, j - k_{u}) \leq i \leq \min (n, j + k_{l})$ .

For example, when

n = 5

,

k_{l} = 2

and

k_{u} = 1

:

Band matrix $A$	Band storage in array ab
$(\begin{array}{l} a_{11} & a_{12} \\ a_{21} & a_{22} & a_{23} \\ a_{31} & a_{32} & a_{33} & a_{34} \\ a_{42} & a_{43} & a_{44} & a_{45} \\ a_{53} & a_{54} & a_{55} \end{array})$	$\begin{matrix} * & a_{12} & a_{23} & a_{34} & a_{45} \\ a_{11} & a_{22} & a_{33} & a_{44} & a_{55} \\ a_{21} & a_{32} & a_{43} & a_{54} & * \\ a_{31} & a_{42} & a_{53} & * & * \end{matrix}$

The elements marked

*

in the upper left and lower right corners of the array ab need not be set, and are not referenced by the routines.

Note: when a general band matrix is supplied for

L U

factorization, space must be allowed to store an additional

k_{l}

superdiagonals, generated by fill-in as a result of row interchanges. This means that the matrix is stored according to the above scheme, but with

k_{l} + k_{u}

superdiagonals.

Triangular band matrices are stored in the same format, with either

k_{l} = 0

if upper triangular, or

k_{u} = 0

if lower triangular.

For symmetric or Hermitian band matrices with

k

subdiagonals or superdiagonals, only the upper or lower triangle (as specified by uplo) need be stored:

if $uplo ='U'$ , $a_{i j}$ is stored in $ab (k + 1 + i - j, j)$ for $\max (1, j - k) \leq i \leq j$ ;
if $uplo ='L'$ , $a_{i j}$ is stored in $ab (1 + i - j, j)$ for $j \leq i \leq \min (n, j + k)$ .

For example, when

n = 5

and

k = 2

:

uplo	Hermitian band matrix $A$	Band storage in array ab
'U'	$(\begin{array}{l} a_{11} & a_{12} & a_{13} \\ {\bar{a}}_{12} & a_{22} & a_{23} & a_{24} \\ {\bar{a}}_{13} & {\bar{a}}_{23} & a_{33} & a_{34} & a_{35} \\ {\bar{a}}_{24} & {\bar{a}}_{34} & a_{44} & a_{45} \\ {\bar{a}}_{35} & {\bar{a}}_{45} & a_{55} \end{array})$	$\begin{array}{l} * & * & a_{13} & a_{24} & a_{35} \\ * & a_{12} & a_{23} & a_{34} & a_{45} \\ a_{11} & a_{22} & a_{33} & a_{44} & a_{55} \end{array}$
'L'	$(\begin{array}{l} a_{11} & {\bar{a}}_{21} & {\bar{a}}_{31} \\ a_{21} & a_{22} & {\bar{a}}_{32} & {\bar{a}}_{42} \\ a_{31} & a_{32} & a_{33} & {\bar{a}}_{43} & {\bar{a}}_{53} \\ a_{42} & a_{43} & a_{44} & {\bar{a}}_{54} \\ a_{53} & a_{54} & a_{55} \end{array})$	$\begin{array}{l} a_{11} & a_{22} & a_{33} & a_{44} & a_{55} \\ a_{21} & a_{32} & a_{43} & a_{54} & * \\ a_{31} & a_{42} & a_{53} & * & * \end{array}$

Note that different storage schemes for band matrices are used by some routines in Chapters F01, F02, F03 and F04.

3.3.5

Unit triangular matrices

Some routines in this chapter have an option to handle unit triangular matrices (that is, triangular matrices with diagonal elements

= 1

). This option is specified by an argument diag. If

diag ='U'

(Unit triangular), the diagonal elements of the matrix need not be stored, and the corresponding array elements are not referenced by the routines. The storage scheme for the rest of the matrix (whether conventional, packed or band) remains unchanged.

3.3.6

Real diagonal elements of complex matrices

Complex Hermitian matrices have diagonal elements that are by definition purely real. In addition, complex triangular matrices which arise in Cholesky factorization are defined by the algorithm to have real diagonal elements.

If such matrices are supplied as input to routines in Chapters F07 and F08, the imaginary parts of the diagonal elements are not referenced, but are assumed to be zero. If such matrices are returned as output by the routines, the computed imaginary parts are explicitly set to zero.

3.4

Parameter Conventions

3.4.1

Option arguments

Most routines in this chapter have one or more option arguments, of type CHARACTER. The descriptions in Section 5 of the routine documents refer only to upper-case values (for example

uplo ='U'

or

'L'

); however, in every case, the corresponding lower-case characters may be supplied (with the same meaning). Any other value is illegal.

A longer character string can be passed as the actual argument, making the calling program more readable, but only the first character is significant. (This is a feature of Fortran 77.) For example:

Call dgetrs('Transpose',...)

3.4.2

Problem dimensions

It is permissible for the problem dimensions (for example, m in f07adf (dgetrf), n or nrhs in f07aef (dgetrs)) to be passed as zero, in which case the computation (or part of it) is skipped. Negative dimensions are regarded as an error.

3.4.3

Length of work arrays

A few routines implementing block partitioned algorithms require workspace sufficient to hold one block of rows or columns of the matrix if they are to achieve optimum levels of performance — for example, workspace of size

n \times n b

, where

n b

is the optimum block size. In such cases, the actual declared length of the work array must be passed as a separate argument lwork, which immediately follows work in the argument-list.

The routine will still perform correctly when less workspace is provided: it uses the largest block size allowed by the amount of workspace supplied, as long as this is likely to give better performance than the unblocked algorithm. On exit,

work (1)

contains the minimum value of lwork which would allow the routine to use the optimum block size; this value of lwork may be used for subsequent runs.

If lwork indicates that there is insufficient workspace to perform the unblocked algorithm, this is regarded as an illegal value of lwork, and is treated like any other illegal argument value (see Section 3.4.4), though

work (1)

will still be set as described above.

If you are in doubt how much workspace to supply and are concerned to achieve optimum performance, supply a generous amount (assume a block size of

64

, say), and then examine the value of

work (1)

on exit.

3.4.4

Error-handling and the diagnostic argument INFO

Routines in this chapter do not use the usual NAG Library error-handling mechanism, involving the argument IFAIL. Instead they have a diagnostic argument info. (Thus they preserve complete compatibility with the LAPACK specification.)

Whereas IFAIL is an Input/Output argument and must be set before calling a routine, info is purely an Output argument and need not be set before entry.

info indicates the success or failure of the computation, as follows:

$info = 0$ : successful termination
$info > 0$ : failure in the course of computation, control returned to the calling program

If the routine document specifies that the routine may terminate with

info > 0

, then it is essential to test info on exit from the routine. (This corresponds to a soft failure in terms of the usual NAG error-handling terminology.) No error message is output.

All routines check that input arguments such as n or lda or option arguments of type CHARACTER have permitted values. If an illegal value of the

i

th argument is detected, info is set to

- i

, a message is output, and execution of the program is terminated. (This corresponds to a hard failure in the usual NAG terminology.)

3.5

Tables of Driver and Computational Routines

3.5.1

Real matrices

Each entry in the following tables, listing real matrices, gives:

the NAG routine name and
the double precision LAPACK routine name.

	Type of matrix and storage scheme
Operation	general	general band	general tridiagonal
driver	f07aaf (dgesv)	f07baf (dgbsv)	f07caf (dgtsv)
expert driver	f07abf (dgesvx)	f07bbf (dgbsvx)	f07cbf (dgtsvx)
mixed precision driver	f07acf (dsgesv)
factorize	f07adf (dgetrf)	f07bdf (dgbtrf)	f07cdf (dgttrf)
solve	f07aef (dgetrs)	f07bef (dgbtrs)	f07cef (dgttrs)
scaling factors	f07aff (dgeequ)	f07bff (dgbequ)
condition number	f07agf (dgecon)	f07bgf (dgbcon)	f07cgf (dgtcon)
error estimate	f07ahf (dgerfs)	f07bhf (dgbrfs)	f07chf (dgtrfs)
invert	f07ajf (dgetri)

Table 1
Routines for real general matrices

	Type of matrix and storage scheme
Operation	symmetric positive definite	symmetric positive definite (packed storage)	symmetric positive definite (RFP storage)	symmetric positive definite band	symmetric positive definite tridiagonal	symmetric positive semidefinite
driver	f07faf (dposv)	f07gaf (dppsv)		f07haf (dpbsv)	f07jaf (dptsv)
expert driver	f07fbf (dposvx)	f07gbf (dppsvx)		f07hbf (dpbsvx)	f07jbf (dptsvx)
mixed precision	f07fcf (dsposv)
factorize	f07fdf (dpotrf)	f07gdf (dpptrf)	f07wdf (dpftrf)	f07hdf (dpbtrf)	f07jdf (dpttrf)	f07kdf (dpstrf)
solve	f07fef (dpotrs)	f07gef (dpptrs)	f07wef (dpftrs)	f07hef (dpbtrs)	f07jef (dpttrs)
scaling factors	f07fff (dpoequ)	f07gff (dppequ)		f07hff (dpbequ)
condition number	f07fgf (dpocon)	f07ggf (dppcon)		f07hgf (dpbcon)	f07jgf (dptcon)
error estimate	f07fhf (dporfs)	f07ghf (dpprfs)		f07hhf (dpbrfs)	f07jhf (dptrfs)
invert	f07fjf (dpotri)	f07gjf (dpptri)	f07wjf (dpftri)

Table 2
Routines for real symmetric positive definite and positive semidefinite matrices

	Type of matrix and storage scheme
Operation	symmetric indefinite	symmetric indefinite (packed storage)
driver	f07maf (dsysv)	f07paf (dspsv)
expert driver	f07mbf (dsysvx)	f07pbf (dspsvx)
factorize	f07mdf (dsytrf)	f07pdf (dsptrf)
solve	f07mef (dsytrs)	f07pef (dsptrs)
condition number	f07mgf (dsycon)	f07pgf (dspcon)
error estimate	f07mhf (dsyrfs)	f07phf (dsprfs)
invert	f07mjf (dsytri)	f07pjf (dsptri)

Table 3
Routines for real symmetric indefinite matrices

	Type of matrix and storage scheme
Operation	triangular	triangular (packed storage)	triangular (RFP storage)	triangular band
solve	f07tef (dtrtrs)	f07uef (dtptrs)		f07vef (dtbtrs)
condition number	f07tgf (dtrcon)	f07ugf (dtpcon)		f07vgf (dtbcon)
error estimate	f07thf (dtrrfs)	f07uhf (dtprfs)		f07vhf (dtbrfs)
invert	f07tjf (dtrtri)	f07ujf (dtptri)	f07wkf (dtftri)

Table 4
Routines for real triangular matrices

3.5.2

Complex matrices

Each entry in the following tables, listing complex matrices, gives:

the NAG routine name and
the double precision LAPACK routine name.

	Type of matrix and storage scheme
Operation	general	general band	general tridiagonal
driver	f07anf (zgesv)	f07bnf (zgbsv)	f07cnf (zgtsv)
expert driver	f07apf (zgesvx)	f07bpf (zgbsvx)	f07cpf (zgtsvx)
mixed precision driver	f07aqf (zcgesv)
factorize	f07arf (zgetrf)	f07brf (zgbtrf)	f07crf (zgttrf)
solve	f07asf (zgetrs)	f07bsf (zgbtrs)	f07csf (zgttrs)
scaling factors	f07atf (zgeequ)	f07btf (zgbequ)
condition number	f07auf (zgecon)	f07buf (zgbcon)	f07cuf (zgtcon)
error estimate	f07avf (zgerfs)	f07bvf (zgbrfs)	f07cvf (zgtrfs)
invert	f07awf (zgetri)

Table 5
Routines for complex general matrices

	Type of matrix and storage scheme
Operation	Hermitian positive definite	Hermitian positive definite (packed storage)	Hermitian positive definite (RFP storage)	Hermitian positive definite band	Hermitian positive definite tridiagonal	Hermitian positive semidefinite
driver	f07fnf (zposv)	f07gnf (zppsv)		f07hnf (zpbsv)	f07jnf (zptsv)
expert driver	f07fpf (zposvx)	f07gpf (zppsvx)		f07hpf (zpbsvx)	f07jpf (zptsvx)
mixed precision driver	f07fqf (zcposv)
factorize	f07frf (zpotrf)	f07grf (zpptrf)	f07wrf (zpftrf)	f07hrf (zpbtrf)	f07jrf (zpttrf)	f07krf (zpstrf)
solve	f07fsf (zpotrs)	f07gsf (zpptrs)	f07wsf (zpftrs)	f07hsf (zpbtrs)	f07jsf (zpttrs)
scaling factors	f07ftf (zpoequ)	f07gtf (zppequ)
condition number	f07fuf (zpocon)	f07guf (zppcon)		f07huf (zpbcon)	f07juf (zptcon)
error estimate	f07fvf (zporfs)	f07gvf (zpprfs)		f07hvf (zpbrfs)	f07jvf (zptrfs)
invert	f07fwf (zpotri)	f07gwf (zpptri)	f07wwf (zpftri)

Table 6
Routines for complex Hermitian positive definite and positive semidefinite matrices

	Type of matrix and storage scheme
Operation	Hermitian indefinite	symmetric indefinite (packed storage)	Hermitian indefinite band	symmetric indefinite tridiagonal
driver	f07mnf (zhesv)	f07nnf (zsysv)	f07pnf (zhpsv)	f07qnf (zspsv)
expert driver	f07mpf (zhesvx)	f07npf (zsysvx)	f07ppf (zhpsvx)	f07qpf (zspsvx)
factorize	f07mrf (zhetrf)	f07nrf (zsytrf)	f07prf (zhptrf)	f07qrf (zsptrf)
solve	f07msf (zhetrs)	f07nsf (zsytrs)	f07psf (zhptrs)	f07qsf (zsptrs)
condition number	f07muf (zhecon)	f07nuf (zsycon)	f07puf (zhpcon)	f07quf (zspcon)
error estimate	f07mvf (zherfs)	f07nvf (zsyrfs)	f07pvf (zhprfs)	f07qvf (zsprfs)
invert	f07mwf (zhetri)	f07nwf (zsytri)	f07pwf (zhptri)	f07qwf (zsptri)

Table 7
Routines for complex Hermitian and symmetric indefinite matrices

	Type of matrix and storage scheme
Operation	triangular	triangular (packed storage)	triangular (RFP storage)	triangular band
solve	f07tsf (ztrtrs)	f07usf (ztptrs)		f07vsf (ztbtrs)
condition number	f07tuf (ztrcon)	f07uuf (ztpcon)		f07vuf (ztbcon)
error estimate	f07tvf (ztrrfs)	f07uvf (ztprfs)		f07vvf (ztbrfs)
invert	f07twf (ztrtri)	f07uwf (ztptri)	f07wxf (ztftri)

Table 8
Routines for complex triangular matrices

4

Functionality Index

Apply iterative refinement to the solution and compute error estimates,

after factorizing the matrix of coefficients,

complex band matrix

f07bvf (zgbrfs)

complex Hermitian indefinite matrix

f07mvf (zherfs)

complex Hermitian indefinite matrix, packed storage

f07pvf (zhprfs)

complex Hermitian positive definite band matrix

f07hvf (zpbrfs)

complex Hermitian positive definite matrix

f07fvf (zporfs)

complex Hermitian positive definite matrix, packed storage

f07gvf (zpprfs)

complex Hermitian positive definite tridiagonal matrix

f07jvf (zptrfs)

complex matrix

f07avf (zgerfs)

complex symmetric indefinite matrix

f07nvf (zsyrfs)

complex symmetric indefinite matrix, packed storage

f07qvf (zsprfs)

complex tridiagonal matrix

f07cvf (zgtrfs)

real band matrix

f07bhf (dgbrfs)

real matrix

f07ahf (dgerfs)

real symmetric indefinite matrix

f07mhf (dsyrfs)

real symmetric indefinite matrix, packed storage

f07phf (dsprfs)

real symmetric positive definite band matrix

f07hhf (dpbrfs)

real symmetric positive definite matrix

f07fhf (dporfs)

real symmetric positive definite matrix, packed storage

f07ghf (dpprfs)

real symmetric positive definite tridiagonal matrix

f07jhf (dptrfs)

real tridiagonal matrix

f07chf (dgtrfs)

Compute error estimates,

complex triangular band matrix

f07vvf (ztbrfs)

complex triangular matrix

f07tvf (ztrrfs)

complex triangular matrix, packed storage

f07uvf (ztprfs)

real triangular band matrix

f07vhf (dtbrfs)

real triangular matrix

f07thf (dtrrfs)

real triangular matrix, packed storage

f07uhf (dtprfs)

Compute row and column scalings,

complex band matrix

f07btf (zgbequ)

complex Hermitian positive definite band matrix

f07htf (zpbequ)

complex Hermitian positive definite matrix

f07ftf (zpoequ)

complex Hermitian positive definite matrix, packed storage

f07gtf (zppequ)

complex matrix

f07atf (zgeequ)

real band matrix

f07bff (dgbequ)

real matrix

f07aff (dgeequ)

real symmetric positive definite band matrix

f07hff (dpbequ)

real symmetric positive definite matrix

f07fff (dpoequ)

real symmetric positive definite matrix, packed storage

f07gff (dppequ)

Condition number estimation,

after factorizing the matrix of coefficients,

complex band matrix

f07buf (zgbcon)

complex Hermitian indefinite matrix

f07muf (zhecon)

complex Hermitian indefinite matrix, packed storage

f07puf (zhpcon)

complex Hermitian positive definite band matrix

f07huf (zpbcon)

complex Hermitian positive definite matrix

f07fuf (zpocon)

complex Hermitian positive definite matrix, packed storage

f07guf (zppcon)

complex Hermitian positive definite tridiagonal matrix

f07juf (zptcon)

complex matrix

f07auf (zgecon)

complex symmetric indefinite matrix

f07nuf (zsycon)

complex symmetric indefinite matrix, packed storage

f07quf (zspcon)

complex tridiagonal matrix

f07cuf (zgtcon)

real band matrix

f07bgf (dgbcon)

real matrix

f07agf (dgecon)

real symmetric indefinite matrix

f07mgf (dsycon)

real symmetric indefinite matrix, packed storage

f07pgf (dspcon)

real symmetric positive definite band matrix

f07hgf (dpbcon)

real symmetric positive definite matrix

f07fgf (dpocon)

real symmetric positive definite matrix, packed storage

f07ggf (dppcon)

real symmetric positive definite tridiagonal matrix

f07jgf (dptcon)

real tridiagonal matrix

f07cgf (dgtcon)

complex triangular band matrix

f07vuf (ztbcon)

complex triangular matrix

f07tuf (ztrcon)

complex triangular matrix, packed storage

f07uuf (ztpcon)

real triangular band matrix

f07vgf (dtbcon)

real triangular matrix

f07tgf (dtrcon)

real triangular matrix, packed storage

f07ugf (dtpcon)

LDL^T factorization,

complex Hermitian positive definite tridiagonal matrix

f07jrf (zpttrf)

real symmetric positive definite tridiagonal matrix

f07jdf (dpttrf)

LL^T or U^TU factorization,

complex Hermitian positive definite band matrix

f07hrf (zpbtrf)

complex Hermitian positive definite matrix

f07frf (zpotrf)

complex Hermitian positive definite matrix, packed storage

f07grf (zpptrf)

complex Hermitian positive definite matrix, RFP storage

f07wrf (zpftrf)

complex Hermitian positive semidefinite matrix

f07krf (zpstrf)

real symmetric positive definite band matrix

f07hdf (dpbtrf)

real symmetric positive definite matrix

f07fdf (dpotrf)

real symmetric positive definite matrix, packed storage

f07gdf (dpptrf)

real symmetric positive definite matrix, RFP storage

f07wdf (dpftrf)

real symmetric positive semidefinite matrix

f07kdf (dpstrf)

LU factorization,

complex band matrix

f07brf (zgbtrf)

complex matrix

f07arf (zgetrf)

complex tridiagonal matrix

f07crf (zgttrf)

real band matrix

f07bdf (dgbtrf)

real matrix

f07adf (dgetrf)

real tridiagonal matrix

f07cdf (dgttrf)

Matrix inversion,

after factorizing the matrix of coefficients,

complex Hermitian indefinite matrix

f07mwf (zhetri)

complex Hermitian indefinite matrix, packed storage

f07pwf (zhptri)

complex Hermitian positive definite matrix

f07fwf (zpotri)

complex Hermitian positive definite matrix, packed storage

f07gwf (zpptri)

complex Hermitian positive definite matrix, RFP storage

f07wwf (zpftri)

complex matrix

f07awf (zgetri)

complex symmetric indefinite matrix

f07nwf (zsytri)

complex symmetric indefinite matrix, packed storage

f07qwf (zsptri)

real matrix

f07ajf (dgetri)

real symmetric indefinite matrix

f07mjf (dsytri)

real symmetric indefinite matrix, packed storage

f07pjf (dsptri)

real symmetric positive definite matrix

f07fjf (dpotri)

real symmetric positive definite matrix, packed storage

f07gjf (dpptri)

real symmetric positive definite matrix, RFP storage

f07wjf (dpftri)

complex triangular matrix

f07twf (ztrtri)

complex triangular matrix, packed storage

f07uwf (ztptri)

complex triangular matrix, RFP storage,

expert driver

f07wxf (ztftri)

real triangular matrix

f07tjf (dtrtri)

real triangular matrix, packed storage

f07ujf (dtptri)

real triangular matrix, RFP storage,

expert driver

f07wkf (dtftri)

PLDL^TP^T or PUDU^TP^T factorization,

complex Hermitian indefinite matrix

f07mrf (zhetrf)

complex Hermitian indefinite matrix, packed storage

f07prf (zhptrf)

complex symmetric indefinite matrix

f07nrf (zsytrf)

complex symmetric indefinite matrix, packed storage

f07qrf (zsptrf)

real symmetric indefinite matrix

f07mdf (dsytrf)

real symmetric indefinite matrix, packed storage

f07pdf (dsptrf)

Solution of simultaneous linear equations,

after factorizing the matrix of coefficients,

complex band matrix

f07bsf (zgbtrs)

complex Hermitian indefinite matrix

f07msf (zhetrs)

complex Hermitian indefinite matrix, packed storage

f07psf (zhptrs)

complex Hermitian positive definite band matrix

f07hsf (zpbtrs)

complex Hermitian positive definite matrix

f07fsf (zpotrs)

complex Hermitian positive definite matrix, packed storage

f07gsf (zpptrs)

complex Hermitian positive definite matrix, RFP storage

f07wsf (zpftrs)

complex Hermitian positive definite tridiagonal matrix

f07jsf (zpttrs)

complex matrix

f07asf (zgetrs)

complex symmetric indefinite matrix

f07nsf (zsytrs)

complex symmetric indefinite matrix, packed storage

f07qsf (zsptrs)

complex tridiagonal matrix

f07csf (zgttrs)

real band matrix

f07bef (dgbtrs)

real matrix

f07aef (dgetrs)

real symmetric indefinite matrix

f07mef (dsytrs)

real symmetric indefinite matrix, packed storage

f07pef (dsptrs)

real symmetric positive definite band matrix

f07hef (dpbtrs)

real symmetric positive definite matrix

f07fef (dpotrs)

real symmetric positive definite matrix, packed storage

f07gef (dpptrs)

real symmetric positive definite matrix, RFP storage

f07wef (dpftrs)

real symmetric positive definite tridiagonal matrix

f07jef (dpttrs)

real tridiagonal matrix

f07cef (dgttrs)

expert drivers (with condition and error estimation):

complex band matrix

f07bpf (zgbsvx)

complex Hermitian indefinite matrix

f07mpf (zhesvx)

complex Hermitian indefinite matrix, packed storage

f07ppf (zhpsvx)

complex Hermitian positive definite band matrix

f07hpf (zpbsvx)

complex Hermitian positive definite matrix

f07fpf (zposvx)

complex Hermitian positive definite matrix, packed storage

f07gpf (zppsvx)

complex Hermitian positive definite tridiagonal matrix

f07jpf (zptsvx)

complex matrix

f07apf (zgesvx)

complex symmetric indefinite matrix

f07npf (zsysvx)

complex symmetric indefinite matrix, packed storage

f07qpf (zspsvx)

complex tridiagonal matrix

f07cpf (zgtsvx)

real band matrix

f07bbf (dgbsvx)

real matrix

f07abf (dgesvx)

real symmetric indefinite matrix

f07mbf (dsysvx)

real symmetric indefinite matrix, packed storage

f07pbf (dspsvx)

real symmetric positive definite band matrix

f07hbf (dpbsvx)

real symmetric positive definite matrix

f07fbf (dposvx)

real symmetric positive definite matrix, packed storage

f07gbf (dppsvx)

real symmetric positive definite tridiagonal matrix

f07jbf (dptsvx)

real tridiagonal matrix

f07cbf (dgtsvx)

simple drivers,

complex band matrix

complex Hermitian indefinite matrix

complex Hermitian indefinite matrix, packed storage

complex Hermitian positive definite band matrix

complex Hermitian positive definite matrix

complex Hermitian positive definite matrix, packed storage

complex Hermitian positive definite matrix, using mixed precision

f07fqf (zcposv)

complex Hermitian positive definite tridiagonal matrix

complex matrix

complex matrix, using mixed precision

f07aqf (zcgesv)

complex symmetric indefinite matrix

complex symmetric indefinite matrix, packed storage

complex triangular band matrix

f07vsf (ztbtrs)

complex triangular matrix

f07tsf (ztrtrs)

complex triangular matrix, packed storage

f07usf (ztptrs)

complex tridiagonal matrix

real band matrix

real matrix

real matrix, using mixed precision

f07acf (dsgesv)

real symmetric indefinite matrix

real symmetric indefinite matrix, packed storage

real symmetric positive definite band matrix

real symmetric positive definite matrix

real symmetric positive definite matrix, packed storage

real symmetric positive definite matrix, using mixed precision

f07fcf (dsposv)

real symmetric positive definite tridiagonal matrix

real triangular band matrix

f07vef (dtbtrs)

real triangular matrix

f07tef (dtrtrs)

real triangular matrix, packed storage

f07uef (dtptrs)

real tridiagonal matrix

5

Auxiliary Routines Associated with Library Routine Arguments

None.

6

Routines Withdrawn or Scheduled for Withdrawal

None.

7

References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia

Golub G H and Van Loan C F (1996) Matrix Computations (3rd Edition) Johns Hopkins University Press, Baltimore

Higham N J (1988) Algorithm 674: Fortran codes for estimating the one-norm of a real or complex matrix, with applications to condition estimation ACM Trans. Math. Software 14 381–396

Wilkinson J H (1965) The Algebraic Eigenvalue Problem Oxford University Press, Oxford

NAG Library Manual, Mark 26

NAG AD Library Manual, Mark 26

NAG C Library Manual, Mark 26

F07 (lapacklin) Chapter Contents

© The Numerical Algorithms Group Ltd. 2018