Integer, Intent (In)	::	m, n, nrhs, lda, ldb, lwork
Integer, Intent (Inout)	::	iwork(*)
Integer, Intent (Out)	::	rank, info
Real (Kind=nag_wp), Intent (In)	::	rcond
Real (Kind=nag_wp), Intent (Inout)	::	a(lda,), b(ldb,), s(*)
Real (Kind=nag_wp), Intent (Out)	::	work(max(1,lwork))

C Header Interface

#include <nag.h>

void	f08kcf_ (const Integer m, const Integer n, const Integer nrhs, double a[], const Integer lda, double b[], const Integer ldb, double s[], const double rcond, Integer rank, double work[], const Integer lwork, Integer iwork[], Integer *info)

The routine may be called by the names f08kcf, nagf_lapackeig_dgelsd or its LAPACK name dgelsd.

3 Description

f08kcf uses the singular value decomposition (SVD) of

A

, where

A

is a real

m

n

matrix which may be rank-deficient.

Several right-hand side vectors

b

and solution vectors

x

can be handled in a single call; they are stored as the columns of the

m

r

right-hand side matrix

B

and the

n

r

solution matrix

X

The problem is solved in three steps:

1.reduce the coefficient matrix $A$ to bidiagonal form with Householder transformations, reducing the original problem into a ‘bidiagonal least squares problem’ (BLS);
2.solve the BLS using a divide-and-conquer approach;
3.apply back all the Householder transformations to solve the original least squares problem.

The effective rank of

A

is determined by treating as zero those singular values which are less than rcond times the largest singular value.

4 References

Anderson E, Bai Z, Bischof C, Blackford S, Demmel J, Dongarra J J, Du Croz J J, Greenbaum A, Hammarling S, McKenney A and Sorensen D (1999) LAPACK Users' Guide (3rd Edition) SIAM, Philadelphia https://www.netlib.org/lapack/lug

Golub G H and Van Loan C F (1996) Matrix Computations (3rd Edition) Johns Hopkins University Press, Baltimore

5 Arguments

1: $m$ – Integer Input

On entry:

m

, the number of rows of the matrix

A

Constraint:

m \geq 0

2: $n$ – Integer Input

On entry:

n

, the number of columns of the matrix

A

Constraint:

n \geq 0

3: $nrhs$ – Integer Input

On entry:

r

, the number of right-hand sides, i.e., the number of columns of the matrices

B

and

X

Constraint:

nrhs \geq 0

4: $a (lda *)$ – Real (Kind=nag_wp) array Input/Output

Note: the second dimension of the array a must be at least

\max (1, n)

On entry: the

m

n

coefficient matrix

A

On exit: the contents of a are destroyed.

5: $lda$ – Integer Input

On entry: the first dimension of the array a as declared in the (sub)program from which f08kcf is called.

Constraint:

lda \geq \max (1, m)

6: $b (ldb *)$ – Real (Kind=nag_wp) array Input/Output

Note: the second dimension of the array b must be at least

\max (1, nrhs)

On entry: the

m

r

right-hand side matrix

B

On exit: b is overwritten by the

n

r

solution matrix

X

. If

m \geq n

and

rank = n

, the residual sum of squares for the solution in the

i

th column is given by the sum of squares of elements

n + 1, \dots, m

in that column.

7: $ldb$ – Integer Input

On entry: the first dimension of the array b as declared in the (sub)program from which f08kcf is called.

Constraint:

ldb \geq \max (1, m, n)

8: $s (*)$ – Real (Kind=nag_wp) array Output

Note: the dimension of the array s must be at least

\max (1, \min (m, n))

On exit: the singular values of

A

in decreasing order.

9: $rcond$ – Real (Kind=nag_wp) Input

On entry: used to determine the effective rank of

A

. Singular values

s (i) \leq rcond \times s (1)

are treated as zero. If

rcond < 0

, machine precision is used instead.

10: $rank$ – Integer Output

On exit: the effective rank of

A

, i.e., the number of singular values which are greater than

rcond \times s (1)

11: $work (\max (1, lwork))$ – Real (Kind=nag_wp) array Workspace

On exit: if

info = 0

work (1)

contains the minimum value of lwork required for optimal performance.

12: $lwork$ – Integer Input

On entry: the dimension of the array work as declared in the (sub)program from which f08kcf is called.

The exact minimum amount of workspace needed depends on m, n and nrhs. As long as lwork is at least

12 r + 2 r \times smlsiz + 8 r \times nlvl + r \times nrhs + {(smlsiz + 1)}^{2},

where

smlsiz

is equal to the maximum size of the subproblems at the bottom of the computation tree (usually about

25

nlvl = \max (0, int (\log_{2} (\min (m, n) / (smlsiz + 1))) + 1)

and

r = \min (m, n)

, the code will execute correctly.

lwork = - 1

, a workspace query is assumed; the routine only calculates the optimal size of the work array and the minimum size of the iwork array, and returns these values as the first entries of the work and iwork arrays, and no error message related to lwork is issued.

Suggested value: for optimal performance, lwork should generally be larger than the minimum required as set out above. Consider increasing lwork by at least

nb \times \min (m, n)

, where

nb

is the optimal block size.

Constraint:

lwork \geq 12 r + 2 r \times smlsiz + 8 r \times nlvl + r \times nrhs + {(smlsiz + 1)}^{2}

lwork = - 1

13: $iwork (*)$ – Integer array Workspace

Note: the dimension of the array iwork must be at least

\max (1, liwork)

, where

liwork

is at least

\max (1, 3 \times \min (m, n) \times nlvl + 11 \times \min (m, n))

On exit: if

info = 0

iwork (1)

returns the minimum

liwork

14: $info$ – Integer Output

On exit:

info = 0

unless the routine detects an error (see Section 6).

6 Error Indicators and Warnings

$info < 0$: If $info = - i$ , argument $i$ had an illegal value. An explanatory message is output, and execution of the program is terminated.

$info > 0$: The algorithm for computing the SVD failed to converge; $〈value〉$ off-diagonal elements of an intermediate bidiagonal form did not converge to zero.

7 Accuracy

See Section 4.5 of Anderson et al. (1999) for details.

8 Parallelism and Performance

f08kcf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.

f08kcf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.

Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9 Further Comments

The complex analogue of this routine is f08kqf.

10 Example

This example solves the linear least squares problem

\min_{x} {‖b - A x‖}_{2}

for the solution,

x

, of minimum norm, where

A = (\begin{array}{r} - 0.09 & - 1.56 & - 1.48 & - 1.09 & 0.08 & - 1.59 \\ 0.14 & 0.20 & - 0.43 & 0.84 & 0.55 & - 0.72 \\ - 0.46 & 0.29 & 0.89 & 0.77 & - 1.13 & 1.06 \\ 0.68 & 1.09 & - 0.71 & 2.11 & 0.14 & 1.24 \\ 1.29 & 0.51 & - 0.96 & - 1.27 & 1.74 & 0.34 \end{array}) and b = (\begin{array}{r} 7.4 \\ 4.3 \\ - 8.1 \\ 1.8 \\ 8.7 \end{array}) .

A tolerance of

0.01

is used to determine the effective rank of

A

Note that the block size (NB) of

64

assumed in this example is not realistic for such a small problem, but should be suitable for large problems.

f08kc: FL CL CPP AD

NAG FL Interfacef08kcf (dgelsd)

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

NAG FL Interface
f08kcf (dgelsd)