NAG Library Routine Document

f02wdf  (withdraw_real_gen_qu_svd)

 Contents

    1  Purpose
    7  Accuracy

1
Purpose

f02wdf returns the Householder QU factorization of a real rectangular m by n mn matrix A. Further, on request or if A is not of full rank, part or all of the singular value decomposition of A is returned.

2
Specification

Fortran Interface
Subroutine f02wdf ( m, n, a, lda, wantb, b, tol, svd, irank, z, sv, wantr, r, ldr, wantpt, pt, ldpt, work, lwork, ifail)
Integer, Intent (In):: m, n, lda, ldr, ldpt, lwork
Integer, Intent (Inout):: ifail
Integer, Intent (Out):: irank
Real (Kind=nag_wp), Intent (In):: tol
Real (Kind=nag_wp), Intent (Inout):: a(lda,n), b(m), r(ldr,n), pt(ldpt,n)
Real (Kind=nag_wp), Intent (Out):: z(n), sv(n), work(lwork)
Logical, Intent (In):: wantb, wantr, wantpt
Logical, Intent (Inout):: svd
C Header Interface
#include nagmk26.h
void  f02wdf_ ( const Integer *m, const Integer *n, double a[], const Integer *lda, const logical *wantb, double b[], const double *tol, logical *svd, Integer *irank, double z[], double sv[], const logical *wantr, double r[], const Integer *ldr, const logical *wantpt, double pt[], const Integer *ldpt, double work[], const Integer *lwork, Integer *ifail)

3
Description

The real m by n mn matrix A is first factorized as
A=Q U 0 ,  
where Q is an m by m orthogonal matrix and U is an n by n upper triangular matrix.
If either U is singular or svd is supplied as .TRUE., then the singular value decomposition (SVD) of U is obtained so that U is factorized as
U=RDPT,  
where R and P are n by n orthogonal matrices and D is the n by n diagonal matrix
D=diagsv1,sv2,,svn,  
with sv1sv2svn0.
Note that the SVD of A is then given by
A=Q1 D 0 PT  where  Q1=Q R 0 0 I ,  
the diagonal elements of D being the singular values of A.
The option to form a vector QTb, or if appropriate Q1T b, is also provided.
The rank of the matrix A, based upon a user-supplied argument tol, is also returned.
The QU factorization of A is obtained by Householder transformations. To obtain the SVD of U the matrix is first reduced to bidiagonal form by means of plane rotations and then the QR algorithm is used to obtain the SVD of the bidiagonal form.

4
References

Wilkinson J H (1978) Singular Value Decomposition – Basic Aspects Numerical Software – Needs and Availability (ed D A H Jacobs) Academic Press

5
Arguments

1:     m – IntegerInput
On entry: m, the number of rows of the matrix A.
Constraint: mn.
2:     n – IntegerInput
On entry: n, the number of columns of the matrix A.
Constraint: 1nm.
3:     aldan – Real (Kind=nag_wp) arrayInput/Output
On entry: the leading m by n part of a must contain the matrix to be factorized.
On exit: the leading m by n part of a, together with the n-element vector z, contains details of the Householder QU factorization.
Details of the storage of the QU factorization are given in Section 9.4.
4:     lda – IntegerInput
On entry: the first dimension of the array a as declared in the (sub)program from which f02wdf is called.
Constraint: ldam.
5:     wantb – LogicalInput
On entry: must be .TRUE. if QTb or Q1Tb is required.
If on entry wantb=.FALSE., b is not referenced.
6:     bm – Real (Kind=nag_wp) arrayInput/Output
On entry: if wantb is supplied as .TRUE., b must contain the m element vector b. Otherwise, b is not referenced.
On exit: contains Q1Tb if svd is returned as .TRUE. and QTb if svd is returned as .FALSE..
7:     tol – Real (Kind=nag_wp)Input
On entry: must specify a relative tolerance to be used to determine the rank of A. tol should be chosen as approximately the largest relative error in the elements of A. For example, if the elements of A are correct to about 4 significant figures, tol should be set to about 5×10-4. See Section 9.3 for a description of how tol is used to determine rank.
If tol is outside the range ε,1.0, where ε is the machine precision, the value ε is used in place of tol. For most problems this is unreasonably small.
8:     svd – LogicalInput/Output
On entry: must be .TRUE. if the singular values are to be found even if A is of full rank.
If before entry, svd=.FALSE. and A is determined to be of full rank, only the QU factorization of A is computed.
On exit: is returned as .FALSE. if only the QU factorization of A has been obtained and is returned as .TRUE. if the singular values of A have been obtained.
9:     irank – IntegerOutput
On exit: returns the rank of the matrix A. (It should be noted that it is possible for irank to be returned as n and svd to be returned as .TRUE., even if svd was supplied as .FALSE.. This means that the matrix U only just failed the test for nonsingularity.)
10:   zn – Real (Kind=nag_wp) arrayOutput
On exit: the n-element vector z contains some details of the Householder transformations. See Section 9.4 for further information.
11:   svn – Real (Kind=nag_wp) arrayOutput
On exit: if svd is returned as .TRUE., sv contains the n singular values of A arranged in descending order.
12:   wantr – LogicalInput
On entry: must be .TRUE. if the orthogonal matrix R is required when the singular values are computed.
If on entry wantr=.FALSE., r is not referenced.
13:   rldrn – Real (Kind=nag_wp) arrayOutput
Note: the second dimension of the array r must be at least n if wantr=.TRUE., and at least 1 otherwise.
On exit: if svd is returned as .TRUE. and wantr was supplied as .TRUE., the leading n by n part of r will contain the left-hand orthogonal matrix of the svd of U.
14:   ldr – IntegerInput
On entry: the first dimension of the array r as declared in the (sub)program from which f02wdf is called.
Constraints:
  • if wantr=.TRUE., ldrn;
  • otherwise ldr1.
15:   wantpt – LogicalInput
On entry: must be .TRUE. if the orthogonal matrix PT is required when the singular values are computed.
Note that if svd is returned as .TRUE., pt is referenced even if wantpt is supplied as .FALSE., but see argument pt.
16:   ptldptn – Real (Kind=nag_wp) arrayOutput
On exit: if svd is returned as .TRUE. and wantpt was supplied as .TRUE., the leading n by n part of pt contains the orthogonal matrix PT.
If svd is returned as .TRUE., but wantpt was supplied as .FALSE., the leading n by n part of pt is used for internal workspace.
17:   ldpt – IntegerInput
On entry: the first dimension of the array pt as declared in the (sub)program from which f02wdf is called.
Constraint: ldptn.
18:   worklwork – Real (Kind=nag_wp) arrayOutput
On exit: if svd is returned as .FALSE., work1 contains the condition number UEU-1E of the upper triangular matrix U.
If svd is returned as .TRUE., work1 will contain the total number of iterations taken by the QR algorithm.
The rest of the array is used as workspace and so contains no meaningful information.
19:   lwork – IntegerInput
On entry: the dimension of the array work as declared in the (sub)program from which f02wdf is called.
Constraint: lwork3×n.
20:   ifail – IntegerInput/Output
On entry: ifail must be set to 0, -1​ or ​1. If you are unfamiliar with this argument you should refer to Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1​ or ​1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this argument, the recommended value is 0. When the value -1​ or ​1 is used it is essential to test the value of ifail on exit.
On exit: ifail=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6
Error Indicators and Warnings

If on entry ifail=0 or -1, explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
ifail=1
On entry,n<1,
orm<n,
orlda<m,
orldr<n when wantr=.TRUE.,
orldpt<n 
orlwork<3×n.
(The routine only checks ldr if wantr is supplied as .TRUE..)
ifail>1
The QR algorithm has failed to converge to the singular values in 50×n iterations. In this case sv1,sv2,,svifail-1 may not have been correctly found and the remaining singular values may not be the smallest singular values. The matrix A has nevertheless been factorized as A=Q1CPT, where C is an upper bidiagonal matrix with sv1,sv2,,svn as its diagonal elements and work2,work3,,workn as its superdiagonal elements.
This failure cannot occur if svd is returned as .FALSE. and in any case is extremely rare.
ifail=-99
An unexpected error has been triggered by this routine. Please contact NAG.
See Section 3.9 in How to Use the NAG Library and its Documentation for further information.
ifail=-399
Your licence key may have expired or may not have been installed correctly.
See Section 3.8 in How to Use the NAG Library and its Documentation for further information.
ifail=-999
Dynamic memory allocation failed.
See Section 3.7 in How to Use the NAG Library and its Documentation for further information.

7
Accuracy

The computed factors Q, U, R, D and PT satisfy the relations
Q U 0 =A+E,  
Q R 0 0 I D 0 PT=A+F  
where E2c1ε A2, F2c2ε A2,
ε being the machine precision and c1 and c2 are modest functions of m and n. Note that A2=sv1.

8
Parallelism and Performance

f02wdf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
f02wdf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

9.1
Timing

The time taken by f02wdf to obtain the Householder QU factorization is approximately proportional to n23m-n.
The additional time taken to obtain the singular value decomposition is approximately proportional to n3, where the constant of proportionality depends upon whether or not the orthogonal matrices R and PT are required.

9.2
General Remarks

Singular vectors associated with a zero or multiple singular value, are not uniquely determined, even in exact arithmetic, and very different results may be obtained if they are computed on different machines.
Unless otherwise stated in the Users' Note for your implementation, the routine may be called with the same array for arguments z and sv, in which case, if svd is returned as .TRUE., the singular values will overwrite the original contents of z; also, if wantpt=.FALSE., it may be called with the same array for arguments r and pt. However this is not standard Fortran, and may not work on all systems.
This routine is called by the least squares routine f04jgf.

9.3
Determining the Rank of A

Following the QU factorization of A, if svd is supplied as .FALSE., then the condition number of U given by
CU=UF U-1F  
is found, where .F denotes the Frobenius norm, and if CU is such that
CU×tol>1.0  
then U is regarded as singular and the singular values of A are computed. If this test is not satisfied, then the rank of A is set to n. Note that if svd is supplied as .TRUE. then this test is omitted.
When the singular values are computed, then the rank of A, r, is returned as the largest integer such that
svr>tol×sv1,  
unless sv1=0 in which case r is returned as zero. That is, singular values which satisfy svitol×sv1 are regarded as negligible because relative perturbations of order tol can make such singular values zero.

9.4
Storage Details of the QU Factorization

The kth Householder transformation matrix, Tk, used in the QU factorization is chosen to introduce the zeros into the kth column and has the form
Tk=I-2 0 u 0 uT ,  uTu=1,  
where u is an m-k+1 element vector.
In place of u the routine actually computes the vector z given by
z=2u1u.  
The first element of z is stored in zk and the remaining elements of z are overwritten on the subdiagonal elements of the kth column of a. The upper triangular matrix U is overwritten on the n by n upper triangular part of a.

10
Example

This example obtains the rank and the singular value decomposition of the 6 by 4 matrix A given by
A= 22.25 31.75 -38.25 65.50 20.00 26.75 28.50 -26.50 -15.25 24.25 27.75 18.50 27.25 10.00 3.00 2.00 -17.25 -30.75 11.25 7.50 17.25 30.75 -11.25 -7.50  
the value tol to be taken as 5×10-4.

10.1
Program Text

Program Text (f02wdfe.f90)

10.2
Program Data

Program Data (f02wdfe.d)

10.3
Program Results

Program Results (f02wdfe.r)

© The Numerical Algorithms Group Ltd, Oxford, UK. 2017