NAG Library Function Document
nag_ztpqrt (f08bpc)
1 Purpose
nag_ztpqrt (f08bpc) computes the factorization of a complex by triangular-pentagonal matrix.
2 Specification
#include <nag.h> |
#include <nagf08.h> |
void |
nag_ztpqrt (Nag_OrderType order,
Integer m,
Integer n,
Integer l,
Integer nb,
Complex a[],
Integer pda,
Complex b[],
Integer pdb,
Complex t[],
Integer pdt,
NagError *fail) |
|
3 Description
nag_ztpqrt (f08bpc) forms the
factorization of a complex
by
triangular-pentagonal matrix
,
where
is an upper triangular
by
matrix and
is an
by
pentagonal matrix consisting of an
by
rectangular matrix
on top of an
by
upper trapezoidal matrix
:
The upper trapezoidal matrix consists of the first rows of an by upper triangular matrix, where . If , is by rectangular; if and , is upper triangular.
A recursive, explicitly blocked,
factorization (see
nag_zgeqrt (f08apc)) is performed on the matrix
. The upper triangular matrix
, details of the unitary matrix
, and further details (the block reflector factors) of
are returned.
Typically the matrix or contains the matrix from the factorization of a subproblem and nag_ztpqrt (f08bpc) performs the update operation from the inclusion of matrix .
For example, consider the
factorization of an
by
matrix
with
:
,
, where
is
by
upper triangular and
is
by
rectangular (this can be performed by
nag_zgeqrt (f08apc)). Given an initial least-squares problem
where
and
are
by
matrices, we have
.
Now, adding an additional
rows to the original system gives the augmented least squares problem
where
is an
by
matrix formed by adding
rows on top of
and
is an
by
matrix formed by adding
rows on top of
.
nag_ztpqrt (f08bpc) can then be used to perform the factorization of the pentagonal matrix ; the by matrix will be zero on input and contain on output.
In the case where is by , , is by upper triangular (forming ) on top of rows of zeros (forming first rows of ). Augmentation is then performed by adding rows to the bottom of with .
4 References
Elmroth E and Gustavson F (2000) Applying Recursion to Serial and Parallel Factorization Leads to Better Performance IBM Journal of Research and Development. (Volume 44) 4 605–624
Golub G H and Van Loan C F (2012) Matrix Computations (4th Edition) Johns Hopkins University Press, Baltimore
5 Arguments
- 1:
order – Nag_OrderTypeInput
-
On entry: the
order argument specifies the two-dimensional storage scheme being used, i.e., row-major ordering or column-major ordering. C language defined storage is specified by
. See
Section 3.2.1.3 in the Essential Introduction for a more detailed explanation of the use of this argument.
Constraint:
or .
- 2:
m – IntegerInput
On entry: , the number of rows of the matrix .
Constraint:
.
- 3:
n – IntegerInput
On entry: , the number of columns of the matrix and the order of the upper triangular matrix .
Constraint:
.
- 4:
l – IntegerInput
-
On entry: , the number of rows of the trapezoidal part of (i.e., ).
Constraint:
.
- 5:
nb – IntegerInput
On entry: the explicitly chosen block-size to be used in the algorithm for computing the
factorization. See
Section 9 for details.
- 6:
a[] – ComplexInput/Output
-
Note: the dimension,
dim, of the array
a
must be at least
.
The
th element of the matrix
is stored in
- when ;
- when .
On entry: the by upper triangular matrix .
On exit: the upper triangle is overwritten by the corresponding elements of the by upper triangular matrix .
- 7:
pda – IntegerInput
-
On entry: the stride separating row or column elements (depending on the value of
order) in the array
a.
Constraint:
.
- 8:
b[] – ComplexInput/Output
-
Note: the dimension,
dim, of the array
b
must be at least
- when
;
- when
.
The
th element of the matrix
is stored in
- when ;
- when .
On entry: the by pentagonal matrix composed of an by rectangular matrix above an by upper trapezoidal matrix .
On exit: details of the unitary matrix .
- 9:
pdb – IntegerInput
-
On entry: the stride separating row or column elements (depending on the value of
order) in the array
b.
Constraints:
- if ,
;
- if , .
- 10:
t[] – ComplexOutput
-
Note: the dimension,
dim, of the array
t
must be at least
- when
;
- when
.
The
th element of the matrix
is stored in
- when ;
- when .
On exit: further details of the unitary matrix
. The number of blocks is
, where
and each block is of order
nb except for the last block, which is of order
. For each of the blocks, an upper triangular block reflector factor is computed:
. These are stored in the
by
matrix
as
.
- 11:
pdt – IntegerInput
-
On entry: the stride separating row or column elements (depending on the value of
order) in the array
t.
Constraints:
- if ,
;
- if , .
- 12:
fail – NagError *Input/Output
-
The NAG error argument (see
Section 3.6 in the Essential Introduction).
6 Error Indicators and Warnings
- NE_ALLOC_FAIL
-
Dynamic memory allocation failed.
- NE_BAD_PARAM
-
On entry, argument had an illegal value.
- NE_INT
-
On entry, .
Constraint: .
On entry, .
Constraint: .
- NE_INT_2
-
On entry, and .
Constraint: and
if , .
On entry, and .
Constraint: .
On entry, and .
Constraint: .
On entry, and .
Constraint: .
On entry, and .
Constraint: .
On entry, and .
Constraint: .
- NE_INT_3
-
On entry, , and .
Constraint: .
- NE_INTERNAL_ERROR
-
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact
NAG for assistance.
7 Accuracy
The computed factorization is the exact factorization of a nearby matrix
, where
and
is the
machine precision.
8 Parallelism and Performance
nag_ztpqrt (f08bpc) is not threaded by NAG in any implementation.
nag_ztpqrt (f08bpc) makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the
Users' Note for your implementation for any additional implementation-specific information.
The total number of floating-point operations is approximately if or if .
The block size,
nb, used by nag_ztpqrt (f08bpc) is supplied explicitly through the interface. For moderate and large sizes of matrix, the block size can have a marked effect on the efficiency of the algorithm with the optimal value being dependent on problem size and platform. A value of
is likely to achieve good efficiency and it is unlikely that an optimal value would exceed
.
To apply
to an arbitrary complex rectangular matrix
, nag_ztpqrt (f08bpc) may be followed by a call to
nag_ztpmqrt (f08bqc). For example,
nag_ztpmqrt(Nag_ColMajor,Nag_LeftSide,Nag_Trans,m,p,n,l,nb,b,pdb,
t,pdt,c,pdc,&c(n+1,1),ldc,&fail)
forms
, where
is
by
.
To form the unitary matrix
explicitly set
, initialize
to the identity matrix and make a call to
nag_ztpmqrt (f08bqc) as above.
10 Example
This example finds the basic solutions for the linear least squares problems
where
and
are the columns of the matrix
,
A
factorization is performed on the first
rows of
using
nag_zgeqrt (f08apc) after which the first
rows of
are updated by applying
using
nag_zgemqrt (f08aqc). The remaining row is added by performing a
update using nag_ztpqrt (f08bpc);
is updated by applying the new
using
nag_ztpmqrt (f08bqc); the solution is finally obtained by triangular solve using
from the updated
.
10.1 Program Text
Program Text (f08bpce.c)
10.2 Program Data
Program Data (f08bpce.d)
10.3 Program Results
Program Results (f08bpce.r)