NAG Library Routine Document

f01lhf (real_gen_blkdiag_lu)

1
Purpose

f01lhf factorizes a real almost block diagonal matrix.

2
Specification

Fortran Interface
Subroutine f01lhf ( n, nbloks, blkstr, a, lena, pivot, tol, kpivot, ifail)
Integer, Intent (In):: n, nbloks, blkstr(3,nbloks), lena
Integer, Intent (Inout):: ifail
Integer, Intent (Out):: pivot(n), kpivot
Real (Kind=nag_wp), Intent (Inout):: a(lena), tol
C Header Interface
#include <nagmk26.h>
void  f01lhf_ (const Integer *n, const Integer *nbloks, const Integer blkstr[], double a[], const Integer *lena, Integer pivot[], double *tol, Integer *kpivot, Integer *ifail)

3
Description

f01lhf factorizes a real almost block diagonal matrix, A, by row elimination with alternate row and column pivoting such that no ‘fill-in’ is produced. The code, which is derived from ARCECO described in Diaz et al. (1983), uses Level 1 and Level 2 BLAS. No three successive diagonal blocks may have columns in common and therefore the almost block diagonal matrix must have the form shown in the following diagram:
Figure 1
Figure 1
This routine may be followed by f04lhf, which is designed to solve sets of linear equations AX=B or ATX=B.

4
References

Diaz J C, Fairweather G and Keast P (1983) Fortran packages for solving certain almost block diagonal linear systems by modified alternate row and column elimination ACM Trans. Math. Software 9 358–375

5
Arguments

1:     n – IntegerInput
On entry: n, the order of the matrix A.
Constraint: n>0.
2:     nbloks – IntegerInput
On entry: n, the total number of blocks of the matrix A.
Constraint: 0<nbloksn.
3:     blkstr3nbloks – Integer arrayInput
On entry: information which describes the block structure of A as follows:
  • blkstr1k must contain the number of rows in the kth block, k=1,2,,nbloks;
  • blkstr2k must contain the number of columns in the kth block, k=1,2,,nbloks;
  • blkstr3k must contain the number of columns of overlap between the kth and k+1th blocks, k=1,2,,nbloks-1. blkstr3nbloks need not be set.
The following conditions delimit the structure of A:
  • blkstr1k,blkstr2k>0,  k=1,2,,nbloks,
  • blkstr3k0,   k=1,2,,nbloks-1,
(there must be at least one column and one row in each block and a non-negative number of columns of overlap);
  • blkstr3k-1+blkstr3kblkstr2k,  k=2,3,,nbloks-1,
(the total number of columns in overlaps in each block must not exceed the number of columns in that block);
  • blkstr21blkstr11,
  • blkstr21+ k =2 j blkstr2k -blkstr3k -1 k =1 j blkstr1k , j=2,3,,nbloks-1 ,
  • k=1jblkstr2k-blkstr3kk=1jblkstr1k,  j=1,2,,nbloks-1,
(the index of the first column of the overlap between the jth and j+1th blocks must be  the index of the last row of the jth block, and the index of the last column of overlap must be  the index of the last row of the jth block);
  • k =1 nbloks blkstr1k =n ,
  • blkstr21+ k =2 nbloks blkstr2k -blkstr3k -1 =nk ,
(both the number of rows and the number of columns of A must equal n).
4:     alena – Real (Kind=nag_wp) arrayInput/Output
On entry: the elements of the almost block diagonal matrix stored block by block, with each block stored column by column. The sizes of the blocks and the overlaps are defined by the argument blkstr.
If ars is the first element in the kth block, then an arbitrary element aij in the kth block must be stored in the array element:
a pk+ j-r mk+ i-s+1  
where
pk=l= 1 k- 1blkstr1l×blkstr2l  
is the base address of the kth block, and
mk=blkstr1k  
is the number of rows of the kth block.
See Section 9 for comments on scaling.
On exit: the factorized form of the matrix.
5:     lena – IntegerInput
On entry: the dimension of the array a as declared in the (sub)program from which f01lhf is called.
Constraint: lena k =1 nbloks blkstr1k ×blkstr2k .
6:     pivotn – Integer arrayOutput
On exit: details of the interchanges.
7:     tol – Real (Kind=nag_wp)Input/Output
On entry: a relative tolerance to be used to indicate whether or not the matrix is singular. For a discussion on how tol is used see Section 9. If tol is non-positive, tol is reset to 10ε, where ε is the machine precision.
On exit: unchanged unless tol0.0 on entry, in which case it is set to 10ε.
8:     kpivot – IntegerOutput
On exit: if ifail=2, kpivot contains the value k, where k is the first position on the diagonal of the matrix A where too small a pivot was detected. Otherwise kpivot is set to 0.
9:     ifail – IntegerInput/Output
On entry: ifail must be set to 0, -1 or 1. If you are unfamiliar with this argument you should refer to Section 3.4 in How to Use the NAG Library and its Documentation for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value -1 or 1 is recommended. If the output of error messages is undesirable, then the value 1 is recommended. Otherwise, if you are not familiar with this argument, the recommended value is 0. When the value -1 or 1 is used it is essential to test the value of ifail on exit.
On exit: ifail=0 unless the routine detects an error or a warning has been flagged (see Section 6).

6
Error Indicators and Warnings

If on entry ifail=0 or -1, explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
ifail=1
On entry, K=value, blkstr2K=value and blkstr1K=value.
Constraint: blkstr2Kblkstr1K.
On entry, K=value, blkstr2K=value blkstr3K=value and blkstr1K=value.
Constraint: blkstr2K-blkstr3Kblkstr1K.
On entry, K=value and blkstr1K=value.
Constraint: blkstr1K1.
On entry, K=value and blkstr2K=value.
Constraint: blkstr2K1.
On entry, K=value and blkstr3K=value.
Constraint: blkstr3K0.
On entry, K=value blkstr3K=value blkstr3K-1=value and blkstr2K=value.
Constraint: blkstr3K+blkstr3K-1blkstr2K.
On entry, lena is too small. lena=value. Minimum possible dimension: value.
On entry, n=value.
Constraint: n1.
On entry, n=value and nbloks=value.
Constraint: nnbloks.
On entry, nbloks=value.
Constraint: nbloks1.
On entry, the following equality does not hold: blkstr21+sumblkstr2k-blkstr3k-1:k=2,nbloks=n.
On entry, the following equality does not hold: sumblkstr1k:k=1,nbloks=n.
On entry, the following inequality was not satisfied for: J=value. sumblkstr2k-blkstr3k:k=1,J sumblkstr1k:k=1,J blkstr21+sumblkstr2k-blkstr3k-1:k=2,J.
ifail=2
Factorization completed, but pivot in diagonal I was small: I=value.
ifail=-99
An unexpected error has been triggered by this routine. Please contact NAG.
See Section 3.9 in How to Use the NAG Library and its Documentation for further information.
ifail=-399
Your licence key may have expired or may not have been installed correctly.
See Section 3.8 in How to Use the NAG Library and its Documentation for further information.
ifail=-999
Dynamic memory allocation failed.
See Section 3.7 in How to Use the NAG Library and its Documentation for further information.

7
Accuracy

The accuracy of f01lhf depends on the conditioning of the matrix A.

8
Parallelism and Performance

f01lhf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9
Further Comments

Singularity or near singularity in A is determined by the argument tol. If the absolute value of any pivot is less than tol×amax, where amax is the maximum absolute value of an element of A, then A is said to be singular. The position on the diagonal of A of the first of any such pivots is indicated by the argument kpivot. The factorization, and the test for near singularity, will be more accurate if before entry A is scaled so that the -norms of the rows and columns of A are all of approximately the same order of magnitude. (The -norm is the maximum absolute value of any element in the row or column.)

10
Example

This example solves the set of linear equations Ax=b where
A= -1.00 -0.98 -0.79 -0.15 -1.00 -0.25 -0.87 0.35 0.78 0.31 -0.85 0.89 -0.69 -0.98 -0.76 -0.82 0.12 -0.01 0.75 0.32 -1.00 -0.53 -0.83 -0.98 -0.58 0.04 0.87 0.38 -1.00 -0.21 -0.93 -0.84 0.37 -0.94 -0.96 -1.00 -0.99 -0.91 -0.28 -0.90 0.78 -0.93 -0.76 0.48 -0.87 -0.14 -1.00 -0.59 -0.99 0.21 -0.73 -0.48 -0.93 -0.91 0.10 -0.89 -0.68 -0.09 -0.58 -0.21 0.85 -0.39 0.79 -0.71 0.39 -0.99 -0.12 -0.75 0.17 -1.37 1.29 -1.59 1.10 -1.63 -1.01 -0.27 0.08 0.61 0.54 -0.41 0.16 -0.46 -0.67 0.56 -0.99 0.16 -0.16 0.98 -0.24 -0.41 0.40 -0.93 0.70 0.43 0.71 -0.97 -0.60 -0.30 0.18 -0.47 -0.98 -0.73 0.07 0.04 -0.25 -0.92 -0.52 -0.46 -0.58 -0.89 -0.94 -0.54 -1.00 -0.36  
and
b= -2.92 -1.17 -1.30 -1.17 -2.10 -4.51 -1.71 -4.59 -4.19 -0.93 -3.31 0.52 -0.12 -0.05 -0.98 -2.07 -2.73 -1.95  
The exact solution is
x=1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1T.  

10.1
Program Text

Program Text (f01lhfe.f90)

10.2
Program Data

Program Data (f01lhfe.d)

10.3
Program Results

Program Results (f01lhfe.r)