naginterfaces.library.sparse.real_symm_basic_setup¶

naginterfaces.library.sparse.real_symm_basic_setup(method, precon, n, tol, maxitn, anorm, sigmax, maxits, monit, sigcmp='N', norm=None, weight='N', iterm=1, sigtol=0.01)[source]¶

real_symm_basic_setup is a setup function, the first in a suite of three functions for the iterative solution of a symmetric system of simultaneous linear equations. real_symm_basic_setup must be called before the iterative solver, real_symm_basic_solver(). The third function in the suite, real_symm_basic_diag(), can be used to return additional information about the computation.

These three functions are suitable for the solution of large sparse symmetric systems of equations.

For full information please refer to the NAG Library document for f11gd

https://support.nag.com/numeric/nl/nagdoc_31/flhtml/f11/f11gdf.html

Parameters

methodstr

The iterative method to be used.

$m e t h o d ='CG'$

Conjugate gradient method (CG).

$m e t h o d ='SYMMLQ'$

Lanczos method (SYMMLQ).

$m e t h o d ='MINRES'$

Minimum residual method (MINRES).

preconstr, length 1

Determines whether preconditioning is used.

$p r e c o n ='N'$

No preconditioning.

$p r e c o n ='P'$

Preconditioning.

nint

$n$ , the order of the matrix $A$ .

tolfloat

The tolerance $τ$ for the termination criterion.

If $t o l \leq 0.0$ , $τ = m a x (\sqrt{ϵ}, \sqrt{n} ϵ)$ is used, where $ϵ$ is the machine precision.

Otherwise $τ = m a x (t o l, 10 ϵ, \sqrt{n} ϵ)$ is used.

maxitnint

The maximum number of iterations.

anormfloat

If $a n o r m > 0.0$ , the value of ${∥ A ∥}_{p}$ to be used in the termination criterion (2) ( $i t e r m = 1$ ).

If $a n o r m \leq 0.0$ , $i t e r m = 1$ and $n o r m ='1'$ or $'I'$ , ${∥ A ∥}_{1} = {∥ A ∥}_{\infty}$ is estimated internally by real_symm_basic_solver().

If $i t e r m = 2$ , $a n o r m$ is not referenced.

It has no effect if $m e t h o d ='MINRES'$ .

sigmaxfloat

If $s i g m a x > 0.0$ , the value of $σ_{1} (¯ A) = {∥ ∥ E^{- 1} A E^{- T} ∥ ∥}_{2}^{- 1}$ .

If $s i g m a x \leq 0.0$ , $σ_{1} (¯ A)$ is estimated by real_symm_basic_solver() when either $s i g c m p ='S'$ or termination criterion (3) ( $i t e r m = 2$ ) is employed, though it will be used only in the latter case.

Otherwise, or if $m e t h o d ='MINRES'$ , $s i g m a x$ is not referenced.

maxitsint

The maximum iteration number $k = m a x i t s$ for which $σ_{1} (T_{k})$ is computed by bisection (see also Notes). If $s i g c m p ='N'$ or $s i g m a x > 0.0$ , or if $m e t h o d ='MINRES'$ , $m a x i t s$ is not referenced.

Suggested value: $m a x i t s = m i n (10, n)$ when $s i g t o l$ is of the order of its default value $(0.01)$ .

monitint

If $m o n i t > 0$ , the frequency at which a monitoring step is executed by real_symm_basic_solver(): the current solution and residual iterates will be returned by real_symm_basic_solver() and a call to real_symm_basic_diag() made possible every $m o n i t$ iterations, starting from the ( $m o n i t$ )th. Otherwise, no monitoring takes place.

There are some additional computational costs involved in monitoring the solution and residual vectors when the Lanczos method (SYMMLQ) is used.

sigcmpstr, length 1, optional

Determines whether an estimate of $σ_{1} (¯ A) = {∥ ∥ E^{- 1} A E^{- T} ∥ ∥}_{2}^{- 1}$ , the largest singular value of the preconditioned matrix of the coefficients, is to be computed using the bisection method on the sequence of tridiagonal matrices ${T_{k}}$ generated during the iteration. Note that $¯ A = A$ when a preconditioner is not used.

If $s i g m a x > 0.0$ (see $s i g m a x$ ), i.e., when $σ_{1} (¯ A)$ is supplied, the value of $s i g c m p$ is ignored.

$s i g c m p ='S'$

$σ_{1} (¯ A)$ is to be computed using the bisection method.

$s i g c m p ='N'$

The bisection method is not used.

If the termination criterion (3) is used, requiring $σ_{1} (¯ A)$ , an inexpensive estimate is computed and used (see Notes).

It is not used if $m e t h o d ='MINRES'$ .

normNone or str, length 1, optional

Note: if this argument is None then a default value will be used, determined as follows: if $i t e r m = 1$ : $'I'$ ; otherwise: $'2'$ .

If $m e t h o d ='CG'$ or $'SYMMLQ'$ , $n o r m$ defines the matrix and vector norm to be used in the termination criteria.

$n o r m ='1'$

Use the $l_{1}$ norm.

$n o r m ='I'$

Use the $l_{\infty}$ norm.

$n o r m ='2'$

Use the $l_{2}$ norm.

It has no effect if $m e t h o d ='MINRES'$ .

weightstr, length 1, optional

Specifies whether a vector $w$ of user-supplied weights is to be used in the vector norms used in the computation of termination criterion (2) ( $i t e r m = 1$ ): ${∥ v ∥}_{p}^{(w)} = {∥ ∥ v^{(w)} ∥ ∥}_{p}^{(w)}$ , where $v_{i}^{(w)} = w_{i} v_{i}$ , for $i = 1, 2, \dots, n$ . The suffix $p = 1, 2, \infty$ denotes the vector norm used, as specified by the argument $n o r m$ . Note that weights cannot be used when $i t e r m = 2$ , i.e., when criterion (3) is used.

$w e i g h t ='W'$

User-supplied weights are to be used and must be supplied on initial entry to real_symm_basic_solver().

$w e i g h t ='N'$

All weights are implicitly set equal to one. Weights do not need to be supplied on initial entry to real_symm_basic_solver().

It has no effect if $m e t h o d ='MINRES'$ .

itermint, optional

Defines the termination criterion to be used.

$i t e r m = 1$

Use the termination criterion defined in (2) (both conjugate gradient and Lanczos (SYMMLQ) methods).

$i t e r m = 2$

Use the termination criterion defined in (3) (Lanczos method (SYMMLQ) only).

It has no effect if $m e t h o d ='MINRES'$ .

sigtolfloat, optional

The tolerance used in assessing the convergence of the estimate of $σ_{1} (¯ A) = {∥ ∥ ¯ A ∥ ∥}_{2}$ when the bisection method is used.

If $s i g t o l \leq 0.0$ , the default value $s i g t o l = 0.01$ is used.

The actual value used is $m a x (s i g t o l, ϵ)$ .

If $s i g c m p ='N'$ or $s i g m a x > 0.0$ , $s i g t o l$ is not referenced.

It has no effect if $m e t h o d ='MINRES'$ .

Returns

commdict, communication object: Communication structure.

Raises

NagValueError

(errno $- 14$ )

On entry, $m o n i t = ⟨ v a l u e ⟩$ and $m a x i t n = ⟨ v a l u e ⟩$ .

Constraint: $m o n i t \leq m a x i t n$ .

(errno $- 13$ )

On entry, $s i g c m p ='S'$ , $s i g m a x \leq 0.0$ , $m a x i t s = ⟨ v a l u e ⟩$ and $m a x i t n = ⟨ v a l u e ⟩$ .

Constraint: if $s i g c m p ='S'$ and $s i g m a x \leq 0.0$ , $m a x i t s \leq m a x i t n$ .

(errno $- 13$ )

On entry, $s i g c m p ='S'$ , $s i g m a x \leq 0.0$ and $m a x i t s = ⟨ v a l u e ⟩$ .

Constraint: if $s i g c m p ='S'$ and $s i g m a x \leq 0.0$ , $m a x i t s \geq 1$ .

(errno $- 12$ )

On entry, $s i g c m p ='S'$ , $s i g m a x \leq 0.0$ and $s i g t o l = ⟨ v a l u e ⟩$ .

Constraint: if $s i g c m p ='S'$ and $s i g m a x \leq 0.0$ , $s i g t o l < 1.0$ .

(errno $- 10$ )

On entry, $i t e r m = 1$ , $n o r m ='2'$ and $a n o r m = ⟨ v a l u e ⟩$ .

Constraint: if $i t e r m = 1$ and $n o r m ='2'$ , $a n o r m > 0.0$ .

(errno $- 9$ )

On entry, $m a x i t n = ⟨ v a l u e ⟩$ .

Constraint: $m a x i t n > 0$ .

(errno $- 8$ )

On entry, $t o l = ⟨ v a l u e ⟩$ .

Constraint: $t o l < 1.0$ .

(errno $- 7$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n > 0$ .

(errno $- 6$ )

On entry, $i t e r m = 2$ and $m e t h o d ='CG'$ .

Constraint: if $i t e r m = 2$ , $m e t h o d \neq'CG'$ .

(errno $- 6$ )

On entry, $i t e r m = 2$ and $n o r m = ⟨ v a l u e ⟩$ .

Constraint: if $i t e r m = 2$ , $n o r m ='2'$ .

(errno $- 6$ )

On entry, $i t e r m = 2$ and $w e i g h t = ⟨ v a l u e ⟩$ .

Constraint: if $i t e r m = 2$ , $w e i g h t ='N'$ .

(errno $- 6$ )

On entry, $i t e r m = ⟨ v a l u e ⟩$ .

Constraint: $i t e r m = 1$ or $2$ .

(errno $- 5$ )

On entry, $w e i g h t = ⟨ v a l u e ⟩$ .

Constraint: $w e i g h t ='N'$ or $'W'$ .

(errno $- 4$ )

On entry, $n o r m = ⟨ v a l u e ⟩$ .

Constraint: $n o r m ='1'$ , $'I'$ or $'2'$ .

(errno $- 3$ )

On entry, $s i g c m p = ⟨ v a l u e ⟩$ .

Constraint: $s i g c m p ='S'$ or $'N'$ .

(errno $- 2$ )

On entry, $p r e c o n = ⟨ v a l u e ⟩$ .

Constraint: $p r e c o n ='N'$ or $'P'$ .

(errno $- 1$ )

On entry, $m e t h o d = ⟨ v a l u e ⟩$ .

Constraint: $m e t h o d ='CG'$ , $'SYMMLQ'$ or $'MINRES'$ .

(errno $1$ )

real_symm_basic_setup has been called out of sequence: either real_symm_basic_setup has been called twice or real_symm_basic_solver() has not terminated its current task.

Notes

The suite consisting of the functions real_symm_basic_setup, real_symm_basic_solver() and real_symm_basic_diag() is designed to solve the symmetric system of simultaneous linear equations $A x = b$ of order $n$ , where $n$ is large and the matrix of the coefficients $A$ is sparse.

real_symm_basic_setup is a setup function which must be called before real_symm_basic_solver(), the iterative solver. The third function in the suite, real_symm_basic_diag() can be used to return additional information about the computation. One of the following methods can be used:

Conjugate Gradient Method (CG)

For this method (see Hestenes and Stiefel (1952), Golub and Van Loan (1996), Barrett et al. (1994) and Dias da Cunha and Hopkins (1994)), the matrix $A$ should ideally be positive definite. The application of the Conjugate Gradient method to indefinite matrices may lead to failure or to lack of convergence.
Lanczos Method (SYMMLQ)

This method, based upon the algorithm SYMMLQ (see Paige and Saunders (1975) and Barrett et al. (1994)), is suitable for both positive definite and indefinite matrices. It is more robust than the Conjugate Gradient method but less efficient when $A$ is positive definite.
Minimum Residual Method (MINRES)

This method may be used when the matrix is indefinite. It seeks to reduce the norm of the residual at each iteration and often takes fewer iterations than the other methods. It does however require slightly more memory.

The CG and SYMMLQ methods start from the residual $r_{0} = b - A x_{0}$ , where $x_{0}$ is an initial estimate for the solution (often $x_{0} = 0$ ), and generate an orthogonal basis for the Krylov subspace $s p a n {A^{k} r_{0}}$ , for $k = 0, 1, \dots,$ , by means of three-term recurrence relations (see Golub and Van Loan (1996)). A sequence of symmetric tridiagonal matrices ${T_{k}}$ is also generated. Here and in the following, the index $k$ denotes the iteration count. The resulting symmetric tridiagonal systems of equations are usually more easily solved than the original problem. A sequence of solution iterates ${x_{k}}$ is thus generated such that the sequence of the norms of the residuals ${∥ r_{k} ∥}$ converges to a required tolerance. Note that, in general, the convergence is not monotonic.

In exact arithmetic, after $n$ iterations, this process is equivalent to an orthogonal reduction of $A$ to symmetric tridiagonal form, $T_{n} = Q^{T} A Q$ ; the solution $x_{n}$ would thus achieve exact convergence. In finite-precision arithmetic, cancellation and round-off errors accumulate causing loss of orthogonality. These methods must, therefore, be viewed as genuinely iterative methods, able to converge to a solution within a prescribed tolerance.

The orthogonal basis is not formed explicitly in either method. The basic difference between the Conjugate Gradient and Lanczos methods lies in the method of solution of the resulting symmetric tridiagonal systems of equations: the conjugate gradient method is equivalent to carrying out an $L D L^{T}$ (Cholesky) factorization whereas the Lanczos method (SYMMLQ) uses an $L Q$ factorization.

Faster convergence for all the methods can be achieved using a preconditioner (see Golub and Van Loan (1996) and Barrett et al. (1994)). A preconditioner maps the original system of equations onto a different system, say

¯ A ¯ x = ¯ b,

with, hopefully, better characteristics with respect to its speed of convergence: for example, the condition number of the matrix of the coefficients can be improved or eigenvalues in its spectrum can be made to coalesce. An orthogonal basis for the Krylov subspace $s p a n {{¯ A}^{k} {¯ r}_{0}}$ , for $k = 0, 1, \dots,$ , is generated and the solution proceeds as outlined above. The algorithms used are such that the solution and residual iterates of the original system are produced, not their preconditioned counterparts. Note that an unsuitable preconditioner or no preconditioning at all may result in a very slow rate, or lack, of convergence. However, preconditioning involves a trade-off between the reduction in the number of iterations required for convergence and the additional computational costs per iteration. Also, setting up a preconditioner may involve non-negligible overheads.

A preconditioner must be symmetric and positive definite, i.e., representable by $M = E E^{T}$ , where $M$ is nonsingular, and such that $¯ A = E^{- 1} A E^{- T} \sim I_{n}$ in (1), where $I_{n}$ is the identity matrix of order $n$ . Also, we can define $¯ r = E^{- 1} r$ and $¯ x = E^{T} x$ . These are formal definitions, used only in the design of the algorithms; in practice, only the means to compute the matrix-vector products $v = A u$ and to solve the preconditioning equations $M v = u$ are required, that is, explicit information about $M$ , $E$ or their inverses is not required at any stage.

The first termination criterion

{∥ r_{k} ∥}_{p} \leq τ ({∥ b ∥}_{p} + {∥ A ∥}_{p} \times {∥ x_{k} ∥}_{p})

is available for both conjugate gradient and Lanczos (SYMMLQ) methods. In (2), $p = 1, \infty$ or $2$ and $τ$ denotes a user-specified tolerance subject to $m a x (10, \sqrt{n}) ϵ \leq τ < 1$ , where $ϵ$ is the machine precision. Facilities are provided for the estimation of the norm of the matrix of the coefficients ${∥ A ∥}_{1} = {∥ A ∥}_{\infty}$ , when this is not known in advance, used in (2), by applying Higham’s method (see Higham (1988)). Note that ${∥ A ∥}_{2}$ cannot be estimated internally. This criterion uses an error bound derived from backward error analysis to ensure that the computed solution is the exact solution of a problem as close to the original as the termination tolerance requires. Termination criteria employing bounds derived from forward error analysis could be used, but any such criteria would require information about the condition number $κ (A)$ which is not easily obtainable.

The second termination criterion

{∥ {¯ r}_{k} ∥}_{2} \leq τ m a x (1.0, {∥ b ∥}_{2} / {∥ r_{0} ∥}_{2}) ({∥ {¯ r}_{0} ∥}_{2} + σ_{1} (¯ A) \times {∥ Δ {¯ x}_{k} ∥}_{2})

is available only for the Lanczos method (SYMMLQ). In (3), $σ_{1} (¯ A) = {∥ ∥ ¯ A ∥ ∥}_{2}$ is the largest singular value of the (preconditioned) iteration matrix $¯ A$ . This termination criterion monitors the progress of the solution of the preconditioned system of equations and is less expensive to apply than criterion (2). When $σ_{1} (¯ A)$ is not supplied, facilities are provided for its estimation by $σ_{1} (¯ A) \sim {m a x}_{k} σ_{1} (T_{k})$ . The interlacing property $σ_{1} (T_{k - 1}) \leq σ_{1} (T_{k})$ and Gerschgorin’s theorem provide lower and upper bounds from which $σ_{1} (T_{k})$ can be easily computed by bisection. Alternatively, the less expensive estimate $σ_{1} (¯ A) \sim {m a x}_{k} {∥ T_{k} ∥}_{1}$ can be used, where $σ_{1} (¯ A) \leq {∥ T_{k} ∥}_{1}$ by Gerschgorin’s theorem. Note that only order of magnitude estimates are required by the termination criterion.

Termination criterion (2) is the recommended choice, despite its (small) additional costs per iteration when using the Lanczos method (SYMMLQ). Also, if the norm of the initial estimate is much larger than the norm of the solution, that is, if $∥ x_{0} ∥ ≫ ∥ x ∥$ , a dramatic loss of significant digits could result in complete lack of convergence. The use of criterion (2) will enable the detection of such a situation, and the iteration will be restarted at a suitable point. No such restart facilities are provided for criterion (3).

Optionally, a vector $w$ of user-specified weights can be used in the computation of the vector norms in termination criterion (2), i.e., ${∥ v ∥}_{p}^{(w)} = {∥ ∥ v^{(w)} ∥ ∥}_{p}^{(w)}$ , where ${(v^{(w)})}_{i} = w_{i} v_{i}$ , for $i = 1, 2, \dots, n$ . Note that the use of weights increases the computational costs.

The MINRES algorithm terminates when the norm of the residual of the preconditioned system $F$ , ${∥ F ∥}_{2} \leq τ \times {∥ ∥ ¯ A ∥ ∥}_{2} \times {∥ x_{k} ∥}_{2}$ , where $¯ A$ is the preconditioned matrix.

The termination criteria discussed are not robust in the presence of a non-trivial nullspace of $A$ , i.e., when $A$ is singular. It is then possible for ${∥ x_{k} ∥}_{p}$ to grow without limit, spuriously satisfying the termination criterion. If singularity is suspected, more robust functions can be found in submodule opt.

The sequence of calls to the functions comprising the suite is enforced: first, the setup function real_symm_basic_setup must be called, followed by the solver real_symm_basic_solver(). The diagnostic function real_symm_basic_diag() can be called either when real_symm_basic_solver() is carrying out a monitoring step or after real_symm_basic_solver() has completed its tasks. Incorrect sequencing will raise an error condition.

References

Barrett, R, Berry, M, Chan, T F, Demmel, J, Donato, J, Dongarra, J, Eijkhout, V, Pozo, R, Romine, C and Van der Vorst, H, 1994, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, SIAM, Philadelphia

Dias da Cunha, R and Hopkins, T, 1994, PIM 1.1 — the parallel iterative method package for systems of linear equations user’s guide — Fortran 77 version, Technical Report, Computing Laboratory, University of Kent at Canterbury, Kent, UK

Golub, G H and Van Loan, C F, 1996, Matrix Computations, (3rd Edition), Johns Hopkins University Press, Baltimore

Hestenes, M and Stiefel, E, 1952, Methods of conjugate gradients for solving linear systems, J. Res. Nat. Bur. Stand. (49), 409–436

Higham, N J, 1988, FORTRAN codes for estimating the one-norm of a real or complex matrix, with applications to condition estimation, ACM Trans. Math. Software (14), 381–396

Paige, C C and Saunders, M A, 1975, Solution of sparse indefinite systems of linear equations, SIAM J. Numer. Anal. (12), 617–629

NAG and Python

Return to Front

naginterfaces.library.sparse.real_symm_basic_setup¶

naginterfaces.library.sparse.real_​symm_​basic_​setup¶

naginterfaces.library.sparse.real_symm_basic_setup¶