e04nf: FL CL CPP AD

NAG FL Interface
e04nff (qp_dense_solve_old)
e04nfa (qp_dense_solve)

Note: this routine uses optional parameters to define choices in the problem specification and in the details of the algorithm. If you wish to use default settings for all of the optional parameters, you need only read Sections 1 to 10 of this document. If, however, you wish to reset some or all of the settings please refer to Section 11 for a detailed description of the algorithm, to Section 12 for a detailed description of the specification of the optional parameters and to Section 13 for a detailed description of the monitoring information produced by the routine.

Keyword Search:

NAG Library Manual, Mark 27.1

Interfaces: FL CL CPP AD

NAG FL Interface Introduction

E04 (Opt) Chapter Contents

E04 (Opt) Chapter Introduction

e04nf: FL CL CPP AD

▸▿ Contents

1 Purpose

▸▿ 2 Specification

2.1 Specification for e04nff

2.2 Specification for e04nfa

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

▸▿ 9 Further Comments

9.1 Scaling

9.2 Description of the Printed Output

▸▿ 10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

▸▿ 11 Algorithmic Details

11.1 Overview

11.2 Definition of Search Direction

11.3 Main Iteration

11.4 Choosing the Initial Working Set

▸▿ 12 Optional Parameters

12.1 Description of the Optional Parameters

13 Description of Monitoring Information

1 Purpose

e04nff/e04nfa solves general quadratic programming problems. It is not intended for large sparse problems.

e04nfa is a version of e04nff that has additional arguments in order to make it safe for use in multithreaded applications (see Section 5). The initialization routine e04wbf must have been called before calling e04nfa.

2 Specification

2.1 Specification for e04nff

Fortran Interface

Subroutine e04nff (

n, nclin, a, lda, bl, bu, cvec, h, ldh, qphess, istate, x, iter, obj, ax, clamda, iwork, liwork, work, lwork, ifail)

Integer, Intent (In)	::	n, nclin, lda, ldh, liwork, lwork
Integer, Intent (Inout)	::	istate(n+nclin), ifail
Integer, Intent (Out)	::	iter, iwork(liwork)
Real (Kind=nag_wp), Intent (In)	::	a(lda,), bl(n+nclin), bu(n+nclin), cvec(), h(ldh,*)
Real (Kind=nag_wp), Intent (Inout)	::	x(n)
Real (Kind=nag_wp), Intent (Out)	::	obj, ax(max(1,nclin)), clamda(n+nclin), work(lwork)
External	::	qphess

C Header Interface

#include <nag.h>

void

e04nff_ (const Integer *n, const Integer *nclin, const double a[], const Integer *lda, const double bl[], const double bu[], const double cvec[], const double h[], const Integer *ldh,
void (NAG_CALL *qphess)(const Integer *n, const Integer *jthcol, const double h[], const Integer *ldh, const double x[], double hx[]),
Integer istate[], double x[], Integer *iter, double *obj, double ax[], double clamda[], Integer iwork[], const Integer *liwork, double work[], const Integer *lwork, Integer *ifail)

2.2 Specification for e04nfa

Fortran Interface

Subroutine e04nfa (

n, nclin, a, lda, bl, bu, cvec, h, ldh, qphess, istate, x, iter, obj, ax, clamda, iwork, liwork, work, lwork, iuser, ruser, lwsav, iwsav, rwsav, ifail)

Integer, Intent (In)	::	n, nclin, lda, ldh, liwork, lwork
Integer, Intent (Inout)	::	istate(n+nclin), iuser(*), iwsav(610), ifail
Integer, Intent (Out)	::	iter, iwork(liwork)
Real (Kind=nag_wp), Intent (In)	::	a(lda,), bl(n+nclin), bu(n+nclin), cvec(), h(ldh,*)
Real (Kind=nag_wp), Intent (Inout)	::	x(n), ruser(*), rwsav(475)
Real (Kind=nag_wp), Intent (Out)	::	obj, ax(max(1,nclin)), clamda(n+nclin), work(lwork)
Logical, Intent (Inout)	::	lwsav(120)
External	::	qphess

C Header Interface

#include <nag.h>

void

e04nfa_ (const Integer *n, const Integer *nclin, const double a[], const Integer *lda, const double bl[], const double bu[], const double cvec[], const double h[], const Integer *ldh,
void (NAG_CALL *qphess)(const Integer *n, const Integer *jthcol, const double h[], const Integer *ldh, const double x[], double hx[], Integer iuser[], double ruser[], Integer iwsav[]),
Integer istate[], double x[], Integer *iter, double *obj, double ax[], double clamda[], Integer iwork[], const Integer *liwork, double work[], const Integer *lwork, Integer iuser[], double ruser[], logical lwsav[], Integer iwsav[], double rwsav[], Integer *ifail)

Before calling e04nfa, or either of the option setting routines e04nga or e04nha, e04wbf must be called. The specification for e04wbf is:

Fortran Interface

Subroutine e04wbf (

rname, cwsav, lcwsav, lwsav, llwsav, iwsav, liwsav, rwsav, lrwsav, ifail)

Integer, Intent (In)	::	lcwsav, llwsav, liwsav, lrwsav
Integer, Intent (Inout)	::	ifail
Integer, Intent (Out)	::	iwsav(liwsav)
Real (Kind=nag_wp), Intent (Out)	::	rwsav(lrwsav)
Logical, Intent (Out)	::	lwsav(llwsav)
Character (*), Intent (In)	::	rname
Character (80), Intent (Out)	::	cwsav(lcwsav)

C Header Interface

#include <nag.h>

void	e04wbf_ (const char rname, char cwsav[], const Integer lcwsav, logical lwsav[], const Integer llwsav, Integer iwsav[], const Integer liwsav, double rwsav[], const Integer lrwsav, Integer ifail, const Charlen length_rname, const Charlen length_cwsav)

e04wbf should be called with

rname ='e04nfa'

. lcwsav, llwsav, liwsav and lrwsav, the declared lengths of cwsav, lwsav, iwsav and rwsav respectively, must satisfy:

$lcwsav \geq 1$
$llwsav \geq 120$
$liwsav \geq 610$
$lrwsav \geq 475$

The contents of the arrays cwsav, lwsav, iwsav and rwsav must not be altered between calling routines e04nfa, e04nga, e04nha and e04wbf.

3 Description

e04nff/e04nfa is designed to solve a class of quadratic programming problems that are assumed to be stated in the following general form:

\underset{x \in R^{n}}{minimize} f (x) subject to l \leq (\begin{matrix} x \\ A x \end{matrix}) \leq u,

where

A

is an

m_{L} \times n

matrix and

f (x)

may be specified in a variety of ways depending upon the particular problem to be solved. The available forms for

f (x)

are listed in Table 1, in which the prefixes FP, LP and QP stand for ‘feasible point’, ‘linear programming’ and ‘quadratic programming’ respectively and

c

is an

n

-element vector.

**Table 1**
Problem type	$f (x)$	Matrix $H$
FP	Not applicable	Not applicable
LP	$c^{T} x$	Not applicable
QP1	$\frac{1}{2} x^{T} H x$	symmetric
QP2	$c^{T} x + \frac{1}{2} x^{T} H x$	symmetric
QP3	$\frac{1}{2} x^{T} H^{T} H x$	$m \times n$ upper trapezoidal
QP4	$c^{T} x + \frac{1}{2} x^{T} H^{T} H x$	$m \times n$ upper trapezoidal

There is no restriction on

H

H^{T} H

apart from symmetry. If the quadratic function is convex, a global minimum is found; otherwise, a local minimum is found. The default problem type is QP2 and other objective functions are selected by using the optional parameter Problem Type. For problems of type FP, the objective function is omitted and the routine attempts to find a feasible point for the set of constraints.

The constraints involving

A

are called the general constraints. Note that upper and lower bounds are specified for all the variables and for all the general constraints. An equality constraint can be specified by setting

l_{i} = u_{i}

. If certain bounds are not present, the associated elements of

l

u

can be set to special values that will be treated as

- \infty

+ \infty

. (See the description of the optional parameter Infinite Bound Size.)

The defining feature of a quadratic function

f (x)

is that the second-derivative matrix

\nabla^{2} f (x)

(the Hessian matrix) is constant. For QP1 and QP2 (the default),

\nabla^{2} f (x) = H

; for QP3 and QP4,

\nabla^{2} f (x) = H^{T} H

; and for the LP case,

\nabla^{2} f (x) = 0

. If

H

is positive semidefinite, it is usually more efficient to use e04ncf/e04nca. If

H

is defined as the zero matrix, e04nff/e04nfa will still attempt to solve the resulting linear programming problem; however, this can be accomplished more efficiently by setting the optional parameter

Problem Type = LP

, or by using e04mff/e04mfa instead.

You must supply an initial estimate of the solution.

In the QP case, you may supply

H

either explicitly as an

m \times n

matrix, or implicitly in a subroutine that computes the product

H x

H^{T} H x

for any given vector

x

In general, a successful run of e04nff/e04nfa will indicate one of three situations:

(i)a minimizer has been found;
(ii)the algorithm has terminated at a so-called dead-point; or
(iii)the problem has no bounded solution.

If a minimizer is found, and

\nabla^{2} f (x)

is positive definite or positive semidefinite, e04nff/e04nfa will obtain a global minimizer; otherwise, the solution will be a local minimizer (which may or may not be a global minimizer). A dead-point is a point at which the necessary conditions for optimality are satisfied but the sufficient conditions are not. At such a point, a feasible direction of decrease may or may not exist, so that the point is not necessarily a local solution of the problem. Verification of optimality in such instances requires further information, and is in general an NP-hard problem (see Pardalos and Schnitger (1988)). Termination at a dead-point can occur only if

\nabla^{2} f (x)

is not positive definite. If

\nabla^{2} f (x)

is positive semidefinite, the dead-point will be a weak minimizer (i.e., with a unique optimal objective value, but an infinite set of optimal

x

The method used by e04nff/e04nfa (see Section 11) is most efficient when many constraints or bounds are active at the solution.

4 References

Gill P E, Hammarling S, Murray W, Saunders M A and Wright M H (1986) Users' guide for LSSOL (Version 1.0) Report SOL 86-1 Department of Operations Research, Stanford University

Gill P E and Murray W (1978) Numerically stable methods for quadratic programming Math. Programming 14 349–372

Gill P E, Murray W, Saunders M A and Wright M H (1984) Procedures for optimization problems with a mixture of bounds and general linear constraints ACM Trans. Math. Software 10 282–298

Gill P E, Murray W, Saunders M A and Wright M H (1989) A practical anti-cycling procedure for linearly constrained optimization Math. Programming 45 437–474

Gill P E, Murray W, Saunders M A and Wright M H (1991) Inertia-controlling methods for general quadratic programming SIAM Rev. 33 1–36

Gill P E, Murray W and Wright M H (1981) Practical Optimization Academic Press

Pardalos P M and Schnitger G (1988) Checking local optimality in constrained quadratic programming is NP-hard Operations Research Letters 7 33–35

5 Arguments

1: $n$ – Integer Input

On entry:

n

, the number of variables.

Constraint:

n > 0

2: $nclin$ – Integer Input

On entry:

m_{L}

, the number of general linear constraints.

Constraint:

nclin \geq 0

3: $a (lda, *)$ – Real (Kind=nag_wp) array Input

Note: the second dimension of the array a must be at least

n

nclin > 0

and at least

1

nclin = 0

On entry: the

i

th row of a must contain the coefficients of the

i

th general linear constraint, for

i = 1, 2, \dots, m_{L}

nclin = 0

, a is not referenced.

4: $lda$ – Integer Input

On entry: the first dimension of the array a as declared in the (sub)program from which e04nff/e04nfa is called.

Constraint:

lda \geq \max (1, nclin)

5: $bl (n + nclin)$ – Real (Kind=nag_wp) array Input

6: $bu (n + nclin)$ – Real (Kind=nag_wp) array Input

On entry: bl must contain the lower bounds and bu the upper bounds, for all the constraints in the following order. The first

n

elements of each array must contain the bounds on the variables, and the next

m_{L}

elements the bounds for the general linear constraints (if any). To specify a nonexistent lower bound (i.e.,

l_{j} = - \infty

), set

bl (j) \leq - bigbnd

, and to specify a nonexistent upper bound (i.e.,

u_{j} = + \infty

), set

bu (j) \geq bigbnd

; the default value of

bigbnd

10^{20}

, but this may be changed by the optional parameter Infinite Bound Size. To specify the

j

th constraint as an equality, set

bl (j) = bu (j) = β

, say, where

| β | < bigbnd

Constraints:

$bl (j) \leq bu (j)$ , for $j = 1, 2, \dots, n + nclin$ ;
if $bl (j) = bu (j) = β$ , $| β | < bigbnd$ .

7: $cvec (*)$ – Real (Kind=nag_wp) array Input

Note: the dimension of the array cvec must be at least

n

if the problem is of type LP, QP2 (the default) or QP4, and at least

0

otherwise.

On entry: the coefficients of the explicit linear term of the objective function when the problem is of type LP, QP2 (the default) and QP4.

If the problem is of type FP, QP1, or QP3, cvec is not referenced.

8: $h (ldh, *)$ – Real (Kind=nag_wp) array Input

Note: the second dimension of the array h must be at least

n

if it is to be used to store

H

explicitly, and at least

1

otherwise.

On entry: may be used to store the quadratic term

H

of the QP objective function if desired. In some cases, you need not use h to store

H

explicitly (see the specification of subroutine qphess). The elements of h are referenced only by subroutine qphess. The number of rows of

H

is denoted by

m

, whose default value is

n

. (The optional parameter Hessian Rows may be used to specify a value of

m < n

If the default version of qphess is used and the problem is of type QP1 or QP2 (the default), the first

m

rows and columns of h must contain the leading

m \times m

rows and columns of the symmetric Hessian matrix

H

. Only the diagonal and upper triangular elements of the leading

m

rows and columns of h are referenced. The remaining elements need not be assigned.

If the default version of qphess is used and the problem is of type QP3 or QP4, the first

m

rows of h must contain an

m \times n

upper trapezoidal factor of the symmetric Hessian matrix

H^{T} H

. The factor need not be of full rank, i.e., some of the diagonal elements may be zero. However, as a general rule, the larger the dimension of the leading nonsingular sub-matrix of h, the fewer iterations will be required. Elements outside the upper trapezoidal part of the first

m

rows of h need not be assigned.

If a non-default version of qphess is supplied, then in some cases it may be desirable to use a one-dimensional array to transmit data to qphess. (This is illustrated in the example program in Section 10 in e04ngf/e04nga.) h is then declared as an ldh by

1

array, where

ldh \geq n \times (n + 1) / 2

In other situations, it may be desirable to compute

H x

H^{T} H x

without accessing h – for example, if

H

H^{T} H

is sparse or has special structure. The arguments h and ldh may then refer to any convenient array.

If the problem is of type FP or LP, h is not referenced.

9: $ldh$ – Integer Input

On entry: the first dimension of the array h as declared in the (sub)program from which e04nff/e04nfa is called.

Constraints:

if the problem is of type QP1, QP2 (the default), QP3 or QP4, $ldh \geq n$ or at least the value of the optional parameter Hessian Rows;
if the problem is of type FP or LP, $ldh \geq 1$ .

10: $qphess$ – Subroutine, supplied by the NAG Library or the user. External Procedure

In general, you need not provide a version of qphess, because a ‘default’ subroutine with name e04nfu/e54nfu is included in the Library. However, the algorithm of e04nff/e04nfa requires only the product of

H

H^{T} H

and a vector

x

; and in some cases you may obtain increased efficiency by providing a version of qphess that avoids the need to define the elements of the matrices

H

H^{T} H

explicitly.

qphess is not referenced if the problem is of type FP or LP, in which case qphess may be the routine e04nfu/e54nfu.

The specification of qphess for e04nff is:

Fortran Interface

Subroutine qphess (

n, jthcol, h, ldh, x, hx)

Integer, Intent (In)	::	n, jthcol, ldh
Real (Kind=nag_wp), Intent (In)	::	h(ldh,*), x(n)
Real (Kind=nag_wp), Intent (Out)	::	hx(n)

C Header Interface

void	qphess (const Integer n, const Integer jthcol, const double h[], const Integer *ldh, const double x[], double hx[])

The specification of qphess for e04nfa is:

Fortran Interface

Subroutine qphess (

n, jthcol, h, ldh, x, hx, iuser, ruser, iwsav)

Integer, Intent (In)	::	n, jthcol, ldh
Integer, Intent (Inout)	::	iuser(*), iwsav(610)
Real (Kind=nag_wp), Intent (In)	::	h(ldh,*), x(n)
Real (Kind=nag_wp), Intent (Inout)	::	ruser(*)
Real (Kind=nag_wp), Intent (Out)	::	hx(n)

C Header Interface

void	qphess (const Integer n, const Integer jthcol, const double h[], const Integer *ldh, const double x[], double hx[], Integer iuser[], double ruser[], Integer iwsav[])

1: $n$ – Integer Input

On entry: this is the same argument as supplied to this routine. See the description for the top level argument n.

2: $jthcol$ – Integer Input

On entry: specifies whether or not the vector

x

is a column of the identity matrix.

$jthcol = j > 0$: The vector $x$ is the $j$ th column of the identity matrix, and hence $H x$ or $H^{T} H x$ is the $j$ th column of $H$ or $H^{T} H$ , respectively. This may in some cases require very little computation and qphess may be coded to take advantage of this. However special code is not necessary because $x$ is always stored explicitly in the array x.
$jthcol = 0$: $x$ has no special form.

3: $h (ldh, *)$ – Real (Kind=nag_wp) array Input

On entry: this is the same argument as supplied to this routine. See the description for the top level argument h.

4: $ldh$ – Integer Input

On entry: this is the same argument as supplied to this routine. See the description for the top level argument ldh.

5: $x (n)$ – Real (Kind=nag_wp) array Input

On entry: the vector

x

6: $hx (n)$ – Real (Kind=nag_wp) array Output

On exit: the product

H x

if the problem is of type QP1 or QP2 (the default), or the product

H^{T} H x

if the problem is of type QP3 or QP4.

Note: the following are additional arguments for specific use with e04nfa. Users of e04nff therefore need not read the remainder of this description.

7: $iuser (*)$ – Integer array User Workspace

8: $ruser (*)$ – Real (Kind=nag_wp) array User Workspace

qphess is called with the arguments iuser and ruser as supplied to e04nff/e04nfa. You should use the arrays iuser and ruser to supply information to qphess.

9: $iwsav (610)$ – Integer array Communication Array

iwsav contains information that is required by the default routine e54nfu.

qphess must either be a module subprogram USEd by, or declared as EXTERNAL in, the (sub)program from which e04nff/e04nfa is called. Arguments denoted as Input must not be changed by this procedure.

Note: qphess should not return floating-point NaN (Not a Number) or infinity values, since these are not handled by e04nff/e04nfa. If your code inadvertently does return any NaNs or infinities, e04nff/e04nfa is likely to produce unexpected results.

11: $istate (n + nclin)$ – Integer array Input/Output

On entry: need not be set if the (default) optional parameter Cold Start is used.

If the optional parameter Warm Start has been chosen, istate specifies the desired status of the constraints at the start of the feasibility phase. More precisely, the first

n

elements of istate refer to the upper and lower bounds on the variables, and the next

m_{L}

elements refer to the general linear constraints (if any). Possible values for

istate (j)

are as follows:

$istate (j)$	Meaning
0	The corresponding constraint should not be in the initial working set.
1	The constraint should be in the initial working set at its lower bound.
2	The constraint should be in the initial working set at its upper bound.
3	The constraint should be in the initial working set as an equality. This value must not be specified unless $bl (j) = bu (j)$ .

The values

- 2

- 1

and

4

are also acceptable but will be reset to zero by the routine. If e04nff/e04nfa has been called previously with the same values of n and nclin, istate already contains satisfactory information. (See also the description of the optional parameter Warm Start.) The routine also adjusts (if necessary) the values supplied in x to be consistent with istate.

Constraint:

- 2 \leq istate (j) \leq 4

, for

j = 1, 2, \dots, n + nclin

On exit: the status of the constraints in the working set at the point returned in x. The significance of each possible value of

istate (j)

is as follows:

$istate (j)$	Meaning
$- 2$	The constraint violates its lower bound by more than the feasibility tolerance.
$- 1$	The constraint violates its upper bound by more than the feasibility tolerance.
$0$	The constraint is satisfied to within the feasibility tolerance, but is not in the working set.
$1$	This inequality constraint is included in the working set at its lower bound.
$2$	This inequality constraint is included in the working set at its upper bound.
$3$	This constraint is included in the working set as an equality. This value of istate can occur only when $bl (j) = bu (j)$ .
$4$	This corresponds to optimality being declared with $x (j)$ being temporarily fixed at its current value. This value of istate can occur only when $ifail = 1$ on exit.

12: $x (n)$ – Real (Kind=nag_wp) array Input/Output

On entry: an initial estimate of the solution.

On exit: the point at which e04nff/e04nfa terminated. If

ifail = 0

1

4

, x contains an estimate of the solution.

13: $iter$ – Integer Output

On exit: the total number of iterations performed.

14: $obj$ – Real (Kind=nag_wp) Output

On exit: the value of the objective function at

x

x

is feasible, or the sum of infeasibilities at

x

otherwise. If the problem is of type FP and

x

is feasible, obj is set to zero.

15: $ax (\max (1, nclin))$ – Real (Kind=nag_wp) array Output

On exit: the final values of the linear constraints

A x

nclin = 0

, ax is not referenced.

16: $clamda (n + nclin)$ – Real (Kind=nag_wp) array Output

On exit: the values of the Lagrange multipliers for each constraint with respect to the current working set. The first

n

elements contain the multipliers for the bound constraints on the variables, and the next

m_{L}

elements contain the multipliers for the general linear constraints (if any). If

istate (j) = 0

(i.e., constraint

j

is not in the working set),

clamda (j)

is zero. If

x

is optimal,

clamda (j)

should be non-negative if

istate (j) = 1

, non-positive if

istate (j) = 2

and zero if

istate (j) = 4

17: $iwork (liwork)$ – Integer array Workspace

18: $liwork$ – Integer Input

On entry: the dimension of the array iwork as declared in the (sub)program from which e04nff/e04nfa is called.

Constraint:

liwork \geq 2 \times n + 3

19: $work (lwork)$ – Real (Kind=nag_wp) array Workspace

20: $lwork$ – Integer Input

On entry: the dimension of the array work as declared in the (sub)program from which e04nff/e04nfa is called.

Constraints:

if the problem is of type QP2 (the default) or QP4,
- if $nclin > 0$ , $lwork \geq 2 \times n^{2} + 8 \times n + 5 \times nclin$ ;
- if $nclin = 0$ , $lwork \geq n^{2} + 8 \times n$ ;
if the problem is of type QP1 or QP3,
- if $nclin > 0$ , $lwork \geq 2 \times n^{2} + 7 \times n + 5 \times nclin$ ;
- if $nclin = 0$ , $lwork \geq n^{2} + 7 \times n$ ;
if the problem is of type LP,
- if $nclin = 0$ , $lwork \geq 8 \times n + 1$ ;
- if $nclin \geq n$ , $lwork \geq 2 \times n^{2} + 8 \times n + 5 \times nclin$ ;
- otherwise $lwork \geq 2 \times {(nclin + 1)}^{2} + 8 \times n + 5 \times nclin$ ;
if the problem is of type FP,
- if $nclin = 0$ , $lwork \geq 7 \times n + 1$ ;
- if $nclin \geq n$ , $lwork \geq 2 \times n^{2} + 7 \times n + 5 \times nclin$ ;
- otherwise $lwork \geq 2 \times {(nclin + 1)}^{2} + 7 \times n + 5 \times nclin$ .

The amounts of workspace provided and required are (by default) output on the current advisory message unit (as defined by x04abf). As an alternative to computing liwork and lwork from the formulas given above, you may prefer to obtain appropriate values from the output of a preliminary run with liwork and lwork set to

1

. (e04nff/e04nfa will then terminate with

ifail = 6

21: $ifail$ – Integer Input/Output

Note: for e04nfa, ifail does not occur in this position in the argument list. See the additional arguments described below.

On entry: ifail must be set to

0

- 1

1

to set behaviour on detection of an error; these values have no effect when no error is detected.

A value of

0

causes the printing of an error message and program execution will be halted; otherwise program execution continues. A value of

- 1

means that an error message is printed while a value of

1

means that it is not.

If halting is not appropriate, the value

- 1

1

is recommended. If message printing is undesirable, then the value

1

is recommended. Otherwise, the value

- 1

is recommended since useful values can be provided in some output arguments even when

ifail \neq 0

on exit. When the value $- 1$ or $1$ is used it is essential to test the value of ifail on exit.

On exit:

ifail = 0

unless the routine detects an error or a warning has been flagged (see Section 6).

e04nff/e04nfa returns with

ifail = 0

x

is a strong local minimizer, i.e., the reduced gradient (Norm Gz; see Section 9.2) is negligible, the Lagrange multipliers (Lagr Mult; see Section 9.2) are optimal and

H_{R}

(the reduced Hessian of

f (x)

; see Section 11.2) is positive semidefinite.

Note: the following are additional arguments for specific use with e04nfa. Users of e04nff therefore need not read the remainder of this description.

21: $iuser (*)$ – Integer array User Workspace

22: $ruser (*)$ – Real (Kind=nag_wp) array User Workspace

iuser and ruser are not used by e04nff/e04nfa, but are passed directly to qphess and may be used to pass information to this routine.

23: $lwsav (120)$ – Logical array Communication Array

24: $iwsav (610)$ – Integer array Communication Array

25: $rwsav (475)$ – Real (Kind=nag_wp) array Communication Array

The arrays lwsav, iwsav and rwsav must not be altered between calls to any of the routines e04nfa, e04nga or e04nha.

26: $ifail$ – Integer Input/Output

Note: see the argument description for ifail above.

6 Error Indicators and Warnings

If on entry

ifail = 0

- 1

, explanatory error messages are output on the current error message unit (as defined by x04aaf).

Errors or warnings detected by the routine:

Note: in some cases e04nff/e04nfa may return useful information.

$ifail = 1$: Iterations terminated at a dead point.
The necessary conditions for optimality are satisfied but the sufficient conditions are not. (The reduced gradient is negligible, the Lagrange multipliers are optimal, but $H_{R}$ is singular or there are some very small multipliers.) If $\nabla^{2} f (x)$ is not positive definite, $x$ is not necessarily a local solution of the problem and verification of optimality requires further information. If $\nabla^{2} f (x)$ is positive semidefinite or the problem is of type LP, $x$ gives the global minimum value of the objective function, but the final $x$ is not unique.

Optimal solution is not unique.

$ifail = 2$: $⟨ value ⟩$ solution is unbounded.
The objective function is not bounded below in the feasible region. This value of ifail occurs if a step larger than Infinite Step Size ( $default value = 10^{20}$ ) would have to be taken in order to continue the algorithm, or the next step would result in an element of $x$ having magnitude larger than optional parameter Infinite Bound Size ( $default value = 10^{20}$ ).

$ifail = 3$: Cannot satisfy the working set constraints to the accuracy requested.

No feasible point for the linear constraints.
It was not possible to satisfy all the constraints to within the feasibility tolerance. In this case, the constraint violations at the final $x$ will reveal a value of the tolerance for which a feasible point will exist – for example, when the feasibility tolerance for each violated constraint exceeds its Slack (see Section 9.2) at the final point. The modified problem (with an altered feasibility tolerance) may then be solved using a Warm Start. You should check that there are no constraint redundancies. If the data for the constraints are accurate only to the absolute precision $σ$ , you should ensure that the value of the optional parameter Feasibility Tolerance ( $default value = \sqrt{ε}$ , where $ε$ is the machine precision) is greater than $σ$ . For example, if all elements of $A$ are of order unity and are accurate only to three decimal places, the Feasibility Tolerance should be at least $10^{- 3}$ .

$ifail = 4$: Too many iterations.
The values of the optional parameters Feasibility Phase Iteration Limit ( $default value = \max (50, 5 (n + m_{L}))$ ) and Optimality Phase Iteration Limit ( $default value = \max (50, 5 (n + m_{L}))$ ) may be too small. If the method appears to be making progress (e.g., the objective function is being satisfactorily reduced), either increase the iterations limit and rerun e04nff/e04nfa or, alternatively, rerun e04nff/e04nfa using the Warm Start facility to specify the initial working set.

$ifail = 5$: Reduced Hessian exceeds assigned dimension.
The algorithm needed to expand the reduced Hessian when it was already at its maximum dimension, as specified by the optional parameter Maximum Degrees of Freedom ( $default value = n$ ).
Rerun e04nff/e04nfa with a larger value (possibly using the Warm Start facility to specify the initial working set).

$ifail = 6$: Not enough workspace to solve problem. Workspace provided is $iwork (⟨ value ⟩)$ and $work (⟨ value ⟩)$ . To solve problem we need $iwork (⟨ value ⟩)$ and $work (⟨ value ⟩)$ .

On entry, $lda = ⟨ value ⟩$ and $nclin = ⟨ value ⟩$ .
Constraint: $lda \geq \max (1, nclin)$ .

On entry, $ldh = ⟨ value ⟩$ , $n = ⟨ value ⟩$ and $m = ⟨ value ⟩$ (where $m$ is the value of the optional parameter Hessian Rows).
Constraint: $ldh \geq \max (n, m)$ .

On entry, $ldh = ⟨ value ⟩$ .
Constraint: $ldh \geq 1$ .

On entry, $n = ⟨ value ⟩$ .
Constraint: $n > 0$ .

On entry, $nclin = ⟨ value ⟩$ .
Constraint: $nclin \geq 0$ .

On entry, the bounds on $⟨ value ⟩$ are inconsistent: $bl (⟨ value ⟩) = ⟨ value ⟩$ and $bu (⟨ value ⟩) = ⟨ value ⟩$ .

On entry, the bounds on linear constraint $⟨ value ⟩$ are inconsistent: $bl (⟨ value ⟩) = ⟨ value ⟩$ and $bu (⟨ value ⟩) = ⟨ value ⟩$ .

On entry, the bounds on nonlinear constraint $⟨ value ⟩$ are inconsistent: $bl (⟨ value ⟩) = ⟨ value ⟩$ and $bu (⟨ value ⟩) = ⟨ value ⟩$ .

On entry, the bounds on variable $⟨ value ⟩$ are inconsistent: $bl (⟨ value ⟩) = ⟨ value ⟩$ and $bu (⟨ value ⟩) = ⟨ value ⟩$ .

On entry, the equal bounds on $⟨ value ⟩$ are infinite, because $bl (⟨ value ⟩) = beta$ and $bu (⟨ value ⟩) = beta$ , but $| beta | \geq bigbnd$ : $beta = ⟨ value ⟩$ and $bigbnd = ⟨ value ⟩$ .

On entry, the equal bounds on linear constraint $⟨ value ⟩$ are infinite, because $bl (⟨ value ⟩) = beta$ and $bu (⟨ value ⟩) = beta$ , but $| beta | \geq bigbnd$ : $beta = ⟨ value ⟩$ and $bigbnd = ⟨ value ⟩$ .

On entry, the equal bounds on nonlinear constraint $⟨ value ⟩$ are infinite, because $bl (⟨ value ⟩) = beta$ and $bu (⟨ value ⟩) = beta$ , but $| beta | \geq bigbnd$ : $beta = ⟨ value ⟩$ and $bigbnd = ⟨ value ⟩$ .

On entry, the equal bounds on variable $⟨ value ⟩$ are infinite, because $bl (⟨ value ⟩) = beta$ and $bu (⟨ value ⟩) = beta$ , but $| beta | \geq bigbnd$ : $beta = ⟨ value ⟩$ and $bigbnd = ⟨ value ⟩$ .

On entry with a Cold Start, $istate (⟨ value ⟩) = ⟨ value ⟩$ .

On entry with a Warm Start, $istate (⟨ value ⟩) = ⟨ value ⟩$ .

$ifail = 7$: Problem Type not recognized. Problem abandoned.

Overflow: If the printed output before the overflow error contains a warning about serious ill-conditioning in the working set when adding the $j$ th constraint, it may be possible to avoid the difficulty by increasing the magnitude of the Feasibility Tolerance ( $default value = \sqrt{ε}$ , where $ε$ is the machine precision) and rerunning the program. If the message recurs even after this change, the offending linearly dependent constraint (with index ‘ $j$ ’) must be removed from the problem.

$ifail = - 99$: An unexpected error has been triggered by this routine. Please contact NAG.
See Section 7 in the Introduction to the NAG Library FL Interface for further information.

$ifail = - 399$: Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library FL Interface for further information.

$ifail = - 999$: Dynamic memory allocation failed.
See Section 9 in the Introduction to the NAG Library FL Interface for further information.

7 Accuracy

e04nff/e04nfa implements a numerically stable active set strategy and returns solutions that are as accurate as the condition of the problem warrants on the machine.

8 Parallelism and Performance

e04nff/e04nfa makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.

Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9 Further Comments

This section contains some comments on scaling and a description of the printed output.

9.1 Scaling

Sensible scaling of the problem is likely to reduce the number of iterations required and make the problem less sensitive to perturbations in the data, thus improving the condition of the problem. In the absence of better information it is usually sensible to make the Euclidean lengths of each constraint of comparable magnitude. See the E04 Chapter Introduction and Gill et al. (1981) for further information and advice.

9.2 Description of the Printed Output

This section describes the intermediate printout and final printout produced by e04nff/e04nfa. The intermediate printout is a subset of the monitoring information produced by the routine at every iteration (see Section 13). You can control the level of printed output (see the description of the optional parameter Print Level). Note that the intermediate printout and final printout are produced only if

Print Level \geq 10

(the default for e04nff, by default no output is produced by e04nfa).

The following line of summary output (

< 80

characters) is produced at every iteration. In all cases, the values of the quantities printed are those in effect on completion of the given iteration.

Itn	is the iteration count.
Step	is the step taken along the computed search direction. If a constraint is added during the current iteration, Step will be the step to the nearest constraint. When the problem is of type LP, the step can be greater than $1$ during the optimality phase.
Ninf	is the number of violated constraints (infeasibilities). This will be zero during the optimality phase.
Sinf/Objective	is the value of the current objective function. If $x$ is not feasible, Sinf gives a weighted sum of the magnitudes of constraint violations. If $x$ is feasible, Objective is the value of the objective function of (1). The output line for the final iteration of the feasibility phase (i.e., the first iteration for which Ninf is zero) will give the value of the true objective at the first feasible point. During the optimality phase the value of the objective function will be nonincreasing. During the feasibility phase the number of constraint infeasibilities will not increase until either a feasible point is found or the optimality of the multipliers implies that no feasible point exists. Once optimal multipliers are obtained the number of infeasibilities can increase, but the sum of infeasibilities will either remain constant or be reduced until the minimum sum of infeasibilities is found.
Norm Gz	is $‖ Z_{R}^{T} g_{FR} ‖$ , the Euclidean norm of the reduced gradient with respect to $Z_{R}$ . During the optimality phase, this norm will be approximately zero after a unit step. (See Sections 11.2 and 11.3.)

The final printout includes a listing of the status of every variable and constraint.

The following describes the printout for each variable. A full stop (.) is printed for any numerical value that is zero.

Varbl

gives the name (V) and index

j

, for

j = 1, 2, \dots, n

, of the variable.

State

gives the state of the variable (FR if neither bound is in the working set, EQ if a fixed variable, LL if on its lower bound, UL if on its upper bound, TF if temporarily fixed at its current value). If Value lies outside the upper or lower bounds by more than the Feasibility Tolerance, State will be ++ or -- respectively.

A key is sometimes printed before State.

A	Alternative optimum possible. The variable is active at one of its bounds, but its Lagrange multiplier is essentially zero. This means that if the variable were allowed to start moving away from its bound then there would be no change to the objective function. The values of the other free variables might change, giving a genuine alternative solution. However, if there are any degenerate variables (labelled D), the actual change might prove to be zero, since one of them could encounter a bound immediately. In either case the values of the Lagrange multipliers might also change.
D	Degenerate. The variable is free, but it is equal to (or very close to) one of its bounds.
I	Infeasible. The variable is currently violating one of its bounds by more than the Feasibility Tolerance.

Value

is the value of the variable at the final iteration.

Lower Bound

is the lower bound specified for the variable. None indicates that

bl (j) \leq - bigbnd

Upper Bound

is the upper bound specified for the variable. None indicates that

bu (j) \geq bigbnd

Lagr Mult

is the Lagrange multiplier for the associated bound. This will be zero if State is FR unless

bl (j) \leq - bigbnd

and

bu (j) \geq bigbnd

, in which case the entry will be blank. If

x

is optimal, the multiplier should be non-negative if State is LL and non-positive if State is UL.

Slack

is the difference between the variable Value and the nearer of its (finite) bounds

bl (j)

and

bu (j)

. A blank entry indicates that the associated variable is not bounded (i.e.,

bl (j) \leq - bigbnd

and

bu (j) \geq bigbnd

The meaning of the printout for general constraints is the same as that given above for variables, with ‘variable’ replaced by ‘constraint’,

bl (j)

and

bu (j)

are replaced by

bl (n + j)

and

bu (n + j)

respectively, and with the following change in the heading:

L Con

gives the name (L) and index

j

, for

j = 1, 2, \dots, n_{L}

, of the linear constraint.

Note that movement off a constraint (as opposed to a variable moving away from its bound) can be interpreted as allowing the entry in the Slack column to become positive.

Numerical values are output with a fixed number of digits; they are not guaranteed to be accurate to this precision.

10 Example

This example minimizes the quadratic function

f (x) = c^{T} x + \frac{1}{2} x^{T} H x

, where

c = {(- 0.02, - 0.2, - 0.2, - 0.2, - 0.2, 0.04, 0.04)}^{T}

H = (\begin{matrix} 2 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 2 & 2 & 0 & 0 & 0 \\ 0 & 0 & 2 & 2 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - 2 & - 2 \\ 0 & 0 & 0 & 0 & 0 & - 2 & - 2 \end{matrix})

subject to the bounds

\begin{matrix} - 0.01 \leq x_{1} \leq 0.01 \\ - 0.1 \leq x_{2} \leq 0.15 \\ - 0.01 \leq x_{3} \leq 0.03 \\ - 0.04 \leq x_{4} \leq 0.02 \\ - 0.1 \leq x_{5} \leq 0.05 \\ - 0.01 \leq x_{6} \\ - 0.01 \leq x_{7} \end{matrix}

and to the general constraints

\begin{matrix} x_{1} & + & x_{2} & + & x_{3} & + & x_{4} & + & x_{5} & + & x_{6} & + & x_{7} & = & - 0.13 \\ 0.15 x_{1} & + & 0.04 x_{2} & + & 0.02 x_{3} & + & 0.04 x_{4} & + & 0.02 x_{5} & + & 0.01 x_{6} & + & 0.03 x_{7} & \leq & - 0.0049 \\ 0.03 x_{1} & + & 0.05 x_{2} & + & 0.08 x_{3} & + & 0.02 x_{4} & + & 0.06 x_{5} & + & 0.01 x_{6} & \leq & - 0.0064 \\ 0.02 x_{1} & + & 0.04 x_{2} & + & 0.01 x_{3} & + & 0.02 x_{4} & + & 0.02 x_{5} & \leq & - 0.0037 \\ 0.02 x_{1} & + & 0.03 x_{2} & + & 0.01 x_{5} & \leq & - 0.0012 \\ - 0.0992 & \leq & 0.70 x_{1} & + & 0.75 x_{2} & + & 0.80 x_{3} & + & 0.75 x_{4} & + & 0.80 x_{5} & + & 0.97 x_{6} \\ - 0.003 & \leq & 0.02 x_{1} & + & 0.06 x_{2} & + & 0.08 x_{3} & + & 0.12 x_{4} & + & 0.02 x_{5} & + & 0.01 x_{6} & + & 0.97 x_{7} & \leq & 0.002 . \end{matrix}

The initial point, which is infeasible, is

x_{0} = {(- 0.01, - 0.03, 0.0, - 0.01, - 0.1, 0.02, 0.01)}^{T} .

The optimal solution (to five figures) is

x^{*} = {(- 0.01, - 0.069865, 0.018259, - 0.24261, - 0.62006, 0.013805, 0.0040665)}^{T} .

One bound constraint and four general constraints are active at the solution.

The document for e04ngf/e04nga includes an example program to solve the same problem using some of the optional parameters described in Section 12.

10.1 Program Text

Note: the following programs illustrate the use of e04nff and e04nfa.

Program Text (e04nffe.f90)

Program Text (e04nfae.f90)

10.2 Program Data

Program Data (e04nffe.d)

Program Data (e04nfae.d)

10.3 Program Results

Program Results (e04nffe.r)

Program Results (e04nfae.r)

Note: the remainder of this document is intended for more advanced users. Section 11 contains a detailed description of the algorithm which may be needed in order to understand Sections 12 and 13. Section 12 describes the optional parameters which may be set by calls to e04ngf/e04nga and/or e04nhf/e04nha. Section 13 describes the quantities which can be requested to monitor the course of the computation.

11 Algorithmic Details

This section contains a detailed description of the method used by e04nff/e04nfa.

11.1 Overview

e04nff/e04nfa is based on an inertia-controlling method that maintains a Cholesky factorization of the reduced Hessian (see below). The method is based on that of Gill and Murray (1978), and is described in detail by Gill et al. (1991). Here we briefly summarise the main features of the method. Where possible, explicit reference is made to the names of variables that are arguments of e04nff/e04nfa or appear in the printed output. e04nff/e04nfa has two phases:

(i)finding an initial feasible point by minimizing the sum of infeasibilities (the feasibility phase), and
(ii)minimizing the quadratic objective function within the feasible region (the optimality phase).

The computations in both phases are performed by the same subroutines. The two-phase nature of the algorithm is reflected by changing the function being minimized from the sum of infeasibilities to the quadratic objective function. The feasibility phase does not perform the standard simplex method (i.e., it does not necessarily find a vertex), except in the LP case when

m_{L} \leq n

. Once any iterate is feasible, all subsequent iterates remain feasible.

e04nff/e04nfa has been designed to be efficient when used to solve a sequence of related problems – for example, within a sequential quadratic programming method for nonlinearly constrained optimization (e.g., e04uff/e04ufa or e04wdf). In particular, you may specify an initial working set (the indices of the constraints believed to be satisfied exactly at the solution); see the discussion of the optional parameter Warm Start.

In general, an iterative process is required to solve a quadratic program. (For simplicity, we shall always consider a typical iteration and avoid reference to the index of the iteration.) Each new iterate

\bar{x}

is defined by

\bar{x} = x + α p

(1)

where the step length

α

is a non-negative scalar and

p

is called the search direction.

At each point

x

, a working set of constraints is defined to be a linearly independent subset of the constraints that are satisfied ‘exactly’ (to within the tolerance defined by the optional parameter Feasibility Tolerance). The working set is the current prediction of the constraints that hold with equality at the solution of a linearly constrained QP problem. The search direction is constructed so that the constraints in the working set remain unaltered for any value of the step length. For a bound constraint in the working set, this property is achieved by setting the corresponding element of the search direction to zero. Thus, the associated variable is fixed, and specification of the working set induces a partition of

x

into fixed and free variables. During a given iteration, the fixed variables are effectively removed from the problem; since the relevant elements of the search direction are zero, the columns of

A

corresponding to fixed variables may be ignored.

Let

m_{W}

denote the number of general constraints in the working set and let

n_{FX}

denote the number of variables fixed at one of their bounds (

m_{W}

and

n_{FX}

are the quantities Lin and Bnd in the monitoring file output from e04nff/e04nfa; see Section 13). Similarly, let

n_{FR}

(

n_{FR} = n - n_{FX}

) denote the number of free variables. At every iteration, the variables are reordered so that the last

n_{FX}

variables are fixed, with all other relevant vectors and matrices ordered accordingly.

11.2 Definition of Search Direction

Let

A_{FR}

denote the

m_{W} \times n_{FR}

sub-matrix of general constraints in the working set corresponding to the free variables and let

p_{FR}

denote the search direction with respect to the free variables only. The general constraints in the working set will be unaltered by any move along

p

A_{FR} p_{FR} = 0 .

(2)

In order to compute

p_{FR}

, the

T Q

factorization of

A_{FR}

is used:

A_{FR} Q_{FR} = (0 T),

(3)

where

T

is a nonsingular

m_{W} \times m_{W}

upper triangular matrix (i.e.,

t_{i j} = 0

i > j

), and the nonsingular

n_{FR} \times n_{FR}

matrix

Q_{FR}

is the product of orthogonal transformations (see Gill et al. (1984)). If the columns of

Q_{FR}

are partitioned so that

Q_{FR} = (Z Y),

where

Y

n_{FR} \times m_{W}

, then the

n_{Z}

(n_{Z} = n_{FR} - m_{W})

columns of

Z

form a basis for the null space of

A_{FR}

. Let

n_{R}

be an integer such that

0 \leq n_{R} \leq n_{Z}

, and let

Z_{R}

denote a matrix whose

n_{R}

columns are a subset of the columns of

Z

. (The integer

n_{R}

is the quantity Zr in the monitoring output from e04nff/e04nfa. In many cases,

Z_{R}

will include all the columns of

Z

.) The direction

p_{FR}

will satisfy (2) if

p_{FR} = Z_{R} p_{R},

(4)

where

p_{R}

is any

n_{R}

-vector.

Let

Q

denote the

n \times n

matrix

Q = (\begin{matrix} Q_{FR} \\ I_{FX} \end{matrix}),

where

I_{FX}

is the identity matrix of order

n_{FX}

. Let

H_{Q}

and

g_{Q}

denote the

n \times n

transformed Hessian and transformed gradient

H_{Q} = Q^{T} H Q and g_{Q} = Q^{T} (c + H x)

and let the matrix of first

n_{R}

rows and columns of

H_{Q}

be denoted by

H_{R}

and the vector of the first

n_{R}

elements of

g_{Q}

be denoted by

g_{R}

. The quantities

H_{R}

and

g_{R}

are known as the reduced Hessian and reduced gradient of

f (x)

, respectively. Roughly speaking,

g_{R}

and

H_{R}

describe the first and second derivatives of an unconstrained problem for the calculation of

p_{R}

At each iteration, a triangular factorization of

H_{R}

is available. If

H_{R}

is positive definite,

H_{R} = R^{T} R

, where

R

is the upper triangular Cholesky factor of

H_{R}

. If

H_{R}

is not positive definite,

H_{R} = R^{T} D R

, where

D = diag (1, 1, \dots, 1, μ)

, with

μ \leq 0

The computation is arranged so that the reduced-gradient vector is a multiple of

e_{R}

, a vector of all zeros except in the last (i.e.,

n_{R}

th) position. This allows the vector

p_{R}

in (4) to be computed from a single back-substitution

R p_{R} = γ e_{R}

(5)

where

γ

is a scalar that depends on whether or not the reduced Hessian is positive definite at

x

. In the positive definite case,

x + p

is the minimizer of the objective function subject to the constraints (bounds and general) in the working set treated as equalities. If

H_{R}

is not positive definite

p_{R}

satisfies the conditions

p_{R}^{T} H_{R} p_{R} < 0 and g_{R}^{T} p_{R} \leq 0,

which allow the objective function to be reduced by any positive step of the form

x + α p

11.3 Main Iteration

If the reduced gradient is zero,

x

is a constrained stationary point in the subspace defined by

Z

. During the feasibility phase, the reduced gradient will usually be zero only at a vertex (although it may be zero at non-vertices in the presence of constraint dependencies). During the optimality phase a zero reduced gradient implies that

x

minimizes the quadratic objective when the constraints in the working set are treated as equalities. At a constrained stationary point, Lagrange multipliers

λ_{C}

and

λ_{B}

for the general and bound constraints are defined from the equations

A_{FR}^{T} λ_{C} = g_{FR} and λ_{B} = g_{FX} - A_{FX}^{T} λ_{C} .

(6)

Given a positive constant

δ

of the order of the machine precision, a Lagrange multiplier

λ_{j}

corresponding to an inequality constraint in the working set is said to be optimal if

λ_{j} \leq δ

when the associated constraint is at its upper bound, or if

λ_{j} \geq - δ

when the associated constraint is at its lower bound. If a multiplier is nonoptimal, the objective function (either the true objective or the sum of infeasibilities) can be reduced by deleting the corresponding constraint (with index Jdel; see Section 13) from the working set.

If optimal multipliers occur during the feasibility phase and the sum of infeasibilities is nonzero, there is no feasible point, and you can force e04nff/e04nfa to continue until the minimum value of the sum of infeasibilities has been found; see the discussion of the optional parameter Minimum Sum of Infeasibilities. At such a point, the Lagrange multiplier

λ_{j}

corresponding to an inequality constraint in the working set will be such that

- (1 + δ) \leq λ_{j} \leq δ

when the associated constraint is at its upper bound, and

- δ \leq λ_{j} \leq (1 + δ)

when the associated constraint is at its lower bound. Lagrange multipliers for equality constraints will satisfy

| λ_{j} | \leq 1 + δ

If the reduced gradient is not zero, Lagrange multipliers need not be computed and the nonzero elements of the search direction

p

are given by

Z_{R} p_{R}

(see (4) and (5)). The choice of step length is influenced by the need to maintain feasibility with respect to the satisfied constraints. If

H_{R}

is positive definite and

x + p

is feasible,

α

will be taken as unity. In this case, the reduced gradient at

\bar{x}

will be zero, and Lagrange multipliers are computed. Otherwise,

α

is set to

α_{M}

, the step to the ‘nearest’ constraint (with index Jadd; see Section 13), which is added to the working set at the next iteration.

Each change in the working set leads to a simple change to

A_{FR}

: if the status of a general constraint changes, a row of

A_{FR}

is altered; if a bound constraint enters or leaves the working set, a column of

A_{FR}

changes. Explicit representations are recurred of the matrices

T

Q_{FR}

and

R

; and of vectors

Q^{T} g

, and

Q^{T} c

. The triangular factor

R

associated with the reduced Hessian is only updated during the optimality phase.

One of the most important features of e04nff/e04nfa is its control of the conditioning of the working set, whose nearness to linear dependence is estimated by the ratio of the largest to smallest diagonal elements of the

T Q

factor

T

(the printed value Cond T; see Section 13). In constructing the initial working set, constraints are excluded that would result in a large value of Cond T.

e04nff/e04nfa includes a rigorous procedure that prevents the possibility of cycling at a point where the active constraints are nearly linearly dependent (see Gill et al. (1989)). The main feature of the anti-cycling procedure is that the feasibility tolerance is increased slightly at the start of every iteration. This not only allows a positive step to be taken at every iteration, but also provides, whenever possible, a choice of constraints to be added to the working set. Let

α_{M}

denote the maximum step at which

x + α_{M} p

does not violate any constraint by more than its feasibility tolerance. All constraints at a distance

α

(

α \leq α_{M}

) along

p

from the current point are then viewed as acceptable candidates for inclusion in the working set. The constraint whose normal makes the largest angle with the search direction is added to the working set.

11.4 Choosing the Initial Working Set

At the start of the optimality phase, a positive definite

H_{R}

can be defined if enough constraints are included in the initial working set. (The matrix with no rows and columns is positive definite by definition, corresponding to the case when

A_{FR}

contains

n_{FR}

constraints.) The idea is to include as many general constraints as necessary to ensure that the reduced Hessian is positive definite.

Let

H_{Z}

denote the matrix of the first

n_{Z}

rows and columns of the matrix

H_{Q} = Q^{T} H Q

at the beginning of the optimality phase. A partial Cholesky factorization is used to find an upper triangular matrix

R

that is the factor of the largest positive definite leading sub-matrix of

H_{Z}

. The use of interchanges during the factorization of

H_{Z}

tends to maximize the dimension of

R

. (The condition of

R

may be controlled using the optional parameter Rank Tolerance.) Let

Z_{R}

denote the columns of

Z

corresponding to

R

, and let

Z

be partitioned as

Z = (\begin{matrix} Z_{R} & Z_{A} \end{matrix})

. A working set for which

Z_{R}

defines the null space can be obtained by including the rows of

Z_{A}^{T}

as ‘artificial constraints’. Minimization of the objective function then proceeds within the subspace defined by

Z_{R}

, as described in Section 11.2.

The artificially augmented working set is given by

{\bar{A}}_{FR} = (\begin{matrix} Z_{A}^{T} \\ A_{FR} \end{matrix}),

(7)

so that

p_{FR}

will satisfy

A_{FR} p_{FR} = 0

and

Z_{A}^{T} p_{FR} = 0

. By definition of the

T Q

factorization,

{\bar{A}}_{FR}

automatically satisfies the following:

{\bar{A}}_{FR} Q_{FR} = (\begin{matrix} Z_{A}^{T} \\ A_{FR} \end{matrix}) Q_{FR} = (\begin{matrix} Z_{A}^{T} \\ A_{FR} \end{matrix}) (\begin{matrix} Z_{R} & Z_{A} & Y \end{matrix}) = (\begin{matrix} 0 & \bar{T} \end{matrix}),

where

\bar{T} = (\begin{matrix} I & 0 \\ 0 & T \end{matrix}),

and hence the

T Q

factorization of (7) is available trivially from

T

and

Q_{FR}

without additional expense.

The matrix

Z_{A}

is not kept fixed, since its role is purely to define an appropriate null space; the

T Q

factorization can, therefore, be updated in the normal fashion as the iterations proceed. No work is required to ‘delete’ the artificial constraints associated with

Z_{A}

when

Z_{R}^{T} g_{FR} = 0

, since this simply involves repartitioning

Q_{FR}

. The ‘artificial’ multiplier vector associated with the rows of

Z_{A}^{T}

is equal to

Z_{A}^{T} g_{FR}

, and the multipliers corresponding to the rows of the ‘true’ working set are the multipliers that would be obtained if the artificial constraints were not present. If an artificial constraint is ‘deleted’ from the working set, an A appears alongside the entry in the Jdel column of the monitoring file output (see Section 13).

The number of columns in

Z_{A}

and

Z_{R}

, the Euclidean norm of

Z_{R}^{T} g_{FR}

, and the condition estimator of

R

appear in the monitoring file output as Art, Zr, Norm Gz and Cond Rz respectively (see Section 13).

Under some circumstances, a different type of artificial constraint isused when solving a linear program. Although the algorithm of e04nff/e04nfa does not usually perform simplex steps (in the traditional sense), there is one exception: a linear program with fewer general constraints than variables (i.e.,

m_{L} \leq n

). Use of the simplex method in this situation leads to savings in storage. At the starting point, the ‘natural’ working set (the set of constraints exactly or nearly satisfied at the starting point) is augmented with a suitable number of ‘temporary’ bounds, each of which has the effect of temporarily fixing a variable at its current value. In subsequent iterations, a temporary bound is treated as a standard constraint until it is deleted from the working set, in which case it is never added again. If a temporary bound is ‘deleted’ from the working set, an F (for ‘Fixed’) appears alongside the entry in the Jdel column of the monitoring file output (see Section 13).

12 Optional Parameters

Several optional parameters in e04nff/e04nfa define choices in the problem specification or the algorithm logic. In order to reduce the number of formal arguments of e04nff/e04nfa these optional parameters have associated default values that are appropriate for most problems. Therefore, you need only specify those optional parameters whose values are to be different from their default values.

The remainder of this section can be skipped if you wish to use the default values for all optional parameters.

The following is a list of the optional parameters available. A full description of each optional parameter is provided in Section 12.1.

Optional parameters may be specified by calling one, or both, of the routines e04ngf/e04nga and e04nhf/e04nha before a call to e04nff/e04nfa.

e04ngf/e04nga reads options from an external options file, with Begin and End as the first and last lines respectively and each intermediate line defining a single optional parameter. For example,

Begin 
  Print Level = 5 
End

The call

Call e04ngf/e04nga (ioptns, inform)

can then be used to read the file on unit ioptns. inform will be zero on successful exit. e04ngf/e04nga should be consulted for a full description of this method of supplying optional parameters.

e04nhf/e04nha can be called to supply options directly, one call being necessary for each optional parameter. For example,

Call e04nhf ('Print Level = 5')

e04nhf/e04nha should be consulted for a full description of this method of supplying optional parameters.

All optional parameters not specified by you are set to their default values. Optional parameters specified by you are unaltered by e04nff/e04nfa (unless they define invalid values) and so remain in effect for subsequent calls unless altered by you.

12.1 Description of the Optional Parameters

For each option, we give a summary line, a description of the optional parameter and details of constraints.

The summary line contains:

the keywords, where the minimum abbreviation of each keyword is underlined (if no characters of an optional qualifier are underlined, the qualifier may be omitted);
a parameter value, where the letters $a$ , $i$ and $r$ denote options that take character, integer and real values respectively;
the default value, where the symbol $ε$ is a generic notation for machine precision (see x02ajf).

Keywords and character values are case and white space insensitive.

Check Frequency

r

Default

= 50

Every

i

th iteration, a numerical test is made to see if the current solution

x

satisfies the constraints in the working set. If the largest residual of the constraints in the working set is judged to be too large, the current working set is refactorized and the variables are recomputed to satisfy the constraints more accurately. If

i \leq 0

, the default value is used.

Cold Start

Default

Warm Start

This option specifies how the initial working set is chosen. With a Cold Start, e04nff/e04nfa chooses the initial working set based on the values of the variables and constraints at the initial point. Broadly speaking, the initial working set will include equality constraints and bounds or inequality constraints that violate or ‘nearly’ satisfy their bounds (to within Crash Tolerance).

With a Warm Start, you must provide a valid definition of every element of the array istate. e04nff/e04nfa will override your specification of istate if necessary, so that a poor choice of the working set will not cause a fatal error. For instance, any elements of istate which are set to

- 2

- 1

4

will be reset to zero, as will any elements which are set to

3

when the corresponding elements of bl and bu are not equal. A warm start will be advantageous if a good estimate of the initial working set is available – for example, when e04nff/e04nfa is called repeatedly to solve related problems.

Crash Tolerance

r

Default

= 0.01

This value is used in conjunction with the optional parameter Cold Start (the default value) when e04nff/e04nfa selects an initial working set. If

0 \leq r \leq 1

, the initial working set will include (if possible) bounds or general inequality constraints that lie within

r

of their bounds. In particular, a constraint of the form

a_{j}^{T} x \geq l

will be included in the initial working set if

| a_{j}^{T} x - l | \leq r (1 + | l |)

. If

r < 0

r > 1

, the default value is used.

Defaults

This special keyword may be used to reset all optional parameters to their default values.

Expand Frequency

i

Default

= 5

This option is part of an anti-cycling procedure designed to guarantee progress even on highly degenerate problems.

The strategy is to force a positive step at every iteration, at the expense of violating the constraints by a small amount. Suppose that the value of the optional parameter Feasibility Tolerance is

δ

. Over a period of

i

iterations, the feasibility tolerance actually used by e04nff/e04nfa (i.e., the working feasibility tolerance) increases from

0.5 δ

δ

(in steps of

0.5 δ / i

At certain stages the following ‘resetting procedure’ is used to remove constraint infeasibilities. First, all variables whose upper or lower bounds are in the working set are moved exactly onto their bounds. A count is kept of the number of nontrivial adjustments made. If the count is positive, iterative refinement is used to give variables that satisfy the working set to (essentially) machine precision. Finally, the working feasibility tolerance is reinitialized to

0.5 δ

If a problem requires more than

i

iterations, the resetting procedure is invoked and a new cycle of

i

iterations is started with

i

incremented by

10

. (The decision to resume the feasibility phase or optimality phase is based on comparing any constraint infeasibilities with

δ

The resetting procedure is also invoked when e04nff/e04nfa reaches an apparently optimal, infeasible or unbounded solution, unless this situation has already occurred twice. If any nontrivial adjustments are made, iterations are continued.

i \leq 0

, the default value is used. If

i \geq 9999999

, no anti-cycling procedure is invoked.

Feasibility Phase Iteration Limit

i_{1}

Default

= \max (50, 5 (n + m_{L}))

Optimality Phase Iteration Limit

i_{2}

Default

= \max (50, 5 (n + m_{L}))

For problems of type FP, the scalar

i_{1}

specifies the maximum number of iterations allowed before temination. Setting

i_{1} = 0

and

Print Level > 0

means that the workspace needed will be computed and printed, but no iterations will be performed.

For problems of type LP, the maximum number of iterations allowed before temination is taken as

\max (i_{1}, i_{2})

. Setting

i_{1} = 0

i_{2} = 0

and

Print Level > 0

means that the workspace needed will be computed and printed, but no iterations will be performed.

For problems of type QP, the scalars

i_{1}

and

i_{2}

specify the maximum number of iterations allowed in the feasibility and optimality phases. Optimality Phase Iteration Limit is equivalent to Iteration Limit. Setting

i_{1} = 0

and

Print Level > 0

means that the workspace needed will be computed and printed, but no iterations will be performed.

i_{1} < 0

i_{2} < 0

, the default value is used.

Feasibility Tolerance

r

Default

= \sqrt{ε}

r \geq ε

r

defines the maximum acceptable absolute violation in each constraint at a ‘feasible’ point. For example, if the variables and the coefficients in the general constraints are of order unity, and the latter are correct to about

6

decimal digits, it would be appropriate to specify

r

10^{- 6}

. If

0 \leq r < ε

, the default value is used.

e04nff/e04nfa attempts to find a feasible solution before optimizing the objective function. If the sum of infeasibilities cannot be reduced to zero, the optional parameter Minimum Sum of Infeasibilities can be used to find the minimum value of the sum. Let Sinf be the corresponding sum of infeasibilities. If Sinf is quite small, it may be appropriate to raise

r

by a factor of

10

100

. Otherwise, some error in the data should be suspected.

Note that a ‘feasible solution’ is a solution that satisfies the current constraints to within the tolerance

r

Hessian Rows

i

Default

= n

Note that this option does not apply to problems of type FP or LP.

This specifies

m

, the number of rows of the Hessian matrix

H

. The default value of

m

n

, the number of variables of the problem.

If the problem is of type QP, then

m

will usually be

n

, the number of variables. However, a value of

m

less than

n

is appropriate for QP3 or QP4 if

H

is an upper trapezoidal matrix with

m

rows. Similarly,

m

may be used to define the dimension of a leading block of nonzeros in the Hessian matrices of QP1 or QP2. In this case the last

n - m

rows and columns of

H

are assumed to be zero. In the QP case

m

should not be greater than

n

; if it is, the last

m - n

rows of

H

are ignored.

i < 0

i > n

, the default value is used.

Infinite Bound Size

r

Default

= 10^{20}

r > 0

r

defines the ‘infinite’ bound

bigbnd

in the definition of the problem constraints. Any upper bound greater than or equal to

bigbnd

will be regarded as

+ \infty

(and similarly any lower bound less than or equal to

- bigbnd

will be regarded as

- \infty

). If

r < 0

, the default value is used.

Infinite Step Size

r

Default

= \max (bigbnd, 10^{20})

r > 0

r

specifies the magnitude of the change in variables that will be considered a step to an unbounded solution. (Note that an unbounded solution can occur only when the Hessian is not positive definite.) If the change in

x

during an iteration would exceed the value of

r

, then the objective function is considered to be unbounded below in the feasible region. If

r \leq 0

, the default value is used.

Iteration Limit

i

Default

= \max (50, 5 (n + m_{L}))

Iters

i

Itns

i

See optional parameter Feasibility Phase Iteration Limit.

List

Default for e04nff

= List

Nolist

Default for e04nfa

= Nolist

Optional parameter List enables printing of each optional parameter specification as it is supplied. Nolist suppresses this printing.

Maximum Degrees of Freedom

i

Default

= n

Note that this option does not apply to problems of type FP or LP.

This places a limit on the storage allocated for the triangular factor

R

of the reduced Hessian

H_{R}

. Ideally,

i

should be set slightly larger than the value of

n_{R}

expected at the solution. It need not be larger than

m_{n} + 1

, where

m_{n}

is the number of variables that appear nonlinearly in the quadratic objective function. For many problems it can be much smaller than

m_{n}

For quadratic problems, a minimizer may lie on any number of constraints, so that

n_{R}

may vary between

1

and

n

. The default value of

i

is, therefore, the number of variables

n

. If Hessian Rows

m

is specified, the default value of

i

is the same number,

m

Minimum Sum of Infeasibilities

a

Default

= NO

If no feasible point exists for the constraints, then this option is used to control whether or not e04nff/e04nfa will calculate a point that minimizes the constraint violations. If

Minimum Sum of Infeasibilities = NO

, e04nff/e04nfa will terminate as soon as it is evident that no feasible point exists for the constraints. The final point will generally not be the point at which the sum of infeasibilities is minimized. If

Minimum Sum of Infeasibilities = YES

, e04nff/e04nfa will continue until the sum of infeasibilities is minimized.

Monitoring File

i

Default

= - 1

i > 6

and

Print Level \geq 5

, monitoring information produced by e04nff/e04nfa at every iteration is sent to a file with logical unit number

i

i < 0

and/or

Print Level < 5

, no monitoring information is produced.

Optimality Tolerance

r

Default

= ε^{0.5}

r \geq ε

r

defines the tolerance used to determine if the bounds and general constraints have the right ‘sign’ for the solution to be judged to be optimal.

0 \leq r < ε

, the default value is used.

Print Level

i

Default for e04nff

= 10

Default for e04nfa

= 0

The value of

i

controls the amount of printout produced by e04nff/e04nfa, as indicated below. A detailed description of the printed output is given in Section 9.2 (summary output at each iteration and the final solution) and Section 13 (monitoring information at each iteration). If

i < 0

, the default value is used.

The following printout is sent to the current advisory message unit (as defined by x04abf):

$i$	Output
$0$	No output.
$1$	The final solution only.
$5$	One line of summary output ( $< 80$ characters; see Section 9.2) for each iteration (no printout of the final solution).
$\geq 10$	The final solution and one line of summary output for each iteration.

The following printout is sent to the unit number given by the optional parameter Monitoring File:

$i$	Output
$< 5$	No output.
$\geq 5$	One long line of output ( $> 80$ characters; see Section 13) for each iteration (no printout of the final solution).
$\geq 20$	At each iteration: the Lagrange multipliers, the variables $x$ , the constraint values $A x$ and the constraint status (see istate).
$\geq 30$	At each iteration: the diagonal elements of the upper triangular matrix $T$ associated with the $T Q$ factorization (3) (see Section 11.2) of the working set and the diagonal elements of the upper triangular matrix $R$ .

Print Level \geq 5

and the unit number defined by the optional parameter Monitoring File is the same as that defined by x04abf, the summary output is suppressed.

Problem Type

a

Default

=

QP2

This option specifies the type of objective function to be minimized during the optimality phase. The following are the five optional keywords and the dimensions of the arrays that must be specified in order to define the objective function:

LP	h not referenced, length-n cvec required;
QP1	$h (ldh, *)$ symmetric, cvec not referenced;
QP2	$h (ldh, *)$ symmetric, length-n cvec required;
QP3	$h (ldh, *)$ upper trapezoidal, cvec not referenced;
QP4	$h (ldh, *)$ upper trapezoidal, length-n cvec required.

For problems of type FP the objective function is omitted and neither h nor cvec are referenced.

The following keywords are also acceptable. The minimum abbreviation of each keyword is underlined.

$a$	Option
Quadratic	QP2
Linear	LP
Feasible	FP

In addition, the keyword QP is equivalent to the default option QP2.

H = 0

(i.e., the objective function is purely linear), the efficiency of e04nff/e04nfa may be increased by specifying

a

as LP.

Rank Tolerance

r

Default

= 100 ε

Note that this option does not apply to problems of type FP or LP.

This optional parameter enables you to control the condition number of the triangular factor

R

(see Section 11). If

ρ_{i}

denotes the function

ρ_{i} = \max {| R_{11} |, | R_{22} |, \dots, | R_{i i} |}

, the dimension of

R

is defined to be smallest index

i

such that

| R_{i + 1, i + 1} | \leq \sqrt{r} | ρ_{i + 1} |

. If

r \leq 0

, the default value is used.

13 Description of Monitoring Information

This section describes the long line of output (

> 80

characters) which forms part of the monitoring information produced by e04nff/e04nfa. (See also the description of the optional parameters Monitoring File and Print Level.) You can control the level of printed output.

To aid interpretation of the printed results the following convention is used for numbering the constraints: indices

1

through

n

refer to the bounds on the variables and indices

n + 1

through

n + m_{L}

refer to the general constraints. When the status of a constraint changes, the index of the constraint is printed, along with the designation L (lower bound), U (upper bound), E (equality), F (temporarily fixed variable) or A (artificial constraint).

When

Print Level \geq 5

and

Monitoring File \geq 0

, the following line of output is produced at every iteration on the unit number specified by the Monitoring File. In all cases the values of the quantities printed are those in effect on completion of the given iteration.

Itn	is the iteration count.
Jdel	is the index of the constraint deleted from the working set. If Jdel is zero, no constraint was deleted.
Jadd	is the index of the constraint added to the working set. If Jadd is zero, no constraint was added.
Step	is the step taken along the computed search direction. If a constraint is added during the current iteration, Step will be the step to the nearest constraint. When the problem is of type LP, the step can be greater than $1$ during the optimality phase.
Ninf	is the number of violated constraints (infeasibilities). This will be zero during the optimality phase.
Sinf/Objective	is the value of the current objective function. If $x$ is not feasible, Sinf gives a weighted sum of the magnitudes of constraint violations. If $x$ is feasible, Objective is the value of the objective function of (1). The output line for the final iteration of the feasibility phase (i.e., the first iteration for which Ninf is zero) will give the value of the true objective at the first feasible point. During the optimality phase the value of the objective function will be nonincreasing. During the feasibility phase the number of constraint infeasibilities will not increase until either a feasible point is found or the optimality of the multipliers implies that no feasible point exists. Once optimal multipliers are obtained the number of infeasibilities can increase, but the sum of infeasibilities will either remain constant or be reduced until the minimum sum of infeasibilities is found.
Bnd	is the number of simple bound constraints in the current working set.
Lin	is the number of general linear constraints in the current working set.
Art	is the number of artificial constraints in the working set, i.e., the number of columns of $Z_{A}$ (see Section 11.4).
Zr	is the number of columns of $Z_{1}$ (see Section 11.2). Zr is the dimension of the subspace in which the objective function is currently being minimized. The value of Zr is the number of variables minus the number of constraints in the working set; i.e., $Zr = n - (Bnd + Lin + Art)$ . The value of $n_{Z}$ , the number of columns of $Z$ (see Section 11.2) can be calculated as $n_{Z} = n - (Bnd + Lin)$ . A zero value of $n_{Z}$ implies that $x$ lies at a vertex of the feasible region.
Norm Gz	is $‖ Z_{R}^{T} g_{FR} ‖$ , the Euclidean norm of the reduced gradient with respect to $Z_{R}$ . During the optimality phase, this norm will be approximately zero after a unit step.
NOpt	is the number of nonoptimal Lagrange multipliers at the current point. NOpt is not printed if the current $x$ is infeasible or no multipliers have been calculated. At a minimizer, NOpt will be zero.
Min Lm	is the value of the Lagrange multiplier associated with the deleted constraint. If Min Lm is negative, a lower bound constraint has been deleted, if Min Lm is positive, an upper bound constraint has been deleted. If no multipliers are calculated during a given iteration Min Lm will be zero.
Cond T	is a lower bound on the condition number of the working set.
Cond Rz	is a lower bound on the condition number of the triangular factor $R$ (the Cholesky factor of the current reduced Hessian; see Section 11.2). If the problem is specified to be of type LP then Cond Rz is not printed.
Rzz	is the last diagonal element $μ$ of the matrix $D$ associated with the $R^{T} D R$ factorization of the reduced Hessian $H_{R}$ (see Section 11.2). Rzz is only printed if $H_{R}$ is not positive definite (in which case $μ \neq 1$ ). If the printed value of Rzz is small in absolute value then $H_{R}$ is approximately singular. A negative value of Rzz implies that the objective function has negative curvature on the current working set.

NAG Library Manual, Mark 27.1

Interfaces: FL CL CPP AD

NAG FL Interface Introduction

E04 (Opt) Chapter Contents

E04 (Opt) Chapter Introduction

e04nf: FL CL CPP AD

NAG FL Interfacee04nff (qp_dense_solve_old)e04nfa (qp_dense_solve)

▸▿ Contents

1 Purpose

2 Specification

2.1 Specification for e04nff

2.2 Specification for e04nfa

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

9.1 Scaling

9.2 Description of the Printed Output

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

11 Algorithmic Details

11.1 Overview

11.2 Definition of Search Direction

11.3 Main Iteration

11.4 Choosing the Initial Working Set

12 Optional Parameters

12.1 Description of the Optional Parameters

13 Description of Monitoring Information

NAG FL Interface
e04nff (qp_dense_solve_old)
e04nfa (qp_dense_solve)