e04nq: FL CL CPP AD

NAG FL Interface
e04nqf (qpconvex2_sparse_solve)

Note: this routine uses optional parameters to define choices in the problem specification and in the details of the algorithm. If you wish to use default settings for all of the optional parameters, you need only read Sections 1 to 10 of this document. If, however, you wish to reset some or all of the settings please refer to Section 11 for a detailed description of the algorithm, to Section 12 for a detailed description of the specification of the optional parameters and to Section 13 for a detailed description of the monitoring information produced by the routine.

Keyword Search:

NAG Library Manual, Mark 27.3

Interfaces: FL CL CPP AD

NAG FL Interface Introduction

E04 (Opt) Chapter Contents

E04 (Opt) Chapter Introduction

e04nq: FL CL CPP AD

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

▸▿ 9 Further Comments

9.1 Description of the Printed Output

▸▿ 10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

▸▿ 11 Algorithmic Details

11.1 Overview

11.2 Definition of the Working Set and Search Direction

11.3 Main Iteration

11.4 Miscellaneous

▸▿ 12 Optional Parameters

12.1 Description of the Optional Parameters

▸▿ 13 Description of Monitoring Information

13.1 Crash Statistics

13.2 Basis Factorization Statistics

13.3 Basis Map

▸▿ 13.4 Solution Output

13.4.1 The ROWS section

13.4.2 The COLUMNS Section

13.5 The Solution File

13.6 The Summary File

1 Purpose

e04nqf solves sparse linear programming or convex quadratic programming problems. The initialization routine e04npf must have been called before calling e04nqf.

2 Specification

Fortran Interface

Subroutine e04nqf (

start, qphx, m, n, ne, nname, lenc, ncolh, iobj, objadd, prob, acol, inda, loca, bl, bu, c, names, helast, hs, x, pi, rc, ns, ninf, sinf, obj, cw, lencw, iw, leniw, rw, lenrw, cuser, iuser, ruser, ifail)

Integer, Intent (In)	::	m, n, ne, nname, lenc, ncolh, iobj, inda(ne), loca(n+1), helast(n+m), lencw, leniw, lenrw
Integer, Intent (Inout)	::	hs(n+m), ns, iw(leniw), iuser(*), ifail
Integer, Intent (Out)	::	ninf
Real (Kind=nag_wp), Intent (In)	::	objadd
Real (Kind=nag_wp), Intent (Inout)	::	acol(ne), bl(n+m), bu(n+m), c(max(1,lenc)), x(n+m), rw(lenrw), ruser(*)
Real (Kind=nag_wp), Intent (Out)	::	pi(m), rc(n+m), sinf, obj
Character (1), Intent (In)	::	start
Character (8), Intent (In)	::	prob, names(nname)
Character (8), Intent (InOut)	::	cw(lencw), cuser(*)
External	::	qphx

C Header Interface

#include <nag.h>

void

e04nqf_ (const char *start,
void (NAG_CALL *qphx)(const Integer *ncolh, const double x[], double hx[], const Integer *nstate, char cuser[], Integer iuser[], double ruser[], const Charlen length_cuser),
const Integer *m, const Integer *n, const Integer *ne, const Integer *nname, const Integer *lenc, const Integer *ncolh, const Integer *iobj, const double *objadd, const char *prob, double acol[], const Integer inda[], const Integer loca[], double bl[], double bu[], double c[], const char names[], const Integer helast[], Integer hs[], double x[], double pi[], double rc[], Integer *ns, Integer *ninf, double *sinf, double *obj, char cw[], const Integer *lencw, Integer iw[], const Integer *leniw, double rw[], const Integer *lenrw, char cuser[], Integer iuser[], double ruser[], Integer *ifail, const Charlen length_start, const Charlen length_prob, const Charlen length_names, const Charlen length_cw, const Charlen length_cuser)

The routine may be called by the names e04nqf or nagf_opt_qpconvex2_sparse_solve.

Before calling e04nqf or one of the option setting routines e04nrf, e04nsf, e04ntf or e04nuf, e04npf must be called.

The specification for e04npf is:

Fortran Interface

Subroutine e04npf (

cw, lencw, iw, leniw, rw, lenrw, ifail)

Integer, Intent (In)	::	lencw, leniw, lenrw
Integer, Intent (Inout)	::	ifail
Integer, Intent (Out)	::	iw(leniw)
Real (Kind=nag_wp), Intent (Out)	::	rw(lenrw)
Character (8), Intent (Out)	::	cw(lencw)

C Header Interface

#include <nag.h>

void	e04npf_ (char cw[], const Integer lencw, Integer iw[], const Integer leniw, double rw[], const Integer lenrw, Integer ifail, const Charlen length_cw)

lencw, leniw and lenrw, the declared lengths of cw, iw and rw respectively, must satisfy:

$lencw \geq 600$
$leniw \geq 600$
$lenrw \geq 600$

The contents of the arrays cw, iw and rw must not be altered between calling routines e04npf, e04nqf, e04nrf, e04nsf, e04ntf and e04nuf.

After calling e04nqf you can call one or both of the routines e04nxf or e04nyf to obtain the current value of an optional parameter.

3 Description

e04nqf is designed to solve large-scale linear or quadratic programming problems of the form:

\underset{x \in R^{n}}{minimize} f (x) subject to ​ l \leq (\begin{matrix} x \\ A x \end{matrix}) \leq u,

(1)

where

x

is an

n

-vector of variables,

l

and

u

are constant lower and upper bounds,

A

is an

m \times n

sparse matrix and

f (x)

is a linear or quadratic objective function that may be specified in a variety of ways, depending upon the particular problem being solved. The optional parameter Maximize may be used to specify a problem in which

f (x)

is maximized instead of minimized.

Upper and lower bounds are specified for all variables and constraints. This form allows full generality in specifying various types of constraint. In particular, the

j

th constraint may be defined as an equality by setting

l_{j} = u_{j}

. If certain bounds are not present, the associated elements of

l

u

may be set to special values that are treated as

- \infty

+ \infty

The possible forms for the function

f (x)

are summarised in Table 1. The most general form for

f (x)

f (x) = q + c^{T} x + \frac{1}{2} x^{T} H x = q + \sum_{j = 1}^{n} c_{j} x_{j} + \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} x_{i} H_{i j} x_{j}

where

q

is a constant,

c

is a constant

n

-vector and

H

is a constant symmetric

n \times n

matrix with elements

{H_{i j}}

. In this form,

f

is a quadratic function of

x

and (1) is known as a quadratic program (QP). e04nqf is suitable for all convex quadratic programs. The defining feature of a convex QP is that the matrix

H

must be positive semidefinite, i.e., it must satisfy

x^{T} H x \geq 0

for all

x

. If not,

f (x)

is nonconvex and e04nqf will terminate with the error indicator

ifail = 11

. If

f (x)

is nonconvex it may be more appropriate to call e04vhf instead.

**Table 1**
Choices for the objective function $f (x)$
Problem type	Objective function $f (x)$	Hessian matrix $H$
FP	Not applicable	$q = c = H = 0$
LP	$q + c^{T} x$	$H = 0$
QP	$q + c^{T} x + \frac{1}{2} x^{T} H x$	Symmetric positive semidefinite

H = 0

, then

f (x) = q + c^{T} x

and the problem is known as a linear program (LP). In this case, rather than defining an

H

with zero elements, you can define

H

to have no columns by setting

ncolh = 0

(see Section 5).

H = 0

q = 0

, and

c = 0

, there is no objective function and the problem is a feasible point problem (FP), which is equivalent to finding a point that satisfies the constraints on

x

. In the situation where no feasible point exists, several options are available for finding a point that minimizes the constraint violations (see the description of the optional parameter Elastic Mode).

e04nqf is suitable for large LPs and QPs in which the matrix

A

is sparse, i.e., when the number of zero elements is sufficiently large that it is worthwhile using algorithms which avoid computations and storage involving zero elements. The matrix

A

is input to e04nqf by means of the three array arguments acol, inda and loca. This allows you to specify the pattern of nonzero elements in

A

e04nqf exploits structure in

H

by requiring

H

to be defined implicitly in a subroutine that computes the product

H x

for any given vector

x

. In many cases, the product

H x

can be computed very efficiently for any given

x

, e.g.,

H

may be a sparse matrix, or a sum of matrices of rank-one.

For problems in which

A

can be treated as a dense matrix, it is usually more efficient to use e04mff/e04mfa, e04ncf/e04nca or e04nff/e04nfa.

There is considerable flexibility allowed in the definition of

f (x)

in Table 1. The vector

c

defining the linear term

c^{T} x

can be input in three ways: as a sparse row of

A

; as an explicit dense vector

c

; or as both a sparse row and an explicit vector (in which case,

c^{T} x

will be the sum of two linear terms). When stored in

A

c

is the iobjth row of

A

, which is known as the objective row. The objective row must always be a free row of

A

in the sense that its lower and upper bounds must be

- \infty

and

+ \infty

. Storing

c

as part of

A

is recommended if

c

is a sparse vector. Storing

c

as an explicit vector is recommended for a sequence of problems, each with a different objective (see arguments c and lenc).

The upper and lower bounds on the

m

elements of

A x

are said to define the general constraints of the problem. Internally, e04nqf converts the general constraints to equalities by introducing a set of slack variables

s

, where

s = {(s_{1}, s_{2}, \dots, s_{m})}^{T}

. For example, the linear constraint

5 \leq 2 x_{1} + 3 x_{2} \leq + \infty

is replaced by

2 x_{1} + 3 x_{2} - s_{1} = 0

, together with the bounded slack

5 \leq s_{1} \leq + \infty

. The problem defined by (1) can, therefore, be re-written in the following equivalent form:

\underset{x \in R^{n}, s \in R^{m}}{minimize} f (x) subject to ​ A x - s = 0, l \leq (\begin{matrix} x \\ s \end{matrix}) \leq u .

Since the slack variables

s

are subject to the same upper and lower bounds as the elements of

A x

, the bounds on

x

and

A x

can simply be thought of as bounds on the combined vector

(x, s)

. (In order to indicate their special role in QP problems, the original variables

x

are sometimes known as ‘column variables’, and the slack variables

s

are known as ‘row variables’.)

Each LP or QP problem is solved using a two-phase iterative procedure (in which the general constraints are satisfied throughout): a feasibility phase (Phase 1), in which the sum of infeasibilities with respect to the bounds on

x

and

s

is minimized to find a feasible point that satisfies all constraints within a specified feasibility tolerance; and an optimality phase (Phase 2), in which

f (x)

is minimized (or maximized) by constructing a sequence of iterates that lies within the feasible region.

Phase 1 involves solving a linear program of the form

Phase 1
	$\underset{x, s, v, w}{minimize} \sum_{j = 1}^{n + m} (v_{j} + w_{j})$
	$subject to A x - s = 0, l \leq (\begin{matrix} x \\ s \end{matrix}) - v + w \leq u, v \geq 0, w \geq 0$

which is equivalent to minimizing the sum of the constraint violations. If the constraints are feasible (i.e., at least one feasible point exists), eventually a point will be found at which both

v

and

w

are zero. Then the associated value of

(x, s)

satisfies the original constraints and is used as the starting point for the Phase 2 iterations for minimizing

f (x)

If the constraints are infeasible (i.e.,

v \neq 0

w \neq 0

at the end of Phase 1), no solution exists for (1) and you have the option of either terminating or continuing in so-called elastic mode (see the discussion of the optional parameter Elastic Mode). In elastic mode, a ‘relaxed’ or ‘perturbed’ problem is solved in which

f (x)

is minimized while allowing some of the bounds to become ‘elastic’, i.e., to change from their specified values. Variables subject to elastic bounds are known as elastic variables. An elastic variable is free to violate one or both of its original upper or lower bounds. You are able to assign which bounds will become elastic if elastic mode is ever started (see the argument helast in Section 5).

To make the relaxed problem meaningful, e04nqf minimizes

f (x)

while (in some sense) finding the ‘smallest’ violation of the elastic variables. In the situation where all the variables are elastic, the relaxed problem has the form

Phase 2 ( $γ$ )
	$\underset{x, s, v, w}{minimize} f (x) + γ \sum_{j = 1}^{n + m} (v_{j} + w_{j})$
	$subject to A x - s = 0, l \leq (\begin{matrix} x \\ s \end{matrix}) - v + w \leq u, v \geq 0, w \geq 0$ ,

where

γ

is a non-negative argument known as the elastic weight (see the description of the optional parameter Elastic Weight), and

f (x) + γ \sum_{j} (v_{j} + w_{j})

is called the composite objective. In the more general situation where only a subset of the bounds are elastic, the

v

's and

w

's for the non-elastic bounds are fixed at zero.

The elastic weight can be chosen to make the composite objective behave like the original objective

f (x)

, the sum of infeasibilities, or anything in-between. If

γ = 0

, e04nqf will attempt to minimize

f

subject to the (true) upper and lower bounds on the non-elastic variables (and declare the problem infeasible if the non-elastic variables cannot be made feasible).

At the other extreme, choosing

γ

sufficiently large will have the effect of minimizing the sum of the violations of the elastic variables subject to the original constraints on the non-elastic variables. Choosing a large value of the elastic weight is useful for defining a ‘least-infeasible’ point for an infeasible problem.

In Phase 1 and elastic mode, all calculations involving

v

and

w

are done implicitly in the sense that an elastic variable

x_{j}

is allowed to violate its lower bound (say) and an explicit value of

v

can be recovered as

v_{j} = l_{j} - x_{j}

A constraint is said to be active or binding at

x

if the associated element of either

x

A x

is equal to one of its upper or lower bounds. Since an active constraint in

A x

has its associated slack variable at a bound, the status of both simple and general upper and lower bounds can be conveniently described in terms of the status of the variables

(x, s)

. A variable is said to be nonbasic if it is temporarily fixed at its upper or lower bound. It follows that regarding a general constraint as being active is equivalent to thinking of its associated slack as being nonbasic.

At each iteration of an active-set method, the constraints

A x - s = 0

are (conceptually) partitioned into the form

B x_{B} + S x_{S} + N x_{N} = 0,

where

x_{N}

consists of the nonbasic elements of

(x, s)

and the basis matrix

B

is square and nonsingular. The elements of

x_{B}

and

x_{S}

are called the basic and superbasic variables respectively; with

x_{N}

they are a permutation of the elements of

x

and

s

. At a QP solution, the basic and superbasic variables will lie somewhere between their upper or lower bounds, while the nonbasic variables will be equal to one of their bounds. At each iteration,

x_{S}

is regarded as a set of independent variables that are free to move in any desired direction, namely one that will improve the value of the objective function (or sum of infeasibilities). The basic variables are then adjusted in order to ensure that

(x, s)

continues to satisfy

A x - s = 0

. The number of superbasic variables (

n_{S}

say), therefore, indicates the number of degrees of freedom remaining after the constraints have been satisfied. In broad terms,

n_{S}

is a measure of how nonlinear the problem is. In particular,

n_{S}

will always be zero for FP and LP problems.

If it appears that no improvement can be made with the current definition of

B

S

and

N

, a nonbasic variable is selected to be added to

S

, and the process is repeated with the value of

n_{S}

increased by one. At all stages, if a basic or superbasic variable encounters one of its bounds, the variable is made nonbasic and the value of

n_{S}

is decreased by one.

Associated with each of the

m

equality constraints

A x - s = 0

is a dual variable

π_{i}

. Similarly, each variable in

(x, s)

has an associated reduced gradient

d_{j}

(also known as a reduced cost). The reduced gradients for the variables

x

are the quantities

g - A^{T} π

, where

g

is the gradient of the QP objective function, and the reduced gradients for the slack variables

s

are the dual variables

π

. The QP subproblem is optimal if

d_{j} \geq 0

for all nonbasic variables at their lower bounds,

d_{j} \leq 0

for all nonbasic variables at their upper bounds and

d_{j} = 0

for all superbasic variables. In practice, an approximate QP solution is found by slightly relaxing these conditions on

d_{j}

(see the description of the optional parameter Optimality Tolerance).

The process of computing and comparing reduced gradients is known as pricing (a term first introduced in the context of the simplex method for linear programming). To ‘price’ a nonbasic variable

x_{j}

means that the reduced gradient

d_{j}

associated with the relevant active upper or lower bound on

x_{j}

is computed via the formula

d_{j} = g_{j} - a_{j}^{T} π

, where

a_{j}

is the

j

th column of

(\begin{matrix} A & - I \end{matrix})

. (The variable selected by such a process and the corresponding value of

d_{j}

(i.e., its reduced gradient) are the quantities +SBS and dj in the monitoring file output; see Section 9.1.) If

A

has significantly more columns than rows (i.e.,

n ≫ m

), pricing can be computationally expensive. In this case, a strategy known as partial pricing can be used to compute and compare only a subset of the

d_{j}

e04nqf is based on SQOPT, which is part of the SNOPT package described in Gill et al. (2005a). It uses stable numerical methods throughout and includes a reliable basis package (for maintaining sparse

L U

factors of the basis matrix

B

), a practical anti-degeneracy procedure, efficient handling of linear constraints and bounds on the variables (by an active-set strategy), as well as automatic scaling of the constraints. Further details can be found in Section 11.

4 References

Fourer R (1982) Solving staircase linear programs by the simplex method Math. Programming 23 274–313

Gill P E and Murray W (1978) Numerically stable methods for quadratic programming Math. Programming 14 349–372

Gill P E, Murray W and Saunders M A (1995) User's guide for QPOPT 1.0: a Fortran package for quadratic programming Report SOL 95-4 Department of Operations Research, Stanford University

Gill P E, Murray W and Saunders M A (2005a) Users' guide for SQOPT 7: a Fortran package for large-scale linear and quadratic programming Report NA 05-1 Department of Mathematics, University of California, San Diego https://www.ccom.ucsd.edu/~peg/papers/sqdoc7.pdf

Gill P E, Murray W and Saunders M A (2005b) Users' guide for SNOPT 7.1: a Fortran package for large-scale linear nonlinear programming Report NA 05-2 Department of Mathematics, University of California, San Diego https://www.ccom.ucsd.edu/~peg/papers/sndoc7.pdf

Gill P E, Murray W, Saunders M A and Wright M H (1987) Maintaining

L U

factors of a general sparse matrix Linear Algebra and its Applics. 88/89 239–270

Gill P E, Murray W, Saunders M A and Wright M H (1989) A practical anti-cycling procedure for linearly constrained optimization Math. Programming 45 437–474

Gill P E, Murray W, Saunders M A and Wright M H (1991) Inertia-controlling methods for general quadratic programming SIAM Rev. 33 1–36

Hall J A J and McKinnon K I M (1996) The simplest examples where the simplex method cycles and conditions where EXPAND fails to prevent cycling Report MS 96–100 Department of Mathematics and Statistics, University of Edinburgh

5 Arguments

The first

n

entries of the arguments bl, bu, hs and x refer to the variables

x

. The last

m

entries refer to the slacks

s

1: $start$ – Character(1) Input

On entry: indicates how a starting basis (and certain other items) will be obtained.

$start ='C'$: Requests that an internal Crash procedure be used to choose an initial basis, unless a Basis file is provided via optional parameters Old Basis File, Insert File or Load File.
$start ='B'$: Is the same as $start ='C'$ but is more meaningful when a Basis file is given.
$start ='W'$: Means that a basis is already defined in hs and a start point is already defined in x (probably from an earlier call).

Constraint:

start ='B'

'C'

'W'

2: $qphx$ – Subroutine, supplied by the NAG Library or the user. External Procedure

For QP problems, you must supply a version of qphx to compute the matrix product

H x

for a given vector

x

. If

H

has rows and columns of zeros, it is most efficient to order

x

so that the nonlinear variables appear first. For example, if

x = {(y, z)}^{T}

and only

y

enters the objective quadratically, then

H x = (\begin{matrix} H_{1} & 0 \\ 0 & 0 \end{matrix}) (\begin{matrix} y \\ z \end{matrix}) = (\begin{matrix} H_{1} y \\ 0 \end{matrix}) .

(2)

In this case, ncolh should be the dimension of

y

, and qphx should compute

H_{1} y

. For FP and LP problems, qphx will never be called by e04nqf and hence qphx may be the dummy routine e04nsh.

The specification of qphx is:

Fortran Interface

Subroutine qphx (

ncolh, x, hx, nstate, cuser, iuser, ruser)

Integer, Intent (In)	::	ncolh, nstate
Integer, Intent (Inout)	::	iuser(*)
Real (Kind=nag_wp), Intent (In)	::	x(ncolh)
Real (Kind=nag_wp), Intent (Inout)	::	ruser(*)
Real (Kind=nag_wp), Intent (Out)	::	hx(ncolh)
Character (8), Intent (InOut)	::	cuser(*)

C Header Interface

void	qphx (const Integer ncolh, const double x[], double hx[], const Integer nstate, char cuser[], Integer iuser[], double ruser[], const Charlen length_cuser)

1: $ncolh$ – Integer Input

On entry: this is the same argument ncolh as supplied to e04nqf.

2: $x (ncolh)$ – Real (Kind=nag_wp) array Input

On entry: the first ncolh elements of the vector

x

3: $hx (ncolh)$ – Real (Kind=nag_wp) array Output

On exit: the product

H x

. If ncolh is less than the input argument n,

H x

is really the product

H_{1} y

in (2).

4: $nstate$ – Integer Input

On entry: allows you to save computation time if certain data must be read or calculated only once. To preserve this data for a subsequent calculation place it in one of cuser, ruser or iuser.

$nstate = 1$

e04nqf is calling qphx for the first time.

$nstate = 0$

There is nothing special about the current call of qphx.

$nstate \geq 2$

e04nqf is calling qphx for the last time. This argument setting allows you to perform some additional computation on the final solution.

$nstate = 2$: The current $x$ is optimal.
$nstate = 3$: The problem appears to be infeasible.
$nstate = 4$: The problem appears to be unbounded.
$nstate = 5$: The iterations limit was reached.

5: $cuser (*)$ – Character(8) array User Workspace

6: $iuser (*)$ – Integer array User Workspace

7: $ruser (*)$ – Real (Kind=nag_wp) array User Workspace

qphx must either be a module subprogram USEd by, or declared as EXTERNAL in, the (sub)program from which e04nqf is called. Arguments denoted as Input must not be changed by this procedure.

Note: qphx should not return floating-point NaN (Not a Number) or infinity values, since these are not handled by e04nqf. If your code inadvertently does return any NaNs or infinities, e04nqf is likely to produce unexpected results.

3: $m$ – Integer Input

On entry:

m

, the number of general linear constraints (or slacks). This is the number of rows in the linear constraint matrix

A

, including the free row (if any; see iobj). Note that

A

must have at least one row. If your problem has no constraints, or only upper or lower bounds on the variables, then you must include a dummy row with sufficiently wide upper and lower bounds (see also acol, inda and loca).

Constraint:

m \geq 1

4: $n$ – Integer Input

On entry:

n

, the number of variables (excluding slacks). This is the number of columns in the linear constraint matrix

A

Constraint:

n \geq 1

5: $ne$ – Integer Input

On entry: the number of nonzero elements in

A

Constraint:

1 \leq ne \leq n \times m

6: $nname$ – Integer Input

On entry: the number of column (i.e., variable) and row names supplied in the array names.

$nname = 1$: There are no names. Default names will be used in the printed output.
$nname = n + m$: All names must be supplied.

Constraint:

nname = 1

n + m

7: $lenc$ – Integer Input

On entry: the number of elements in the constant objective vector

c

lenc > 0

, the first lenc elements of

x

belong to variables corresponding to the constant objective term

c

Constraint:

0 \leq lenc \leq n

8: $ncolh$ – Integer Input

On entry:

n_{H}

, the number of leading nonzero columns of the Hessian matrix

H

. For FP and LP problems, ncolh must be set to zero.

The first ncolh elements of

x

belong to variables corresponding to the nonzero block of the QP Hessian.

Constraint:

0 \leq ncolh \leq n

9: $iobj$ – Integer Input

On entry: if

iobj > 0

, row iobj of

A

is a free row containing the nonzero elements of the vector

c

appearing in the linear objective term

c^{T} x

iobj = 0

, there is no free row, and the linear objective vector should be supplied in array c.

Constraint:

0 \leq iobj \leq m

10: $objadd$ – Real (Kind=nag_wp) Input

On entry: the constant

q

, to be added to the objective for printing purposes. Typically

objadd = 0.0E0

11: $prob$ – Character(8) Input

On entry: the name for the problem. It is used in the printed solution and in some routines that output Basis files. A blank name may be used.

12: $acol (ne)$ – Real (Kind=nag_wp) array Input

On entry: the nonzero elements of

A

, ordered by increasing column index. Note that all elements must be assigned a value in the calling program.

13: $inda (ne)$ – Integer array Input

On entry:

inda (i)

must contain the row index of the nonzero element stored in

acol (i)

, for

i = 1, 2, \dots, ne

. Thus a pair of values

(acol (i), inda (i))

contains a matrix element and its corresponding row index.

Note that the row indices for a column may be supplied in any order.

Constraint:

1 \leq inda (i) \leq m

, for

i = 1, 2, \dots, ne

14: $loca (n + 1)$ – Integer array Input

On entry:

loca (j)

must contain the index in acol and inda of the start of the

j

th column, for

j = 1, 2, \dots, n

. Thus for

j = 1 : n

, the entries of column

j

are held in

acol (k : l)

and their corresponding row indices are in

inda (k : l)

, where

k = loca (j)

and

l = loca (j + 1) - 1

. To specify the

j

th column as empty, set

loca (j) = loca (j + 1)

. Note that the first and last elements of loca must be

loca (1) = 1

and

loca (n + 1) = ne + 1

. If your problem has no constraints, or just bounds on the variables, you may include a dummy ‘free’ row with a single (zero) element by setting

ne = 1

acol (1) = 0.0

inda (1) = 1

loca (1) = 1

, and

loca (j) = 2

, for

j = 2 : n + 1

. This row is made ‘free’ by setting its bounds to be

bl (n + 1) = - bigbnd

and

bu (n + 1) = bigbnd

, where

bigbnd

is the value of the optional parameter Infinite Bound Size.

Constraints:

$loca (1) = 1$ ;
$loca (j) \geq 1$ , for $j = 2, 3, \dots, n$ ;
$loca (n + 1) = ne + 1$ ;
$0 \leq loca (j + 1) - loca (j) \leq m$ , for $j = 1, 2, \dots, n$ .

15: $bl (n + m)$ – Real (Kind=nag_wp) array Input

On entry:

l

, the lower bounds for all the variables and general constraints, in the following order. The first n elements of bl must contain the bounds on the variables

x

, and the next m elements the bounds for the general linear constraints

A x

(which, equivalently, are the bounds for the slacks,

s

) and the free row (if any). To fix the

j

th variable, set

bl (j) = bu (j) = β

, say, where

| β | < bigbnd

. To specify a nonexistent lower bound (i.e.,

l_{j} = - \infty

), set

bl (j) \leq - bigbnd

. Here,

bigbnd

is the value of the optional parameter Infinite Bound Size. To specify the

j

th constraint as an equality, set

bl (n + j) = bu (n + j) = β

, say, where

| β | < bigbnd

. Note that the lower bound corresponding to the free row must be set to

- \infty

and stored in

bl (n + iobj)

Constraint: if

iobj > 0

bl (n + iobj) \leq - bigbnd

(See also the description for bu.)

16: $bu (n + m)$ – Real (Kind=nag_wp) array Input

On entry:

u

, the upper bounds for all the variables and general constraints, in the following order. The first n elements of bu must contain the bounds on the variables

x

, and the next m elements the bounds for the general linear constraints

A x

(which, equivalently, are the bounds for the slacks,

s

) and the free row (if any). To specify a nonexistent upper bound (i.e.,

u_{j} = + \infty

), set

bu (j) \geq bigbnd

. Note that the upper bound corresponding to the free row must be set to

+ \infty

and stored in

bu (n + iobj)

Constraints:

if $iobj > 0$ , $bu (n + iobj) \geq bigbnd$ ;
otherwise $bl (i) \leq bu (i)$ .

17: $c (\max (1, lenc))$ – Real (Kind=nag_wp) array Input

Note: the dimension of the array c must be at least

\max (1, lenc)

iobj \neq 0

, and at least

1

otherwise.

On entry: contains the explicit objective vector

c

(if any). If the problem is of type FP, or if

lenc = 0

, c is not referenced. (In that case, c may be dimensioned (1), or it could be any convenient array.)

18: $names (nname)$ – Character(8) array Input

On entry: the optional column and row names, respectively.

nname = 1

, names is not referenced and the printed output will use default names for the columns and rows.

nname = n + m

, the first n elements must contain the names for the columns and the next m elements must contain the names for the rows. Note that the name for the free row (if any) must be stored in

names (n + iobj)

19: $helast (n + m)$ – Integer array Input

On entry: defines which variables are to be treated as being elastic in elastic mode. The allowed values of helast are:

$helast (j)$	Status in elastic mode
$0$	Variable $j$ is non-elastic and cannot be infeasible
$1$	Variable $j$ can violate its lower bound
$2$	Variable $j$ can violate its upper bound
$3$	Variable $j$ can violate either its lower or upper bound

helast need not be assigned if optional parameter

Elastic Mode = 0

Constraint: if

Elastic Mode \neq 0

helast (j) = 0, 1, 2, 3

, for

j = 1, 2, \dots, n + m

20: $hs (n + m)$ – Integer array Input/Output

On entry: if

start ='C'

'B'

, and a Basis file of some sort is to be input (see the description of the optional parameters Old Basis File, Insert File or Load File), then hs and x need not be set at all.

start ='C'

'B'

and there is no Basis file, the first n elements of hs and x must specify the initial states and values, respectively, of the variables

x

. (The slacks

s

need not be initialized.) An internal Crash procedure is then used to select an initial basis matrix

B

. The initial basis matrix will be triangular (neglecting certain small elements in each column). It is chosen from various rows and columns of

(\begin{matrix} A & - I \end{matrix})

. Possible values for

hs (j)

are as follows:

$hs (j)$	State of $x (j)$ during Crash procedure
$0$ or $1$	Eligible for the basis
$2$	Ignored
$3$	Eligible for the basis (given preference over $0$ or $1$ )
$4$ or $5$	Ignored

If nothing special is known about the problem, or there is no wish to provide special information, you may set

hs (j) = 0

and

x (j) = 0.0

, for

j = 1, 2, \dots, n

. All variables will then be eligible for the initial basis. Less trivially, to say that the

j

th variable will probably be equal to one of its bounds, set

hs (j) = 4

and

x (j) = bl (j)

hs (j) = 5

and

x (j) = bu (j)

as appropriate.

Following the Crash procedure, variables for which

hs (j) = 2

are made superbasic. Other variables not selected for the basis are then made nonbasic at the value

x (j)

bl (j) \leq x (j) \leq bu (j)

, or at the value

bl (j)

bu (j)

closest to

x (j)

start ='W'

, hs and x must specify the initial states and values, respectively, of the variables and slacks

(x, s)

. If e04nqf has been called previously with the same values of n and m, hs already contains satisfactory information.

Constraints:

if $start ='C'$ or $'B'$ , $0 \leq hs (j) \leq 5$ , for $j = 1, 2, \dots, n$ ;
if $start ='W'$ , $0 \leq hs (j) \leq 3$ , for $j = 1, 2, \dots, n + m$ .

On exit: the final states of the variables and slacks

(x, s)

. The significance of each possible value of

hs (j)

is as follows:

$hs (j)$	State of variable $j$	Normal value of $x (j)$
$0$	Nonbasic	$bl (j)$
$1$	Nonbasic	$bu (j)$
$2$	Superbasic	Between $bl (j)$ and $bu (j)$
$3$	Basic	Between $bl (j)$ and $bu (j)$

ninf = 0

, basic and superbasic variables may be outside their bounds by as much as the value of the optional parameter Feasibility Tolerance. Note that unless the optional parameter

Scale Option = 0

is specified, the optional parameter Feasibility Tolerance applies to the variables of the scaled problem. In this case, the variables of the original problem may be as much as

0.1

outside their bounds, but this is unlikely unless the problem is very badly scaled.

Very occasionally some nonbasic variables may be outside their bounds by as much as the optional parameter Feasibility Tolerance, and there may be some nonbasic variables for which

x (j)

lies strictly between its bounds.

ninf > 0

, some basic and superbasic variables may be outside their bounds by an arbitrary amount (bounded by sinf if

Scale Option = 0

21: $x (n + m)$ – Real (Kind=nag_wp) array Input/Output

On entry: the initial values of the variables

x

, and, if

start ='W'

, the slacks

s

, i.e.,

(x, s)

. (See the description for argument hs.)

On exit: the final values of the variables and slacks

(x, s)

22: $pi (m)$ – Real (Kind=nag_wp) array Output

On exit: contains the dual variables

π

(a set of Lagrange multipliers (shadow prices) for the general constraints).

23: $rc (n + m)$ – Real (Kind=nag_wp) array Output

On exit: contains the reduced costs,

g - {(\begin{matrix} A & - I \end{matrix})}^{T} π

. The vector

g

is the gradient of the objective if x is feasible; otherwise, it is the gradient of the Phase 1 objective. In the former case,

g (i) = 0

, for

i = n + 1 : m

, hence

rc (n + 1 : m) = π

24: $ns$ – Integer Input/Output

On entry:

n_{S}

, the number of superbasics. For QP problems, ns need not be specified if

start ='C'

, but must retain its value from a previous call when

start ='W'

. For FP and LP problems, ns need not be initialized.

On exit: the final number of superbasics. This will be zero for FP and LP problems.

25: $ninf$ – Integer Output

On exit: the number of infeasibilities.

26: $sinf$ – Real (Kind=nag_wp) Output

On exit: the sum of the scaled infeasibilities. This will be zero if

ninf = 0

, and is most meaningful when

Scale Option = 0

27: $obj$ – Real (Kind=nag_wp) Output

On exit: the value of the objective function.

ninf = 0

, obj includes the quadratic objective term

\frac{1}{2} x^{T} H x

(if any).

ninf > 0

, obj is just the linear objective term

c^{T} x

(if any).

For FP problems, obj is set to zero.

Note that obj does not include contributions from the constant term objadd or the objective row, if any.

28: $cw (lencw)$ – Character(8) array Communication Array

29: $lencw$ – Integer Input

On entry: the dimension of the array cw as declared in the (sub)program from which e04nqf is called.

Constraint:

lencw \geq 600

30: $iw (leniw)$ – Integer array Communication Array

31: $leniw$ – Integer Input

On entry: the dimension of the array iw as declared in the (sub)program from which e04nqf is called.

Constraint:

leniw \geq 600

32: $rw (lenrw)$ – Real (Kind=nag_wp) array Communication Array

33: $lenrw$ – Integer Input

On entry: the dimension of the array rw as declared in the (sub)program from which e04nqf is called.

Constraint:

lenrw \geq 600

34: $cuser (*)$ – Character(8) array User Workspace

cuser is not used by e04nqf, but is passed directly to qphx and may be used to pass information to this routine.

35: $iuser (*)$ – Integer array User Workspace

iuser is not used by e04nqf, but is passed directly to qphx and may be used to pass information to this routine.

36: $ruser (*)$ – Real (Kind=nag_wp) array User Workspace

ruser is not used by e04nqf, but is passed directly to qphx and may be used to pass information to this routine.

37: $ifail$ – Integer Input/Output

On entry: ifail must be set to

0

−1

1

to set behaviour on detection of an error; these values have no effect when no error is detected.

A value of

0

causes the printing of an error message and program execution will be halted; otherwise program execution continues. A value of

−1

means that an error message is printed while a value of

1

means that it is not.

If halting is not appropriate, the value

−1

1

is recommended. If message printing is undesirable, then the value

1

is recommended. Otherwise, the value

−1

is recommended since useful values can be provided in some output arguments even when

ifail \neq 0

on exit. When the value $- 1$ or $1$ is used it is essential to test the value of ifail on exit.

On exit:

ifail = 0

unless the routine detects an error or a warning has been flagged (see Section 6).

e04nqf returns with

ifail = 0

if the reduced gradient (rgNorm; see Section 9.1) is negligible, the Lagrange multipliers (Lagr Mult; see Section 9.1) are optimal,

x

satisfies the constraints to the accuracy requested by the value of the optional parameter Feasibility Tolerance and the reduced Hessian factor

R

(see Section 11.2) is nonsingular.

6 Error Indicators and Warnings

If on entry

ifail = 0

−1

, explanatory error messages are output on the current error message unit (as defined by x04aaf).

Errors or warnings detected by the routine:

Note: in some cases e04nqf may return useful information.

$ifail = 1$: On entry, $lencw = ⟨ value ⟩$ .
Constraint: $lencw \geq 600$ .

On entry, $leniw = ⟨ value ⟩$ .
Constraint: $leniw \geq 600$ .

On entry, $lenrw = ⟨ value ⟩$ .
Constraint: $lenrw \geq 600$ .

The initialization routine e04npf has not been called.

$ifail = 2$: An error has occurred in the basis package, perhaps indicating incorrect setup of arrays inda and loca. Set the optional parameter Print File and examine the output carefully for further information.

Basis file dimensions do not match this problem.

On entry, bounds bl and bu for $⟨ value ⟩$ are equal and infinite: $bl = bu = ⟨ value ⟩$ and $infbnd = ⟨ value ⟩$ .

On entry, bounds bl and bu for $⟨ value ⟩$ are equal and infinite. $bl = bu = ⟨ value ⟩$ and $infbnd = ⟨ value ⟩$ .

On entry, bounds for $⟨ value ⟩$ are inconsistent. $bl = ⟨ value ⟩$ and $bu = ⟨ value ⟩$ .

On entry, $iobj = ⟨ value ⟩$ and $m = ⟨ value ⟩$ .
Constraint: $0 \leq iobj \leq m$ .

On entry, $lenc = ⟨ value ⟩$ and $n = ⟨ value ⟩$ .
Constraint: $0 \leq lenc \leq n$ .

On entry, $loca (1) = ⟨ value ⟩$ , $loca (⟨ value ⟩) = ⟨ value ⟩$ , $ne = ⟨ value ⟩$ .
Constraint: $loca (1) = 1$ or $loca (⟨ value ⟩) = ne + 1$ .

On entry, $m = ⟨ value ⟩$ .
Constraint: $m \geq ⟨ value ⟩$ .

On entry, $n = ⟨ value ⟩$ .
Constraint: $n \geq ⟨ value ⟩$ .

On entry, $ncolh = ⟨ value ⟩$ and $n = ⟨ value ⟩$ .
Constraint: $0 \leq ncolh \leq n$ .

On entry, $ne = ⟨ value ⟩$ , $n = ⟨ value ⟩$ and $m = ⟨ value ⟩$ .
Constraint: $1 \leq ne \leq n \times m$ .

On entry, ne is not equal to the number of nonzeros in acol. $ne = ⟨ value ⟩$ , nonzeros in $acol = ⟨ value ⟩$ .

On entry, $nname = ⟨ value ⟩$ , $n = ⟨ value ⟩$ and $m = ⟨ value ⟩$ .
Constraint: $nname = 1$ or $n + m$ .

On entry, row index $⟨ value ⟩$ in $inda (⟨ value ⟩)$ is outside the range $1$ to $m = ⟨ value ⟩$ .

On entry, $start = ⟨ value ⟩$ .
Constraint: $start ='B'$ , $'C'$ or $'W'$ .

$ifail = 3$: The requested accuracy could not be achieved.

$ifail = 4$: Weak solution found – the solution is not unique.

$ifail = 5$: The linear constraints appear to be infeasible.

The problem appears to be infeasible. Infeasibilites have been minimized.

The problem appears to be infeasible. Nonlinear infeasibilites have been minimized.

The problem appears to be infeasible. The linear equality constraints could not be satisfied.
The problem is infeasible. The general constraints cannot all be satisfied simultaneously to within the value of the optional parameter Feasibility Tolerance.
Feasibility is measured with respect to the upper and lower bounds on the variables and slacks. The message tells us that among all the points satisfying the general constraints $A x - s = 0$ , there is apparently no point that satisfies the bounds on $x$ and $s$ . Violations as small as the Feasibility Tolerance are ignored, but at least one component of $x$ or $s$ violates a bound by more than the tolerance.
Note: although the objective function is the sum of infeasibilities (when $ninf > 0$ ), this sum will not necessarily have been minimized when $Elastic Mode = 1$ .
If $Elastic Mode \neq 0$ , e04nqf will optimize the QP objective and the sum of infeasibilities, suitably weighted using the optional parameter Elastic Mode. The routine will tend to determine a ‘good’ infeasible point if the elastic weight is sufficiently large.

$ifail = 6$: The problem appears to be unbounded. The constraint violation limit has been reached.

The problem appears to be unbounded. The objective function is unbounded.
The problem is unbounded (or badly scaled). For a minimization problem, the objective function is not bounded below in the feasible region.
For linear problems, unboundedness is detected by the simplex method when a nonbasic variable can be increased or decreased by an arbitrary amount without causing a basic variable to violate a bound. Consider adding an upper or lower bound to the variable. Also, examine the constraints that have nonzeros in the associated column, to see if they have been formulated as intended.
Very rarely, the scaling of the problem could be so poor that numerical error will give an erroneous indication of unboundedness. Consider using the optional parameter Scale Option.

$ifail = 7$: Iteration limit reached.

Major iteration limit reached.
Too many iterations. The value of the optional parameter Iterations Limit is too small.
The Iterations limit was exceeded before the required solution could be found. Check the iteration log to be sure that progress was being made. If so, restart the run using a Basis file that was saved at the end of the run.

$ifail = 8$: The value of the optional parameter Superbasics Limit is too small.
The current set of basic and superbasic variables have been optimized as much as possible and a pricing operation is necessary to continue, but there are already Superbasics Limit superbasics (and no room for any more).
In general, raise the Superbasics Limit $s$ by a reasonable amount, bearing in mind the storage needed for reduced Hessian (see Section 11.2). (The Reduced Hessian Dimension $h$ will also increase to $s$ unless specified otherwise, and the associated storage will be about $\frac{1}{2} s^{2}$ words.) In some cases you may have to set $h < s$ to conserve storage, but beware that the rate of convergence will probably fall off severely.

$ifail = 9$: The basis is singular after several attempts to factorize it (and add slacks where necessary).
Either the problem is badly scaled or the value of the optional parameter LU Factor Tolerance is too large.

$ifail = 10$: Numerical difficulties have been encountered and no further progress can be made.
Numerical error in trying to satisfy the general constraints. The basis is very ill-conditioned.
An $L U$ factorization of the basis has just been obtained and used to recompute the basic variables $x_{B}$ , given the present values of the superbasic and nonbasic variables. However, a row check has revealed that the resulting solution does not satisfy the current constraints $A x - s = 0$ sufficiently well.
This probably means that the current basis is very ill-conditioned. Request the Scale Option if there are any linear constraints and variables.
For certain highly structured basis matrices (notably those with band structure), a systematic growth may occur in the factor $U$ . Consult the description of Umax, Umin and Growth in Section 13, and set the optional parameter LU Factor Tolerance to $2.0$ (or possibly even smaller, but not less than $1.0$ ).

$ifail = 11$: Error in qphx: the QP Hessian is indefinite.
An indefinite matrix was detected during the computation of the reduced Hessian factor $R$ (see Section 11.2). This may be caused by $H$ being indefinite. Check also that qphx has been coded correctly and that all relevant elements of $H x$ have been assigned their correct values. If qphx is coded correctly and $H$ is positive semidefinite, the failure may be caused by ill conditioning. Try reducing the values of the optional parameters LU Factor Tolerance and LU Update Tolerance. If there are very large values in $H$ , check the scaling of the variables and constraints.

$ifail = 12$: Internal memory allocation failed when attempting to obtain workspace sizes $⟨ value ⟩$ , $⟨ value ⟩$ and $⟨ value ⟩$ . Please contact NAG.

$ifail = 13$: Internal memory allocation was insufficient. Please contact NAG.

$ifail = 14$: An error has occurred in the basis package, perhaps indicating incorrect setup of arrays inda and loca. Set the optional parameter Print File and examine the output carefully for further information.

$ifail = 15$: An unexpected error has occurred. Set the optional parameter Print File and examine the output carefully for further information.

$ifail = - 99$: An unexpected error has been triggered by this routine. Please contact NAG.
See Section 7 in the Introduction to the NAG Library FL Interface for further information.

$ifail = - 399$: Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library FL Interface for further information.

$ifail = - 999$: Dynamic memory allocation failed.
See Section 9 in the Introduction to the NAG Library FL Interface for further information.

7 Accuracy

e04nqf implements a numerically stable active-set strategy and returns solutions that are as accurate as the condition of the problem warrants on the machine.

8 Parallelism and Performance

e04nqf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.

Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.

9 Further Comments

This section contains a description of the printed output.

9.1 Description of the Printed Output

Print Level > 0

, one line of information is output to the Print File every

k

th iteration, where

k

is the specified Print Frequency. A heading is printed before the first such line following a basis factorization. The heading contains the items described below. In this description, a pricing operation is defined to be the process by which one or more nonbasic variables are selected to become superbasic (in addition to those already in the superbasic set). The variable selected will be denoted by jq. If the problem is purely linear, variable jq will usually become basic immediately (unless it should happen to reach its opposite bound and return to the nonbasic set).

If optional parameter Partial Price is in effect, variable jq is selected from

A_{pp}

I_{pp}

, the ppth segments of the constraint matrix

(\begin{matrix} A & - I \end{matrix})

Label	Description
Itn	is the iteration count.
pp	is the partial-price indicator. The variable selected by the last pricing operation came from the ppth partition of $A$ and $- I$ . Note that pp is reset to zero whenever the basis is refactorized.
dj	is the value of the reduced gradient (or reduced cost) for the variable selected by the pricing operation at the start of the current iteration. Algebraically, dj is $d_{j} = g_{j} - π^{T} a_{j}$ , for $j = jq$ , where $g_{j}$ is the gradient of the current objective function, $π$ is the vector of dual variables, and $a_{j}$ is the $j$ th column of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ . Note that dj is the norm of the reduced-gradient vector at the start of the iteration, just after the pricing operation.
+SBS	is the variable jq selected by the pricing operation to be added to the superbasic set.
-SBS	is the variable chosen to leave the superbasic set. It has become basic if the entry under -B is nonzero, otherwise it becomes nonbasic.
-BS	is the variable removed from the basis to become nonbasic.
Step	is the value of the step length $α$ taken along the current search direction $p$ . The variables $x$ have just been changed to $x + α p$ . If a variable is made superbasic during the current iteration (i.e., +SBS is positive), Step will be the step to the nearest bound. During the optimality phase, the step can be greater than unity only if the reduced Hessian is not positive definite.
Pivot	is the $r$ th element of a vector $y$ satisfying $B y = a_{q}$ whenever $a_{q}$ (the $q$ th column of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ replaces the $r$ th column of the basis matrix $B$ . Wherever possible, Step is chosen so as to avoid extremely small values of Pivot (since they may cause the basis to be nearly singular). In extreme cases, it may be necessary to increase the value of the optional parameter Pivot Tolerance to exclude very small elements of $y$ from consideration during the computation of Step.
nInf	is the number of violated constraints (infeasibilities) before the present iteration. This number will not increase unless iterations are in elastic mode.
sInf	is the sum of infeasibilities before the present iteration. It will usually decrease at each nonzero step, but if nInf decreases by $2$ or more, sInf may occasionally increase. However, in elastic mode it will decrease monotonically.
Objective	is the value of the current objective function after the present iteration. Note, if Elastic Mode is $2$ , the heading is Composite Obj.
L+U	L is the number of nonzeros in the basis factor $L$ . Immediately after a basis factorization $B = L U$ , L contains lenL (see Section 13). Further nonzeros are added to L when various columns of $B$ are later replaced. (Thus, L increases monotonically.) U is the number of nonzeros in the basis factor $U$ . Immediately after a basis factorization $B = L U$ , U contains lenU (see Section 13). As columns of $B$ are replaced, the matrix $U$ is maintained explicitly (in sparse form). The value of U may fluctuate up or down; in general, it will tend to increase.
ncp	is the number of compressions required to recover workspace in the data structure for $U$ . This includes the number of compressions needed during the previous basis factorization. Normally, ncp should increase very slowly.

The following will be output if the problem is QP or if the superbasic set is non-empty.

Label	Description
rgNorm	is the largest reduced-gradient among the superbasic variables after the current iteration. During the optimality phase, this will be approximately zero after a unit step.
nS	is the current number of superbasic variables.
condHz	is a lower bound on the condition number of the reduced Hessian (see Section 11.2). The larger this number, the more difficult the problem. Attention should be given to the scaling of the variables and the constraints to guard against high values of condHz.

10 Example

This example minimizes the quadratic function

f (x) = c^{T} x + \frac{1}{2} x^{T} H x

, where

c = {(- 200.0, - 2000.0, - 2000.0, - 2000.0, - 2000.0, 400.0, 400.0)}^{T}

H = (\begin{matrix} 2 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 2 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 2 & 2 & 0 & 0 & 0 \\ 0 & 0 & 2 & 2 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 2 & 2 \\ 0 & 0 & 0 & 0 & 0 & 2 & 2 \end{matrix})

subject to the bounds

\begin{matrix} 0 \leq x_{1} \leq 200 \\ 0 \leq x_{2} \leq 2500 \\ 400 \leq x_{3} \leq 800 \\ 100 \leq x_{4} \leq 700 \\ 0 \leq x_{5} \leq 1500 \\ 0 \leq x_{6} \\ 0 \leq x_{7} \end{matrix}

and to the linear constraints

\begin{matrix} x_{1} & + & x_{2} & + & x_{3} & + & x_{4} & + & x_{5} & + & x_{6} & + & x_{7} & = & 2000 \\ 0.15 x_{1} & + & 0.04 x_{2} & + & 0.02 x_{3} & + & 0.04 x_{4} & + & 0.02 x_{5} & + & 0.01 x_{6} & + & 0.03 x_{7} & \leq & 60 \\ 0.03 x_{1} & + & 0.05 x_{2} & + & 0.08 x_{3} & + & 0.02 x_{4} & + & 0.06 x_{5} & + & 0.01 x_{6} & \leq & 100 \\ 0.02 x_{1} & + & 0.04 x_{2} & + & 0.01 x_{3} & + & 0.02 x_{4} & + & 0.02 x_{5} & \leq & 40 \\ 0.02 x_{1} & + & 0.03 x_{2} & + & 0.01 x_{5} & \leq & 30 \\ 1500 & \leq & 0.70 x_{1} & + & 0.75 x_{2} & + & 0.80 x_{3} & + & 0.75 x_{4} & + & 0.80 x_{5} & + & 0.97 x_{6} \\ 250 & \leq & 0.02 x_{1} & + & 0.06 x_{2} & + & 0.08 x_{3} & + & 0.12 x_{4} & + & 0.02 x_{5} & + & 0.01 x_{6} & + & 0.97 x_{7} & \leq & 300 . \end{matrix}

The initial point, which is infeasible, is

x_{0} = {(0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0)}^{T} .

The optimal solution (to five figures) is

x^{*} = {(0.0, 349.40, 648.85, 172.85, 407.52, 271.36, 150.02)}^{T} .

One bound constraint and four linear constraints are active at the solution. Note that the Hessian matrix

H

is positive semidefinite.

Note: the remainder of this document is intended for more advanced users. Section 11 contains a detailed description of the algorithm which may be needed in order to understand Sections 12 and 13. Section 12 describes the optional parameters which may be set by calls to e04nrf, e04nsf, e04ntf and/or e04nuf. Section 13 describes the quantities which can be requested to monitor the course of the computation.

11 Algorithmic Details

This section contains a detailed description of the method used by e04nqf.

11.1 Overview

e04nqf is based on an inertia-controlling method that maintains a Cholesky factorization of the reduced Hessian (see below). The method is similar to that of Gill and Murray (1978), and is described in detail by Gill et al. (1991). Here we briefly summarise the main features of the method. Where possible, explicit reference is made to the names of variables that are arguments of the routine or appear in the printed output.

The method used has two distinct phases: finding an initial feasible point by minimizing the sum of infeasibilities (the feasibility phase), and minimizing the quadratic objective function within the feasible region (the optimality phase). The computations in both phases are performed by the same subroutines. The two-phase nature of the algorithm is reflected by changing the function being minimized from the sum of infeasibilities (the printed quantity sInf; see Section 9.1) to the quadratic objective function (the printed quantity Objective; see Section 9.1).

In general, an iterative process is required to solve a quadratic program. Given an iterate

(x, s)

in both the original variables

x

and the slack variables

s

, a new iterate

(\bar{x}, \bar{s})

is defined by

(\begin{matrix} \bar{x} \\ \bar{s} \end{matrix}) = (\begin{matrix} x \\ s \end{matrix}) + α p,

(3)

where the step length

α

is a non-negative scalar (the printed quantity Step; see Section 13), and

p

is called the search direction. (For simplicity, we shall consider a typical iteration and avoid reference to the index of the iteration.) Once an iterate is feasible (i.e., satisfies the constraints), all subsequent iterates remain feasible.

11.2 Definition of the Working Set and Search Direction

At each iterate

(x, s)

, a working set of constraints is defined to be a linearly independent subset of the constraints that are satisfied ‘exactly’ (to within the value of the optional parameter Feasibility Tolerance). The working set is the current prediction of the constraints that hold with equality at a solution of the LP or QP problem. Let

m_{W}

denote the number of constraints in the working set (including bounds), and let

W

denote the associated

m_{W} \times (n + m)

working set matrix consisting of the

m_{W}

gradients of the working set constraints.

The search direction is defined so that constraints in the working set remain unaltered for any value of the step length. It follows that

p

must satisfy the identity

W p = 0 .

(4)

This characterisation allows

p

to be computed using any

n \times n_{Z}

full-rank matrix

Z

that spans the null space of

W

. (Thus,

n_{Z} = n - m_{W}

and

W Z = 0

.) The null space matrix

Z

is defined from a sparse

L U

factorization of part of

W

(see (7) and (8)). The direction

p

will satisfy (4) if

p = Z p_{Z},

(5)

where

p_{Z}

is any

n_{Z}

-vector.

The working set contains the constraints

A x - s = 0

and a subset of the upper and lower bounds on the variables

(x, s)

. Since the gradient of a bound constraint

x_{j} \geq l_{j}

x_{j} \leq u_{j}

is a vector of all zeros except for

\pm 1

in position

j

, it follows that the working set matrix contains the rows of

(\begin{matrix} A & - I \end{matrix})

and the unit rows associated with the upper and lower bounds in the working set.

The working set matrix

W

can be represented in terms of a certain column partition of the matrix

(\begin{matrix} A & - I \end{matrix})

by (conceptually) partitioning the constraints

A x - s = 0

so that

B x_{B} + S x_{S} + N x_{N} = 0,

(6)

where

B

is a square nonsingular basis and

x_{B}

x_{S}

and

x_{N}

are the basic, superbasic and nonbasic variables respectively. The nonbasic variables are equal to their upper or lower bounds at

(x, s)

, and the superbasic variables are independent variables that are chosen to improve the value of the current objective function. The number of superbasic variables is

n_{S}

(the printed quantity nS; see Section 9.1). Given values of

x_{N}

and

x_{S}

, the basic variables

x_{B}

are adjusted so that

(x, s)

satisfies (6).

P

is a permutation matrix such that

(\begin{matrix} A & - I \end{matrix}) P = (\begin{matrix} B & S & N \end{matrix})

, then

W

satisfies

W P = (\begin{matrix} B & S & N \\ 0 & 0 & I_{N} \end{matrix}),

(7)

where

I_{N}

is the identity matrix with the same number of columns as

N

The null space matrix

Z

is defined from a sparse

L U

factorization of part of

W

. In particular,

Z

is maintained in ‘reduced gradient’ form, using the LUSOL package (see Gill et al. (1991)) to maintain sparse

L U

factors of the basis matrix

B

as the

B S N

partition changes. Given the permutation

P

, the null space basis is given by

Z = P (\begin{matrix} - B^{- 1} S \\ I \\ 0 \end{matrix}) .

(8)

This matrix is used only as an operator, i.e., it is never computed explicitly. Products of the form

Z v

and

Z^{T} g

are obtained by solving with

B

B^{T}

. This choice of

Z

implies that

n_{Z}

, the number of ‘degrees of freedom’ at

(x, s)

, is the same as

n_{S}

, the number of superbasic variables.

Let

g_{Z}

and

H_{Z}

denote the reduced gradient and reduced Hessian of the objective function:

g_{Z} = Z^{T} g and H_{Z} = Z^{T} H Z,

(9)

where

g

is the objective gradient at

(x, s)

. Roughly speaking,

g_{Z}

and

H_{Z}

describe the first and second derivatives of an

n_{S}

-dimensional unconstrained problem for the calculation of

p_{Z}

. (The condition estimator of

H_{Z}

is the quantity condHz in the monitoring file output; see Section 9.1.)

At each iteration, an upper triangular factor

R

is available such that

H_{Z} = R^{T} R

. Normally,

R

is computed from

R^{T} R = Z^{T} H Z

at the start of the optimality phase and then updated as the QP working set changes. For efficiency, the dimension of

R

should not be excessive (say,

n_{S} \leq 1000

). This is guaranteed if the number of nonlinear variables is ‘moderate’.

If the QP problem contains linear variables,

H

is positive semidefinite and

R

may be singular with at least one zero diagonal element. However, an inertia-controlling strategy is used to ensure that only the last diagonal element of

R

can be zero. (See Gill et al. (1991) for a discussion of a similar strategy for indefinite quadratic programming.)

If the initial

R

is singular, enough variables are fixed at their current value to give a nonsingular

R

. This is equivalent to including temporary bound constraints in the working set. Thereafter,

R

can become singular only when a constraint is deleted from the working set (in which case no further constraints are deleted until

R

becomes nonsingular).

11.3 Main Iteration

If the reduced gradient is zero,

(x, s)

is a constrained stationary point on the working set. During the feasibility phase, the reduced gradient will usually be zero only at a vertex (although it may be zero elsewhere in the presence of constraint dependencies). During the optimality phase, a zero reduced gradient implies that

x

minimizes the quadratic objective function when the constraints in the working set are treated as equalities. At a constrained stationary point, Lagrange multipliers

λ

are defined from the equations

W^{T} λ = g (x) .

(10)

A Lagrange multiplier,

λ_{j}

, corresponding to an inequality constraint in the working set is said to be optimal if

λ_{j} \leq σ

when the associated constraint is at its upper bound, or if

λ_{j} \geq - σ

when the associated constraint is at its lower bound, where

σ

depends on the value of the optional parameter Optimality Tolerance. If a multiplier is nonoptimal, the objective function (either the true objective or the sum of infeasibilities) can be reduced by continuing the minimization with the corresponding constraint excluded from the working set. (This step is sometimes referred to as ‘deleting’ a constraint from the working set.) If optimal multipliers occur during the feasibility phase but the sum of infeasibilities is nonzero, there is no feasible point and the routine terminates immediately with

ifail = 3

The special form (7) of the working set allows the multiplier vector

λ

, the solution of (10), to be written in terms of the vector

d = (\begin{matrix} g \\ 0 \end{matrix}) - (\begin{matrix} A^{T} \\ - I \end{matrix}) π = (\begin{matrix} g - A^{T} π \\ π \end{matrix}),

(11)

where

π

satisfies the equations

B^{T} π = g_{B}

, and

g_{B}

denotes the basic elements of

g

. The elements of

π

are the Lagrange multipliers

λ_{j}

associated with the equality constraints

A x - s = 0

. The vector

d_{N}

of nonbasic elements of

d

consists of the Lagrange multipliers

λ_{j}

associated with the upper and lower bound constraints in the working set. The vector

d_{S}

of superbasic elements of

d

is the reduced gradient

g_{Z}

in (9). The vector

d_{B}

of basic elements of

d

is zero, by construction. (The Euclidean norm of

d_{S}

and the final values of

d_{S}

g

and

π

are the quantities rgNorm, Reduced Gradnt, Obj Gradient and Dual Activity in the monitoring file output; see Section 13.)

If the reduced gradient is not zero, Lagrange multipliers need not be computed and the search direction is given by

p = Z p_{Z}

(see (8) and (12)). The step length is chosen to maintain feasibility with respect to the satisfied constraints.

There are two possible choices for

p_{Z}

, depending on whether or not

H_{Z}

is singular. If

H_{Z}

is nonsingular,

R

is nonsingular and

p_{Z}

in (5) is computed from the equations

R^{T} R p_{Z} = - g_{Z},

(12)

where

g_{Z}

is the reduced gradient at

x

. In this case,

(x, s) + p

is the minimizer of the objective function subject to the working set constraints being treated as equalities. If

(x, s) + p

is feasible,

α

is defined to be unity. In this case, the reduced gradient at

(\bar{x}, \bar{s})

will be zero, and Lagrange multipliers are computed at the next iteration. Otherwise,

α

is set to

α_{N}

, the step to the ‘nearest’ constraint along

p

. This constraint is then added to the working set at the next iteration.

H_{Z}

is singular, then

R

must also be singular, and an inertia-controlling strategy is used to ensure that only the last diagonal element of

R

is zero. (See Gill et al. (1991) for a discussion of a similar strategy for indefinite quadratic programming.) In this case,

p_{Z}

satisfies

p_{Z}^{T} H_{Z} p_{Z} = 0 and g_{Z}^{T} p_{Z} \leq 0,

(13)

which allows the objective function to be reduced by any step of the form

(x, s) + α p

, where

α > 0

. The vector

p = Z p_{Z}

is a direction of unbounded descent for the QP problem in the sense that the QP objective is linear and decreases without bound along

p

. If no finite step of the form

(x, s) + α p

(where

α > 0

) reaches a constraint not in the working set, the QP problem is unbounded and the routine terminates immediately with

ifail = 6

. Otherwise,

α

is defined as the maximum feasible step along

p

and a constraint active at

(x, s) + α p

is added to the working set for the next iteration.

e04nqf makes explicit allowance for infeasible constraints. Infeasible linear constraints are detected first by solving a problem of the form

\begin{matrix} \underset{x, v, w}{minimize} e^{T} (v + w) subject to ​ l \leq (\begin{matrix} x \\ G x - v + w \end{matrix}) \leq u, v \geq 0, w \geq 0, \end{matrix}

(14)

where

e^{T} = (1, 1, \dots, 1)

. This is equivalent to minimizing the sum of the general linear constraint violations subject to the simple bounds. (In the linear programming literature, the approach is often called elastic programming.)

11.4 Miscellaneous

If the basis matrix is not chosen carefully, the condition of the null space matrix

Z

in (8) could be arbitrarily high. To guard against this, the routine implements a ‘basis repair’ feature in which the LUSOL package (see Gill et al. (1991)) is used to compute the rectangular factorization

{(\begin{matrix} B & S \end{matrix})}^{T} = L U,

(15)

returning just the permutation

P

that makes

P L P^{T}

unit lower triangular. The pivot tolerance is set to require

{| P L P^{T} |}_{i j} \leq 2

, and the permutation is used to define

P

in (7). It can be shown that

‖ Z ‖

is likely to be little more than unity. Hence,

Z

should be well-conditioned regardless of the condition of

W

. This feature is applied at the beginning of the optimality phase if a potential

B - S

ordering is known.

The EXPAND procedure (see Gill et al. (1989)) is used to reduce the possibility of cycling at a point where the active constraints are nearly linearly dependent. Although there is no absolute guarantee that cycling will not occur, the probability of cycling is extremely small (see Hall and McKinnon (1996)). The main feature of EXPAND is that the feasibility tolerance is increased at the start of every iteration. This allows a positive step to be taken at every iteration, perhaps at the expense of violating the bounds on

(x, s)

by a small amount.

Suppose that the value of the optional parameter Feasibility Tolerance is

δ

. Over a period of

K

iterations (where

K

is the value of the optional parameter Expand Frequency), the feasibility tolerance actually used by the routine (i.e., the working feasibility tolerance) increases from

0.5 δ

δ

(in steps of

0.5 δ / K

At certain stages the following ‘resetting procedure’ is used to remove small constraint infeasibilities. First, all nonbasic variables are moved exactly onto their bounds. A count is kept of the number of nontrivial adjustments made. If the count is nonzero, the basic variables are recomputed. Finally, the working feasibility tolerance is reinitialized to

0.5 δ

If a problem requires more than

K

iterations, the resetting procedure is invoked and a new cycle of iterations is started. (The decision to resume the feasibility phase or optimality phase is based on comparing any constraint infeasibilities with

δ

The resetting procedure is also invoked when the routine reaches an apparently optimal, infeasible or unbounded solution, unless this situation has already occurred twice. If any nontrivial adjustments are made, iterations are continued.

The EXPAND procedure not only allows a positive step to be taken at every iteration, but also provides a potential choice of constraints to be added to the working set. All constraints at a distance

α

(where

α \leq α_{N}

) along

p

from the current point are then viewed as acceptable candidates for inclusion in the working set. The constraint whose normal makes the largest angle with the search direction is added to the working set. This strategy helps keep the basis matrix

B

well-conditioned.

12 Optional Parameters

Several optional parameters in e04nqf define choices in the problem specification or the algorithm logic. In order to reduce the number of formal arguments of e04nqf these optional parameters have associated default values that are appropriate for most problems. Therefore, you need only specify those optional parameters whose values are to be different from their default values.

The remainder of this section can be skipped if you wish to use the default values for all optional parameters.

The following is a list of the optional parameters available. A full description of each optional parameter is provided in Section 12.1.

Backup Basis File
Check Frequency
Crash Option
Crash Tolerance
Defaults
Dump File
Elastic Mode
Elastic Objective
Elastic Weight
Expand Frequency
Factorization Frequency
Feasibility Tolerance
Feasible Point
Infinite Bound Size
Insert File
Iterations Limit
List
Load File
LU Complete Pivoting
LU Density Tolerance
LU Factor Tolerance
LU Partial Pivoting
LU Rook Pivoting
LU Singularity Tolerance
LU Update Tolerance
Maximize
Minimize
New Basis File
Nolist
Old Basis File
Optimality Tolerance
Partial Price
Pivot Tolerance
Print File
Print Frequency
Print Level
Punch File
QPSolver CG
QPSolver Cholesky
QPSolver QN
Reduced Hessian Dimension
Save Frequency
Scale Option
Scale Print
Scale Tolerance
Solution File
Solution No
Solution Yes
Summary File
Summary Frequency
Superbasics Limit
Suppress Parameters
System Information No
System Information Yes
Timing Level
Unbounded Step Size

Optional parameters may be specified by calling one, or any, of the routines e04nrf, e04nsf, e04ntf and e04nuf before a call to e04nqf, but after a call to e04npf.

e04nrf reads options from an external options file, with Begin and End as the first and last lines respectively and each intermediate line defining a single optional parameter. For example,

Begin
   Print Level = 5
End

The call

Call e04nrf (ispecs, cw, iw, rw, ifail)

can then be used to read the file on unit ispecs. ifail will be zero on successful exit. e04nrf should be consulted for a full description of this method of supplying optional parameters.

e04nsf, e04ntf or e04nuf can be called to supply options directly, one call being necessary for each optional parameter. e04nsf, e04ntf or e04nuf should be consulted for a full description of this method of supplying optional parameters.

All optional parameters not specified by you are set to their default values. Optional parameters specified by you are unaltered by e04nqf (unless they define invalid values) and so remain in effect for subsequent calls unless altered by you.

12.1 Description of the Optional Parameters

For each option, we give a summary line, a description of the optional parameter and details of constraints.

The summary line contains:

the keywords;
a parameter value, where the letters $a$ , $i$ and $r$ denote options that take character, integer and real values respectively;
the default value is used whenever the condition $| i | \geq 100000000$ is satisfied and where the symbol $ε$ is a generic notation for machine precision (see x02ajf);
The variable $bigbnd$ holds the value of Infinite Bound Size.

Keywords and character values are case and white space insensitive.

Check Frequency

i

Default

= 60

Every

i

th iteration after the most recent basis factorization, a numerical test is made to see if the current solution

(x, s)

satisfies the linear constraints

A x - s = 0

. If the largest element of the residual vector

r = A x - s

is judged to be too large, the current basis is refactorized and the basic variables recomputed to satisfy the constraints more accurately. If

i \leq 0

, the value

i = 99999999

is used and effectively no checks are made.

Check Frequency = 1

is useful for debugging purposes, but otherwise this option should not be needed.

Crash Option

i

Default

= 3

Crash Tolerance

r

Default

= 0.1

Note that these options do not apply when

start ='W'

(see Section 5).

start ='C'

, an internal Crash procedure is used to select an initial basis from various rows and columns of the constraint matrix

(\begin{matrix} A & - I \end{matrix})

. The value of

i

determines which rows and columns of

A

are initially eligible for the basis, and how many times the Crash procedure is called. Columns of

- I

are used to pad the basis where necessary.

$i$	Meaning
$0$	The initial basis contains only slack variables: $B = I$ .
$1$	The Crash procedure is called once, looking for a triangular basis in all rows and columns of the matrix $A$ .
$2$	The Crash procedure is called once, looking for a triangular basis in rows.
$3$	The Crash procedure is called twice, treating linear equalities and linear inequalities separately.

i \geq 1

, certain slacks on inequality rows are selected for the basis first. (If

i \geq 2

, numerical values are used to exclude slacks that are close to a bound.) The Crash procedure then makes several passes through the columns of

A

, searching for a basis matrix that is essentially triangular. A column is assigned to ‘pivot’ on a particular row if the column contains a suitably large element in a row that has not yet been assigned. (The pivot elements ultimately form the diagonals of the triangular basis.) For remaining unassigned rows, slack variables are inserted to complete the basis.

The Crash Tolerance allows the Crash procedure to ignore certain ‘small’ nonzero elements in each column of

A

. If

a_{\max}

is the largest element in column

j

, other nonzeros

a_{i j}

in the column are ignored if

| a_{i j} | \leq a_{\max} \times r

. (To be meaningful,

r

should be in the range

0 \leq r < 1

When

r > 0.0

, the basis obtained by the Crash procedure may not be strictly triangular, but it is likely to be nonsingular and almost triangular. The intention is to obtain a starting basis containing more columns of

A

and fewer (arbitrary) slacks. A feasible solution may be reached sooner on some problems.

For example, suppose the first

m

columns of

A

form the matrix shown under LU Factor Tolerance; i.e., a tridiagonal matrix with entries

- 1

4

- 1

. To help the Crash procedure choose all

m

columns for the initial basis, we would specify a Crash Tolerance of

r

for some value of

r > 0.5

Defaults

This special keyword may be used to reset all optional parameters to their default values.

Dump File

i_{1}

Default

= 0

Load File

i_{2}

Default

= 0

Optional parameters Dump File and Load File are similar to optional parameters Punch File and Insert File, but they record solution information in a manner that is more direct and more easily modified. A full description of information recorded in optional parameters Dump File and Load File is given in Gill et al. (2005a).

i_{1} > 0

, the last solution obtained will be output to the file with unit number

i

i_{2} > 0

, the Load File containing basis information will be read. The file will usually have been output previously as a Dump File. The file will not be accessed if optional parameters Old Basis File or Insert File are specified.

Elastic Mode

i

Default

= 1

This argument determines if (and when) elastic mode is to be started. Three elastic modes are available as follows:

$i$	Meaning
$0$	Elastic mode is never invoked. e04nqf will terminate as soon as infeasibility is detected. There may be other points with significantly smaller sums of infeasibilities.
$1$	Elastic mode is invoked only if the constraints are found to be infeasible (the default). If the constraints are infeasible, continue in elastic mode with the composite objective determined by the values of the optional parameters Elastic Objective and Elastic Weight.
$2$	The iterations start and remain in elastic mode. This option allows you to minimize the composite objective function directly without first performing Phase 1 iterations. The success of this option will depend critically on your choice of Elastic Weight. If Elastic Weight is sufficiently large and the constraints are feasible, the minimizer of the composite objective and the solution of the original problem are identical. However, if the Elastic Weight is not sufficiently large, the minimizer of the composite function may be infeasible, even if a feasible point exists.

Elastic Objective

i

Default

= 1

This determines the form of the composite objective

f (x) + γ \sum_{j} (v_{j} + w_{j})

in Phase 2 (

γ

). Three types of composite objectives are available.

$i$	Meaning
$0$	Include only the true objective $f (x)$ in the composite objective. This option sets $γ = 0$ in the composite objective and allows e04nqf to ignore the elastic bounds and find a solution that minimizes $f (x)$ subject to the non-elastic constraints. This option is useful if there are some ‘soft’ constraints that you would like to ignore if the constraints are infeasible.
$1$	Use a composite objective defined with $γ$ determined by the value of Elastic Weight. This value is intended to be used in conjunction with $Elastic Mode = 2$ .
$2$	Include only the elastic variables in the composite objective. The elastics are weighted by $γ = 1$ . This choice minimizes the violations of the elastic variables at the expense of possibly increasing the true objective. This option can be used to find a point that minimizes the sum of the violations of a subset of constraints specified by the input array helast.

Elastic Weight

r

Default

= 1.0

This defines the value of

γ

in the composite objective in Phase 2 (

γ

At each iteration of elastic mode, the composite objective is defined to be

minimize ​ ​ σ ​ ​ f (x) + γ ​ (sum of infeasibilities);

where

σ = 1

for Minimize,

σ = −1

for Maximize, and

f (x)

is the quadratic objective.

Note that the effect of

γ

is not disabled once a feasible point is obtained.

Expand Frequency

i

Default

= 10000

This option is part of an anti-cycling procedure (see Section 11.4) designed to allow progress even on highly degenerate problems.

The strategy is to force a positive step at every iteration, at the expense of violating the constraints by a small amount. Suppose that the value of the optional parameter Feasibility Tolerance is

δ

. Over a period of

i

iterations, the feasibility tolerance actually used by e04nqf (i.e., the working feasibility tolerance) increases from

0.5 δ

δ

(in steps of

0.5 δ / i

Increasing the value of

i

helps reduce the number of slightly infeasible nonbasic variables (most of which are eliminated during the resetting procedure). However, it also diminishes the freedom to choose a large pivot element (see the description of the optional parameter Pivot Tolerance).

i \leq 0

, the value

i = 99999999

is used and effectively no anti-cycling procedure is invoked.

Factorization Frequency

i

Default

= 100 (LP)

50 (QP)

i > 0

, at most

i

basis changes will occur between factorizations of the basis matrix.

For LP problems, the basis factors are usually updated at every iteration. Higher values of

i

may be more efficient on problems that are extremely sparse and well scaled.

For QP problems, fewer basis updates will occur as the solution is approached. The number of iterations between basis factorizations will, therefore, increase. During these iterations a test is made regularly according to the value of optional parameter Check Frequency to ensure that the linear constraints

A x - s = 0

are satisfied. Occasionally, the basis will be refactorized before the limit of

i

updates is reached. If

i \leq 0

, the default value is used.

Feasibility Tolerance

r

Default

= \max {10^{−6}, \sqrt{ε}}

A feasible problem is one in which all variables satisfy their upper and lower bounds to within the absolute tolerance

r

. (This includes slack variables. Hence, the general constraints are also satisfied to within

r

e04nqf attempts to find a feasible solution before optimizing the objective function. If the sum of infeasibilities cannot be reduced to zero, the problem is assumed to be infeasible. Let sInf be the corresponding sum of infeasibilities. If sInf is quite small, it may be appropriate to raise

r

by a factor of

10

100

. Otherwise, some error in the data should be suspected.

Note that if sInf is not small and you have not asked e04nqf to minimize the violations of the elastic variables (i.e., you have not specified

Elastic Objective = 2

), there may be other points that have a significantly smaller sum of infeasibilities. e04nqf will not attempt to find the solution that minimizes the sum unless

Elastic Objective = 2

If the constraints and variables have been scaled (see the description of the optional parameter Scale Option), then feasibility is defined in terms of the scaled problem (since it is more likely to be meaningful).

Infinite Bound Size

r

Default

= 10^{20}

r \geq 0

r

defines the ‘infinite’ bound

infbnd

in the definition of the problem constraints. Any upper bound greater than or equal to

infbnd

will be regarded as

+ \infty

(and similarly any lower bound less than or equal to

- infbnd

will be regarded as

- \infty

). If

r < 0

, the default value is used.

Iterations Limit

i

Default

= \max {10000, 10 \max {m, n}}

The value of

i

specifies the maximum number of iterations allowed before termination. Setting

i = 0

and

Print Level > 0

means that: the workspace needed to start solving the problem will be computed and printed; and feasibility and optimality will be checked. No iterations will be performed. If

i < 0

, the default value is used.

LU Density Tolerance

r_{1}

Default

= 0.6

LU Singularity Tolerance

r_{2}

Default

= ε^{\frac{2}{3}}

The density tolerance

r_{1}

is used during

L U

factorization of the basis matrix. Columns of

L

and rows of

U

are formed one at a time, and the remaining rows and columns of the basis are altered appropriately. At any stage, if the density of the remaining matrix exceeds

r_{1}

, the Markowitz strategy for choosing pivots is terminated. The remaining matrix is factored by a dense

L U

procedure. Raising the density tolerance towards

1.0

may give slightly sparser

L U

factors, with a slight increase in factorization time.

r_{2} > 0

r_{2}

defines the singularity tolerance used to guard against ill-conditioned basis matrices. After

B

is refactorized, the diagonal elements of

U

are tested as follows. If

| u_{j j} | \leq r_{2}

| u_{j j} | < r_{2} \max_{i} | u_{i j} |

, the

j

th column of the basis is replaced by the corresponding slack variable. If

r_{2} \leq 0

, the default value is used.

LU Factor Tolerance

r_{1}

Default

= 100.0

LU Update Tolerance

r_{2}

Default

= 10.0

The values of

r_{1}

and

r_{2}

affect the stability and sparsity of the basis factorization

B = L U

, during refactorization and updates respectively. The lower triangular matrix

L

is a product of matrices of the form

(\begin{matrix} 1 \\ μ & 1 \end{matrix})

where the multipliers

μ

will satisfy

| μ | \leq r_{i}

. The default values of

r_{1}

and

r_{2}

usually strike a good compromise between stability and sparsity. They must satisfy

r_{1}

r_{2} \geq 1.0

For large and relatively dense problems,

r_{1} = 10.0

5.0

(say) may give a useful improvement in stability without impairing sparsity to a serious degree.

For certain very regular structures (e.g., band matrices) it may be necessary to reduce

r_{1} ​ and/or ​ r_{2}

in order to achieve stability. For example, if the columns of

A

include a sub-matrix of the form

(\begin{matrix} 4 & −1 \\ −1 & 4 & −1 \\ −1 & 4 & −1 \\ \dots & \dots & \dots \\ −1 & 4 & −1 \\ −1 & 4 \end{matrix}),

one should set both

r_{1}

and

r_{2}

to values in the range

1.0 \leq r_{i} < 4.0

LU Partial Pivoting

Default

LU Complete Pivoting

LU Rook Pivoting

The

L U

factorization implements a Markowitz-type search for pivots that locally minimize the fill-in subject to a threshold pivoting stability criterion. The default option is to use threshold partial pivoting. The optional parameters LU Complete Pivoting and LU Rook Pivoting are more expensive but more stable and better at revealing rank, as long as the LU Factor Tolerance is not too large (say

< 2.0

Minimize

Default

Maximize

Feasible Point

This option specifies the required direction of the optimization. It applies to both linear and nonlinear terms (if any) in the objective function. Note that if two problems are the same except that one minimizes

f (x)

and the other maximizes

- f (x)

, their solutions will be the same but the signs of the dual variables

π_{i}

and the reduced gradients

d_{j}

(see Section 11.3) will be reversed.

The optional parameter Feasible Point means ‘ignore the objective function, while finding a feasible point for the linear constraints’. It can be used to check that the constraints are feasible without altering the call to e04nqf.

New Basis File

i_{1}

Default

= 0

Backup Basis File

i_{2}

Default

= 0

Save Frequency

i_{3}

Default

= 100

Optional parameters New Basis File and Backup Basis File are sometimes referred to as basis maps. They contain the most compact representation of the state of each variable. They are intended for restarting the solution of a problem at a point that was reached by an earlier run. For nontrivial problems, it is advisable to save basis maps at the end of a run, in order to restart the run if necessary.

i_{1} > 0

, a basis map will be saved on file

i_{1}

every

i_{3}

th iteration, where

i_{3}

is the Save Frequency. The first record of the file will contain the word PROCEEDING if the run is still in progress. A basis map will also be saved at the end of a run, with some other word indicating the final solution status.

Use of

i_{2} > 0

is intended as a safeguard against losing the results of a long run. Suppose that a New Basis File is being saved every

100

(Save Frequency) iterations, and that e04nqf is about to save such a basis at iteration

2000

. It is conceivable that the run may be interrupted during the next few milliseconds (in the middle of the save). In this case the Basis file will be corrupted and the run will have been essentially wasted.

To eliminate this risk, both a New Basis File and a Backup Basis File may be specified. The following would be suitable for the above example:

Backup Basis File 11
New Basis File 12

The current basis will then be saved every

100

iterations, first on New Basis File and then immediately on Backup Basis File. If the run is interrupted at iteration

2000

during the save on New Basis File, there will still be a usable basis on Backup Basis File (corresponding to iteration

1900

Note that a new basis will be saved in New Basis File at the end of a run if it terminates normally, but it will not be saved in Backup Basis File. In the above example, if an optimum solution is found at iteration

2050

(or if the iteration limit is

2050

), the final basis on New Basis File will correspond to iteration

2050

, but the last basis saved on Backup Basis File will be the one for iteration

2000

A full description of information recorded in New Basis File and Backup Basis File is given in Gill et al. (2005a).

Nolist

Default

List

Optional parameter List enables printing of each optional parameter specification as it is supplied. Nolist suppresses this printing.

Old Basis File

i

Default

= 0

i > 0

, the basis maps information will be obtained from this file. The file will usually have been output previously as a New Basis File or Backup Basis File. A full description of information recorded in New Basis File and Backup Basis File is given in Gill et al. (2005a).

The file will not be acceptable if the number of rows or columns in the problem has been altered.

Optimality Tolerance

r

Default

= \max {10^{−6}, \sqrt{ε}}

This is used to judge the size of the reduced gradients

d_{j} = g_{j} - a_{j}^{T} π

, where

g_{j}

is the

j

th component of the gradient,

a_{j}

is the associated column of the constraint matrix

(\begin{matrix} A & - I \end{matrix})

, and

π

is the set of dual variables.

By construction, the reduced gradients for basic variables are always zero. The problem will be declared optimal if the reduced gradients for nonbasic variables at their lower or upper bounds satisfy

d_{j} / ‖ π ‖ \geq - r or d_{j} / ‖ π ‖ \leq r

respectively, and if

| d_{j} | / ‖ π ‖ \leq r

for superbasic variables.

In the above tests,

‖ π ‖

is a measure of the size of the dual variables. It is included to make the tests independent of a scale factor on the objective function. The quantity

‖ π ‖

actually used is defined by

‖ π ‖ = \max (σ / \sqrt{m}, 1), where ​ σ = \sum_{i = 1}^{m} | π_{i} |,

so that only large scale factors are allowed for.

If the objective is scaled down to be very small, the optimality test reduces to comparing

d_{j}

against

0.01 r

Partial Price

i

Default

= 10 (LP)

1 (QP)

This option is recommended for large FP or LP problems that have significantly more variables than constraints (i.e.,

n ≫ m

). It reduces the work required for each pricing operation (i.e., when a nonbasic variable is selected to enter the basis). If

i = 1

, all columns of the constraint matrix

(\begin{matrix} A & - I \end{matrix})

are searched. If

i > 1

A

and

I

are partitioned to give

i

roughly equal segments

A_{j}, I_{j}

, for

j = 1, 2, \dots, i

(modulo

i

). If the previous pricing search was successful on

A_{j - 1}, I_{j - 1}

, the next search begins on the segments

A_{j}

and

I_{j}

. If a reduced gradient is found that is larger than some dynamic tolerance, the variable with the largest such reduced gradient (of appropriate sign) is selected to enter the basis. If nothing is found, the search continues on the next segments

A_{j + 1}, I_{j + 1}

, and so on. If

i \leq 0

, the default value is used.

Pivot Tolerance

r

Default

= ε^{\frac{2}{3}}

Broadly speaking, the pivot tolerance is used to prevent columns entering the basis if they would cause the basis to become almost singular.

When

x

changes to

x + α p

for some search direction

p

, a ‘ratio test’ determines which component of

x

reaches an upper or lower bound first. The corresponding element of

p

is called the pivot element. Elements of

p

are ignored (and, therefore, cannot be pivot elements) if they are smaller than the pivot tolerance

r

It is common for two or more variables to reach a bound at essentially the same time. In such cases, the optional parameter Feasibility Tolerance (say

t

) provides some freedom to maximize the pivot element and thereby improve numerical stability. Excessively small values of

t

should, therefore, not be specified. To a lesser extent, the optional parameter Expand Frequency (say

f

) also provides some freedom to maximize the pivot element. Excessively large values of

f

should, therefore, not be specified.

Print File

i

Default

= 0

i > 0

, the following information is output to

i

during the solution of each problem:

–a listing of the optional parameters;
–some statistics about the problem;
–the amount of storage available for the $L U$ factorization of the basis matrix;
–notes about the initial basis resulting from a Crash procedure or a Basis file;
–the iteration log;
–basis factorization statistics;
–the exit ifail condition and some statistics about the solution obtained;
–the printed solution, if requested.

The last four items are described in Sections 9 and 13. Further brief output may be directed to the Summary File.

Print Frequency

i

Default

= 100

i > 0

, one line of the iteration log will be printed every

i

th iteration. A value such as

i = 10

is suggested for those interested only in the final solution. If

i \leq 0

, the value of

i = 99999999

is used and effectively no checks are made.

Print Level

i

Default

= 1

This controls the amount of printing produced by e04nqf as follows.

$i$	Meaning
0	No output except error messages. If you want to suppress all output, set $Print File = 0$ .
$= 1$	The set of selected options, problem statistics, summary of the scaling procedure, information about the initial basis resulting from a Crash or a Basis file, a single line of output at each iteration (controlled by the optional parameter Print Frequency), and the exit condition with a summary of the final solution.
$\geq 10$	Basis factorization statistics.

Punch File

i_{1}

Default

= 0

Insert File

i_{2}

Default

= 0

These files provide compatibility with commercial mathematical programming systems. The Punch File from a previous run may be used as an Insert File for a later run on the same problem. A full description of information recorded in Insert File and Punch File is given in Gill et al. (2005a).

i_{1} > 0

, the final solution obtained will be output to file

i_{1}

. For linear programs, this format is compatible with various commercial systems.

i_{2} > 0

, the Insert File containing basis information will be read. The file will usually have been output previously as a Punch File. The file will not be accessed if Old Basis File is specified.

QPSolver Cholesky

Default

QPSolver CG

QPSolver QN

Specifies the active-set algorithm used to solve the quadratic program in Phase 2 (

γ

). QPSolver Cholesky holds the full Cholesky factor

R

of the reduced Hessian

Z^{T} H Z

. As the QP iterations proceed, the dimension of

R

changes with the number of superbasic variables. If the number of superbasic variables needs to increase beyond the value of Reduced Hessian Dimension, the reduced Hessian cannot be stored and the solver switches to QPSolver CG. The Cholesky solver is reactivated if the number of superbasics stabilizes at a value less than Reduced Hessian Dimension.

QPSolver QN solves the QP using a quasi-Newton method. In this case,

R

is the factor of a quasi-Newton approximate Hessian.

QPSolver CG uses an active-set method similar to QPSolver QN, but uses the conjugate-gradient method to solve all systems involving the reduced Hessian.

The Cholesky QP solver is the most robust, but may require a significant amount of computation if there are many superbasics.

The quasi-Newton QP solver does not require computation of the exact

R

at the start of Phase 2 (

γ

). It may be appropriate when the number of superbasics is large but relatively few iterations are needed to reach a solution (e.g., if e04nqf is called with a Warm Start).

The conjugate-gradient QP solver is appropriate for problems with many degrees of freedom (say, more than

2000

superbasics).

Reduced Hessian Dimension

i

Default

= 1 (LP) ​ or ​ \min (2000, n_{H} + 1, n) (QP)

This specifies that an

i \times i

triangular matrix

R

(to define the reduced Hessian according to

R^{T} R = Z^{T} H Z

). is to be available for use by the Cholesky QP solver.

Scale Option

i

Default

= 2

Scale Tolerance

r

Default

= 0.9

Scale Print

Three scale options are available as follows:

$i$	Meaning
0	No scaling. This is recommended if it is known that $x$ and the constraint matrix never have very large elements (say, larger than $100$ ).
1	The constraints and variables are scaled by an iterative procedure that attempts to make the matrix coefficients as close as possible to $1.0$ (see Fourer (1982)). This will sometimes improve the performance of the solution procedures.
2	The constraints and variables are scaled by the iterative procedure. Also, a certain additional scaling is performed that may be helpful if the right-hand side $b$ or the solution $x$ is large. This takes into account columns of $(\begin{matrix} A & - I \end{matrix})$ that are fixed or have positive lower bounds or negative upper bounds.

Optional parameter Scale Tolerance affects how many passes might be needed through the constraint matrix. On each pass, the scaling procedure computes the ratio of the largest and smallest nonzero coefficients in each column:

ρ_{j} = \max_{j} | a_{i j} | / \min_{i} | a_{i j} | (a_{i j} \neq 0) .

\max_{j} ρ_{j}

is less than

r

times its previous value, another scaling pass is performed to adjust the row and column scales. Raising

r

from

0.9

0.99

(say) usually increases the number of scaling passes through

A

. At most

10

passes are made. The value of

r

should lie in the range

0 < r < 1

Scale Print causes the row scales

r (i)

and column scales

c (j)

to be printed to Print File, if System Information Yes has been specified. The scaled matrix coefficients are

{\bar{a}}_{i j} = a_{i j} c (j) / r (i)

, and the scaled bounds on the variables and slacks are

{\bar{l}}_{j} = l_{j} / c (j)

{\bar{u}}_{j} = u_{j} / c (j)

, where

c (j) = r (j - n)

j > n

Solution Yes

Default

Solution No

This option determines if the final obtained solution is to be output to the Print File. Note that the Solution File option operates independently.

Solution File

i

Default

= 0

i > 0

, the final solution will be output to file

i

(whether optimal or not).

To see more significant digits in the printed solution, it will sometimes be useful to make

i

refer to the system Print File.

Summary File

i_{1}

Default

= 0

Summary Frequency

i_{2}

Default

= 100

i_{1} > 0

, the Summary File is output to file

i_{1}

, including a line of the iteration log every

i_{2}

th iteration. In an interactive environment, it is useful to direct this output to the terminal, to allow a run to be monitored online. (If something looks wrong, the run can be manually terminated.) Further details are given in Section 13. If

i_{2} \leq 0

, the value of

i_{2} = 99999999

is used and effectively no checks are made.

Superbasics Limit

i

Default

= 1 (LP) ​ or ​ \min {n_{H} + 1, n} (QP)

This places a limit on the storage allocated for superbasic variables. Ideally,

i

should be set slightly larger than the ‘number of degrees of freedom’ expected at an optimal solution.

For linear programs, an optimum is normally a basic solution with no degrees of freedom. (The number of variables lying strictly between their bounds is no more than

m

, the number of general constraints.) The default value of

i

is, therefore,

1

For quadratic problems, the number of degrees of freedom is often called the ‘number of independent variables’. Normally,

i

need not be greater than

n_{H} + 1

, where

n_{H}

is the number of leading nonzero columns of

H

. For many problems,

i

may be considerably smaller than

n_{H}

. This will save storage if

n_{H}

is very large.

Suppress Parameters

Normally e04nqf prints the options file as it is being read, and then prints a complete list of the available keywords and their final values. The optional parameter Suppress Parameters tells e04nqf not to print the full list.

System Information No

Default

System Information Yes

This option prints additional information on the progress of major and minor iterations, and Crash statistics. See Section 13.

Timing Level

i

Default

= 0

i > 0

, some timing information will be output to the Print file, if

Print File > 0

Unbounded Step Size

r

Default

= infbnd

r > 0

r

specifies the magnitude of the change in variables that will be considered a step to an unbounded solution. (Note that an unbounded solution can occur only when the Hessian is not positive definite.) If the change in

x

during an iteration would exceed the value of

r

, the objective function is considered to be unbounded below in the feasible region. If

r \leq 0

, the default value is used. See Infinite Bound Size for the definition of

infbnd

13 Description of Monitoring Information

This section describes the intermediate printout and final printout which constitutes the monitoring information produced by e04nqf. (See also the description of the optional parameters Print File and Print Level.) You can control the level of printed output.

13.1 Crash Statistics

When

Print Level \geq 10

Print File > 0

and System Information Yes has been specified, the following lines of intermediate printout (less than

120

characters) are produced on the unit number specified by optional parameter Print File whenever

start ='C'

(see Section 5). They refer to the number of columns selected by the Crash procedure during each of several passes through

A

, whilst searching for a triangular basis matrix.

Label	Description
Slacks	is the number of slacks selected initially.
Free cols	is the number of free columns in the basis, including those whose bounds are rather far apart.
Preferred	is the number of ‘preferred’ columns in the basis (i.e., $hs (j) = 3$ for some $j \leq n$ ). It will be a subset of the columns for which $hs (j) = 3$ was specified.
Unit	is the number of unit columns in the basis.
Double	is the number of double columns in the basis.
Triangle	is the number of triangular columns in the basis.
Pad	is the number of slacks used to pad the basis (to make it a nonsingular triangle).

13.2 Basis Factorization Statistics

When

Print Level \geq 10

and

Print File > 0

, the first seven items of intermediate printout in the list below are produced on the unit number specified by optional parameter Print File whenever the matrix

B

B_{S} = {(\begin{matrix} B & S \end{matrix})}^{T}

is factorized. Gaussian elimination is used to compute an

L U

factorization of

B

B_{S}

, where

P L P^{T}

is a lower triangular matrix and

P U Q

is an upper triangular matrix for some permutation matrices

P

and

Q

. The factorization is stabilized in the manner described under the optional parameter LU Factor Tolerance. In addition, if System Information Yes has been specified, the entries from Elems onwards are also output.

Label

Description

Factor

the number of factorizations since the start of the run.

Demand

a code giving the reason for the present factorization, as follows:

Code	Meaning
0	First $L U$ factorization.
1	The number of updates reached the Factorization Frequency.
2	The nonzeros in the updated factors have increased significantly.
7	Not enough storage to update factors.
10	Row residuals are too large (see the description of the optional parameter Check Frequency).
11	Ill-conditioning has caused inconsistent results.

Itn

is the current minor iteration number.

Nonlin

is the number of nonlinear variables in the current basis

B

Linear

is the number of linear variables in

B

Slacks

is the number of slack variables in

B

B, BR, BS or BT factorize

is the type of

L U

factorization.

B	periodic factorization of the basis $B$ .
BR	more careful rank-revealing factorization of $B$ using threshold rook pivoting. This occurs mainly at the start, if the first basis factors seem singular or ill-conditioned. Followed by a normal B factorize.
BS	$B_{S}$ is factorized to choose a well-conditioned $B$ from the current $(B S)$ . Followed by a normal B factorize.
BT	same as BS except the current $B$ is tried first and accepted if it appears to be not much more ill-conditioned than after the previous BS factorize.

is the number of rows in

B

B_{S}

is the number of columns in

B

B_{S}

. Preceded by ‘=’ or ‘>’ respectively.

Elems

is the number of nonzero elements in

B

B_{S}

Amax

is the largest nonzero in

B

B_{S}

Density

is the percentage nonzero density of

B

B_{S}

Merit/MerRP/MerCP

Merit is the average Markowitz merit count for the elements chosen to be the diagonals of

P U Q

. Each merit count is defined to be

(c - 1) (r - 1)

where

c

and

r

are the number of nonzeros in the column and row containing the element at the time it is selected to be the next diagonal. Merit is the average of n such quantities. It gives an indication of how much work was required to preserve sparsity during the factorization. If LU Complete Pivoting or LU Rook Pivoting has been selected, this heading is changed to MerCP, respectively MerRP.

lenL

is the number of nonzeros in

L

L+U

is the number of nonzeros representing the basis factors

L

and

U

. Immediately after a basis factorization

B = L U

, this is lenL+lenU, the number of subdiagonal elements in the columns of a lower triangular matrix and the number of diagonal and superdiagonal elements in the rows of an upper-triangular matrix. Further nonzeros are added to L when various columns of

B

are later replaced. As columns of

B

are replaced, the matrix

U

is maintained explicitly (in sparse form). The value of L will steadily increase, whereas the value of U may fluctuate up or down. Thus the value of L+U may fluctuate up or down (in general, it will tend to increase).

Cmpressns

is the number of times the data structure holding the partially factored matrix needed to be compressed to recover unused storage. Ideally this number should be zero. If it is more than

3

4

, the amount of workspace available to e04nqf should be increased for efficiency.

Incres

is the percentage increase in the number of nonzeros in

L

and

U

relative to the number of nonzeros in

B

B_{S}

Utri

is the number of triangular rows of

B

B_{S}

at the top of

U

lenU

the number of nonzeros in

U

, including its diagonals.

Ltol

is the largest subdiagonal element allowed in

L

. This is the specified LU Factor Tolerance or a smaller value that is currently being used for greater stability.

Umax

the maximum nonzero element in

U

Ugrwth

is the ratio

Umax / Amax

, which ideally should not be substantially larger than

10.0

100.0

. If it is orders of magnitude larger, it may be advisable to reduce the LU Factor Tolerance to

5.0

4.0

3.0

2.0

, say (but bigger than

1.0

As long as Lmax is not large (say

5.0

or less),

\max (Amax, Umax) / DUmin

gives an estimate of the condition number

B

. If this is extremely large, the basis is nearly singular. Slacks are used to replace suspect columns of

B

and the modified basis is refactored.

Ltri

is the number of triangular columns of

B

B_{S}

at the left of

L

dense1

is the number of columns remaining when the density of the basis matrix being factorized reached

0.3

Lmax

is the actual maximum subdiagonal element in

L

(bounded by Ltol).

Akmax

is the largest nonzero generated at any stage of the

L U

factorization. (Values much larger than Amax indicate instability.) Akmax is not printed if LU Partial Pivoting is selected.

Agrwth

is the ratio

Akmax / Amax

. Values much larger than

100

(say) indicate instability. Agrwth is not printed if LU Partial Pivoting is selected.

bump

is the size of the block to be factorized nontrivially after the triangular rows and columns of

B

B_{S}

have been removed.

dense2

is the number of columns remaining when the density of the basis matrix being factorized reached

0.6

. (The Markowitz pivot strategy searches fewer columns at that stage.)

DUmax

is the largest diagonal of

P U Q

DUmin

is the smallest diagonal of

P U Q

condU

the ratio

DUmax / DUmin

, which estimates the condition number of

U

(and of

B

if Ltol is less than

5.0

, say).

13.3 Basis Map

When

Print Level \geq 10

and

Print File > 0

, the following lines of intermediate printout (less than

80

characters) are produced on the unit number specified by optional parameter Print File. They refer to the elements of the names array (see Section 5).

Label	Description
Name	gives the name for the problem (blank if problem unnamed).
Infeasibilities	gives the number of infeasibilities. Printed only if the final point is infeasible.
Objective Value	gives the objective value at the final point (or the value of the sum of infeasibilities). Printed only if the final point is feasible.
Status	gives the exit status for the problem (i.e., Optimal soln, Weak soln, Unbounded, Infeasible, Excess itns, Error condn or Feasble soln) followed by details of the direction of the optimization (i.e., (Min) or (Max)).
Iteration	gives the iteration number when the file was created.
Superbasics	gives the number of superbasic variables.
Objective	gives the name of the free row for the problem (blank if objective unnamed).
RHS	gives the name of the constraint right-hand side for the problem (blank if objective unnamed).
Ranges	gives the name of the ranges for the problem (blank if objective unnamed).
Bounds	gives the name of the bounds for the problem (blank if objective unnamed).

13.4 Solution Output

At the end of a run, the final solution will be output to the Print file. Some header information appears first to identify the problem and the final state of the optimization procedure. A ROWS section and a COLUMNS section then follow, giving one line of information for each row and column.

13.4.1 The ROWS section

General constraints take the form

l \leq A x \leq u

. The

i

th constraint is, therefore, of the form

α \leq ν_{i} x \leq β,

where

ν_{i}

is the

i

th row of

A

Internally, the constraints take the form

A x - s = 0

, where

s

is the set of slack variables (which happen to satisfy the bounds

l \leq s \leq u

). For the

i

th constraint, the slack variable

s_{i}

is directly available, and it is sometimes convenient to refer to its state. It should satisfy

α \leq s_{i} \leq β

. A fullstop (.) is printed for any numerical value that is exactly zero.

Label

Description

Number

is the value of

n + i

. (This is used internally to refer to

s_{i}

in the intermediate output.)

Row

gives the name of

ν_{i}

State

the state of

ν_{i}

(the state of

s_{i}

relative to the bounds

α

and

β

). The various states possible are as follows:

LL	$s_{i}$ is nonbasic at its lower limit, $α$ .
UL	$s_{i}$ is nonbasic at its upper limit, $β$ .
EQ	$s_{i}$ is nonbasic and fixed at the value $α = β$ .
FR	$s_{i}$ is nonbasic and currently zero, even though it is free to take any value between its bounds $α$ and $β$ .
BS	$s_{i}$ is basic.
SBS	$s_{i}$ is superbasic.

A key is sometimes printed before State. Note that unless the optional parameter

Scale Option = 0

is specified, the tests for assigning a key are applied to the variables of the scaled problem.

A	Alternative optimum possible. The variable is nonbasic, but its reduced gradient is essentially zero. This means that if the variable were allowed to start moving away from its bound, there would be no change in the value of the objective function. The values of the other free variables might change, giving a genuine alternative solution. However, if there are any degenerate variables (labelled D), the actual change might prove to be zero, since one of them could encounter a bound immediately. In either case, the values of the Lagrange multipliers might also change.
D	Degenerate. The variable is basic or superbasic, but it is equal (or very close) to one of its bounds.
I	Infeasible. The variable is basic or superbasic and is currently violating one of its bounds by more than the value of the Feasibility Tolerance.
N	Not precisely optimal. If the slack is superbasic, the dual variable $π_{i}$ is not sufficiently small, as measured by the Optimality Tolerance. If the slack is nonbasic, $π_{i}$ is not sufficiently positive or negative. If a loose Optimality Tolerance has been used, or if iterations were terminated before optimality, this key might be helpful in deciding whether or not to restart the run.

Activity

is the value of

ν_{i} x

at the final iterate.

Slack Activity

is the value by which the row differs from its nearest bound. (For the free row (if any), it is set to Activity.)

Lower Limit

α

, the lower bound specified for the variable

s_{i}

. None indicates that

bl (j) \leq - infbnd

Upper Limit

β

, the upper bound specified for the variable

s_{i}

. None indicates that

bu (j) \geq infbnd

Dual Activity

is the value of the dual variable

π_{i}

(the Lagrange multiplier for

ν_{i}

; see Section 11.3). For FP problems,

π_{i}

is set to zero.

gives the index

i

of the

i

th row.

13.4.2 The COLUMNS Section

Let the

j

th component of

x

be the variable

x_{j}

and assume that it satisfies the bounds

α \leq x_{j} \leq β

. A fullstop (.) is printed for any numerical value that is exactly zero.

Label

Description

Number

is the column number

j

. (This is used internally to refer to

x_{j}

in the intermediate output.)

Column

gives the name of

x_{j}

State

the state of

x_{j}

relative to the bounds

α

and

β

. The various states possible are as follows:

LL	$x_{j}$ is nonbasic at its lower limit, $α$ .
UL	$x_{j}$ is nonbasic at its upper limit, $β$ .
EQ	$x_{j}$ is nonbasic and fixed at the value $α = β$ .
FR	$x_{j}$ is nonbasic and currently zero, even though it is free to take any value between its bounds $α$ and $β$ .
BS	$x_{j}$ is basic.
SBS	$x_{j}$ is superbasic.

A key is sometimes printed before State. Note that unless the optional parameter

Scale Option = 0

is specified, the tests for assigning a key are applied to the variables of the scaled problem.

A	Alternative optimum possible. The variable is nonbasic, but its reduced gradient is essentially zero. This means that if the variable were allowed to start moving away from its bound, there would be no change in the value of the objective function. The values of the other free variables might change, giving a genuine alternative solution. However, if there are any degenerate variables (labelled D), the actual change might prove to be zero, since one of them could encounter a bound immediately. In either case, the values of the Lagrange multipliers might also change.
D	Degenerate. The variable is basic or superbasic, but it is equal (or very close) to one of its bounds.
I	Infeasible. The variable is basic or superbasic and is currently violating one of its bounds by more than the value of the Feasibility Tolerance.
N	Not precisely optimal. If the slack is superbasic, the dual variable $π_{i}$ is not sufficiently small, as measured by the Optimality Tolerance. If the slack is nonbasic, $π_{i}$ is not sufficiently positive or negative. If a loose Optimality Tolerance has been used, or if iterations were terminated before optimality, this key might be helpful in deciding whether or not to restart the run.

Activity

is the value of

x_{j}

at the final iterate.

Obj Gradient

is the value of

g_{j}

at the final iterate. For FP problems,

g_{j}

is set to zero.

Lower Limit

is the lower bound specified for the variable. None indicates that

bl (j) \leq - infbnd

Upper Limit

is the upper bound specified for the variable. None indicates that

bu (j) \geq infbnd

Reduced Gradnt

is the value of

d_{j}

at the final iterate (see Section 11.3). For FP problems,

d_{j}

is set to zero.

m + j

is the value of

m + j

Note: if two problems are the same except that one minimizes

f (x)

and the other maximizes

- f (x)

, their solutions will be the same but the signs of the dual variables

π_{i}

and the reduced gradients

d_{j}

will be reversed.

13.5 The Solution File

If a positive Solution File is specified, the information contained in a printed solution may also be output to the relevant file (which may be the Print file if so desired). Infinite Upper and Lower limits appear as

\pm 10^{20}

rather than None. Other real values are output with format 1pe16.6. The maximum line length is

111

characters, including what would be the carriage-control character if the file were printed.

A Solution file is intended to be read from disk by a self-contained program that extracts and saves certain values as required for possible further computation. Typically the first

14

lines would be ignored. Each subsequent line may be read using

Format (i8, 2x, 2a4, 1x, a1, 1x, a3, 5e16.6, i7)

adapted to suit the occasion. The end of the ROWS section is marked by a line that starts with a

1

and is otherwise blank. If this and the next

4

lines are skipped, the COLUMNS section (see Section 13.4.2) can then be read under the same format. (There should be no need to use any BACKSPACE statements.)

13.6 The Summary File

If Summary File

f

is specified with

f > 0

, certain brief information will be output to unit

f

. When e04nqf is run interactively, unit

f

will usually be the terminal. For batch jobs a disk file should be used, to retain a concise log of each run if desired. (A Summary File is more easily perused than the associated Print file).

A Summary file (like the Print file) is not rewound after a problem has been processed. The maximum line length is

72

characters, including a carriage-control character in column 1.

The following information is included:

1.The optional parameters supplied via the option setting routines, if any;
2.The Basis file loaded, if any;
3.The status of the solution after each basis factorization (whether feasible; the objective value; the number of function calls so far);
4.The same information every $k$ th iteration, where $k$ is the specified Summary Frequency;
5.Warnings and error messages;
6.The exit condition and a summary of the final solution.

Item 4 is preceded by a blank line, but item 5 is not.

The meaning of the printout for linear constraints is the same as that given above for variables, with ‘variable’ replaced by ‘constraint’,

n

replaced by

m

names (j)

replaced by

names (n + j)

bl (j)

and

bu (j)

are replaced by

bl (n + j)

and

bu (n + j)

respectively, and with the following change in the heading:

Constrnt

gives the name of the linear constraint.

Note that movement off a constraint (as opposed to a variable moving away from its bound) can be interpreted as allowing the entry in the Residual column to become positive.

Numerical values are output with a fixed number of digits; they are not guaranteed to be accurate to this precision.

NAG Library Manual, Mark 27.3

Interfaces: FL CL CPP AD

NAG FL Interface Introduction

E04 (Opt) Chapter Contents

E04 (Opt) Chapter Introduction

e04nq: FL CL CPP AD

NAG FL Interfacee04nqf (qpconvex2_​sparse_​solve)

▸▿ Contents

1 Purpose

2 Specification

3 Description

4 References

5 Arguments

6 Error Indicators and Warnings

7 Accuracy

8 Parallelism and Performance

9 Further Comments

9.1 Description of the Printed Output

10 Example

10.1 Program Text

10.2 Program Data

10.3 Program Results

11 Algorithmic Details

11.1 Overview

11.2 Definition of the Working Set and Search Direction

11.3 Main Iteration

11.4 Miscellaneous

12 Optional Parameters

12.1 Description of the Optional Parameters

13 Description of Monitoring Information

13.1 Crash Statistics

13.2 Basis Factorization Statistics

13.3 Basis Map

13.4 Solution Output

13.4.1 The ROWS section

13.4.2 The COLUMNS Section

13.5 The Solution File

13.6 The Summary File

NAG FL Interface
e04nqf (qpconvex2_sparse_solve)