naginterfaces.library.opt.qpconvex2_sparse_solve¶

naginterfaces.library.opt.qpconvex2_sparse_solve(start, m, ncolh, iobj, objadd, prob, acol, inda, loca, bl, bu, names, helast, hs, x, ns, comm, qphx=None, c=None, data=None, io_manager=None)[source]¶

qpconvex2_sparse_solve solves sparse linear programming or convex quadratic programming problems. The initialization function qpconvex2_sparse_init() must have been called before calling qpconvex2_sparse_solve.

Note: this function uses optional algorithmic parameters, see also: qpconvex2_sparse_option_file(), qpconvex2_sparse_option_string(), qpconvex2_sparse_option_integer_set(), qpconvex2_sparse_option_double_set(), qpconvex2_sparse_init(), qpconvex2_sparse_option_integer_get(), qpconvex2_sparse_option_double_get().

For full information please refer to the NAG Library document for e04nq

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/e04/e04nqf.html

Parameters

startstr, length 1

Indicates how a starting basis (and certain other items) will be obtained.

$s t a r t ='C'$

Requests that an internal Crash procedure be used to choose an initial basis, unless a Basis file is provided via options ‘Old Basis File’, ‘Insert File’ or ‘Load File’.

$s t a r t ='B'$

Is the same as $s t a r t ='C'$ but is more meaningful when a Basis file is given.

$s t a r t ='W'$

Means that a basis is already defined in $h s$ and a start point is already defined in $x$ (probably from an earlier call).

mint

$m$ , the number of general linear constraints (or slacks). This is the number of rows in the linear constraint matrix $A$ , including the free row (if any; see $i o b j$ ). Note that $A$ must have at least one row. If your problem has no constraints, or only upper or lower bounds on the variables, then you must include a dummy row with sufficiently wide upper and lower bounds (see also $a c o l$ , $i n d a$ and $l o c a$ ).

ncolhint

$n_{H}$ , the number of leading nonzero columns of the Hessian matrix $H$ . For FP and LP problems, $n c o l h$ must be set to zero.

The first $n c o l h$ elements of $x$ belong to variables corresponding to the nonzero block of the QP Hessian.

iobjint

If $i o b j > 0$ , row $i o b j$ of $A$ is a free row containing the nonzero elements of the vector $c$ appearing in the linear objective term $c^{T} x$ .

If $i o b j = 0$ , there is no free row, and the linear objective vector should be supplied in array $c$ .

objaddfloat

The constant $q$ , to be added to the objective for printing purposes. Typically $o b j a d d = 0.0 e 0$ .

probstr, length 8

The name for the problem. It is used in the printed solution and in some functions that output Basis files. A blank name may be used.

acolfloat, array-like, shape $(ne)$

The nonzero elements of $A$ , ordered by increasing column index. Note that all elements must be assigned a value in the calling program.

indaint, array-like, shape $(ne)$

$i n d a [i - 1]$ must contain the row index of the nonzero element stored in $a c o l [i - 1]$ , for $i = 1, 2, \dots, ne$ . Thus a pair of values $(a c o l [i - 1], i n d a [i - 1])$ contains a matrix element and its corresponding row index.

Note that the row indices for a column may be supplied in any order.

locaint, array-like, shape $(n + 1)$

$l o c a [j - 1]$ must contain the index in $a c o l$ and $i n d a$ of the start of the $j$ th column, for $j = 1, 2, \dots, n$ . Thus for $j = 1 : n$ , the entries of column $j$ are held in $a c o l [k - 1 : l]$ and their corresponding row indices are in $i n d a [k - 1 : l]$ , where $k = l o c a [j - 1]$ and $l = l o c a [j] - 1$ . To specify the $j$ th column as empty, set $l o c a [j - 1] = l o c a [j]$ . Note that the first and last elements of $l o c a$ must be $l o c a [0] = 1$ and $l o c a [n] = ne + 1$ . If your problem has no constraints, or just bounds on the variables, you may include a dummy ‘free’ row with a single (zero) element by setting $ne = 1$ , $a c o l [0] = 0.0$ , $i n d a [0] = 1$ , $l o c a [0] = 1$ , and $l o c a [j - 1] = 2$ , for $j = 2 : n + 1$ . This row is made ‘free’ by setting its bounds to be $b l [n] = - bigbnd$ and $b u [n] = bigbnd$ , where $bigbnd$ is the value of the option ‘Infinite Bound Size’.

blfloat, array-like, shape $(n + m)$

$l$ , the lower bounds for all the variables and general constraints, in the following order. The first $n$ elements of $b l$ must contain the bounds on the variables $x$ , and the next $m$ elements the bounds for the general linear constraints $A x$ (which, equivalently, are the bounds for the slacks, $s$ ) and the free row (if any). To fix the $j$ th variable, set $b l [j - 1] = b u [j - 1] = β$ , say, where $| β | < bigbnd$ . To specify a nonexistent lower bound (i.e., $l_{j} = - \infty$ ), set $b l [j - 1] \leq - bigbnd$ . Here, $bigbnd$ is the value of the option ‘Infinite Bound Size’. To specify the $j$ th constraint as an equality, set $b l [n + j - 1] = b u [n + j - 1] = β$ , say, where $| β | < bigbnd$ . Note that the lower bound corresponding to the free row must be set to $- \infty$ and stored in $b l [n + i o b j - 1]$ .

bufloat, array-like, shape $(n + m)$

$u$ , the upper bounds for all the variables and general constraints, in the following order. The first $n$ elements of $b u$ must contain the bounds on the variables $x$ , and the next $m$ elements the bounds for the general linear constraints $A x$ (which, equivalently, are the bounds for the slacks, $s$ ) and the free row (if any). To specify a nonexistent upper bound (i.e., $u_{j} = + \infty$ ), set $b u [j - 1] \geq bigbnd$ . Note that the upper bound corresponding to the free row must be set to $+ \infty$ and stored in $b u [n + i o b j - 1]$ .

namesstr, length 8, array-like, shape $(nname)$

The optional column and row names, respectively.

If $nname = 1$ , $n a m e s$ is not referenced and the printed output will use default names for the columns and rows.

If $nname = n + m$ , the first $n$ elements must contain the names for the columns and the next $m$ elements must contain the names for the rows.

Note that the name for the free row (if any) must be stored in $n a m e s [n + i o b j - 1]$ .

helastint, array-like, shape $(n + m)$

Defines which variables are to be treated as being elastic in elastic mode. The allowed values of $h e l a s t$ are:

$h e l a s t [j - 1]$	Status in elastic mode
$0$	Variable $j$ is non-elastic and cannot be infeasible
$1$	Variable $j$ can violate its lower bound
$2$	Variable $j$ can violate its upper bound
$3$	Variable $j$ can violate either its lower or upper bound

$h e l a s t$ need not be assigned if option $‘Elastic Mode' = 0$ .

hsint, array-like, shape $(n + m)$

If $s t a r t ='C'$ or $'B'$ , and a Basis file of some sort is to be input (see the description of the options ‘Old Basis File’, ‘Insert File’ or ‘Load File’), then $h s$ and $x$ need not be set at all.

If $s t a r t ='C'$ or $'B'$ and there is no Basis file, the first $n$ elements of $h s$ and $x$ must specify the initial states and values, respectively, of the variables $x$ . (The slacks $s$ need not be initialized.) An internal Crash procedure is then used to select an initial basis matrix $B$ .

The initial basis matrix will be triangular (neglecting certain small elements in each column).

It is chosen from various rows and columns of $(\begin{matrix} A & - I \end{matrix})$ .

Possible values for $h s [j - 1]$ are as follows:

$h s [j - 1]$	State of $x [j - 1]$ during Crash procedure
$0$ or $1$	Eligible for the basis
$2$	Ignored
$3$	Eligible for the basis (given preference over $0$ or $1$ )
$4$ or $5$	Ignored

If nothing special is known about the problem, or there is no wish to provide special information, you may set $h s [j - 1] = 0$ and $x [j - 1] = 0.0$ , for $j = 1, 2, \dots, n$ .

All variables will then be eligible for the initial basis.

Less trivially, to say that the $j$ th variable will probably be equal to one of its bounds, set $h s [j - 1] = 4$ and $x [j - 1] = b l [j - 1]$ or $h s [j - 1] = 5$ and $x [j - 1] = b u [j - 1]$ as appropriate.

Following the Crash procedure, variables for which $h s [j - 1] = 2$ are made superbasic.

Other variables not selected for the basis are then made nonbasic at the value $x [j - 1]$ if $b l [j - 1] \leq x [j - 1] \leq b u [j - 1]$ , or at the value $b l [j - 1]$ or $b u [j - 1]$ closest to $x [j - 1]$ .

If $s t a r t ='W'$ , $h s$ and $x$ must specify the initial states and values, respectively, of the variables and slacks $(x, s)$ .

If qpconvex2_sparse_solve has been called previously with the same values of $n$ and $m$ , $h s$ already contains satisfactory information.

xfloat, array-like, shape $(n + m)$

The initial values of the variables $x$ , and, if $s t a r t ='W'$ , the slacks $s$ , i.e., $(x, s)$ . (See the description for argument $h s$ .)

nsint

$n_{S}$ , the number of superbasics. For QP problems, $n s$ need not be specified if $s t a r t ='C'$ , but must retain its value from a previous call when $s t a r t ='W'$ . For FP and LP problems, $n s$ need not be initialized.

commdict, communication object, modified in place

Communication structure.

This argument must have been initialized by a prior call to qpconvex2_sparse_init().

qphxNone or callable hx = qphx(x, nstate, data=None), optional

Note: if this argument is None then a NAG-supplied facility will be used.

For QP problems, you must supply a version of $q p h x$ to compute the matrix product $H x$ for a given vector $x$ .

If $H$ has rows and columns of zeros, it is most efficient to order $x$ so that the nonlinear variables appear first.

For example, if $x = {(y, z)}^{T}$ and only $y$ enters the objective quadratically, then

\begin{matrix} H x = (\begin{matrix} H_{1} & 0 0 & 0 \end{matrix}) (\begin{matrix} y z \end{matrix}) = (\begin{matrix} H_{1} y 0 \end{matrix}) . \end{matrix}

In this case, $n c o l h$ should be the dimension of $y$ , and $q p h x$ should compute $H_{1} y$ .

For FP and LP problems, $q p h x$ will never be called by qpconvex2_sparse_solve and hence $q p h x$ may be None.

Parameters

xfloat, ndarray, shape $(ncolh)$

The first $n c o l h$ elements of the vector $x$ .

nstateint

Allows you to save computation time if certain data must be read or calculated only once. To preserve this data for a subsequent calculation place it in one of $c u s e r$ , $r u s e r$ or $i u s e r$ .

$n s t a t e = 1$

qpconvex2_sparse_solve is calling $q p h x$ for the first time.

$n s t a t e = 0$

There is nothing special about the current call of $q p h x$ .

$n s t a t e \geq 2$

qpconvex2_sparse_solve is calling $q p h x$ for the last time. This argument setting allows you to perform some additional computation on the final solution.

$n s t a t e = 2$

The current $x$ is optimal.

$n s t a t e = 3$

The problem appears to be infeasible.

$n s t a t e = 4$

The problem appears to be unbounded.

$n s t a t e = 5$

The iterations limit was reached.

dataarbitrary, optional, modifiable in place

User-communication data for callback functions.

Returns

hxfloat, array-like, shape $(ncolh)$: The product $H x$ . If $n c o l h$ is less than the input argument $n$ , $H x$ is really the product $H_{1} y$ in [equation].

cNone or float, array-like, shape $(lenc)$ , optional

Contains the explicit objective vector $c$ (if any). If the problem is of type FP, or if $lenc = 0$ , $c$ is not referenced. (In that case, $c$ may be dimensioned (1), or it could be any convenient array.)

dataarbitrary, optional

User-communication data for callback functions.

io_managerFileObjManager, optional

Manager for I/O in this routine.

Returns

hsint, ndarray, shape $(n + m)$

The final states of the variables and slacks $(x, s)$ . The significance of each possible value of $h s [j - 1]$ is as follows:

$h s [j - 1]$	State of variable $j$	Normal value of $x [j - 1]$
$0$	Nonbasic	$b l [j - 1]$
$1$	Nonbasic	$b u [j - 1]$
$2$	Superbasic	Between $b l [j - 1]$ and $b u [j - 1]$
$3$	Basic	Between $b l [j - 1]$ and $b u [j - 1]$

If $n i n f = 0$ , basic and superbasic variables may be outside their bounds by as much as the value of the option ‘Feasibility Tolerance’.

Note that unless the option $‘Scale Option' = 0$ is specified, the option ‘Feasibility Tolerance’ applies to the variables of the scaled problem.

In this case, the variables of the original problem may be as much as $0.1$ outside their bounds, but this is unlikely unless the problem is very badly scaled.

Very occasionally some nonbasic variables may be outside their bounds by as much as the option ‘Feasibility Tolerance’, and there may be some nonbasic variables for which $x [j - 1]$ lies strictly between its bounds.

If $n i n f > 0$ , some basic and superbasic variables may be outside their bounds by an arbitrary amount (bounded by $s i n f$ if $‘Scale Option' = 0$ ).

xfloat, ndarray, shape $(n + m)$

The final values of the variables and slacks $(x, s)$ .

pifloat, ndarray, shape $(m)$

Contains the dual variables $π$ (a set of Lagrange multipliers (shadow prices) for the general constraints).

rcfloat, ndarray, shape $(n + m)$

Contains the reduced costs, $g - {(\begin{matrix} A & - I \end{matrix})}^{T} π$ . The vector $g$ is the gradient of the objective if $x$ is feasible; otherwise, it is the gradient of the Phase 1 objective. In the former case, $g (i) = 0$ , for $i = n + 1 : m$ , hence $r c [n + 1 : m - 1] = π$ .

nsint

The final number of superbasics. This will be zero for FP and LP problems.

ninfint

The number of infeasibilities.

sinffloat

The sum of the scaled infeasibilities. This will be zero if $n i n f = 0$ , and is most meaningful when $‘Scale Option' = 0$ .

objfloat

The value of the objective function.

If $n i n f = 0$ , $o b j$ includes the quadratic objective term $\frac{1}{2} x^{T} H x$ (if any).

If $n i n f > 0$ , $o b j$ is just the linear objective term $c^{T} x$ (if any).

For FP problems, $o b j$ is set to zero.

Note that $o b j$ does not include contributions from the constant term $o b j a d d$ or the objective row, if any.

Other Parameters

‘Check Frequency’int

Default $= 60$

Every $i$ th iteration after the most recent basis factorization, a numerical test is made to see if the current solution $(x, s)$ satisfies the linear constraints $A x - s = 0$ . If the largest element of the residual vector $r = A x - s$ is judged to be too large, the current basis is refactorized and the basic variables recomputed to satisfy the constraints more accurately. If $i \leq 0$ , the value $i = 99999999$ is used and effectively no checks are made.

$‘Check Frequency' = 1$ is useful for debugging purposes, but otherwise this option should not be needed.

‘Crash Option’int

Default $= 3$

Note that these options do not apply when $s t a r t ='W'$ (see Parameters).

If $s t a r t ='C'$ , an internal Crash procedure is used to select an initial basis from various rows and columns of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ . The value of $i$ determines which rows and columns of $A$ are initially eligible for the basis, and how many times the Crash procedure is called. Columns of $- I$ are used to pad the basis where necessary.

$i$	Meaning
$0$	The initial basis contains only slack variables: $B = I$ .
$1$	The Crash procedure is called once, looking for a triangular basis in all rows and columns of the matrix $A$ .
$2$	The Crash procedure is called once, looking for a triangular basis in rows.
$3$	The Crash procedure is called twice, treating linear equalities and linear inequalities separately.

If $i \geq 1$ , certain slacks on inequality rows are selected for the basis first. (If $i \geq 2$ , numerical values are used to exclude slacks that are close to a bound.) The Crash procedure then makes several passes through the columns of $A$ , searching for a basis matrix that is essentially triangular. A column is assigned to ‘pivot’ on a particular row if the column contains a suitably large element in a row that has not yet been assigned. (The pivot elements ultimately form the diagonals of the triangular basis.) For remaining unassigned rows, slack variables are inserted to complete the basis.

The ‘Crash Tolerance’ allows the Crash procedure to ignore certain ‘small’ nonzero elements in each column of $A$ . If $a_{m a x}$ is the largest element in column $j$ , other nonzeros $a_{i j}$ in the column are ignored if $∣ ∣ a_{i j} ∣ ∣ \leq a_{m a x} \times r$ . (To be meaningful, $r$ should be in the range $0 \leq r < 1$ .)

When $r > 0.0$ , the basis obtained by the Crash procedure may not be strictly triangular, but it is likely to be nonsingular and almost triangular. The intention is to obtain a starting basis containing more columns of $A$ and fewer (arbitrary) slacks. A feasible solution may be reached sooner on some problems.

For example, suppose the first $m$ columns of $A$ form the matrix shown under ‘LU Factor Tolerance’; i.e., a tridiagonal matrix with entries $- 1$ , $4$ , $- 1$ . To help the Crash procedure choose all $m$ columns for the initial basis, we would specify a ‘Crash Tolerance’ of $r$ for some value of $r > 0.5$ .

‘Crash Tolerance’float

Default $= 0.1$

Note that these options do not apply when $s t a r t ='W'$ (see Parameters).

If $s t a r t ='C'$ , an internal Crash procedure is used to select an initial basis from various rows and columns of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ . The value of $i$ determines which rows and columns of $A$ are initially eligible for the basis, and how many times the Crash procedure is called. Columns of $- I$ are used to pad the basis where necessary.

$i$	Meaning
$0$	The initial basis contains only slack variables: $B = I$ .
$1$	The Crash procedure is called once, looking for a triangular basis in all rows and columns of the matrix $A$ .
$2$	The Crash procedure is called once, looking for a triangular basis in rows.
$3$	The Crash procedure is called twice, treating linear equalities and linear inequalities separately.

If $i \geq 1$ , certain slacks on inequality rows are selected for the basis first. (If $i \geq 2$ , numerical values are used to exclude slacks that are close to a bound.) The Crash procedure then makes several passes through the columns of $A$ , searching for a basis matrix that is essentially triangular. A column is assigned to ‘pivot’ on a particular row if the column contains a suitably large element in a row that has not yet been assigned. (The pivot elements ultimately form the diagonals of the triangular basis.) For remaining unassigned rows, slack variables are inserted to complete the basis.

The ‘Crash Tolerance’ allows the Crash procedure to ignore certain ‘small’ nonzero elements in each column of $A$ . If $a_{m a x}$ is the largest element in column $j$ , other nonzeros $a_{i j}$ in the column are ignored if $∣ ∣ a_{i j} ∣ ∣ \leq a_{m a x} \times r$ . (To be meaningful, $r$ should be in the range $0 \leq r < 1$ .)

When $r > 0.0$ , the basis obtained by the Crash procedure may not be strictly triangular, but it is likely to be nonsingular and almost triangular. The intention is to obtain a starting basis containing more columns of $A$ and fewer (arbitrary) slacks. A feasible solution may be reached sooner on some problems.

For example, suppose the first $m$ columns of $A$ form the matrix shown under ‘LU Factor Tolerance’; i.e., a tridiagonal matrix with entries $- 1$ , $4$ , $- 1$ . To help the Crash procedure choose all $m$ columns for the initial basis, we would specify a ‘Crash Tolerance’ of $r$ for some value of $r > 0.5$ .

‘Defaults’valueless

This special keyword may be used to reset all options to their default values.

‘Dump File’int

Default $= 0$

Options ‘Dump File’ and ‘Load File’ are similar to options ‘Punch File’ and ‘Insert File’, but they record solution information in a manner that is more direct and more easily modified. A full description of information recorded in options ‘Dump File’ and ‘Load File’ is given in Gill et al. (2005a).

If $i_{1} > 0$ , the last solution obtained will be output to the file with unit number $i$ .

If $i_{2} > 0$ , the ‘Load File’ containing basis information will be read. The file will usually have been output previously as a ‘Dump File’. The file will not be accessed if options ‘Old Basis File’ or ‘Insert File’ are specified.

‘Load File’int

Default $= 0$

Options ‘Dump File’ and ‘Load File’ are similar to options ‘Punch File’ and ‘Insert File’, but they record solution information in a manner that is more direct and more easily modified. A full description of information recorded in options ‘Dump File’ and ‘Load File’ is given in Gill et al. (2005a).

If $i_{1} > 0$ , the last solution obtained will be output to the file with unit number $i$ .

If $i_{2} > 0$ , the ‘Load File’ containing basis information will be read. The file will usually have been output previously as a ‘Dump File’. The file will not be accessed if options ‘Old Basis File’ or ‘Insert File’ are specified.

‘Elastic Mode’int

Default $= 1$

This argument determines if (and when) elastic mode is to be started. Three elastic modes are available as follows:

$i$	Meaning
$0$	Elastic mode is never invoked. `qpconvex2_sparse_solve` will terminate as soon as infeasibility is detected. There may be other points with significantly smaller sums of infeasibilities.
$1$	Elastic mode is invoked only if the constraints are found to be infeasible (the default). If the constraints are infeasible, continue in elastic mode with the composite objective determined by the values of the options ‘Elastic Objective’ and ‘Elastic Weight’.
$2$	The iterations start and remain in elastic mode. This option allows you to minimize the composite objective function directly without first performing Phase 1 iterations. The success of this option will depend critically on your choice of ‘Elastic Weight’. If ‘Elastic Weight’ is sufficiently large and the constraints are feasible, the minimizer of the composite objective and the solution of the original problem are identical. However, if the ‘Elastic Weight’ is not sufficiently large, the minimizer of the composite function may be infeasible, even if a feasible point exists.

‘Elastic Objective’int

Default $= 1$

This determines the form of the composite objective $f (x) + γ \sum_{j} (v_{j} + w_{j})$ in Phase 2 ( $γ$ ). Three types of composite objectives are available.

$i$	Meaning
$0$	Include only the true objective $f (x)$ in the composite objective. This option sets $γ = 0$ in the composite objective and allows `qpconvex2_sparse_solve` to ignore the elastic bounds and find a solution that minimizes $f (x)$ subject to the non-elastic constraints. This option is useful if there are some ‘soft’ constraints that you would like to ignore if the constraints are infeasible.
$1$	Use a composite objective defined with $γ$ determined by the value of ‘Elastic Weight’. This value is intended to be used in conjunction with $‘Elastic Mode' = 2$ .
$2$	Include only the elastic variables in the composite objective. The elastics are weighted by $γ = 1$ . This choice minimizes the violations of the elastic variables at the expense of possibly increasing the true objective. This option can be used to find a point that minimizes the sum of the violations of a subset of constraints specified by the input array $h e l a s t$ .

‘Elastic Weight’float

Default $= 1.0$

This defines the value of $γ$ in the composite objective in Phase 2 ( $γ$ ).

At each iteration of elastic mode, the composite objective is defined to be

m i n i m i z e σ f (x) + γ (sum of infeasibilities);

where $σ = 1$ for ‘Minimize’, $σ = - 1$ for ‘Maximize’, and $f (x)$ is the quadratic objective.

Note that the effect of $γ$ is not disabled once a feasible point is obtained.

‘Expand Frequency’int

Default $= 10000$

This option is part of an anti-cycling procedure (see Miscellaneous) designed to allow progress even on highly degenerate problems.

The strategy is to force a positive step at every iteration, at the expense of violating the constraints by a small amount. Suppose that the value of the option ‘Feasibility Tolerance’ is $δ$ . Over a period of $i$ iterations, the feasibility tolerance actually used by qpconvex2_sparse_solve (i.e., the working feasibility tolerance) increases from $0.5 δ$ to $δ$ (in steps of $0.5 δ / i$ ).

Increasing the value of $i$ helps reduce the number of slightly infeasible nonbasic variables (most of which are eliminated during the resetting procedure). However, it also diminishes the freedom to choose a large pivot element (see the description of the option ‘Pivot Tolerance’).

If $i \leq 0$ , the value $i = 99999999$ is used and effectively no anti-cycling procedure is invoked.

‘Factorization Frequency’int

Default $= 100 (L P)$ or $50 (Q P)$

If $i > 0$ , at most $i$ basis changes will occur between factorizations of the basis matrix.

For LP problems, the basis factors are usually updated at every iteration. Higher values of $i$ may be more efficient on problems that are extremely sparse and well scaled.

For QP problems, fewer basis updates will occur as the solution is approached. The number of iterations between basis factorizations will, therefore, increase. During these iterations a test is made regularly according to the value of option ‘Check Frequency’ to ensure that the linear constraints $A x - s = 0$ are satisfied. Occasionally, the basis will be refactorized before the limit of $i$ updates is reached. If $i \leq 0$ , the default value is used.

‘Feasibility Tolerance’float

Default $= m a x {10^{- 6}, \sqrt{ϵ}}$

A feasible problem is one in which all variables satisfy their upper and lower bounds to within the absolute tolerance $r$ . (This includes slack variables. Hence, the general constraints are also satisfied to within $r$ .)

qpconvex2_sparse_solve attempts to find a feasible solution before optimizing the objective function. If the sum of infeasibilities cannot be reduced to zero, the problem is assumed to be infeasible. Let sInf be the corresponding sum of infeasibilities. If sInf is quite small, it may be appropriate to raise $r$ by a factor of $10$ or $100$ . Otherwise, some error in the data should be suspected.

Note that if sInf is not small and you have not asked qpconvex2_sparse_solve to minimize the violations of the elastic variables (i.e., you have not specified $‘Elastic Objective' = 2$ ), there may be other points that have a significantly smaller sum of infeasibilities. qpconvex2_sparse_solve will not attempt to find the solution that minimizes the sum unless $‘Elastic Objective' = 2$ .

If the constraints and variables have been scaled (see the description of the option ‘Scale Option’), then feasibility is defined in terms of the scaled problem (since it is more likely to be meaningful).

‘Infinite Bound Size’float

Default $= 10^{20}$

If $r \geq 0$ , $r$ defines the ‘infinite’ bound $infbnd$ in the definition of the problem constraints. Any upper bound greater than or equal to $infbnd$ will be regarded as $+ \infty$ (and similarly any lower bound less than or equal to $- infbnd$ will be regarded as $- \infty$ ). If $r < 0$ , the default value is used.

‘Iterations Limit’int

Default $= m a x {10000, 10 m a x {m, n}}$

The value of $i$ specifies the maximum number of iterations allowed before termination. Setting $i = 0$ and $‘Print Level' > 0$ means that: the workspace needed to start solving the problem will be computed and printed; and feasibility and optimality will be checked. No iterations will be performed. If $i < 0$ , the default value is used.

‘LU Density Tolerance’float

Default $= 0.6$

The density tolerance $r_{1}$ is used during $L U$ factorization of the basis matrix. Columns of $L$ and rows of $U$ are formed one at a time, and the remaining rows and columns of the basis are altered appropriately. At any stage, if the density of the remaining matrix exceeds $r_{1}$ , the Markowitz strategy for choosing pivots is terminated. The remaining matrix is factored by a dense $L U$ procedure. Raising the density tolerance towards $1.0$ may give slightly sparser $L U$ factors, with a slight increase in factorization time.

If $r_{2} > 0$ , $r_{2}$ defines the singularity tolerance used to guard against ill-conditioned basis matrices. After $B$ is refactorized, the diagonal elements of $U$ are tested as follows. If $∣ ∣ u_{j j} ∣ ∣ \leq r_{2}$ or $∣ ∣ u_{j j} ∣ ∣ < r_{2} {m a x}_{i} ∣ ∣ u_{i j} ∣ ∣$ , the $j$ th column of the basis is replaced by the corresponding slack variable. If $r_{2} \leq 0$ , the default value is used.

‘LU Singularity Tolerance’float

Default $= ϵ^{\frac{2}{3}}$

The density tolerance $r_{1}$ is used during $L U$ factorization of the basis matrix. Columns of $L$ and rows of $U$ are formed one at a time, and the remaining rows and columns of the basis are altered appropriately. At any stage, if the density of the remaining matrix exceeds $r_{1}$ , the Markowitz strategy for choosing pivots is terminated. The remaining matrix is factored by a dense $L U$ procedure. Raising the density tolerance towards $1.0$ may give slightly sparser $L U$ factors, with a slight increase in factorization time.

If $r_{2} > 0$ , $r_{2}$ defines the singularity tolerance used to guard against ill-conditioned basis matrices. After $B$ is refactorized, the diagonal elements of $U$ are tested as follows. If $∣ ∣ u_{j j} ∣ ∣ \leq r_{2}$ or $∣ ∣ u_{j j} ∣ ∣ < r_{2} {m a x}_{i} ∣ ∣ u_{i j} ∣ ∣$ , the $j$ th column of the basis is replaced by the corresponding slack variable. If $r_{2} \leq 0$ , the default value is used.

‘LU Factor Tolerance’float

Default $= 100.0$

The values of $r_{1}$ and $r_{2}$ affect the stability and sparsity of the basis factorization $B = L U$ , during refactorization and updates respectively. The lower triangular matrix $L$ is a product of matrices of the form

\begin{matrix} (\begin{matrix} 1 μ & 1 \end{matrix}) \end{matrix}

where the multipliers $μ$ will satisfy $| μ | \leq r_{i}$ . The default values of $r_{1}$ and $r_{2}$ usually strike a good compromise between stability and sparsity. They must satisfy $r_{1}$ , $r_{2} \geq 1.0$ .

For large and relatively dense problems, $r_{1} = 10.0$ or $5.0$ (say) may give a useful improvement in stability without impairing sparsity to a serious degree.

For certain very regular structures (e.g., band matrices) it may be necessary to reduce $r_{1} and/or r_{2}$ in order to achieve stability. For example, if the columns of $A$ include a sub-matrix of the form

\begin{matrix} ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 4 & - 1 - 1 & 4 & - 1 - 1 & 4 & - 1 \dots & \dots & \dots - 1 & 4 & - 1 - 1 & 4 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠, \end{matrix}

one should set both $r_{1}$ and $r_{2}$ to values in the range $1.0 \leq r_{i} < 4.0$ .

‘LU Update Tolerance’float

Default $= 10.0$

The values of $r_{1}$ and $r_{2}$ affect the stability and sparsity of the basis factorization $B = L U$ , during refactorization and updates respectively. The lower triangular matrix $L$ is a product of matrices of the form

\begin{matrix} (\begin{matrix} 1 μ & 1 \end{matrix}) \end{matrix}

where the multipliers $μ$ will satisfy $| μ | \leq r_{i}$ . The default values of $r_{1}$ and $r_{2}$ usually strike a good compromise between stability and sparsity. They must satisfy $r_{1}$ , $r_{2} \geq 1.0$ .

For large and relatively dense problems, $r_{1} = 10.0$ or $5.0$ (say) may give a useful improvement in stability without impairing sparsity to a serious degree.

For certain very regular structures (e.g., band matrices) it may be necessary to reduce $r_{1} and/or r_{2}$ in order to achieve stability. For example, if the columns of $A$ include a sub-matrix of the form

\begin{matrix} ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} 4 & - 1 - 1 & 4 & - 1 - 1 & 4 & - 1 \dots & \dots & \dots - 1 & 4 & - 1 - 1 & 4 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠, \end{matrix}

one should set both $r_{1}$ and $r_{2}$ to values in the range $1.0 \leq r_{i} < 4.0$ .

‘LU Partial Pivoting’valueless

Default

The $L U$ factorization implements a Markowitz-type search for pivots that locally minimize the fill-in subject to a threshold pivoting stability criterion. The default option is to use threshold partial pivoting. The options ‘LU Complete Pivoting’ and ‘LU Rook Pivoting’ are more expensive but more stable and better at revealing rank, as long as the ‘LU Factor Tolerance’ is not too large (say $< 2.0$ ).

‘LU Complete Pivoting’valueless

The $L U$ factorization implements a Markowitz-type search for pivots that locally minimize the fill-in subject to a threshold pivoting stability criterion. The default option is to use threshold partial pivoting. The options ‘LU Complete Pivoting’ and ‘LU Rook Pivoting’ are more expensive but more stable and better at revealing rank, as long as the ‘LU Factor Tolerance’ is not too large (say $< 2.0$ ).

‘LU Rook Pivoting’valueless

The $L U$ factorization implements a Markowitz-type search for pivots that locally minimize the fill-in subject to a threshold pivoting stability criterion. The default option is to use threshold partial pivoting. The options ‘LU Complete Pivoting’ and ‘LU Rook Pivoting’ are more expensive but more stable and better at revealing rank, as long as the ‘LU Factor Tolerance’ is not too large (say $< 2.0$ ).

‘Minimize’valueless

Default

This option specifies the required direction of the optimization. It applies to both linear and nonlinear terms (if any) in the objective function. Note that if two problems are the same except that one minimizes $f (x)$ and the other maximizes $- f (x)$ , their solutions will be the same but the signs of the dual variables $π_{i}$ and the reduced gradients $d_{j}$ (see Main Iteration) will be reversed.

The option ‘Feasible Point’ means ‘ignore the objective function, while finding a feasible point for the linear constraints’. It can be used to check that the constraints are feasible without altering the call to qpconvex2_sparse_solve.

‘Maximize’valueless

This option specifies the required direction of the optimization. It applies to both linear and nonlinear terms (if any) in the objective function. Note that if two problems are the same except that one minimizes $f (x)$ and the other maximizes $- f (x)$ , their solutions will be the same but the signs of the dual variables $π_{i}$ and the reduced gradients $d_{j}$ (see Main Iteration) will be reversed.

The option ‘Feasible Point’ means ‘ignore the objective function, while finding a feasible point for the linear constraints’. It can be used to check that the constraints are feasible without altering the call to qpconvex2_sparse_solve.

‘Feasible Point’valueless

This option specifies the required direction of the optimization. It applies to both linear and nonlinear terms (if any) in the objective function. Note that if two problems are the same except that one minimizes $f (x)$ and the other maximizes $- f (x)$ , their solutions will be the same but the signs of the dual variables $π_{i}$ and the reduced gradients $d_{j}$ (see Main Iteration) will be reversed.

The option ‘Feasible Point’ means ‘ignore the objective function, while finding a feasible point for the linear constraints’. It can be used to check that the constraints are feasible without altering the call to qpconvex2_sparse_solve.

‘New Basis File’int

Default $= 0$

Options ‘New Basis File’ and ‘Backup Basis File’ are sometimes referred to as basis maps. They contain the most compact representation of the state of each variable. They are intended for restarting the solution of a problem at a point that was reached by an earlier run. For nontrivial problems, it is advisable to save basis maps at the end of a run, in order to restart the run if necessary.

If $i_{1} > 0$ , a basis map will be saved on file $i_{1}$ every $i_{3}$ th iteration, where $i_{3}$ is the ‘Save Frequency’. The first record of the file will contain the word PROCEEDING if the run is still in progress. A basis map will also be saved at the end of a run, with some other word indicating the final solution status.

Use of $i_{2} > 0$ is intended as a safeguard against losing the results of a long run. Suppose that a ‘New Basis File’ is being saved every $100$ (‘Save Frequency’) iterations, and that qpconvex2_sparse_solve is about to save such a basis at iteration $2000$ . It is conceivable that the run may be interrupted during the next few milliseconds (in the middle of the save). In this case the Basis file will be corrupted and the run will have been essentially wasted.

To eliminate this risk, both a ‘New Basis File’ and a ‘Backup Basis File’ may be specified.

The current basis will then be saved every $100$ iterations, first on ‘New Basis File’ and then immediately on ‘Backup Basis File’. If the run is interrupted at iteration $2000$ during the save on ‘New Basis File’, there will still be a usable basis on ‘Backup Basis File’ (corresponding to iteration $1900$ ).

Note that a new basis will be saved in ‘New Basis File’ at the end of a run if it terminates normally, but it will not be saved in ‘Backup Basis File’. In the above example, if an optimum solution is found at iteration $2050$ (or if the iteration limit is $2050$ ), the final basis on ‘New Basis File’ will correspond to iteration $2050$ , but the last basis saved on ‘Backup Basis File’ will be the one for iteration $2000$ .

A full description of information recorded in ‘New Basis File’ and ‘Backup Basis File’ is given in Gill et al. (2005a).

‘Backup Basis File’int

Default $= 0$

Options ‘New Basis File’ and ‘Backup Basis File’ are sometimes referred to as basis maps. They contain the most compact representation of the state of each variable. They are intended for restarting the solution of a problem at a point that was reached by an earlier run. For nontrivial problems, it is advisable to save basis maps at the end of a run, in order to restart the run if necessary.

If $i_{1} > 0$ , a basis map will be saved on file $i_{1}$ every $i_{3}$ th iteration, where $i_{3}$ is the ‘Save Frequency’. The first record of the file will contain the word PROCEEDING if the run is still in progress. A basis map will also be saved at the end of a run, with some other word indicating the final solution status.

Use of $i_{2} > 0$ is intended as a safeguard against losing the results of a long run. Suppose that a ‘New Basis File’ is being saved every $100$ (‘Save Frequency’) iterations, and that qpconvex2_sparse_solve is about to save such a basis at iteration $2000$ . It is conceivable that the run may be interrupted during the next few milliseconds (in the middle of the save). In this case the Basis file will be corrupted and the run will have been essentially wasted.

To eliminate this risk, both a ‘New Basis File’ and a ‘Backup Basis File’ may be specified.

The current basis will then be saved every $100$ iterations, first on ‘New Basis File’ and then immediately on ‘Backup Basis File’. If the run is interrupted at iteration $2000$ during the save on ‘New Basis File’, there will still be a usable basis on ‘Backup Basis File’ (corresponding to iteration $1900$ ).

Note that a new basis will be saved in ‘New Basis File’ at the end of a run if it terminates normally, but it will not be saved in ‘Backup Basis File’. In the above example, if an optimum solution is found at iteration $2050$ (or if the iteration limit is $2050$ ), the final basis on ‘New Basis File’ will correspond to iteration $2050$ , but the last basis saved on ‘Backup Basis File’ will be the one for iteration $2000$ .

A full description of information recorded in ‘New Basis File’ and ‘Backup Basis File’ is given in Gill et al. (2005a).

‘Save Frequency’int

Default $= 100$

Options ‘New Basis File’ and ‘Backup Basis File’ are sometimes referred to as basis maps. They contain the most compact representation of the state of each variable. They are intended for restarting the solution of a problem at a point that was reached by an earlier run. For nontrivial problems, it is advisable to save basis maps at the end of a run, in order to restart the run if necessary.

If $i_{1} > 0$ , a basis map will be saved on file $i_{1}$ every $i_{3}$ th iteration, where $i_{3}$ is the ‘Save Frequency’. The first record of the file will contain the word PROCEEDING if the run is still in progress. A basis map will also be saved at the end of a run, with some other word indicating the final solution status.

Use of $i_{2} > 0$ is intended as a safeguard against losing the results of a long run. Suppose that a ‘New Basis File’ is being saved every $100$ (‘Save Frequency’) iterations, and that qpconvex2_sparse_solve is about to save such a basis at iteration $2000$ . It is conceivable that the run may be interrupted during the next few milliseconds (in the middle of the save). In this case the Basis file will be corrupted and the run will have been essentially wasted.

To eliminate this risk, both a ‘New Basis File’ and a ‘Backup Basis File’ may be specified.

The current basis will then be saved every $100$ iterations, first on ‘New Basis File’ and then immediately on ‘Backup Basis File’. If the run is interrupted at iteration $2000$ during the save on ‘New Basis File’, there will still be a usable basis on ‘Backup Basis File’ (corresponding to iteration $1900$ ).

Note that a new basis will be saved in ‘New Basis File’ at the end of a run if it terminates normally, but it will not be saved in ‘Backup Basis File’. In the above example, if an optimum solution is found at iteration $2050$ (or if the iteration limit is $2050$ ), the final basis on ‘New Basis File’ will correspond to iteration $2050$ , but the last basis saved on ‘Backup Basis File’ will be the one for iteration $2000$ .

A full description of information recorded in ‘New Basis File’ and ‘Backup Basis File’ is given in Gill et al. (2005a).

‘Nolist’valueless

Default

Option ‘List’ enables printing of each option specification as it is supplied. ‘Nolist’ suppresses this printing.

‘List’valueless

Option ‘List’ enables printing of each option specification as it is supplied. ‘Nolist’ suppresses this printing.

‘Old Basis File’int

Default $= 0$

If $i > 0$ , the basis maps information will be obtained from this file. The file will usually have been output previously as a ‘New Basis File’ or ‘Backup Basis File’. A full description of information recorded in ‘New Basis File’ and ‘Backup Basis File’ is given in Gill et al. (2005a).

The file will not be acceptable if the number of rows or columns in the problem has been altered.

‘Optimality Tolerance’float

Default $= m a x {10^{- 6}, \sqrt{ϵ}}$

This is used to judge the size of the reduced gradients $d_{j} = g_{j} - a_{j}^{T} π$ , where $g_{j}$ is the $j$ th component of the gradient, $a_{j}$ is the associated column of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ , and $π$ is the set of dual variables.

By construction, the reduced gradients for basic variables are always zero. The problem will be declared optimal if the reduced gradients for nonbasic variables at their lower or upper bounds satisfy

d_{j} / ∥ π ∥ \geq - r or d_{j} / ∥ π ∥ \leq r

respectively, and if $∣ ∣ d_{j} ∣ ∣ / ∥ π ∥ \leq r$ for superbasic variables.

In the above tests, $∥ π ∥$ is a measure of the size of the dual variables. It is included to make the tests independent of a scale factor on the objective function. The quantity $∥ π ∥$ actually used is defined by

∥ π ∥ = m a x (σ / \sqrt{m}, 1), where σ = m \sum i = 1 | π_{i} |,

so that only large scale factors are allowed for.

If the objective is scaled down to be very small, the optimality test reduces to comparing $d_{j}$ against $0.01 r$ .

‘Partial Price’int

Default $= 10 (L P)$ or $1 (Q P)$

This option is recommended for large FP or LP problems that have significantly more variables than constraints (i.e., $n ≫ m$ ). It reduces the work required for each pricing operation (i.e., when a nonbasic variable is selected to enter the basis). If $i = 1$ , all columns of the constraint matrix $(\begin{matrix} A & - I \end{matrix})$ are searched. If $i > 1$ , $A$ and $I$ are partitioned to give $i$ roughly equal segments $A_{j}, I_{j}$ , for $j = 1, 2, \dots, i$ (modulo $i$ ). If the previous pricing search was successful on $A_{j - 1}, I_{j - 1}$ , the next search begins on the segments $A_{j}$ and $I_{j}$ . If a reduced gradient is found that is larger than some dynamic tolerance, the variable with the largest such reduced gradient (of appropriate sign) is selected to enter the basis. If nothing is found, the search continues on the next segments $A_{j + 1}, I_{j + 1}$ , and so on. If $i \leq 0$ , the default value is used.

‘Pivot Tolerance’float

Default $= ϵ^{\frac{2}{3}}$

Broadly speaking, the pivot tolerance is used to prevent columns entering the basis if they would cause the basis to become almost singular.

When $x$ changes to $x + α p$ for some search direction $p$ , a ‘ratio test’ determines which component of $x$ reaches an upper or lower bound first. The corresponding element of $p$ is called the pivot element. Elements of $p$ are ignored (and, therefore, cannot be pivot elements) if they are smaller than the pivot tolerance $r$ .

It is common for two or more variables to reach a bound at essentially the same time. In such cases, the option ‘Feasibility Tolerance’ (say $t$ ) provides some freedom to maximize the pivot element and thereby improve numerical stability. Excessively small values of $t$ should, therefore, not be specified. To a lesser extent, the option ‘Expand Frequency’ (say $f$ ) also provides some freedom to maximize the pivot element. Excessively large values of $f$ should, therefore, not be specified.

‘Print File’int

Default $= 0$

If $i > 0$ , the following information is output to $i$ during the solution of each problem:

a listing of the options;
some statistics about the problem;
the amount of storage available for the $L U$ factorization of the basis matrix;
notes about the initial basis resulting from a Crash procedure or a Basis file;
the iteration log;
basis factorization statistics;
the exit $errno$ condition and some statistics about the solution obtained;
the printed solution, if requested.

The last four items are described in Further Comments and Monitoring Information. Further brief output may be directed to the ‘Summary File’.

‘Print Frequency’int

Default $= 100$

If $i > 0$ , one line of the iteration log will be printed every $i$ th iteration. A value such as $i = 10$ is suggested for those interested only in the final solution. If $i \leq 0$ , the value of $i = 99999999$ is used and effectively no checks are made.

‘Print Level’int

Default $= 1$

This controls the amount of printing produced by qpconvex2_sparse_solve as follows.

$i$	Meaning
0	No output except error messages. If you want to suppress all output, set $‘Print File' = 0$ .
$= 1$	The set of selected options, problem statistics, summary of the scaling procedure, information about the initial basis resulting from a Crash or a Basis file, a single line of output at each iteration (controlled by the option ‘Print Frequency’), and the exit condition with a summary of the final solution.
$\geq 10$	Basis factorization statistics.

‘Punch File’int

Default $= 0$

These files provide compatibility with commercial mathematical programming systems. The ‘Punch File’ from a previous run may be used as an ‘Insert File’ for a later run on the same problem. A full description of information recorded in ‘Insert File’ and ‘Punch File’ is given in Gill et al. (2005a).

If $i_{1} > 0$ , the final solution obtained will be output to file $i_{1}$ . For linear programs, this format is compatible with various commercial systems.

If $i_{2} > 0$ , the ‘Insert File’ containing basis information will be read. The file will usually have been output previously as a ‘Punch File’. The file will not be accessed if ‘Old Basis File’ is specified.

‘Insert File’int

Default $= 0$

These files provide compatibility with commercial mathematical programming systems. The ‘Punch File’ from a previous run may be used as an ‘Insert File’ for a later run on the same problem. A full description of information recorded in ‘Insert File’ and ‘Punch File’ is given in Gill et al. (2005a).

If $i_{1} > 0$ , the final solution obtained will be output to file $i_{1}$ . For linear programs, this format is compatible with various commercial systems.

If $i_{2} > 0$ , the ‘Insert File’ containing basis information will be read. The file will usually have been output previously as a ‘Punch File’. The file will not be accessed if ‘Old Basis File’ is specified.

‘QPSolver Cholesky’valueless

Default

Specifies the active-set algorithm used to solve the quadratic program in Phase 2 ( $γ$ ). ‘QPSolver Cholesky’ holds the full Cholesky factor $R$ of the reduced Hessian $Z^{T} H Z$ . As the QP iterations proceed, the dimension of $R$ changes with the number of superbasic variables. If the number of superbasic variables needs to increase beyond the value of ‘Reduced Hessian Dimension’, the reduced Hessian cannot be stored and the solver switches to ‘QPSolver CG’. The Cholesky solver is reactivated if the number of superbasics stabilizes at a value less than ‘Reduced Hessian Dimension’.

‘QPSolver QN’ solves the QP using a quasi-Newton method. In this case, $R$ is the factor of a quasi-Newton approximate Hessian.

‘QPSolver CG’ uses an active-set method similar to ‘QPSolver QN’, but uses the conjugate-gradient method to solve all systems involving the reduced Hessian.

The Cholesky QP solver is the most robust, but may require a significant amount of computation if there are many superbasics.

The quasi-Newton QP solver does not require computation of the exact $R$ at the start of Phase 2 ( $γ$ ). It may be appropriate when the number of superbasics is large but relatively few iterations are needed to reach a solution (e.g., if qpconvex2_sparse_solve is called with a Warm Start).

The conjugate-gradient QP solver is appropriate for problems with many degrees of freedom (say, more than $2000$ superbasics).

‘QPSolver CG’valueless

Specifies the active-set algorithm used to solve the quadratic program in Phase 2 ( $γ$ ). ‘QPSolver Cholesky’ holds the full Cholesky factor $R$ of the reduced Hessian $Z^{T} H Z$ . As the QP iterations proceed, the dimension of $R$ changes with the number of superbasic variables. If the number of superbasic variables needs to increase beyond the value of ‘Reduced Hessian Dimension’, the reduced Hessian cannot be stored and the solver switches to ‘QPSolver CG’. The Cholesky solver is reactivated if the number of superbasics stabilizes at a value less than ‘Reduced Hessian Dimension’.

‘QPSolver QN’ solves the QP using a quasi-Newton method. In this case, $R$ is the factor of a quasi-Newton approximate Hessian.

‘QPSolver CG’ uses an active-set method similar to ‘QPSolver QN’, but uses the conjugate-gradient method to solve all systems involving the reduced Hessian.

The Cholesky QP solver is the most robust, but may require a significant amount of computation if there are many superbasics.

The quasi-Newton QP solver does not require computation of the exact $R$ at the start of Phase 2 ( $γ$ ). It may be appropriate when the number of superbasics is large but relatively few iterations are needed to reach a solution (e.g., if qpconvex2_sparse_solve is called with a Warm Start).

The conjugate-gradient QP solver is appropriate for problems with many degrees of freedom (say, more than $2000$ superbasics).

‘QPSolver QN’valueless

Specifies the active-set algorithm used to solve the quadratic program in Phase 2 ( $γ$ ). ‘QPSolver Cholesky’ holds the full Cholesky factor $R$ of the reduced Hessian $Z^{T} H Z$ . As the QP iterations proceed, the dimension of $R$ changes with the number of superbasic variables. If the number of superbasic variables needs to increase beyond the value of ‘Reduced Hessian Dimension’, the reduced Hessian cannot be stored and the solver switches to ‘QPSolver CG’. The Cholesky solver is reactivated if the number of superbasics stabilizes at a value less than ‘Reduced Hessian Dimension’.

‘QPSolver QN’ solves the QP using a quasi-Newton method. In this case, $R$ is the factor of a quasi-Newton approximate Hessian.

‘QPSolver CG’ uses an active-set method similar to ‘QPSolver QN’, but uses the conjugate-gradient method to solve all systems involving the reduced Hessian.

The Cholesky QP solver is the most robust, but may require a significant amount of computation if there are many superbasics.

The quasi-Newton QP solver does not require computation of the exact $R$ at the start of Phase 2 ( $γ$ ). It may be appropriate when the number of superbasics is large but relatively few iterations are needed to reach a solution (e.g., if qpconvex2_sparse_solve is called with a Warm Start).

The conjugate-gradient QP solver is appropriate for problems with many degrees of freedom (say, more than $2000$ superbasics).

‘Reduced Hessian Dimension’int

Default $= 1 (L P) or m i n (2000, n_{H} + 1, n) (Q P)$

This specifies that an $i \times i$ triangular matrix $R$ (to define the reduced Hessian according to $R^{T} R = Z^{T} H Z$ ). is to be available for use by the Cholesky QP solver.

‘Scale Option’int

Default $= 2$

Three scale options are available as follows:

$i$	Meaning
0	No scaling. This is recommended if it is known that $x$ and the constraint matrix never have very large elements (say, larger than $100$ ).
1	The constraints and variables are scaled by an iterative procedure that attempts to make the matrix coefficients as close as possible to $1.0$ (see Fourer (1982)). This will sometimes improve the performance of the solution procedures.
2	The constraints and variables are scaled by the iterative procedure. Also, a certain additional scaling is performed that may be helpful if the right-hand side $b$ or the solution $x$ is large. This takes into account columns of $(\begin{matrix} A & - I \end{matrix})$ that are fixed or have positive lower bounds or negative upper bounds.

Option ‘Scale Tolerance’ affects how many passes might be needed through the constraint matrix. On each pass, the scaling procedure computes the ratio of the largest and smallest nonzero coefficients in each column:

ρ_{j} = {m a x}_{j} ∣ ∣ a_{i j} ∣ ∣ / {m i n}_{i} ∣ ∣ a_{i j} ∣ ∣ (a_{i j} \neq 0) .

If ${m a x}_{j} ρ_{j}$ is less than $r$ times its previous value, another scaling pass is performed to adjust the row and column scales. Raising $r$ from $0.9$ to $0.99$ (say) usually increases the number of scaling passes through $A$ . At most $10$ passes are made. The value of $r$ should lie in the range $0 < r < 1$ .

‘Scale Print’ causes the row scales $r (i)$ and column scales $c (j)$ to be printed to ‘Print File’, if ‘System Information Yes’ has been specified. The scaled matrix coefficients are ${¯ a}_{i j} = a_{i j} c (j) / r (i)$ , and the scaled bounds on the variables and slacks are ${¯ l}_{j} = l_{j} / c (j)$ , ${¯ u}_{j} = u_{j} / c (j)$ , where $c (j) = r (j - n)$ if $j > n$ .

‘Scale Tolerance’float

Default $= 0.9$

Three scale options are available as follows:

$i$	Meaning
0	No scaling. This is recommended if it is known that $x$ and the constraint matrix never have very large elements (say, larger than $100$ ).
1	The constraints and variables are scaled by an iterative procedure that attempts to make the matrix coefficients as close as possible to $1.0$ (see Fourer (1982)). This will sometimes improve the performance of the solution procedures.
2	The constraints and variables are scaled by the iterative procedure. Also, a certain additional scaling is performed that may be helpful if the right-hand side $b$ or the solution $x$ is large. This takes into account columns of $(\begin{matrix} A & - I \end{matrix})$ that are fixed or have positive lower bounds or negative upper bounds.

Option ‘Scale Tolerance’ affects how many passes might be needed through the constraint matrix. On each pass, the scaling procedure computes the ratio of the largest and smallest nonzero coefficients in each column:

ρ_{j} = {m a x}_{j} ∣ ∣ a_{i j} ∣ ∣ / {m i n}_{i} ∣ ∣ a_{i j} ∣ ∣ (a_{i j} \neq 0) .

If ${m a x}_{j} ρ_{j}$ is less than $r$ times its previous value, another scaling pass is performed to adjust the row and column scales. Raising $r$ from $0.9$ to $0.99$ (say) usually increases the number of scaling passes through $A$ . At most $10$ passes are made. The value of $r$ should lie in the range $0 < r < 1$ .

‘Scale Print’ causes the row scales $r (i)$ and column scales $c (j)$ to be printed to ‘Print File’, if ‘System Information Yes’ has been specified. The scaled matrix coefficients are ${¯ a}_{i j} = a_{i j} c (j) / r (i)$ , and the scaled bounds on the variables and slacks are ${¯ l}_{j} = l_{j} / c (j)$ , ${¯ u}_{j} = u_{j} / c (j)$ , where $c (j) = r (j - n)$ if $j > n$ .

‘Scale Print’valueless

Three scale options are available as follows:

$i$	Meaning
0	No scaling. This is recommended if it is known that $x$ and the constraint matrix never have very large elements (say, larger than $100$ ).
1	The constraints and variables are scaled by an iterative procedure that attempts to make the matrix coefficients as close as possible to $1.0$ (see Fourer (1982)). This will sometimes improve the performance of the solution procedures.
2	The constraints and variables are scaled by the iterative procedure. Also, a certain additional scaling is performed that may be helpful if the right-hand side $b$ or the solution $x$ is large. This takes into account columns of $(\begin{matrix} A & - I \end{matrix})$ that are fixed or have positive lower bounds or negative upper bounds.

Option ‘Scale Tolerance’ affects how many passes might be needed through the constraint matrix. On each pass, the scaling procedure computes the ratio of the largest and smallest nonzero coefficients in each column:

ρ_{j} = {m a x}_{j} ∣ ∣ a_{i j} ∣ ∣ / {m i n}_{i} ∣ ∣ a_{i j} ∣ ∣ (a_{i j} \neq 0) .

If ${m a x}_{j} ρ_{j}$ is less than $r$ times its previous value, another scaling pass is performed to adjust the row and column scales. Raising $r$ from $0.9$ to $0.99$ (say) usually increases the number of scaling passes through $A$ . At most $10$ passes are made. The value of $r$ should lie in the range $0 < r < 1$ .

‘Scale Print’ causes the row scales $r (i)$ and column scales $c (j)$ to be printed to ‘Print File’, if ‘System Information Yes’ has been specified. The scaled matrix coefficients are ${¯ a}_{i j} = a_{i j} c (j) / r (i)$ , and the scaled bounds on the variables and slacks are ${¯ l}_{j} = l_{j} / c (j)$ , ${¯ u}_{j} = u_{j} / c (j)$ , where $c (j) = r (j - n)$ if $j > n$ .

‘Solution Yes’valueless

Default

This option determines if the final obtained solution is to be output to the ‘Print File’. Note that the ‘Solution File’ option operates independently.

‘Solution No’valueless

This option determines if the final obtained solution is to be output to the ‘Print File’. Note that the ‘Solution File’ option operates independently.

‘Solution File’int

Default $= 0$

If $i > 0$ , the final solution will be output to file $i$ (whether optimal or not).

To see more significant digits in the printed solution, it will sometimes be useful to make $i$ refer to the system ‘Print File’.

‘Summary File’int

Default $= 0$

If $i_{1} > 0$ , the ‘Summary File’ is output to file $i_{1}$ , including a line of the iteration log every $i_{2}$ th iteration. In an interactive environment, it is useful to direct this output to the terminal, to allow a run to be monitored online. (If something looks wrong, the run can be manually terminated.) Further details are given in Monitoring Information. If $i_{2} \leq 0$ , the value of $i_{2} = 99999999$ is used and effectively no checks are made.

‘Summary Frequency’int

Default $= 100$

If $i_{1} > 0$ , the ‘Summary File’ is output to file $i_{1}$ , including a line of the iteration log every $i_{2}$ th iteration. In an interactive environment, it is useful to direct this output to the terminal, to allow a run to be monitored online. (If something looks wrong, the run can be manually terminated.) Further details are given in Monitoring Information. If $i_{2} \leq 0$ , the value of $i_{2} = 99999999$ is used and effectively no checks are made.

‘Superbasics Limit’int

Default $= 1 (L P) or m i n {n_{H} + 1, n} (Q P)$

This places a limit on the storage allocated for superbasic variables. Ideally, $i$ should be set slightly larger than the ‘number of degrees of freedom’ expected at an optimal solution.

For linear programs, an optimum is normally a basic solution with no degrees of freedom. (The number of variables lying strictly between their bounds is no more than $m$ , the number of general constraints.) The default value of $i$ is, therefore, $1$ .

For quadratic problems, the number of degrees of freedom is often called the ‘number of independent variables’. Normally, $i$ need not be greater than $n_{H} + 1$ , where $n_{H}$ is the number of leading nonzero columns of $H$ . For many problems, $i$ may be considerably smaller than $n_{H}$ . This will save storage if $n_{H}$ is very large.

‘Suppress Parameters’valueless

Normally qpconvex2_sparse_solve prints the options file as it is being read, and then prints a complete list of the available keywords and their final values. The option ‘Suppress Parameters’ tells qpconvex2_sparse_solve not to print the full list.

‘System Information No’valueless

Default

This option prints additional information on the progress of major and minor iterations, and Crash statistics. See Monitoring Information.

‘System Information Yes’valueless

This option prints additional information on the progress of major and minor iterations, and Crash statistics. See Monitoring Information.

‘Timing Level’int

Default $= 0$

If $i > 0$ , some timing information will be output to the Print file, if $‘Print File' > 0$ .

‘Unbounded Step Size’float

Default $= infbnd$

If $r > 0$ , $r$ specifies the magnitude of the change in variables that will be considered a step to an unbounded solution. (Note that an unbounded solution can occur only when the Hessian is not positive definite.) If the change in $x$ during an iteration would exceed the value of $r$ , the objective function is considered to be unbounded below in the feasible region. If $r \leq 0$ , the default value is used. See ‘Infinite Bound Size’ for the definition of $infbnd$ .

Raises

NagValueError

(errno $1$ )

The initialization function qpconvex2_sparse_init() has not been called.

(errno $2$ )

On entry, $s t a r t = ⟨ v a l u e ⟩$ .

Constraint: $s t a r t ='B'$ , $'C'$ or $'W'$ .

(errno $2$ )

On entry, $ne = ⟨ v a l u e ⟩$ , $n = ⟨ v a l u e ⟩$ and $m = ⟨ v a l u e ⟩$ .

Constraint: $1 \leq ne \leq n \times m$ .

(errno $2$ )

On entry, $lenc = ⟨ v a l u e ⟩$ and $n = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq lenc \leq n$ .

(errno $2$ )

An error has occurred in the basis package, perhaps indicating incorrect setup of arrays $i n d a$ and $l o c a$ . Set the option ‘Print File’ and examine the output carefully for further information.

(errno $2$ )

On entry, $n c o l h = ⟨ v a l u e ⟩$ and $n = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq n c o l h \leq n$ .

(errno $2$ )

On entry, $nname = ⟨ v a l u e ⟩$ , $n = ⟨ v a l u e ⟩$ and $m = ⟨ v a l u e ⟩$ .

Constraint: $nname = 1$ or $n + m$ .

(errno $2$ )

On entry, $i o b j = ⟨ v a l u e ⟩$ and $m = ⟨ v a l u e ⟩$ .

Constraint: $0 \leq i o b j \leq m$ .

(errno $2$ )

On entry, $l o c a [0] = ⟨ v a l u e ⟩$ , $l o c a [⟨ v a l u e ⟩] = ⟨ v a l u e ⟩$ , $ne = ⟨ v a l u e ⟩$ .

Constraint: $l o c a [0] = 1$ or $l o c a [⟨ v a l u e ⟩] = ne + 1$ .

(errno $2$ )

On entry, row index $⟨ v a l u e ⟩$ in $i n d a [⟨ v a l u e ⟩]$ is outside the range $1$ to $m = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, $ne$ is not equal to the number of nonzeros in $a c o l$ . $ne = ⟨ v a l u e ⟩$ , nonzeros in $a c o l = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, bounds $b l$ and $b u$ for $⟨ v a l u e ⟩$ are equal and infinite: $b l = b u = ⟨ v a l u e ⟩$ and $infbnd = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, bounds for $⟨ v a l u e ⟩$ are inconsistent. $b l = ⟨ v a l u e ⟩$ and $b u = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, bounds $b l$ and $b u$ for $⟨ v a l u e ⟩$ $⟨ v a l u e ⟩$ are equal and infinite. $b l = b u = ⟨ v a l u e ⟩$ and $infbnd = ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, bounds for $⟨ v a l u e ⟩$ $⟨ v a l u e ⟩$ are inconsistent. $b l = ⟨ v a l u e ⟩$ and $b u = ⟨ v a l u e ⟩$ .

(errno $2$ )

Basis file dimensions do not match this problem.

(errno $2$ )

On entry, $m = ⟨ v a l u e ⟩$ .

Constraint: $m \geq ⟨ v a l u e ⟩$ .

(errno $2$ )

On entry, $n = ⟨ v a l u e ⟩$ .

Constraint: $n \geq ⟨ v a l u e ⟩$ .

(errno $12$ )

Internal memory allocation failed when attempting to obtain workspace sizes $⟨ v a l u e ⟩$ , $⟨ v a l u e ⟩$ and $⟨ v a l u e ⟩$ . Please contact NAG.

(errno $13$ )

Internal memory allocation was insufficient. Please contact NAG.

(errno $14$ )

An error has occurred in the basis package, perhaps indicating incorrect setup of arrays $i n d a$ and $l o c a$ . Set the option ‘Print File’ and examine the output carefully for further information.

(errno $15$ )

An unexpected error has occurred. Set the option ‘Print File’ and examine the output carefully for further information.

Warns

NagAlgorithmicWarning

(errno $4$ ): Weak solution found – the solution is not unique.
(errno $5$ ): The linear constraints appear to be infeasible.
(errno $5$ ): The problem appears to be infeasible. The linear equality constraints could not be satisfied.
(errno $5$ ): The problem appears to be infeasible. Nonlinear infeasibilites have been minimized.
(errno $5$ ): The problem appears to be infeasible. Infeasibilites have been minimized.
(errno $6$ ): The problem appears to be unbounded. The objective function is unbounded.
(errno $6$ ): The problem appears to be unbounded. The constraint violation limit has been reached.

NagAlgorithmicMajorWarning

(errno $3$ ): The requested accuracy could not be achieved.
(errno $7$ ): Iteration limit reached.
(errno $7$ ): Major iteration limit reached.
(errno $8$ ): The value of the option ‘Superbasics Limit’ is too small.
(errno $9$ ): The basis is singular after several attempts to factorize it (and add slacks where necessary).
(errno $10$ ): Numerical difficulties have been encountered and no further progress can be made.
(errno $11$ ): Error in $q p h x$ : the QP Hessian is indefinite.

Notes

qpconvex2_sparse_solve is designed to solve large-scale linear or quadratic programming problems of the form:

\begin{matrix} {m i n i m i z e}_{x \in R^{n}} f (x) subject to l \leq (\begin{matrix} x A x \end{matrix}) \leq u, \end{matrix}

where $x$ is an $n$ -vector of variables, $l$ and $u$ are constant lower and upper bounds, $A$ is an $m \times n$ sparse matrix and $f (x)$ is a linear or quadratic objective function that may be specified in a variety of ways, depending upon the particular problem being solved. The option ‘Maximize’ may be used to specify a problem in which $f (x)$ is maximized instead of minimized.

Upper and lower bounds are specified for all variables and constraints. This form allows full generality in specifying various types of constraint. In particular, the $j$ th constraint may be defined as an equality by setting $l_{j} = u_{j}$ . If certain bounds are not present, the associated elements of $l$ or $u$ may be set to special values that are treated as $- \infty$ or $+ \infty$ .

The possible forms for the function $f (x)$ are summarised in Table [label omitted]. The most general form for $f (x)$ is

f (x) = q + c^{T} x + \frac{1}{2} x^{T} H x = q + n \sum j = 1 c_{j} x_{j} + \frac{1}{2} n \sum i = 1 n \sum j = 1 x_{i} H_{i j} x_{j}

where $q$ is a constant, $c$ is a constant $n$ -vector and $H$ is a constant symmetric $n \times n$ matrix with elements ${H_{i j}}$ . In this form, $f$ is a quadratic function of $x$ and (1) is known as a quadratic program (QP). qpconvex2_sparse_solve is suitable for all convex quadratic programs. The defining feature of a convex QP is that the matrix $H$ must be positive semidefinite, i.e., it must satisfy $x^{T} H x \geq 0$ for all $x$ . If not, $f (x)$ is nonconvex and qpconvex2_sparse_solve will terminate with the error indicator $e r r n o$ = 11. If $f (x)$ is nonconvex it may be more appropriate to call handle_solve_ssqp() instead.

Problem type	Objective function $f (x)$	Hessian matrix $H$
FP	Not applicable	$q = c = H = 0$
LP	$q + c^{T} x$	$H = 0$
QP	$q + c^{T} x + \frac{1}{2} x^{T} H x$	Symmetric positive semidefinite

If $H = 0$ , then $f (x) = q + c^{T} x$ and the problem is known as a linear program (LP). In this case, rather than defining an $H$ with zero elements, you can define $H$ to have no columns by setting $n c o l h = 0$ (see Parameters).

If $H = 0$ , $q = 0$ , and $c = 0$ , there is no objective function and the problem is a feasible point problem (FP), which is equivalent to finding a point that satisfies the constraints on $x$ . In the situation where no feasible point exists, several options are available for finding a point that minimizes the constraint violations (see the description of the option ‘Elastic Mode’).

qpconvex2_sparse_solve is suitable for large LPs and QPs in which the matrix $A$ is sparse, i.e., when the number of zero elements is sufficiently large that it is worthwhile using algorithms which avoid computations and storage involving zero elements. The matrix $A$ is input to qpconvex2_sparse_solve by means of the three array arguments $a c o l$ , $i n d a$ and $l o c a$ . This allows you to specify the pattern of nonzero elements in $A$ .

qpconvex2_sparse_solve exploits structure in $H$ by requiring $H$ to be defined implicitly in a function that computes the product $H x$ for any given vector $x$ . In many cases, the product $H x$ can be computed very efficiently for any given $x$ , e.g., $H$ may be a sparse matrix, or a sum of matrices of rank-one.

For problems in which $A$ can be treated as a dense matrix, it is usually more efficient to use lp_solve(), lsq_lincon_solve() or qp_dense_solve().

There is considerable flexibility allowed in the definition of $f (x)$ in Table [label omitted]. The vector $c$ defining the linear term $c^{T} x$ can be input in three ways: as a sparse row of $A$ ; as an explicit dense vector $c$ ; or as both a sparse row and an explicit vector (in which case, $c^{T} x$ will be the sum of two linear terms). When stored in $A$ , $c$ is the $i o b j$ th row of $A$ , which is known as the objective row. The objective row must always be a free row of $A$ in the sense that its lower and upper bounds must be $- \infty$ and $+ \infty$ . Storing $c$ as part of $A$ is recommended if $c$ is a sparse vector. Storing $c$ as an explicit vector is recommended for a sequence of problems, each with a different objective (see arguments $c$ and $lenc$ ).

The upper and lower bounds on the $m$ elements of $A x$ are said to define the general constraints of the problem. Internally, qpconvex2_sparse_solve converts the general constraints to equalities by introducing a set of slack variables $s$ , where $s = {(s_{1}, s_{2}, \dots, s_{m})}_{1}^{T}$ . For example, the linear constraint $5 \leq 2 x_{1} + 3 x_{2} \leq + \infty$ is replaced by $2 x_{1} + 3 x_{2} - s_{1} = 0$ , together with the bounded slack $5 \leq s_{1} \leq + \infty$ . The problem defined by (1) can, therefore, be re-written in the following equivalent form:

\begin{matrix} {m i n i m i z e}_{x \in R^{n}, s \in R^{m}} f (x) subject to A x - s = 0, l \leq (\begin{matrix} x s \end{matrix}) \leq u . \end{matrix}

Since the slack variables $s$ are subject to the same upper and lower bounds as the elements of $A x$ , the bounds on $x$ and $A x$ can simply be thought of as bounds on the combined vector $(x, s)$ . (In order to indicate their special role in QP problems, the original variables $x$ are sometimes known as ‘column variables’, and the slack variables $s$ are known as ‘row variables’.)

Each LP or QP problem is solved using a two-phase iterative procedure (in which the general constraints are satisfied throughout): a feasibility phase (Phase 1), in which the sum of infeasibilities with respect to the bounds on $x$ and $s$ is minimized to find a feasible point that satisfies all constraints within a specified feasibility tolerance; and an optimality phase (Phase 2), in which $f (x)$ is minimized (or maximized) by constructing a sequence of iterates that lies within the feasible region.

Phase 1 involves solving a linear program of the form

Phase 1
	${m i n i m i z e}_{x, s, v, w} \sum_{j = 1}^{n + m} (v_{j} + w_{j})$
	$subject to A x - s = 0, l \leq (\begin{matrix} x s \end{matrix}) - v + w \leq u, v \geq 0, w \geq 0$

which is equivalent to minimizing the sum of the constraint violations. If the constraints are feasible (i.e., at least one feasible point exists), eventually a point will be found at which both $v$ and $w$ are zero. Then the associated value of $(x, s)$ satisfies the original constraints and is used as the starting point for the Phase 2 iterations for minimizing $f (x)$ .

If the constraints are infeasible (i.e., $v \neq 0$ or $w \neq 0$ at the end of Phase 1), no solution exists for (1) and you have the option of either terminating or continuing in so-called elastic mode (see the discussion of the option ‘Elastic Mode’). In elastic mode, a ‘relaxed’ or ‘perturbed’ problem is solved in which $f (x)$ is minimized while allowing some of the bounds to become ‘elastic’, i.e., to change from their specified values. Variables subject to elastic bounds are known as elastic variables. An elastic variable is free to violate one or both of its original upper or lower bounds. You are able to assign which bounds will become elastic if elastic mode is ever started (see the argument $h e l a s t$ in Parameters).

To make the relaxed problem meaningful, qpconvex2_sparse_solve minimizes $f (x)$ while (in some sense) finding the ‘smallest’ violation of the elastic variables. In the situation where all the variables are elastic, the relaxed problem has the form

Phase 2 ( $γ$ )
	${m i n i m i z e}_{x, s, v, w} f (x) + γ \sum_{j = 1}^{n + m} (v_{j} + w_{j})$
	$subject to A x - s = 0, l \leq (\begin{matrix} x s \end{matrix}) - v + w \leq u, v \geq 0, w \geq 0$ ,

where $γ$ is a non-negative argument known as the elastic weight (see the description of the option ‘Elastic Weight’), and $f (x) + γ \sum_{j} (v_{j} + w_{j})$ is called the composite objective. In the more general situation where only a subset of the bounds are elastic, the $v$ ’s and $w$ ’s for the non-elastic bounds are fixed at zero.

The elastic weight can be chosen to make the composite objective behave like the original objective $f (x)$ , the sum of infeasibilities, or anything in-between. If $γ = 0$ , qpconvex2_sparse_solve will attempt to minimize $f$ subject to the (true) upper and lower bounds on the non-elastic variables (and declare the problem infeasible if the non-elastic variables cannot be made feasible).

At the other extreme, choosing $γ$ sufficiently large will have the effect of minimizing the sum of the violations of the elastic variables subject to the original constraints on the non-elastic variables. Choosing a large value of the elastic weight is useful for defining a ‘least-infeasible’ point for an infeasible problem.

In Phase 1 and elastic mode, all calculations involving $v$ and $w$ are done implicitly in the sense that an elastic variable $x_{j}$ is allowed to violate its lower bound (say) and an explicit value of $v$ can be recovered as $v_{j} = l_{j} - x_{j}$ .

A constraint is said to be active or binding at $x$ if the associated element of either $x$ or $A x$ is equal to one of its upper or lower bounds. Since an active constraint in $A x$ has its associated slack variable at a bound, the status of both simple and general upper and lower bounds can be conveniently described in terms of the status of the variables $(x, s)$ . A variable is said to be nonbasic if it is temporarily fixed at its upper or lower bound. It follows that regarding a general constraint as being active is equivalent to thinking of its associated slack as being nonbasic.

At each iteration of an active-set method, the constraints $A x - s = 0$ are (conceptually) partitioned into the form

B x_{B} + S x_{S} + N x_{N} = 0,

where $x_{N}$ consists of the nonbasic elements of $(x, s)$ and the basis matrix $B$ is square and nonsingular. The elements of $x_{B}$ and $x_{S}$ are called the basic and superbasic variables respectively; with $x_{N}$ they are a permutation of the elements of $x$ and $s$ . At a QP solution, the basic and superbasic variables will lie somewhere between their upper or lower bounds, while the nonbasic variables will be equal to one of their bounds. At each iteration, $x_{S}$ is regarded as a set of independent variables that are free to move in any desired direction, namely one that will improve the value of the objective function (or sum of infeasibilities). The basic variables are then adjusted in order to ensure that $(x, s)$ continues to satisfy $A x - s = 0$ . The number of superbasic variables ( $n_{S}$ say), therefore, indicates the number of degrees of freedom remaining after the constraints have been satisfied. In broad terms, $n_{S}$ is a measure of how nonlinear the problem is. In particular, $n_{S}$ will always be zero for FP and LP problems.

If it appears that no improvement can be made with the current definition of $B$ , $S$ and $N$ , a nonbasic variable is selected to be added to $S$ , and the process is repeated with the value of $n_{S}$ increased by one. At all stages, if a basic or superbasic variable encounters one of its bounds, the variable is made nonbasic and the value of $n_{S}$ is decreased by one.

Associated with each of the $m$ equality constraints $A x - s = 0$ is a dual variable $π_{i}$ . Similarly, each variable in $(x, s)$ has an associated reduced gradient $d_{j}$ (also known as a reduced cost). The reduced gradients for the variables $x$ are the quantities $g - A^{T} π$ , where $g$ is the gradient of the QP objective function, and the reduced gradients for the slack variables $s$ are the dual variables $π$ . The QP subproblem is optimal if $d_{j} \geq 0$ for all nonbasic variables at their lower bounds, $d_{j} \leq 0$ for all nonbasic variables at their upper bounds and $d_{j} = 0$ for all superbasic variables. In practice, an approximate QP solution is found by slightly relaxing these conditions on $d_{j}$ (see the description of the option ‘Optimality Tolerance’).

The process of computing and comparing reduced gradients is known as pricing (a term first introduced in the context of the simplex method for linear programming). To ‘price’ a nonbasic variable $x_{j}$ means that the reduced gradient $d_{j}$ associated with the relevant active upper or lower bound on $x_{j}$ is computed via the formula $d_{j} = g_{j} - a_{j}^{T} π$ , where $a_{j}$ is the $j$ th column of $(\begin{matrix} A & - I \end{matrix})$ . (The variable selected by such a process and the corresponding value of $d_{j}$ (i.e., its reduced gradient) are the quantities +SBS and dj in the monitoring file output; see Further Comments.) If $A$ has significantly more columns than rows (i.e., $n ≫ m$ ), pricing can be computationally expensive. In this case, a strategy known as partial pricing can be used to compute and compare only a subset of the $d_{j}$ s.

qpconvex2_sparse_solve is based on SQOPT, which is part of the SNOPT package described in Gill et al. (2005a). It uses stable numerical methods throughout and includes a reliable basis package (for maintaining sparse $L U$ factors of the basis matrix $B$ ), a practical anti-degeneracy procedure, efficient handling of linear constraints and bounds on the variables (by an active-set strategy), as well as automatic scaling of the constraints. Further details can be found in Algorithmic Details.

References

Fourer, R, 1982, Solving staircase linear programs by the simplex method, Math. Programming (23), 274–313

Gill, P E and Murray, W, 1978, Numerically stable methods for quadratic programming, Math. Programming (14), 349–372

Gill, P E, Murray, W and Saunders, M A, 1995, User’s guide for QPOPT 1.0: a Fortran package for quadratic programming, Report SOL 95-4, Department of Operations Research, Stanford University

Gill, P E, Murray, W and Saunders, M A, 2005, Users’ guide for SQOPT 7: a Fortran package for large-scale linear and quadratic programming, Report NA 05-1, Department of Mathematics, University of California, San Diego, https://www.ccom.ucsd.edu/~peg/papers/sqdoc7.pdf

Gill, P E, Murray, W and Saunders, M A, 2005, Users’ guide for SNOPT 7.1: a Fortran package for large-scale linear nonlinear programming, Report NA 05-2, Department of Mathematics, University of California, San Diego, https://www.ccom.ucsd.edu/~peg/papers/sndoc7.pdf

Gill, P E, Murray, W, Saunders, M A and Wright, M H, 1987, Maintaining $L U$ factors of a general sparse matrix, Linear Algebra and its Applics. (88/89), 239–270

Gill, P E, Murray, W, Saunders, M A and Wright, M H, 1989, A practical anti-cycling procedure for linearly constrained optimization, Math. Programming (45), 437–474

Gill, P E, Murray, W, Saunders, M A and Wright, M H, 1991, Inertia-controlling methods for general quadratic programming, SIAM Rev. (33), 1–36

Hall, J A J and McKinnon, K I M, 1996, The simplest examples where the simplex method cycles and conditions where EXPAND fails to prevent cycling, Report MS, 96–100, Department of Mathematics and Statistics, University of Edinburgh

NAG and Python

Return to Front

naginterfaces.library.opt.qpconvex2_sparse_solve¶

naginterfaces.library.opt.qpconvex2_​sparse_​solve¶

naginterfaces.library.opt.qpconvex2_sparse_solve¶