NAG CL Interface
e04ncc (lsq_lincon_solve)
1
Purpose
e04ncc solves linearly constrained linear least squares problems and convex quadratic programming problems. It is not intended for large sparse problems.
2
Specification
void |
e04ncc (Integer m,
Integer n,
Integer nclin,
const double a[],
Integer tda,
const double bl[],
const double bu[],
const double cvec[],
double b[],
double h[],
Integer tdh,
Integer kx[],
double x[],
double *objf,
Nag_E04_Opt *options,
Nag_Comm *comm,
NagError *fail) |
|
The function may be called by the names: e04ncc, nag_opt_lsq_lincon_solve or nag_opt_lin_lsq.
3
Description
e04ncc is designed to solve a class of quadratic programming problems stated in the following general form:
where
is an
by
matrix and the objective function
may be specified in a variety of ways depending upon the particular problem to be solved. The available forms for
are listed in
Table 1 below, in which the prefixes FP, LP, QP and LS stand for ‘feasible point’, ‘linear programming’, ‘quadratic programming’ and ‘least squares’ respectively,
is an
element vector,
is an
element vector, and
denotes the Euclidean length of
.
Problem Type |
|
Matrix |
FP |
Not applicable |
Not applicable |
LP |
|
Not applicable |
QP1 |
|
by symmetric positive semidefinite |
QP2 |
|
by symmetric positive semidefinite |
QP3 |
|
by upper trapezoidal |
QP4 |
|
by upper trapezoidal |
LS1 |
|
by |
LS2 |
|
by |
LS3 |
|
by upper trapezoidal |
LS4 |
|
by upper trapezoidal |
Table 1
For problems of type LS,
is referred to as the
least squares matrix, or the
matrix of observations, and
as the
vector of observations. The default problem type is LS1, and other objective functions are selected by using the optional parameter
(see
Section 12.2).
When is upper trapezoidal it will usually be the case that , so that is upper triangular, but full generality has been allowed for in the specification of the problem. The upper trapezoidal form is intended for cases where a previous factorization, such as a factorization, has been performed.
The constraints involving
are called the
general constraints. Note that upper and lower bounds are specified for all the variables and for all the general constraints. An equality constraint can be specified by setting
. If certain bounds are not present, the associated elements of
or
can be set to special values that will be treated as
or
. (See the description of the optional parameter
in
Section 12.2.
The function
is a quadratic function, whose defining feature is that its second-derivative matrix
(the
Hessian matrix) is constant. For the LP case,
; for QP1 and QP2,
; and for QP3, QP4 and LS problems,
and the Hessian matrix is positive semidefinite (positive definite if
is full rank), so that
is convex. If
is defined as the zero matrix,
e04ncc will solve the resulting linear programming problem; however, this can be accomplished more efficiently by using
e04mfc.
Problems of type QP3 and QP4 for which is not in upper trapezoidal form should be solved as problems of type LS1 and LS2 respectively, with .
You must supply an initial estimate of the solution.
If is of full rank then e04ncc will obtain the unique (global) minimum. If is not of full rank then the solution may still be a global minimum if all active constraints have nonzero Lagrange multipliers. Otherwise the solution obtained will be either a weak minimum (i.e., with a unique optimal objective value, but an infinite set of optimal ), or else the objective function is unbounded below in the feasible region. The last case can only occur when contains an explicit linear term (as in problems LP, QP2, QP4, LS2 and LS4).
The method used by
e04ncc is described in detail in
Section 11.
4
References
Gill P E, Hammarling S, Murray W, Saunders M A and Wright M H (1986) Users' guide for LSSOL (Version 1.0) Report SOL 86-1 Department of Operations Research, Stanford University
Gill P E, Murray W, Saunders M A and Wright M H (1984) Procedures for optimization problems with a mixture of bounds and general linear constraints ACM Trans. Math. Software 10 282–298
Gill P E, Murray W and Wright M H (1981) Practical Optimization Academic Press
Stoer J (1971) On the numerical solution of constrained least squares problems SIAM J. Numer. Anal. 8 382–411
5
Arguments
-
1:
– Integer
Input
-
On entry:
, the number of rows in the matrix
. If the problem is of type FP or LP,
m is not referenced and is assumed to be zero. The default type is LS1; other problem types can be specified using the optional parameter
, see
Section 12.2.
If the problem is of type QP,
m will usually be
, the number of variables. However, a value of
m less than
is appropriate for problem type QP3 or QP4 if
is an upper trapezoidal matrix with
rows. Similarly,
m may be used to define the dimension of a leading block of nonzeros in the Hessian matrices of QP1 or QP2. In QP cases,
should not be greater than
; if it is, the last
rows of
are ignored.
If the problem is a least squares problem (in particular, the default type LS1),
m is also the dimension of the array
b. Note that all possibilities (
,
and
) are allowed in this case.
Constraint:
if problem is not FP or LP.
-
2:
– Integer
Input
-
On entry: , the number of variables.
Constraint:
.
-
3:
– Integer
Input
-
On entry: , the number of general linear constraints.
Constraint:
.
-
4:
– const double
Input
-
Note: the th element of the matrix is stored in .
On entry: the
th row of
a must contain the coefficients of the
th general linear constraint (the
th row of
), for
. If
then the array
a is not referenced.
-
5:
– Integer
Input
-
On entry: the stride separating matrix column elements in the array
a.
Constraint:
if .
-
6:
– const double
Input
-
7:
– const double
Input
-
On entry:
bl must contain the lower bounds and
bu the upper bounds, for all the constraints in the following order. The first
elements of each array must contain the bounds on the variables, and the next
elements the bounds for the general linear constraints (if any). To specify a nonexistent lower bound (i.e.,
), set
, and to specify a nonexistent upper bound (i.e.,
), set
, where
is one of the optional parameters (default value
(see
Section 12.2). To specify the
th constraint as an equality, set
, say, where
.
Constraints:
- , for ;
- if , .
-
8:
– const double
Input
-
On entry: the coefficients of the explicit linear term of the objective function when the problem is of type LP, QP2, QP4, LS2 or LS4.
If the problem is of type FP, QP1, QP3, LS1 (the default) or LS3,
cvec is not referenced and may be
NULL.
-
9:
– double
Input/Output
-
On entry: the elements of the vector of observations.
On exit: the transformed residual vector of equation
(10).
b is referenced only in the case of least squares problem types (in particular, the default type LS1. For other problem types,
b is not referenced and may be
NULL.
-
10:
– double
Input/Output
-
Note: the th element of the matrix is stored in .
On entry: the array
h must contain the matrix
as specified in
Table 1 (see
Section 3).
For problems QP1 and QP2, the first
rows and columns of
h must contain the leading
by
rows and columns of the symmetric Hessian matrix. Only the diagonal and upper triangular elements of the leading
rows and columns of
h are referenced. The remaining elements are assumed to be zero and need not be assigned.
For problems QP3, QP4, LS3 and LS4, the first
rows of
h must contain an
by
upper trapezoidal factor of either the Hessian or the least squares matrix, ordered according to the array
kx (see below). The factor need not be of full rank, i.e., some of the diagonals may be zero. However, as a general rule, the larger the dimension of the leading nonsingular sub-matrix of
, the fewer iterations will be required. Elements outside the upper trapezoidal part of the first
rows of
are assumed to be zero and need not be assigned.
If a constrained least squares problem contains a very large number of observations, storage limitations may prevent storage of the entire least squares matrix. In such cases, you should transform the original into a triangular matrix before the call to e04ncc and solve as type LS3 or LS4.
On exit: by default,
h contains the upper triangular Cholesky factor
of equation
(8), with columns ordered as indicated by
kx (see below). If the optional parameter
(see
Section 12.2), and the problem is one of the LS or QP types,
h contains the upper triangular Cholesky factor of the Hessian matrix
, with columns ordered as indicated by
kx (see below). In either case, this matrix may be used to obtain the variance-covariance matrix or to recover the upper triangular factor of the original least squares matrix.
If the problem is of type FP or LP,
h is not referenced and may be
NULL.
-
11:
– Integer
Input
-
On entry: the stride separating matrix column elements in the array
h.
Constraint:
.
-
12:
– Integer
Input/Output
-
On entry: for problems of type QP3, QP4, LS3 or LS4 the array
kx must specify the order of the columns of the matrix
with respect to the ordering of
x. Thus if column
of
is the column associated with the variable
then
.
If the problem is of any other type then the array
kx need not be initialized.
Constraints:
- , for ;
- if , .
On exit: defines the order of the columns of
with respect to the ordering of
x, as described above.
-
13:
– double
Input/Output
-
On entry: an initial estimate of the solution.
On exit: the point at which
e04ncc terminated. If
,
NW_SOLN_NOT_UNIQUE or
NW_NOT_FEASIBLE,
x contains an estimate of the solution.
-
14:
– double *
Output
-
On exit: the value of the objective function at
if
is feasible, or the sum of infeasibilities at
otherwise. If the problem is of type FP and
is feasible,
objf is set to zero.
-
15:
– Nag_E04_Opt *
Input/Output
-
On entry/exit: a pointer to a structure of type Nag_E04_Opt whose members are optional parameters for
e04ncc. These structure members offer the means of adjusting some of the argument values of the algorithm and on output will supply further details of the results. A description of the members of
options is given below in
Section 12. Some of the results returned in
options can be used by
e04ncc to perform a ‘warm start’ (see the member
in
Section 12.2).
If any of these optional parameters are required then the structure
options should be declared and initialized by a call to
e04xxc and supplied as an argument to
e04ncc. However, if the optional parameters are not required the NAG defined null pointer,
E04_DEFAULT, can be used in the function call.
-
16:
– Nag_Comm *
Input/Output
-
Note: comm is a NAG defined type (see
Section 3.1.1 in the Introduction to the NAG Library CL Interface).
On entry/exit: structure containing pointers for communication with an optional user-defined printing function; see
Section 12.3.1 for details. If you do not need to make use of this communication feature the null pointer
NAGCOMM_NULL may be used in the call to
e04ncc;
comm will then be declared internally for use in calls to user-supplied functions.
-
17:
– NagError *
Input/Output
-
The NAG error argument (see
Section 7 in the Introduction to the NAG Library CL Interface).
6
Error Indicators and Warnings
- NE_2_INT_ARG_LT
-
On entry, while . These arguments must satisfy .
On entry, while . These arguments must satisfy .
- NE_ALLOC_FAIL
-
Dynamic memory allocation failed.
- NE_ARRAY_CONS
-
The contents of array
kx are not valid. Constraint: must contain a permutation of integers
.
- NE_B_NULL
-
but argument NULL.
- NE_BAD_PARAM
-
On entry, argument had an illegal value.
On entry, argument had an illegal value.
On entry, argument had an illegal value.
- NE_BOUND
-
The lower bound for variable (array element ) is greater than the upper bound.
- NE_BOUND_LCON
-
The lower bound for linear constraint (array element ) is greater than the upper bound.
- NE_CVEC_NULL
-
but argument NULL.
- NE_CYCLING
-
The algorithm could be cycling, since a total of 50 changes were made to the working set without altering . Check the detailed iteration printout for a repeated pattern of constraint deletions and additions.
If a sequence of constraint changes is being repeated, the iterates are probably cycling. (
e04ncc does not contain a method that is guaranteed to avoid cycling; such a method would be combinatorial in nature.) Cycling may occur in two circumstances: at a constrained stationary point where there are some small or zero Lagrange multipliers; or at a point (usually a vertex) where the constraints that are satisfied exactly are nearly linearly dependent. In the latter case, you have the option of identifying the offending dependent constraints and removing them from the problem, or restarting the run with a larger value of the optional parameter
(default value
, where
is the
machine precision; see
Section 12.2). If this error exit occurs but no suspicious pattern of constraint changes can be observed, it may be worthwhile to restart with the final
(with optional parameter
or
).
- NE_H_NULL_QP
-
but argument
NULL. This problem type requires an array to be supplied in argument
h.
- NE_INT_ARG_LT
-
On entry, .
Constraint: .
On entry, .
Constraint: .
On entry, .
Constraint: .
- NE_INTERNAL_ERROR
-
An internal error has occurred in this function. Check the function call
and any array sizes. If the call is correct then please contact
NAG for
assistance.
- NE_INVALID_INT_RANGE_1
-
Value given to is not valid. Correct range is .
Value given to is not valid. Correct range is .
- NE_INVALID_REAL_RANGE_F
-
Value given to is not valid. Correct range is .
Value given to is not valid. Correct range is .
Value given to is not valid. Correct range is .
- NE_INVALID_REAL_RANGE_FF
-
Value given to is not valid. Correct range is .
Value given to is not valid. Correct range is .
- NE_NOT_APPEND_FILE
-
Cannot open file for appending.
- NE_NOT_CLOSE_FILE
-
Cannot close file .
- NE_OPT_NOT_INIT
-
Options structure not initialized.
- NE_STATE_VAL
-
is out of range. .
- NE_UNBOUNDED
-
Solution appears to be unbounded.
This error indicator implies that a step as large as optional parameter
(default value
; see
Section 12.2) would have to be taken in order to continue the algorithm. This situation can occur only when
is singular, there is an explicit linear term, and at least one variable has no upper or lower bound.
- NE_WARM_START
-
but pointer NULL.
- NE_WRITE_ERROR
-
Error occurred when writing to file .
- NW_NOT_FEASIBLE
-
No feasible point was found for the linear constraints.
It was not possible to satisfy all the constraints to within the feasibility tolerance. In this case, the constraint violations at the final
will reveal a value of the tolerance for which a feasible point will exist – for example, if the feasibility tolerance for each violated constraint exceeds its
Residual (see
Section 12.3) at the final point. The modified problem (with an altered value of the optional feasibility tolerance,
) may then be solved using optional parameter
(see
Section 12.2). You should check that there are no constraint redundancies. If the data for the constraints are accurate only to the absolute precision
, you should ensure that the value of
is
greater than
. For example, if all elements of
are of order unity and are accurate only to three decimal places,
should be at least
.
- NW_OVERFLOW_WARN
-
Serious ill conditioning in the working set after adding constraint . Overflow may occur in subsequent iterations.
If overflow occurs preceded by this warning then serious ill conditioning has probably occurred in the working set when adding a constraint. It may be possible to avoid the difficulty by increasing the magnitude of the optional parameter and re-running the program. If the message recurs even after this change, the offending linearly dependent constraint must be removed from the problem.
- NW_SOLN_NOT_UNIQUE
-
Optimal solution is not unique.
The point in
x is a weak local minimum, i.e., the projected gradient is negligible, the Lagrange multipliers are optimal, but either
(see
Section 11.3) is singular or there is a small multiplier. This means that
is not unique.
- NW_TOO_MANY_ITER
-
The maximum number of iterations, , have been performed.
The limiting number of iterations (determined by the optional parameters
and
, see
Section 12.2) was reached before normal termination occurred. If the method appears to be making progress (e.g., the objective function is being satisfactorily reduced), either increase the iteration limits or, alternatively, rerun
e04ncc using the optional parameter
to specify the initial working set. If the iteration limit is already large, but some of the constraints could be nearly linearly dependent, check the extended iteration printout (see
Section 12.3) for a repeated pattern of constraints entering and leaving the working set. (Near-dependencies are often indicated by wide variations in size in the diagonal elements of the matrix
(see
Section 11.2), which will be printed if optional parameter
(default value
; see
Section 12.2.) In this case, the algorithm could be cycling (see the comments below for
).
7
Accuracy
e04ncc implements a numerically stable active set strategy and returns solutions that are as accurate as the condition of the problem warrants on the machine.
8
Parallelism and Performance
e04ncc is not threaded in any implementation.
9.1
Termination Criteria
e04ncc exits with
if
is a strong local minimizer, i.e., the reduced gradient is negligible, the Lagrange multipliers are optimal (see
Section 12.3) and
(see
Section 11.3) is nonsingular.
9.2
Scaling
Sensible scaling of the problem is likely to reduce the number of iterations required and make the problem less sensitive to perturbations in the data, thus improving the condition of the problem. In the absence of better information it is usually sensible to make the Euclidean lengths of each constraint of comparable magnitude. See the
E04 Chapter Introduction and
Gill et al. (1981) for further information and advice.
10
Example
To minimize the quadratic function
, where
subject to the bounds
and to the general constraints
The initial point, which is feasible, is
and
.
The optimal solution (to five figures) is
and
. Three bound constraints and two general constraints are active at the solution. Note that, although the Hessian matrix is positive semidefinite, the point
is unique.
This example illustrates the use of the
options structure. Since the problem is of type QP2 (as described in
Section 3) and the default value of the optional parameter
, it is necessary to reset this argument to
in order to solve the problem. This is achieved by declaring the
options structure and initializing it by calling
e04xxc. Then
is assigned directly, before calling
e04ncc. On return from
e04ncc,
e04xzc is used to free the memory assigned to the pointers in the options structure. You must
not use the standard C function
free() for this purpose.
10.1
Program Text
10.2
Program Data
10.3
Program Results
11
Further Description
This section gives a detailed description of the algorithm used in
e04ncc. This, and possibly the next section,
Section 12, may be omitted if the more sophisticated features of the algorithm and software are not currently of interest.
11.1
Overview
e04ncc is based on an inertia-controlling method that maintains a Cholesky factorization of the reduced Hessian (see below). The method is based on that of Gill and Murray (1978) and is described in detail by
Gill et al. (1981). Here we briefly summarise the main features of the method.
e04ncc uses essentially the same algorithm as the subroutine LSSOL described in
Gill et al. (1986). It is based on a two-phase (primal) quadratic programming method with features to exploit the convexity of the objective function due to
Gill et al. (1984). (In the full-rank case, the method is related to that of Stoer, see
Stoer (1971).)
e04ncc has two phases: finding an initial feasible point by minimizing the sum of infeasibilities (the
feasibility phase), and minimizing the quadratic objective function within the feasible region (the
optimality phase). The two-phase nature of the algorithm is reflected by changing the function being minimized from the sum of infeasibilities to the quadratic objective function. The feasibility phase does
not perform the standard simplex method (i.e., it does not necessarily find a vertex), except in the LP case when
. Once any iterate is feasible, all subsequent iterates remain feasible.
e04ncc has been designed to be efficient when used to solve a
sequence of related problems — for example, within a sequential quadratic programming method for nonlinearly constrained optimization (e.g.,
e04ucc). In particular, you may specify an initial working set (the indices of the constraints believed to be satisfied exactly at the solution); see the discussion of the optional parameter
in
Section 12.2.
In general, an iterative process is required to solve a quadratic program. (For simplicity, we shall always consider a typical iteration and avoid reference to the index of the iteration.) Each new iterate
is defined by
where the
step length is a non-negative scalar, and
is called the
search direction.At each point
, a
working set of constraints is defined to be a linearly independent subset of the constraints that are satisfied ‘exactly’ (to within the tolerance defined by the optional parameter
; see
Section 12.2). The working set is the current prediction of the constraints that hold with equality at a solution of
(1). The search direction is constructed so that the constraints in the working set remain
unaltered for any value of the step length. For a bound constraint in the working set, this property is achieved by setting the corresponding element of the search direction to zero. Thus, the associated variable is
fixed, and specification of the working set induces a partition of
into
fixed and
free variables. During a given iteration, the fixed variables are effectively removed from the problem; since the relevant elements of the search direction are zero, the columns of
corresponding to fixed variables may be ignored.
Let
denote the number of general constraints in the working set and let
denote the number of variables fixed at one of their bounds (
and
are the quantities
Lin and
Bnd in the extended iteration printout from
e04ncc; see
Section 12.3). Similarly, let
denote the number of free variables. At every iteration,
the variables are re-ordered so that the last variables are fixed, with all other relevant vectors and matrices ordered accordingly. The order of the variables is indicated by the contents of the array
kx on exit (see
Section 5).
11.2
Definition of the Search Direction
Let
denote the
by
sub-matrix of general constraints in the working set corresponding to the free variables, and let
denote the search direction with respect to the free variables only. The general constraints in the working set will be unaltered by any move along
if
In order to compute
, the
factorization of
is used:
where
is a nonsingular
by
reverse-triangular matrix (i.e.,
if
), and the nonsingular
by
matrix
is the product of orthogonal transformations (see
Gill et al. (1984)). If the columns of
are partitioned so that
where
is
by
, then the
columns of
form a basis for the null space of
. Let
be an integer such that
, and let
denote a matrix whose
columns are a subset of the columns of
. (The integer
is the quantity
Zr in the extended iteration printout from
e04ncc; see
Section 12.3. In many cases,
will include
all the columns of
.) The direction
will satisfy
(3) if
where
is any
-vector.
11.3
The Main Iteration
Let
denote the
by
matrix
where
is the identity matrix of order
. Let
denote an
by
upper triangular matrix (the
Cholesky factor) such that
and let the matrix of the first
rows and columns of
be denoted by
. (The matrix
in
(8) is the Hessian with its rows and columns permuted so that the free variables come first.)
The definition of
in
(6) depends on whether or not the matrix
is singular at
. In the nonsingular case,
satisfies the equations
where
denotes the vector
and
denotes the objective gradient. (The norm of
is the printed quantity
Norm Gf; see
Section 12.3.) When
is defined by
(9),
is the minimizer of the objective function subject to the constraints (bounds and general) in the working set treated as equalities. In general, a vector
is available such that
, which allows
to be computed from a single back-substitution
. For example, when solving problem LS1,
comprises the first
elements of the
transformed residual vector
which is recurred from one iteration to the next, where
is an orthogonal matrix.
In the singular case,
is defined such that
This vector has the property that the objective function is linear along and may be reduced by any step of the form , where .
The vector
is known as the
projected gradient at
. If the projected gradient is zero,
is a constrained stationary point in the subspace defined by
. During the feasibility phase, the projected gradient will usually be zero only at a vertex (although it may be zero at non-vertices in the presence of constraint dependencies). During the optimality phase, a zero projected gradient implies that
minimizes the quadratic objective when the constraints in the working set are treated as equalities. At a constrained stationary point, Lagrange multipliers
and
for the general and bound constraints are defined from the equations
Given a positive constant
of the order of the
machine precision, the Lagrange multiplier
corresponding to an inequality constraint in the working set is said to be
optimal if
when the associated constraint is at its
upper bound, or if
when the associated constraint is at its
lower bound. If a multiplier is non-optimal, the objective function (either the true objective or the sum of infeasibilities) can be reduced by deleting the corresponding constraint (with index
Jdel; see
Section 12.3) from the working set.
If optimal multipliers occur during the feasibility phase and the sum of infeasibilities is nonzero, there is no feasible point, and e04ncc will continue until the minimum value of the sum of infeasibilities has been found. At this point, the Lagrange multiplier corresponding to an inequality constraint in the working set will be such that when the associated constraint is at its upper bound, and when the associated constraint is at its lower bound. Lagrange multipliers for equality constraints will satisfy .
The choice of step length is based on remaining feasible with respect to the satisfied constraints. If
is nonsingular and
is feasible,
will be taken as unity. In this case, the projected gradient at
will be zero, and Lagrange multipliers are computed. Otherwise,
is set to
, the step to the ‘nearest’ constraint (with index
Jadd; see
Section 12.3), which is added to the working set at the next iteration.
If
is not input as a triangular matrix, it is overwritten by a triangular matrix
satisfying
(8) obtained using the Cholesky factorization in the QP case, or the
factorization in the LS case. Column interchanges are used in both cases, and an estimate is made of the rank of the triangular factor. Thereafter, the dependent rows of
are eliminated from the problem.
Each change in the working set leads to a simple change to
: if the status of a general constraint changes, a
row of
is altered; if a bound constraint enters or leaves the working set, a
column of
changes. Explicit representations are recurred of the matrices
and
; and of vectors
,
and
, which are related by the formulae
and
Note that the triangular factor
associated with the Hessian of the original problem is updated during both the optimality
and the feasibility phases.
The treatment of the singular case depends critically on the following feature of the matrix updating schemes used in
e04ncc: if a given factor
is nonsingular, it can become singular during subsequent iterations only when a constraint leaves the working set, in which case only its last diagonal element can become zero. This property implies that a vector satisfying
(11) may be found using the single back-substitution
, where
is the matrix
with a unit last diagonal, and
is a vector of all zeros except in the last position. If the Hessian matrix
is singular, the matrix
(and hence
) may be singular at the start of the optimality phase. However,
will be nonsingular if enough constraints are included in the initial working set. (The matrix with no rows and columns is positive definite by definition, corresponding to the case when
contains
constraints.) The idea is to include as many general constraints as necessary to ensure a nonsingular
.
At the beginning of each phase, an upper triangular matrix
is determined that is the largest nonsingular leading sub-matrix of
. The use of interchanges during the factorization of
tends to maximize the dimension of
. (The rank of
is estimated using the optional parameter
; see
Section 12.2.) Let
denote the columns of
corresponding to
, and let
be partitioned as
. A working set for which
defines the null space can be obtained by including
the rows of as ‘artificial constraints’. Minimization of the objective function then proceeds within the subspace defined by
.
The artificially augmented working set is given by
so that
will satisfy
and
. By definition of the
factorization,
automatically satisfies the following:
where
and hence the
factorization of
(13) requires no additional work.
The matrix need not be kept fixed, since its role is purely to define an appropriate null space; the factorization can therefore be updated in the normal fashion as the iterations proceed. No work is required to ‘delete’ the artificial constraints associated with when , since this simply involves repartitioning . When deciding which constraint to delete, the ‘artificial’ multiplier vector associated with the rows of is equal to , and the multipliers corresponding to the rows of the ‘true’ working set are the multipliers that would be obtained if the temporary constraints were not present.
The number of columns in
and
, the Euclidean norm of
, and the condition estimator of
appear in the extended iteration printout as
Art,
Zr,
Norm Gz and
Cond Rz respectively (see
Section 12.3).
Although the algorithm of e04ncc does not perform simplex steps in general, there is one exception: a linear program with fewer general constraints than variables (i.e., ). (Use of the simplex method in this situation leads to savings in storage.) At the starting point, the ‘natural’ working set (the set of constraints exactly or nearly satisfied at the starting point) is augmented with a suitable number of ‘temporary’ bounds, each of which has the effect of temporarily fixing a variable at its current value. In subsequent iterations, a temporary bound is treated as a standard constraint until it is deleted from the working set, in which case it is never added again.
One of the most important features of
e04ncc is its control of the conditioning of the working set, whose nearness to linear dependence is estimated by the ratio of the largest to smallest diagonals of the
factor
(the printed value
Cond T; see
Section 12.3). In constructing the initial working set, constraints are excluded that would result in a large value of
Cond T. Thereafter,
e04ncc allows constraints to be violated by as much as a user-specified feasibility tolerance (see
,
Section 12.2) in order to provide, whenever possible, a
choice of constraints to be added to the working set at a given iteration. Let
denote the maximum step at which
does not violate any constraint by more than its feasibility tolerance. All constraints at distance
along
from the current point are then viewed as acceptable candidates for inclusion in the working set. The constraint whose normal makes the largest angle with the search direction is added to the working set. In order to ensure that the new iterate satisfies the constraints in the working set as accurately as possible, the step taken is the exact distance to the newly added constraint. As a consequence, negative steps are occasionally permitted, since the current iterate may violate the constraint to be added by as much as the feasibility tolerance.
12
Optional Parameters
A number of optional input and output arguments to
e04ncc are available through the structure argument
options, type Nag_E04_Opt. An argument may be selected by assigning an appropriate value to the relevant structure member; those arguments not selected will be assigned default values. If no use is to be made of any of the optional parameters you should use the NAG defined null pointer,
E04_DEFAULT, in place of
options when calling
e04ncc; the default settings will then be used for all arguments.
Before assigning values to
options directly the structure
must be initialized by a call to the function
e04xxc. Values may then be assigned to the structure members in the normal C manner.
After return from
e04ncc, the
options structure may only be re-used for future calls of
e04ncc if the dimensions of the new problem are the same. Otherwise, the structure must be cleared by a call of
e04xzc) and re-initialized by a call of
e04xxc before future calls. Failure to do this will result in unpredictable behaviour.
Option settings may also be read from a text file using the function
e04xyc in which case initialization of the
options structure will be performed automatically if not already done. Any subsequent direct assignment to the
options structure must
not be preceded by initialization.
If assignment of functions and memory to pointers in the
options structure is required, then this must be done directly in the calling program; they cannot be assigned using
e04xyc.
12.1
Optional Parameter Checklist and Default Values
For easy reference, the following list shows the members of
options which are valid for
e04ncc together with their default values where relevant. The number
is a generic notation for
machine precision (see
X02AJC).
Nag_ProblemType prob |
Nag_LS1 |
Nag_Start start |
|
Boolean list |
Nag_TRUE |
Nag_PrintType print_level |
Nag_Soln_Iter |
|
char outfile[512] |
stdout |
void (*print_fun)() |
NULL |
|
Integer fmax_iter |
|
Integer max_iter |
|
double crash_tol |
0.01 |
|
double ftol |
|
double inf_bound |
|
double inf_step |
|
double rank_tol |
or |
Integer *state |
size |
double *ax |
size nclin |
double *lambda |
size |
Boolean hessian |
Nag_FALSE |
Integer iter |
|
12.2
Description of the Optional Parameters
prob – Nag_ProblemType | | Default |
On entry: specifies the type of objective function to be minimized during the optimality phase. The following are the ten possible values of
and the size of the arrays
h,
kx,
b and
cvec that are required to define the objective function:
|
h,
b and cvec not referenced; |
|
h and b not referenced,
cvec of size n; |
|
h of size , symmetric,
b and cvec not referenced; |
|
h of size , symmetric,
b not referenced,
cvec of size n; |
|
h of size ,
upper trapezoidal,
b and cvec not referenced; |
|
h of size , upper trapezoidal,
b not referenced,
cvec of size n. |
|
h of size ,
b of size m,
cvec not referenced; |
|
h of size ,
b of size m,
cvec of size n; |
|
h of size ,
upper trapezoidal,
b of size m,
cvec not referenced; |
|
h of size ,
upper trapezoidal,
b of size m,
cvec of size n. |
The array
kx of size
n must be supplied for all problem types but need only be initialized for types
,
,
and
. If
, i.e., the objective function is purely linear, the efficiency of
e04ncc may be increased by specifying
.
Constraint:
, , , , , , , , or .
start – Nag_Start | | Default |
On entry: specifies how the initial working set is chosen. With
,
e04ncc chooses the initial working set based on the values of the variables and constraints at the initial point. Broadly speaking, the initial working set will include equality constraints and bounds or inequality constraints that violate or ‘nearly’ satisfy their bounds (to within the value of the optional parameter
; see below).
With
, you must provide a valid definition of every array element of the optional parameter
(see below).
e04ncc will override your specification of
if necessary, so that a poor choice of the working set will not cause a fatal error. For instance, any elements of
which are set to
,
or 4 will be reset to zero, as will any elements which are set to 3 when the corresponding elements of
bl and
bu are not equal. A warm start will be advantageous if a good estimate of the initial working set is available – for example, when
e04ncc is called repeatedly to solve related problems.
Constraint:
or .
list – Nag_Boolean | | Default |
On entry: if the argument settings in the call to e04ncc will be printed.
print_level – Nag_PrintType | | Default |
On entry: the level of results printout produced by
e04ncc. The following values are available:
|
No output. |
|
The final solution. |
|
One line of output for each iteration. |
|
A longer line of output for each iteration with more information (line exceeds 80 characters). |
|
The final solution and one line of output for each iteration. |
|
The final solution and one long line of output for each iteration (line exceeds 80 characters). |
|
As with the Lagrange multipliers, the variables , the constraint values and the constraint status also printed at each iteration. |
|
As with the diagonal elements of the matrix associated with the factorization (see (4) in Section 11.2) of the working set, and the diagonal elements of the upper triangular matrix printed at each iteration. |
Details of each level of results printout are described in
Section 12.3.
Constraint:
, , , , , , or .
outfile – const char[512] | | Default |
On entry: the name of the file to which results should be printed. If then the stdout stream is used.
print_fun – pointer to function | | Default NULL |
On entry: printing function defined by you; the prototype of
is
void(*print_fun)(const Nag_Search_State *st, Nag_Comm *comm);
See
Section 12.3.1 below for further details.
fmax_iter – Integer | | Default |
max_iter – Integer | | Default
|
On entry:
and
specify the maximum number of iterations allowed in the feasibility and optimality phase, respectively.
If you wish to check that a call to e04ncc is correct before attempting to solve the problem in full then may be set to . No iterations will then be performed but all initialization prior to the first iteration will be done and a listing of argument settings will be output, if optional parameter (the default setting).
Constraints:
- ;
- .
crash_tol – double | | Default |
On entry: is used when optional parameter (the default) and e04ncc selects an initial working set. The initial working set will include (if possible) bounds or general inequality constraints that lie within of their bounds. In particular, a constraint of the form will be included in the initial working set if .
Constraint:
.
ftol – double | | Default |
On entry: defines the maximum acceptable
absolute violation in each constraint at a ‘feasible’ point. For example, if the variables and the coefficients in the general constraints are of order unity, and the latter are correct to about 6 decimal digits, it would be appropriate to specify
as
.
e04ncc attempts to find a feasible solution before optimizing the objective function. If the sum of infeasibilities cannot be reduced to zero, e04ncc finds the minimum value of the sum. Let Sinf be the corresponding sum of infeasibilities. If Sinf is quite small, it may be appropriate to raise by a factor of 10 or . Otherwise, some error in the data should be suspected.
Note that a ‘feasible solution’ is a solution that satisfies the current constraints to within the feasibility tolerance .
Constraint:
.
inf_bound – double | | Default |
On entry: defines the ‘infinite’ bound in the definition of the problem constraints. Any upper bound greater than or equal to will be regarded as (and similarly any lower bound less than or equal to will be regarded as ).
Constraint:
.
inf_step – double | | Default |
On entry: specifies the magnitude of the change in variables that will be considered a step to an unbounded solution. (Note that an unbounded solution can occur only when the Hessian is singular and the objective contains an explicit linear term.) If the change in during an iteration would exceed the value of , the objective function is considered to be unbounded below in the feasible region.
Constraint:
.
rank_tol – double | | Default or |
The default value is for problem types QP1, LS1 and LS3 but is for other QP and LS problem types. This option does not apply to FP or LP problem types.
On entry:
enables you to control the estimate of the triangular factor
(see
Section 11.3). If
denotes the function
, the rank of
is defined to be smallest index
such that
.
Constraint:
.
state – Integer * | | Default memory |
On entry:
need not be set if the default option of
is used as
values of memory will be automatically allocated by
e04ncc.
If the option
has been chosen,
must point to a minimum of
elements of memory. This memory will already be available if the
options structure has been used in a previous call to
e04ncc from the calling program, with
and the same values of
n and
nclin. If a previous call has not been made sufficient memory must be allocated to
by you.
When a warm start is chosen
should specify the status of the constraints at the start of the feasibility phase. More precisely, the first
elements of
refer to the upper and lower bounds on the variables, and the next
elements refer to the general linear constraints (if any). Possible values for
are as follows:
|
Meaning |
0 |
The constraint should not be in the initial working set. |
1 |
The constraint should be in the initial working set at its lower bound. |
2 |
The constraint should be in the initial working set at its upper bound. |
3 |
The constraint should be in the initial working set as an equality. This value should only be specified if . |
The values
,
and 4 are also acceptable but will be reset to zero by the function, as will any elements which are set to 3 when the corresponding elements of
bu and
bl are not equal. If
e04ncc has been called previously with the same values of
n and
nclin,
already contains satisfactory information. (See also the description of the optional parameter
.) The function also adjusts (if necessary) the values supplied in
x to be consistent with the values supplied in
.
Constraint:
, for .
On exit: the status of the constraints in the working set at the point returned in
x. The significance of each possible value of
is as follows:
|
Meaning |
|
The constraint violates its lower bound by more than the feasibility tolerance. |
|
The constraint violates its upper bound by more than the feasibility tolerance. |
|
The constraint is satisfied to within the feasibility tolerance, but is not in the working set. |
|
This inequality constraint is included in the working set at its lower bound. |
|
This inequality constraint is included in the working set at its upper bound. |
|
This constraint is included in the working set as an equality. This value of can occur only when . |
|
This corresponds to optimality being declared with being temporarily fixed at its current value. This value of can only occur when . |
ax – double * | | Default memory |
On entry:
nclin values of memory will be automatically allocated by
e04ncc and this is the recommended method of use of
. However you may supply memory from the calling program.
On exit: if , points to the final values of the linear constraints .
lambda – double * | | Default memory |
On entry: values of memory will be automatically allocated by e04ncc and this is the recommended method of use of . However you may supply memory from the calling program.
On exit: the values of the Lagrange multipliers for each constraint with respect to the current working set. The first elements contain the multipliers for the bound constraints on the variables, and the next elements contain the multipliers for the general linear constraints (if any). If (i.e., constraint is not in the working set), is zero. If is optimal, should be non-negative if , non-positive if and zero if .
hessian – Nag_Boolean | | Default |
On entry: controls the contents of the argument
h on return from
e04ncc.
e04ncc works exclusively with the transformed and reordered matrix
(8), and hence extra computation is required to form the Hessian itself. If the optional parameter
,
h contains the Cholesky factor of the matrix
with columns ordered as indicated by
kx (see
Section 5). If
,
h contains the Cholesky factor of the Hessian matrix
, with columns ordered as indicated by
kx.
On exit: the total number of iterations performed in the feasibility phase and (if appropriate) the optimality phase.
12.3
Description of Printed Output
The level of printed output can be controlled with the structure members
and
(see
Section 12.2). If
then the argument values to
e04ncc are listed, whereas the printout of results is governed by the value of
. The default of
provides a single line of output at each iteration and the final result. This section describes all of the possible levels of results printout available from
e04ncc.
To aid interpretation of the printed results, the following convention is used for numbering the constraints: indices 1 to refer to the bounds on the variables, and indices to refer to the general constraints.
When or the following line of output is produced at every iteration. In all cases, the values of the quantities printed are those in effect on completion of the given iteration.
Itn |
is the iteration count. |
Step |
is the step taken along the computed search direction. If a constraint is added during the current iteration, Step will be the step to the nearest constraint. During the optimality phase, the step can be greater than only if the factor is singular (see Section 11.3). |
Ninf |
is the number of violated constraints (infeasibilities). This will be zero during the optimality phase. |
Sinf/Objective |
is the value of the current objective function. If is not feasible, Sinf gives a weighted sum of the magnitudes of constraint violations. If is feasible, Objective is the value of the objective function. The output line for the final iteration of the feasibility phase (i.e., the first iteration for which Ninf is zero) will give the value of the true objective at the first feasible point. |
|
During the optimality phase, the value of the objective function will be non-increasing. During the feasibility phase, the number of constraint infeasibilities will not increase until either a feasible point is found, or the optimality of the multipliers implies that no feasible point exists. Once optimal multipliers are obtained, the number of infeasibilities can increase, but the sum of infeasibilities will either remain constant or be reduced until the minimum sum of infeasibilities is found. |
Norm Gz |
, the Euclidean norm of the reduced gradient with respect to (see Section 11.3). During the optimality phase, this norm will be approximately zero after a unit step. |
If , , or the line of printout is extended to give the following additional information. (Note that this longer line extends over more than 80 characters.)
Jdel |
is the index of the constraint deleted from the working set, along with the designation L (lower bound), U (upper bound), E (equality), F (temporarily fixed variable) or A (artificial constraint). If Jdel is zero, no constraint was deleted. |
Jadd |
is the index of the constraint added to the working set, along with a designation as for Jdel. If Jadd is zero, no constraint was added. |
Bnd |
is the number of simple bound constraints in the current working set. |
Lin |
is the number of general linear constraints in the current working set. |
Art |
is the number of artificial constraints in the working set, i.e., the number of columns of (see Section 11.3). |
Zr |
is the number of columns of (see Section 11.2). Zr is the dimension of the subspace in which the objective function is currently being minimized. The value of Zr is the number of variables minus the number of constraints in the working set; i.e., . |
|
The value of , the number of columns of (see Section 11) can be calculated as . A zero value of implies that lies at a vertex of the feasible region. |
Norm Gf |
is the Euclidean norm of the gradient function with respect to the free variables, i.e., variables not currently held at a bound. |
Cond T |
is a lower bound on the condition number of the working set. |
Cond Rz |
is a lower bound on the condition number of the triangular factor (the first Zr rows and columns of the factor ). |
When
or
more detailed results are given at each iteration. For the setting
additional values output are:
Value of x |
is the value of currently held in x. |
State |
is the current value of associated with . |
Value of Ax |
is the value of currently held in . |
State |
is the current value of associated with . |
Also printed are the Lagrange Multipliers for the bound constraints, linear constraints and artificial constraints.
If then the diagonals of and are also output at each iteration.
When
,
,
,
or
the final printout from
e04ncc includes a listing of the status of every variable and constraint. The following describes the printout for each variable.
Varbl |
gives the name (V) and index , for of the variable. |
State |
gives the state of the variable (FR if neither bound is in the working set, EQ if a fixed variable, LL if on its lower bound, UL if on its upper bound, TF if temporarily fixed at its current value). If Value lies outside the upper or lower bounds by more than the optional parameter (default value , where is the machine precision; see Section 12.2), State will be ++ or -- respectively. |
|
A key is sometimes printed before State to give some additional information about the state of a variable.
A |
Alternative optimum possible. The variable is active at one of its bounds, but its Lagrange Multiplier is essentially zero. This means that if the variable were allowed to start moving away from its bound, there would be no change to the objective function. The values of the other free variables might change, giving a genuine alternative solution. However, if there are any degenerate variables (labelled D), the actual change might prove to be zero, since one of them could encounter a bound immediately. In either case, the values of the Lagrange multipliers might also change. |
D |
Degenerate. The variable is free, but it is equal to (or very close to) one of its bounds. |
I |
Infeasible. The variable is currently violating one of its bounds by more than . |
|
Value |
is the value of the variable at the final iteration. |
Lower bound |
is the lower bound specified for variable . (None indicates that , where is the optional parameter.) |
Upper bound |
is the upper bound specified for variable . (None indicates that , where is the optional parameter.) |
Lagr mult |
is the value of the Lagrange multiplier for the associated bound. This will be zero if State is FR unless and , in which case the entry will be blank. If is optimal, the multiplier should be non-negative if State is LL, and non-positive if State is UL. |
Residual |
is the difference between the variable Value and the nearer of its (finite) bounds and . A blank entry indicates that the associated variable is not bounded (i.e., and ). |
The meaning of the printout for general constraints is the same as that given above for variables, with ‘variable’ replaced by ‘constraint’,
and
replaced by
and
respectively, and with the following change in the heading:
L Con |
the name (L) and index , for of the linear constraint. |
Note that movement off a constraint (as opposed to a variable moving away from its bound) can be interpreted as allowing the entry in the
Residual column to become positive.
Numerical values are output with a fixed number of digits; they are not guaranteed to be accurate to this precision.
If then printout will be suppressed; you can print the final solution when e04ncc returns to the calling program.
12.3.1
Output of results via a user-defined printing function
You may also specify your own print function for output of iteration results and the final solution by use of the function pointer, which has prototype
void (*print_fun)(const Nag_Search_State *st, Nag_Comm *comm)
The rest of this section can be skipped if you wish to use the default printing facilities.
When a user-defined function is assigned to
this will be called in preference to the internal print function of
e04ncc. Calls to the user-defined function are again controlled by means of the
member. Information is provided through
st and
comm, the two structure arguments to
.
If then the results from the last iteration of e04ncc are provided through st. Note that will be called with only if , , , , or . The following members of st are set:
- n – Integer
-
The number of variables.
- nclin – Integer
-
The number of linear constraints.
- iter – Integer
-
The iteration count.
- jdel – Integer
-
Index of constraint deleted from the working set.
- jadd – Integer
-
Index of constraint added to the working set.
- step – double
-
The step taken along the computed search direction.
- ninf – Integer
-
The number of violated constraints (infeasibilities).
- f – double
-
The current value of the objective function if ; otherwise, is a weighted sum of the magnitudes of constraint violations.
- bnd – Integer
-
Number of bound constraints in the working set.
- lin – Integer
-
Number of general linear constraints in the working set.
- nart – Integer
-
Number of artificial constraints in the working set (see
Section 11.3).
- nrank – Integer
-
The rank of the upper triangular matrix
(see
Section 11.3).
- nrz – Integer
-
Number of columns of
(see
Section 11.2).
- norm_gz – double
-
Euclidean norm of the reduced gradient,
(see
Section 11.3).
- norm_gf – double
-
Euclidean norm of the gradient function with respect to the free variables.
- cond_t – double
-
A lower bound on the condition number of the working set.
- cond_r – double
-
A lower bound on the condition number of the triangular factor
(see
Section 11.3).
- x – double *
-
The components of the current point , for .
- ax – double *
-
If , the components of the linear constraints .
- state – Integer *
-
contains the status of the
variables and
general linear constraints. See
Section 12.2 for a description of the possible status values.
- diagt – double *
-
If , the elements in the diagonal of the matrix .
- diagr – double *
-
If , the first elements of the diagonal of the upper triangular matrix .
If then the Lagrange multipliers have been updated and the following members of st are set:
- bnd – Integer
-
The number of bound constraints in the working set.
- kx – Integer *
- bclambda – double *
-
Indices of the bound constraints in the working set, with associated multipliers. is the index of the constraint with multiplier , for .
- lin – Integer
-
The number of linear constraints in the working set.
- kactive – Integer *
- lambda – double *
-
Indices of the linear constraints in the working set, with associated multipliers. is the index of the constraint with multiplier , for .
- nart – Integer
-
The number of artificial constraints in the working set (see
Section 11.3).
- gq – double *
-
, for , hold the multipliers for the artificial constraints.
If then the final result from e04ncc is available and the following members of st are set:
- n – Integer
-
The number of variables.
- nclin – Integer
-
The number of linear constraints.
- iter – Integer
-
The iteration count.
- x – double *
-
The components of the final point , for .
- feasible – Nag_Boolean
-
Will be Nag_TRUE if the final point is feasible.
- f – double
-
The final value of the objective function if is Nag_TRUE; otherwise, the sum of infeasibilities. If the problem is of type FP and is feasible then is set to zero.
- ax – double *
-
If , the components of the final linear constraint activities, .
- state – Integer *
-
Contains the final status of the
variables and
general linear constraints. See
Section 12.2 for a description of the possible status values.
- lambda – double *
-
Contains the final values of the Lagrange multipliers.
- bl – double *
-
Contains the lower bounds.
- bu – double *
-
Contains the upper bounds.
- endstate – Nag_EndState
-
The state of termination of
e04ncc. Possible values of
and their correspondence to the exit value of
fail are:
The relevant members of the structure
comm are:
- it_prt – Nag_Boolean
-
Will be Nag_TRUE when the print function is called with the result of the current iteration.
- sol_prt – Nag_Boolean
-
Will be Nag_TRUE when the print function is called with the final result.
- new_lm – Nag_Boolean
-
Will be Nag_TRUE when the Lagrange multipliers have been updated.
- user – double *
- iuser – Integer *
- p – Pointer
-
Pointers for communication of user information. If used they must be allocated memory either before entry to e04ncc or during a call to . The type Pointer will be void * with a C compiler that defines void * and char * otherwise.