Note:this routine usesoptional parametersto define choices in the problem specification and in the details of the algorithm. If you wish to use default settings for all of the optional parameters, you need only read Sections 1 to 10 of this document. If, however, you wish to reset some or all of the settings please refer to Section 11 for a detailed description of the algorithm and to Section 12 for a detailed description of the specification of the optional parameters.
e04svf is a solver from the NAG optimization modelling suite for problems such as,
Quadratic Programming (QP)
, linear
Semidefinite Programming (SDP)
and semidefinite programming with bilinear matrix inequalities (BMI-SDP).
The routine may be called by the names e04svf or nagf_opt_handle_solve_pennon.
3Description
e04svf serves as a solver for compatible problems stored as a handle. The handle points to an internal data structure which defines the problem and serves as a means of communication for routines in the NAG optimization modelling suite. First, the problem handle is initialized by calling e04raf. Then some of the routines e04ref,e04rff,e04rhf,e04rjf,e04rnfore04rpf may be used to formulate the objective function, (standard) constraints and matrix constraints of the problem. Once the problem is fully set, the handle may be passed to the solver. When the handle is no longer needed, e04rzf should be called to destroy it and deallocate the memory held within. See Section 3.1 in the E04 Chapter Introduction for more details about the NAG optimization modelling suite.
Problems which can be defined this way are, for example, (generally nonconvex)
Quadratic Programming (QP)
(1)
linear semidefinite programming problems (SDP)
(2)
or semidefinite programming problems with bilinear matrix inequalities (BMI-SDP)
(3)
Here , and are -dimensional vectors, is a symmetric matrix, , are -dimensional vectors, is a general rectangular matrix and , are symmetric matrices. The expression stands for a constraint on eigenvalues of a symmetric matrix , namely, all the eigenvalues should be non-negative, i.e., the matrix should be positive semidefinite. See relevant routines in the suite for more details on the problem formulation.
The solver is based on a generalized Augmented Lagrangian method with a suitable choice of standard and matrix penalty functions. For a detailed description of the algorithm see Section 11. Under standard assumptions on the problem (Slater constraint qualification, boundedness of the objective function on the feasible set, see Stingl (2006) for details) the algorithm converges to a local solution. In case of convex problems such as linear SDP or convex QP, this is the global solution. The solver is suitable for both small dense and large-scale sparse problems.
The algorithm behaviour and solver strategy can be modified by various optional parameters (see Section 12) which can be set by e04zmfande04zpf anytime between the initialization of the handle by e04raf and a call to the solver. Once the solver has finished, options may be modified for the next solve. The solver may be called repeatedly with various starting points and/or optional parameters.
There are several optional parameters with a multiple choice where the default choice is (for example, Hessian Density). This value means that the decision over the option is left to the solver based on the structure of the problem. Option getter e04znf can be called to retrieve the choice of these options as well as on any other options.
Optional parameter Task may be used to switch the problem to maximization or to ignore the objective function and find only a feasible point.
Optional parameter Monitor Frequency may be used to turn on the monitor mode of the solver. The solver invoked in this mode pauses regularly even before the optimal point is found to allow monitoring the progress from the calling program. All the important error measures and statistics are available in the calling program which may terminate the solver early if desired (see argument inform).
3.1Structure of the Lagrangian Multipliers
The algorithm works internally with estimates of both the decision variables, denoted by , and the Lagrangian multipliers (dual variables) for standard and matrix constraints, denoted by and , respectively. You may provide initial estimates, request approximations during the run (the monitor mode turned on) and obtain the final values. The Lagrangian multipliers are split into two arrays, the multipliers for (standard) constraints are stored in array u and multipliers for matrix constraints in array ua. Both arrays need to conform to the structure of the constraints.
If the simple bounds were defined (e04rhf was successfully called), the first elements of u belong to the corresponding Lagrangian multipliers, interleaving a multiplier for the lower and for the upper bound for each . If any of the bounds were set to infinity, the corresponding Lagrangian multipliers are set to and may be ignored.
Similarly, the following elements of u belong to multipliers for the linear constraints, if formulated by e04rjf. The organization is the same, i.e., the multipliers for each constraint for the lower and upper bounds are alternated and zeroes are used for any missing (infinite bound) constraint.
A Lagrangian multiplier for a matrix constraint (one block) of dimension is a dense symmetric matrix of the same dimension. All multipliers are stored next to each other in array ua in the same order as the matrix constraints were defined by e04rnfande04rpf. The lower triangle of each is stored in the packed column order (see Section 3.3.2 in the F07 Chapter Introduction). For example, if there are four matrix constraints of dimensions , , , , the dimension of array ua should be . The first elements belong to the packed lower triangle of , followed by six elements of and one element for each and . See for example Section 10 in e04rdf.
3.2Approximation of the Lagrangian Multipliers
By the nature of the algorithm, all inequality Lagrangian multiplier are always kept positive during the computational process. This applies even to Lagrangian multipliers of inactive constraints at the solution. They will only be close to zero although they would normally be equal to zero exactly. This is one of the major differences between results from solvers based on the active set method (such as e04nqf) and others, such as, e04svf or interior point methods. As a consequence, the initial estimate of the multipliers (if provided) might be adjusted by the solver to be sufficiently positive, also the estimates returned during the intermediate exits might only be a very crude approximation to their final values as they do not satisfy all the Karush–Kuhn–Tucker (KKT) conditions.
Another difference is that e04nqf merges multipliers for both lower and upper inequality into one element whose sign determines the inequality because there can be at most one active constraint and multiplier for the inactive is exact zero. Negative multipliers are associated with the upper bounds and positive with the lower bounds. On the other hand, e04svf works with both multipliers at the same time so they are returned in two elements, one for the lower bound, the other for the upper bound (see Section 3.1). An equivalent result can be achieved by subtracting the upper bound multiplier from the lower one. This holds even when equalities are interpreted as two inequalities (see optional parameter Transform Constraints).
4References
Ben–Tal A and Zibulevsky M (1997) Penalty/barrier multiplier methods for convex programming problems SIAM Journal on Optimization7 347–366
Fujisawa K, Kojima M, Nakata K (1997) Exploiting sparsity in primal-dual interior-point method for semidefinite programming Math. Programming79 235–253
Hogg J D and Scott J A (2011) HSL MA97: a bit-compatible multifrontal code for sparse symmetric systems RAL Technical Report. RAL-TR-2011-024
Kočvara M and Stingl M (2003) PENNON – a code for convex nonlinear and semidefinite programming Optimization Methods and Software18(3) 317–333
Kočvara M and Stingl M (2007) On the solution of large-scale SDP problems by the modified barrier method using iterative solvers Math. Programming (Series B)109(2–3) 413–444
Mittelmann H D (2003) An independent benchmarking of SDP and SOCP solvers Math. Programming95 407–430
Stingl M (2006) On the Solution of Nonlinear Semidefinite Programs by Augmented Lagrangian Methods, PhD thesis Institute of Applied Mathematics II, Friedrich–Alexander University of Erlangen–Nuremberg
5Arguments
1: – Type (c_ptr)Input
On entry: the handle to the problem. It needs to be initialized (e.g., by e04raf) and to hold a problem formulation compatible with e04svf. It must not be changed between calls to the NAG optimization modelling suite.
2: – IntegerInput
On entry: , the current number of decision variables in the model.
3: – Real (Kind=nag_wp) arrayInput/Output
Note: intermediate stops take place only if .
On entry: if (the default), , the initial estimate of the variables ; otherwise, x need not be set.
On intermediate exit:
the value of the variables at the end of the current outer iteration.
If , u will not be referenced; otherwise, it needs to match the dimension of constraints defined by e04rhfande04rjf as explained in Section 3.1.
Constraint:
.
5: – Real (Kind=nag_wp) arrayInput/Output
Note: intermediate stops take place only if .
if , u holds Lagrangian multipliers (dual variables) for (standard) constraints, i.e., simple bounds defined by e04rhf and a set of linear constraints defined by e04rjf. Either their initial estimates, intermediate approximations or final values, see Section 3.1.
On entry: if (the default is ), , the initial estimate of the Lagrangian multipliers ; otherwise, u need not be set.
On intermediate exit:
the estimate of the multipliers at the end of the current outer iteration.
On intermediate re-entry: the input is ignored.
On exit: the final value of multipliers .
6: – IntegerInput
On entry: the dimension of array uc. If , uc will not be referenced. This argument is reserved for future releases of the NAG Library which will allow definition of second-order cone constraints. It needs to be set to at the moment.
Constraint:
.
7: – Real (Kind=nag_wp) arrayInput/Output
uc is reserved for future releases of the NAG Library which will allow definition of second-order cone constraints. It is not referenced at the moment.
8: – IntegerInput
On entry: the dimension of array ua. If , ua will not be referenced; otherwise, it needs to match the total number of nonzeros in all matrix Lagrangian multipliers (constraints defined by e04rnfande04rpf) as explained in Section 3.1.
Constraint:
.
9: – Real (Kind=nag_wp) arrayInput/Output
Note: intermediate stops take place only if .
If , ua holds the Lagrangian multipliers for matrix constraints defined by e04rnfande04rpf, see Section 3.1.
On entry: if (the default is ), , the initial estimate of the matrix Lagrangian multipliers ; otherwise, ua need not be set.
On intermediate exit:
the estimate of the matrix multipliers at the end of the outer iteration.
On intermediate re-entry: the input is ignored.
On final exit: the final estimate of the multipliers .
10: – Real (Kind=nag_wp) arrayOutput
On intermediate or final entry: error measures and various indicators (see Section 11 for details) at the end of the current (or final) outer iteration as given in the table below:
On intermediate or final exit: solver statistics at the end of the current (or final) outer iteration as given in the table below. Note that time statistics is provided only if Stats Time is set (the default is ), the measured time is returned in seconds.
Number of the outer iterations.
Total number of the inner iterations.
Total number of the linesearch steps.
Number of evaluations of the augmented Lagrangian , (see (8)).
Number of evaluations of .
Number of evaluations of .
Reserved for future use.
Total running time of the solver.
Total running time of the solver without evaluations of the user's functions and monitoring stops.
Time spent in the inner iterations.
Time spent in Lagrangian multipliers updates.
Time spent in penalty parameters updates.
Time spent in matrix feasibility computation.
Time of evaluations of .
Time of evaluations of .
Time of evaluations of .
Time of factorizations of the Newton system.
Time of factorizations of the matrix constraints.
–
reserved for future use.
12: – IntegerInput/Output
Note: intermediate stops take place only if .
On initial entry: no effect.
On intermediate exit:
.
On intermediate re-entry: if set to , solving the current problem is terminated and the routine returns ; otherwise, the routine continues.
On final exit: .
13: – IntegerInput/Output
On entry: ifail must be set to , or to set behaviour on detection of an error; these values have no effect when no error is detected.
A value of causes the printing of an error message and program execution will be halted; otherwise program execution continues. A value of means that an error message is printed while a value of means that it is not.
If halting is not appropriate, the value or is recommended. If message printing is undesirable, then the value is recommended. Otherwise, the value is recommended since useful values can be provided in some output arguments even when on exit. When the value or is used it is essential to test the value of ifail on exit.
On exit: unless the routine detects an error or a warning has been flagged (see Section 6).
6Error Indicators and Warnings
If on entry or , explanatory error messages are output on the current error message unit (as defined by x04aaf).
Errors or warnings detected by the routine:
Note: in some cases e04svf may return useful information.
The supplied handle does not define a valid handle to the data structure for the NAG optimization modelling suite. It has not been properly initialized or it has been corrupted.
This solver does not support the model defined in the handle.
The problem is already being solved.
On entry, , expected .
Constraint: nvar must match the current number of variables of the model in the handle.
On entry, . nnzu does not match the size of the Lagrangian multipliers for (standard) constraints.
or .
On entry, . nnzu does not match the size of the Lagrangian multipliers for (standard) constraints.
when there are no (standard) constraints.
On entry, . nnzua does not match the size of the Lagrangian multipliers for matrix constraints.
or .
On entry, . nnzua does not match the size of the Lagrangian multipliers for matrix constraints.
when there are no matrix constraints.
On entry, . nnzuc does not match the size of the Lagrangian multipliers for second-order cone constraints.
when there are no second-order cone constraints.
User requested termination during a monitoring step.
The current starting point is unusable.
The starting point , either provided by you (if , the default) or the automatic estimate (if ), must not be extremely infeasible in the matrix constraints (infeasibility of order and higher) and all the functions used in the problem formulation must be evaluatable.
In the unlikely case this error is triggered, it is necessary to provide a better estimate of the initial values.
Outer iteration limit has been reached.
The requested accuracy is not achieved.
If is left to the default, this error indicates numerical difficulties. Consider whether the stopping tolerances (, , ) are set too low or optional parameters affecting the behaviour of the penalty updates (, or ) have been modified inadvisedly. The iteration log should reveal more about the misbehaviour. Providing a different starting point might be of help in certain situations.
The inner subproblem could not be solved to the required accuracy. Inner iteration limit has been reached.
The inner subproblem could not be solved to the required accuracy. Limited progress in the inner subproblem triggered a stop (heuristic inner stop criteria).
The inner subproblem could not be solved to the required accuracy. Line search or another internal component failed.
A problem with the convergence of the inner subproblem is typically a sign of numerical difficulties of the whole algorithm. The inner subproblem might be stopped before reaching the required accuracy because of the , a heuristic detected no progress in the inner iterations (if , default) or if an internal component failed (for example, line search was unable to find a suitable step). The algorithm tries to recover, however, it might give up after several attempts with one of these error messages.
If it occurs in the very early iterations, consider increasing and possibly or which should ease the first iterations. An occurrence in later iterations indicates numerical difficulties typically due to scaling and/or ill-conditioning or the problem is close to infeasible. Reducing the tolerance on the stopping criteria or increasing might be of limited help.
Unable to make progress, the algorithm was stopped.
This error is returned if the solver cannot decrease the duality gap over a range of iterations. This can be due to the scaling of the problem or the problem might be close to primal or dual infeasibility.
The algorithm converged to a suboptimal solution. The full accuracy was not achieved. The solution should still be usable.
This error may be reported only if (default). The solver predicted that it is unable to reach a better estimate of the solution. However, the error measures indicate that the point is a reasonable approximation. Typically, only the norm of the gradient of the Lagrangian (optimality) does not fully satisfy the requested tolerance whereas the others are well below the tolerance.
Setting will disallow this error but it is unlikely that the algorithm would reach a better solution.
The problem was found to be infeasible during preprocessing.
One or more of the constraints (or its part after preprocessing) violates the constraints by more than ().
The problem was found unbounded during preprocessing.
The objective function consists of an unrestricted ray and thus the problem does not have a solution.
The problem seems to be infeasible, the algorithm was stopped.
Whilst the algorithm cannot definitively detect that the problem is infeasible, several indirect indicators suggest that it might be the case.
The problem seems to be unbounded, the algorithm was stopped.
Whilst the algorithm cannot definitively detect that the problem is unbounded, several indirect indicators (such as a rapid decrease in the objective function and a lack of convergence in the inner subproblem) suggest that this might be the case. A good scaling of the objective function is always highly recommended to avoid situations when unusual behavior triggers falsely this error exit.
An unexpected error has been triggered by this routine. Please
contact NAG.
See Section 7 in the Introduction to the NAG Library FL Interface for further information.
Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library FL Interface for further information.
Dynamic memory allocation failed.
See Section 9 in the Introduction to the NAG Library FL Interface for further information.
If on the final exit, the returned point satisfies Karush–Kuhn–Tucker (KKT) conditions to the requested accuracy (under the default settings close to ) and thus it is a good estimate of a local solution. If , some of the convergence conditions were not fully satisfied but the point still seems to be a reasonable estimate and should be usable. Please refer to Section 11.2 and the description of the particular options.
8Parallelism and Performance
e04svf is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
e04svf makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this routine. Please also consult the Users' Note for your implementation for any additional implementation-specific information.
9Further Comments
9.1Description of the Printed Output
The solver can print information to give an overview of the problem and of the progress of the computation. The output may be send to two independent
unit numbers
which are set by optional parameters Print File and Monitoring File. Optional parameters Print Level, Print Options and Monitoring Level determine the exposed level of detail. This allows, for example, to generate a detailed log in a file while the condensed information is displayed on the screen.
By default (, ), five sections are printed to the standard output: a header, a list of options, problem statistics, an iteration log and a summary.
Header
The header is a message indicating the start of the solver. It should look like:
The list shows all options of the solver, each displayed on one line. The line contains the option name, its current value and an indicator for how it was set. The options left at their defaults are noted by ‘d’, the ones you set are noted by ‘U’ and the options reset by the solver by ‘S’. The solver will automatically set options which are set to or options which are not possible to satisfy in the given context (e.g., requesting DIMACS Measures for a nonlinear problem). Note that the output format is compatible with the file format expected by e04zpf. The output might look as follows:
Outer Iteration Limit = 20 * U
Stop Tolerance 1 = 1.00000E-06 * d
Stop Tolerance 2 = 1.00000E-07 * d
Hessian Density = Dense * S
Problem statistics
The statistics about the size of the problem shows how the problem is represented internally, i.e., it reflects any changes imposed by preprocessing (for example, removed fixed and disabled variables or constant feasible constraints) and problem transformations (see, for example, Presolve Block Detect). It may look like:
Problem Statistics
No of variables 7 (+0 disabled, +1 fixed)
free (unconstrained) 0
bounded 7
No of lin. constraints 8 (+0 disabled, +1 removed)
nonzeroes 41
No of matrix inequal. 4
detected matrix inq. 3 (+1 constant)
linear 3
nonlinear 0
max. dimension 5
detected normal inq. 1
linear 1
nonlinear 0
Objective function Linear
Iteration log
If , the status of each major iteration is condensed to one line. The line shows the major iteration number ( represents the starting point), the current objective value, KKT measures (optimality, feasibility and complementarity), minimal penalty and the number of inner iterations performed. Note that all these values are also available in and . The output might look as follows:
Occasionally, a one letter flag is printed at the end of the line indicating that the inner subproblem was not solved to the required accuracy. The possibilities are M for maximum number of inner iterations, L for difficulties in the line search and ! when a heuristic stop took place. Repeated troubles in the subproblems may lead to . The output below had which was not enough in the first subproblem (first outer iteration).
All KKT measures should normally converge to zero as the algorithm progresses and once the requested accuracy (Stop Tolerance 2) is achieved, the solver stops. However, the convergence is not necessarilly monotonic. The penalty parameters are decreased each major iteration which should improve overall the feasibility of the problem. This also increases the ill-conditioning which might lead to a higher number of inner iterations. A very high number of inner iterations usually signals numerical difficulties. See Section 11 for the algorithmic details.
If , each major iteration produces significantly more detailed output comprising detailed error measures and output from every inner iteration. The output is self-explanatory so is not featured here in detail.
Summary
Once the solver finishes, a detailed summary is produced. An example is shown below:
It starts with the status line of the overall result which matches the ifail value. It is followed by the final objective value and the error measures (including DIMACS Measures if turned on). Iteration counters, numbers of evaluations of the Augmented Lagrangian function and timing of the routine conclude the section. The timing of the algorithm is displayed only if Stats Time is set.
9.2Internal Changes
Internal changes have been made to this routine as follows:
At Mark 26.1: e04svf cannot, at the
moment, handle fixed variables in the model. You are now able to
define such a model and e04svf will return
in this case.
At Mark 27.1: e04svf can now handle fixed variables in the model. Relevant error code has been removed.
For details of all known issues which have been reported for the NAG Library please refer to the Known Issues.
10Example
Semidefinite Programming has many applications in several fields of mathematics, such as, combinatorial optimization, finance, statistics, control theory or structural optimization. However, these applications seldom come in the form of (2) or (3). Usually a reformulation is needed or even a relaxation is employed to achieve the desired formulation. This is also the case of the Lovász function computed in this example. See also e04raf for links to further examples in the NAG optimization modelling suite.
The Lovász function (or also called number) of an undirected graph is an important quantity in combinatorial optimization. It gives an upper bound to Shannon capacity of the graph and is also related to the clique number and the chromatic number of the complement of which are NP-hard problems.
The function can be expressed in various ways, here we use the following:
where and denotes the space of real symmetric matrices. This eigenvalue optimization problem is easy to reformulate as an SDP problem by introducing an artificial variable as follows:
Finally, this can be written as (2)) which is formulated in the example:
where is a matrix of all ones and is a matrix of all zeros except and .
The example also demonstrates how to set the optional parameters and how to retrieve them.
The data file stores the Petersen graph whose is .
For simplicity, we will use the following problem formulation; its connection to (SDP) and (BMI-SDP) is easy to see:
(4)
where , , are functions from to and is a matrix function from to . Here denotes the space of real symmetric matrices and , stands for a constraint on eigenvalues of , namely the matrix should be positive semidefinite. Furthermore, we define the inner product on . The index will be omitted whenever the dimension is clear from the context. Finally, for and , denotes the directional derivative of with respect to in direction .
11.1Overview
The algorithm is based on a (generalized) augmented Lagrangian approach and on a suitable choice of smooth penalty/barrier functions for (standard) inequality constraints and for constraints on matrix eigenvalues. By means of we define a penalty/barrier function for matrix inequalities as follows.
Let have an eigenvalue decomposition where . We define matrix function for as
(5)
Both and satisfy a number of assumptions (see Kočvara and Stingl (2003)) guaranteeing, in particular, that for any ,
(6)
Further in the text, we use simplified notation .
Thus for any , , problem (4) has the same solution as the following augmented problem
(7)
The Lagrangian of (7) can be viewed as a (generalized) augmented Lagrangian of (4):
(8)
where , and
, , are Lagrange multipliers associated with the (standard) inequalities and equalities and the matrix inequality constraints, respectively.
The algorithm combines ideas of the (exterior) penalty and (interior) barrier methods with the augmented Lagrangian method, it can be defined as follows:
Algorithm 1 (Outer Loop)
Let , , and be given. Let , , . For repeat until a stopping criteria or maximum number of iterations is reached:
(i)Find , satisfying
(9)
(ii)Update Lagrangian multipliers
(iii)Update penalty parameters and inner problem stopping criteria
Step (i) of Algorithm 1, further referred as the inner problem, is the most time-consuming and thus the choice of the solver for (9) is critical for the overall efficiency of the method. See Section 11.4 below.
The inequality Lagrangian multipliers update in step (ii) is motivated by the fact that if , solve (9) exactly in iteration , we obtain
Details can be found, for example, in Stingl (2006).
In practise, numerical studies showed that it is not advantageous to do the full updates of multipliers , . Firstly, big changes in the multipliers may lead to a large number of iterations in subsequent solution of (9) and, secondly, the multipliers might become ill-conditioned after a few steps and the algorithm suffers from numerical instabilities. To overcome these difficulties, a restricted update is performed instead.
New Lagrangian multipliers for (standard) inequalities , for are limited not to violate the following bound
A similar strategy is applied to the matrix multipliers as well. For (see Umat Update Restriction) set
The penalty parameters in step (iii) are updated by some constant factor dependent on the initial penalty parameters and P Update Speed. The update process is stopped when and are reached (see P Min, Pmat Min).
Additional details about the multiplier and penalty update strategies, as well as local and global convergence properties under standard assumptions can be found in an extensive study Stingl (2006).
11.2Stopping Criteria
Algorithm 1 is stopped when all the stopping criteria are satisfied to the requested accuracy, these are:
(10)
(11)
and these based on Karush–Kuhn–Tucker (KKT) error measures, to keep the notation simple, formulation (4) is assumed and iteration index is dropped:
Note that if , only the feasibility is taken into account.
There is an option for linear SDP problems to switch from stopping criteria based on the KKT conditions to DIMACS Measures, see Mittelmann (2003). This is the default choice. To keep the notation readable, these are defined here only for the following simpler formulation of linear SDP rather than (2):
(15)
In this case the algorithm stops when:
(16)
where denote the adjoint operator to , .
They can be viewed as a scaled version of the KKT conditions. represents the (scaled) norm of the gradient of the Lagrangian, and the dual and primal infeasibility, respectively, and and measure the duality gap and the complementary slackness. Note that in this solver by definition and is automaticaly zero because the formulation involves slack variables which are not used here.
11.3Choice of penalty functions and
To treat the (standard) inequality constraints , we use the penalty/barrier function proposed by Ben–Tal and Zibulevsky (1997):
with default .
The choice of (and thus of ) is motivated by the complexity of the evaluation of and its derivatives. If is defined as
it is possible to avoid the explicit eigenvalue decomposition in (5) as it can be seen in the formulae below (note that index is omitted):
(17)
where
(18)
For details follow Kočvara and Stingl (2003). Note that, in particular, formula (17) requires nontrivial computational resources even if careful handling of the sparsity of partial derivatives of is implemented. e04svf uses a set of strategies described in Fujisawa et al. (1997) adapted for parallel computation.
11.4Solution of the inner problem
This section describes solving of the inner problem (step (i) of Algorithm 1). We attempt to find an approximate solution of the following system (in and ) up to the given precision :
(19)
where the penalty parameters , as well as the Lagrangian multipliers and are fixed.
A linesearch SQP framework is used due to its desirable convergence properties. It can be stated as follows.
Algorithm 2 (Inner Loop)
Let , be given (typically as the solution from the previous outer iteration), , , , and fixed. For
System (20) is solved by the factorization routine MA97 (see Hogg and Scott (2011), in combination with an inertia correction strategy described in Stingl (2006). The step length selection is guided by Linesearch Mode.
If there are no equality constraints in the problem, the unconstrained minimization in Step (i) of Algorithm 1 simplifies to the modified Newton method with line-search (for details, see Kočvara and Stingl (2003)). Alternatively, the equality constraints can be converted to two inequalities which would be treated with the remaining constraints (see Transform Constraints).
12Optional Parameters
Several optional parameters in e04svf define choices in the problem specification or the algorithm logic. In order to reduce the number of formal arguments of e04svf these optional parameters have associated default values that are appropriate for most problems. Therefore, you need only specify those optional parameters whose values are to be different from their default values.
The remainder of this section can be skipped if you wish to use the default values for all optional parameters.
The optional parameters can be changed by calling e04zmf anytime between the initialization of the handle and the call to the solver. Modification of the optional parameters during intermediate monitoring stops is not allowed. Once the solver finishes, the optional parameters can be altered again for the next solve.
If any options are set by the solver (typically those with the choice of ), their value can be retrieved by e04znf. If the solver is called again, any such arguments are reset to their default values and the decision is made again.
The following is a list of the optional parameters available. A full description of each optional parameter is provided in Section 12.1.
For each option, we give a summary line, a description of the optional parameter and details of constraints.
The summary line contains:
the keywords, where the minimum abbreviation of each keyword is underlined;
a parameter value,
where the letters , and denote options that take character, integer and real values respectively;
the default value, where the symbol is a generic notation for machine precision (see x02ajf).
All options accept the value to return single options to their default states.
Keywords and character values are case and white space insensitive.
Defaults
This special keyword may be used to reset all optional parameters to their default values. Any value given with this keyword will be ignored.
DIMACS Measures
Default
If the problem is a linear semidefinite programming problem, this parameter specifies if DIMACS error measures (see Section 11.2) should be computed and/or checked. In other cases, this option reverts to automatically.
Constraint: , or .
Hessian Density
Default
This optional parameter guides the solver on how the Hessian matrix of augmented Lagrangian should be built. Option leaves the decision to the solver and it is the recommended option. Setting it to bypasses the autodetection and the Hessian is always built as a dense matrix. Option instructs the solver to use a sparse storage and factorization of the matrix if possible.
Constraint: , or
Infinite Bound Size
Default
This defines the ‘infinite’ bound in the definition of the problem constraints. Any upper bound greater than or equal to will be regarded as (and similarly any lower bound less than or equal to will be regarded as ). Note that a modification of this optional parameter does not influence constraints which have already been defined; only the constraints formulated after the change will be affected.
Constraint: .
Initial P
Default
This optional parameter defines the choice of the penalty optional parameters , , see Algorithm 1.
The penalty optional parameters are chosen automatically as set by optional parameter Init Value P, Init Value Pmat and subject to automatic scaling. Note that might be increased so that the penalty function is defined for all matrix constraints at the starting point.
The penalty optional parameters are kept from the previous run of the solver if possible. If not, this options reverts to . Note that even if the matrix penalty optional parameters are the same as in the previous run, they are still subject to a possible increase so that the penalty function is well defined at the starting point.
Constraint: or .
Initial U
Default
This parameter guides the solver on which initial Lagrangian multipliers are to be used.
The Lagrangian multipliers are chosen automatically as set by automatic scaling.
The values of arrays u and ua (if provided) are used as the initial Lagrangian multipliers subject to automatic adjustments. If one or the other array is not provided, the choice for missing data is as in .
The Lagrangian multipliers are kept from the previous run of the solver. If this option is set for the first run or optional parameters change the approach of the solver, the choice automatically reverts to . This might be useful if the solver is hot started, for example, to achieve higher precision of the solution.
Constraint: , or .
Initial X
Default
This parameter guides which starting point is to be used.
The starting point is chosen automatically so that it satisfies simple bounds on the variables or as a zero vector. Input of argument x is ignored.
Initial values of argument x are used as a starting point.
Constraint: or .
Init Value P
Default
This parameter defines the value , the initial penalty optional parameter for (standard) inequalities. A low value of the penalty causes the solution of the inner problem to be closer to the feasible region and thus to the desirable result. However, it also increases ill-conditioning of the system. It is not advisable to set the penalty too low unless a good starting point is provided.
Constraint: .
Init Value Pmat
Default
The value of this option suggests , the initial penalty optional parameter for matrix inequalities. It is similar to Init Value P (and the same advice applies), however, gets increased automatically if the matrix constraints are more infeasible than the actual penalty optional parameter.
Constraint: .
Inner Iteration Limit
Default
The maximum number of the inner iterations (Newton steps) to be performed by Algorithm 2 in each outer iteration. Setting the option too low might lead to . Values higher than are unlikely to improve convergence.
Constraint: .
Inner Stop Criteria
Default
The precision for the solution of the inner subproblem is determined in Algorithm 1 and under typical circumstances Algorithm 2 is expected to reach this precision within the given Inner Iteration Limit. If any problems are detected and , Algorithm 2 is allowed to stop before reaching the requested precision or the Inner Iteration Limit. This usually saves many unfruitful iterations and the solver may recover in the following iterations. If you suspect that the heuristic problem detection is not suitable for your problem, setting disallows such behaviour.
Constraint: or .
Inner Stop Tolerance
Default
This option sets the required precision for the first inner problem solved by Algorithm 2. The precison of the solution of the inner problem does not need to be very high in the first outer iterations and it is automatically adjusted through the outer iterations to reach the optimality limit in the last one.
Setting too restrictive (too low) causes an increase of the number of inner iterations needed in the first outer iterations and might lead to . In certain cases it might be helpful to use a more relaxed (higher) and increase P Update Speed which should reduce the number of inner iterations needed at the beginning of the computation in exchange for a possibly higher number of the outer iterations.
Constraint: .
Linesearch Mode
Default
This controls the step size selection in Algorithm 2. If (the default for linear problems), unit steps are taken where possible and the step shortening takes place only to avoid undefined regions for the matrix penalty function (see (17)). This may be used for linear problems but it is not recommended for any nonlinear ones. If , Armijo backtracking linesearch is used instead which is a fairly basic linesearch. If , a cubic safe guarded linesearch based on Goldstein condition is employed, this is the recommended (and default) choice for nonlinear problems.
Constraint: , , or .
List
Default
This parameter may be set to if you wish to turn on printing of each optional parameter specification as it is supplied.
Constraint: or
Monitor Frequency
Default
If , the solver returns to you at the end of every th outer iteration. During these intermediate exits, the current point x and Lagrangian multipliers u, ua (if requested) are provided as well as the statistics and error measures (rinfo, stats). Argument inform helps to distinguish between intermediate and final exits and also allows immediate termination.
If , the solver stops only once on the final point and no intermediate exits are made.
Constraint: .
Monitoring File
Default
If , the
unit number
for the secondary (monitoring) output. If set to , no secondary output is provided. The following information is output to the unit:
–a listing of the optional parameters;
–problem statistics, the iteration log and the final status as set by Monitoring Level.
Constraint: .
Monitoring Level
Default
This parameter sets the amount of information detail that will be printed by the solver to the secondary output. The meaning of the levels is the same as with Print Level.
Constraint: .
Outer Iteration Limit
Default
The maximum number of the outer iterations to be performed by Algorithm 1. If , no iteration is performed, only quantities needed in the stopping criteria are computed and returned in rinfo. This might be useful in connection with and to check optimality of the given point. However, note that the rules for possible modifications of the starting point still apply, see u and ua. Setting the option too low might lead to .
Constraint: .
P Min
Default
This controls , the lowest possible penalty value used for (standard) inequalities. In general, very small values of the penalty optional parameters cause ill-conditioning which might lead to numerical difficulties. On the other hand, very high prevents the algorithm from reaching the requested accuracy on the feasibility. Under normal circumstances, the default value is recommended.
Constraint: .
Pmat Min
Default
This is an equivalent of P Min for the minimal matrix penalty optional parameter . The same advice applies.
Constraint: .
Preference
Default
This option affects how contributions from the matrix constraints (17) to the system Hessian matrix are computed. The default option of should be suitable in most cases. However, dealing with matrix constraints of a very high dimension may cause noticable memory overhead and switching to may be required.
Constraint: or .
Presolve Block Detect
Default
If , the matrix constraints are checked during preprocessoring to determine if they can be split into smaller independent ones, thus speeding up the solver.
Constraint: or .
Print File
Default
If , the
unit number
for the primary output of the solver. If , the primary output is completely turned off independently of other settings. The default value is the advisory message unit number as defined by x04abf at the time of the optional parameters initialization, e.g., at the initialization of the handle. The following information is output to the unit:
–a listing of optional parameters if set by Print Options;
–problem statistics, the iteration log and the final status from the solver as set by Print Level.
Constraint: .
Print Level
Default
This parameter defines how detailed information should be printed by the solver to the primary output.
Output
No output from the solver
Only the final status and the objective value
Problem statistics, one line per outer iteration showing the progress of the solution, final status and statistics
As level but detailed output of the outer iterations is provided and brief overview of the inner iterations
,
As level but details of the inner iterations are printed as well
Constraint: .
Print Options
Default
If , a listing of optional parameters will be printed to the primary output.
Constraint: or .
P Update Speed
Default
This option affects the rate at which the penalty optional parameters are updated (Algorithm 1, step (iii)) and thus indirectly influences the overall number of outer iterations. Its value can be interpretted as the typical number of outer iterations needed to get from the initial penalty values , half-way to the and . Values smaller than causes a very agressive penalty update strategy which might lead to the increased number of inner iterations and possibly to numerical difficulties. On the other hand, values higher than produce a relatively conservative approach which leads to a higher number of the outer iterations.
If the solver encounters difficulties on your problem, a higher value might help. If your problem is working fine, setting a lower value might increase the speed.
Constraint: .
Stats Time
Default
This parameter turns on timings of various parts of the algorithm to give a better overview of where most of the time is spent. This might be helpful for a choice of different solving approaches. It is possible to choose between CPU and wall clock time. Choice is equivalent to .
Constraint: , , or .
Stop Criteria
Default
If , the solver is allowed to stop prematurely with a suboptimal solution, , if it predicts that a better estimate of the solution cannot be reached. This is the recommended option.
Constraint: or .
Stop Tolerance 1
Default
This option defines used as a tolerance for the relative duality gap (10) and the relative precision (11), see Section 11.2.
Constraint: .
Stop Tolerance 2
Default
This option sets the value which is used for optimality (12) and complementarity (14) tests from KKT conditions or if for all DIMACS error measures instead. See Section 11.2.
Constraint: .
Stop Tolerance Feasibility
Default
This parameter places an acceptance limit on the feasibility of the solution (13), . See Section 11.2.
Constraint: .
Task
Default
This parameter specifies the required direction of the optimization. If , the objective function (if set) is ignored and the algorithm stops as soon as a feasible point is found with respect to the given tolerance. If no objective function was set, Task reverts to automatically.
Constraint: , or .
Transform Constraints
Default
This parameter controls how equality constraints are treated by the solver. If , all equality constraints from (4) are treated as two inequalities and , see Section 11.4. This is the default and the only option in this release for equality constrained problems.
Constraint: , or .
U Update Restriction
Default
This defines the value giving the bounds on the updates of Lagrangian multipliers for (standard) inequalities between the outer iterations. Values close to limit the changes of the multipliers and serve as a kind of smoothing, lower values allow more significant changes.
Based on numerical experience, big variation in the multipliers may lead to a large number of iterations in the subsequent step and might disturb the convergence due to ill-conditioning.
It might be worth experimenting with the value on your particular problem. Mid range values are recommended over the more extremal ones.
Constraint: .
Umat Update Restriction
Default
This is an equivalent of U Update Restriction for matrix constraints, denoted as in Section 11.1. The advice above applies equally.