NAG CL Interface e04ptc (handle_solve_socp_ipm)
Note:this function usesoptional parametersto define choices in the problem specification and in the details of the algorithm. If you wish to use default settings for all of the optional parameters, you need only read Sections 1 to 10 of this document. If, however, you wish to reset some or all of the settings please refer to Section 11 for a detailed description of the algorithm and to Section 12 for a detailed description of the specification of the optional parameters.
e04ptc is a solver from the NAG optimization modelling suite for large-scale Second-order Cone Programming (SOCP) problems. It is based on an interior point method (IPM).
where $\mathcal{K}={\mathcal{K}}^{{n}_{1}}\times \cdots \times {\mathcal{K}}^{{n}_{r}}\times {\mathbb{R}}^{{n}_{l}}$ is a Cartesian product of $r$ quadratic (second-order type) cones and ${n}_{l}$-dimensional real space, and $n={\sum}_{i=1}^{r}{n}_{i}+{n}_{l}$ is the number of decision variables. Here $c$, $x$, ${l}_{x}$ and ${u}_{x}$ are $n$-dimensional vectors, $A$ is an $m\times n$ sparse matrix, and ${l}_{A}$ and ${u}_{A}$ are $m$-dimensional vectors. Note that $x\in \mathcal{K}$ partitions subsets of variables into quadratic cones and each ${\mathcal{K}}^{{n}_{i}}$ can be either a quadratic cone or a rotated quadratic cone. These are defined as follows:
e04ptc solves SOCP problems stored as a handle. The handle points to an internal data structure which defines the problem and serves as a means of communication for functions in the NAG optimization modelling suite. First, the problem handle is initialized by calling e04rac. Then some of the functions e04rbc,e04rec,e04rfc,e04rhc,e04rjc,e04rscore04rtc may be called to formulate the quadratic cones, linear objective function, quadratic objective function, quadratic constraints, bounds of the variables, and the block of linear constraints, respectively. Alternatively, the whole model can be loaded from a file by e04sac. When the handle is no longer needed, e04rzc should be called to destroy it and deallocate the memory held within. See Section 4.1 in the E04 Chapter Introduction for more details about the NAG optimization modelling suite.
The solver method can be modified by various optional parameters (see Section 12) which can be set by e04zmcande04zpc anytime between the initialization of the handle and a call to the solver. Once the solver has finished, options may be modified for the next solve. The solver may be called repeatedly with various optional parameters.
The optional parameter ${\mathbf{Task}}$ may be used to switch the problem to maximization or to ignore the objective function and find only a feasible point.
Several options may have significant impact on the performance of the solver. Even if the defaults were chosen to suit the majority of problems, it is recommended that you experiment in order to find the most suitable set of options for a particular problem, see Sections 11 and 12 for further details.
3.1Structure of the Lagrangian Multipliers
The algorithm works internally with estimates of both the decision variables, denoted by $x$, and the Lagrangian multipliers (dual variables), denoted by $u$ for bound and linear constraints, and $uc$ for quadratic cone constraints.
If the simple bounds have been defined (e04rhc was successfully called), the first $2n$ elements of $u$ belong to the corresponding Lagrangian multipliers, interleaving a multiplier for the lower and the upper bound for each ${x}_{i}$. If any of the bounds were set to infinity, the corresponding Lagrangian multipliers are set to $0$ and may be ignored.
Similarly, the following $2m$ elements of $u$ belong to multipliers for the linear constraints (if e04rjc has been successfully called). The organization is the same, i.e., the multipliers for each constraint for the lower and upper bounds are alternated and zeros are used for any missing (infinite bound) constraints.
If convex quadratic constraints have been defined successfully by e04rscore04rtc, denote the number of such constraints as $\mathrm{nq}$, then the following $2\mathrm{nq}$ elements of $u$ belong to multipliers for the convex quadratic constraints. The organization is the same as linear constraints.
Some solvers merge multipliers for both lower and upper inequality into one element whose sign determines the inequality. Negative multipliers are associated with the upper bounds and positive with the lower bounds. An equivalent result can be achieved with this storage scheme by subtracting the upper bound multiplier from the lower one. This is also consistent with equality constraints.
Finally, the elements of $uc$ are the corresponding Lagrangian multipliers for the variables in the quadratic cone constraints that have been defined by e04rbc. All multipliers are stored next to each other in array uc in the same order as the cone constraints were defined by e04rbc. For example, if the first cone constraint contains variables ${x}_{4}$, ${x}_{2}$, ${x}_{3}$ and the second cone constraint contains variables ${x}_{1}$, ${x}_{7}$, ${x}_{6}$, ${x}_{5}$, then the dimension of array uc must be $7$ and the first $3$ elements are the corresponding Lagrangian multipliers for the cone composed of ${x}_{4}$, ${x}_{2}$, ${x}_{3}$, followed by $4$ elements that are the corresponding Lagrangian multipliers for the cone of ${x}_{1}$, ${x}_{7}$, ${x}_{6}$, ${x}_{5}$.
4References
Alizadeh F and Goldfarb D (2003) Second-order cone programming Mathematical programming95(1) 3–51
Andersen E D, Roos C and Terlaky T (2003) On implementing a primal-dual interior-point method for conic quadratic optimization Mathematical programming95(2) 249–277
Goldfarb D and Scheinberg K (2005) Product-form Cholesky factorization in interior point methods for second-order cone programming Mathematical programming103(1) 153–179
Goldman A J and Tucker A W (1956) Theory of linear programming Linear inequalities and related systems38 53–97
Hogg J D and Scott J A (2011) HSL MA97: a bit-compatible multifrontal code for sparse symmetric systems RAL Technical Report. RAL-TR-2011-024
HSL (2011) A collection of Fortran codes for large-scale scientific computation http://www.hsl.rl.ac.uk/
Karypis G and Kumar V (1998) A fast and high quality multilevel scheme for partitioning irregular graphs SIAM J. Sci. Comput.20(1) 359–392
Lobo M S, Vandenberghe L, Boyd S and Levret H (1998) Applications of second-order cone programming Linear Algebra and its Applications284(1-3) 193–228
Lustig I J, Marsten R E and Shanno D F (1992) On implementing Mehrotra's predictor–corrector interior-point method for linear programming SIAM J. Optim.2(3) 435–449
Mehrotra S (1992) On the implementation of a primal-dual interior point method SIAM J. Optim.2 575–601
Nesterov Y E and Todd M J (1997) Self-scaled barriers and interior-point methods for convex programming Mathematics of Operations research22(1) 1–42
Nesterov Y E and Todd M J (1998) Primal-dual interior-point methods for self-scaled cones SIAM J. Optim.8(2) 324–364
Nocedal J and Wright S J (2006) Numerical Optimization (2nd Edition) Springer Series in Operations Research, Springer, New York
Sturm J F (2002) Implementation of Interior Point Methods for Mixed Semidefinite and Second Order Cone Optimization Problems Optimization Methods and Software17(6) 151–171
Xu X, Hung P-F and Ye Y (1996) A simplified homogeneous and self-dual linear programming algorithm and its implementation Annals of Operations Research62(1) 151–171
5Arguments
1: $\mathbf{handle}$ – void *Input
On entry: the handle to the problem. It needs to be initialized (e.g., by e04rac) and to hold a problem formulation compatible with e04ptc. It must not be changed between calls to the NAG optimization modelling suite.
2: $\mathbf{nvar}$ – IntegerInput
On entry: $n$, the current number of decision variables $x$ in the model.
If ${\mathbf{nnzu}}=0$, u will not be referenced; otherwise, it needs to match the dimension of constraints defined by e04rhcande04rjc as explained in Section 3.1.
Note: if ${\mathbf{nnzu}}>0$, u holds Lagrange multipliers (dual variables) for the bound constraints and linear constraints. If ${\mathbf{nnzu}}=0$, u will not be referenced and may be NULL.
On entry: the input of u is reserved for future releases of the NAG Library and it is ignored at the moment.
If ${\mathbf{nnzuc}}=0$, uc will not be referenced; otherwise, it needs to match the total number of cone variables defined by e04rbc as explained in Section 3.1.
Note: if ${\mathbf{nnzuc}}>0$, uc holds Lagrange multipliers (dual variables) for second-order cones as defined by e04rbc. If ${\mathbf{nnzuc}}=0$, uc will not be referenced and may be NULL.
On entry: the input of uc is reserved for future releases of the NAG Library and it is ignored at the moment.
On exit: the final values of the variables $\mathit{uc}$.
On exit: solver statistics as given in the table below. Note that times are measured in seconds, see optional parameter ${\mathbf{Stats\; Time}}$.
$0$
Number of iterations.
$1$
Not referenced.
$2$
Total number of iterative refinements performed.
$3$
Value of the perturbation added to the diagonal in the normal equation formulation or the augmented system formulation.
$4$
Total number of factorizations performed.
$5$
Total time spent in the solver.
$6$
Time spent in the presolve phase.
$7$
Time spent in the last iteration.
$8$
Total time spent factorizing the system matrix.
$9$
Total time spent backsolving the system matrix.
$10$
Not referenced.
$11$
Time spent in the initialization phase.
$12$
Number of nonzeros in the system matrix.
$13$
Number of nonzeros in the system matrix factor.
$14$
Maximum error of the backsolve.
$15$
Number of columns in $A$ considered dense by the solver.
$16$
Number of conic constraints considered dense by the solver.
$17$–$99$
Reserved for future use.
10: $\mathbf{monit}$ – function, supplied by the userExternal Function
monit is provided to enable you to monitor the progress of the optimization. It is invoked at the end of every $i$th iteration where $i$ is given by the optional parameter ${\mathbf{SOCP\; Monitor\; Frequency}}$ (the default is $0$, monit is not called).
On entry: the handle to the problem as provided on entry to e04ptc. It may be used to query the model during the solve, and extract the current approximation of the solution by e04rxc.
On entry: solver statistics at the end of the current iteration as described in stats, however, elements $3$, $5$, $9$, $10$ and $15$ refer to the quantities in the last iteration rather than accumulated over all iterations through the whole algorithm run.
4: $\mathbf{comm}$ – Nag_Comm *
Pointer to structure of type Nag_Comm; the following members are relevant to monit.
user – double *
iuser – Integer *
p – Pointer
The type Pointer will be void *. Before calling e04ptc you may allocate memory and initialize these pointers with various quantities for use by monit when called from e04ptc (see Section 3.1.1 in the Introduction to the NAG Library CL Interface).
5: $\mathbf{inform}$ – Integer *Input/Output
On entry: a non-negative value.
On exit: must be set to a value describing the action to be taken by the solver on return from monit. Specifically, if the value is negative the solution of the current problem will terminate immediately with ${\mathbf{fail}}\mathbf{.}\mathbf{code}=$NE_USER_STOP; otherwise, computations will continue.
11: $\mathbf{comm}$ – Nag_Comm *
The NAG communication argument (see Section 3.1.1 in the Introduction to the NAG Library CL Interface).
12: $\mathbf{fail}$ – NagError *Input/Output
The NAG error argument (see Section 7 in the Introduction to the NAG Library CL Interface).
6Error Indicators and Warnings
NE_ALLOC_FAIL
Dynamic memory allocation failed.
See Section 3.1.2 in the Introduction to the NAG Library CL Interface for further information.
NE_BAD_PARAM
On entry, argument $\u27e8\mathit{\text{value}}\u27e9$ had an illegal value.
NE_DIM_MATCH
On entry, ${\mathbf{nnzu}}=\u27e8\mathit{\text{value}}\u27e9$. nnzu does not match the size of the Lagrangian multipliers for constraints. The correct value is $0$ for no constraints.
On entry, ${\mathbf{nnzu}}=\u27e8\mathit{\text{value}}\u27e9$. nnzu does not match the size of the Lagrangian multipliers for constraints. The correct value is either $0$ or $\u27e8\mathit{\text{value}}\u27e9$.
On entry, ${\mathbf{nnzuc}}=\u27e8\mathit{\text{value}}\u27e9$. nnzuc does not match the size of the Lagrangian multipliers for second-order cone constraints. ${\mathbf{nnzuc}}=0$ when there are no second-order cone constraints.
On entry, ${\mathbf{nnzuc}}=\u27e8\mathit{\text{value}}\u27e9$. nnzuc does not match the size of the Lagrangian multipliers for second-order cone constraints. The correct value is either $0$ or $\u27e8\mathit{\text{value}}\u27e9$.
NE_HANDLE
The supplied handle does not define a valid handle to the data structure for the NAG optimization modelling suite. It has not been properly initialized or it has been corrupted.
NE_INFEASIBLE
The problem was found to be primal infeasible.
NE_INTERNAL_ERROR
An internal error has occurred in this function. Check the function call and any array sizes. If the call is correct then please contact NAG for assistance.
See Section 7.5 in the Introduction to the NAG Library CL Interface for further information.
NE_NO_IMPROVEMENT
No progress, stopping early.
The solver predicted that it is unable to make further progress and stopped prematurely. This might be due to the scaling of the problem, its conditioning or numerical difficulties.
NE_NO_LICENCE
Your licence key may have expired or may not have been installed correctly.
See Section 8 in the Introduction to the NAG Library CL Interface for further information.
NE_PHASE
The problem is already being solved.
NE_REF_MATCH
On entry, ${\mathbf{nvar}}=\u27e8\mathit{\text{value}}\u27e9$, expected $\mathrm{value}=\u27e8\mathit{\text{value}}\u27e9$.
Constraint: nvar must match the current number of variables of the model in the handle.
NE_SETUP_ERROR
This solver does not support the model defined in the handle.
NE_TIME_LIMIT
The solver terminated after the maximum time allowed was exceeded.
Maximum number of seconds exceeded. Use optional parameter ${\mathbf{Time\; Limit}}$ to reset the limit.
NE_TOO_MANY_ITER
Maximum number of iterations exceeded.
NE_UNBOUNDED
The problem was found to be dual infeasible.
This error indicates that the primal problem is unbounded or infeasible.
NE_USER_STOP
User requested termination during a monitoring step.
NW_NOT_CONVERGED
Suboptimal solution.
The solver predicted that it is unable to reach a better estimate of the solution. However, the error measures indicate that the point is a reasonable approximation.
7Accuracy
The accuracy of the solution is determined by optional parameters ${\mathbf{SOCP\; Stop\; Tolerance}}$ and ${\mathbf{SOCP\; Stop\; Tolerance\; 2}}$
If ${\mathbf{fail}}\mathbf{.}\mathbf{code}=$ NE_NOERROR on the final exit, the returned point satisfies Karush–Kuhn–Tucker (KKT) conditions to the requested accuracy (under the default settings close to $\sqrt{\epsilon}$) and thus it is a good estimate of the solution. If ${\mathbf{fail}}\mathbf{.}\mathbf{code}=$NW_NOT_CONVERGED, some of the convergence conditions were not fully satisfied but the point is a reasonable estimate and still usable. Please refer to Section 11.5 and the description of the particular options.
8Parallelism and Performance
Background information to multithreading can be found in the Multithreading documentation.
e04ptc is threaded by NAG for parallel execution in multithreaded implementations of the NAG Library.
e04ptc makes calls to BLAS and/or LAPACK routines, which may be threaded within the vendor library used by this implementation. Consult the documentation for the vendor library for further information.
Please consult the X06 Chapter Introduction for information on how to control and interrogate the OpenMP environment used within this function. Please also consult the Users' Note for your implementation for any additional implementation-specific information.
9Further Comments
9.1Formulating Problems as SOCPs
This SOCP solver can solve several common convex problems covering a large variety of applications. However, in certain cases a reformulation is needed. In this section, we cover QCQP, norm minimization problems and robust linear programming, see Alizadeh and Goldfarb (2003) and Lobo et al. (1998) for further details.
Convex Quadratically Constrained Quadratic Programming (QCQP) appears in applications such as modern portfolio theory and wireless sensor network localization. The general convex QCQP problem has the following form
where
${P}_{i}\in {\mathbb{R}}^{n\times n}$,
for $i=0,\dots ,p$, are symmetric and positive semidefinite matrices, hence there exist matrices ${F}_{i}\in {\mathbb{R}}^{{k}_{i}\times n}$ such that
In many practical problems this decomposition is already available. Otherwise it needs to be computed, for example, via Cholesky or eigenvalue decomposition, such as f07fdc for positive definite matrices, and f07kdcorf08fcc for positive semidefinite matrices. Let's introduce new artificial variables ${t}_{i}$ such that ${t}_{i}+{q}_{i}^{\mathrm{T}}x+{r}_{i}=0$, then we have an equivalent characterization of cone constraints as
$$\frac{1}{2}{x}^{\mathrm{T}}{P}_{i}x+{q}_{i}^{\mathrm{T}}x+{r}_{i}\le 0\text{\hspace{1em} \u27f7 \hspace{1em}}{t}_{i}+{q}_{i}^{\mathrm{T}}x+{r}_{i}=0\text{\hspace{0.5em} and \hspace{0.5em}}\frac{1}{2}{\Vert {F}_{i}x\Vert}_{2}^{2}\le {t}_{i}\text{.}$$
By the definition of rotated quadratic cone (3) we have
Two functions e04rscande04rtc can be used to define convex quadratic objective function and constraints directly and the solver will tranform them to SOCP automatically. If matrix (${P}_{i}$) in quadratic term is close to singular, it's also recommended that the users follow the procedure above to factorize and transform to SOCP so that small eigenvalues of the matrix can be taken out accordding to the users' needs to achieve numerical stability in the solver.
9.1.2Norm Minimization Problems
Consider the following problem that minimizes the sum of Euclidean norms
where ${A}_{i}\in {\mathbb{R}}^{{n}_{i}\times n}$ and ${b}_{i}\in {\mathbb{R}}^{{n}_{i}}$. Problem (6) can be formulated as SOCP by introducing $r$ auxiliary variables ${t}_{i}$, for $i=1,\dots ,r$, and adding constraints
where $A\in {\mathbb{R}}^{m\times n}$ and $b\in {\mathbb{R}}^{m}$, introduce an auxiliary variable $u\in {\mathbb{R}}^{m}$ such that $Ax+b=u$, then problem (11) is transformed to
where $c$ and ${b}_{i}$ are given but there is some uncertainty in parameter ${a}_{i}$. In such a situation you might want to solve problem (15) in the worst-case sense, i.e., find the best solution $x$ with respect to the most adverse choice of ${a}_{i}$. Introducing uncertainty set to some or all your data and solving the problem in the worst-case scenario helps to avoid high sensitivity of your results even for a small perturbation in the input data. Assume ${a}_{i}$ are known to lie in given ellipsoids around its known centre ${\overline{a}}_{i}$
where ${P}_{i}\in {\mathbb{R}}^{n\times n}$ are positive semidefinite matrices, this problem is also known as robust linear programming which can be modelled as
$$\begin{array}{ll}\underset{x\in {\mathbb{R}}^{n}}{\mathrm{minimize}}\phantom{\rule{0.25em}{0ex}}& {c}^{\mathrm{T}}x\\ \text{subject to}& {a}_{i}^{\mathrm{T}}x\le {b}_{i}\text{, \hspace{1em} for all \hspace{0.5em}}{a}_{i}\in {\mathcal{E}}_{i}\text{, \hspace{1em}}i=1,\dots ,r\text{.}\end{array}$$
(16)
Constraints
$${a}_{i}^{\mathrm{T}}x\le {b}_{i}\text{, \hspace{1em} for all \hspace{0.5em}}{a}_{i}\in {\mathcal{E}}_{i}\text{, \hspace{1em}}i=1,\dots ,r$$
Note we can also get SOCP formulation if there is some uncertainty or variation in the parameters $c$ and ${b}_{i}$ following a similar procedure.
9.2Description of the Printed Output
The solver can print information to give an overview of the problem and of the progress of the computation. The output may be sent to two independent streams (files) which are set by optional parameters ${\mathbf{Print\; File}}$ and ${\mathbf{Monitoring\; File}}$. Optional parameters ${\mathbf{Print\; Level}}$, ${\mathbf{Monitoring\; Level}}$, ${\mathbf{Print\; Solution}}$ and ${\mathbf{Print\; Options}}$ determine the exposed level of detail. This allows, for example, a detailed log file to be generated while the condensed information is displayed on the screen.
By default (${\mathbf{Print\; File}}=6$, ${\mathbf{Print\; Level}}=2$), six sections are printed to the standard output:
Header
Optional parameters list (if ${\mathbf{Print\; Options}}=\mathrm{YES}$)
The header is a message indicating the start of the solver. It should look like:
------------------------------------------------
E04PT, Interior point method for SOCP problems
------------------------------------------------
Optional parameters list
The list shows all options of the solver, each displayed on one line. The output contains the option name, its current value and an indicator for how it was set. The options unchanged from the default setting are noted by ‘d’, options you set are noted by ‘U’, and options reset by the solver are noted by ‘S’. Note that the output format is compatible with the file format expected by e04zpc. The output might look as follows:
Socp Iteration Limit = 100 * d
Socp Max Iterative Refinement = 9 * d
Socp Presolve = Yes * d
Socp Scaling = None * d
Problem statistics
If ${\mathbf{Print\; Level}}\ge 2$, statistics on the original and the presolved problems are printed. More detailed statistics, as well as a list of the presolve operations, are also printed for ${\mathbf{Print\; Level}}$$3$ or above, for example:
Problem Statistics
No of variables 3
free (unconstrained) 1
bounded 2
No of lin. constraints 2
nonzeroes 6
No of quad.constraints 0
No of cones 1
biggest cone size 3
Objective function Linear
Presolved Problem Measures
No of variables 7
No of lin. constraints 4
nonzeroes 12
No of cones 1
Iteration log
If ${\mathbf{Print\; Level}}\ge 2$, the solver prints the status of each iteration.
If ${\mathbf{Print\; Level}}=2$, the output shows the iteration number ($0$ represents the starting point), the current primal and dual objective value, convergence measures (primal infeasibility, dual infeasibility and duality gap defined in Section 11.5.1) and the value of the additional variable $\tau $ (see Section 11.1). The output might look as follows:
If ${\mathbf{Print\; Level}}=3$, the solver also prints for each iteration ${\rho}_{A}$ (defined in Section 11.5.1), the value of the variable $\kappa $ (see Section 11.1), the step size, the maximum error of the backsolves performed as well as the total number of iterative refinements performed. The output takes the following form:
Occasionally, when numerical instabilities are too big, the solver will restart the iteration and switch to an augmented system formulation. In such cases the letters RS will be printed in the information column (I).
If ${\mathbf{Print\; Level}}>3$, each iteration produces more information that expands over several lines. This additional information contains:
the method used (normal equation, augmented system);
the number of factorizations performed at the current iteration;
the type of factorization performed (Cholesky, Bunch–Parlett);
the value of the perturbation added to the diagonal in the normal equation formulation or on the zero block in the augmented system formulation;
the total time spent in the iteration if ${\mathbf{Stats\; Time}}$ is not set to $\mathrm{NO}$.
The output might look as follows:
------------ Details of Iteration 1 ------------
method Augmented System
iterative refinements 9
factorizations 1
matrix type Bunch-Parlett
diagonal perturbation 7.00E-08
time iteration 0.05 sec
--------------------------------------------------
Summary
Once the solver finishes, a detailed summary is produced:
-------------------------------------------------
Status: converged, an optimal solution found
-------------------------------------------------
Final primal objective value 2.688878E+01
Final dual objective value 2.688878E+01
Absolute primal infeasibility 2.264154E-07
Relative primal infeasibility 6.788104E-09
Absolute dual infeasibility 7.639479E-09
Relative dual infeasibility 1.371539E-09
Absolute complementarity gap 2.558237E-08
Relative complementarity gap 8.342957E-10
Iterations 8
It starts with the status line of the overall result which matches the fail value and is followed by the final primal and dual objective values as well as the error measures and iteration count.
Optionally, if ${\mathbf{Stats\; Time}}=\mathrm{YES}$, the timings of the different parts of the algorithm are displayed. It might look as follows:
If ${\mathbf{Print\; Solution}}=\mathrm{X}$, the values of the primal variables and their bounds on the primary and secondary outputs. It might look as follows:
This example demonstrates how to define and solve a Second-order Cone Programming (SOCP). As described in Section 9.1, SOCP has many applications, however, a reformulation might be needed. See e04rsc where a convex quadratically constrained quadratic programming problem is defined and solved via SOCP. See also e04sac where the input is read from a file.
This section contains the description of the underlying algorithms used in e04ptc, which implements the standard primal-dual path-following interior point method with Nesterov–Todd scaling and self-dual embedding. For further details, see Nesterov and Todd (1998), Nesterov and Todd (1997) and Andersen et al. (2003).
For simplicity, we consider the following primal Second-order Cone Programming (SOCP) formulation
where $c$, $x\in {\mathbb{R}}^{n}$, $b\in {\mathbb{R}}^{m}$, $A\in {\mathbb{R}}^{m\times n}$ with full row rank, and $\overline{\mathcal{K}}={\mathcal{K}}^{{n}_{1}}\times \cdots \times {\mathcal{K}}^{{n}_{r}}\times {\mathbb{R}}_{+}^{{n}_{l}}$. The dual formulation for problem (19) is given by
where $y$ and $z$ denote the dual variables and $\overline{\mathcal{K}}$ is as defined above (it is a self-dual cone). Solutions of the primal (19) and dual (20) problem are connected by the strong duality theory (see, for example, Nocedal and Wright (2006)) and are characterized by the first-order optimality conditions, the so-called Karush–Kuhn–Tucker (KKT) conditions, which are stated as follows:
where $\circ $ is the multiplication operator defined in a special case of a so-called Euclidean Jordan algebra $({\mathbb{R}}^{n},\circ )$ with the following definition
If (19) and (20) have a strictly feasible solution (i.e., there is a feasible solution $(\hat{x},(\hat{y},\hat{z}))$ such that $\hat{x}\in \mathrm{int}\overline{\mathcal{K}}$ and $\hat{z}\in \mathrm{int}\overline{\mathcal{K}}$), then they both have optimal solutions and the duality gap is zero. Moreover, a feasible solution pair $({x}^{*},{y}^{*},{z}^{*})$ is optimal if, and only if, the KKT conditions (21) hold at this point, see Alizadeh and Goldfarb (2003) for more details.
The underlying algorithm applies an iterative method to find an optimal solution $({x}^{*},{y}^{*},{z}^{*})$ of the system (21) employing variants of Newton's method and modifying the search direction and step length so that the cone constraints are preserved at every iteration.
11.1Homogeneous Self-Dual Algorithm
The homogeneous and self-dual (HSD) model was first studied by Goldman and Tucker (1956) for linear programming and simplified by Xu et al. (1996). Then a generalization of HSD was employed to solve SOCP problems by Andersen et al. (2003) and Sturm (2002). As its name suggests, the HSD model and its dual are equivalent. Self-dual formulations embed the original problem (19) in a larger conic optimization problem such that the latter is primal and dual feasible, with known feasible points, and from which solution we can extract optimal solutions or certificates of infeasibility of the original problem.
We define the homogeneous and self-dual model for problem (19) as follows:
The model (23) can be viewed as a self-dual optimization problem with a zero objective function. If $(\hat{x},\hat{\tau},\hat{y},\hat{z},\hat{\kappa})$ is any feasible solution to (23), then if $\hat{\tau}>0$, a primal-dual optimal solution to (19) and (20) is given by
and the duality gap is given by ${c}^{\mathrm{T}}{x}^{*}-{b}^{\mathrm{T}}{y}^{*}=\hat{\kappa}/\hat{\tau}=0$. The homogeneous algorithm is an application of the primal-dual method for the computation of a feasible solution to (23). In order to achieve this, we follow the guideline of path-following interior point method and define a central path that is a smooth curve connecting an initial interior point and a complementary solution. So the set of nonlinear equations
defines the central path of the homogeneous model parameterized by $\gamma \in [0,1]$, $({x}^{0},{z}^{0},{y}^{0},{\tau}^{0},{\kappa}^{0})$ is an initial feasible point and $\mu $ has the expression $\mu \u2254\frac{{x}^{\mathrm{T}}s+\tau \kappa}{r+1}$ where $r$ is the number of cones.
11.2The Nesterov–Todd Search Direction
The Newton search direction is only guaranteed to be well-defined in a narrow neighbourhood around the central path. The search direction corresponds to applying Newton's method to (24) in a scaled space and then scaling the resulting search direction back to the original space so that it is well-defined. A matrix $W$ is a scaling matrix if it satisfies the conditions $W\succ 0$ and $WQW=Q$ where $W\succ 0$ means $W$ is symmetric and positive definite and $Q$ is a symmetric block diagonal matrix composed by so called reflection matrices ${Q}_{i}$ with the following definition:
It is easy to see that if we scale $x$ to $Wx$, $z$ to ${W}^{\mathrm{-1}}z$, $A$ to $A{W}^{\mathrm{-1}}$, and $c$ to ${W}^{\mathrm{-1}}c$, the resulting primal and dual pair is equivalent to (19) and (20), see Alizadeh and Goldfarb (2003) for more details.
An important issue is the choice of the scaling matrix $W$. According to Andersen et al. (2003), the best results are obtained for the Nesterov–Todd (NT) scaling suggested by Nesterov and Todd (1997). In the NT scaling, $W$ is chosen such that
When Newton's method is applied to the perturbed complementarity conditions in (24), the quadratic terms are neglected. Instead of neglecting the quadratic term Mehrotra (1992) suggested using a second-order correction of the search direction which increases the efficiency of the algorithm significantly in practice (Lustig et al. (1992)).
To implement this idea, we first solve (24) for $\gamma =0$ to get an affine scaling direction and a maximum step size ${\alpha}_{n}^{\text{max}}$ to the boundary. Then use these directions to estimate the quadratic terms
$$W\Delta x\circ {W}^{\mathrm{-1}}\Delta z\text{\hspace{1em} and \hspace{1em}}\Delta \tau \Delta \kappa $$
from (24) and use ${\alpha}_{n}^{\text{max}}$ to choose
where $\delta \in (0,1)$ is a constant. Therefore, we can choose $\gamma $ dynamically depending on how much progress can be made in the pure Newton (affine scaling) direction.
11.4Solving the KKT System
The solution of the Newton system of equations (25) is the most computationally costly operation. To reduce the system, we need the following definition. Associated with each vector $x=({x}_{0};\overline{x})\in {\mathbb{R}}^{n}$ there is an arrow-shaped matrix $\mathit{Arw}\left(x\right)$ defined as:
Linear systems (26) and (27) are systems of $m+n$ variables, symmetric and indefinite. Submatrix $W$ is block diagonal and positive definite. Note that systems (26) and (27) have the same coefficient matrix so we only need to perform factorization once per iteration.
The system (27) can be further reduced by eliminating ${g}_{1}$ and ${h}_{1}$, to a positive definite system usually called normal equations defined as
Typically, formulation (28) is preferred for many problems as the system matrix can be factorized by a sparse Cholesky. However, this brings some well-known disadvantages: ill-conditioning of the system is often observed during the final stages of the algorithm. If matrix $A$ contains dense columns (columns with relatively many nonzeros), then $A{W}^{\mathrm{-2}}{A}^{\mathrm{T}}$ has many nonzeros, which in turn makes the factorization expensive. On the other hand, solving the augmented system by Bunch–Parlett type factorization is usually slower, but it normally avoids the fill-in caused by dense columns.
e04ptc can detect and handle dense columns in the KKT system effectively. Since matrix ${W}^{\mathrm{-2}}$ in (28) is block diagonal, so dense columns also come as a linear combination of some columns in $A$. Depending on the number and the density of the ‘dense’ columns, the solver may either choose to directly use an augmented system formulation or to treat these columns separately in a product-form Cholesky factorization as described by Goldfarb and Scheinberg (2005). It is also possible to manually override the automatic choice via the optional parameter ${\mathbf{SOCP\; System\; Formulation}}$ and let the solver use a normal equations or an augmented system formulation.
Badly scaled optimal solutions may present numerical challenges, therefore, iterative refinement is employed for reducing the roundoff errors produced during the solution of the system. When the condition number of the system $A{W}^{\mathrm{-2}}{A}^{\mathrm{T}}$ prevents the satisfactory use of iterative refinement, e04ptc switches automatically to an augmented system formulation, reporting RS (Restart) in the last column of the iteration log (I). Furthermore, e04ptc provides several scaling techniques to adjust the numerical characteristics of the problem data, see optional parameter ${\mathbf{SOCP\; Scaling}}$.
Finally, factorization of the system matrix can degrade sparsity, so the resulting fill-in can be large, therefore, several ordering techniques are included to minimize it. e04ptc uses Harwell packages MA97 (see Hogg and Scott (2011) and HSL (2011)) for the underlying sparse linear algebra factorization and MC68 approximate minimum degree algorithm, and METIS (Karypis and Kumar (1998)) nested dissection algorithm for the ordering.
11.5Stopping Criteria
11.5.1Convergence-optimal termination
To measure the infeasibility, the following measures
are defined to measure the relative reduction in the primal, dual and gap infeasibility, respectively. In addition, an extra measure is considered to quantify the accuracy in the objective function, which is given by
The iteration is considered nearly feasible and optimal, and the interior point algorithm is stopped when the following conditions
$$\mathrm{max}\phantom{\rule{0.125em}{0ex}}({\rho}_{P},{\rho}_{D})\le {\epsilon}_{1}\text{\hspace{1em} and \hspace{1em}}{\rho}_{A}\le {\epsilon}_{2}$$
are satisfied. Here ${\epsilon}_{1}$ and ${\epsilon}_{2}$ may be set using ${\mathbf{SOCP\; Stop\; Tolerance}}$ and ${\mathbf{SOCP\; Stop\; Tolerance\; 2}}$, respectively.
Premature termination is triggered and the returned solution is considered as an optimal solution if the current iteration exhibits fast convergence and the optimality measures lie within a small range of desired precision. In particular, the self-dual algorithm is stopped if the above termination conditions are met within a small factor and $\tau >1000\kappa $. This measure is tracked after the first $10$ iterations.
In addition, the solver stops prematurely and reports suboptimal solution when it predicts that the current estimate of the solution will not be improved in subsequent iterations. In most cases the returned solution should be acceptable.
11.5.2Infeasibility/Unboundedness Detection
The problem is concluded to be primal or dual infeasible if one of the following conditions hold:
1.$\mathrm{max}\phantom{\rule{0.125em}{0ex}}({\rho}_{P},{\rho}_{D},{\rho}_{G})\le {\epsilon}_{1}\text{\hspace{1em} and \hspace{1em}}\tau \le {\epsilon}_{2}\mathrm{max}\phantom{\rule{0.125em}{0ex}}(1,\kappa )$.
2.$\mu \le {\epsilon}_{2}{\mu}_{0}\text{\hspace{1em} and \hspace{1em}}\tau \le {\epsilon}_{2}\mathrm{max}\phantom{\rule{0.125em}{0ex}}(1,\kappa )$.
Then the problem is declared dual infeasible if ${c}^{\mathrm{T}}x<0$ or primal infeasible otherwise.
11.6Further Details
e04ptc includes an advance preprocessing phase (called presolve) to reduce the dimensions of the problem before passing it to the solver. The reduction in problem size generally improves the behaviour of the solver, shortening the total computation time. In addition, infeasibility may also be detected during preprocessing. The default behaviour of the presolve can be modified by optional parameter ${\mathbf{SOCP\; Presolve}}$.
12Optional Parameters
Several optional parameters in e04ptc define choices in the problem specification or the algorithm logic. In order to reduce the number of formal arguments of e04ptc these optional parameters have associated default values that are appropriate for most problems. Therefore, you need only specify those optional parameters whose values are to be different from their default values.
The remainder of this section can be skipped if you wish to use the default values for all optional parameters.
The optional parameters can be changed by calling e04zmc anytime between the initialization of the handle and the call to the solver. Modification of the optional parameters during intermediate monitoring stops is not allowed. Once the solver finishes, the optional parameters can be altered again for the next solve.
The option values can be retrieved by calling e04znc.
The following is a list of the optional parameters available. A full description of each optional parameter is provided in Section 12.1.
For each option, we give a summary line, a description of the optional parameter and details of constraints.
The summary line contains:
the keywords;
a parameter value,
where the letters $a$, $i$ and $r$ denote options that take character, integer and real values respectively;
the default value, where the symbol $\epsilon $ is a generic notation for machine precision (see X02AJC).
All options accept the value $\mathrm{DEFAULT}$ to return single options to their default states.
Keywords and character values are case and white space insensitive.
Defaults
This special keyword may be used to reset all optional parameters to their default values. Any argument value given with this keyword will be ignored.
Infinite Bound Size
$r$
Default $\text{}={10}^{20}$
This defines the ‘infinite’ bound $\mathit{bigbnd}$ in the definition of the problem constraints. Any upper bound greater than or equal to $\mathit{bigbnd}$ will
be regarded as $+\infty $ (and similarly any lower bound less than or equal to $-\mathit{bigbnd}$ will be regarded as $-\infty $). Note that a modification of this optional parameter does not influence constraints which have already been defined; only the constraints formulated after the change will be affected.
(See Section 3.1.1 in the Introduction to the NAG Library CL Interface for further information on NAG data types.)
If $i\ge 0$, the
Nag_FileID number (as returned from x04acc)
for the secondary (monitoring) output. If set to $\mathrm{-1}$, no secondary output is provided. The following information is output to the unit:
–a listing of the optional parameters;
–problem statistics, the iteration log, and the final status as set by ${\mathbf{Monitoring\; Level}}$;
–the solution if set by ${\mathbf{Print\; Solution}}$.
This parameter sets the amount of information detail that will be printed by the solver to the secondary output. The meaning of the levels is the same as with ${\mathbf{Print\; Level}}$.
(See Section 3.1.1 in the Introduction to the NAG Library CL Interface for further information on NAG data types.)
If $i\ge 0$, the
Nag_FileID number (as returned from x04acc, stdout as the default)
for the primary output of the solver. If ${\mathbf{Print\; File}}=\mathrm{-1}$, the primary output is completely turned off independently of other settings. The following information is output to the unit:
–a listing of optional parameters if set by ${\mathbf{Print\; Options}}$;
–problem statistics, the iteration log, and the final status from the solver as set by ${\mathbf{Print\; Level}}$;
–the solution if set by ${\mathbf{Print\; Solution}}$.
This parameter defines how detailed information should be printed by the solver to the primary output.
$\mathit{i}$
Output
$0$
No output from the solver
$1$
Only the final status and the primal and dual objective value
$2$
Problem statistics, one line per iteration showing the progress of the solution with respect to the convergence measures, final status and statistics
$3$
As level $2$ but each iteration line is longer, including step lengths and errors
$4,5$
As level $3$ but further details of each iteration are presented
Constraint: $0\le {\mathbf{Print\; Level}}\le 5$.
Print Options
$a$
Default $=\mathrm{YES}$
If ${\mathbf{Print\; Options}}=\mathrm{YES}$, a listing of optional parameters will be printed to the primary output.
Constraint: ${\mathbf{Print\; Options}}=\mathrm{YES}$ or $\mathrm{NO}$.
Print Solution
$a$
Default $=\mathrm{NO}$
If ${\mathbf{Print\; Solution}}=\mathrm{X}$, the final values of the primal variables are printed on the primary and secondary outputs.
If ${\mathbf{Print\; Solution}}=\mathrm{YES}$ or $\mathrm{ALL}$, in addition to the primal variables, the final values of the dual variables are printed on the primary and secondary outputs.
Constraint: ${\mathbf{Print\; Solution}}=\mathrm{YES}$, $\mathrm{NO}$, $\mathrm{X}$ or $\mathrm{ALL}$.
SOCP Iteration Limit
$i$
Default $\text{}=100$
The maximum number of iterations to be performed by e04ptc. Setting the option too low might lead to ${\mathbf{fail}}\mathbf{.}\mathbf{code}=$NE_TOO_MANY_ITER.
This parameter defines the frequency of how often function monit is called. If $i>0$, the solver calls monit at the end of every $i$th iteration. If it is set to $0$, the function is not called at all.
This parameter allows you to reduce the level of presolving of the problem or turn it off completely. If the presolver is turned off, the solver will try to handle the problem as given by you. In such a case, the presence of fixed variables or linear dependencies in the constraint matrix can cause numerical instabilities to occur. In normal circumstances, it is recommended to use the full presolve which is the default.
Constraint: ${\mathbf{SOCP\; Presolve}}=\mathrm{FULL}$, $\mathrm{BASIC}$ or $\mathrm{NO}$.
SOCP Scaling
$a$
Default $=\mathrm{NONE}$
This parameter controls the type of scaling to be applied on the constraint matrix $A$ before solving the problem. More precisely, the scaling procedure will try to find diagonal matrices ${D}_{1}$ and ${D}_{2}$ such that the values in ${D}_{1}A{D}_{2}$ are of a similar order of magnitude. The solver is less likely to run into numerical difficulties when the constraint matrix is well scaled.
Constraint: ${\mathbf{SOCP\; Scaling}}=\mathrm{ARITHMETIC}$, $\mathrm{GEOMETRIC}$ or $\mathrm{NONE}$.
SOCP Stop Tolerance
$r$
Default $=\sqrt{\epsilon}$
This parameter sets the value ${\epsilon}_{1}$ which is the tolerance for the convergence measures in the stopping criteria, see Section 11.5.
As described in Section 11.4, e04ptc can internally work either with the normal equations formulation (28) or with the augmented system (26) and (27). A brief discussion of advantages and disadvantages is presented in (27). Setting the option value to $\mathrm{AUTO}$ leaves the decision to the solver based on the structure of the constraints and it is the recommended value. This will typically lead to the normal equations formulation unless there are many dense columns or the system is significantly cheaper to factorize as the augmented system. Note that in some cases even if ${\mathbf{SOCP\; System\; Formulation}}=\mathrm{NORMAL\; EQUATIONS}$ the solver might switch the formulation through the computation to the augmented system due to numerical instabilities or computational cost.
This parameter allows you to turn on timings of various parts of the algorithm to give a better overview of where most of the time is spent. This might be helpful for a choice of different solving approaches. It is possible to choose between CPU and wall clock time. Choice $\mathrm{YES}$ is equivalent to $\mathrm{WALL\; CLOCK}$.
Constraint: ${\mathbf{Stats\; Time}}=\mathrm{YES}$, $\mathrm{NO}$, $\mathrm{CPU}$ or $\mathrm{WALL\; CLOCK}$.
Task
$a$
Default $=\mathrm{MINIMIZE}$
This parameter specifies the required direction of the optimization. If ${\mathbf{Task}}=\mathrm{FEASIBLE\; POINT}$, the objective function (if set) is ignored and the algorithm stops as soon as a feasible point is found with respect to the given tolerance. If no objective function is set, ${\mathbf{Task}}$ reverts to $\mathrm{FEASIBLE\; POINT}$ automatically.
Constraint: ${\mathbf{Task}}=\mathrm{MINIMIZE}$, $\mathrm{MAXIMIZE}$ or $\mathrm{FEASIBLE\; POINT}$.
Time Limit
$r$
Default $\text{}={10}^{6}$
A limit to the number of seconds that the solver can use to solve one problem. If during the convergence check this limit is exceeded, the solver will terminate with ${\mathbf{fail}}\mathbf{.}\mathbf{code}=$NE_TIME_LIMIT.