naginterfaces.library.opt.handle_solve_bxnl¶

naginterfaces.library.opt.handle_solve_bxnl(handle, lsqfun, lsqgrd, x, nres, lsqhes=None, lsqhprd=None, monit=None, data=None, io_manager=None, spiked_sorder='C')[source]¶

handle_solve_bxnl is a bound-constrained nonlinear least squares trust region solver (BXNL) from the NAG optimization modelling suite aimed for small to medium-scale problems.

Note: this function uses optional algorithmic parameters, see also: handle_opt_set(), handle_opt_get().

For full information please refer to the NAG Library document for e04gg

https://support.nag.com/numeric/nl/nagdoc_30.3/flhtml/e04/e04ggf.html

Parameters

handleHandle

The handle to the problem. It needs to be initialized (e.g., by handle_init()) and to hold a problem formulation compatible with handle_solve_bxnl. It must not be changed between calls to the NAG optimization modelling suite.

lsqfuncallable (rx, inform) = lsqfun(x, nres, inform, data=None)

$l s q f u n$ must evaluate the value of the nonlinear residuals, $r_{i} (x) := y_{i} - ϕ (t_{i}; x), i = 1, \dots, n_{r e s}$ , at a specified point $x$ .

Parameters

xfloat, ndarray, shape $(nvar)$: $x$ , the vector of variable values at which the residuals, $r_{i}$ , are to be evaluated.
nresint: $n_{res}$ , the current number of residuals in the model.
informint: A non-negative value.
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

Returns

rxfloat, array-like, shape $(n r e s)$: The value of the residual vector, $r (x)$ , evaluated at $x$ .
informint: May be used to indicate that some residuals could not be computed at the requested point. This can be done by setting $i n f o r m$ to a negative value. The solver will attempt a rescue procedure and request an alternative point. If the rescue procedure fails, the solver will exit with $e r r n o$ = 25.

lsqgrdcallable inform = lsqgrd(x, nres, rdx, inform, data=None)

$l s q g r d$ evaluates the residual gradients, $\nabla r_{i} (x)$ , at a specified point $x$ .

Parameters

xfloat, ndarray, shape $(nvar)$

$x$ , the vector of variable values at which the residual gradients, $\nabla r_{i} (x)$ , are to be evaluated.

nresint

$n_{res}$ , the current number of residuals in the model.

rdxfloat, ndarray, shape $(nnzrd)$ , to be modified in place

On entry: the elements should only be assigned and not referenced.

On exit: the vector containing the nonzero residual gradients evaluated at $x$ ,

\nabla r (x) = [\nabla r_{1} (x), \nabla r_{2} (x), \dots, \nabla r_{n_{res}} (x)],

where

\nabla r_{i} (x) = {[\frac{\partial r_{i} (x)}{\partial x_{1}}, \dots, \frac{\partial r_{i} (x)}{\partial x_{n_{var}}}]}^{T} .

The elements must be stored in the same order as the defined sparsity pattern provided in the call to handle_set_nlnls().

informint

A non-negative value.

dataarbitrary, optional, modifiable in place

User-communication data for callback functions.

Returns

informint: May be used to indicate that the residual gradients could not be computed at the requested point. This can be done by setting $i n f o r m$ to a negative value. The solver will attempt a rescue procedure and request an alternative point. If the rescue procedure fails, the solver will exit with $e r r n o$ = 25.

xfloat, array-like, shape $(nvar)$

$x_{0}$ , the initial estimates of the variables, $x$ .

nresint

$n_{res}$ , the current number of residuals in the model.

lsqhesNone or callable inform = lsqhes(x, lamda, hx, inform, data=None), optional

Note: if this argument is None then a NAG-supplied facility will be used.

$l s q h e s$ evaluates the residual Hessians, $\nabla^{2} r_{i} (x)$ , at a specified point $x$ .

By default, the option $‘Bxnl Use Second Derivatives' ='NO'$ and $l s q h e s$ is never called. $l s q h e s$ may be None.

This function will only be called if the option $‘Bxnl Use Second Derivatives' ='YES'$ and if the model (see Models) requires second order information.

Under these circumstances, if you do not provide a valid $l s q h e s$ the solver will terminate with either $e r r n o$ = 6 or $e r r n o$ = 21.

Parameters

xfloat, ndarray, shape $(nvar)$

$x$ , the vector of decision variables at the current iteration.

lamdafloat, ndarray, shape $(nres)$

$λ$ , the vector containing the (weighted) residuals at $x$ , $λ_{i} = w_{i} r_{i} (x)$ . See [equation] and Residual Weights.

hxfloat, ndarray, shape $(nvar, nvar)$ , to be modified in place

On entry: the elements should only be assigned and not referenced.

On exit: a dense square (symmetric) matrix containing the weighted sum of residual Hessians,

H (x) = n_{res} \sum i = 1 λ_{i} \nabla^{2} r_{i} (x),

where

\begin{matrix} \nabla^{2} r_{i} (x) = ⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} \frac{\partial^{2}}{\partial_{x_{1}} \partial_{x_{1}}} r_{i} (x) & \frac{\partial^{2}}{\partial_{x_{1}} \partial_{x_{2}}} r_{i} (x) & \dots & \frac{\partial^{2}}{\partial_{x_{1}} \partial_{x_{n_{v a r}}}} r_{i} (x) \frac{\partial^{2}}{\partial_{x_{2}} \partial_{x_{1}}} r_{i} (x) & \frac{\partial^{2}}{\partial_{x_{2}} \partial_{x_{2}}} r_{i} (x) & \dots & \frac{\partial^{2}}{\partial_{x_{2}} \partial_{x_{n_{v a r}}}} r_{i} (x) ⋮ & ⋮ & ⋱ & ⋮ \frac{\partial^{2}}{\partial_{x_{n_{v a r}}} \partial_{x_{1}}} r_{i} (x) & \frac{\partial^{2}}{\partial_{x_{n_{v a r}}} \partial_{x_{2}}} r_{i} (x) & \dots & \frac{\partial^{2}}{\partial_{x_{n_{v a r}}} \partial_{x_{n_{v a r}}}} r_{i} (x) \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠, \end{matrix}

is also a dense square (symmetric) matrix containing the $i$ th residual Hessian evaluated at the point $x$ . All matrix elements must be provided: both upper and lower triangular parts.

informint

A non-negative value.

dataarbitrary, optional, modifiable in place

User-communication data for callback functions.

Returns

informint: May be used to indicate that one or more elements of the residual Hessian could not be computed at the requested point. This can be done by setting $i n f o r m$ to a negative value. The solver will attempt a rescue procedure and if the rescue procedure fails, the solver will exit with $e r r n o$ = 25.

lsqhprdNone or callable (hxy, inform) = lsqhprd(x, y, hxy, inform, data=None), optional

Note: if this argument is None then a NAG-supplied facility will be used.

$l s q h p r d$ evaluates the residual Hessians, $\nabla^{2} r_{i} (x)$ , at a specified point, $x$ , and performs matrix-vector products with a given vector, $y$ , returning the dense matrix $[\nabla^{2} r_{1} (x) y, \nabla^{2} r_{2} (x) y, \dots, \nabla^{2} r_{n_{res}} (x) y]$ .

If you do not supply this function, it may be None.

Parameters

xfloat, ndarray, shape $(nvar)$: $x$ , the vector of decision variables at the current iteration.
yfloat, ndarray, shape $(nvar)$: $y$ , the vector used to perform the required matrix-vector products.
hxyfloat, ndarray, shape $(nvar, nres)$: The elements should only be assigned and not referenced.
informint: The first call to $l s q h p r d$ will have a non-zero value and can be used to optimize your code in order to avoid recalculations of common quantities when evaluating the Hessians. For all other instances $i n f o r m$ will have a value of zero. This notification argument may be safely ignored if such optimization is not required.
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

Returns

hxyfloat, array-like, shape $(nvar, nres)$: A dense matrix of size $n_{var} \times n_{res}$ containing the following matrix-vector products,

$H (x, y) = [\nabla^{2} r_{1} (x) y, \nabla^{2} r_{2} (x) y, \dots, \nabla^{2} r_{n_{res}} (x) y] .$
informint: May be used to indicate that one or more elements of the residual Hessian could not be computed at the requested point. This can be done by setting $i n f o r m$ to a negative value. The solver will attempt a rescue procedure and if the rescue procedure fails, the solver will exit with $e r r n o$ = 25. The value of $i n f o r m$ returned on the first call is ignored.

monitNone or callable monit(x, rinfo, stats, data=None), optional

Note: if this argument is None then a NAG-supplied facility will be used.

$m o n i t$ is provided to enable monitoring of the progress of the optimization and, if necessary, to halt the optimization process.

If no monitoring is required, $m o n i t$ may be None.

$m o n i t$ is called at the end of every $i$ th step where $i$ is controlled by the option ‘Bxnl Monitor Frequency’ (the default value is $0$ , $m o n i t$ is not called).

Parameters

xfloat, ndarray, shape $(nvar)$: The current best point.
rinfofloat, ndarray, shape $(100)$: Best objective value computed and various indicators (the values are as described in the main argument $r i n f o$ ).
statsfloat, ndarray, shape $(100)$: Solver statistics at monitoring steps or at the end of the current iteration (the values are as described in the main argument $s t a t s$ ).
dataarbitrary, optional, modifiable in place: User-communication data for callback functions.

dataarbitrary, optional

User-communication data for callback functions.

io_managerFileObjManager, optional

Manager for I/O in this routine.

spiked_sorderstr, optional

If $h x y$ in $l s q h p r d$ is spiked (i.e., has unit extent in all but one dimension, or has size $1$ ), $s p i k e d_s o r d e r$ selects the storage order to associate with it in the NAG Engine:

spiked_sorder = $'C'$: row-major storage will be used;
spiked_sorder = $'F'$: column-major storage will be used.

Returns

xfloat, ndarray, shape $(nvar)$

The final values of the variables, $x$ .

rxfloat, ndarray, shape $(n r e s)$

The values of the residuals at the final point given in $x$ .

rinfofloat, ndarray, shape $(100)$

Objective value and various indicators at monitoring steps or at the end of the final iteration. The measures are given in the table below:

$0$	Objective function value, $f (x)$ .
$1$	Norm of the projected gradient at the current iterate, see PG STEP in Bound Constraints and [equation] in Stopping Criteria.
$2$	Norm of the scaled projected gradient at the current iterate, see [equation] in Stopping Criteria
$3$	Norm of the step between the current and previous iterate.
$4$	Convergence tests result. A scalar value between $0 - 7$ indicates whether a convergence test has passed. Specifically, $1$ indicates small objective test passed, $2$ indicates small (scaled) gradient test passed, $4$ indicates small step test passed. In the case where two or more tests passed, they are accumulated.
$5$	Norm of the current iterate $x$ . If regularization is requested, then this value was used in the regularization and it might differ from $∥ x ∥$ if $x$ has fixed or disabled elements.
$6$ – $99$	Reserved for future use.

statsfloat, ndarray, shape $(100)$

Solver statistics at monitoring steps or at the end of the final iteration as given in the table below:

$0$	Number of iterations performed.
$1$	Total number of calls to the objective function $l s q f u n$ .
$2$	Total number of calls to the objective gradient function $l s q g r d$ .
$3$	Total number of calls to the objective Hessian function $l s q h e s$ .
$4$	Total time in seconds spent in the solver. It includes time spent in user-supplied subroutines.
$5$	Number of calls to the objective function $l s q f u n$ required by linesearch steps.
$6$	Number of calls to the objective gradient function $l s q g r d$ required by linesearch steps.
$7$	Number of calls to the objective function $l s q f u n$ required by projected gradient steps.
$8$	Number of calls to the objective gradient function $l s q g r d$ required by projected gradient steps.
$9$	Number of inner iterations performed, see option $‘Bxnl Model' ='TENSOR-NEWTON'$ .
$10$	Number of linesearch iterations performed.
$11$	Number of projected gradient iterations performed.
$12$	Total number of calls to the objective auxiliary Hessian function $l s q h p r d$ .
$13$ – $99$	Reserved for future use.

Other Parameters

‘Defaults’valueless

This special keyword may be used to reset all options to their default values. Any value given with this keyword will be ignored.

‘Bxnl Basereg Pow’float

Default $= 2.0$

This argument defines the regularization power $p$ in [equation] and for the tensor Newton subproblem (when $‘Bxnl Tn Method' ='IMPLICIT'$ ). Some values are restricted depending on the type of regularization specified, see ‘Bxnl Basereg Type’ for more details.

Constraint: $‘Bxnl Basereg Pow' > 0$ .

‘Bxnl Basereg Term’float

Default $= 0.01$

This argument defines the regularization term $σ$ in [equation] and for the tensor Newton subproblem (when $‘Bxnl Tn Method' ='IMPLICIT'$ ).

Constraint: $‘Bxnl Basereg Term' > 0$ .

‘Bxnl Basereg Type’str

Default $='NONE'$

This argument specifies the method used to incorporate the regularizer into [equation] and optionally into the tensor Newton subproblem (when $‘Bxnl Model' ='Tensor-Newton'$ and $‘Bxnl Tn Method' ='IMPLICIT'$ ).

The option $‘Bxnl Basereg Type' ='EXPAND-NVAR-DOF'$ reformulates the original problem by expanding it with $n_{var}$ degrees of freedom that is subsequently solved. For the case $‘Bxnl Basereg Type' ='EXPAND-1-DOF'$ the residual vector is extended with a new term of the form $\frac{σ}{p} {∥ x ∥}_{2}^{p}$ ; for this method a value of $p = 3$ is recommended.

If $‘Bxnl Basereg Type' ='EXPAND-NVAR-DOF'$ then the regularization power term $p$ must be $2.0$ , that is $‘Bxnl Basereg Pow' ='2.0'$ . For further details see Subproblems.

Constraint: $‘Bxnl Basereg Type' ='NONE'$ , $'EXPAND-NVAR-DOF'$ or $'EXPAND-1-DOF'$ .

‘Bxnl Save Covariance Matrix’str

Default $='NO'$

This argument indicates to the solver to store the covariance matrix into the handle.

If $‘Bxnl Save Covariance Matrix' ='YES'$ then the lower triangle part of the covariance matrix is stored in packed column order (see the F07 Introduction) into the handle and can be retrieved via handle_set_get_real() using $cmdstr ='COVARIANCE MATRIX'$ with $lrarr = (n_{var} \times (n_{var} + 1)) / 2$ .

In the special case where $‘Bxnl Save Covariance Matrix' ='VARIANCE'$ , only the diagonal elements of the covariance matrix are stored in the handle and can be retrieved via handle_set_get_real() using $cmdstr ='VARIANCE'$ with $lrarr = n_{var}$ .

Similarly, if $‘Bxnl Save Covariance Matrix' ='HESSIAN'$ then the lower triangle part of the matrix $H (x) = \nabla r (x) {\nabla r (x)}^{T} = {J (x)}^{T} J (x)$ is stored in packed column order into the handle and can be retrieved via handle_set_get_real() using $cmdstr ='HESSIAN MATRIX'$ with $lrarr = (n_{var} \times (n_{var} + 1)) / 2$ .

Limitations: If the number of enabled residuals is not greater than the number of enabled variables, or the pseudo-inverse of $H (x)$ could not be calculated, then the covariance matrix (variance vector) is not stored in the handle and will not be available.

For more information on how the covariance matrix is estimated, see lsq_uncon_covariance().

Constraint: $‘Bxnl Save Covariance Matrix' ='NO'$ , $'YES'$ , $'VARIANCE'$ or $'HESSIAN'$ .

‘Bxnl Stop Abs Tol Fun’float

Default $= ϵ^{\frac{1}{2}}$

This argument specifies the relative tolerance for the error test, specifically, it sets the value of $ϵ_{abs}^{f}$ of equation [equation] in Stopping Criteria. Setting ‘Bxnl Stop Abs Tol Fun’ to a large value may cause the solver to stop prematurely with a suboptimal solution.

Constraint: $‘Bxnl Stop Abs Tol Fun' > 0$ .

‘Bxnl Stop Abs Tol Grd’float

Default $= 2.2 ϵ^{\frac{1}{3}}$

This argument specifies the relative tolerance for the gradient test, specifically, it sets the value of $ϵ_{abs}^{g}$ of equation [equation] in Stopping Criteria. Setting ‘Bxnl Stop Abs Tol Grd’ to a large value may cause the solver to stop prematurely with a suboptimal solution.

Constraint: $‘Bxnl Stop Abs Tol Grd' > 0$ .

‘Bxnl Stop Rel Tol Fun’float

Default $= ϵ^{\frac{1}{2}}$

This argument specifies the relative tolerance for the error test, specifically, it sets the value of $ϵ_{rel}^{f}$ of equation [equation] in Stopping Criteria. Setting ‘Bxnl Stop Rel Tol Fun’ to a large value may cause the solver to stop prematurely with a suboptimal solution.

Constraint: $‘Bxnl Stop Rel Tol Fun' > 0$ .

‘Bxnl Stop Rel Tol Grd’float

Default $= ϵ^{\frac{1}{2}}$

This argument specifies the relative tolerance for the gradient test, specifically, it sets the value of $ϵ_{rel}^{g}$ of equation [equation] in Stopping Criteria. Setting ‘Bxnl Stop Rel Tol Grd’ to a large value may cause the solver to stop prematurely with a suboptimal solution.

Constraint: $‘Bxnl Stop Rel Tol Grd' > 0$ .

‘Bxnl Stop Step Tol’float

Default $= 2 ϵ$

Specifies the stopping tolerance for the step length test, specifically, it sets the value for $ϵ_{step}$ of equation [equation] in Stopping Criteria. Setting ‘Bxnl Stop Step Tol’ to a large value may cause the solver to stop prematurely with a suboptimal solution.

Under certain circumstances, e.g., when in doubt of the quality of the first - or second-order derivatives, in the event of the solver exiting with a successful step length test, it is recommended to verify that either the error or the gradient norm is acceptably small.

Constraint: $‘Bxnl Stop Step Tol' > 0$ .

‘Bxnl Reg Order’str

Default $='AUTO'$

This argument specifies the order of the regularization $p$ in [equation] used when $‘Bxnl Glob Method' ='Reg'$ .

Some values for $p$ are restricted depending on the method chosen in ‘Bxnl Nlls Method’, see Regularization for more details.

Constraint: $‘Bxnl Reg Order' ='AUTO'$ , $'QUADRATIC'$ or $'CUBIC'$ .

‘Bxnl Glob Method’str

Default $='TR'$

This argument specifies the globalization method used to estimate the next step $s_{k}$ . It also determines the class of subproblem to solve. The trust region subproblem finds the step by minimizing the specified model withing a given radius. On the other hand, when $‘Bxnl Glob Method' = R E G$ , the problem is reformulated by adding an aditional regularization term and minimized in order to find the next step $s_{k}$ . See Subproblems for more details.

Constraint: $‘Bxnl Glob Method' ='TR'$ or $'REG'$ .

‘Bxnl Nlls Method’str

Default $='GALAHAD'$

This argument defines the method used to estimate the next step $s_{k}$ in $x_{k + 1} = x_{k} + s_{k}$ . It only applies to $‘Bxnl Model' ='GAUSS-NEWTON'$ , $'QUASI-NEWTON'$ or $'HYBRID'$ . When the globalization technique chosen is trust region ( $‘Bxnl Glob Method' = T R$ ) the methods for ‘Bxnl Nlls Method’ available are Powell’s dogleg method, a generalized eigenvalue method (AINT) Adachi et al. (2015), a variant of Moré–Sorensen’s method, and GALAHAD’s DTRS method. Otherwise, when the globalization method chosen is via regularization ( $‘Bxnl Glob Method' = R E G$ ) the methods available are comprised by a linear system solver and GALAHAD’s DRQS method. See Subproblems for more details.

Constraint: $‘Bxnl Nlls Method' ='POWELL-DOGLEG'$ , $'AINT'$ , $'MORE-SORENSEN'$ , $'LINEAR SOLVER'$ or $'GALAHAD'$ .

‘Bxnl Model’str

Default $='HYBRID'$

This argument specifies which model is used to approximate the objective function and estimate the next point that reduces the error. This is one of the most important options and should be chosen according to the problem characteristics. The models are briefly described in Models.

Constraint: $‘Bxnl Model' ='GAUSS-NEWTON'$ , $'QUASI-NEWTON'$ , $'HYBRID'$ or $'TENSOR-NEWTON'$ .

‘Bxnl Tn Method’str

Default $='MIN-1-VAR'$

This argument specifies how to solve the subproblem and find the next step $s_{k}$ for the tensor Newton model, $‘Bxnl Model' ='TENSOR-NEWTON'$ . The subproblems are solved using a range of regularization schemes. See Tensor Newton subproblem.

Constraint: $‘Bxnl Tn Method' ='IMPLICIT'$ , $'MIN-1-VAR'$ , $'MIN-NVAR'$ , $'ADD-1-VAR'$ or $'ADD-NVAR'$ .

‘Bxnl Use Second Derivatives’str

Default $='NO'$

This argument indicates whether the weighted sum of residual Hessians are available through the call-back $l s q h e s$ . If $‘Bxnl Use Second Derivatives' ='NO'$ and the specified model in ‘Bxnl Model’ requires user-suppied second derivatives, then the solver will terminate with $e r r n o$ = 6.

Constraint: $‘Bxnl Use Second Derivatives' ='YES'$ or $'NO'$ .

‘Bxnl Use Weights’str

Default $='NO'$

This argument indicates whether to use a weighted nonlinear least square model. If $‘Bxnl Use Weights' ='YES'$ then the weights $w_{i} > 0, i = 1, \dots, n_{res}$ in [equation] must be supplied by you via handle_set_get_real(). If weights are to be used, then all $n_{res}$ elements must be provided, see Residual Weights. If the solver is expecting to use weights but they are not provided or have non-positive values, then the solver will terminate with $e r r n o$ = 11.

Constraint: $‘Bxnl Use Second Derivatives' ='YES'$ or $'NO'$ .

‘Bxnl Iteration Limit’int

Default $= 1000$

This argument specifies the maximum amount of major iterations the solver is alloted. If this limit is reached, then the solver will terminate with $e r r n o$ = 22.

Constraint: $‘Bxnl Iteration Limit' \geq 1$ .

‘Bxnl Monitor Frequency’int

Default $= 0$

If $‘Bxnl Monitor Frequency' > 0$ , the user-supplied function $m o n i t$ will be called at the end of every $i$ th step for monitoring purposes.

Constraint: $‘Bxnl Monitor Frequency' \geq 0$ .

‘Bxnl Print Header’int

Default $= 30$

This argument defines, in number of iterations, the frequency with which to print the iteration log header.

Constraint: $‘Bxnl Print Header' \geq 1$ .

‘Infinite Bound Size’float

Default $= 10^{20}$

This defines the ‘infinite’ bound $bigbnd$ in the definition of the problem constraints. Any upper bound greater than or equal to $bigbnd$ will be regarded as $+ \infty$ (and similarly any lower bound less than or equal to $- bigbnd$ will be regarded as $- \infty$ ). Note that a modification of this option does not influence constraints which have already been defined; only the constraints formulated after the change will be affected.

Constraint: $‘Infinite Bound Size' \geq 1000$ .

‘Monitoring File’int

Default $= - 1$

If $i \geq 0$ , the unit number for the secondary (monitoring) output. If $‘Monitoring File' = - 1$ , no secondary output is provided. The information output to this unit is controlled by ‘Monitoring Level’.

Constraint: $‘Monitoring File' \geq - 1$ .

‘Monitoring Level’int

Default $= 4$

This argument sets the amount of information detail that will be printed by the solver to the secondary output. The meaning of the levels is the same as for ‘Print Level’.

Constraint: $0 \leq ‘Monitoring Level' \leq 5$ .

‘Print File’int

Default $= advisory message unit number$

If $i \geq 0$ , the unit number for the primary output of the solver. If $‘Print File' = - 1$ , the primary output is completely turned off independently of other settings. The default value is the advisory message unit number at the time of the options initialization, e.g., at the initialization of the handle. The information output to this unit is controlled by ‘Print Level’.

Constraint: $‘Print File' \geq - 1$ .

‘Print Level’int

Default $= 2$

This argument defines how detailed information should be printed by the solver to the primary and secondary output.

$i$	Output
$0$	No output from the solver.
$1$	The Header and Summary.
$2$ , $3$ , $4$ , $5$	Additionally, the Iteration log.

Constraint: $0 \leq ‘Print Level' \leq 5$ .

‘Print Options’str

Default $='YES'$

If $‘Print Options' ='YES'$ , a listing of options will be printed to the primary output and is always printed to the secondary output.

Constraint: $‘Print Options' ='YES'$ or $'NO'$ .

‘Print Solution’str

Default $='NO'$

If $‘Print Solution' ='X'$ , the final values of the primal variables are printed on the primary and secondary outputs.

If $‘Print Solution' ='YES'$ or $'ALL'$ , in addition to the primal variables, the final values of the dual variables are printed on the primary and secondary outputs.

Constraint: $‘Print Solution' ='YES'$ , $'NO'$ , $'X'$ or $'ALL'$ .

‘Stats Time’str

Default $='NO'$

This argument turns on timing. This might be helpful for a choice of different solving approaches. It is possible to choose between CPU and wall clock time. Choice ‘YES’ is equivalent to ‘WALL CLOCK’.

Constraint: $‘Stats Time' ='YES'$ , $'NO'$ , $'CPU'$ or $'WALL CLOCK'$ .

‘Time Limit’float

Default $= 10^{6}$

A limit to the number of seconds that the solver can use to solve one problem. If at the end of an iteration this limit is exceeded, the solver will terminate with $e r r n o$ = 23.

Constraint: $‘Time Limit' > 0$ .

Raises

NagValueError

(errno $1$ )

$h a n d l e$ has not been initialized.

(errno $1$ )

$h a n d l e$ does not belong to the NAG optimization modelling suite, has not been initialized properly or is corrupted.

(errno $1$ )

$h a n d l e$ has not been initialized properly or is corrupted.

(errno $2$ )

This solver does not support the model defined in the handle.

(errno $2$ )

The problem is already being solved.

(errno $3$ )

Unsupported option combinations.

(errno $3$ )

Unsupported model and method chosen.

(errno $4$ )

On entry, $nvar = ⟨ v a l u e ⟩$ , expected $v a l u e = ⟨ v a l u e ⟩$ .

Constraint: $nvar$ must match the current number of variables of the model in the $h a n d l e$ .

(errno $4$ )

The information supplied does not match with that previously stored.

On entry, $n r e s = ⟨ v a l u e ⟩$ must match that given during the definition of the objective in the $h a n d l e$ , i.e., $⟨ v a l u e ⟩$ .

(errno $4$ )

There are no decision variables. $nvar$ must be greater than zero.

(errno $6$ )

Exact second derivatives needed for tensor model.

(errno $11$ )

Data for residual weights not found or is invalid.

(errno $21$ )

The current starting point is unusable.

Warns

NagAlgorithmicMajorWarning

(errno $18$ ): Numerical difficulties encountered and solver was terminated.
(errno $19$ ): Iteration limit reached while solving a subproblem.
(errno $19$ ): Line Search failed.
(errno $22$ ): Maximum number of iterations reached.
(errno $23$ ): The solver terminated after the maximum time allowed was exceeded.
(errno $24$ ): The solver was terminated because no further progress could be achieved.
(errno $25$ ): Invalid number detected in user-supplied function and recovery failed.

NagCallbackTerminateWarning

(errno $20$ ): User requested termination during a monitoring step.

Notes

handle_solve_bxnl computes a solution $x$ to the nonlinear least squares problem

\begin{matrix} \begin{matrix} {minimize}_{x \in R^{n_{var}}} & f (x) = \frac{1}{2} \sum_{i = 1}^{n_{res}} {[w_{i} r_{i} (x)]}^{2} + \frac{σ}{p} {∥ x ∥}_{2}^{p} subject to & l_{x} \leq x \leq u_{x}, \end{matrix} \end{matrix}

where $r_{i} (x), i = 1, \dots, n_{res}$ , are smooth nonlinear functions called residuals, $w_{i}, i = 1, \dots, n_{res}$ are weights (by default they are all defined to $1$ , see Residual Weights on how to change them), and the rightmost element represents the regularization term with argument $σ \geq 0$ and power $p > 0$ (by default the regularization term is not used, see Algorithmic Details on how to enable it). The constraint elements $l_{x}$ and $u_{x}$ are $n_{var}$ -dimensional vectors defining the bounds on the variables.

Typically in a calibration or data fitting context, the residuals will be defined as the difference between the observed values $y_{i}$ at $t_{i}$ and the values provided by a nonlinear model $ϕ (t; x)$ , i.e., $r_{i} (x) := y_{i} - ϕ (t_{i}; x)$ . If these residuals (errors) follow a Gaussian distribution, then the values of the optimal parameter vector $x^{*}$ are the maximum likelihood estimates. For a description of the various algorithms implemented for solving this problem see Algorithmic Details. It is also recommended that you read the E04 Introduction.

handle_solve_bxnl serves as a solver for problems stored as a handle. The handle points to an internal data structure which defines the problem and serves as a means of communication for functions in the NAG optimization modelling suite. First, the problem handle is initialized by calling handle_init(). A nonlinear least square residual objective can be added by calling handle_set_nlnls() and, optionally, (simple) box constraints can be defined by calling handle_set_simplebounds(). It should be noted that handle_solve_bxnl internally works with a dense representation of the residual Jacobian even if a sparse structure was defined in the call to handle_set_nlnls(). Once the problem is fully described, the handle may be passed to the solver handle_solve_bxnl. When the handle is not needed anymore, handle_free() should be called to destroy it and deallocate the memory held within. For more information refer to the NAG optimization modelling suite in the E04 Introduction.

The algorithm is based on the trust region framework and its behaviour can be modified by various options (see Other Parameters) which can be set by handle_opt_set() and handle_opt_set_file() anytime between the initialization of the handle by handle_init() and a call to the solver. Once the solver has finished, options may be modified for the next solve. The solver may be called repeatedly with various starting points and/or options. The option getter handle_opt_get() can be called to retrieve the current value of any option.

Several options might have significant impact on the performance of the solver. Even though the defaults were chosen to suit the majority of anticipated problems, it is recommended that you experiment with the option settings to find the most suitable set of options for a particular problem, see Algorithmic Details and Other Parameters for further details.

References

Adachi, S, Iwata, S, Nakatsukasa, Y, and Takeda, A, 2015, Solving the trust region subproblem by a generalized eigenvalue problem, Technical report, METR 2015-14., Mathematical Engineering, The University of Tokyo, https://www.keisu.t.u-tokyo.ac.jp/data/2015/METR15-14.pdf

Conn, A R, Gould, N I M and Toint, Ph L, 2000, Trust Region Methods, SIAM, Philadephia

Gould, N I M, Orban, D, and Toint, Ph L, 2003, GALAHAD, a library of thread-safe Fortran 90 packages for large-scale nonlinear optimization, ACM Transactions on Mathematical Software (TOMS) (29(4)), 353–372

Gould, N I M, Rees, T, and Scott, J A, 2017, A higher order method for solving nonlinear least-squares problems, Technical report, RAL-P-1027-010, RAL Library. STFC Rutherford Appleton Laboratory, http://www.numerical.rl.ac.uk/people/rees/pdf/RAL-P-2017-010.pdf

Kanzow, C, Yamashita, N, and Fukushima, M, 2004, Levenberg-Marquardt methods with strong local convergence properties for solving nonlinear equations with convex constraints, Journal of Computational and Applied Mathematics (174), 375–397

Lanczos, C, 1956, Applied Analysis, 272–280, Prentice Hall, Englewood Cliffs, NJ, USA

Nielsen, H B, 1999, Damping parameter in Marquadt’s Method, Technical report TR IMM-REP-1999-05., Department of Mathematical Modelling, Technical University of Denmark, http://www2.imm.dtu.dk/documents/ftp/tr99/tr05_99.pdf

Nocedal, J and Wright, S J, 2006, Numerical Optimization, (2nd Edition), Springer Series in Operations Research, Springer, New York

NAG and Python

Return to Front

naginterfaces.library.opt.handle_solve_bxnl¶

naginterfaces.library.opt.handle_​solve_​bxnl¶

naginterfaces.library.opt.handle_solve_bxnl¶