NAG Library Routine Document
E04LYF
1 Purpose
E04LYF is an easy-to-use modified-Newton algorithm for finding a minimum of a function, subject to fixed upper and lower bounds on the independent variables, when first and second derivatives of are available. It is intended for functions which are continuous and which have continuous first and second derivatives (although it will usually work even if the derivatives have occasional discontinuities).
2 Specification
SUBROUTINE E04LYF ( |
N, IBOUND, FUNCT2, HESS2, BL, BU, X, F, G, IW, LIW, W, LW, IUSER, RUSER, IFAIL) |
INTEGER |
N, IBOUND, IW(LIW), LIW, LW, IUSER(*), IFAIL |
REAL (KIND=nag_wp) |
BL(N), BU(N), X(N), F, G(N), W(LW), RUSER(*) |
EXTERNAL |
FUNCT2, HESS2 |
|
3 Description
E04LYF is applicable to problems of the form:
when first and second derivatives of
are available.
Special provision is made for problems which actually have no bounds on the , problems which have only non-negativity bounds and problems in which and . You must supply a subroutine to calculate the values of and its first derivatives at any point and a subroutine to calculate the second derivatives.
From a starting point you supplied there is generated, on the basis of estimates of the curvature of , a sequence of feasible points which is intended to converge to a local minimum of the constrained function.
4 References
Gill P E and Murray W (1976) Minimization subject to bounds on the variables NPL Report NAC 72 National Physical Laboratory
5 Parameters
- 1: N – INTEGERInput
On entry: the number of independent variables.
Constraint:
.
- 2: IBOUND – INTEGERInput
On entry: indicates whether the facility for dealing with bounds of special forms is to be used. It must be set to one of the following values:
- If you are supplying all the and individually.
- If there are no bounds on any .
- If all the bounds are of the form .
- If and .
Constraint:
.
- 3: FUNCT2 – SUBROUTINE, supplied by the user.External Procedure
You must supply this routine to calculate the values of the function
and its first derivatives
at any point
. It should be tested separately before being used in conjunction with E04LYF (see the
E04 Chapter Introduction).
The specification of
FUNCT2 is:
INTEGER |
N, IUSER(*) |
REAL (KIND=nag_wp) |
XC(N), FC, GC(N), RUSER(*) |
|
- 1: N – INTEGERInput
On entry: the number of variables.
- 2: XC(N) – REAL (KIND=nag_wp) arrayInput
On entry: the point at which the function and its derivatives are required.
- 3: FC – REAL (KIND=nag_wp)Output
On exit: the value of the function at the current point .
- 4: GC(N) – REAL (KIND=nag_wp) arrayOutput
On exit: must be set to the value of the first derivative at the point , for .
- 5: IUSER() – INTEGER arrayUser Workspace
- 6: RUSER() – REAL (KIND=nag_wp) arrayUser Workspace
-
FUNCT2 is called with the parameters
IUSER and
RUSER as supplied to E04LYF. You are free to use the arrays
IUSER and
RUSER to supply information to
FUNCT2 as an alternative to using COMMON global variables.
FUNCT2 must either be a module subprogram USEd by, or declared as EXTERNAL in, the (sub)program from which E04LYF is called. Parameters denoted as
Input must
not be changed by this procedure.
- 4: HESS2 – SUBROUTINE, supplied by the user.External Procedure
You must supply this routine to evaluate the elements
of the matrix of second derivatives of
at any point
. It should be tested separately before being used in conjunction with E04LYF (see the
E04 Chapter Introduction).
The specification of
HESS2 is:
INTEGER |
N, LH, IUSER(*) |
REAL (KIND=nag_wp) |
XC(N), HESLC(LH), HESDC(N), RUSER(*) |
|
- 1: N – INTEGERInput
On entry: the number of variables.
- 2: XC(N) – REAL (KIND=nag_wp) arrayInput
On entry: the point at which the derivatives are required.
- 3: HESLC(LH) – REAL (KIND=nag_wp) arrayOutput
On exit:
HESS2 must place the strict lower triangle of the second derivative matrix
in
HESLC, stored by rows, i.e., set
, for
and
. (The upper triangle is not required because the matrix is symmetric.)
- 4: LH – INTEGERInput
On entry: the length of the array
HESLC.
- 5: HESDC(N) – REAL (KIND=nag_wp) arrayOutput
On exit: must contain the diagonal elements of the second derivative matrix, i.e., set
, for .
- 6: IUSER() – INTEGER arrayUser Workspace
- 7: RUSER() – REAL (KIND=nag_wp) arrayUser Workspace
-
HESS2 is called with the parameters
IUSER and
RUSER as supplied to E04LYF. You are free to use the arrays
IUSER and
RUSER to supply information to
HESS2 as an alternative to using COMMON global variables.
HESS2 must either be a module subprogram USEd by, or declared as EXTERNAL in, the (sub)program from which E04LYF is called. Parameters denoted as
Input must
not be changed by this procedure.
- 5: BL(N) – REAL (KIND=nag_wp) arrayInput/Output
On entry: the lower bounds
.
If
IBOUND is set to
,
must be set to
, for
. (If a lower bound is not specified for any
, the corresponding
should be set to
.)
If
IBOUND is set to
, you must set
to
; E04LYF will then set the remaining elements of
BL equal to
.
On exit: the lower bounds actually used by E04LYF.
- 6: BU(N) – REAL (KIND=nag_wp) arrayInput/Output
On entry: the upper bounds
.
If
IBOUND is set to
,
must be set to
, for
. (If an upper bound is not specified for any
the corresponding
should be set to
.)
If
IBOUND is set to
, you must set
to
; E04LYF will then set the remaining elements of
BU equal to
.
On exit: the upper bounds actually used by E04LYF.
- 7: X(N) – REAL (KIND=nag_wp) arrayInput/Output
On entry: must be set to a guess at the th component of the position of the minimum, for . The routine checks the gradient and the Hessian matrix at the starting point, and is more likely to detect any error in your programming if the initial are nonzero and mutually distinct.
On exit: the lowest point found during the calculations. Thus, if on exit, is the th component of the position of the minimum.
- 8: F – REAL (KIND=nag_wp)Output
On exit: the value of
corresponding to the final point stored in
X.
- 9: G(N) – REAL (KIND=nag_wp) arrayOutput
On exit: the value of
corresponding to the final point stored in
X, for
; the value of
for variables not on a bound should normally be close to zero.
- 10: IW(LIW) – INTEGER arrayWorkspace
- 11: LIW – INTEGERInput
On entry: the dimension of the array
IW as declared in the (sub)program from which E04LYF is called.
Constraint:
.
- 12: W(LW) – REAL (KIND=nag_wp) arrayWorkspace
- 13: LW – INTEGERInput
On entry: the dimension of the array
W as declared in the (sub)program from which E04LYF is called.
Constraint:
.
- 14: IUSER() – INTEGER arrayUser Workspace
- 15: RUSER() – REAL (KIND=nag_wp) arrayUser Workspace
-
IUSER and
RUSER are not used by E04LYF, but are passed directly to
FUNCT2 and
HESS2 and may be used to pass information to these routines as an alternative to using COMMON global variables.
- 16: IFAIL – INTEGERInput/Output
-
On entry:
IFAIL must be set to
,
. If you are unfamiliar with this parameter you should refer to
Section 3.3 in the Essential Introduction for details.
For environments where it might be inappropriate to halt program execution when an error is detected, the value
is recommended. If the output of error messages is undesirable, then the value
is recommended. Otherwise, because for this routine the values of the output parameters may be useful even if
on exit, the recommended value is
.
When the value is used it is essential to test the value of IFAIL on exit.
On exit:
unless the routine detects an error or a warning has been flagged (see
Section 6).
6 Error Indicators and Warnings
If on entry
or
, explanatory error messages are output on the current error message unit (as defined by
X04AAF).
Note: E04LYF may return useful information for one or more of the following detected errors or warnings.
Errors or warnings detected by the routine:
-
On entry, | , |
or | , |
or | , |
or | and for some , |
or | and , |
or | , |
or | . |
There have been
function evaluations, yet the algorithm does not seem to be converging. The calculations can be restarted from the final point held in
X. The error may also indicate that
has no minimum.
The conditions for a minimum have not all been met but a lower point could not be found and the algorithm has failed.
Not used. (This value of the parameter is included so as to make the significance of etc. consistent in the easy-to-use routines.)
There is some doubt about whether the point
found by E04LYF is a minimum. The degree of confidence in the result decreases as
IFAIL increases. Thus, when
it is probable that the final
gives a good estimate of the position of a minimum, but when
it is very unlikely that the routine has found a minimum.
In the search for a minimum, the modulus of one of the variables has become very large
. This indicates that there is a mistake in user-supplied subroutines
FUNCT2 or
HESS2, that your problem has no finite solution, or that the problem needs rescaling (see
Section 8).
It is very likely that you have made an error in forming the gradient.
It is very likely that you have made an error in forming the second derivatives.
If you are dissatisfied with the result (e.g., because , , or ), it is worth restarting the calculations from a different starting point (not the point at which the failure occurred) in order to avoid the region which caused the failure.
7 Accuracy
When a successful exit is made then, for a computer with a mantissa of decimals, one would expect to get about decimals accuracy in , and about decimals accuracy in , provided the problem is reasonably well scaled.
The number of iterations required depends on the number of variables, the behaviour of
and the distance of the starting point from the solution. The number of operations performed in an iteration of E04LYF is roughly proportional to
. In addition, each iteration makes one call of
HESS2 and at least one call of
FUNCT2. So, unless
, the gradient vector and the matrix of second derivatives can be evaluated very quickly, the run time will be dominated by the time spent in user-supplied subroutines
FUNCT2 and
HESS2.
Ideally the problem should be scaled so that at the solution the value of and the corresponding values of are each in the range , and so that at points a unit distance away from the solution, is approximately a unit value greater than at the minimum. It is unlikely that you will be able to follow these recommendations very closely, but it is worth trying (by guesswork), as sensible scaling will reduce the difficulty of the minimization problem, so that E04LYF will take less computer time.
9 Example
A program to minimize
subject to
starting from the initial guess
. (In practice, it is worth trying to make user-supplied subroutines
FUNCT2 and
HESS2 as efficient as possible. This has not been done in the example program for reasons of clarity.)
9.1 Program Text
Program Text (e04lyfe.f90)
9.2 Program Data
None.
9.3 Program Results
Program Results (e04lyfe.r)