e04ggc is part of the new NAG optimization modelling suite (see Section 4.1 in the E04 Chapter Introduction), therefore the definition of the nonlinear residual function values and gradients need to be split into two separate subroutines. e04ggc offers a significant improvement in performance over e04gbc as well as additional functionality, such as the addition of variable bounds and user-evaluation recovery, amongst many others.
e04gbc is a comprehensive algorithm for finding an unconstrained minimum of a sum of squares of nonlinear functions in variables . First derivatives are required.
e04gbc is intended for objective functions which have continuous first and second derivatives (although it will usually work even if the derivatives have occasional discontinuities).
The function may be called by the names: e04gbc, nag_opt_lsq_uncon_quasi_deriv_comp or nag_opt_lsq_deriv.
3Description
e04gbc is applicable to problems of the form:
where and . (The functions are often referred to as ‘residuals’.) You must supply a function to calculate the values of the and their first derivatives at any point .
From a starting point e04gbc generates a sequence of points which is intended to converge to a local minimum of . The sequence of points is given by
where the vector is a direction of search, and is chosen such that is approximately a minimum with respect to .
The vector used depends upon the reduction in the sum of squares obtained during the last iteration. If the sum of squares was sufficiently reduced, then is the Gauss–Newton direction; otherwise the second derivatives of the are taken into account using a quasi-Newton updating scheme.
The method is designed to ensure that steady progress is made whatever the starting point, and to have the rapid ultimate convergence of Newton's method.
4References
Gill P E and Murray W (1978) Algorithms for the solution of the nonlinear least squares problem SIAM J. Numer. Anal.15 977–992
5Arguments
1: – IntegerInput
On entry: , the number of residuals, .
2: – IntegerInput
On entry: , the number of variables, .
Constraint:
.
3: – function, supplied by the userExternal Function
lsqfun must calculate the vector of values and their first derivatives at any point . (However, if you do not wish to calculate the residuals at a particular , there is the option of setting an argument to cause e04gbc to terminate immediately.)
On entry: the numbers and of residuals and variables, respectively.
3: – const doubleInput
On entry: the point at which the values of the and the are required.
4: – doubleOutput
On exit: unless on entry, or is reset to a negative number, then must contain the value of at the point , for .
5: – doubleOutput
On exit: unless on entry, or is reset to a negative number, then must contain the value of the first derivative at the point , for and .
6: – IntegerInput
On entry: the stride separating matrix column elements in the array fjac.
7: – Nag_Comm *
Pointer to structure of type Nag_Comm; the following members are relevant to lsqfun.
flag – IntegerInput/Output
On entry: contains , 1 or . The value 0 indicates that only the residuals need to be evaluated, the value 1 indicates that only the Jacobian matrix needs to be evaluated, and the value 2 indicates that both the residuals and the Jacobian matrix must be calculated. (If the default value of the optional parameter is used (i.e., ), then lsqfun will always be called with set to 2.)
On exit: if lsqfun resets to some negative number then e04gbc will terminate immediately with the error indicator NE_USER_STOP. If fail is supplied to e04gbc, will be set to the user's setting of .
first – Nag_BooleanInput
On entry: will be set to Nag_TRUE on the first call to lsqfun and Nag_FALSE for all subsequent calls.
nf – IntegerInput
On entry: the number of calls made to lsqfun including the current one.
user – double *
iuser – Integer *
p – Pointer
The type Pointer will be void * with a C compiler that defines void * and char * otherwise. Before calling e04gbc these pointers may be allocated memory and initialized with various quantities for use by lsqfun when called from e04gbc.
Note:lsqfun should not return floating-point NaN (Not a Number) or infinity values, since these are not handled by e04gbc. If your code inadvertently does return any NaNs or infinities, e04gbc is likely to produce unexpected results.
Note: lsqfun should be tested separately before being used in conjunction with e04gbc. Function e04yac may be used to check the derivatives.
4: – doubleInput/Output
On entry: must be set to a guess at the th component of the position of the minimum, for .
On exit: the final point . On successful exit, is the th component of the estimated position of the minimum.
5: – double *Output
On exit: the value of , the sum of squares of the residuals , at the final point given in x.
6: – doubleOutput
On exit: is the value of the residual at the final point given in x, for .
7: – doubleOutput
On exit: contains the value of the first derivative at the final point given in x, for and .
8: – IntegerInput
On entry: the stride separating matrix column elements in the array fjac.
Constraint:
.
9: – Nag_E04_Opt *Input/Output
On entry/exit: a pointer to a structure of type Nag_E04_Opt whose members are optional parameters for e04gbc. These structure members offer the means of adjusting some of the argument values of the algorithm and on output will supply further details of the results. A description of the members of options is given in Section 11.2.
If any of these optional parameters are required then the structure options should be declared and initialized by a call to e04xxc and supplied as an argument to e04gbc. However, if the optional parameters are not required the NAG defined null pointer, E04_DEFAULT, can be used in the function call.
10: – Nag_Comm *Input/Output
Note:comm is a NAG defined type (see Section 3.1.1 in the Introduction to the NAG Library CL Interface).
On entry/exit: structure containing pointers for communication to the user-supplied function; see the above description of lsqfun for details. If you do not need to make use of this communication feature the null pointer NAGCOMM_NULL may be used in the call to e04gbc; comm will then be declared internally for use in calls to the user-supplied function.
11: – NagError *Input/Output
The NAG error argument (see Section 7 in the Introduction to the NAG Library CL Interface).
The exits NW_TOO_MANY_ITER, NW_COND_MIN, and NE_SVD_FAIL may also be caused by mistakes in lsqfun, by the formulation of the problem or by an awkward function. If there are no such mistakes it is worth restarting the calculations from a different starting point (not the point at which the failure occurred) in order to avoid the region which caused the failure.
NE_2_INT_ARG_LT
On entry, while . These arguments must satisfy .
On entry, while . These arguments must satisfy .
On entry, while . These arguments must satisfy .
NE_2_REAL_ARG_LT
On entry, while . These arguments must satisfy .
NE_ALLOC_FAIL
Dynamic memory allocation failed.
NE_BAD_PARAM
On entry, argument had an illegal value.
On entry, argument had an illegal value.
NE_DERIV_ERRORS
Large errors were found in the derivatives of the objective function.
You should check carefully the derivation and programming of expressions for the , because it is very unlikely that lsqfun is calculating them correctly.
NE_INT_ARG_LT
On entry, .
Constraint: .
NE_INVALID_INT_RANGE_1
Value given to not valid. Correct range is .
NE_INVALID_REAL_RANGE_EF
Value given to not valid. Correct range is .
NE_INVALID_REAL_RANGE_FF
Value given to not valid. Correct range is .
NE_NOT_APPEND_FILE
Cannot open file for appending.
NE_NOT_CLOSE_FILE
Cannot close file .
NE_OPT_NOT_INIT
Options structure not initialized.
NE_SVD_FAIL
The computation of the singular value decomposition of the Jacobian matrix has failed to converge in a reasonable number of sub-iterations.
It may be worth applying e04gbc again starting with an initial approximation which is not too close to the point at which the failure occurred.
NE_USER_STOP
User requested termination, user flag value .
This exit occurs if you set to a negative value in lsqfun. If fail is supplied the value of will be the same as your setting of .
NE_WRITE_ERROR
Error occurred when writing to file .
NW_COND_MIN
The conditions for a minimum have not all been satisfied, but a lower point could not be found.
This could be because has been set so small that rounding errors in the evaluation of the residuals make attainment of the convergence conditions impossible.
See Section 7 for further information.
NW_TOO_MANY_ITER
The maximum number of iterations, , have been performed.
If steady reductions in the sum of squares, , were monitored up to the point where this exit occurred, then the exit probably occurred simply because was set too small, so the calculations should be restarted from the final point held in x. This exit may also indicate that has no minimum.
7Accuracy
If the problem is reasonably well scaled and a successful exit is made, then, for a computer with a mantissa of decimals, one would expect to get about decimals accuracy in the components of and between (if is of order 1 at the minimum) and (if is close to zero at the minimum) decimals accuracy in .
A successful exit () is made from e04gbc when (B1, B2 and B3) or B4 or B5 hold, where
and where , and the optional parameter are as defined in Section 11.2, while and are the values of and its vector of first derivatives at .
If then the vector in x on exit, , is almost certainly an estimate of , the position of the minimum to the accuracy specified by .
If , then may still be a good estimate of , but to verify this you should make the following checks. If
(a)the sequence converges to at a superlinear or a fast linear rate, and
(b),
where denotes transpose, then it is almost certain that is a close approximation to the minimum. When (b) is true, then usually is a close approximation to .
Further suggestions about confirmation of a computed solution are given in the E04 Chapter Introduction.
8Parallelism and Performance
Background information to multithreading can be found in the Multithreading documentation.
e04gbc is not threaded in any implementation.
9Further Comments
The number of iterations required depends on the number of variables, the number of residuals, the behaviour of , the accuracy demanded and the distance of the starting point from the solution. The number of multiplications performed per iteration of e04gbc varies, but for is approximately . In addition, each iteration makes at least one call of lsqfun. So, unless the residuals can be evaluated very quickly, the run time will be dominated by the time spent in lsqfun.
Ideally, the problem should be scaled so that, at the solution, and the corresponding values of the are each in the range , and so that at points one unit away from the solution, differs from its value at the solution by approximately one unit. This will usually imply that the Hessian matrix of at the solution is well-conditioned. It is unlikely that you will be able to follow these recommendations very closely, but it is worth trying (by guesswork), as sensible scaling will reduce the difficulty of the minimization problem, so that e04gbc will take less computer time.
When the sum of squares represents the goodness-of-fit of a nonlinear model to observed data, elements of the variance-covariance matrix of the estimated regression coefficients can be computed by a subsequent call to e04ycc, using information returned in the arrays and . See e04ycc for further details.
10Example
This example finds the least squares estimates of , and in the model
using the 15 sets of data given in the following table.
0.14
1.0
15.0
1.0
0.18
2.0
14.0
2.0
0.22
3.0
13.0
3.0
0.25
4.0
12.0
4.0
0.29
5.0
11.0
5.0
0.32
6.0
10.0
6.0
0.35
7.0
9.0
7.0
0.39
8.0
8.0
8.0
0.37
9.0
7.0
7.0
0.58
10.0
6.0
6.0
0.73
11.0
5.0
5.0
0.96
12.0
4.0
4.0
1.34
13.0
3.0
3.0
2.10
14.0
2.0
2.0
4.39
15.0
1.0
1.0
The program uses (, , 1.5) as the initial guess at the position of the minimum.
The program shows the use of certain optional parameters, with some option values being assigned directly within the program text and by reading values from a data file.
The options structure is declared and initialized by e04xxc. A value is then assigned directly to options and three further options are read from the data file by use of e04xyc. The memory freeing function e04xzc is used to free the memory assigned to the pointers in the option structure. You must not use the standard C function free() for this purpose.
A number of optional input and output arguments to e04gbc are available through the structure argument options, type Nag_E04_Opt. An argument may be selected by assigning an appropriate value to the relevant structure member; those arguments not selected will be assigned default values. If no use is to be made of any of the optional parameters you should use the NAG defined null pointer, E04_DEFAULT, in place of options when calling e04gbc; the default settings will then be used for all arguments.
Before assigning values to options directly the structure must be initialized by a call to the function e04xxc. Values may then be assigned to the structure members in the normal C manner.
After return from e04gbc, the options structure may only be re-used for future calls of e04gbc if the dimensions of the new problem are the same. Otherwise, the structure must be cleared by a call of e04xzc) and re-initialized by a call of e04xxc before future calls. Failure to do this will result in unpredictable behaviour.
Optional parameter settings may also be read from a text file using the function e04xyc in which case initialization of the options structure will be performed automatically if not already done. Any subsequent direct assignment to the options structure must not be preceded by initialization.
If assignment of functions and memory to pointers in the options structure is required, this must be done directly in the calling program. They cannot be assigned using e04xyc.
11.1Optional Parameter Checklist and Default Values
For easy reference, the following list shows the members of options which are valid for e04gbc together with their default values where relevant. The number is a generic notation for machine precision (see X02AJC).
On entry: if a check of the derivatives defined by lsqfun will be made at the starting point x. The derivative check is carried out by a call to e04yac. A starting point of or should be avoided if this test is to be meaningful, but if either of these starting points is necessary then e04yac should be used to check lsqfun at a different point prior to calling e04gbc.
max_iter – Integer
Default
On entry: the limit on the number of iterations allowed before termination.
Constraint:
.
optim_tol – double
Default
On entry: the accuracy in to which the solution is required. If is the true value of at the minimum, then , the estimated position prior to a normal exit, is such that
where . For example, if the elements of are not much larger than in modulus and if , then is usually accurate to about five decimal places. (For further details see Section 7.) If and the variables are scaled roughly as described in Section 9 and is the machine precision, then a setting of order will usually be appropriate.
Constraint:
.
minlin – Nag_LinFun
Default
On entry: specifies whether the linear minimizations (i.e., minimizations of with respect to ) are to be performed by a function which just requires the evaluation of the , , or by a function which also requires the first derivatives of the , .
It will often be possible to evaluate the first derivatives of the residuals in about the same amount of computer time that is required for the evaluation of the residuals themselves – if this is so then e04gbc should be called with set to . However, if the evaluation of the derivatives takes more than about four times as long as the evaluation of the residuals, then a setting of will usually be preferable. If in doubt, use the default setting as it is slightly more robust.
Constraint:
or .
linesearch_tol – double
Default . (If , default )
If then the default value of will be changed from to if .
On entry: specifies how accurately the linear minimizations are to be performed.
Every iteration of e04gbc involves a linear minimization, i.e., minimization of with respect to . The minimum with respect to will be located more accurately for small values of (say 0.01) than for large values (say 0.9). Although accurate linear minimizations will generally reduce the number of iterations performed by e04gbc, they will increase the number of calls of lsqfun made each iteration. On balance it is usually more efficient to perform a low accuracy minimization.
Constraint:
.
step_max – double
Default
On entry: an estimate of the Euclidean distance between the solution and the starting point supplied. (For maximum efficiency, a slight overestimate is preferable.) e04gbc will ensure that, for each iteration,
where is the iteration number. Thus, if the problem has more than one solution, e04gbc is most likely to find the one nearest to the starting point. On difficult problems, a realistic choice can prevent the sequence entering a region where the problem is ill-behaved and can help avoid overflow in the evaluation of . However, an underestimate of can lead to inefficiency.
Constraint:
.
s – double *
Default memory
On entry: n values of memory will be automatically allocated by e04gbc and this is the recommended method of use of . However, you may supply memory from the calling program.
On exit: the singular values of the Jacobian matrix at the final point. Thus may be useful as information about the structure of your problem.
v – double *
Default memory
On entry: values of memory will be automatically allocated by e04gbc and this is the recommended method of use of . However, you may supply memory from the calling program.
On exit: the matrix associated with the singular value decomposition
of the Jacobian matrix at the final point, stored by rows. This matrix may be useful for statistical purposes, since it is the matrix of orthonormalized eigenvectors of .
tdv – Integer
Default
On entry: if memory is supplied then must contain the last dimension of the array assigned to as declared in the function from which e04gbc is called.
On exit: the trailing dimension used by . If the NAG default memory allocation has been used this value will be n.
Constraint:
.
grade – Integer
On exit: the grade of the Jacobian at the final point. e04gbc estimates the dimension of the subspace for which the Jacobian matrix can be used as a valid approximation to the curvature (see Gill and Murray (1978)); this estimate is called the grade.
iter – Integer
On exit: the number of iterations which have been performed in e04gbc.
nf – Integer
On exit: the number of times the residuals have been evaluated (i.e., the number of calls of lsqfun).
11.3Description of Printed Output
The level of printed output can be controlled with the structure members and (see Section 11.2). If then the argument values to e04gbc are listed, whereas the printout of results is governed by the value of . The default of provides a single line of output at each iteration and the final result. This section describes all of the possible levels of results printout available from e04gbc.
When or a single line of output is produced on completion of each iteration, this gives the following values:
the step taken along the computed search direction .
When more detailed results are given at each iteration. Additional values output are:
Grade
the grade of the Jacobian matrix. (See description of , Section 9.)
x
the current point .
g
the current gradient of .
Singular values
the singular values of the current approximation to the Jacobian matrix.
If , or the final result consists of:
x
the final point .
g
the gradient of at the final point.
Residuals
the values of the residuals at the final point.
Sum of squares
the value of , the sum of squares of the residuals at the final point.
If then printout will be suppressed; you can print the final solution when e04gbc returns to the calling program.
11.3.1Output of results via a user-defined printing function
You may also specify your own print function for output of iteration results and the final solution by use of the function pointer, which has prototype
void (*print_fun)(const Nag_Search State *st, Nag_Comm *comm);
The rest of this section can be skipped if the default printing facilities provide the required functionality.
When a user-defined function is assigned to this will be called in preference to the internal print function of e04gbc. Calls to the user-defined function are again controlled by means of the member. Information is provided through st and comm, the two structure arguments to . The structure member is relevant in this context. If then the results from the last iteration of e04gbc are in the following members of st:
m – Integer
The number of residuals.
n – Integer
The number of variables.
x – double *
Points to the memory locations holding the current point .
fvec – double *
Points to the memory locations holding the values of the residuals at the current point .
fjac – double *
Points to memory locations. contains the value of , for and at the current point .
tdfjac – Integer
The trailing dimension for .
step – double
The step taken along the search direction .
xk_norm – double
The Euclidean norm of .
g – double *
Points to the memory locations holding the gradient of at the current point .
grade – Integer
The grade of the Jacobian matrix.
s – double *
Points to the memory locations holding the singular values of the current Jacobian.
Will be Nag_TRUE when the print function is called with the result of the current iteration.
sol_prt – Nag_Boolean
Will be Nag_TRUE when the print function is called with the final result.
user – double *
iuser – Integer *
p – Pointer
Pointers for communication of user information. If used they must be allocated memory either before entry to e04gbc or during a call to lsqfun or . The type Pointer will be void * with a C compiler that defines void * and char * otherwise.