PDF version (NAG web site
, 64-bit version, 64-bit version)
NAG Toolbox: nag_fit_2dspline_sctr (e02dd)
Purpose
nag_fit_2dspline_sctr (e02dd) computes a bicubic spline approximation to a set of scattered data. The knots of the spline are located automatically, but a single argument must be specified to control the trade-off between closeness of fit and smoothness of fit.
Syntax
[
nx,
lamda,
ny,
mu,
c,
fp,
rank,
wrk,
ifail] = e02dd(
start,
x,
y,
f,
w,
s,
nx,
lamda,
ny,
mu,
wrk, 'm',
m, 'nxest',
nxest, 'nyest',
nyest)
[
nx,
lamda,
ny,
mu,
c,
fp,
rank,
wrk,
ifail] = nag_fit_2dspline_sctr(
start,
x,
y,
f,
w,
s,
nx,
lamda,
ny,
mu,
wrk, 'm',
m, 'nxest',
nxest, 'nyest',
nyest)
Note: the interface to this routine has changed since earlier releases of the toolbox:
At Mark 22: |
lwrk was removed from the interface |
Description
nag_fit_2dspline_sctr (e02dd) determines a smooth bicubic spline approximation to the set of data points with weights , for .
The approximation domain is considered to be the rectangle , where and denote the lowest and highest data values of .
The spline is given in the B-spline representation
where
and
denote normalized cubic B-splines, the former defined on the knots
to
and the latter on the knots
to
. For further details, see
Hayes and Halliday (1974) for bicubic splines and
de Boor (1972) for normalized B-splines.
The total numbers
and
of these knots and their values
and
are chosen automatically by the function. The knots
and
are the interior knots; they divide the approximation domain
into
subpanels
, for
and
. Then, much as in the curve case (see
nag_fit_1dspline_auto (e02be)), the coefficients
are determined as the solution of the following constrained minimization problem:
minimize
subject to the constraint
where: |
|
is a measure of the (lack of) smoothness of . Its value depends on the discontinuity jumps in across the boundaries of the subpanels. It is zero only when there are no discontinuities and is positive otherwise, increasing with the size of the jumps (see Dierckx (1981b) for details). |
|
|
denotes the weighted residual
, |
and |
|
is a non-negative number to be specified by you. |
By means of the argument
, ‘the smoothing factor’, you will then control the balance between smoothness and closeness of fit, as measured by the sum of squares of residuals in
(3). If
is too large, the spline will be too smooth and signal will be lost (underfit); if
is too small, the spline will pick up too much noise (overfit). In the extreme cases the method would return an interpolating spline
if
were set to zero, and returns the least squares bicubic polynomial
if
is set very large. Experimenting with
-values between these two extremes should result in a good compromise. (See
Choice of for advice on choice of
.) Note however, that this function, unlike
nag_fit_1dspline_auto (e02be) and
nag_fit_2dspline_grid (e02dc), does not allow
to be set exactly to zero: to compute an interpolant to scattered data,
nag_interp_2d_scat (e01sa) or
nag_interp_2d_scat_shep (e01sg) should be used.
The method employed is outlined in
Outline of method used and fully described in
Dierckx (1981a) and
Dierckx (1981b). It involves an adaptive strategy for locating the knots of the bicubic spline (depending on the function underlying the data and on the value of
), and an iterative method for solving the constrained minimization problem once the knots have been determined.
References
de Boor C (1972) On calculating with B-splines J. Approx. Theory 6 50–62
Dierckx P (1981a) An improved algorithm for curve fitting with spline functions Report TW54 Department of Computer Science, Katholieke Univerciteit Leuven
Dierckx P (1981b) An algorithm for surface fitting with spline functions IMA J. Numer. Anal. 1 267–283
Hayes J G and Halliday J (1974) The least squares fitting of cubic spline surfaces to general data sets J. Inst. Math. Appl. 14 89–103
Peters G and Wilkinson J H (1970) The least squares problem and pseudo-inverses Comput. J. 13 309–316
Reinsch C H (1967) Smoothing by spline functions Numer. Math. 10 177–183
Parameters
Compulsory Input Parameters
- 1:
– string (length ≥ 1)
-
Determines whether calculations are to be performed afresh (Cold Start) or whether knots found in previous calls are to be used as an initial estimate of knot placement (Warm Start).
- The function will build up the knot set starting with no interior knots. No values need be assigned to the arguments nx, ny, lamda, mu or wrk.
- The function will restart the knot-placing strategy using the knots found in a previous call of the function. In this case, the arguments nx, ny, lamda, mu and wrk must be unchanged from that previous call. This warm start can save much time in determining a satisfactory set of knots for the given value of s. This is particularly useful when different smoothing factors are used for the same dataset.
Constraint:
or .
- 2:
– double array
- 3:
– double array
- 4:
– double array
-
, , must be set to the coordinates of , the th data point, for . The order of the data points is immaterial.
- 5:
– double array
-
must be set to
, the
th value in the set of weights, for
. Zero weights are permitted and the corresponding points are ignored, except when determining
,
,
and
(see
Restriction of the approximation domain). For advice on the choice of weights, see
Weighting of data points in the E02 Chapter Introduction.
Constraint:
the number of data points with nonzero weight must be at least .
- 6:
– double scalar
-
The smoothing factor,
s.
For advice on the choice of
s, see
Description and
Choice of .
Constraint:
.
- 7:
– int64int32nag_int scalar
-
If the warm start option is used, the value of
nx must be left unchanged from the previous call.
- 8:
– double array
-
If the warm start option is used, the values must be left unchanged from the previous call.
- 9:
– int64int32nag_int scalar
-
If the warm start option is used, the value of
ny must be left unchanged from the previous call.
- 10:
– double array
-
If the warm start option is used, the values must be left unchanged from the previous call.
- 11:
– double array
-
lwrk, the dimension of the array, must satisfy the constraint
,
where , and ...
If the warm start option is used, on entry, the value of must be left unchanged from the previous call.
This array is used as workspace.
Optional Input Parameters
- 1:
– int64int32nag_int scalar
-
Default:
the dimension of the arrays
x,
y,
f,
w. (An error is raised if these dimensions are not equal.)
, the number of data points.
The number of data points with nonzero weight (see
w) must be at least
.
- 2:
– int64int32nag_int scalar
- 3:
– int64int32nag_int scalar
-
Default:
the dimension of the arrays
lamda,
mu. (An error is raised if these dimensions are not equal.)
An upper bound for the number of knots
and
required in the
- and
-directions respectively.
In most practical situations,
is sufficient. See also
Choice of nxest and nyest.
Constraint:
and .
Output Parameters
- 1:
– int64int32nag_int scalar
-
The total number of knots, , of the computed spline with respect to the variable.
- 2:
– double array
-
Contains the complete set of knots
associated with the
variable, i.e., the interior knots
as well as the additional knots
and
needed for the B-spline representation (where
and
are as described in
Description).
- 3:
– int64int32nag_int scalar
-
The total number of knots, , of the computed spline with respect to the variable.
- 4:
– double array
-
Contains the complete set of knots
associated with the
variable, i.e., the interior knots
as well as the additional knots
and
needed for the B-spline representation (where
and
are as described in
Description).
- 5:
– double array
-
The coefficients of the spline approximation.
is the coefficient
defined in
Description.
- 6:
– double scalar
-
The weighted sum of squared residuals,
, of the computed spline approximation.
fp should equal
s within a relative tolerance of
unless
, when the spline has no interior knots and so is simply a bicubic polynomial. For knots to be inserted,
s must be set to a value below the value of
fp produced in this case.
- 7:
– int64int32nag_int scalar
-
Gives the rank of the system of equations used to compute the final spline (as determined by a suitable machine-dependent threshold). When
, the solution is unique; otherwise the system is rank-deficient and the minimum-norm solution is computed. The latter case may be caused by too small a value of
s.
- 8:
– double array
-
This array is used as workspace.
- 9:
– int64int32nag_int scalar
unless the function detects an error (see
Error Indicators and Warnings).
Error Indicators and Warnings
Errors or warnings detected by the function:
-
-
On entry, | or , |
or | the number of data points with nonzero weight , |
or | , |
or | , |
or | , |
or | , where , and , |
or | . |
-
-
On entry, either all the , for , are equal, or all the , for , are equal.
-
-
The number of knots required is greater than allowed by
nxest and
nyest. Try increasing
nxest and/or
nyest and, if necessary, supplying larger arrays for the arguments
lamda,
mu,
c,
wrk and
iwrk. However, if
nxest and
nyest are already large, say
nxest,
, then this error exit may indicate that
s is too small.
-
-
No more knots can be added because the number of B-spline coefficients
already exceeds the number of data points
m. This error exit may occur if either of
s or
m is too small.
-
-
No more knots can be added because the additional knot would (quasi) coincide with an old one. This error exit may occur if too large a weight has been given to an inaccurate data point, or if
s is too small.
-
-
The iterative process used to compute the coefficients of the approximating spline has failed to converge. This error exit may occur if
s has been set very small. If the error persists with increased
s, contact
NAG.
-
-
lwrk is too small; the function needs to compute the minimal least squares solution of a rank-deficient system of linear equations, but there is not enough workspace. There is no approximation returned but, having saved the information contained in
nx,
lamda,
ny,
mu and
wrk, and having adjusted the value of
lwrk and the dimension of array
wrk accordingly, you can continue at the point the program was left by calling
nag_fit_2dspline_sctr (e02dd) with
. Note that the requested value for
lwrk is only large enough for the current phase of the algorithm. If the function is restarted with
lwrk set to the minimum value requested, a larger request may be made at a later stage of the computation. See
Arguments for the upper bound on
lwrk. On soft failure, the minimum requested value for
lwrk is returned in
and the safe value for
lwrk is returned in
.
-
An unexpected error has been triggered by this routine. Please
contact
NAG.
-
Your licence key may have expired or may not have been installed correctly.
-
Dynamic memory allocation failed.
If
,
,
or
, a spline approximation is returned, but it fails to satisfy the fitting criterion (see
(2) and
(3) in
Description – perhaps only by a small amount, however.
Accuracy
On successful exit, the approximation returned is such that its weighted sum of squared residuals
fp is equal to the smoothing factor
, up to a specified relative tolerance of
– except that if
and
,
fp may be significantly less than
: in this case the computed spline is simply the least squares bicubic polynomial approximation of degree
, i.e., a spline with no interior knots.
Further Comments
Timing
The time taken for a call of nag_fit_2dspline_sctr (e02dd) depends on the complexity of the shape of the data, the value of the smoothing factor , and the number of data points. If nag_fit_2dspline_sctr (e02dd) is to be called for different values of , much time can be saved by setting after the first call.
It should be noted that choosing very small considerably increases computation time.
Choice of s
If the weights have been correctly chosen (see
Weighting of data points in the E02 Chapter Introduction), the standard deviation of
would be the same for all
, equal to
, say. In this case, choosing the smoothing factor
in the range
, as suggested by
Reinsch (1967), is likely to give a good start in the search for a satisfactory value. Otherwise, experimenting with different values of
will be required from the start.
In that case, in view of computation time and memory requirements, it is recommended to start with a very large value for
and so determine the least squares bicubic polynomial; the value returned for
fp, call it
, gives an upper bound for
. Then progressively decrease the value of
to obtain closer fits – say by a factor of
in the beginning, i.e.,
,
, and so on, and more carefully as the approximation shows more details.
To choose
very small is strongly discouraged. This considerably increases computation time and memory requirements. It may also cause rank-deficiency (as indicated by the argument
rank) and endanger numerical stability.
The number of knots of the spline returned, and their location, generally depend on the value of and on the behaviour of the function underlying the data. However, if nag_fit_2dspline_sctr (e02dd) is called with , the knots returned may also depend on the smoothing factors of the previous calls. Therefore if, after a number of trials with different values of and , a fit can finally be accepted as satisfactory, it may be worthwhile to call nag_fit_2dspline_sctr (e02dd) once more with the selected value for but now using . Often, nag_fit_2dspline_sctr (e02dd) then returns an approximation with the same quality of fit but with fewer knots, which is therefore better if data reduction is also important.
Choice of nxest and nyest
The number of knots may also depend on the upper bounds
nxest and
nyest. Indeed, if at a certain stage in
nag_fit_2dspline_sctr (e02dd) the number of knots in one direction (say
) has reached the value of its upper bound (
nxest), then from that moment on all subsequent knots are added in the other
direction. This may indicate that the value of
nxest is too small. On the other hand, it gives you the option of limiting the number of knots the function locates in any direction. For example, by setting
(the lowest allowable value for
nxest), you can indicate that you want an approximation which is a simple cubic polynomial in the variable
.
Restriction of the approximation domain
The fit obtained is not defined outside the rectangle . The reason for taking the extreme data values of and for these four knots is that, as is usual in data fitting, the fit cannot be expected to give satisfactory values outside the data region. If, nevertheless, you require values over a larger rectangle, this can be achieved by augmenting the data with two artificial data points and with zero weight, where denotes the enlarged rectangle.
Outline of method used
First suitable knot sets are built up in stages (starting with no interior knots in the case of a cold start but with the knot set found in a previous call if a warm start is chosen). At each stage, a bicubic spline is fitted to the data by least squares and
, the sum of squares of residuals, is computed. If
, a new knot is added to one knot set or the other so as to reduce
at the next stage. The new knot is located in an interval where the fit is particularly poor. Sooner or later, we find that
and at that point the knot sets are accepted. The function then goes on to compute a spline which has these knot sets and which satisfies the full fitting criterion specified by
(2) and
(3). The theoretical solution has
. The function computes the spline by an iterative scheme which is ended when
within a relative tolerance of
. The main part of each iteration consists of a linear least squares computation of special form, done in a similarly stable and efficient manner as in
nag_fit_2dspline_panel (e02da). As there also, the minimal least squares solution is computed wherever the linear system is found to be rank-deficient.
An exception occurs when the function finds at the start that, even with no interior knots (), the least squares spline already has its sum of squares of residuals . In this case, since this spline (which is simply a bicubic polynomial) also has an optimal value for the smoothness measure , namely zero, it is returned at once as the (trivial) solution. It will usually mean that has been chosen too large.
For further details of the algorithm and its use see
Dierckx (1981b).
Evaluation of Computed Spline
The values of the computed spline at the points
, for
, may be obtained in the double array
ff (see
nag_fit_2dspline_evalv (e02de)), of length at least
, by the following call:
[ff, ifail] = e02de(x, y, lamda, mu, c);
where
and the coordinates
,
are stored in
,
.
PX and
PY have the same values as
nx and
ny as output from
nag_fit_2dspline_sctr (e02dd), and
LAMDA,
MU and
C have the same values as
lamda,
mu and
c output from
nag_fit_2dspline_sctr (e02dd).
WRK is a double workspace array of length at least
, and
IWRK is an integer workspace array of length at least
.
To evaluate the computed spline on a
by
rectangular grid of points in the
-
plane, which is defined by the
coordinates stored in
, for
, and the
coordinates stored in
, for
, returning the results in the double array
ff (see
nag_fit_2dspline_evalm (e02df)) which is of length at least
, the following call may be used:
[fg, ifail] = e02df(tx, ty, lamda, mu, c);
where
,
.
LAMDA,
MU and
C have the same values as
lamda,
mu and
c output from
nag_fit_2dspline_sctr (e02dd).
WRK is a double workspace array of length at least
, where
and
.
IWRK is an integer workspace array of length at least
if
, or
otherwise.
The result of the spline evaluated at grid point is returned in element of the array FG.
Example
This example reads in a value of
m, followed by a set of
m data points
and their weights
. It then calls
nag_fit_2dspline_sctr (e02dd) to compute a bicubic spline approximation for one specified value of
s, and prints the values of the computed knots and B-spline coefficients. Finally it evaluates the spline at a small sample of points on a rectangular grid.
Open in the MATLAB editor:
e02dd_example
function e02dd_example
fprintf('e02dd example results\n\n');
d = [11.16 1.24 22.15;
12.85 3.06 22.11;
19.85 10.72 7.97;
19.72 1.39 16.83;
15.91 7.74 15.30;
0.00 20.00 34.60;
20.87 20.00 5.74;
3.45 12.78 41.24;
14.26 17.87 10.74;
17.43 3.46 18.60;
22.80 12.39 5.47;
7.58 1.98 29.87;
25.00 11.87 4.40;
0.00 0.00 58.20;
9.66 20.00 4.73;
5.22 14.66 40.36;
17.25 19.57 6.43;
25.00 3.87 8.74;
12.13 10.79 13.71;
22.23 6.21 10.25;
11.52 8.53 15.74;
15.20 0.00 21.60;
7.54 10.69 19.31;
17.32 13.78 12.11;
2.14 15.03 53.10;
0.51 8.37 49.43;
22.69 19.63 3.25;
5.47 17.13 28.63;
21.67 14.36 5.52;
3.31 0.33 44.08];
x = d(:,1);
y = d(:,2);
f = d(:,3);
w = ones(size(x));
start = 'C';
s = 10;
nx = int64(0);
lamda = zeros(14,1);
ny = int64(0);
mu = zeros(14,1);
wrk = zeros(11016, 1);
[nx, lamda, ny, mu, c, fp, rank, wrk, ifail] = ...
e02dd( ...
start, x, y, f, w, s, nx, lamda, ny, mu, wrk);
fprintf('\nCalling with smoothing factor S = %5.2f\n', s);
fprintf('Rank deficiency = %4d\n\n',(nx-4)*(ny-4)-rank);
fprintf('Knots: lamda mu\n');
for j = 4:max(nx,ny)-3
if j<=min(nx,ny)-3
fprintf('%4d%10.4f%10.4f\n', j, lamda(j), mu(j));
elseif j<=nx-3
fprintf('%4d%10.4f\n', j, lamda(j));
else
fprintf('%4d%20.4f\n', j, mu(j));
end
end
cp = c(1:(ny-4)*(nx-4));
cp = reshape(cp,[ny-4,nx-4]);
fprintf('\nB-spline coefficients:\n');
disp(cp);
fprintf('Weighted sum of squared residuals = %7.4f\n', fp);
if fp==0
fprintf('(The spline is an interpolating spline)\n');
elseif nx==8 && ny==8
fprintf('(The spline is the weighted least-squares bi-cubic polynomial)\n');
end
fprintf('\n');
mx = [3:21];
my = [2:17];
[ff, ifail] = e02df( ...
mx, my, lamda(1:nx), mu(1:ny), c);
fig1 = figure;
ff = reshape(ff,[16,19]);
meshc(mx,my,ff);
xlabel('x');
ylabel('y');
title('Least-squares bi-cubic spline fit of scattered data');
view(32,40);
e02dd example results
Calling with smoothing factor S = 10.00
Rank deficiency = 0
Knots: lamda mu
4 0.0000 0.0000
5 9.7575 9.0008
6 18.2582 20.0000
7 25.0000
B-spline coefficients:
58.1559 46.3067 6.0058 31.9987 5.8554 -23.7779
63.7813 46.7449 33.3668 18.2980 14.3600 15.9518
40.8392 -33.7898 5.1688 13.0954 -4.1317 19.3683
75.4362 111.9175 6.9393 17.3287 7.0928 -13.2436
34.6068 -42.6140 25.2015 -1.9641 10.3721 -9.0871
Weighted sum of squared residuals = 10.0021
PDF version (NAG web site
, 64-bit version, 64-bit version)
© The Numerical Algorithms Group Ltd, Oxford, UK. 2009–2015