This chapter provides routines for the numerical evaluation of definite integrals in one or more dimensions and for evaluating weights and abscissae of integration rules.
2Background to the Problems
The routines in this chapter are designed to estimate:
(a)the value of a one-dimensional definite integral of the form
(1)
where is defined by you, either at a set of points , for , where , or in the form of a function; and the limits of integration may be finite or infinite.
Some methods are specially designed for integrands of the form
(2)
which contain a factor , called the weight-function, of a specific form. These methods take full account of any peculiar behaviour attributable to the factor.
(b)the values of the one-dimensional indefinite integrals arising from (1) where the ranges of integration are interior to the interval .
(c)the value of a multidimensional definite integral of the form
(3)
where is a function defined by you and is some region of -dimensional space.
The simplest form of is the -rectangle defined by
(4)
where and are constants. When and are functions of (), the region can easily be transformed to the rectangular form (see page 266 of Davis and Rabinowitz (1975)). Some of the methods described incorporate the transformation procedure.
2.1One-dimensional Integrals
To estimate the value of a one-dimensional integral, a quadrature rule uses an approximation in the form of a weighted sum of integrand values, i.e.,
(5)
The points within the interval are known as the abscissae, and the are known as the weights.
More generally, if the integrand has the form (2), the corresponding formula is
(6)
If the integrand is known only at a fixed set of points, these points must be used as the abscissae, and the weighted sum is calculated using finite difference methods. However, if the functional form of the integrand is known, so that its value at any abscissa is easily obtained, then a wide variety of quadrature rules are available, each characterised by its choice of abscissae and the corresponding weights.
The appropriate rule to use will depend on the interval – whether finite or otherwise – and on the form of any factor in the integrand. A suitable value of depends on the general behaviour of ; or of , if there is a factor present.
Among possible rules, we mention particularly the Gaussian formulae, which employ a distribution of abscissae which is optimal for or of polynomial form.
The choice of basic rules constitutes one of the principles on which methods for one-dimensional integrals may be classified. The other major basis of classification is the implementation strategy, of which some types are now presented.
(a)Single rule evaluation procedures
A fixed number of abscissae, , is used. This number and the particular rule chosen uniquely determine the weights and abscissae. No estimate is made of the accuracy of the result.
(b)Automatic procedures
The number of abscissae, , within is gradually increased until consistency is achieved to within a level of accuracy (absolute or relative) you requested. There are essentially two ways of doing this; hybrid forms of these two methods are also possible:
(i)whole interval procedures (non-adaptive)
A series of rules using increasing values of are successively applied over the whole interval . It is clearly more economical if abscissae already used for a lower value of can be used again as part of a higher-order formula. This principle is known as optimal extension. There is no overlap between the abscissae used in Gaussian formulae of different orders. However, the Kronrod formulae are designed to give an optimal -point formula by adding points to an -point Gauss formula. Further extensions have been developed by Patterson.
(ii)adaptive procedures
The interval is repeatedly divided into a number of sub-intervals, and integration rules are applied separately to each sub-interval. Typically, the subdivision process will be carried further in the neighbourhood of a sharp peak in the integrand than where the curve is smooth. Thus, the distribution of abscissae is adapted to the shape of the integrand.
Subdivision raises the problem of what constitutes an acceptable accuracy in each sub-interval. The usual global acceptability criterion demands that the sum of the absolute values of the error estimates in the sub-intervals should meet the conditions required of the error over the whole interval. Automatic extrapolation over several levels of subdivision may eliminate the effects of some types of singularities.
An ideal general-purpose method would be an automatic method which could be used for a wide variety of integrands, was efficient (i.e., required the use of as few abscissae as possible), and was reliable (i.e., always gave results to within the requested accuracy). Complete reliability is unobtainable, and generally higher reliability is obtained at the expense of efficiency, and vice versa. It must therefore be emphasized that the automatic routines in this chapter cannot be assumed to be reliable. In general, however, the reliability is very high.
2.2Multidimensional Integrals
A distinction must be made between cases of moderately low dimensionality (say, up to or dimensions), and those of higher dimensionality. Where the number of dimensions is limited, a one-dimensional method may be applied to each dimension, according to some suitable strategy, and high accuracy may be obtainable (using product rules). However, the number of integrand evaluations rises very rapidly with the number of dimensions, so that the accuracy obtainable with an acceptable amount of computational labour is limited; for example a product of -point rules in dimensions would require more than integrand evaluations. Special techniques such as the Monte Carlo methods can be used to deal with high dimensions.
(a)Products of one-dimensional rules
Using a two-dimensional integral as an example, we have
(7)
(8)
where and are the weights and abscissae of the rules used in the respective dimensions.
A different one-dimensional rule may be used for each dimension, as appropriate to the range and any weight function present, and a different strategy may be used, as appropriate to the integrand behaviour as a function of each independent variable.
For a rule-evaluation strategy in all dimensions, the formula (8) is applied in a straightforward manner. For automatic strategies (i.e., attempting to attain a requested accuracy), there is a problem in deciding what accuracy must be requested in the inner integral(s). Reference to formula (7) shows that the presence of a limited but random error in the -integration for different values of can produce a ‘jagged’ function of , which may be difficult to integrate to the desired accuracy and for this reason products of automatic one-dimensional routines should be used with caution (see Lyness (1983)).
(b)Monte Carlo methods
These are based on estimating the mean value of the integrand sampled at points chosen from an appropriate statistical distribution function. Usually a variance reducing procedure is incorporated to combat the fundamentally slow rate of convergence of the rudimentary form of the technique. These methods can be effective by comparison with alternative methods when the integrand contains singularities or is erratic in some way, but they are of quite limited accuracy.
(c)Number theoretic methods
These are based on the work of Korobov and Conroy and operate by exploiting implicitly the properties of the Fourier expansion of the integrand. Special rules, constructed from so-called optimal coefficients, give a particularly uniform distribution of the points throughout -dimensional space and from their number theoretic properties minimize the error on a prescribed class of integrals. The method can be combined with the Monte Carlo procedure.
(d)Sag–Szekeres method
By transformation this method seeks to induce properties into the integrand which make it accurately integrable by the trapezoidal rule. The transformation also allows effective control over the number of integrand evaluations.
(e)Sparse grid methods
Given a set of one-dimensional quadrature rules of increasing levels of accuracy, the sparse grid method constructs an approximation to a multidimensional integral using -dimensional tensor products of the differences between rules of adjacent levels. This provides a lower theoretical accuracy than the methods in (a), the full grid approach, which is nonetheless still sufficient for various classes of sufficiently smooth integrands. Furthermore, it requries substantially fewer evaluations than the full grid approach. Specifically, if a one-dimensional quadrature rule has points, the full grid will require function evaluations, whereas the sparse grid of level will require . Hence a sparse grid approach is computationally feasible even for integrals over .
Sparse grid methods are deterministic, and may be viewed as automatic whole domain procedures if their level is allowed to increase.
(f)Automatic adaptive procedures
An automatic adaptive strategy in several dimensions normally involves division of the region into subregions, concentrating the divisions in those parts of the region where the integrand is worst behaved. It is difficult to arrange with any generality for variable limits in the inner integral(s). For this reason, some methods use a region where all the limits are constants; this is called a hyper-rectangle. Integrals over regions defined by variable or infinite limits may be handled by transformation to a hyper-rectangle. Integrals over regions so irregular that such a transformation is not feasible may be handled by surrounding the region by an appropriate hyper-rectangle and defining the integrand to be zero outside the desired region. Such a technique should always be followed by a Monte Carlo method for integration.
The method used locally in each subregion produced by the adaptive subdivision process is usually one of three types: Monte Carlo, number theoretic or deterministic. Deterministic methods are usually the most rapidly convergent but are often expensive to use for high dimensionality and not as robust as the other techniques.
3Recommendations on Choice and Use of Available Routines
This section is divided into five subsections. The first subsection illustrates the difference between direct and reverse communication routines. The second subsection highlights the different levels of vectorization provided by different interfaces.
Sections 3.3, 3.3.2 and 3.4 consider in turn routines for: one-dimensional integrals over a finite interval, and over a semi-infinite or an infinite interval; and multidimensional integrals. Within each sub-section, routines are classified by the type of method, which ranges from simple rule evaluation to automatic adaptive algorithms. The recommendations apply particularly when the primary objective is simply to compute the value of one or more integrals, and in these cases the automatic adaptive routines are generally the most convenient and reliable, although also the most expensive in computing time.
Note however that in some circumstances it may be counter-productive to use an automatic routine. If the results of the quadrature are to be used in turn as input to a further computation (e.g., an ‘outer’ quadrature or an optimization problem), then this further computation may be adversely affected by the ‘jagged performance profile’ of an automatic routine; a simple rule-evaluation routine may provide much better overall performance. For further guidance, the article by Lyness (1983) is recommended.
3.1Direct and Reverse Communication
Routines in this chapter which evaluate an integral value may be classified as either direct communication or reverse communication. See Section 7 in How to Use the NAG Library for a description of these terms.
Currently in this chapter the only routine explicitly using reverse communication is d01raf.
3.2Choice of Interface
This section concerns the design of the interface for the provision of abscissae, and the subsequent collection of calculated information, typically integrand evaluations. Vectorized interfaces typically allow for more efficient operation.
(a)Single abscissa interfaces
The algorithm will provide a single abscissa at which information is required. These are typically the most simple to use, although they may be significantly less efficient than a vectorized equivalent. Most of the algorithms in this chapter are of this type.
The algorithm will return a set of abscissae, at all of which information is required. While these are more complicated to use, they are typically more efficient than a non-vectorized equivalent. They reduce the overhead of function calls, allow the avoidance of repetition of computations common to each of the integrand evaluations, and offer greater scope for vectorization and parallelization of your code.
These are routines which allow for multiple integrals to be estimated simultaneously. As with (b) above, these are more complicated to use than single integral routines, however they can provide higher efficiency, particularly if several integrals require the same subcalculations at the same abscissae. They are most efficient if integrals which are supplied together are expected to have similar behaviour over the domain, particularly when the algorithm is adaptive.
If is defined numerically at four or more points, then the Gill–Miller finite difference method (d01gaf) should be used. The interval of integration is taken to coincide with the range of values of the points supplied. It is in the nature of this problem that any routine may be unreliable. In order to check results independently and so as to provide an alternative technique you may fit the integrand by Chebyshev series using e02adf and then use routine e02ajf to evaluate its integral (which need not be restricted to the range of the integration points, as is the case for d01gaf). A further alternative is to fit a cubic spline to the data using e02baf and then to evaluate its integral using e02bdf.
(b)Integrand defined as a function
If the functional form of is known, then one of the following approaches should be taken. They are arranged in the order from most specific to most general, hence the first applicable procedure in the list will be the most efficient.
However, if you do not wish to make any assumptions about the integrand, the most reliable routines to use will be
d01atf (or d01ajf), d01auf (or d01akf), d01alf, d01rgf or d01raf, although these will in general be less efficient for simple integrals.
(i)Rule-evaluation routines
If is known to be sufficiently well behaved (more precisely, can be closely approximated by a polynomial of moderate degree), a Gaussian routine with a suitable number of abscissae may be used.
d01bcford01tbf
with d01fbf may be used if it is required to examine the weights and abscissae.
d01tbf
is faster and more accurate, whereas
d01bcf
is more general. d01uaf uses the same quadrature rules as d01tbf, and may be used if you do not explicitly require the weights and abscissae.
If is well behaved, apart from a weight-function of the form
d01bcfandd01tbf
generate weights and abscissae for specific Gauss rules. Weights and abscissae for other quadrature formulae may be computed using routines d01tdford01tef. Wherever possible use d01tdf in preference to d01tef. The former however requires information that may not be readily available.
(ii)Automatic whole-interval routines
If is reasonably smooth, and the required accuracy is not too high, the automatic whole interval
routines d01arfandd01bdf
may be used. Additionally, d01esf with may be used with an appropriate transformation from the unit interval.
d01bdf uses the Gauss -point rule, with the point Kronrod extension, and the subsequent and point Patterson extensions if required.
d01esf supports multiple simultaneous integrals, and has a vectorized interface. Either high order Gauss–Patterson rules (of size , for ), or high order Clenshaw-Curtis rules (of size , for ). Gauss–Patterson rules possess greater polynomial accuracy, whereas Clenshaw–Curtis rules are often well suited to oscillatory integrals.
d01arf incorporates the same high order Gauss–Patterson rules as d01esf, and is the only routine that may be used for indefinite integration.
(iii)Automatic adaptive routines
Firstly, several routines are available for integrands of the form where is a ‘smooth’ function (i.e., has no singularities, sharp peaks or violent oscillations in the interval of integration) and is a weight function of one of the following forms.
2.if : use
d01aqf
(this integral is called the Hilbert transform of );
3.if or : use
d01anf
(this routine can also handle certain types of singularities in ).
Secondly, there are multiple routines for general , using different strategies.
d01atf (and d01ajf), and d01auf (and d01akf)
use the strategy of Piessens et al. (1983), using repeated bisection of the interval, and in the first case the -algorithm (Wynn (1956)), to improve the integral estimate. This can cope with singularities away from the end points, provided singular points do not occur as abscissae,
d01auf tends to perform better than d01atf
on more oscillatory integrals.
d01alf
uses the same subdivision strategy as
d01atf
over a set of initial interval segments determined by supplied break-points. It is hence suitable for integrals with discontinuities (including switches in definition) or sharp peaks occuring at known points. Such integrals may also be approximated using other routines which do not allow break-points, although such integrals should be evaluated over each of the sub-intervals seperately.
d01raf again uses the strategy of Piessens et al. (1983), and provides the functionality of
d01alf,d01atfandd01auf
in a reverse communication framework. It also supports multiple integrals and uses a vectorized interface for the abscissae. Hence it is likely to be more efficient if several similar integrals are required to be evaluated over the same domain. Furthermore, its behaviour can be tailored through the use of optional parameters.
d01ahf uses the strategy of Patterson (1968) and the -algorithm to adaptively evaluate the integral in question. It tends to be more efficient than the bisection based algorithms, although these tend to be more robust when singularities occur away from the end points.
d01rgf uses another adaptive scheme due to Gonnet (2010). This attempts to match the quadrature rule to the underlying integrand as well as subdividing the domain. Further, it can explicitly deal with singular points at abscissae, should NaN's or ∞ be returned by the user-supplied (sub)routine, provided the generation of these does not cause the program to halt (see Chapter X07).
3.3.2Over a Semi-infinite or Infinite Interval
(a)Integrand defined at a set of points
If is defined numerically at four or more points, and the portion of the integral lying outside the range of the points supplied may be neglected, then the Gill–Miller finite difference method, d01gaf, should be used.
(b)Integrand defined as a function
(i)Rule evaluation routines
If behaves approximately like a polynomial in , apart from a weight function of the form:
1. (semi-infinite interval, lower limit finite); or
2. (semi-infinite interval, upper limit finite); or
3. (infinite interval),
or if behaves approximately like a polynomial in (semi-infinite range), then the Gaussian routines may be used.
d01uaf
may be used if it is not required to examine the weights and abscissae.
d01bcford01tbf
with d01fbf may be used if it is required to examine the weights and abscissae.
d01tbf
is faster and more accurate, whereas
d01bcf
is more general.
d01ubf returns an approximation to the specific problem .
(ii)Automatic adaptive routines
d01amf
may be used, except for integrands which decay slowly towards an infinite end point, and oscillate in sign over the entire range. For this class, it may be possible to calculate the integral by integrating between the zeros and invoking some extrapolation process (see c06baf).
d01asf
may be used for integrals involving weight functions of the form and over a semi-infinite interval (lower limit finite).
The following alternative procedures are mentioned for completeness, though their use will rarely be necessary.
1.If the integrand decays rapidly towards an infinite end point, a finite cut-off may be chosen, and the finite range methods applied.
2.If the only irregularities occur in the finite part (apart from a singularity at the finite limit, with which
d01amf
can cope), the range may be divided, with
d01amf used on the infinite part.
3.A transformation to finite range may be employed, e.g.,
will transform to while for infinite ranges we have
If the integrand behaves badly on and well on or vice versa it is better to compute it as . This saves computing unnecessary function values in the semi-infinite range where the function is well behaved.
3.4Multidimensional Integrals
A number of techniques are available in this area and the choice depends to a large extent on the dimension and the required accuracy. It can be advantageous to use more than one technique as a confirmation of accuracy, particularly for high-dimensional integrations. Several routines include a transformation procedure, using a user-supplied subroutine, which allows general product regions to be easily dealt with in terms of conversion to the standard -cube region.
(a)Products of one-dimensional rules (suitable for up to about dimensions)
If is known to be a sufficiently well behaved function of each variable , apart possibly from weight functions of the types provided, a product of Gaussian rules may be used. These are provided by
d01bcford01tbf
with d01fbf. Rules for finite, semi-infinite and infinite ranges are included.
For two-dimensional integrals only, unless the integrand is very badly behaved, the automatic whole-interval product procedure of d01daf may be used. The limits of the inner integral may be user-specified functions of the outer variable. Infinite limits may be handled by transformation (see Section 3.3.2); end point singularities introduced by transformation should not be troublesome, as the integrand value will not be required on the boundary of the region.
If none of these routines proves suitable and convenient, the one-dimensional routines may be used recursively. For example, the two-dimensional integral
may be expressed as
The user-supplied code to evaluate will call the integration routine for the -integration, which will call more user-supplied code for as a function of ( being effectively a constant).
The reverse communication routine d01raf may be used by itself in a pseudo-recursive manner, in that it may be called to evaluate an inner integral for the integrand value of an outer integral also being calculated by d01raf.
(b)Sag–Szekeres method
Two routines are based on this method.
d01fdf is particularly suitable for integrals of very large dimension although the accuracy is generally not high. It allows integration over either the general product region (with built-in transformation to the -cube) or the -sphere. Although no error estimate is provided, two adjustable arguments may be varied for checking purposes or may be used to tune the algorithm to particular integrals.
d01jaf is also based on the Sag–Szekeres method and integrates over the -sphere. It uses improved transformations which may be varied according to the behaviour of the integrand. Although it can yield very accurate results it can only practically be employed for dimensions not exceeding .
(c)Number Theoretic method
Two subroutines are based on this method, d01gcf and a vectorized equivalent d01gdf.
Algorithms of this type carry out multidimensional integration using the Korobov–Conroy method over a product region with built-in transformation to the -cube. A stochastic modification of this method is incorporated into the routines in this Library, hybridising the technique with the Monte Carlo procedure. An error estimate is provided in terms of the statistical standard error. A number of pre-computed optimal coefficient rules for up to dimensions are provided; others can be computed using d01gyfandd01gzf. Like the Sag–Szekeres method it is suitable for large dimensional integrals although the accuracy is not high.
d01gcf requires a function to be provided to evaluate the value of the integrand at a single abscissa, and a subroutine to return the upper and lower limits of integration in a given dimension.
d01gdf has a vectorized interface which can result in faster execution, especially on vector-processing machines. You are required to provide two subroutines, the first to return an array of values of the integrand at each of an array of points, and the second to evaluate the limits of integration at each of an array of points. This reduces the overhead of function calls, avoids repetitions of computations common to each of the evaluations of the integral and limits of integration, and offers greater scope for vectorization of your code.
(d)A combinatorial extrapolation method
d01paf computes a sequence of approximations and an error estimate to the integral of a function over a multidimensional simplex using a combinatorial method with extrapolation.
(e)Sparse Grid method
d01esf implements a sparse grid quadrature scheme for the integration of a vector of multidimensional integrals over the unit hypercube,
The routine uses a vectorized interface, which returns a set of points at which the integrands must be evaluated in a sparse storage format for efficiency.
Other domains can be readily integrated over by using an appropriate mapping inside the provided subroutine for evaluating the integrands. It is suitable for up to , although no upper bound on the number of dimensions is enforced. It will also evaluate one-dimensional integrals, although in this case the sparse grid used is in fact the full grid.
The routine uses optional parameters, set and queried using the routines d01zkfandd01zlf respectively. Amongst other options, these allow the parallelization of the routine to be controlled.
d01gbf
is an adaptive Monte Carlo routine. This routine is usually slow and not recommended for high-accuracy work. It is a robust routine that can often be used for low-accuracy results with highly irregular integrands or when is large.
d01fcf
is an adaptive deterministic routine. Convergence is fast for well behaved integrands. Highly accurate results can often be obtained for between and , using significantly fewer integrand evaluations than would be required by
d01gbf.
The routine will usually work when the integrand is mildly singular and for should be used before
d01gbf.
If it is known in advance that the integrand is highly irregular, it is best to compare results from at least two different routines.
There are many problems for which one or both of the routines will require large amounts of computing time to obtain even moderately accurate results. The amount of computing time is controlled by the number of integrand evaluations you have allowed, and you should set this argument carefully, with reference to the time available and the accuracy desired.
d01eaf extends the technique of d01fcf to integrate adaptively more than one integrand, that is to calculate the set of integrals
for a set of similar integrands where .
4Decision Trees
Tree 1: One-dimensional integrals over a finite interval
Note:d01atf,d01auf,d01rafandd01rgf are likely to be more efficient due to their vectorized interfaces than d01ajfandd01akf, which use a more conventional user-interface, consistent with other routines in the chapter.
Tree 2: One-dimensional integrals over a semi-infinite or infinite interval
Is the functional form of the integrand known?
Are you concerned with efficiency for simple integrands?
Is the integrand smooth (polynomial-like) with no exceptions?
Is the integrand smooth (polynomial-like) apart from weight function (semi-infinite range) or (infinite range) or is the integrand polynomial-like in ? (semi-infinite range)?