Integer type:  int32  int64  nag_int  show int32  show int32  show int64  show int64  show nag_int  show nag_int

Chapter Contents
Chapter Introduction
NAG Toolbox

# NAG Toolbox: nag_rand_field_1d_user_setup (g05zm)

## Purpose

nag_rand_field_1d_user_setup (g05zm) performs the setup required in order to simulate stationary Gaussian random fields in one dimension, for a user-defined variogram, using the circulant embedding method. Specifically, the eigenvalues of the extended covariance matrix (or embedding matrix) are calculated, and their square roots output, for use by nag_rand_field_1d_generate (g05zp), which simulates the random field.

## Syntax

[lam, xx, m, approx, rho, icount, eig, user, ifail] = g05zm(ns, xmin, xmax, var, cov1, 'maxm', maxm, 'pad', pad, 'icorr', icorr, 'user', user)
[lam, xx, m, approx, rho, icount, eig, user, ifail] = nag_rand_field_1d_user_setup(ns, xmin, xmax, var, cov1, 'maxm', maxm, 'pad', pad, 'icorr', icorr, 'user', user)

## Description

A one-dimensional random field $Z\left(x\right)$ in $ℝ$ is a function which is random at every point $x\in ℝ$, so $Z\left(x\right)$ is a random variable for each $x$. The random field has a mean function $\mu \left(x\right)=𝔼\left[Z\left(x\right)\right]$ and a symmetric positive semidefinite covariance function $C\left(x,y\right)=𝔼\left[\left(Z\left(x\right)-\mu \left(x\right)\right)\left(Z\left(y\right)-\mu \left(y\right)\right)\right]$. $Z\left(x\right)$ is a Gaussian random field if for any choice of $n\in ℕ$ and ${x}_{1},\dots ,{x}_{n}\in ℝ$, the random vector ${\left[Z\left({x}_{1}\right),\dots ,Z\left({x}_{n}\right)\right]}^{\mathrm{T}}$ follows a multivariate Normal distribution, which would have a mean vector $\stackrel{~}{\mathbf{\mu }}$ with entries ${\stackrel{~}{\mu }}_{i}=\mu \left({x}_{i}\right)$ and a covariance matrix $\stackrel{~}{C}$ with entries ${\stackrel{~}{C}}_{ij}=C\left({x}_{i},{x}_{j}\right)$. A Gaussian random field $Z\left(x\right)$ is stationary if $\mu \left(x\right)$ is constant for all $x\in ℝ$ and $C\left(x,y\right)=C\left(x+a,y+a\right)$ for all $x,y,a\in ℝ$ and hence we can express the covariance function $C\left(x,y\right)$ as a function $\gamma$ of one variable: $C\left(x,y\right)=\gamma \left(x-y\right)$. $\gamma$ is known as a variogram (or more correctly, a semivariogram) and includes the multiplicative factor ${\sigma }^{2}$ representing the variance such that $\gamma \left(0\right)={\sigma }^{2}$.
The functions nag_rand_field_1d_user_setup (g05zm) and nag_rand_field_1d_generate (g05zp) are used to simulate a one-dimensional stationary Gaussian random field, with mean function zero and variogram $\gamma \left(x\right)$, over an interval $\left[{x}_{\mathrm{min}},{x}_{\mathrm{max}}\right]$, using an equally spaced set of $N$ points on the interval. The problem reduces to sampling a Normal random vector $\mathbf{X}$ of size $N$, with mean vector zero and a symmetric Toeplitz covariance matrix $A$. Since $A$ is in general expensive to factorize, a technique known as the circulant embedding method is used. $A$ is embedded into a larger, symmetric circulant matrix $B$ of size $M\ge 2\left(N-1\right)$, which can now be factorized as $B=W\Lambda {W}^{*}={R}^{*}R$, where $W$ is the Fourier matrix (${W}^{*}$ is the complex conjugate of $W$), $\Lambda$ is the diagonal matrix containing the eigenvalues of $B$ and $R={\Lambda }^{\frac{1}{2}}{W}^{*}$. $B$ is known as the embedding matrix. The eigenvalues can be calculated by performing a discrete Fourier transform of the first row (or column) of $B$ and multiplying by $M$, and so only the first row (or column) of $B$ is needed – the whole matrix does not need to be formed.
As long as all of the values of $\Lambda$ are non-negative (i.e., $B$ is positive semidefinite), $B$ is a covariance matrix for a random vector $\mathbf{Y}$, two samples of which can now be simulated from the real and imaginary parts of ${R}^{*}\left(\mathbf{U}+i\mathbf{V}\right)$, where $\mathbf{U}$ and $\mathbf{V}$ have elements from the standard Normal distribution. Since ${R}^{*}\left(\mathbf{U}+i\mathbf{V}\right)=W{\Lambda }^{\frac{1}{2}}\left(\mathbf{U}+i\mathbf{V}\right)$, this calculation can be done using a discrete Fourier transform of the vector ${\Lambda }^{\frac{1}{2}}\left(\mathbf{U}+i\mathbf{V}\right)$. Two samples of the random vector $\mathbf{X}$ can now be recovered by taking the first $N$ elements of each sample of $\mathbf{Y}$ – because the original covariance matrix $A$ is embedded in $B$, $\mathbf{X}$ will have the correct distribution.
If $B$ is not positive semidefinite, larger embedding matrices $B$ can be tried; however if the size of the matrix would have to be larger than maxm, an approximation procedure is used. We write $\Lambda ={\Lambda }_{+}+{\Lambda }_{-}$, where ${\Lambda }_{+}$ and ${\Lambda }_{-}$ contain the non-negative and negative eigenvalues of $B$ respectively. Then $B$ is replaced by $\rho {B}_{+}$ where ${B}_{+}=W{\Lambda }_{+}{W}^{*}$ and $\rho \in \left(0,1\right]$ is a scaling factor. The error $\epsilon$ in approximating the distribution of the random field is given by
 $ε= 1-ρ 2 trace⁡Λ + ρ2 trace⁡Λ- M .$
Three choices for $\rho$ are available, and are determined by the input argument icorr:
• setting ${\mathbf{icorr}}=0$ sets
 $ρ= trace⁡Λ trace⁡Λ+ ,$
• setting ${\mathbf{icorr}}=1$ sets
 $ρ= trace⁡Λ trace⁡Λ+ ,$
• setting ${\mathbf{icorr}}=2$ sets $\rho =1$.
nag_rand_field_1d_user_setup (g05zm) finds a suitable positive semidefinite embedding matrix $B$ and outputs its size, m, and the square roots of its eigenvalues in lam. If approximation is used, information regarding the accuracy of the approximation is output. Note that only the first row (or column) of $B$ is actually formed and stored.

## References

Dietrich C R and Newsam G N (1997) Fast and exact simulation of stationary Gaussian processes through circulant embedding of the covariance matrix SIAM J. Sci. Comput. 18 1088–1107
Schlather M (1999) Introduction to positive definite functions and to unconditional simulation of random fields Technical Report ST 99–10 Lancaster University
Wood A T A and Chan G (1994) Simulation of stationary Gaussian processes in ${\left[0,1\right]}^{d}$ Journal of Computational and Graphical Statistics 3(4) 409–432

## Parameters

### Compulsory Input Parameters

1:     $\mathrm{ns}$int64int32nag_int scalar
The number of sample points to be generated in realizations of the random field.
Constraint: ${\mathbf{ns}}\ge 1$.
2:     $\mathrm{xmin}$ – double scalar
The lower bound for the interval over which the random field is to be simulated.
Constraint: ${\mathbf{xmin}}<{\mathbf{xmax}}$.
3:     $\mathrm{xmax}$ – double scalar
The upper bound for the interval over which the random field is to be simulated.
Constraint: ${\mathbf{xmin}}<{\mathbf{xmax}}$.
4:     $\mathrm{var}$ – double scalar
The multiplicative factor ${\sigma }^{2}$ of the variogram $\gamma \left(x\right)$.
Constraint: ${\mathbf{var}}\ge 0.0$.
5:     $\mathrm{cov1}$ – function handle or string containing name of m-file
cov1 must evaluate the variogram $\gamma \left(x\right)$, without the multiplicative factor ${\sigma }^{2}$, for all $x\ge 0$. The value returned in gamma is multiplied internally by var.
[gamma, user] = cov1(x, user)

Input Parameters

1:     $\mathrm{x}$ – double scalar
The value $x$ at which the variogram $\gamma \left(x\right)$ is to be evaluated.
2:     $\mathrm{user}$ – Any MATLAB object
cov1 is called from nag_rand_field_1d_user_setup (g05zm) with the object supplied to nag_rand_field_1d_user_setup (g05zm).

Output Parameters

1:     $\mathrm{gamma}$ – double scalar
The value of the variogram $\frac{\gamma \left(x\right)}{{\sigma }^{2}}$.
2:     $\mathrm{user}$ – Any MATLAB object

### Optional Input Parameters

1:     $\mathrm{maxm}$int64int32nag_int scalar
Default: ${2}^{3+⌈{\mathrm{log}}_{2}\left({\mathbf{ns}}-1\right)⌉}$
The maximum size of the circulant matrix to use. For example, if the embedding matrix is to be allowed to double in size three times before the approximation procedure is used, then choose ${\mathbf{maxm}}={2}^{k+2}$ where $k=1+⌈{\mathrm{log}}_{2}\left({\mathbf{ns}}-1\right)⌉$.
Constraint: ${\mathbf{maxm}}\ge {2}^{k}$, where $k$ is the smallest integer satisfying ${2}^{k}\ge 2\left({\mathbf{ns}}-1\right)$ .
2:     $\mathrm{pad}$int64int32nag_int scalar
Default: ${\mathbf{pad}}=1$
Determines whether the embedding matrix is padded with zeros, or padded with values of the variogram. The choice of padding may affect how big the embedding matrix must be in order to be positive semidefinite.
${\mathbf{pad}}=0$
The embedding matrix is padded with zeros.
${\mathbf{pad}}=1$
The embedding matrix is padded with values of the variogram.
Constraint: ${\mathbf{pad}}=0$ or $1$.
3:     $\mathrm{icorr}$int64int32nag_int scalar
Default: ${\mathbf{icorr}}=0$
Determines which approximation to implement if required, as described in Description.
Constraint: ${\mathbf{icorr}}=0$, $1$ or $2$.
4:     $\mathrm{user}$ – Any MATLAB object
user is not used by nag_rand_field_1d_user_setup (g05zm), but is passed to cov1. Note that for large objects it may be more efficient to use a global variable which is accessible from the m-files than to use user.

### Output Parameters

1:     $\mathrm{lam}\left({\mathbf{maxm}}\right)$ – double array
Contains the square roots of the eigenvalues of the embedding matrix.
2:     $\mathrm{xx}\left({\mathbf{ns}}\right)$ – double array
The points at which values of the random field will be output.
3:     $\mathrm{m}$int64int32nag_int scalar
The size of the embedding matrix.
4:     $\mathrm{approx}$int64int32nag_int scalar
Indicates whether approximation was used.
${\mathbf{approx}}=0$
No approximation was used.
${\mathbf{approx}}=1$
Approximation was used.
5:     $\mathrm{rho}$ – double scalar
Indicates the scaling of the covariance matrix. ${\mathbf{rho}}=1.0$ unless approximation was used with ${\mathbf{icorr}}=0$ or $1$.
6:     $\mathrm{icount}$int64int32nag_int scalar
Indicates the number of negative eigenvalues in the embedding matrix which have had to be set to zero.
7:     $\mathrm{eig}\left(3\right)$ – double array
Indicates information about the negative eigenvalues in the embedding matrix which have had to be set to zero. ${\mathbf{eig}}\left(1\right)$ contains the smallest eigenvalue, ${\mathbf{eig}}\left(2\right)$ contains the sum of the squares of the negative eigenvalues, and ${\mathbf{eig}}\left(3\right)$ contains the sum of the absolute values of the negative eigenvalues.
8:     $\mathrm{user}$ – Any MATLAB object
9:     $\mathrm{ifail}$int64int32nag_int scalar
${\mathbf{ifail}}={\mathbf{0}}$ unless the function detects an error (see Error Indicators and Warnings).

## Error Indicators and Warnings

Errors or warnings detected by the function:
${\mathbf{ifail}}=1$
Constraint: ${\mathbf{ns}}\ge 1$.
${\mathbf{ifail}}=2$
Constraint: ${\mathbf{xmin}}<{\mathbf{xmax}}$.
${\mathbf{ifail}}=4$
Constraint: the minimum calculated value for maxm is $_$.
Where the minimum calculated value is given by ${2}^{k}$, where $k$ is the smallest integer satisfying ${2}^{k}\ge 2\left({\mathbf{ns}}-1\right)$.
${\mathbf{ifail}}=5$
Constraint: ${\mathbf{var}}\ge 0.0$.
${\mathbf{ifail}}=7$
Constraint: ${\mathbf{pad}}=0$ or $1$.
${\mathbf{ifail}}=8$
Constraint: ${\mathbf{icorr}}=0$, $1$ or $2$.
${\mathbf{ifail}}=-99$
${\mathbf{ifail}}=-399$
Your licence key may have expired or may not have been installed correctly.
${\mathbf{ifail}}=-999$
Dynamic memory allocation failed.

## Accuracy

If on exit ${\mathbf{approx}}=1$, see the comments in Description regarding the quality of approximation; increase the value of maxm to attempt to avoid approximation.

None.

## Example

This example calls nag_rand_field_1d_user_setup (g05zm) to calculate the eigenvalues of the embedding matrix for $8$ sample points of a random field characterized by the symmetric stable variogram:
 $γx = σ2 exp - x′ ν ,$
where ${x}^{\prime }=\frac{x}{\ell }$, and $\ell$ and $\nu$ are parameters.
It should be noted that the symmetric stable variogram is one of the pre-defined variograms available in nag_rand_field_1d_predef_setup (g05zn). It is used here purely for illustrative purposes.
```function g05zm_example

fprintf('g05zm example results\n\n');

% Random field variance
var = 0.5;
% Domain endpoints
xmin = -1;
xmax = 1;
% Scaling factor rho = 1
icorr = int64(2);
% Number of sample points
ns = int64(8);

% Put covariance parameters in communication array
l    = 0.1;
nu   = 1.2;
user = [l, nu];

% Get square roots of the eigenvalues of the embedding matrix
[lam, xx, m, approx, rho, icount, eig, user, ifail] = ...
g05zm(...
ns, xmin, xmax, var, @cov1, 'icorr', icorr, 'user', user);

fprintf('\nSize of embedding matrix = %d\n\n', m);

% Display approximation information if approximation used
if approx == 1
fprintf('Approximation required\n\n');
fprintf('rho = %10.5f\n', rho);
fprintf('eig = %10.5f%10.5f%10.5f\n', eig(1:3));
fprintf('icount = %d\n', icount);
else
fprintf('Approximation not required\n\n');
end

% Display square roots of the eigenvalues of the embedding matrix
fprintf('Square roots of eigenvalues of embedding matrix:\n');
fprintf('%9.5f%9.5f%9.5f%9.5f\n',lam(1:m));

function [gam, user] = cov1(x, user)
if x == 0
gam = 1;
else
l  = user(1);
nu = user(2);
gam = exp(-(abs(x)/l)^nu);
end
```
```g05zm example results

Size of embedding matrix = 16

Approximation not required

Square roots of eigenvalues of embedding matrix:
0.74207  0.73932  0.73150  0.71991
0.70639  0.69304  0.68184  0.67442
0.67182  0.67442  0.68184  0.69304
0.70639  0.71991  0.73150  0.73932
```