d03ub:: Partial Differential Equations (NAG Toolbox)

nag_pde_3d_ellip_fd_iter (d03ub) determines the approximate change vector

s

corresponding to a given residual

r

, i.e., it determines an approximate solution to a set of equations

M s = r

(2)

where

M

is a square

(n_{1} \times n_{2} \times n_{3})

(n_{1} \times n_{2} \times n_{3})

matrix and

r

is a known vector of length

(n_{1} \times n_{2} \times n_{3})

. The set of equations (2) must be of seven-diagonal form

a_{i j k} s_{i j, k - 1} + b_{i j k} s_{i, j - 1, k} + c_{i j k} s_{i - 1, j k} + d_{i j k} s_{i j k} + e_{i j k} s_{i + 1, j k} + f_{i j k} s_{i, j + 1, k} + g_{i j k} s_{i j, k + 1} = r_{i j k}

for

i = 1, 2, \dots, n_{1}

j = 1, 2, \dots, n_{2}

and

k = 1, 2, \dots, n_{3}

, provided that

d_{i j k} \neq 0.0

. Indeed, if

d_{i j k} = 0.0

, then the equation is assumed to be

s_{i j k} = r_{i j k} .

The calling program supplies the current residual

r

at each iteration and the coefficients of the seven-point molecule system of equations on which the update procedure is based. The function performs one iteration, using the approximate

L U

factorization of the Strongly Implicit Procedure with the necessary acceleration argument adjustment, to calculate the approximate solution

s

of the set of equations (2). The change

s

overwrites the residual array for return to the calling program. The calling program must combine this change stored in

r

with the old approximation to obtain the new approximate solution for

t

. It must then recalculate the residuals and, if the accuracy requirements have not been satisfied, commence the next iterative cycle.

Clearly there is no requirement that the iterative update matrix passed in the form of the seven-diagonal element arrays a, b, c, d, e, f and g is the same as that used to calculate the residuals, and therefore the one governing the problem. However, the convergence may be impaired if they are not equal. Indeed, if the system of equations (1) is not precisely of the seven-diagonal form illustrated above but has a few additional terms, then the methods of deferred or defect correction can be employed. The residual is calculated by the calling program using the full system of equations, but the update formula is based on a seven-diagonal system (2) of the form given above. For example, the solution of a system of eleven-diagonal equations each involving the combination of terms with

t_{i \pm 1, j \pm 1, k}, t_{i \pm 1, j, k}, t_{i, j \pm 1, k}, t_{i, j, k \pm 1}

and

t_{i j k}

could use the seven-diagonal coefficients on which to base the update, provided these incorporate the major features of the equations.

For problems in which the solution is not unique, in the sense that an arbitrary constant can be added to the solution (for example Poisson's equation with all Neumann boundary conditions), the calling program should subtract a typical nodal value from the whole solution

t

at every iteration to keep rounding errors to a minimum for those cases when convergence is slow. For such problems there is generally an associated compatibility condition. For the example mentioned this compatibility condition equates the total net source within the region (i.e., the source integrated over the region) with the total net outflow across the boundaries defined by the Neumann conditions (i.e., the normal derivative integrated along the whole boundary). It is very important that the algebraic equations derived to model such a problem accurately implement the compatibility condition. If they do not, a net source or sink is very likely to be represented by the set of algebraic equations and no steady-state solution of the equations exists.

References

Parameters

Compulsory Input Parameters

Optional Input Parameters

Output Parameters

Error Indicators and Warnings

Accuracy

The improvement in accuracy for each iteration, i.e., on each call, depends on the size of the system and on the condition of the update matrix characterised by the seven-diagonal coefficient arrays. The ultimate accuracy obtainable depends on the above factors and on the machine precision. However, since nag_pde_3d_ellip_fd_iter (d03ub) works with residuals and the update vector, the calling program can, in most cases where at each iteration all the residuals are usually of about the same size, calculate the residuals from extended precision values of the function, source term and equation coefficients if greater accuracy is required. The rate of convergence obtained with the Strongly Implicit Procedure is not always smooth because of the cyclic use of nine acceleration arguments. The convergence may become slow with very large problems. The final accuracy obtained can be judged approximately from the rate of convergence determined from the changes to the dependent variable

t

and in particular the change on the last iteration.

Further Comments

Example

This example solves Laplace's equation in a rectangular box with a non-uniform grid spacing in the

x

y

and

z

coordinate directions and with Dirichlet boundary conditions specifying the function on the surfaces of the box equal to

e^{(1.0 + x) / y (n_{2})} \times \cos (\sqrt{2} y / y (n_{2})) \times e^{(- 1.0 - z) / y (n_{2})} .

Note that this is the same problem as that solved in the example for nag_pde_3d_ellip_fd (d03ec). The differences in the maximum residuals obtained at each iteration between the two test runs are explained by the fact that in nag_pde_3d_ellip_fd (d03ec) the residual at each node is normalized by dividing by the central coefficient, whereas this normalization has not been used in the example program for nag_pde_3d_ellip_fd_iter (d03ub).

function d03ub_example


fprintf('d03ub example results\n\n');

n = [16 20 24];
b = [ 6 10 15];
delta = b./(n.*(n-1));
for j = 1:n(1)
  x(j) = (j*(j-1))*delta(1);
end
for j = 1:n(2)
  y(j) = (j*(j-1))*delta(2);
end
for j = 1:n(3)
  z(j) = (j*(j-1))*delta(3);
end

a = zeros(n);
b = a; c = a; e = a; f = a; g = a;
q = a; t = a;

aparam = 1;
it     = int64(1);

% Set up difference equation coefficients, source terms and
% initial approximation

% Specification for internal nodes
ii = 2:n(1)-1;
jj = 2:n(2)-1;
kk = 2:n(3)-1;
for k = kk
  a(ii,jj,k) = 2/((z(k)-z(k-1))*(z(k+1)-z(k-1)));
  g(ii,jj,k) = 2/((z(k+1)-z(k))*(z(k+1)-z(k-1)));
end
for j = jj
  b(ii,j,kk) = 2/((y(j)-y(j-1))*(y(j+1)-y(j-1)));
  f(ii,j,kk) = 2/((y(j+1)-y(j))*(y(j+1)-y(j-1)));
end
for i = ii
  c(i,jj,kk) = 2/((x(i)-x(i-1))*(x(i+1)-x(i-1)));
  e(i,jj,kk) = 2/((x(i+1)-x(i))*(x(i+1)-x(i-1)));
end
d = -a - b - c - e - f - g;

% Specification for boundary nodes
ex1 = exp((x(1)+1)/y(n(2)));
ex2 = exp((x(n(1))+1)/y(n(2)));
for j = 1:n(2);
  cy1 = cos(sqrt(2)*y(j)/y(n(2)));
  q(1,   j,:) = ex1*cy1*exp((-z(:)-1)/y(n(2)));;
  q(n(1),j,:) = ex2*cy1*exp((-z(:)-1)/y(n(2)));;
end
cy1 = cos(sqrt(2)*y(1)/y(n(2)));
cy2 = cos(sqrt(2)*y(n(2))/y(n(2)));
for i = 1:n(1);
  ex1 = exp((x(i)+1)/y(n(2)));
  q(i,1,   :) = ex1*cy1*exp((-z(:)-1)/y(n(2)));;
  q(i,n(2),:) = ex1*cy2*exp((-z(:)-1)/y(n(2)));;
end
ez1 = exp((-z(1)-1)/y(n(2)));
ez2 = exp((-z(n(3))-1)/y(n(2)));
for i = 1:n(1);
  ex1 = exp((x(i)+1)/y(n(2)));
  q(i,:,1)    = ex1*cos(sqrt(2)*y(:)/y(n(2)))*ez1;
  q(i,:,n(3)) = ex1*cos(sqrt(2)*y(:)/y(n(2)))*ez2;
end

nits = 10;
n = int64(n);

% Iterative loop
for it = int64(1:nits)
  [r] = resid(n(1),n(2),n(3),a,b,c,d,e,f,g,q,t);
  [r, ifail] =  d03ub( ...
                       n(1), n(2), n(3), a, b, c, d, e, f, g, aparam, it, r);
  t = t + r;
end

for i = 1:n(3)
  nrms(i) = norm(r(:,:,i));
end
  
fprintf('Final residual after %d iterations = %10.1e\n',nits,norm(nrms));
fprintf('\nApproximate solution is:\n\n');
for j=1:4:n(3)
  fprintf('            z = %7.4f\n    x/y',z(j));
  fprintf('%8.2f',x(1:4:n(1)));
  for i=1:4:n(2)
    fprintf('\n%8.2f',y(i));
    fprintf('%8.3f',t(1:4:n(1),i,j));
  end
  fprintf('\n\n');
end

fig1 = figure;
hold on
mesh(y,x,t(:,:,1));
mesh(y,x,t(:,:,9));
mesh(y,x,t(:,:,17));
mesh(y,x,t(:,:,24));
z1 = sprintf('z = %5.2f',z(1));
z2 = sprintf('z = %5.2f',z(9));
z3 = sprintf('z = %5.2f',z(17));
z4 = sprintf('z = %5.2f',z(24));
title({'Solution of 3D Laplace''s Equation in a Box',
       'Solutions in the xy-plane for various z values'});
xlabel('y');
ylabel('x');
zlabel('U(x,y,z=Z)');
legend(z1,z2,z3,z4);
view(-27,16);
hold off



function [r] = resid(n1,n2,n3,a,b,c,d,e,f,g,q,t)
r  = zeros(n1, n2, n3);
for k = 1:n3
  for j = 1:n2
    for i = 1:n1
      if (d(i,j,k) ~= 0)
        % Seven point molecule formula
        r(i,j,k) = q(i,j,k) - a(i,j,k)*t(i,j,k-1) - b(i,j,k)*t(i,j-1,k) - ...
                   c(i,j,k)*t(i-1,j,k) - d(i,j,k)*t(i,j,k) - ...
                   e(i,j,k)*t(i+1,j,k) - f(i,j,k)*t(i,j+1,k) - ...
                   g(i,j,k)*t(i,j,k+1);
      else
        % Explicit equation
        r(i,j,k) = q(i,j,k) - t(i,j,k);
      end
    end
  end
end

d03ub example results

Final residual after 10 iterations =    1.4e-04

Approximate solution is:

            z =  0.0000
    x/y    0.00    0.50    1.80    3.90
    0.00   1.000   1.051   1.197   1.477
    0.53   0.997   1.048   1.194   1.473
    1.89   0.964   1.014   1.154   1.424
    4.11   0.836   0.879   1.001   1.235
    7.16   0.530   0.557   0.634   0.783

            z =  0.5435
    x/y    0.00    0.50    1.80    3.90
    0.00   0.947   0.996   1.134   1.399
    0.53   0.944   0.993   1.131   1.395
    1.89   0.913   0.960   1.093   1.349
    4.11   0.792   0.833   0.948   1.170
    7.16   0.502   0.528   0.601   0.741

            z =  1.9565
    x/y    0.00    0.50    1.80    3.90
    0.00   0.822   0.864   0.984   1.215
    0.53   0.820   0.862   0.982   1.211
    1.89   0.793   0.834   0.949   1.171
    4.11   0.688   0.723   0.823   1.016
    7.16   0.436   0.458   0.522   0.644

            z =  4.2391
    x/y    0.00    0.50    1.80    3.90
    0.00   0.654   0.688   0.784   0.967
    0.53   0.653   0.686   0.781   0.964
    1.89   0.631   0.664   0.756   0.932
    4.11   0.547   0.575   0.655   0.808
    7.16   0.347   0.365   0.415   0.512

            z =  7.3913
    x/y    0.00    0.50    1.80    3.90
    0.00   0.478   0.502   0.572   0.705
    0.53   0.476   0.501   0.570   0.703
    1.89   0.460   0.484   0.551   0.680
    4.11   0.399   0.420   0.478   0.590
    7.16   0.253   0.266   0.303   0.374

            z = 11.4130
    x/y    0.00    0.50    1.80    3.90
    0.00   0.319   0.336   0.382   0.472
    0.53   0.319   0.335   0.381   0.470
    1.89   0.308   0.324   0.369   0.455
    4.11   0.267   0.281   0.320   0.395
    7.16   0.169   0.178   0.203   0.250

On entry,	$n1 < 2$ ,
or	$n2 < 2$ ,
or	$n3 < 2$ .

On entry,	$lda < n1$ ,
or	$sda < n2$ .

NAG Toolbox: nag_pde_3d_ellip_fd_iter (d03ub)

▸▿ Contents

Purpose

Syntax

Description