Table 1 lists the iteration counts and computation
times () for the three problems.
The global grid consists of
grid cells and it
was divided into
subdomains.
The first two rows concern the algorithm for accurate solution of
subdomains. The iteration counts for extremely accurate subdomain
solution with tolerance
and
are the same.
From this we conclude that the subdomain solution accuracy with
tolerance
is accurate enough for this algorithm and test problem.
Table 1: Lowering the accuracy of subdomain solution. I is the algorithm
for accurate solution of subdomains, II for inaccurate solution
using GMRES, III is for IBLUD
The other rows are for the GCR based algorithm. Note that
in the special case of IBLUD postconditioning we have also listed
the more efficient GMRES acceleration.
As the subdomain solution accuracy is lowered the number of iterations
increases only slightly. Because of this the computing time drops
significantly for lower subdomain accuracies. Only for the special case of
IBLUD postconditioning, the number of iterations is significantly higher.
This rise in number of iterations does however not outweigh the reduction
in work by computing only
instead of doing GMRES. The computing
time with IBLUD postconditioning is by far the lowest. Note that the
amount of additional work in GCR acceleration compared to GMRES
acceleration can be significant. The only difference with GMRES is
that GCR needs some additional vector updates and requires some
more memory. A more efficient implementation, see [25], will
certainly be considered for the incompressible Navier-Stokes equations.
Mathematically,
the algorithms are the same for IBLUD postconditioning.
An important observation is that the GCR algorithm for inaccurate
solution of subdomains requires fewer iterations than the algorithm
for accurate subdomain solution using the same subdomain solution accuracy.
To show that this difference is not caused by the different definition of
the residual, we compare the computed solutions of the Poisson equation
with the exact solution
.
Table 2 shows
the maximum norms and 2-norms of the difference with the exact solution.
The 2-norm is defined as .
Clearly, the solutions obtained with both algorithms have approximately
the same accuracy. Only for large subdomain accuracy (giving very
large computing time) the algorithm for accurate solution of subdomains
gives a more accurate solution.
This verifies our earlier claim that the solution obtained with
GMRES acceleration used with accurate solution
of subdomains is sensitive to the subdomain solution accuracy.
This sensitivity is not present with the GCR based acceleration procedure
used with inaccurate solution of subdomains.
Table 2: Accuracy of the solution to the Poisson problem using
(I) the algorithm for accurate solution of subdomains,
(II) the algorithm for inaccurate solution of subdomains using
GMRES and
(III) for IBLUD
We see that with inaccurate solution of subdomains by ILUD (the IBLUD postconditioning), we can reduce computing time is reduced by a factor 2-6 compared to accurate solution of subdomains. In [6], a comparison between accurate and inaccurate solution was made based on the number of iterations only. This led to the conclusion that the simple accelerated Schwarz algorithm using accurate subdomain solution was a competitive method compared to single-domain ILU preconditioned GMRES. The basis of analysis in the present paper is that we do less work per iteration and therefore we allow some more iterations. Therefore, although the number of iterations is machine and implementation independent it should not be used as a basis for comparison. Computing time is more suitable to compare algorithms but may give different results depending on the implementation and machine architecture.
The most impressive reductions of computing time are obtained for the Poisson equation, which is also the most expensive part of the multi-block Navier-Stokes problem, see [4]. For the difficult recirculating flow problem we obtain a reduction of a factor of 2. The simpler uniform flow problem shows a reduction of a factor of 4.