One of the main reasons for investigating inaccurate solution of subdomain problems is to reduce the excessive computing times observed in the multi-block incompressible Navier-Stokes solver [10], and to bring them closer to single-block solution time. This also gives better prospects for parallel computing.
Table 2 lists the number of iterations and computing times for single-block solution of the Poisseuille flow on an grid and the single-block cylinder grid with 10800 cells from Figure 5. The results are given for GMRES subdomain solution using RILU() postconditioning.
Table 2: Single-block solution using GMRES with RILU() postconditioning
Table 3 shows a comparison of single-block solution and multi-block solution of the momentum equations for different decompositions of the domain.
Table 3: Results with various decompositions into subdomains for the momentum
equations, multiplicative algorithm.
Table 4 shows a comparison of single-block solution and multi-block solution of the pressure equations for different decompositions of the domain. It is important to note that with the optimized restarted GCR method with RIBLU() postconditioning (the bottom row in Table 4), the maximum dimension of the Krylov space had to be increased to 40 to obtain convergence within 200 iterations/time step of the pressure equation.
Table 4: Results with various decompositions into subdomains for the pressure
equations, multiplicative algorithm.
Comparing Tables 2 and 3, we see that for small numbers of subdomains, computing time for the momentum equations can be reduced to below that of single-block solution for method II. However, for larger numbers of blocks this is not the case, which is possibly due to superlinear convergence of the subdomain solvers, see further on. Method III is faster than method II for the cylinder problem, but not for the Poisseuille problem. An explanation is that the time step for the cylinder problem () is much smaller than for the Poisseuille flow () which increases the diagonal of the momentum matrix considerably and improves convergence.
Comparing Tables 2 and 4, we see that for small numbers of subdomains, the computing time with method II with inaccurate subdomain solution is still a factor 2-3 larger than with single block solution. Also, for the Poisseuille flow in Table 4 inaccurate subdomain solution does not always provide a speedup. Method III leads to growing iteration count and computing times for increasing numbers of blocks. The reason for the bad performance of the RIBLU() postconditioner is that for close to , it performs much worse with respect to single domain RILU() than for , see Section 6.3. We also see that the Jackson & Robinson truncation strategy is quite effective in reducing iteration count compared to restarted GCR. The optimizations in GCR do not outweigh this increase in iteration count.
A possible reason for the modest reduction in computing time by method II for larger numbers of blocks is the following. For larger numbers of blocks the subdomains are smaller and therefore superlinear convergence of the subdomain GMRES solver can occur earlier. For example, in case of superlinear convergence, lowering the subdomain solution accuracy from to might only save a single subdomain GMRES iteration (out of say 6 iterations). The subdomain solution accuracy therefore only gives a small reduction of work needed to solve subdomains, but it may still cause a significant increase in the number of GCR iterations in the outer loop.
The results in Table 3 and Table 4 confirm the remark in Section 3.4 that the constant C in Theorem 2 does not depend on the number of blocks for the multiplicative algorithm: the ratio of the number of iterations needed with and does not increase as the domain is decomposed into more subdomains.