- Research
- Open Access
- Published:
Parallel algorithm for convection–diffusion system based on least-squares procedure
SpringerPlus volume 5, Article number: 1690 (2016)
Abstract
Combining subspace correction method with least-squares finite element procedure, we construct a new overlapping domain decomposition parallel algorithm for solving the first-order time-dependent convection–diffusion system. This algorithm is fully parallel. We analyze the convergence of approximate solution, and study the dependence of the convergent rate on the spacial mesh size, time increment, iteration number and sub-domains overlapping degree. Both theoretical analysis and numerical results suggest that only one or two iterations are needed to reach to given accuracy at each time step.
Background
In this paper, we consider the following initial-boundary value problem for time-dependent convection–diffusion system:
where \(\varOmega \) is an open bounded domain \(\mathbf {R}^d\) \((1\le d\le 3)\), with a Lipschitz continuous boundary \(\varGamma =\varGamma _D\cup \varGamma _N\); and \(\nu \) is the unit vector normal to \(\varGamma _N\); the flow field \(\mathbf {b}=(b_1,b_2,\ldots ,b_d)^T\); the source term \(q=q(x,t)\ge 0\) and exterior flow function \(f=f(x,t)\) are some given functions; the coefficient \(c=c(x)\) is positive function and the diffusion coefficient matrix \(A=(a(i,j))_{d\times d}\) is a symmetric uniformly positive definite matrix, i.e., there exist some positive constants \(c_*\) and \(a_*\) such that
This type of partial differential equation arises in many important fields, such as the mathematical modeling of aerodynamics, porous medium fluid flow, fluid dynamics (e.g. Euler equations, Navier-Stokes equations), meteorology, and semiconductor devices. Many numerical methods have been established to simulate this problem, for example, finite element and finite difference method, Eulerian–Lagrangian localized adjoint method Celia et al. (1990). The streamline diffusion finite element method Hughes and Brooks (1979), least-squares mixed element methods Yang (1999, 2000, 2002), Zhang and Guo (2012), Zhang et al. (2011) and Zhang (2009), and so on. Generally, these numerical procedures result in a large scale of algebraic system, so it is very important and useful to develop effective parallel algorithms both in engineering applications and mathematical analysis.
Recently domain decomposition parallel computation has become a powerful tool for solving a large scale system of partial differential equations. A lot of work has been done on domain decomposition parallel algorithms, for example, see Beilina (2016), Bramble et al. (1990, 1991), Cai (1989), Dolean et al. (2008, 2015), Dryja and Widlund (1987), Lu et al. (1991), Ma et al. (2009), Tarek (2008), Xu (1989, 1992, 2001) and Yang (2010). But many parallel algorithms based on overlapping domain decomposition are iterative algorithms so that many iteration steps are needed to reach given accuracy, which leads to much more global amount of computational work. On the basis of the idea of the parallel subspace correction method proposed by Xu (1989, 1992, 2001), the first author of this paper and Yang established a new parallel algorithm combined with characteristic finite element scheme, finite difference scheme and least-square scheme for one dimensional convection–diffusion problem in Zhang et al. (2011) and Zhang and Yang (2011a, b), where both theoretical analysis and numerical results suggest that when overlapping degree has a positive lower bound independent of mesh size, only one or two iterative times is needed to reach the optimal convergence precision at each time level.
In this paper, using the same technique as in Zhang et al. (2011), Zhang and Yang (2011) and Zhang and Yang (2011), we establish a new parallel algorithm for solving the convection–diffusion system. Here the arbitrary dimensional problem is considered, unlike in Zhang and Yang (2011) only one dimensional model was studied. And the different least-squares finite element scheme from the one in Zhang and Yang (2011) is used to obtain the optimal \(L^2\)-norm error estimate. The partition of unity is applied to distribute the corrections in the overlapping domains reasonably in this parallel algotithm. We analyze the convergence of approximate solution, and study the dependence of the convergent rate on the spacial mesh size, time increment, iteration number and sub-domains overlapping degree. Both theoretical analysis and numerical experiments indicate the full parallelization of the algorithms and very good approximate property.
Parallel algorithm
Throughout this paper we use usual definitions and notations of Sobolev spaces as in Adams (1975). Let \(W^{k,p}\,(\varOmega )\,(k\ge 0,\,1\le p\le \infty )\) be Sobolev spaces defined on \(\varOmega \) with usual norms \(\Vert \cdot \Vert _{W^{k,p}\,(\varOmega )}\) and \(H^k(\varOmega )=W^{k,2}\,(\varOmega )\). Define inner products as follows:
Introduce the spaces \(\mathcal {W}=\{\omega \in [L^2(\varOmega )]^d;\,\nabla \cdot {\omega }\in L^2(\varOmega ),\, \omega \cdot \nu =0\,\,\text {on} \,\,\varGamma _N\}\) and \(\mathcal {V}=\{v\in H^1(\varOmega );\,\,v=0\,\,\text {on}\,\,\varGamma _D\}\). Make a time partition \(0=t_0<t_1<\cdots<t_{M-1}<t_M=T\) and set \(\tau _n=t_n-t_{n-1}\) and \(\tau =\max \nolimits _{1\le n\le M}\tau _n\). Let \(w^n(x)=w(x,t_n)\). By use of the difference technique with first-order accuracy to discretize the first-order system (1), we can rewrite the system (1) as follows [see Yang (1999)]
where
To construct parallel subspace correction algorithm, we firstly make a domain decomposition. Assume that \(\{\varOmega '_i\}^N_{i=1}\) is a non-overlapping domain decomposition of \(\varOmega \). In order to obtain an overlapping domain decomposition, we extend each subregion \(\varOmega '_i\) to a larger region \(\varOmega _i\) such that \(\varOmega '_i\subset \varOmega _i\subset \varOmega \) and \( dist(\partial \varOmega '_i \backslash \partial \varOmega ,\partial \varOmega _i \backslash \partial \varOmega )\ge H\) for each \(1 \le i \le N\), where \(H >0\) is called as overlapping degree. Let \(\mathcal {T}_{h_u}\) and \(\mathcal {T}_{h_\sigma }\) be two families of quasi-regular finite element partitions of the domain \(\varOmega \) such that the elements in the partitions have the diameters bounded by \(h_u\) and \(h_\sigma \), respectively. Assume that \(\mathcal {T}_{h_u,i}=\mathcal {T}_{h_u}\bigcap \varOmega _i\) and \(\mathcal {T}_{h_\sigma ,i}=\mathcal {T}_{h_\sigma } \bigcap \varOmega _i\) just are one finite element partition of \(\varOmega _i\) for \( 1 \le i \le N\). Let \( {\mathcal W}_{h_\sigma }\subset \mathcal {W}\), and \({\mathcal V}_{h_u}\subset \mathcal {V}\) be piecewise r-degree and k-degree polynomial spaces defined on the partitions \(\mathcal {T}_{h_\sigma }\) and \(\mathcal {T}_{h_u}\), respectively.
Denote by \(\tilde{A}\) the inverse of A and define a bilinear form
Based on (3) and Yang (1999), we get the standard least-squares finite element procedure:
Least-squares scheme Given an initial approximation \((\varrho ^0_h,w^0_h)\in {\mathcal W}_{h_\sigma }\times {\mathcal V}_{h_u}\). For \(n=1,2,\ldots ,M\), seek \((\varrho ^n_h,w^n_h)\in {\mathcal W}_{h_\sigma }\times {\mathcal V}_{h_u}\) such that
In the following part of this section, we propose the parallel domain decomposition algorithm of the system (4). Define finite element sub-spaces:
and
It is clear that
and
It is easily seen that there exists a finite open covering family \(\{O^i\}^N_{i=1}\) of the domain \(\varOmega \) such that \(O^i\cap \varOmega \subset \varOmega _i\). We know that there exists a partition of unity \(\{\varphi _i\}^N_{i=1}\) (see Toselli and Widlund (2005), Lemma 3.4) such that
Let \(\varphi ^i_{h_u}\) and \(\varphi ^i_{h_\sigma }\) be the nodal piecewise linear interpolation of \(\varphi _i\) on the finite element meshes \({\mathcal T}_{h_u}\) and \({\mathcal T}_{h_\sigma }\), and \({\mathcal I}_{h_u}\) and \({\mathcal I}_{h_\sigma }\) be the interpolating operators on \(\mathcal {V}_{h_u}\) and \(\mathcal {W}_{h_\sigma }\).
Based on (4), we formulate the parallel subspace correction algorithm.
Parallel algorithm Let m denote the iteration number at each time step. Give an initial approximation \((\sigma ^0_h,u^{0}_h)=(\varrho ^0_h,w^0_h) \in {\mathcal W}_{h_\sigma }\times {\mathcal V}_{h_u}\). For \(n=1,\ 2,\ \ldots , M\), seek \((\sigma ^n_h,u^n_h)\in {\mathcal W}_{h_\sigma }\times {\mathcal V}_{h_u}\) by four steps:
Step 1. Set \((\tilde{\sigma }^n_{0},\tilde{u}^n_{0})=(\sigma ^{n-1}_h,u^{n-1}_h)\) and \(j:=1\).
Step 2. For \(i=1,2,\ldots ,N,\) seek \((\varepsilon ^i_{j},e^i_{j})\in {\mathcal W}^i_{h_\sigma }\times {\mathcal V}^i_{h_u}\), in parallel, such that
Step 3. Set corrections
Step 4. If \( j < m \), then set \(j:=j+1\) and return the step 2; or set
and then return back to the first step to start iteration at the next time step.
Some lemmas and main result
In the following sections, we denote by K and \(\delta \) some general constants and small positive constants independent of the mesh parameters H, \(h_\sigma \) \(h_u\) and \(\tau \), which may be different at different occurrences. Let
In order to analyze the convergence of parallel algorithm, we introduce projection operators \(P^i_{h_\sigma }{:}\,{\mathcal W}_{h_\sigma }\rightarrow {\mathcal W}^i_{h_\sigma }\) and \(Q^i_{h_u}:\,{\mathcal V}_{h_u}\rightarrow {\mathcal V}^i_{h_u}\) such that
Now, we give some important lemmas which are used to analyze the convergence of parallel algorithm.
We assume that finite element spaces \(\mathcal {W}_{h_\sigma }\) and \(\mathcal {V}_{h_u}\) have the inverse property and approximate properties [see Ciarlet (1978)] that there exist some integers \(r,\,r_1,\,k>0\), such that, for \(1\le q\le \infty \) and \(\forall \,\omega \in H(\text {div};\varOmega )\cap [W^{r+1,q}(\varOmega )]^d\),
Based on Theorem 3.3 in Yang (1999), the following result can be read:
Lemma 1
Let \((\sigma ,u)\) and \((\varrho ^n_h,w^n_h)\) be the solutions of (1) and least-squares scheme, respectively. Then there holds the a priori error estimate
Lemma 2
[See Yang (2001)] For any function \(\varphi \in W^{1,\infty }(\varOmega )\) and \(\omega _h\in \mathcal {W}_{h_\sigma }\), we have the following estimate
where \(d=1,2,3.\)
Lemma 3
For \(1\le i\le N\), we have
Proof
Using Lemma 2, we know that
This is the first inequality of (8).
In addition, by using the technique of Theorem 3.1 in Yang (2001), we can easily obtain
That is the second inequality of (8). The proof of Lemma 3 is completed. \(\square \)
Lemma 4
The following estimate
holds for each \(( \psi ,w)\) and \(( \omega ,v)\) in \({\mathcal W}_{h_\sigma } \times {\mathcal V}_{h_u}\).
Proof
It is easily seen that
and
and
Hence we have
Noting that
we have
and
Substituting these estimates into (10) leads to (9). This ends the proof of Lemma 4. \(\square \)
For parallel algorithm , we have the following convergence result:
Theorem 1
Let \((\sigma ,u)\) and \((\sigma ^n_h,u^n_h)\) are the solutions of the system (1) and parallel algorithm, respectively. If \(h^{2m}=O(\tau )\), then there holds the following a priori error estimate
where \(h=\max (h_\sigma ,h_u)\).
Proof of Theorem 1
It is easily seen that parallel algorithm is also equivalent to use an iteration with initial values \(({\sigma }^{n-1}_h,u^{n-1}_h)\) to solve the following equation: \((\hat{\sigma }^{n}_h,\hat{u}^n_h) \in {\mathcal W}_{h_\sigma } \times {\mathcal V}_{h_u}\) such that for any \((\omega _h,v_h) \in {\mathcal W}_{h_\sigma } \times {\mathcal V}_{h_u}\)
From (12) we have
Let \(\theta ^n=u^n_h-w^n_h,\) \(\rho ^n=w^n_h-u^n,\) \(\pi ^n=\sigma ^n_h-\varrho ^n_h\) and \(\eta ^n=\varrho ^n_h-\sigma ^n.\) Subtracting (4) from (13), we can get
Lemma 5
For parallel algorithm, we have the estimate
Proof
From (5), we have
In addition, from parallel algorithm we can obtain the following equation
Taking \((\omega _h,v_h)=(\tilde{\sigma }^n_j-\hat{\sigma }^{n}_h,\tilde{u}^n_j-\hat{u}^n_h)\) in (17) and using Lemma 4, we have
Thus, we have
That is the inequality (15). This ends the proof of Lemma 5. \(\square \)
Hence, we need to estimate the bounds of \({\sigma }^{n-1}_h-\hat{\sigma }^{n}_h\) and \(u^{n-1}_h-\hat{u}^n_h\).
Lemma 6
For parallel algorithm, we have the following estimate
Proof
From (14) we have
Taking \((\omega _h,v_h)=(\hat{\sigma }^{n}_h-{\sigma }^{n-1}_h,\hat{u}^n_h-u^{n-1}_h)\) in (21) and using the inequality \(ab\le \frac{1}{\delta }a^2+\delta b^2\), we can obtain
Hence, when we choose sufficiently small \(\delta \), we can obtain the estimate (20). This ends the proof of Lemma 6. \(\square \)
Finally, we prove Theorem 1.
Proof
Let \((\omega _h,v_h)=(\pi ^n,\theta ^n-\theta ^{n-1})\) in (14), we have
Since
we have
Next, we estimate the terms on the right-hand side of the error equation (22).
It is clear that
and
Substituting the above estimates into (22) and then summing it up from 1 to n, we get
Applying a known inequality
and discrete Gronwall’s lemma to (23), we derive that
Using Lemma 1, we can obtain the estimate (11). The proof of Theorem 1 is complete. \(\square \)
Numerical results
As in Zhang and Yang (2011), we first consider the one dimensional convection–diffusion problem:
We divide the domain [0, 1] into three sub-domains: \(\varOmega _1=\left[0,\frac{1}{3}+\frac{H}{2}\right],\) \(\varOmega _2=\left[\frac{1}{3}-\frac{H}{2},\frac{2 }{3}+\frac{H}{2}\right]\) and \(\varOmega _3=\left[\frac{2 }{3}-\frac{H}{2},1\right]\), where H is the overlapping degree (see Fig. 1).
We use piecewise linear polynomial spaces, set \(h_u=h_\sigma =h\) and take the linear unit decomposition functions as in Zhang et al. (2011). We define the \(L^2\)-norm error as follows:
and the \(L^\infty \)-norm error
Experiment I In this experiment, the exact solution is chosen as \(u=e^t\sin ^2\pi x\). Set \(T=1\), and \(b=1\). For different parameters a, h, \(\tau \) and the iterative number m at each time step, we give \(L^2\)-norm errors and the \(L^\infty \)-norm errors in Tables 1, 2 and 3. These numerical results suggest that we can get a good result for convection–diffusion problem using parallel algorithm , even iterating only one or two cycle at each time step. Moreover, these numerical results also imply that the errors caused by decomposing domain decrease as the discretization parameters h and \(\tau \) decrease and increase as the overlapping degree H becomes small, which are coincided with our theoretical result.
Experiment II As in Zhang and Yang (2011), we select the right-hand side function with complex structure and the initial condition as follows:
Choosing \(H=1/12,\ h=\tau =1/48\), \(b=1,\) and \(a=1e{-}4,\) we observe numerical results at different time (see Figs. 2, 3). We use “ * ” to denote \(u_h\) and \( \sigma _h\), the values of the parallel algorithm and use “ - ” to denote \(w_h\) and \(\varrho _h\), the values of least-squares algorithm. These figures clearly show that \(u_h,\ \sigma _h\) approximate to \(w_h\) and \(\varrho _h\) at different time, respectively, which is coincided with our theoretical analysis.
Next, we consider the two dimensional convection–diffusion problem:
where \(\varOmega =[0,1]\times [0,1],\,\mathbf {A}=a\mathbf E \), \(\mathbf E \) is the unit matrix, and \(\mathbf b =(1,1)^T\). We divide \(\varOmega \) into four sub-domains: \(\varOmega _1=[0,0.6]\times [0,0.6]\), \(\varOmega _2=[0.4,1]\times [0,0.6]\), \(\varOmega _3=[0,0.6]\times [0.4,1]\), \(\varOmega _4=[0.4,1]\times [0.4,1]\), see Fig. 4.
In this section, we use piecewise linear polynomial spaces. And We take the linear unit decomposition functions \(\{\varphi _i\}^4_{i=1}\) as follows:
Experiment III Here we still select the same right-hand side function with complex structure and the initial condition as in Zhang and Yang (2011),
Set \(H=0.2\), \(h=\tau =1/40\), and \(a=1e{-}2\), \( T=1.0\), \( m=1\). We can get Figs. 5, 6 and 7. These results suggest that the values \(u_h\), \( \varvec{\sigma }_h=(\sigma ^1_h,\sigma ^2_h)\) by parallel algorithm approximate to \(w_h\) and the values \(\rho _h=(\rho ^1_h,\rho ^2_h)\) by least-squares scheme respectively, which implies that our method is valid for two-dimensional problem.
Conclusions
In this paper, combined subspace correction method with least-squares mixed element procedure, a new class of parallel domain decomposition algorithm is proposed to solve convection–diffusion problem. The convergence of approximate solution, and the dependence of the convergent rate on the spacial mesh size, time increment, iteration number and sub-domains overlapping degree are studied. Both theoretical analysis and numerical experiments indicate the full parallelization of the algorithms and very good approximate property.
In fact, though we consider the convection–diffusion problem in this paper, we can extend our method to other complex problems, e.g. saltwater intrusion problem, aerodynamic problems, nuclear waste disposal, etc., which are our future work.
References
Adams RA (1975) Sobolev spaces. Academic, New York
Beilina L (2016) Domain decomposition finite element/finite difference method for the conductivity reconstruction in a hyperbolic equation. Communications in Nonlinear Science and Numerical Simulation, Elsevier
Bramble JH, Pasciak JE, Xu J (1990) Parallel multilevel preconditioners. Math Comput 55:1–22
Bramble JH, Pasciak JE, Xu J (1991) Convergence estimates for product iterative methods with application to domain decomposition. Math Comput 57:1–21
Cai XC (1989) Some domain decomposition algorithms for nonselfadjont elliptic and parabolic partial differential equations. Ph. D. thesis, Courant Institute
Celia MA, Russell TF, Herrera I, Ewing RE (1990) An Eulerian–Lagrangian localized adjoint method for the advection-diffusion equation. Adv Water Resour 13:187–206
Ciarlet PG (1978) The finite element methods for elliptic problems. North-Holland, New York
Dolean V, Lanteri S, Perrussel R (2008) A domain decomposition method for solving the three-dimensional time-harmonic Maxwell equations discretized by discontinuous Galerkin methods. J Comput Phys 227(3):2044–2072
Dolean V, Jolive P, Nataf F (2015) An introduction to domain decomposition methods: algorithms, theory, and parallel implementation. SIAM
Dryja M, Widlund OB (1987) An additive variant of Schwarz alternating methods for many subregions. Tech. Report 339 Dept. of Comp. Sci. Coutant Institute
Hughes TJR, Brooks AN (1979) A multidimensional upwind scheme with no crosswind diffusion. In: Hughes T.J.R. (Ed.), Finite element methods for convection dominated flows 34:19–35
Lu T, Shih TM, Liem CB (1991) Two synchronous parallel algorithms for partial differential equations. J Comput Math 9(4):74–85
Ma K, Sun T, Yang DP (2009) Parallel Galerkin domain decomposition procedure for parabolic equation on general domain. Numer Methods Partial Differ Equ 25(5):622–636
Tarek M (2008) Domain decomposition methods for the numerical solution of partial differential equations. Lecture Notes in Computational Science and Engineering, Springer
Toselli A, Widlund O (2005) Domain decomposition methods-algorithms and theory. Springer-Verlag, Berlin Heidelberg
Xu J (1989) Theory of Multilevel Methods. Ph. D. thesis, Cornell University
Xu J (1992) Iterative methods by space decomposition and subspace correction: A unifying approach. SIAM Review 34:581–613
Xu J (2001) The method of subspace corrections. J Comput Appl Math 128:335–362
Yang DP (1999) Some least-squares Garlerkin procedures for first-order time-dependent convection–diffusion system. Comput Methods Appl Mech Eng 180:81–95
Yang DP (2000) Analysis of least-squares mixed finite element methods for nonlinear nonstationay convection–diffusion problems. Math Comput 69:929–963
Yang DP (2001) A splitting positive definite mixed element method for miscible displacement of compressible flow in porous media. Numer Methods Partial Differ Equ 17:229–249
Yang DP (2002) Least-squares mixed finite element methods for non-linear parabolic problems. J Comput Math 20:153–164
Yang DP (2010) Parallel domain decomposition procedures of improved D-D type for parabolic problems. J Comput Appl Math 233:2779–2794
Zhang JS (2009) Least-squares mixed finite element method for Sobolev equation. Chin J Eng Math 26(4):749–752
Zhang JS, Yang DP, Fu H, Guo H (2011) Parallel characteristic finite element method for time-dependent convection–diffusion problem. Numer Linear Algebra Appl 18(4):695–705
Zhang JS, Yang DP, Zhu J (2011) Two new least-squares mixed finite element procedures for convection-dominated Sobolev equations. Appl Math J Chin Univ 26(4):401–411
Zhang JS, Yang DP (2011) Parallel characteristic finite difference methods for convection–diffusion equations. Numer Methods Partial Differ Equ 27:854–866
Zhang JS, Yang DP (2011) Parallel least-squares finite element method for time-dependent convection–diffusion system. Computing 91:217–240
Zhang JS, Guo H (2012) A split least-squares characteristic mixed element method for nonlinear nonstationary convection–diffusion problem. Int J Comput Math 89(7):932–943
Authors' contributions
JSZ designed the algorithm and wrote the main manuscript text. HFF and HG did some the experiments and prepared all the figures. All the authors contributed to the discussion of the results. All authors read and approved the final manuscript.
Acknowledgements
Jiansong Zhang’s work was partially supported by the National Nature Science Foundation of China (11126084, 11401588), China Scholarship Council, and the Natural Science Foundation of Shandong Province (ZR2014AQ005), the Fundamental Research Funds for the Central Universities and the Young Talent Attraction fellowship from the Brazilian National Council for Scientific and Technological Development (CNPq). Hui Guo and Hongfei Fu’s work was was partially supported by the Fundamental Research Funds for the Central Universities.
Competing interests
The authors declare that they have no competing interests.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zhang, J., Guo, H., Fu, H. et al. Parallel algorithm for convection–diffusion system based on least-squares procedure. SpringerPlus 5, 1690 (2016). https://doi.org/10.1186/s40064-016-3333-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s40064-016-3333-8
Keywords
- Overlapping domain decomposition
- Parallel subspace correction
- Least-squares
- Convection–diffusion system
Mathematics Subject Classification
- 65M55
- 65M60
- 65M12
- 65M15