This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Computer simulation has shown its great power in solving electromagnetic (EM) problems and become an indispensable tool in engineering design and analysis. For such simulations, surface integral equation methods (SIE) are widely used, and the method of moments (MoM) using SIE is regarded as an effective and popular method [1, 2]. Nowadays, accurate electromagnetic scattering analysis of electrically large objects is of vital importance in many engineering applications. For the scatterer with large electric size, high-frequency methods like physical optics (PO) [3] and uniform theory of diffraction (UTD) [4] fail to provide sufficient accuracy, while high-accuracy methods like MoM and finite element method (FEM) are restricted from their large amount of memory requirement. Even the multilevel fast multipole algorithm (MLFMA) [5–7] can hardly handle the problem of thousands of wavelengths in the condition of limited computing resources.
In order to solve the problem, two approaches, large-scale parallel computing on supercomputers [8] and domain decomposition method (DDM), are concerned. Parallel computing technique can save the computational time to a large extent, in which the most frequently used parallel programming model is the message passing interface (MPI). But for MoM, it is still not easy to simulate a model of hundreds of wavelengths due to limited computational resources. Recent years’ in-depth study has been conducted in DDM based on finite elements (FEM), finite difference, and integral equation methods [9–11]. Among them, the DDM based on integral equations has successfully simulated scattering problems of PEC objects and radiation problems of antenna, which seems to be a method both feasible and efficient for solving complex and large problems. Basically, the strategies of DDM can be categorized into two types: one is to divide the whole solution region into many subdomains, and each subdomain is enclosed by a closed surface, named enclosed DDM (E-DDM) [12–15]. The other is also to divide the whole solution region into many subdomains, but each subdomain is not required to be enclosed by a closed surface, named unclosed-DDM (U-DDM). More recently, a so-called U-DDM-based discontinuous Galerkin (DG) method has been successfully developed to implement DDM of the SIE MoM and the volume integral equation (VIE) MoM [16–19]. It has been shown that this DG method using Krylov iterative methods is attractive for complex multiscale EM problems. However, so far, this powerful DG has not been derived and investigated for the challenging problems of scattering from electrically large objects.
The aim of this paper is to derive an efficient formulation of E-DDM for analysing EM scattering from electrically large PEC objects whose MLFMA models are too large for the user’s computer to accommodate. In this paper, the marriage of an efficient parallel integral equation-based nonoverlapping domain decomposition method (IE-NDDM) and high performance of computing (HPC) is presented. On the basis of the first-order Robin-type transmission conditions (TC) [20], an explicit boundary condition is applied to ensure the continuity of electric currents on the touching faces between two subdomains. With the aid of the TC, the unphysical reflection at subdomain interfaces is suppressed, and each subdomain is allowed to be meshed independently, providing unprecedented flexibility and convenience for the process of mesh generation. The proposed method allows all the computer resources to be used to a single subdomain at a time, the advantage of which is that the imbalanced workload caused by the unevenly divided subdomains is reduced, and at the same time the scale of the problem to be solved is greatly enlarged. For the solution of the interior of each subdomain and the coupling between subdomains, parallel MLFMA is used to speed up the matrix-vector multiplications, and to avoid the storage of the mutual impedance, different subdomains exchange the coupling in the manner of near-field instead of traditional MLFMA with weight functions expanded in the disaggregation stage. An improved adaptive direction partitioning scheme is applied to obtain a good load balance for MLFMA, further improving the capability of the proposed IE-NDDM. Based on the proposed algorithms, we have also developed a hybrid message passing interface (MPI)/OpenMP parallel implementation to fully exploit the recent success of multicore processors and massively parallel distributed memory supercomputers.
The remaining part of this paper is organized as follows. First, we outline the basic principle of IE-NDDM. Then, the parallelization strategy for the iterative process and MLFMA solver is designed, respectively. The numerical results are presented in Section 4 followed by the conclusion.
2. Nonoverlapping Domain Decomposition Method
2.1. Notations
Compared with overlapped DDM, nonoverlapping DDM only adds touching faces between neighbouring subdomains to make each of them closed, and CFIE can be applied to obtain faster convergence compared to EFIE when using the MLFMA solver. Without loss of generality, consider a PEC object divided into two identical nonoverlapping subdomains which is illuminated by an incident plane wave
When the field point
By applying the transmission condition to the adjacent subdomains, the continuity of electric currents is ensured. Thus, it allows the solution of nonoverlapping DDM equals to that of the original problem.
2.2. Galerkin Discrete Formulation
In order to simplify the presentation, we still consider a decomposed PEC object with two subdomains shown in Figure 1. To solve (1), the RWG functions [21] are used to expand the surface current
Based on the idea of nonoverlapping DDM, the touching face
In this paper, an explicit boundary condition
Generally, for the
At this point, the unknown current coefficients in (5) and (6), the mutual-interaction matrices
Note that during the execution of the algorithm, the global model equation (8) does not have to be fully constructed since the fast algorithm MLFMA will be applied.
2.3. Resolution of the DDM Linear System
For the proposed IE-NDDM, the Gauss–Seidel scheme ought to be adapted so as to allow all the computer resources to be used to a single subdomain problem at a time whose whole model is too large for the current computer resources to accommodate. The unknown coefficients on a subdomain will be successively updated by solving the local model equation defined on the subdomain until convergence. The convergence rate is improved by using the explicit boundary condition in (7), and it will be verified in the numerical results.
Given the tolerance
The iterative steps continue, and the residual error
It is worth pointing out that at the
In this paper, a hybrid MPI/OpenMP strategy is also implemented. Take the very large problem for example; we can use
2.4. Acceleration of IE-NDDM Using Parallel MLFMA
For electrically large problems, the interior of each subdomain and the coupling between subdomains are accelerated using MLFMA to further reduce the memory consumption and computational complexity, and that is to speed up far interactions of
The spatial partitioning and direction partitioning are the key factors for parallelization of MLFMA. Simply using spatial partitioning at lower levels while using direction partitioning at higher levels and setting a transition level in the middle level fails to obtain ideal scalability when the number of processes exceeds
[figures omitted; refer to PDF]
3. Results and Discussion
In this section, numerical examples are given to study the performance of the proposed IE-NDDM, including the accuracy, the parallel efficiency, and the convergence behaviour. The computing platform used in the section is the HPC cluster from Xidian University (XD-HPC). Each node has two 12-core Intel Xeon 2690-v2 2.2 GHz CPUs and 64 GB memory connected with each other by 56 Gbps InfiniBand network. The parallel GMRES is combined with a block-diagonal preconditioner as the inner iterative solver for MLFMA. The residual error for inner iterative convergence is set to 3.0 × 10−3.
3.1. Performance Analysis for a PEC Sphere
First, the accuracy of the proposed parallel IE-NDDM is validated through comparison with the result of Mie series and MLFMA. A PEC sphere divided into four subdomains is simulated in this example with each colour representing one subdomain, the model of which is shown in Figure 5. The diameter of the sphere is 3 m, and the frequency of the incident plane wave is 3 GHz. The whole sphere is modelled by CFIE discretised with a total of 1030251 unknowns. The bistatic RCS results are plotted in Figure 6. From the comparison, it can be seen that the results from the IE-NDDM agree well with those from Mie series and MLFMA. Figure 7 compares the convergence behaviour of the proposed IE-NDDM by employing different boundary conditions. As can be seen, a faster convergence rate is obtained when we use the boundary condition proposed in this paper which takes 8 steps to converge while the traditional one-order Robin-type transmission condition takes 16 steps.
[figure omitted; refer to PDF] [figure omitted; refer to PDF] [figure omitted; refer to PDF]3.2. Performance Analysis for a Metallic Missile
Here, we take a metallic missile as an example to study the computation of current distribution on it and the strong scalability. The incident plane wave propagates towards head (
In the strong scaling experiment, we examine the solution time required for the missile problem by increasing the number of cores. In parallel computation, six MPI processes are assigned with one computing node, in which four OpenMP threads are used in each MPI process. The computing node in use has two 12-core Intel Xeon E5 2.2 GHz processors. We still consider EM scattering from the missile at 10 GHz and increase the number of computing nodes from 1 to 16. Thus, the total computing cores increase from 24 to 384. The timings of various simulations are given in Figure 11. We see that the speedup is super-linear between 48 and 192 cores. At peak performance on 384 cores, the speedup relative to 24 cores is approximately a twelve times decrease in simulation time.
[figure omitted; refer to PDF]3.3. Performance Analysis for the Metallic 737 Aircraft
This example is an investigation of the RCS of a real-life Boeing 737 aircraft and further validates the accuracy and efficiency of the proposed IE-NDDM. The aircraft is divided into five subdomains shown in Figure 12 with each colour representing one subdomain. The incident plane wave is toward the nose (
The computational statistics for solving each subdomain and the entire problem are recorded in Table 1. Eight computing nodes are used, and six MPI processes are assigned with one computing node, in which four OpenMP threads are used in each MPI process. We can observe that the parallel IE-NDDM algorithm leads to over 73.7% memory reduction compared with MLFMA. The bistatic RCS results computed by the parallel IE-NDDM algorithm are plotted in Figure 13, in which the MLFMA results are also given for comparison. From the comparison, it can be seen that the results using IE-NDDM agree well with those from MLFMA. Thus, it provides a theoretical support for the solution of larger scattering objects.
Table 1
Computational statistics of aircraft.
| Method | Unknowns | Peak storage (GB) | CPU cores |
|---|---|---|---|
| IE-NDDM | 6729015 ① | 123.8 | 192 |
| 8818704 ② | |||
| 5946264 ③ | |||
| 5751930 ④ | |||
| 9000189 ⑤ | |||
| MLFMA | 36040785 | 470.7 | 192 |
[figures omitted; refer to PDF]
3.4. Performance Analysis for a Metallic Aircraft of 2000 Wavelengths
In this example, an extremely electrically large problem, the scattering characteristics of another aircraft, has been solved by the parallel IE-NDDM algorithm. The incident plane wave is toward the bow, and the polarization direction is
[figures omitted; refer to PDF]
4. Conclusions
In this paper, an IE-NDDM using the special explicit boundary condition has been established for analysing EM scattering from electrically large PEC objects whose MLFMA models are too large for the user’s computer to accommodate. The parallel multilevel fast multipole algorithm is used to accelerate the computation of the subdomains and the coupling between them, and the improved adaptive partitioning scheme enhanced its parallel efficiency. The coupling between different subdomains is computed in the manner of near field, and Lagrange polynomial interpolation is applied to the calculation of the translation matrix, which significantly reduces storage and CPU time. In the case of limited resources, the engineering problem of the scattering characteristics of thousands of wavelengths is solved.
Conflicts of Interest
The authors declare that there is no conflict of interest regarding the publication of this paper.
Acknowledgments
This work was supported in part by the China Postdoctoral Science Foundation funded project under Grant 2017M613068, in part by the National Key Research and Development Program of China under Grant 2017YFB0202102, in part by the National Key Research and Development Program of China under Grant 2016YFE0121600, in part by the National High Technology Research and Development Program of China (863 Program) under Grant 2014AA01A302, in part by the Key Research and Development Program of Shandong Province under Grant 2015GGX101028, and in part by the Special Program for Applied Research on Super Computation of the NSFC-Guangdong Joint Fund (the second phase) under Grant No. U1501501.
[1] R. F. Harrington, "Boundary integral formulations for homogeneous material bodies," Journal of Electromagnetic Waves and Applications, vol. 3 no. 1,DOI: 10.1163/156939389X00016, 1989.
[2] L. N. Medgyesi-Mitschang, J. M. Putnam, M. B. Gedera, "Generalized method of moments for three-dimensional penetrable scatterers," Journal of the Optical Society of America A, vol. 11 no. 4, pp. 1383-1398, DOI: 10.1364/JOSAA.11.001383, 1994.
[3] L. Pandolfo, P. de Vita, M. Bandinelli, G. Carluccio, M. Albani, "A flexible, general-purpose code based on the iterative physical optics algorithm: analyzing electromagnetic scattering in electrically large scenarios. [EM Programmer’s Notebook]," IEEE Antennas and Propagation Magazine, vol. 59 no. 5, pp. 150-158, DOI: 10.1109/MAP.2017.2731210, 2017.
[4] G. Karagounis, D. de Zutter, D. Vande Ginste, "A hybrid MLFMM–UTD method for the solution of very large 2-D electromagnetic problems," IEEE Transactions on Antennas and Propagation, vol. 64 no. 1, pp. 224-234, DOI: 10.1109/TAP.2015.2501812, 2016.
[5] J. Song, Cai-Cheng Lu, Weng Cho Chew, "Multilevel fast multipole algorithm for electromagnetic scattering by large complex objects," IEEE Transactions on Antennas and Propagation, vol. 45 no. 10, pp. 1488-1493, DOI: 10.1109/8.633855, 1997.
[6] B. Dembart, E. Yip, "A 3D fast multipole method for electromagnetics with multiple levels," 11th Annual Review of Progress in Applied Computational Electromagnetics, pp. 621-628, .
[7] Z. P. Nie, J. Hu, "The fast multipole methods for vector analysis of scattering from 3-dimensional objects with complex structure," Acta Electronica Sinica, vol. 27 no. 6, pp. 104-109, 1999.
[8] Y. Chen, S. Zuo, Y. Zhang, X. Zhao, H. Zhang, "Large-scale parallel method of moments on CPU/MIC heterogeneous clusters," IEEE Transactions on Antennas and Propagation, vol. 65 no. 7, pp. 3782-3787, DOI: 10.1109/TAP.2017.2700871, 2017.
[9] Z. Lou, J. M. Jin, "A novel dual-field time-domain finite-element domain-decomposition method for computational electromagnetics," IEEE Transactions on Antennas and Propagation, vol. 54 no. 6, pp. 1850-1862, DOI: 10.1109/TAP.2006.875922, 2006.
[10] Z. H. Lai, J. F. Kiang, R. Mittra, "A domain decomposition finite difference time domain (FDTD) method for scattering problem from very large rough surfaces," IEEE Transactions on Antennas and Propagation, vol. 63 no. 10, pp. 4468-4476, DOI: 10.1109/TAP.2015.2466466, 2015.
[11] Q. Su, Z. J. Gu, Z. Lin, X. W. Zhao, Y. Zhang, "Analysis of scattering characteristics of one-thousand-wavelength object using parallel IE-NDDM," Journal of Xidian University, vol. 45 no. 6, 2018.
[12] M. K. Li, W. C. Chew, "Wave-field interaction with complex structures using equivalence principle algorithm," IEEE Transactions on Antennas and Propagation, vol. 55 no. 1, pp. 130-138, DOI: 10.1109/TAP.2006.888453, 2007.
[13] M. K. Li, W. C. Chew, "Multiscale simulation of complex structures using equivalence principle algorithm with high-order field point sampling scheme," IEEE Transactions on Antennas and Propagation, vol. 56 no. 8, pp. 2389-2397, DOI: 10.1109/TAP.2008.926785, 2008.
[14] Z. Peng, X. C. Wang, J. F. Lee, "Integral equation based domain decomposition method for solving electromagnetic wave scattering from non-penetrable objects," IEEE Transactions on Antennas and Propagation, vol. 59 no. 9, pp. 3328-3338, DOI: 10.1109/TAP.2011.2161542, 2011.
[15] Z. Peng, K. H. Lim, J. F. Lee, "Computations of electromagnetic wave scattering from penetrable composite targets using a surface integral equation method with multiple traces," IEEE Transactions on Antennas and Propagation, vol. 61 no. 1, pp. 256-270, DOI: 10.1109/TAP.2012.2220098, 2013.
[16] Z. Peng, K. H. Lim, J. F. Lee, "A discontinuous Galerkin surface integral equation method for electromagnetic wave scattering from nonpenetrable targets," IEEE Transactions on Antennas and Propagation, vol. 61 no. 7, pp. 3617-3628, DOI: 10.1109/TAP.2013.2258394, 2013.
[17] Z. Peng, R. Hiptmair, Y. Shao, B. MacKie-Mason, "Domain decomposition preconditioning for surface integral equations in solving challenging electromagnetic scattering problems," IEEE Transactions on Antennas and Propagation, vol. 64 no. 1, pp. 210-223, DOI: 10.1109/TAP.2015.2500908, 2016.
[18] L. M. Zhang, X. Q. Sheng, "A discontinuous Galerkin volume integral equation method for scattering from inhomogeneous objects," IEEE Transactions on Antennas and Propagation, vol. 63 no. 12, pp. 5661-5667, DOI: 10.1109/TAP.2015.2490254, 2015.
[19] L. M. Zhang, X. Q. Sheng, "Discontinuous Galerkin volume integral equation solution of scattering from inhomogeneous dielectric objects by using the SWG basis function," IEEE Transactions on Antennas and Propagation, vol. 65 no. 3, pp. 1500-1504, DOI: 10.1109/TAP.2016.2647686, 2017.
[20] K. Zhao, V. Rawat, J. F. Lee, "A domain decomposition method for electromagnetic radiation and scattering analysis of multi-target problems," IEEE Transactions on Antennas and Propagation, vol. 56 no. 8, pp. 2211-2221, DOI: 10.1109/TAP.2008.926774, 2008.
[21] S. Rao, D. Wilton, A. Glisson, "Electromagnetic scattering by surfaces of arbitrary shape," IEEE Transactions on Antennas and Propagation, vol. 30 no. 3, pp. 409-418, DOI: 10.1109/TAP.1982.1142818, 1982.
[22] M. H. Han, Y. X. Peng, S. K. Li, F. J. Chen, "Implementation of parallel computing for electromagnetic scattering on Linux cluster," Journal of Computer Research and Development, vol. 42 no. 6, pp. 1085-1088, DOI: 10.1360/crad20050628, 2005.
[23] J. Dong, S. L. Chai, J. J. Mao, "Parallel solving large scale electromagnetic computing problem on super computer," High Performance Computing Application Conference 2005, pp. 329-338, .
[24] L. Gurel, O. Ergul, "Efficient solution of the combined-field integral equation with the parallel multilevel fast multipole algorithm," 2007 Computational Electromagnetics Workshop, pp. 16-21, DOI: 10.1109/cem.2007.4387644, .
[25] O. Ergul, L. Gurel, Parallelization of MLFMA for the Solution of Large-Scale Electromagnetics Problems, 2014.
[26] B. Michiels, J. Fostier, I. Bogaert, D. de Zutter, "Weak scalability analysis of the distributed-memory parallel MLFMA," IEEE Transactions on Antennas and Propagation, vol. 61 no. 11, pp. 5567-5574, DOI: 10.1109/TAP.2013.2278078, 2013.
[27] X. Zhao, S.-W. Ting, Y. Zhang, "Parallelization of half-space MLFMA using adaptive direction partitioning strategy," IEEE Antennas and Wireless Propagation Letters, vol. 13 no. 6, pp. 1203-1206, DOI: 10.1109/LAWP.2014.2331699, 2014.
[28] J. Song, W. C. Chew, "Interpolation of translation matrix in MLFMA," Microwave and Optical Technology Letters, vol. 30 no. 2, pp. 109-114, DOI: 10.1002/mop.1234, 2001.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2019 Qin Su et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://creativecommons.org/licenses/by/4.0/
Abstract
In this paper, a parallel nonoverlapping and nonconformal domain decomposition method (DDM) is proposed for fast and accurate analysis of electrically large objects in the condition of limited resources. The formulation of nonoverlapping DDM for PEC bodies is derived from combined-field integral equation (CFIE), and an explicit boundary condition is applied to ensure the continuity of electric currents across the boundary. A parallel multilevel fast multipole algorithm (MLFMA) is extended to accelerate matrix-vector multiplications of subdomains as well as the coupling between them, and the coupling between different subdomains is computed in the manner of near field to avoid the storage of the mutual impedance. An improved adaptive direction partitioning scheme is applied to the oct-tree of MLFMA to achieve high parallel efficiency. Numerical examples demonstrate that the proposed method is able to simulate realistic problems with a maximum dimension greater than 2000 wavelengths.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Gu, Zongjing 1 ; Zhai, Chang 1 ; Lin, Zhongchao 1
; Wu, Weijun 2 1 Shaanxi Key Laboratory of Large Scale Electromagnetic Computing, Xidian University, Xi’an, Shaanxi 710071, China
2 Science and Technology on Electromagnetic Compatibility Laboratory, China Ship Development and Design Center, Wuhan 430064, China





