吴昊翔,
赵勋旺,,
林中朝,
张玉,
张崎
西安电子科技大学陕西省超大规模电磁计算重点实验室 ??西安 ??710071
基金项目:国家重点研发计划(2017YFB0202102, 2016YFE0121600),中国博士后科学基金(2017M613068)
详细信息
作者简介:顾宗静:男,1989年生,博士生,研究方向为计算电磁学、大规模并行矩量法、区域分解算法
吴昊翔:男,1995年生,硕士生,研究方向为计算电磁学、大规模并行矩量法
赵勋旺:男,1983年生,副教授,研究方向为大型机载天线阵列分析
林中朝:男,1988年生,讲师,研究方向为计算电磁学
张玉:男,1978年生,教授,研究方向为计算电磁学、大规模并行算法
通讯作者:赵勋旺 xwzhao@mail.xidian.edu.cn
中图分类号:TN820计量
文章访问数:1177
HTML全文浏览量:473
PDF下载量:37
被引次数:0
出版历程
收稿日期:2018-06-04
修回日期:2018-12-13
网络出版日期:2018-12-19
刊出日期:2019-04-01
Parallel MoM Using the Six Hundred Thousand Cores on Domestically-made and Many-core Supercomputer
Zongjing GU,Haoxiang WU,
Xunwang ZHAO,,
Zhongchao LIN,
Yu ZHANG,
Qi ZHANG
Shaanxi Key Laboratory of Large Scale Electromagnetic Computing, Xidian University, Xi’an 710077, China
Funds:The National Key Research and Development Program of China (2017YFB0202102, 2016YFE0121600), The China Postdoctoral Science Foundation (2017M613068)
摘要
摘要:为实现电磁计算的安全可靠和自主可控,该文基于“天河二号”国产众核超级计算机平台,开展大规模并行矩量法(MoM)的开发工作。为减轻大规模并行计算时计算机集群的通信压力以及加速矩量法积分方程求解,通过分析矩量法电场积分方程离散生成的矩阵具有对角占优特性,提出一种新型LU分解算法,即对角块矩阵选主元LU分解(BDPLU)算法,该算法减少了panel列分解的计算量,更重要的是,完全消除了选主元过程的MPI通信开销。利用BDPLU算法,并行矩量法突破了6×105 CPU核并行规模,这是目前在国产超级计算平台上实现的最大规模的并行矩量法计算,其矩阵求解并行效率可达51.95%。数值结果表明,并行矩量法可准确高效地在国产超级计算平台上解决大规模电磁问题。
关键词:矩量法/
LU分解/
国产超级计算机/
6×105核
Abstract:In order to realize safety, reliability and self-control of electromagnetic computing, the large-scale parallel MoM is studied based on domestically-made many-core supercomputer platform named " Tianhe-2”. A new LU decomposition algorithm named Block Diagonal matrix Pivoting LU decomposition (BDPLU) algorithm, is proposed by analyzing the diagonally dominant characteristics of the matrix generated through dispersing electric field integral equation of MoM, for the purpose of communication pressure reduction to computer cluster and solution acceleration to MoM integral equation during large-scale parallel computation. The BDPLU algorithm reduces the amount of calculation in the process of panel factorization. More importantly, the algorithm completely eliminates MPI communication when pivoting. Using BDPLU algorithm, the maximum number of CPU cores break through 6×105 CPU cores, which is the largest scale of parallel MoM computation in domestically-made and many-core supercomputing platform at present, and the parallel efficiency of solving matrix can reach 51.95%. Numerical results show that parallel MoM can accurately and efficiently solve large-scale electromagnetic field problems on domestic supercomputing platform.
Key words:Method of Moments (MoM)/
LU decomposition/
Domestically-made supercomputer/
6×105 cores
PDF全文下载地址:
https://jeit.ac.cn/article/exportPdf?id=1acba4ea-5cf9-416b-b569-79eb66fe39ff