Computer Science ›› 2024, Vol. 51 ›› Issue (9): 31-39.doi: 10.11896/jsjkx.230300188

• High Performance Computing • Previous Articles     Next Articles

Heterogeneous Parallel Computing and Performance Optimization for DSMC/PIC Coupled Simulation Based on MPI+CUDA

LIN Yongzhen1,2, XU Chuanfu1,2, QIU Haozhong1,2, WANG Qingsong1,2, WANG Zhenghua2, YANG Fuxiang3,4, LI Jie3   

  1. 1 Institute for Quantum Information & State Key Laboratory of High Performance Computing,National University of Defense Technology,Chang-sha 410000,China
    2 College of Computer,National University of Defense Technology,Changsha 410000,China
    3 College of Aerospace Science and Engineering,National University of Defense Technology,Changsha 410000,China
    4 Army Military Transportation University,Bengbu,Anhui 233000,China
  • Received:2023-03-23 Revised:2023-06-29 Online:2024-09-15 Published:2024-09-10
  • About author:LIN Yongzhen,born in 1997,postgra-duate.His main research interests include parallel computing and applications.
    XU Chuanfu,born in 1980,Ph.D,asso-ciate researcher,master supervisor.His main research interests include parallel computing and large-scale science and engineering computing.

Abstract: DSMC/PIC coupled simulation is an important high-performance computing application that demands efficient parallel computing for large-scale simulations.Due to the dynamic injection and migration of particles,DSMC/PIC coupled simulations based on MPI parallelism often suffer from large communication overheads and are difficult to achieve load balancing.To address these issues,we design and implement efficient MPI+CUDA heterogeneous parallel algorithm based on the self-developed DSMC/PIC simulation software.Combining the characteristics of the GPU architecture and the DSMC/PIC computation,we conduct a series of performance optimizations,including GPU memory access optimization,GPU thread workload optimization,CPU-GPU data transmission optimization,and DSMC/PIC data conflict optimization.We perform large-scale DSMC/PIC coupled he-terogeneous parallel simulations on NVIDIA V100 and A100 GPUs in the Beijing Beilong Super Cloud HPC system for the pulsed vacuum arc plasma jet application with billions of particles.Compared to the original pure MPI parallelism,the GPU heterogeneous parallelism significantly reduce simulation time,with a speedup of 550% on two GPU cards compared to 192 cores of the CPU,while maintaining better strong scalability.

Key words: Coupled DSMC/PIC, Particle simulation, Heterogeneous parallel, MPI+CUDA

CLC Number: 

  • TP391
[1]BIRD G A.Molecular gas dynamics[J].NASA STI/ReconTechnical Report A,1976,76:40225.
[2]BIRDSALL C K,LANGDON A B.Plasma physics via computer simulation[M].CRC Press,2004.
[3] HOCKNEY R W,EASTWOOD J W.Computer simulationusing particles[J].SIAM Review,1983,3(25):1025102.
[4] BIRD G A.Direct simulation and the Boltzmann equation[J].The Physics of Fluids,1970,13(11):2676-2681.
[5]COPPLESTONE S,ORTWEIN P,MUNZ C D,et al.Coupled PIC-DSMC simulations of a laser-driven plasma expansion[C]//High Performance Computing in Science and Engineering'15:Transactions of the High Performance Computing Center,Stuttgart(HLRS) 2015.Springer International Publishing,2016:689-701.
[6] COPPLESTONE S,MUNZ C D,PFEIFFER M.PIC-DSMCsimulations of plasma plume expansions with ionization and recombination processes[C]//2016 IEEE International Confe-rence on Plasma Science(ICOPS).IEEE,2016.
[7]SMITH B D,BOYD I D,KAMHAWI H,et al.Hybrid-PICmodeling of a high-voltage,high-specific-impulse hall thruster[C]//49th AIAA/ASME/SAE/ASEE Joint Propulsion Con-ference.2013:3887.
[8]KORKUT B,LI Z,LEVIN D A.3-D simulation of ion thruster plumes using octree adaptive mesh refinement[J].IEEE Tran-sactions on Plasma Science,2015,43(5):1706-1721.
[9]BRIEDA L,ZHUANG T S,KEIDAR M.Near plume modeling of a micro cathode arc thruster[C]//49th AIAA/ASME/SAE/ ASEE Joint Propulsion Conference.2013:4120.
[10]TACCOGNA F,MINELLI P,BRUNO D,et al.Kinetic divertor modeling[J].Chemical Physics,2012,398:27-32.
[11]GLEASON-GONZÁLEZ C,VAROUTIS S,HAUER V,et al.Simulation of neutral gas flow in a tokamak divertor using the Direct Simulation Monte Carlo method[J].Fusion Engineering and Design,2014,89(7/8):1042-1047.
[12]XU C,ZHANG L,DENG X,et al.Balancing cpu-gpu collaborative high-order cfd simulations on the tianhe-1a supercomputer[C]//2014 IEEE 28th International Parallel and DistributedProcessing Symposium.IEEE,2014:725-734.
[13]XU C,DENG X,ZHANG L,et al.Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer[J].Journal of Computational Physics,2014,278:275-297.
[14]LI J,INGHAM D,MA L,et al.Numerical simulation of thechemical combination and dissociation reactions of neutral particles in a rarefied plasma arc jet[J].IEEE Transactions on Plasma Science,2017,45(3):461-471.
[15]SU Y,LI J,WANG H,et al.Numerical simulation of chemical reactions on rarefied plasma plume by DSMC method[J].IEEE Transactions on Plasma Science,2021,49(3):1214-1226.
[16]BIRD G A.Definition of mean free path for real gases[J].The Physics of fluids,1983,26(11):3222-3223.
[17]BIRD G A.Molecular gas dynamics and the direct simulation of gas flows[M].Oxford University Press,1994.
[18]BORIS J P.Relativistic plasma simulation-optimization of a hybrid code[C]//International Conference on Numerical Simulation of Plasmas.1970:3-67.
[19]LIMA E R A,TAVARES F W,BISCAIA JR E C.Finite volume solution of the modified Poisson-Boltzmann equation for two colloidal particles[J].Physical Chemistry Chemical Physics,2007,9(24):3174-3180.
[20]QIU H Z.Parallel computing of DSMC/PIC Hybrid Numerical Simulation for Three-dimensional Unsteady pulsed Vacuum Arc Plasma Plumes [C]//2021 HPC China.2021:483-491.
[21]QIU H,XU C,LI D,et al.Parallelizing and Balancing Coupled DSMC/PIC for Large-scale Particle Simulations[C]//2022 IEEE International Parallel and Distributed Processing Sympo-sium(IPDPS).IEEE,2022:390-401.
[1] WANG Qingsong, QIU Haozhong, LIN Yongzhen, YANG Fuxiang, LI Jie, WANG Zhenghua, XU Chuanfu. Large-scale Efficient Hybrid Parallel Computing for DSMC/PIC Coupled Simulation [J]. Computer Science, 2023, 50(11A): 230300146-9.
[2] DING Yue, XU Chuanfu, QIU Haozhong, DAI Weixi, WANG Qingsong, LIN Yongzhen, WANG Zhenghua. Study on Cross-platform Heterogeneous Parallel Computing for Lattice Boltzmann Multi-phase Flow Simulations Based on SYCL [J]. Computer Science, 2023, 50(11): 32-40.
[3] YANG Wang-dong, WANG Hao-tian, ZHANG Yu-feng, LIN Sheng-le, CAI Qin-yun. Survey of Heterogeneous Hybrid Parallel Computing [J]. Computer Science, 2020, 47(8): 5-16.
[4] LIU Xu, MO Ze-yao, AN Heng-bin, CAO Xiao-lin and ZHANG Ai-qing. Automatic Load Modeling Algorithm Based on Real Time Measuring [J]. Computer Science, 2015, 42(1): 63-66.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!