Computer Science ›› 2020, Vol. 47 ›› Issue (4): 13-17.doi: 10.11896/jsjkx.191000010

• Computer Architecture • Previous Articles     Next Articles

Extreme-scale Simulation Based LBM Computing Fluid Dynamics Simulations

LV Xiao-jing1, LIU Zhao2, CHU Xue-sen3, SHI Shu-peng1, MENG Hong-song1, HUANG Zhen-chun2   

  1. 1 National Supercomputing Center in Wuxi,Wuxi,Jiangsu 214072,China;
    2 Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;
    3 China Ship Scientific Research Center,Wuxi,Jiangsu 214072,China
  • Received:2019-09-08 Online:2020-04-15 Published:2020-04-15
  • Contact: LIU Zhao,born in 1986,Ph.D,is a member of China Computer Federation.His main research interests include high performance computing and computer architecture.
  • About author:LV Xiao-jing,born in 1989,postgradua-te,is a member of China Computer Fede-ration.Her main research interests include parallel algorithm and application.
  • Supported by:
    This work was supported by National Key R&D Program of China (2017YFB0203602).

Abstract: Lattice Boltzmann Method (LBM) is a computational fluid dynamics method based on mesoscopic simulation scales and has been widely used in theoretical research and processing engineering problems.Improving the parallel simulation capability of LBM Computing Fluid software is an important study for high performance computing and applications.The research aims to design and implement a set of highly efficient extended LBM computational fluid dynamics software based on the “Sunway TaihuLight” supercomputing system.According to the architecture of domestic multi-core processor SW26010,several parallel optimization multi-level parallelism techniques to boost the simulation speed and improve the scalability of SWLBM are designed,including date reuse of 19-point stencil,vectorization of collision process and communication overlap computing.Based on these parallel optimization schemes,the numerical simulation with over 10million cores and up to 5.6trillion grids is tested and the SWLBM software can bring up to 172x speed up and achieve a sustained floating of 4.7 PFlops.Compared with the million-core 10000*10000*5000 grid wind filed simulation,the SWLBM machine has a core efficiency of 87%.Test results show that SWLBM has the ability to provide practical large-scale parallel simulation solutions for industrial applications.

Key words: Lattice Boltzmann method, Multi-level parallelism, Parallel optimization, SW26010

CLC Number: 

  • TP391
[1]GAGLIANO A,NOCERA F,PATANIA F,et al.Assessment of micro-wind turbines performance in the urban environments:an aided methodology through geographical information systems[J].International Journal of Energy and Environmental Engineering,2013,4(1):43.
[2]KRAUSE M J,GENGENBACH T,HEUVELINE V.Hybridparallel simulations of fluid flows in complex geometries:application to the human lungs[M]//Euro-Par 2010 Parallel Processing Workshops.Berlin:Springer,2011:209-216.
[3]GÖTZ J,IGLBERGER K,STÜRMER M,et al.Direct numerical simulation of particulate flows on 294912 processor cores[C]//2010 ACM/IEEE International Conference for High Performance Computing,Networking,Storage and Analysis.New Orleans,LA,USA:IEEE,2010.
[4]SCHORNBAUM F,RÜDE U.Massively parallel algorithms for the lattice boltzmann method on NonUniform grids[J].SIAM Journal on Scientific Computing,2016,38(2):C96-C126.
[5]FIETZ J,KRAUSE M J,SCHULZ C,et al.Optimized hybridparallel lattice boltzmann fluid flow simulations on complex geometries[M]//Euro-Par 2012 Parallel Processing.Berlin:Springer,2012:818-829.
[6]ONODERA N,AOKI T,SHIMOKAWABE T.Large-scale LES wind simulation using lattice Boltzmann method for a 10km× 10km area in metropolitan Tokyo[J].TSUBAME e-Science Journal Global Scientific Information and Computing Center,2003,9:1-8.
[7]BAILEY P,MYRE J,WALSH S D C,et al.Accelerating lattice boltzmann fluid flow simulations using graphics processors[C]//2009 International Conference on Parallel Processing.Vienna:IEEE,2009.
[8]CRIMI G,MANTOVANI F,PIVANTI M,et al.Early experience on porting and running a lattice boltzmann code on the xeon-phi Co-processor[J].Procedia Computer Science,2013,18:551-560.
[9]YANG C,ZHENG W M,XUE W,et al.A peta-scalable CPU-GPU algorithm for global atmospheric simulations[J].ACM SIGPLAN Notices,2013,48(8):1.
[1] ZHU Yu, PANG Jian-min, XU Jin-long, TAO Xiao-han, WANG Jun. Adaptive Tiling Size Algorithm for 3D Stencil Computation on SW26010 Many-core Processor [J]. Computer Science, 2021, 48(6): 10-18.
[2] HE Ya-ru, PANG Jian-min, XU Jin-long, ZHU Yu, TAO Xiao-han. Implementation and Optimization of Floyd Parallel Algorithm Based on Sunway Platform [J]. Computer Science, 2021, 48(6): 34-40.
[3] LIU Xiao-nan, JING Li-na, WANG Li-xin, WANG Mei-ling. Large-scale Quantum Fourier Transform Simulation Based on SW26010 [J]. Computer Science, 2020, 47(8): 93-97.
[4] YUAN Xin-hui, LIN Rong-fen, WEI Di, YIN Wan-wang, XU Jin-xiu. Optimization of BFS on Domestic Heterogeneous Many-core Processor SW26010 [J]. Computer Science, 2020, 47(8): 98-104.
[5] WEI Lin-jing, NING Lu-lu, GUO Bin, HOU Zhen-xing, GAN Shi-run. K-mediods Cluster Mining and Parallel Optimization Based on Shuffled Frog Leaping Algorithm [J]. Computer Science, 2020, 47(10): 126-129.
[6] XU Chuan-fu,WANG Xi,LIU Shu,CHEN Shi-zhao,LIN Yu. Large-scale High-performance Lattice Boltzmann Multi-phase Flow Simulations Based on Python [J]. Computer Science, 2020, 47(1): 17-23.
[7] XU Lei, CHEN Rong-liang, CAI Xiao-chuan. Scalable Parallel Finite Volume Lattice Boltzmann Method Based on Unstructured Grid [J]. Computer Science, 2019, 46(8): 84-88.
[8] YANG Si-yan,HE Guo-qi,LIU Ru-yi. Video Stitching Algorithm Based on SIFT and Its Optimization [J]. Computer Science, 2019, 46(7): 286-291.
[9] NI Hong, LIU Xin. Many-core Optimization for Sparse Triangular Solver Under Unstructured Grids [J]. Computer Science, 2019, 46(6A): 518-522.
[10] TAO Xiao-han, PANG Jian-min, GAO Wei, WANG Qi, YAO Jin-yang. Performance Optimization of FT Program Based on SW26010 Processor [J]. Computer Science, 2019, 46(4): 321-328.
[11] LIU Yu-cheng, Richard·DING, ZHANG Ying-chao. Research on Pan-real-time Problem of Medical Detection Based on BPNNs Recognition Algorithm [J]. Computer Science, 2018, 45(6): 301-307.
[12] JIN Xin, LI Long-wei, SU Guo-hua, LIU Xiao-lei and JI Jia-nan. Prediction about Network Security Situation of Electric Power Telecommunication Based on Spark Framework and PSO Algorithm [J]. Computer Science, 2017, 44(Z6): 366-371.
[13] JIANG Wen-chao, LIN Sui, WANG Duo-qiang, LI Dong-ming and JIN Hai. Three-level Parallel Optimization and Application of Calculix in TH-2 Super-computing Environments [J]. Computer Science, 2017, 44(3): 32-35.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!