Computer Science ›› 2020, Vol. 47 ›› Issue (1): 24-30.doi: 10.11896/jsjkx.181102176

• Computer Architecture • Previous Articles     Next Articles

Research on Adaptation of CFD Software Based on Many-core Architecture of 100P Domestic Supercomputing System

LI Fang1,LI Zhi-hui2,XU Jin-xiu1,FAN Hao1,CHU Xue-sen3,LI Xin-liang4   

  1. (Jiangnan Institute of Computing Technology,Wuxi,Jiangsu 214083,China)1;
    (National Laboratory of Computational Fluid Dynamics,Beijing 100191,China)2;
    (China Ship Scientific Research Center,Wuxi,Jiangsu 214081,China)3;
    (Institute of Mechanics,Chinese Academy of Sciences,Beijing 100190,China)4
  • Received:2018-11-26 Published:2020-01-19
  • About author:LI Fang,born in 1980,Ph.D,associate researcher.Her main research interests include computational fuid dynamics and high-performance parallel computation and application;LI Zhi-hui,borin in 1968,Ph.D,professor,doctoral supervisor.His main research interests include computable modeling on nonlinear deforming and destroying mechanism of metal truss structure,numerical forecast of flight track and high-performance parallel computation and application.
  • Supported by:
    This work was supported by the Project of Manned Space Engineering Technology (2018-14),Major Research Plan of the National Natural Science Foundation of China (91530319) and National Basic Research Program of China (2014CB744100).

Abstract: Domestic many-core super computing system provides two program languages with different program difficulty.Adaptation to many-core architecture of CFD software decides which program language should be used.Firstly,this paper briefly introduced the many-core architecture,program model and program languages.And then challenges on the adaptation of CFD software were analyzed,including data relativity of implicit method,solving of big parse linear equations,many grid method and unstructured grids.For each challenge,corresponding countermeasure was provided too.At last,the paper provided the speedup ratio of some typical software of fluid dynamics based on theory analysis and experiments.Facts prove that most CFD softwares adapt well to domestic many-core architecture and can use simple program language to get better parallel ration on million cores.

Key words: Domestic, Many-core architecture, Software of computational fluid dynamics, Adaptation, Program language, Parallel algorithm

CLC Number: 

  • TP311
[1]ZHENG F,LI H L,LV H,et al.Cooperative computing techniques for a deeply fused and heterogeneous many-core processor architecture[J].Journal of Computer Science and Technology,2015,30(1):145-162.
[2]FU H H,LIAO J F,YANG J Z,et al.The Sunway Taihulight supercomputer:system and applications[J].Science China Information Sciences,2016,59(7):72-91.
[3]YANG C,XUE W,FU H H,et al.10m-core scalable fully- implicit solver for nonhydrostatic atmospheric dynamics[C]∥Proceedings of the International Conference for High Performance Computing,Networking,Storage and Analysis.IEEE,2016:6-15.
[4]ZHANG J,ZHOU C B,WANG Y G,et al.Extreme-Scale Phase Field Simulations of Coarsening Dynamics on the Sunway TaihuLight Supercomputer [C]∥International Conference for High Performance Computing,Networking,Storage and Analysis.IEEE,2016:34-45.
[5]FU H H,XUE W,YANG C,et al.Redesigning CAM-SE for Peta-Scale Climate Modeling Performance on Sunway TaihuLight [C]∥High Performance Computing,Networking,Storage and Analysis.IEEE,2017:4-12.
[6]FU H H,LIAO J F,YANG J Z,et al.15-Pflops Nonlinear Earthquake Simulation on Sunway TaihuLight:Enabling Depiction of Realistic 10 Hz Scenarios[C]∥High Performance Computing,Networking,Storage and Analysis.IEEE,2017:102-117.
[7]QIAO F L,ZHAO W,YIN X Q,et al.A highly effective global surface wave numerical simulation with ultra-high resolution[C]∥High Performance Computing,Networking,Storage and Analysis.IEEE,2016:46-56.
[8]HOU C F,XU J,WANG P,et al.Efficient GPU-accelerated molecular dynamics simulation of solid covalent crystals[J].MOLECULAR SIMULATION,2012,38(1):8-15.
[9]HOU C F,XU J,WANG P,et al.Petascale molecular dynamics simulation of crystalline silicon on Tianhe-1A[J].International Journal of High Performance Computing Applications,2013,27(3):307-317.
[10]LI D,XU Z M,LI S,et al.A survey on information diffusion in online social networks [J].Chinese Journal of Computers,2014,37(1):189-206.
[11]LIN H,TANG X C,YU B W,et al.Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores[C]∥2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS).IEEE Computer Society,2017.
[12]LIN J,XU Z G,NUKADA A.Optimizations of Two Compute-bound Scientific Kernels on SW26010 Many-core Processor[C]∥Proceedings of the 46th International Conference on Pa-rallel Processing.IEEE,2017.
[13]XU Z G,LIN J,MATSUOKA S.Benchmarking Sunway SW26010 Manycore Processor[C]∥Proceedings of The Seventh International Workshop on Accelerators and Hybrid Exascale Systems (AsHES) (IPDPS workshop).Orlando,USA,2017.
[14]AN H.Pipelining Computation and Data Reuse Strategies for Scaling GROMACS on the Sunway Many-core Processor[C]∥18th International Conference on Algorithms and Architectures for Parallel Processing(ICA3PP-2018).IEEE,2018.
[15]YOU H T,ZHANG L B,MAO Z H.OpenACC2.0 VS OpenMP4.0 Comparation of Two Popular Programming Language Based on Compilation Instructions[J].High Performance Computing,2014,227:20-25.
[17]LIAO J F.Redesigning CAM-SE for Peta-Scale Climate Mode- ling Performance on Sunway TaihuLight [D].Beijing:Tsinghua University,2017.
[18]AO Y L.Research on Key Optimizations of Sparse Matrix and Stencil Computation for the Domestic Large Many-core System[D].Beijing:University of Chinese Academy of Sciences,2017.
[19]NI H.Research on Heterogeneous parallel computing technology of CFD in unstructured grids[D].Wuxi:The 56th Institute of PLA,2018.
[20]LI Z Z.Research on parallel multi grid of unstructured grids
[D].Changsha:National University of Defense Technology,2012.
[21]MENG D L,WEN M H,WEI J W.Porting and Optimizing OpenFOAM on Sunway TaihuLight System.Computer Science,2017,10(44):64-70.
[22]LIN H.Extreme-scale graph analysis on heterogeneous architecture[D].Beijing:Tsinghua University,2017.
[23]XU J X,YOU H T.Application of Many-core Programming Language OpenACC in Solving of Boltzmann Equations[J].High Performance Computing,2016(2):7-12.
[24]LI Z H,ZHANG H X.Parallel Computing of Three Dimension Complex Gas Motion Flow[J].Journal of Aerodynamics,2010,28(1):7-16.
[1] NI Hong, LIU Xin. Many-core Optimization for Sparse Triangular Solver Under Unstructured Grids [J]. Computer Science, 2019, 46(6A): 518-522.
[2] CHEN Da-shuang, LI Ying-shan, WU Hong. Self-adaptive EM Phase Noise Suppression Algorithm in F-OFDM System [J]. Computer Science, 2019, 46(2): 68-75.
[3] XU Shu-yan, HAN Li-xin, XU Guo-xia. Domain Adaptation Algorithm Based on Tensor Decomposition [J]. Computer Science, 2019, 46(12): 89-94.
[4] SHI Zheng, XU Ming-xing. Database of Chinese Domestic Films for Fox-office Revenue Forecasting [J]. Computer Science, 2019, 46(11A): 149-152.
[5] ZHAO Yun-tao, CHEN Jing-cheng, LI Wei-gang. Multi-objective Grey Wolf Optimization Hybrid Adaptive Differential Evolution Mechanism [J]. Computer Science, 2019, 46(11A): 83-88.
[6] ZHU Chao, WU Su-ping. Parallel Harris Feature Point Detection Algorithm [J]. Computer Science, 2019, 46(11A): 289-293.
[7] HU Meng-yuan, HUANG Hong-yun, DING Zuo-hua. Ensemble Model for Software Defect Prediction [J]. Computer Science, 2019, 46(11): 176-180.
[8] XU Qi-ze, HAN Wen-ting, CHEN Jun-shi, AN Hong. Optimization of Breadth-first Search Algorithm Based on Many-core Platform [J]. Computer Science, 2019, 46(1): 314-319.
[9] ZHANG Ying-jie, ZHU Xue-feng. Review of Pattern Driven Software Architecture Design [J]. Computer Science, 2018, 45(11A): 48-52.
[10] LOU Hao-feng, ZHANG Duan. Gaussian Process Assisted CMA-ES Application in Medical Image Registration [J]. Computer Science, 2018, 45(11A): 234-237,262.
[11] LIU Wei, CHENG Cong-cong, PEI Meng-li, SHE Wei. Adaptive Blind Watermarking Algorithm Based on Sub-block Segmentation [J]. Computer Science, 2018, 45(10): 178-182, 216.
[12] ZHOU Jie and LI Wen-jing. Research on Parallel Algorithm of Petri Net Based on Three-layer Mixed Programming Model [J]. Computer Science, 2017, 44(Z11): 586-591, 595.
[13] ZHU Kai-long, LU Yu-liang and ZHANG Yan-qing. Study on Calculation Method for Internet Topological Parameters Based on MapReduce [J]. Computer Science, 2017, 44(6): 80-82.
[14] TANG Bing, Laurent BOBELIN and HE Hai-wu. Parallel Algorithm of Nonnegative Matrix Factorization Based on Hybrid MPI and OpenMP Programming Model [J]. Computer Science, 2017, 44(3): 51-54.
[15] ZHANG Chun-yun, QIN Peng-da and YIN Yi-long. Self-adaptation Multi-gram Weight Learning Strategy for Sentence Representation Based on Convolutional Neural Network [J]. Computer Science, 2017, 44(1): 60-64.
Full text



[1] LIU Bo-yi, TANG Xiang-yan and CHENG Jie-ren. Recognition Method for Corn Borer Based on Templates Matching in Muliple Growth Periods[J]. Computer Science, 2018, 45(4): 106 -111, 142 .
[2] YANG Yu-qi, ZHANG Guo-an and JIN Xi-long. Dual-cluster-head Routing Protocol Based on Vehicle Density in VANETs[J]. Computer Science, 2018, 45(4): 126 -130 .
[3] PANG Bo, JIN Qian-kun, HENIGULI·Wu Mai Er and QI Xing-bin. Routing Scheme Based on Network Slicing and ILP Model in SDN[J]. Computer Science, 2018, 45(4): 143 -147 .
[4] DAI Wen-jing, YUAN Jia-bin. Survey on Hidden Subgroup Problem[J]. Computer Science, 2018, 45(6): 1 -8 .
[5] DU Xiu-li, GU Bin-bin, HU Xing, QIU Shao-ming and CHEN Bo. Support Similarity between Lines Based CoSaMP Algorithm for Image Reconstruction[J]. Computer Science, 2018, 45(4): 306 -311 .
[6] HU Qing-cheng, ZHANG Yong, XING Chun-xiao. K-clique Heuristic Algorithm for Influence Maximization in Social Network[J]. Computer Science, 2018, 45(6): 32 -35 .
[7] ZHANG Jing, ZHOU An-min, LIU Liang, JIA Peng and LIU Lu-ping. Review of Crash Exploitability Analysis Methods[J]. Computer Science, 2018, 45(5): 5 -14, 23 .
[8] SHEN Xia-jiong, ZHANG Jun-tao, HAN Dao-jun. Short-term Traffic Flow Prediction Model Based on Gradient Boosting Regression Tree[J]. Computer Science, 2018, 45(6): 222 -227,264 .
[9] LV Ju-jian, ZHAO Hui-min, CHEN Rong-jun, LI Jian-hong. Unsupervised Active Learning Based on Adaptive Sparse Neighbors Reconstruction[J]. Computer Science, 2018, 45(6): 251 -258 .
[10] LIU Yu-cheng, Richard·DING, ZHANG Ying-chao. Research on Pan-real-time Problem of Medical Detection Based on BPNNs Recognition Algorithm[J]. Computer Science, 2018, 45(6): 301 -307 .