Computer Science ›› 2025, Vol. 52 ›› Issue (5): 101-108.doi: 10.11896/jsjkx.241100169

• High Performance Computing • Previous Articles     Next Articles

Investigation on Load Balancing Strategies for Lattice Boltzmann Method with Local Grid Refinement

HUANG Chenxi, LI Jiahui, YAN Hui, ZHONG Ying, LU Yutong   

  1. School of Computer Science and Engineering,Sun Yat-Sen University,Guangzhou 510006,China
    National Supercomputing Center in Guangzhou,Guangzhou 510006,China
  • Received:2024-11-27 Revised:2025-02-28 Online:2025-05-15 Published:2025-05-12
  • About author:HUANG Chenxi,born in 1994,postgraduate.His main research interests include computational fluid dynamics and high-performance computing.
    LI Jiahui,born in 1984,Ph.D.His main research interests include computational fluid dynamics and high-performance computing.

Abstract: The Lattice Boltzmann method(LBM) with local mesh refinement is widely used in large-scale unsteady computational fluid dynamics problems.Although the local mesh refinement method can effectively reduce the computational workload,it also brings serious load balancing issues.Especially in large-scale parallel computing,the choice of load balancing algorithms has a significant impact on the overall computational efficiency.This paper used a self-developed LBM program to compare the implementations of Palabos and to explore the static load balancing algorithm based on the Lattice Boltzmann method with local mesh refinement.This paper implementes various load balancing strategies based on global load and stratified load,uses greedy algorithms and space-filling curve algorithms,and proposes optimization of the load balancing of computing nodes.It uses the flow around a sphere and the DrivAer case as test cases for testing and comparison.Firstly,the results show the performance based on the stratified load strategy is better than that based on the global load strategy.At the same time,a comparative analysis reveals that the greedy algorithm demostrates superior scalability,whereas the space-filling curve algorithmexhibits higher efficiency when operating with a limited number of process.Finally,based on layered load,a hybrid algorithm is implemented by combining greedy algorithm and space filling curve algorithm,which achieves the best performance when the number of processes is larger.

Key words: Lattice Boltzmann method, Load balancing, Local grid refinement, Parallel computing, Node-wise load balancing

CLC Number: 

  • TP319
[1]HE Y L,WANG Y,LI Q.Lattice Boltzmann method:Theory and applications [M].Beijing:Science Press,2009.
[2]CHEN S,DOOLEN G D.Lattice Boltzmann Method for Fluid Flows[J].Annual Review of Fluid Mechanics,1998,30(1):329-364.
[3]AN B,SANG W M.The numerical stu-dy of lattice Boltzmann method based on different grid structure[J].Chinese Journal of Theoretical and Applied Mechanics,2013,45(5):699-706.
[4]AN B,MENG X Y,YANG S J,et al.Research on the latticeBoltzmann algorithm for grid refinement based on non-uniform rectangular grid[J].Chinese Journal of Theoretical and Applied Mechanics,2023,55(10):2288-2296
[5]MARVRIPLIS D J.Multigrid solution of the steady state lattice Boltzmann equation[J].Computers & Fluids,2006,35:793-804.
[6]PATIL D V,PREMNATH K N,BANERJEE S.Multigrid lattice Boltzmann method for accelerated solution of elliptic equations[J].Journal of Computational Physics,2014,265:172-194.
[7]FILIPPOVA O,HÄNEL D.Grid Refinement for Lattice-BGK Models[J].Journal of Computational Physics,1998,147(1):219-228.
[8]DUPUIS A,CHOPARD B.Theory and applications of an alternative lattice Boltzmann grid refinement algorithm[J].Physical Review E,2003,67(6):066707.
[9]LAGRAVA D,MALASPINAS O,LATT J,et al.Advances inmulti-domain lattice Boltzmann grid refinement[J].Journal of Computational Physics,2012,231(14):4808-4822.
[10]LATT J,MALASPINAS O,KONTAXAKIS D,et al.Palabos:Parallel Lattice Boltzmann Solver[J].Computers & Mathema-tics with Applications,2021,81:334-350.
[11]HASERT M,MASILAMANI K,ZIMNY S,et al.Complex fluid simulations with the parallel tree-based Lattice Boltzmann solver Musubi[J].Journal of Computational Science,2014,5(5):784-794.
[12]FEICHTINGER C,DONATH S,KÖSTLER H,et al.WaLBerla:HPC software design for computational engineering simulations[J].Journal of Computational Science,2011,2(2):105-112.
[13]ROHDE M,KANDHAI D,DERKSEN J J,et al.A generic,mass conservative local grid refinement technique for lattice-Boltzmann schemes[J].Numerical Methods in Fluids,2006,51(4):439-468.
[14]KARYPIS G,KUMAR V.A fast and high quality multilevelscheme for partitioning irregular graphs[J].SIAM Journal on Scientific Computing,1998,20(1):359-392.
[15]CAMPBELL P M,DEVINE K D,FLAHERTY J E,et al.TERESCO,Dynamic octree load balancing using space-filling curves[R].Williams College Department of Computer Science Technical Report.2003.
[16]MORTON G M.A computer oriented geodetic data base and a new technique in the file sequencing[J]International Business Machines Ltd.,1966,20.
[17]BORRELL R,OYARZUN G,DOSIMONT D,et al.ParallelSFC-based mesh partitioning and load balancing[C]// 2019 IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems(ScalA).2019:72-78.
[18]WHITE A T,CHONG C K.Rotational invariance in the three-dimensional lattice Boltzmann method is dependent on the choice of lattice[J].Journal of Computational Physics,2011,230(16):6367-6378.
[19]GEIER M,SCHÖNHERR M,PASQUALI A,et al.The cumulant lattice Boltzmann equation in three dimensions:Theory and validation[J].Computers & Mathematics with Applications,2015,70(4):507-547.
[20]SIMON H D,TENG S H.How Good is Recursive Bisection?[J].SIAM Journal of Scientific Computing,1997,18(5):1436-1445.
[21]DRIESSCHE R V,ROOSE D.An improved spectral bisectionalgorithm and its application to dynamic load balancing[J].Parallel Computing,1995,21(1):29-48.
[22]CLIFT R,GRACE J R,WEBER M E.Bubbles,drops,and particles[M].Academic Press,1978.
[23]HEFT A I,INDINGER T,ADAMS N A.Introduction of a new realistic generic car model for aerodynamic investigations[R].SAE Technical Paper,2012.
[1] LIAO Qiucheng, ZHOU Yang, LIN Xinhua. Metrics and Tools for Evaluating the Deviation in Parallel Timing [J]. Computer Science, 2025, 52(5): 41-49.
[2] ZHENG Longhai, XIAO Bohuai, YAO Zewei, CHEN Xing, MO Yuchang. Graph Reinforcement Learning Based Multi-edge Cooperative Load Balancing Method [J]. Computer Science, 2025, 52(3): 338-348.
[3] ZHANG Manjing, HE Yulin, LI Xu, HUANG Zhexue. Distributed Two-stage Clustering Method Based on Node Sampling [J]. Computer Science, 2025, 52(2): 134-144.
[4] WANG Yijie, GAO Guoju, SUN Yu'e, HUANG He. Flow Cardinality Estimation Method Based on Distributed Sketch in SDN [J]. Computer Science, 2025, 52(2): 268-278.
[5] XU He, ZHOU Tao, LI Peng, QIN Fangfang, JI Yimu. LU Parallel Decomposition Optimization Algorithm Based on Kunpeng Processor [J]. Computer Science, 2024, 51(9): 51-58.
[6] LIAO Qihua, NIE Kai, HAN Lin, CHEN Mengyao, XIE Wenbing. Tile Selection Algorithm Based on Data Locality [J]. Computer Science, 2024, 51(12): 100-109.
[7] ZHONG Zhenyu, LIN Yongliang, WANG Haotian, LI Dongwen, SUN Yufei, ZHANG Yuzhi. Automatic Pipeline Parallel Training Framework for General-purpose Computing Devices [J]. Computer Science, 2024, 51(12): 129-136.
[8] LI Siyao, LI Shanglin, LUO Jingzhi. Parallel Computing of Reentry Vehicle Trajectory by Multiple Shooting Method Based onOPENMP [J]. Computer Science, 2024, 51(11A): 231000019-6.
[9] HE Weilong, SU Lingli, GUO Bingxuan, LI Maosen, HAO Yan. Research and Implementation of Dynamic Scene 3D Perception Technology Based on BinocularEstimation [J]. Computer Science, 2024, 51(11A): 240300045-8.
[10] YANG Zheming, ZUO Lulu, JI Wen. Joint Optimization Method for Node Deployment and Resource Allocation Based on End-EdgeCollaboration [J]. Computer Science, 2024, 51(11A): 240200010-7.
[11] PENG Weidong, GUO Wei, WEI Lin. Reconfigurable Computing System for Parallel Implementation of SVM Training Based on FPGA [J]. Computer Science, 2024, 51(11A): 231100120-7.
[12] WANG Xiaozhong, ZHANG Zuyu. Multi Level Parallel Computing for SW26010 Discontinuous Galerkin Finite Element Algorithm [J]. Computer Science, 2024, 51(11A): 240700055-5.
[13] FU Xiong, FANG Lei, WANG Junchang. Edge Server Placement for Energy Consumption and Load Balancing [J]. Computer Science, 2023, 50(6A): 220300088-5.
[14] ZHAI Xulun, ZHANG Yongguang, JIN Anzhao, QIANG Wei, LI Mengbing. Parallel DVB-RCS2 Turbo Decoding on Multi-core CPU [J]. Computer Science, 2023, 50(6): 22-28.
[15] XIE Haoshan, LIU Xiaonan, ZHAO Chenyan, LIU Zhengyu. Simulation Implementation of HHL Algorithm Based on Songshan Supercomputer System [J]. Computer Science, 2023, 50(6): 74-80.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!