Computer Science ›› 2025, Vol. 52 ›› Issue (5): 101-108.doi: 10.11896/jsjkx.241100169

• High Performance Computing • Previous Articles     Next Articles

Investigation on Load Balancing Strategies for Lattice Boltzmann Method with Local Grid Refinement

HUANG Chenxi, LI Jiahui, YAN Hui, ZHONG Ying, LU Yutong   

  1. School of Computer Science and Engineering,Sun Yat-Sen University,Guangzhou 510006,China
    National Supercomputing Center in Guangzhou,Guangzhou 510006,China
  • Received:2024-11-27 Revised:2025-02-28 Online:2025-05-15 Published:2025-05-12
  • About author:HUANG Chenxi,born in 1994,postgraduate.His main research interests include computational fluid dynamics and high-performance computing.
    LI Jiahui,born in 1984,Ph.D.His main research interests include computational fluid dynamics and high-performance computing.

Abstract: The Lattice Boltzmann method(LBM) with local mesh refinement is widely used in large-scale unsteady computational fluid dynamics problems.Although the local mesh refinement method can effectively reduce the computational workload,it also brings serious load balancing issues.Especially in large-scale parallel computing,the choice of load balancing algorithms has a significant impact on the overall computational efficiency.This paper used a self-developed LBM program to compare the implementations of Palabos and to explore the static load balancing algorithm based on the Lattice Boltzmann method with local mesh refinement.This paper implementes various load balancing strategies based on global load and stratified load,uses greedy algorithms and space-filling curve algorithms,and proposes optimization of the load balancing of computing nodes.It uses the flow around a sphere and the DrivAer case as test cases for testing and comparison.Firstly,the results show the performance based on the stratified load strategy is better than that based on the global load strategy.At the same time,a comparative analysis reveals that the greedy algorithm demostrates superior scalability,whereas the space-filling curve algorithmexhibits higher efficiency when operating with a limited number of process.Finally,based on layered load,a hybrid algorithm is implemented by combining greedy algorithm and space filling curve algorithm,which achieves the best performance when the number of processes is larger.

Key words: Lattice Boltzmann method, Load balancing, Local grid refinement, Parallel computing, Node-wise load balancing

CLC Number: 

  • TP319
[1]HE Y L,WANG Y,LI Q.Lattice Boltzmann method:Theory and applications [M].Beijing:Science Press,2009.
[2]CHEN S,DOOLEN G D.Lattice Boltzmann Method for Fluid Flows[J].Annual Review of Fluid Mechanics,1998,30(1):329-364.
[3]AN B,SANG W M.The numerical stu-dy of lattice Boltzmann method based on different grid structure[J].Chinese Journal of Theoretical and Applied Mechanics,2013,45(5):699-706.
[4]AN B,MENG X Y,YANG S J,et al.Research on the latticeBoltzmann algorithm for grid refinement based on non-uniform rectangular grid[J].Chinese Journal of Theoretical and Applied Mechanics,2023,55(10):2288-2296
[5]MARVRIPLIS D J.Multigrid solution of the steady state lattice Boltzmann equation[J].Computers & Fluids,2006,35:793-804.
[6]PATIL D V,PREMNATH K N,BANERJEE S.Multigrid lattice Boltzmann method for accelerated solution of elliptic equations[J].Journal of Computational Physics,2014,265:172-194.
[7]FILIPPOVA O,HÄNEL D.Grid Refinement for Lattice-BGK Models[J].Journal of Computational Physics,1998,147(1):219-228.
[8]DUPUIS A,CHOPARD B.Theory and applications of an alternative lattice Boltzmann grid refinement algorithm[J].Physical Review E,2003,67(6):066707.
[9]LAGRAVA D,MALASPINAS O,LATT J,et al.Advances inmulti-domain lattice Boltzmann grid refinement[J].Journal of Computational Physics,2012,231(14):4808-4822.
[10]LATT J,MALASPINAS O,KONTAXAKIS D,et al.Palabos:Parallel Lattice Boltzmann Solver[J].Computers & Mathema-tics with Applications,2021,81:334-350.
[11]HASERT M,MASILAMANI K,ZIMNY S,et al.Complex fluid simulations with the parallel tree-based Lattice Boltzmann solver Musubi[J].Journal of Computational Science,2014,5(5):784-794.
[12]FEICHTINGER C,DONATH S,KÖSTLER H,et al.WaLBerla:HPC software design for computational engineering simulations[J].Journal of Computational Science,2011,2(2):105-112.
[13]ROHDE M,KANDHAI D,DERKSEN J J,et al.A generic,mass conservative local grid refinement technique for lattice-Boltzmann schemes[J].Numerical Methods in Fluids,2006,51(4):439-468.
[14]KARYPIS G,KUMAR V.A fast and high quality multilevelscheme for partitioning irregular graphs[J].SIAM Journal on Scientific Computing,1998,20(1):359-392.
[15]CAMPBELL P M,DEVINE K D,FLAHERTY J E,et al.TERESCO,Dynamic octree load balancing using space-filling curves[R].Williams College Department of Computer Science Technical Report.2003.
[16]MORTON G M.A computer oriented geodetic data base and a new technique in the file sequencing[J]International Business Machines Ltd.,1966,20.
[17]BORRELL R,OYARZUN G,DOSIMONT D,et al.ParallelSFC-based mesh partitioning and load balancing[C]// 2019 IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems(ScalA).2019:72-78.
[18]WHITE A T,CHONG C K.Rotational invariance in the three-dimensional lattice Boltzmann method is dependent on the choice of lattice[J].Journal of Computational Physics,2011,230(16):6367-6378.
[19]GEIER M,SCHÖNHERR M,PASQUALI A,et al.The cumulant lattice Boltzmann equation in three dimensions:Theory and validation[J].Computers & Mathematics with Applications,2015,70(4):507-547.
[20]SIMON H D,TENG S H.How Good is Recursive Bisection?[J].SIAM Journal of Scientific Computing,1997,18(5):1436-1445.
[21]DRIESSCHE R V,ROOSE D.An improved spectral bisectionalgorithm and its application to dynamic load balancing[J].Parallel Computing,1995,21(1):29-48.
[22]CLIFT R,GRACE J R,WEBER M E.Bubbles,drops,and particles[M].Academic Press,1978.
[23]HEFT A I,INDINGER T,ADAMS N A.Introduction of a new realistic generic car model for aerodynamic investigations[R].SAE Technical Paper,2012.
[1] LI Fan, WU Yahui, DENG Su, MA Wubin, ZHOU Haohao. Load Balancing Task Allocation Strategy for User-oriented Mobile Crowdsensing [J]. Computer Science, 2026, 53(2): 379-386.
[2] JI Liguang, ZHOU Bei, YANG Hongru, ZHOU Yuchang, CUI Mengqi, XU Jinchen. Parallel Detection Method of Maximum Floating-point Error Based on Gridding Particle SwarmOptimization Algorithm [J]. Computer Science, 2026, 53(2): 124-132.
[3] XU Jinlong, WANG Gengwu, HAN Lin, NIE Kai, LI Haoran, CHEN Mengyao, LIU Haohao. Research on Parallel Scheduling Strategy Optimization Technology Based on Sunway Compiler [J]. Computer Science, 2025, 52(9): 137-143.
[4] ZHOU Kai, WANG Kai, ZHU Yuhang, PU Liming, LIU Shuxin, ZHOU Deqiang. Customized Container Scheduling Strategy Based on GMM [J]. Computer Science, 2025, 52(6): 346-354.
[5] LIAO Qiucheng, ZHOU Yang, LIN Xinhua. Metrics and Tools for Evaluating the Deviation in Parallel Timing [J]. Computer Science, 2025, 52(5): 41-49.
[6] ZHENG Longhai, XIAO Bohuai, YAO Zewei, CHEN Xing, MO Yuchang. Graph Reinforcement Learning Based Multi-edge Cooperative Load Balancing Method [J]. Computer Science, 2025, 52(3): 338-348.
[7] ZHANG Manjing, HE Yulin, LI Xu, HUANG Zhexue. Distributed Two-stage Clustering Method Based on Node Sampling [J]. Computer Science, 2025, 52(2): 134-144.
[8] WANG Yijie, GAO Guoju, SUN Yu'e, HUANG He. Flow Cardinality Estimation Method Based on Distributed Sketch in SDN [J]. Computer Science, 2025, 52(2): 268-278.
[9] AI Yuan, LI Jiahao, ZHAO Yitao, HU Kai. Optimization of Blockchain Dynamic Sharding and Cross-shard Transaction Protocol Based on Greedy Strategy [J]. Computer Science, 2025, 52(11A): 250100133-8.
[10] WEI Debin, ZHANG Yi, XU Pingduo, WANG Xinrui. Multipath Routing Algorithm for Satellite Networks Based on Convolutional Twin Delay Deep Deterministic Policy Gradient [J]. Computer Science, 2025, 52(11): 280-288.
[11] SUN Shiquan, YE Miao, ZHU Cheng, WANG Yong, JIANG Qiuxiang. Performance Optimization of Wireless Edge Storage System Based on SDN and Drone Assistance in Disaster Scenarios [J]. Computer Science, 2025, 52(11): 306-319.
[12] XU He, ZHOU Tao, LI Peng, QIN Fangfang, JI Yimu. LU Parallel Decomposition Optimization Algorithm Based on Kunpeng Processor [J]. Computer Science, 2024, 51(9): 51-58.
[13] LIAO Qihua, NIE Kai, HAN Lin, CHEN Mengyao, XIE Wenbing. Tile Selection Algorithm Based on Data Locality [J]. Computer Science, 2024, 51(12): 100-109.
[14] ZHONG Zhenyu, LIN Yongliang, WANG Haotian, LI Dongwen, SUN Yufei, ZHANG Yuzhi. Automatic Pipeline Parallel Training Framework for General-purpose Computing Devices [J]. Computer Science, 2024, 51(12): 129-136.
[15] YANG Zheming, ZUO Lulu, JI Wen. Joint Optimization Method for Node Deployment and Resource Allocation Based on End-EdgeCollaboration [J]. Computer Science, 2024, 51(11A): 240200010-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!