Computer Science ›› 2020, Vol. 47 ›› Issue (6A): 318-324.doi: 10.11896/JsJkx.191100012

• Computer Network • Previous Articles     Next Articles

Load Balancing Strategy of Distributed Messaging System for Cloud Services

GAO Zi-yan and WANG Yong   

  1. College of Computer Science and Technology,Faculty of Information Technology,BeiJing University of Technology,BeiJing 100124,China
  • Published:2020-07-07
  • About author:GAO Zi-yan, bachelor.Her main research interests include distributed computing and big data.
    WANG Yong, born in 1974, Ph.D, associate professor.His main researchinte-rests include parallel and distributed computing.

Abstract: Aiming at the problem of load skew between nodes in distributed messaging systems under cloud services,a dynamic load balancing strategy based on the role of replica is proposed and the algorithm is applied to Apache Kafka,the distributed streaming platform.Because the function of the messaging system is to read,write and store messages,the algorithm used CPU,disk and bytes in/out as the main load factors of nodes,and proposed the corresponding Leadership Movement strategy and Replica Movement strategy according to different load types.The feasibility of the algorithm is demonstrated from the perspectives of time cost,space cost and service availability,and the influence of parameters involved in the algorithm on the execution of the algorithm was discussed.Experiment results show that,the algorithm can achieve that the resource usage of each node in the cluster is not greater than the specified threshold.Compared with the default system,the standard deviation of cluster CPU occupancy rate decreases by 72.1%,the standard deviation of disk occupancy rate decreases by 86.1%,the standard deviation of bytes in rate decreases by 79.2%,and the standard deviation of bytes out rate decreases by 63.9%.The optimization effect is remarkable.

Key words: Apache Kafka, Cloud service, Distributed messaging system, Load balancing, Multi-replica mechanism

CLC Number: 

  • TP393.4
[1] MENG X F,CI X.Big data management:concepts,techniques and challenges .Journal of Computer Research and Development,2013,50(1):146-169.
[2] LUO Z J,JIN J H,SONG A B,et al.Cloud computing: architecture and key technology.Journal on Communications,2011.
[3] DUAN Y C,WANG D P.A comparative study on SLA content of cloud service contract.Computer and Networks,2018(21).
[4] VILLARS R L,OLOFSON C W,EASTWOOD M.Bigdata:What it is and why you should care.IDC Analyze the Futhre White Paper,2011.
[5] RABL T,GOMEZ-VILLAMOR S,SADOGHI M,et al.Solving big data challenges for enterprise application performance management//Proceedings of the VLDB Endowment.2012:1724-1735.
[6] COLLINS R L,CARLONI L P.Flexible filters:load balancing through backpressure for stream programs//Proceedings of the Seventh ACM International Conference on Embedded Software.New York:ACM,2009:205-214.
[7] BELLAVISTA P,et al.Quality of Service in Wide Scale Publish-Subscribe Systems.IEEE Communications Surveys & Tutorials,2014,16(3):1591-1616.
[8] Apache Kafka.http://kafka.apache.org/.
[9] BIRAJDAR P M,UJEDE K,YALAWAR R,et al.Bidirectional Hadoop kafka Managing Messaging Bus.International Research Journal of Engineering and Technology(IRJET),2016,3(3).
[10] AHUJA S P,MUPPARAJU N.Performance evaluation and comparison of distributed messaging using message oriented middleware.Computer and Information Science,2014,7(4):9-16.
[11] VIDELA A,WILLIAMS J J W.RabbitMQ in action: distributed messaging for everyone.Manning About this Chapter Title Evaluation of Fairness in Message Broker System,2012.
[12] NARKHEDE N,SHAPIRA G,PALINO T.Kafka:The Definitive Guide:Real-time Data and Stream Processing at Scale.O’Reilly Media,Inc.2017.
[13] KLEPPMANN M,KREPS J.Kafka,Samza and the Unix philosophy of distributed data.Bulletin of the IEEE CS Technical Committee on Data Engineering,2015.
[14] WANG G,et al.Building a Replicated Logging System with Apache Kafka.Proceedings of the VLDB Endowment,2015,8(12):1654-1655.
[15] BYZEK Y.Optimizing Your Apache Kafka Deployment:Leversfor The throughput,Latency,Durability,and Availability.Technical report,Confluent Inc,2017.
[16] JUNQUEIRA F,REED B.ZooKeeper:Distributed Process Coordination.Sebastopol:O’Reilly Media,Inc.2013.
[1] TIAN Zhen-zhen, JIANG Wei, ZHENG Bing-xu, MENG Li-min. Load Balancing Optimization Scheduling Algorithm Based on Server Cluster [J]. Computer Science, 2022, 49(6A): 639-644.
[2] GAO Jie, LIU Sha, HUANG Ze-qiang, ZHENG Tian-yu, LIU Xin, QI Feng-bin. Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor [J]. Computer Science, 2022, 49(5): 355-362.
[3] TAN Shuang-jie, LIN Bao-jun, LIU Ying-chun, ZHAO Shuai. Load Scheduling Algorithm for Distributed On-board RTs System Based on Machine Learning [J]. Computer Science, 2022, 49(2): 336-341.
[4] LIANG Jian, HE Jun-hui. H.264/AVC Video Encryption Based on Adaptive Permutation of Macroblock Coding Information [J]. Computer Science, 2022, 49(1): 314-320.
[5] XIA Zhong, XIANG Min, HUANG Chun-mei. Hierarchical Management Mechanism of P2P Video Surveillance Network Based on CHBL [J]. Computer Science, 2021, 48(9): 278-285.
[6] SONG Hai-ning, JIAO Jian, LIU Yong. Research on Mobile Edge Computing in Expressway [J]. Computer Science, 2021, 48(6A): 383-386.
[7] WANG Zheng, JIANG Chun-mao. Cloud Task Scheduling Algorithm Based on Three-way Decisions [J]. Computer Science, 2021, 48(6A): 420-426.
[8] ZHENG Zeng-qian, WANG Kun, ZHAO Tao, JIANG Wei, MENG Li-min. Load Balancing Mechanism for Bandwidth and Time-delay Constrained Streaming Media Server Cluster [J]. Computer Science, 2021, 48(6): 261-267.
[9] JIANG Hui-min, JIANG Zhe-yuan. Reference Model and Development Methodology for Enterprise Cloud Service Architecture [J]. Computer Science, 2021, 48(2): 13-22.
[10] YAO Ze-wei, LIU Jia-wen, HU Jun-qin, CHEN Xing. PSO-GA Based Approach to Multi-edge Load Balancing [J]. Computer Science, 2021, 48(11A): 456-463.
[11] YANG Zi-qi, CAI Ying, ZHANG Hao-chen, FAN Yan-fang. Computational Task Offloading Scheme Based on Load Balance for Cooperative VEC Servers [J]. Computer Science, 2021, 48(1): 81-88.
[12] GUO Fei-yan, TANG Bing. Mobile Edge Server Placement Method Based on User Latency-aware [J]. Computer Science, 2021, 48(1): 103-110.
[13] HUANG Mei-gen, WANG Tao, LIU Liang, PANG Rui-qin and DU Huan. Virtual Network Function Deployment Strategy Based on Software Defined Network Resource Optimization [J]. Computer Science, 2020, 47(6A): 404-408.
[14] ZHOU Jian-xin, ZHANG Zhi-peng, ZHOU Ning. Load Balancing Technology of Segment Routing Based on CKSP [J]. Computer Science, 2020, 47(4): 256-261.
[15] ZHU An-qing, LI Shuai, TANG Xiao-dong. Parallel FP_growth Association Rules Mining Method on Spark Platform [J]. Computer Science, 2020, 47(12): 139-143.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!