Computer Science ›› 2024, Vol. 51 ›› Issue (2): 293-299.doi: 10.11896/jsjkx.230100031

• Computer Network • Previous Articles     Next Articles

Study on Deep Reinforcement Learning for Energy-aware Virtual Machine Scheduling

WANG Yangmin, HU Chengyu, YAN Xuesong, ZENG Deze   

  1. School of Computer Science,China University of Geosciences(Wuhan),Wuhan 430078,China
  • Received:2023-01-06 Revised:2023-04-12 Online:2024-02-15 Published:2024-02-22
  • About author:WANG Yangmin,born in 1999,postgraduate.His main research intrests include reinforcement learning and evolution computation.HU Chengyu, born in 1978,Ph.D.professor,is a member of CCF(No.40126S).His main research interests include evolutionary algorithm,reinforcement learning and cloud computing.

Abstract: With the rapid development of computer technology,cloud computing technology has become one of the best ways to solve users’ storage and computing power demands.Among them,dynamic virtual machine scheduling based on NUMA architecture has become a hot topic in academia and industry.However,in current research,heuristic algorithms are difficult to schedule virtual machines in real time,and most of the literatures do not consider the energy consumption caused by virtual machine sche-duling under NUMA architecture.This paper proposes a service migration framework of large-scale mobile cloud center virtual machine based on deep reinforcement learning,and constructs the energy consumption model under NUMA architecture.Hierarchical adaptive sampling soft actor critic(HASAC) is proposed.In the cloud computing scenario,the proposed algorithm is compared with the classical deep reinforcement learning methods.Experiment results show that the improved algorithm proposed in this paper can handle more user requests in different scenarios,and consumes less energy.In addition,experiments on various strategies in the algorithm prove the effectiveness of the proposed strategy.

Key words: NUMA architecture, Deep learning, Reinforcement learning, Energy perception, Layered buffer

CLC Number: 

  • TP181
[1]DUAN W,HU M.Overview of Cloud Computing System Reliability Research[J].Computer Research and Development,2020,57(1):102-123.
[2]HE D,CHEN L,LIANG J,et al.NUMA-Aware ContentionScheduling on Multicore Systems [C]//2021 16th International Conference on Intelligent Systems and Knowledge Engineering(ISKE).IEEE,2021:452-457.
[3]CHENG Y,CHEN W,WANG Z.Performance-monitoring-based traffic-aware virtual machine deployment on NUMA systems [J].IEEE Systems Journal,2015,11(2):973-982.
[4]BISWAI N K,BANERJEE S,BISWAS U.An approach towards development of new linear regression prediction model for reduced energy consumption and SLA violation in the domain of green cloud computing[J].Sustainable Energy Technologies and Assessments,2021,45:101087.
[5]MA O,JIANG X,JIN G.Heuristic dynamic threshold algorithm for priority inversion suppression of LEDBAT protocol [J].Computer Research and Development,2020,57(6):1292-1301.
[6]REN H,WANG Y,XU C.Smig-rl:An evolutionary migration framework for cloud services based on deep reinforcement lear-ning[J].ACM Transactions on Internet Technology(TOIT),2020,20(4):1-18.
[7]KRUEKAEW B,KIMPAN W.Enhancing of artificial bee colony algorithm for virtual machine scheduling and load balancing problem in cloud computing[J].International Journal of Computational Intelligence Systems,2020,13(1):496-510.
[8]RAGMANI A,ELOMRI A,ABGHOUR N,et al.FACO:A hybrid fuzzy ant colony optimization algorithm for virtual machine scheduling in high-performance cloud computing[J].Journal of Ambient Intelligence and Humanized Computing,2020,11:3975-3987.
[9]TARAFDAR A,KHATUA S,DAS R K.QoS aware energy efficient VM consolidation techniques for a virtualized data center[C]//2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing(UCC).IEEE,2018:114-123.
[10]WANG Y,LIU H,ZHENG W,et al.Multi-objective workflow scheduling with deep-Q-network-based multi-agent reinforcement learning[J].IEEE Access,2019,7:39974-39982.
[11]ZHANG S,WU T,PAN M,et al.A-SARSA:A predictive container auto-scaling algorithm based on reinforcement learning [C]//2020 IEEE International Conference on Web Services(ICWS).IEEE,2020:489-497.
[12]LIU L,XU J,YU H,et al.Joint Admission Control and Provisioning for Virtual Machines[C]//2015 IEEE International Conference on Communications(ICC).London,UK,2015:332-337.
[13]HU K,LIN W,HUANG T,et al.Virtual Machine Consolidation for NUMA Systems:A Hybrid Heuristic Grey Wolf Approach [C]//2020 IEEE 26th International Conference on Parallel and Distributed Systems(ICPADS).IEEE,2020:569-576.
[14]SHENG J,HU Y,ZHOU W.Learning to schedule multi-NUMA virtual machines via reinforcement learning [J].Pattern Recognition,2022,121:108254.
[15]OH Y,KIMIN L,SHIN J.Learning to Sample with Local and Global Contexts in Experience Replay Buffers [C]//International Conference on Learning Representations(ICLR).2021.
[1] CHEN Runhuan, DAI Hua, ZHENG Guineng, LI Hui , YANG Geng. Urban Electricity Load Forecasting Method Based on Discrepancy Compensation and Short-termSampling Contrastive Loss [J]. Computer Science, 2024, 51(4): 158-164.
[2] LIN Binwei, YU Zhiyong, HUANG Fangwan, GUO Xianwei. Data Completion and Prediction of Street Parking Spaces Based on Transformer [J]. Computer Science, 2024, 51(4): 165-173.
[3] SONG Hao, MAO Kuanmin, ZHU Zhou. Algorithm of Stereo Matching Based on GAANET [J]. Computer Science, 2024, 51(4): 229-235.
[4] XUE Jinqiang, WU Qin. Progressive Multi-stage Image Denoising Algorithm Combining Convolutional Neural Network and
Multi-layer Perceptron
[J]. Computer Science, 2024, 51(4): 243-253.
[5] SHI Dianxi, HU Haomeng, SONG Linna, YANG Huanhuan, OUYANG Qianying, TAN Jiefu , CHEN Ying. Multi-agent Reinforcement Learning Method Based on Observation Reconstruction [J]. Computer Science, 2024, 51(4): 280-290.
[6] ZHAO Miao, XIE Liang, LIN Wenjing, XU Haijiao. Deep Reinforcement Learning Portfolio Model Based on Dynamic Selectors [J]. Computer Science, 2024, 51(4): 344-352.
[7] HUANG Kun, SUN Weiwei. Traffic Speed Forecasting Algorithm Based on Missing Data [J]. Computer Science, 2024, 51(3): 72-80.
[8] WANG Yao, LUO Junren, ZHOU Yanzhong, GU Xueqiang, ZHANG Wanpeng. Review of Reinforcement Learning and Evolutionary Computation Methods for StrategyExploration [J]. Computer Science, 2024, 51(3): 183-197.
[9] ZHENG Cheng, SHI Jingwei, WEI Suhua, CHENG Jiaming. Dual Feature Adaptive Fusion Network Based on Dependency Type Pruning for Aspect-basedSentiment Analysis [J]. Computer Science, 2024, 51(3): 205-213.
[10] CHEN Jinyin, LI Xiao, JIN Haibo, CHEN Ruoxi, ZHENG Haibin, LI Hu. CheatKD:Knowledge Distillation Backdoor Attack Method Based on Poisoned Neuronal Assimilation [J]. Computer Science, 2024, 51(3): 351-359.
[11] WANG Yan, WANG Tianjing, SHEN Hang, BAI Guangwei. Optimal Penetration Path Generation Based on Maximum Entropy Reinforcement Learning [J]. Computer Science, 2024, 51(3): 360-367.
[12] HUANG Wenke, TENG Fei, WANG Zidan, FENG Li. Image Segmentation Based on Deep Learning:A Survey [J]. Computer Science, 2024, 51(2): 107-116.
[13] CAI Jiacheng, DONG Fangmin, SUN Shuifa, TANG Yongheng. Unsupervised Learning of Monocular Depth Estimation:A Survey [J]. Computer Science, 2024, 51(2): 117-134.
[14] ZHANG Feng, HUANG Shixin, HUA Qiang, DONG Chunru. Novel Image Classification Model Based on Depth-wise Convolution Neural Network andVisual Transformer [J]. Computer Science, 2024, 51(2): 196-204.
[15] LI Junwei, LIU Quan, XU Yapeng. Option-Critic Algorithm Based on Mutual Information Optimization [J]. Computer Science, 2024, 51(2): 252-258.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!