Computer Science ›› 2025, Vol. 52 ›› Issue (6A): 240800050-8.doi: 10.11896/jsjkx.240800050

• Network & Communication • Previous Articles     Next Articles

Optimization Strategy of Task Offloading Based on Meta Reinforcement Learning

ZHAO Chanchan, YANG Xingchen, SHI Bao, LYU Fei, LIU Libin   

  1. School of Information Engineering,Inner Mongolia University of Technology,Hohhot 010080,China
  • Online:2025-06-16 Published:2025-06-12
  • About author:ZHAO Chanchan,born in 1982,Ph.D,associate professor.Her main research interests include mobile edge computing and blockchain.
    SHI Bao,born in 1982,Ph.D,associate professor.His main research interest isimage processing.
  • Supported by:
    Natural Science Foundation of Inner Mongolia Autonomous Region(2023LHMS06016) and Basic Scientific Research Business Fee Project of Universities Directly Under the Inner Mongolia Autonomous Region(JY20240010,JY20230082).

Abstract: With the rapid development of edge computing,task offloading has become a crucial strategy for enhancing system performance and resource utilization.Existing deep learning-based offloading methods face challenges in real-world applications,such as low sample efficiency and poor adaptability to new environments.To address these issues,a task offloading method based on meta-reinforcement learning(MRL-PPO) is proposed,aiming to effectively solve the efficient offloading of heterogeneous tasks in edge computing while minimizing task delay and energy consumption.A sequence-to-sequence(Seq2Seq) network with an attention mechanism is designed,modeling offloading tasks as a directed acyclic graph(DAG).The encoder encodes the offloading tasks,and the decoder outputs different offloading decisions based on the context vector,addressing the complexity of network training caused by varying task sequence dimensions.The attention mechanism allows the model to dynamically focus on key features of the offloading tasks,improving decision accuracy and efficiency.To optimize the performance of the PPO algorithm in complex environments,an intrinsic reward learning algorithm is introduced.Experimental results demonstrate that the proposed algorithm outperforms existing methods in different tasks,and can quickly adapt to new environments,effectively reducing delay and energy consumption during task processing.

Key words: Edge computing, Meta reinforcement learning, Task offloading, Seq2Seq network, Attention mechanism

CLC Number: 

  • TP391
[1]YANG W Y,JIA X,SHAO C.Meta-Reinforcement Learning-based Vehicle Task Offloading in Internet of Vehicles(IoV)[C]//2023 3rd International Symposium on ComputerTechno-logy and Information Science(ISCTIS).Chengdu,China,2023:330-333.
[2]SHI W S,SUN H,CAO J,et al.Edge Computing-An Emerging Computing Model for the Internet of Everything Era [J].Journal of Computer Research and Development,2017,54(5):907-924.
[3]SHARMA N,GHOSH A,MISRA R,et al.Deep Meta Q-Learning Based Multi-Task Offloading in Edge-Cloud Systems[J].IEEE Transactions on Mobile Computing,2024,23(4):2583-2598.
[4]HO T M,NGUYEN K K.Joint Server Selection,CooperativeOffloading and Handover in Multi-Access Edge Computing Wireless Network:A Deep Reinforcement Learning Approach[J].IEEE Transactions on Mobile Computing,2022,21(7):2421-2435.
[5]TRINH B,MUNTEAN G M.A Deep Reinforcement Learning-Based Offloading Scheme for Multi-Access Edge Computing-Supported eXtended Reality Systems[J].IEEE Transactions on Vehicular Technology,2023,72(1):1254-1264.
[6]FEMENIAS G,RIERA-PALOU F.Mobile Edge Computing Aided Cell-Free Massive MIMO Networks[J].IEEE Transactions on MOBILE Computing,2024,23(2):1246-1261.
[7]JIANG H,SHI D X,XUE,CHAO,et al.Multi-agent deep reinforcement learning with type-based hierarchical group communication[J].Applied Intelligence,2021,51(8):5793-5808.
[8]SONG T.Opportunistic Task Offloading in UAV-assisted Mobile Edge Computing:A Deep Reinforcement Learning Approach[C]//2023 14th International Conference on Information and Communication Technology Convergence(ICTC).Jeju Island,Korea,Republic of,2023:881-884.
[9]WANG J,WANG H.A Secure Data Offloading Strategy forUAV Wireless Networks Based on Improved Ant Colony Algorithms[C]//2022 3rd International Conference on Electronics,Communications and Information Technology(CECIT).Sanya,China,2022:57-61.
[10]LV W,YANG P,ZHENG T,et al.Energy Consumption and QoS-Aware Co-Offloading for Vehicular Edge Computing[J].IEEE Internet of Things Journal,2023,10(6):5214-5225.
[11]WU H X,GENG J W,BAI X J,et al.Deep reinforcement lear-ning-based online task offloading in mobile edge computing networks[J].Information Sciences,2024,654.
[12]GONG B C,JIANG X W.Dependent Task-Offloading Strategy Based on Deep Reinforcement Learning in Mobile Edge Computing[J].Wireless Communications & Mobile Computing,2023,2023.
[13]CHEN J W,YANG Y J,WANG C Y,et al.Multitask Offloading Strategy Optimization Based on Directed Acyclic Graphs for Edge Computing[J].IEEE Internet of Things Journal,2021,9(12):9367-9378.
[14]WANG J,HU J,MIN G Y,et al.Dependent Task Offloading for Edge Computing based on Deep Reinforcement Learning[J].IEEE Transactions on Computers,2022,71(10):2449-2461.
[15]GUO M,HU X,CHEN Y R,et al.Joint Scheduling and Offloading Schemes for Multiple Interdependent Computation Tasks in Mobile Edge Computing[J].IEEE Internet of Things Journal,2024,11(4):5718-5730.
[16]HOMA M,MEHMET B,LUTFIYE D A.Handover-EnabledDynamic Computation Offloading for Vehicular Edge Computing Networks[J].IEEE Transactions on Vehicular Technology,2023,72(7):9394-9405.
[17]LI Y N,LI J B,LV Z Q,et al.GASTO:A Fast Adaptive Graph Learning Framework for Edge Computing Empowered Task Offloading[J].IEEE Transactions on Network and Service Management,2023,20(2):932-944.
[18]ARABNEJAD H,BARBOSA J.List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table[J].IEEE Transactions on Parallel and Distributed Systems,2013,25(3):682-694.
[1] GUAN Xin, YANG Xueyong, YANG Xiaolin, MENG Xiangfu. Tumor Mutation Prediction Model of Lung Adenocarcinoma Based on Pathological [J]. Computer Science, 2025, 52(6A): 240700010-8.
[2] TAN Jiahui, WEN Chenyan, HUANG Wei, HU Kai. CT Image Segmentation of Intracranial Hemorrhage Based on ESC-TransUNet Network [J]. Computer Science, 2025, 52(6A): 240700030-9.
[3] CHEN Xianglong, LI Haijun. LST-ARBunet:An Improved Deep Learning Algorithm for Nodule Segmentation in Lung CT Images [J]. Computer Science, 2025, 52(6A): 240600020-10.
[4] WANG Rong , ZOU Shuping, HAO Pengfei, GUO Jiawei, SHU Peng. Sand Dust Image Enhancement Method Based on Multi-cascaded Attention Interaction [J]. Computer Science, 2025, 52(6A): 240800048-7.
[5] WANG Baohui, GAO Zhan, XU Lin, TAN Yingjie. Research and Implementation of Mine Gas Concentration Prediction Algorithm Based on Deep Learning [J]. Computer Science, 2025, 52(6A): 240400188-7.
[6] ZHENG Chuangrui, DENG Xiuqin, CHEN Lei. Traffic Prediction Model Based on Decoupled Adaptive Dynamic Graph Convolution [J]. Computer Science, 2025, 52(6A): 240400149-8.
[7] HONG Yi, SHEN Shikai, SHE Yumei, YANG Bin, DAI Fei, WANG Jianxiao, ZHANG Liyi. Multivariate Time Series Prediction Based on Dynamic Graph Learning and Attention Mechanism [J]. Computer Science, 2025, 52(6A): 240700047-8.
[8] TENG Minjun, SUN Tengzhong, LI Yanchen, CHEN Yuan, SONG Mofei. Internet Application User Profiling Analysis Based on Selection State Space Graph Neural Network [J]. Computer Science, 2025, 52(6A): 240900060-8.
[9] LI Daicheng, LI Han, LIU Zheyu, GONG Shiheng. MacBERT Based Chinese Named Entity Recognition Fusion with Dependent Syntactic Information and Multi-view Lexical Information [J]. Computer Science, 2025, 52(6A): 240600121-8.
[10] HUANG Bocheng, WANG Xiaolong, AN Guocheng, ZHANG Tao. Transmission Line Fault Identification Method Based on Transfer Learning and Improved YOLOv8s [J]. Computer Science, 2025, 52(6A): 240800044-8.
[11] WU Zhihua, CHENG Jianghua, LIU Tong, CAI Yahui, CHENG Bang, PAN Lehao. Human Target Detection Algorithm for Low-quality Laser Through-window Imaging [J]. Computer Science, 2025, 52(6A): 240600069-6.
[12] ZENG Fanyun, LIAN Hechun, FENG Shanshan, WANG Qingmei. Material SEM Image Retrieval Method Based on Multi-scale Features and Enhanced HybridAttention Mechanism [J]. Computer Science, 2025, 52(6A): 240800014-7.
[13] HOU Zhexiao, LI Bicheng, CAI Bingyan, XU Yifei. High Quality Image Generation Method Based on Improved Diffusion Model [J]. Computer Science, 2025, 52(6A): 240500094-9.
[14] DING Xuxing, ZHOU Xueding, QIAN Qiang, REN Yueyue, FENG Youhong. High-precision and Real-time Detection Algorithm for Photovoltaic Glass Edge Defects Based onFeature Reuse and Cheap Operation [J]. Computer Science, 2025, 52(6A): 240400146-10.
[15] ZHAO Chanchan, WEI Xiaomin, SHI Bao, LYU Fei, LIU Libin, ZHANG Ziyang. Edge Computing Based Approach for Node Trust Evaluation in Blockchain Networks [J]. Computer Science, 2025, 52(6A): 240600153-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!