计算机科学 ›› 2022, Vol. 49 ›› Issue (12): 332-339.doi: 10.11896/jsjkx.210900042

• 计算机网络 • 上一篇    下一篇

一体化网络多终端接入智能路由技术

许逸铭, 马礼, 傅颖勋, 李阳, 马东超   

  1. 北方工业大学信息学院 北京100144
  • 收稿日期:2021-09-06 修回日期:2022-05-12 发布日期:2022-12-14
  • 通讯作者: 马礼(mali@ncut.edu.cn)
  • 作者简介:(17801088312@163.com)
  • 基金资助:
    国家重点研发计划(2018YFB1800302);国家自然科学基金(62001007);北京市自然科学基金(KZ201810009011,4202020,19L2021)

Intelligent Routing Technology for Multi-terminal Access in Integrated Network

XU Yi-ming, MA Li, FU Ying-xun, LI Yang, MA Dong-chao   

  1. School of Information Science and Technology,North China University of Technology,Beijing 100144,China
  • Received:2021-09-06 Revised:2022-05-12 Published:2022-12-14
  • About author:XU Yi-ming,born in 1996,postgra-duate.His main research interests include software defined network and reinforcement learning.MA Li,born in 1968,Ph.D,professor,is a distinguished member of China Computer Federation.His main research interests include advanced computing technology,distributed computing,multi-agent system and IoT system.
  • Supported by:
    National Key Research and Development Program of China(2018YFB1800302),National Natural Science Foundation of China(62001007) and Natural Science Foundation of Beijing,China(KZ201810009011,4202020,19L2021).

摘要: 针对一体化异构网络中大量终端设备接入网络造成网络流量剧烈波动引发的网络负载均衡问题,提出了一种基于强化学习的智能路由算法TDANRA。TDANRA算法通过软件定义网络技术获取细粒度、高精度的网络流量状态参数,根据网络流量状态与链路带宽利用率阈值调整机制自动生成实时的路由策略,指导网络中流量的转发,从而解决网络流量剧烈波动的问题。仿真实验结果表明,TDANRA算法可以在大量终端设备接入网络的情况下实现网络流量的负载均衡,降低端到端传输时延与数据丢包率。

关键词: 软件定义网络, 强化学习, 负载均衡, 智能路由, 链路带宽利用率阈值调整机制

Abstract: Aiming at the problem of network load balancing caused by the drastic fluctuation of network traffic caused by the access of a large number of terminal devices in the integrated heterogeneous network,an intelligent routing algorithm TDANRA based on reinforcement learning is proposed.Fine-grained and high-precision network traffic status parameters are obtained by software-defined network technology,TDANRA algorithm automatically generates real-time routing policies based on network traffic status and link bandwidth utilization threshold adjustment mechanism to guide the forwarding of network traffic,so as to solve the problem of drastic fluctuation of network traffic.Simulation results show that TDANRA algorithm can realize load ba-lancing of network traffic and reduce end-to-end transmission delay and data packet loss rate when a large number of terminal devices are connected to the network.

Key words: Software defined network, Reinforcement learning, Load balancing, Intelligent routing, Link bandwidth usage thres-hold adjustment mechanism

中图分类号: 

  • TP393
[1]FOOLADIVANDA D,ROSENBERG C.Joint Resource Allocation and User Association for Heterogeneous Wireless Cellular Networks[J].IEEE Transactions on Wireless Communications,2013,12(1):248-257.
[2]WANG G,ZHONG Y,HAN T,et al.On the Effect of Spatio-Temporal Fluctuation of Traffic in Wireless Networks[C]//IEEE Global Communications Conference,GLOBECOM 2018.Abu Dhabi,United Arab Emirates:IEEE,2018:1-7.
[3]TEFERA G,SHE K,CHEN M,et al.Congestion-aware adaptive decentralised computation offloading and caching for multi-access edge computing networks[J].IET Communications,2020,14(19):3410-3419.
[4]YU S,CHEN X,ZHOU Z,et al.When Deep Re-inforcementLearning Meets Federated Learning:Intelligent Multitimescale Resource Management for Multiaccess Edge Computing in 5G Ultradense Network[J].IEEE Internet of Things Journal,2021,8(4):2238-2251.
[5]ZORZI M,ZANELLA A,TESTOLIN A,et al.COBANETS:A new paradigm for cognitive communications systems[C]//2016 International Conference on Computing,Networking and Communications,ICNC 2016.Kauai,HI,USA:IEEE Computer So-ciety,2016:1-7.
[6]MCKEOWN N,ANDERSON T E,BALAKRISHNAN H,et al.OpenFlow:enabling innovation in campus networks[J].Computer Communication Review,2008,38(2):69-74.
[7]WANG X,DENG Q,REN J,et al.The Joint Optimization of Online Traffic Matrix Measurement and Traffic Engineering For Software-Defined Networks[J].IEEE/ACM Transactions on Networking,2020,28(1):234-247.
[8]MOSHREF M,YU M,GOVINDAN R,et al.DREAM:dynamic resource allocation for software-defined measurement[C]//Proceedings of the 2014 ACM Conference onSIGCOMM(SIG-COMM ’14).Association for Computing Machinery,2014:419-430.
[9]MALBOUBI M,WANG L,CHUAH C N,et al.Intelligent SDN based traffic(de)Aggregation and Measurement Paradigm(iSTAMP)[C]//IEEE Conference on Computer Communications.2014:934-942.
[10]AKYILDIZ I F,LEE A,WANG P,et al.Research challenges for traffic engineering in software defined networks[J].IEEE Network,2016,30(3):52-58.
[11]NAEEM F,TARIQ M,POOR H V.SDN-Enabled Energy-Efficient Routing Optimization Framework for Industrial Internet of Things[J].IEEE Transactions on Industrial Informatics,2021,17(8):5660-5667.
[12]DENG G,WANG K.An Application-aware QoS Routing Algorithm for SDN-based IoT Networking[C]//2018 IEEE Symposium on Computers and Communications,ISCC 2018.Natal,Brazil:IEEE,2018:186-191.
[13]TANG F,YANG L T,TANG C,et al.A Dynamical and Load-Balanced Flow Scheduling Approach for Big Data Centers in Clouds[J].IEEE Transactions on Cloud Computing,2018,6(4):915-928.
[14]GUO Y,WANG W,ZHANG H,et al.Traffic Engineering inhybrid Software Defined Network via Reinforcement Learning [J].Journal of Network and Computer Applications,2021,189:103116.
[15]LIU W X,CAI J,CHEN Q C,et al.DRL-R:Deep reinforcement learning approach for intelligent routing in software-defined data-center networks[J].Journal of Network and Computer Applications,2021,177:102865.
[16]YAO H,MAI T,XU X,et al.NetworkAI:An Intelligent Network Architecture for Self-Learning Control Strategies in Software Defined Networks[J].IEEE Internet of Things Journal,2018,5(6):4319-4327.
[17]SUN P,GUO Z,LAN J,et al.ScaleDRL:A Scalable Deep Reinforcement Learning Approach for Traffic Engineering in SDN with Pinning Control[J].Computer Networks,2021,190:107891.
[18]CHEN Y R,REZAPOUR A,TZENG W G,et al.RL-Routing:An SDN Routing Algorithm Based on Deep Reinforcement Learning[J].IEEE Transactions on Network Science and Engineering,2020,7(4):3185-3199.
[19]FUJIMOTO S,HOOF H V,MEGER D.Addressing Function Approximation Error in Actor-Critic Methods[C]//Proceedings of the 35th International Conference on Machine Learning,ICML 2018.2018:1582-1591.
[20]SUTTON R S.Learning to Predict by the Methods of Temporal Differences[J].Machine Learning,1988,3:9-44.
[21]SILVER D,LEVER G,HEESS N,et al.Deterministic Policy Gradient Algorithms[C]//Proceedings of the 31th International Conference on Machine Learning,ICML 2014.2014,32:387-395.
[22]ZHANG J,YE M,GUO Z,et al.CFR-RL:Traffic Engineering With Reinforcement Learning in SDN[J].IEEE Journal on Selected Areas in Communications,2020,38(10):2249-2259.
[1] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[2] 刘兴光, 周力, 刘琰, 张晓瀛, 谭翔, 魏急波.
基于边缘智能的频谱地图构建与分发方法
Construction and Distribution Method of REM Based on Edge Intelligence
计算机科学, 2022, 49(9): 236-241. https://doi.org/10.11896/jsjkx.220400148
[3] 袁唯淋, 罗俊仁, 陆丽娜, 陈佳星, 张万鹏, 陈璟.
智能博弈对抗方法:博弈论与强化学习综合视角对比分析
Methods in Adversarial Intelligent Game:A Holistic Comparative Analysis from Perspective of Game Theory and Reinforcement Learning
计算机科学, 2022, 49(8): 191-204. https://doi.org/10.11896/jsjkx.220200174
[4] 史殿习, 赵琛然, 张耀文, 杨绍武, 张拥军.
基于多智能体强化学习的端到端合作的自适应奖励方法
Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning
计算机科学, 2022, 49(8): 247-256. https://doi.org/10.11896/jsjkx.210700100
[5] 于滨, 李学华, 潘春雨, 李娜.
基于深度强化学习的边云协同资源分配算法
Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning
计算机科学, 2022, 49(7): 248-253. https://doi.org/10.11896/jsjkx.210400219
[6] 李梦菲, 毛莺池, 屠子健, 王瑄, 徐淑芳.
基于深度确定性策略梯度的服务器可靠性任务卸载策略
Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient
计算机科学, 2022, 49(7): 271-279. https://doi.org/10.11896/jsjkx.210600040
[7] 田真真, 蒋维, 郑炳旭, 孟利民.
基于服务器集群的负载均衡优化调度算法
Load Balancing Optimization Scheduling Algorithm Based on Server Cluster
计算机科学, 2022, 49(6A): 639-644. https://doi.org/10.11896/jsjkx.210800071
[8] 谢万城, 李斌, 代玥玥.
空中智能反射面辅助边缘计算中基于PPO的任务卸载方案
PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing
计算机科学, 2022, 49(6): 3-11. https://doi.org/10.11896/jsjkx.220100249
[9] 洪志理, 赖俊, 曹雷, 陈希亮, 徐志雄.
基于遗憾探索的竞争网络强化学习智能推荐方法研究
Study on Intelligent Recommendation Method of Dueling Network Reinforcement Learning Based on Regret Exploration
计算机科学, 2022, 49(6): 149-157. https://doi.org/10.11896/jsjkx.210600226
[10] 郭雨欣, 陈秀宏.
融合BERT词嵌入表示和主题信息增强的自动摘要模型
Automatic Summarization Model Combining BERT Word Embedding Representation and Topic Information Enhancement
计算机科学, 2022, 49(6): 313-318. https://doi.org/10.11896/jsjkx.210400101
[11] 范静宇, 刘全.
基于随机加权三重Q学习的异策略最大熵强化学习算法
Off-policy Maximum Entropy Deep Reinforcement Learning Algorithm Based on RandomlyWeighted Triple Q -Learning
计算机科学, 2022, 49(6): 335-341. https://doi.org/10.11896/jsjkx.210300081
[12] 张佳能, 李辉, 吴昊霖, 王壮.
一种平衡探索和利用的优先经验回放方法
Exploration and Exploitation Balanced Experience Replay
计算机科学, 2022, 49(5): 179-185. https://doi.org/10.11896/jsjkx.210300084
[13] 高捷, 刘沙, 黄则强, 郑天宇, 刘鑫, 漆锋滨.
基于国产众核处理器的深度神经网络算子加速库优化
Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor
计算机科学, 2022, 49(5): 355-362. https://doi.org/10.11896/jsjkx.210500226
[14] 李鹏, 易修文, 齐德康, 段哲文, 李天瑞.
一种基于深度学习的供热策略优化方法
Heating Strategy Optimization Method Based on Deep Learning
计算机科学, 2022, 49(4): 263-268. https://doi.org/10.11896/jsjkx.210300155
[15] 欧阳卓, 周思源, 吕勇, 谭国平, 张悦, 项亮亮.
基于深度强化学习的无信号灯交叉路口车辆控制
DRL-based Vehicle Control Strategy for Signal-free Intersections
计算机科学, 2022, 49(3): 46-51. https://doi.org/10.11896/jsjkx.210700010
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!