基于卷积双延迟深度确定性策略梯度的卫星网络多路径路由算法

doi:10.11896/jsjkx.240800161

Abstract

Abstract: In the satellite network,due to the influence of geographical location and people's living habits,the difference in the needs of users in the satellite coverage area will cause the load imbalance of the satellite network.A multi-path routing algorithm based onconvolutional double-delay deep deterministic policy gradient(CTD3-MR) is proposed for the above problem.Under the SDN structure,CTD3 is deployed in the controller as the agent,and the dynamically changing links' residual bandwidth,transmission delay,packet loss rate and spatiotemporal level are trained as the network state input agent,and the output action is used as the network link weight,and the weighted sum of the maximum link bandwidth utilization,average end-to-end delay and network packet loss rate is used as the reward function to adjust the action.After the agent training converges,the controller obtains the k-shortest path according to the network link weight output by the agent,and takes the path weight ratio as the path traffic allocation ratio to generate an optimal routing strategy and forward it to the satellite for multipath transmission.Finally,CTD3-MR is compared with TD3,TMR and ECMP routing algorithms.Experimental results show that compared with other routing algorithms,CTD3-MR reduces the average end-to-end delay by at least 7.64%,the packet loss rate by 28.65%,the maximum link bandwidth utilization by 11.44%,and the traffic distribution index by 5.82%,which improves the network load balancing performance.

Key words: Satellite networks, Software defined networking, Multipath routing, Deep reinforcement learning, Load balancing

CLC Number:

TP393

WEI Debin, ZHANG Yi, XU Pingduo, WANG Xinrui. Multipath Routing Algorithm for Satellite Networks Based on Convolutional Twin Delay Deep Deterministic Policy Gradient[J].Computer Science, 2025, 52(11): 280-288.

References

[1]HUANG Y,JIANG X,CHEN S,et al.Pheromone incentivized intelligent multipath traffic scheduling approach for LEO satellite networks[J].IEEE Transactions on Wireless Communications,2022,21(8):5889-5902.
[2]ZENG G,ZHAN Y,PAN X.Failure-tolerant and low-latencytelecommand in mega-constellations:The redundant multi-path routing[J].IEEE Access,2021,9:34975-34985.
[3]WANG C,REN Z,CHENG W,et al.CDMR:effective computing-dependent multi-path routing strategies in satellite and terrestrial integrated networks[J].IEEE Transactions on Network Science and Engineering,2022,9(5):3715-3730.
[4]YOU Q D,WANG X Y,JIN X H,et al.Multipath Routing Algorithm for Satellite Network for Fast Information Backhaul[J].Geomatics and Engineering Science of Wuhan University(Engineering Science),2019,52(8):747-752.
[5]TANG F,ZHANG H,YANG L T.Multipath cooperative routing with efficient acknowledgement for LEO satellite networks[J].IEEE Transactions on Mobile Computing,2018,18(1):179-192.
[6]QI H,GUO Y,HOU D,et al.SDN-based dynamic multi-pathrouting strategy for satellite networks[J].Future Generation Computer Systems,2022,133:254-265.
[7]LIU X,ZHOU H,ZHANG Z,et al.Multipath Cooperative Routing in Ultra-Dense LEO Satellite Networks:A Deep Reinforcement Learning-Based Approach[J].IEEE Internet of Things Journal,2025,12(2):1789-1804.
[8]LI J H,LI H W,LAI Z Q,et al.DetSpace:Distributed congestion free routing for global deterministic network in mega-constellations[J].China Communications,2024,21(12):203-216.
[9]WEI W,FU L,GU H,et al.Iris:Towards Intelligent Reliable Routing for Software Defined Satellite Networks[J].IEEE Transactions on Communications,2025,73(1):454-468.
[10]WANG H,RAN Y Y,ZHAO L,et al.Dynamic Routing Algorithm for Low Earth Orbit Satellite Network Based on Deep Graph Reinforcement Learning[J].Journal of Chongqing University of Posts and Telecommunications:Natural Science Edition,2023,35(4):596-605.
[11]DONG F,SONG J,ZHANG Y,et al.DRL-Based Load-Balancing Routing Scheme for 6G Space-Air-Ground Integrated Networks[J].Remote Sensing,2023,15(11):2801.
[12]LI X T,ZHANG Y S.An Artificial Intelligence Routing Method for SDN Networks Suitable for Low-Earth Orbit Satellites[J].Electronic Measurement Technology,2020,43(22):109-114.
[13]WANG C,WANG H,WANG W.A Two-Hops State-AwareRouting Strategy Based on Deep Reinforcement Learning for LEO Satellite Networks[J].Electronics,2019,8(9):920.
[14]XU G,ZHAO Y,RAN Y,et al.Spatial Location Aided Fully-Distributed Dynamic Routing for Large-Scale LEO Satellite Networks[J].IEEE Communications Letters,2022,26(12):3034-3038.
[15]CHU K,CHENG S,ZHU L.A robust routing strategy based on deep reinforcement learning for mega satellite constellations[J].Electronics Letters,2023,59(11):e12820.
[16]BAI Q.Research on Load Balancing Routing Strategy for Satellite Network Based on Uncertainty Theory[D].Nanjing:Nanjing University of Posts and Telecommunications,2020.
[17]NA Z Y,PAN Z,LIU X,et al.Distributed routing strategybased on machine learning for LEO satellite network[J].Wireless Communications and Mobile Computing,2018(3/4):1-10.
[18]WEI D B,CAO Y,YANG L,et al.A Routing Strategy for LEO Satellite Network Based on Spatiotemporal Hierarchy[J].Journal of Aeronautics and Astronautics,2023,44(16):204-214.
[19]WEI L H,LIU G W,LIU Y,et al.Research on Satellite Internet Routing Optimization Based on Deep Reinforcement Learning[J].Space-Earth Integrated Information Network,2022,3(3):65-71.
[20]CHIM T W,YEUNG K L,LUI K S.Traffic distribution overequal-cost-multi-paths[J].Computer Networks,2005,49(4):465-475.

Related Articles 15

[1]	XU Jinlong, WANG Gengwu, HAN Lin, NIE Kai, LI Haoran, CHEN Mengyao, LIU Haohao. Research on Parallel Scheduling Strategy Optimization Technology Based on Sunway Compiler [J]. Computer Science, 2025, 52(9): 137-143.
[2]	CHEN Jintao, LIN Bing, LIN Song, CHEN Jing, CHEN Xing. Dynamic Pricing and Energy Scheduling Strategy for Photovoltaic Storage Charging Stations Based on Multi-agent Deep Reinforcement Learning [J]. Computer Science, 2025, 52(9): 337-345.
[3]	ZHANG Yongliang, LI Ziwen, XU Jiahao, JIANG Yuchen, CUI Ying. Congestion-aware and Cached Communication for Multi-agent Pathfinding [J]. Computer Science, 2025, 52(8): 317-325.
[4]	HUO Dan, YU Fuping, SHEN Di, HAN Xueyan. Research on Multi-machine Conflict Resolution Based on Deep Reinforcement Learning [J]. Computer Science, 2025, 52(7): 271-278.
[5]	WU Zongming, CAO Jijun, TANG Qiang. Online Parallel SDN Routing Optimization Algorithm Based on Deep Reinforcement Learning [J]. Computer Science, 2025, 52(6A): 240900018-9.
[6]	WANG Chenyuan, ZHANG Yanmei, YUAN Guan. Class Integration Test Order Generation Approach Fused with Deep Reinforcement Learning andGraph Convolutional Neural Network [J]. Computer Science, 2025, 52(6): 58-65.
[7]	ZHAO Xuejian, YE Hao, LI Hao, SUN Zhixin. Multi-AGV Path Planning Algorithm Based on Improved DDPG [J]. Computer Science, 2025, 52(6): 306-315.
[8]	ZHOU Kai, WANG Kai, ZHU Yuhang, PU Liming, LIU Shuxin, ZHOU Deqiang. Customized Container Scheduling Strategy Based on GMM [J]. Computer Science, 2025, 52(6): 346-354.
[9]	HUANG Chenxi, LI Jiahui, YAN Hui, ZHONG Ying, LU Yutong. Investigation on Load Balancing Strategies for Lattice Boltzmann Method with Local Grid Refinement [J]. Computer Science, 2025, 52(5): 101-108.
[10]	LI Yuanbo, HU Hongchao, YANG Xiaohan, GUO Wei, LIU Wenyan. Intrusion Tolerance Scheduling Algorithm for Microservice Workflow Based on Deep Reinforcement Learning [J]. Computer Science, 2025, 52(5): 375-383.
[11]	ZHENG Longhai, XIAO Bohuai, YAO Zewei, CHEN Xing, MO Yuchang. Graph Reinforcement Learning Based Multi-edge Cooperative Load Balancing Method [J]. Computer Science, 2025, 52(3): 338-348.
[12]	DU Likuan, LIU Chen, WANG Junlu, SONG Baoyan. Self-learning Star Chain Space Adaptive Allocation Method [J]. Computer Science, 2025, 52(3): 359-365.
[13]	HUO Xingpeng, SHA Letian, LIU Jianwen, WU Shang, SU Ziyue. Windows Domain Penetration Testing Attack Path Generation Based on Deep Reinforcement Learning [J]. Computer Science, 2025, 52(3): 400-406.
[14]	WANG Yijie, GAO Guoju, SUN Yu'e, HUANG He. Flow Cardinality Estimation Method Based on Distributed Sketch in SDN [J]. Computer Science, 2025, 52(2): 268-278.
[15]	XU Donghong, LI Bin, QI Yong. Task Scheduling Strategy Based on Improved A2C Algorithm for Cloud Data Center [J]. Computer Science, 2025, 52(2): 310-322.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Multipath Routing Algorithm for Satellite Networks Based on Convolutional Twin Delay Deep Deterministic Policy Gradient

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0