基于强化学习的无线可充电传感网移动充电路径优化

doi:10.11896/jsjkx.200400075

计算机科学 ›› 2020, Vol. 47 ›› Issue (11): 316-321.doi: 10.11896/jsjkx.200400075

基于强化学习的无线可充电传感网移动充电路径优化

张昊, 管昕洁, 白光伟

南京工业大学计算机科学与技术学院南京 211816

收稿日期:2020-04-17 修回日期:2020-07-13 出版日期:2020-11-15 发布日期:2020-11-05
通讯作者: 管昕洁(xjguan@njtech.edu.cn)
作者简介:Becky951219@126.com
基金资助:
国家自然科学基金项目(61802176)

Optimization of Mobile Charging Path of Wireless Rechargeable Sensor Networks Based on Reinforcement Learning

ZHANG Hao, GUAN Xin-jie, BAI Guang-wei

Department of Computer Science and Technology,Nanjing University of Technology,Nanjing 211816,China

Received:2020-04-17 Revised:2020-07-13 Online:2020-11-15 Published:2020-11-05
About author:ZHANG Hao,born in 1995,postgra-duate.Her main research interests include reinforcement learning,artificial intelligence and wireless sensor network.
GUAN Xin-jie,born in 1984,Ph.D,master instructor.Her main research inte-rests include network optimization,edge computing and software defined network.
Supported by:
This work was supported by the National Natural Science Foundation of China (61802176).

摘要/Abstract

摘要： 无线传感器网络在环境感知、目标跟踪等方面占据了重要地位。为了能够及时地为传感器节点补充能量,提出了一种基于强化学习的低功耗、高能效的移动路径充电算法。无线传感器网络采用移动充电车对传感器节点进行充电,将Q-Learning算法与epsilon-greedy算法相结合,以最短路径依次完成所有传感器节点的充电。现有的相关研究通常忽略了传感器节点自身所能承受电量的最大值,容易导致传感器节点因充电过程中电量超出最大值而暂停工作,因此限制了移动充电车的充电时间。结果表明,所提移动充电策略的效用更高,与传统的Q-Learning算法和贪心算法相比,训练周期大幅度下降且实现了能量利用率最大化。

关键词: 路径, 能量利用率, 强化学习, 无线可充电传感网, 移动充电

Abstract: Wireless sensor networks occupy an important position in environmental perception and target tracking.In order to recharge sensor nodes in time,this paper proposes a low power consumption and high energy efficientcy mobile path charging algorithm based on reinforcement learning.Wireless sensor network uses a mobile charger to charge the sensor nodes.The Q-Lear-ning algorithm and the epsilon-greedy algorithm are combined to complete the charging of all sensor nodes in turn in the shortest path.Existing related researches usually ignore the maximum amount of power that the sensor node itself can withstand,which easily causes the power to exceed the maximum threshold during charging and suspend work,so the charging time of the mobile charger is limited.The result shows that the proposed mobile charging strategy has a higher utility.Compared with the traditional Q-Learning algorithm and the greedy algorithm,the training cycle is greatly reduced and the energy utilization rate is maximized.

Key words: Energy utilization, Mobile charging, Path, Reinforcement learning, Wireless rechargeable sensor network

中图分类号:

TP393

张昊, 管昕洁, 白光伟. 基于强化学习的无线可充电传感网移动充电路径优化[J]. 计算机科学, 2020, 47(11): 316-321. https://doi.org/10.11896/jsjkx.200400075

ZHANG Hao, GUAN Xin-jie, BAI Guang-wei. Optimization of Mobile Charging Path of Wireless Rechargeable Sensor Networks Based on Reinforcement Learning[J]. Computer Science, 2020, 47(11): 316-321. https://doi.org/10.11896/jsjkx.200400075

参考文献

[1] LÓPEZ RIQUELME J A,SOTO F,SUARDÍAZ J,et al.Wireless Sensor Networks for precision horticulture in Southern Spain[J].Computers & Electronics in Agriculture,2009,68(1):25-35.
[2] GIUSEPPE A,MARCO C,MARIO D F,et al.Energy conservation in wireless sensor networks:A survey[J].Ad Hoc Networks,2009,7(3):537-568.
[3] FAFOUTIS X,VUCKOVIC D,DI M A,DRAGON N,et al.Energy-Harvesting wireless sensor networks[C]//Proc.of the 9th European Conf.on Wireless Sensor Networks(EWSN).Trento:University of Trento,2012:84-85.
[4] KURS A,KARALIS A,MOFFATT R,et al.Wireless power transfer via strongly coupled magnetic resonances[J].Science,2007,317(5834):83-86.
[5] KURS A,MOFFATT R,SOLJACIC M.Simultaneous midrange power transfer to multiple devices[J].Applied Physics Letters,2010,96(4):34.
[6] DAI H P,CHEN G H,XU L J,et al.Effective Algorithm for Placement of Directional Wireless Chargers[J].Ruan Jian Xue Bao,2015,26(7):1711-1729.
[7] SHI Y,XIE L HOU Y T,et al.On Renewable Sensor Networks with Wireless Energy Transfer[C]//INFOCOM,2011 Procee-dings IEEE.IEEE,2012:1350-1358.
[8] XIE L,SHI Y,HOU Y T,et al.Multi-node wireless energy charging in sensor networks[J].IEEE/ACM Transactions on Networking,2015,23(2):437-450.
[9] HE L,KONG L,GU Y,et al.Evaluating the On-Demand Mobile Charging in Wireless Sensor Networks[J].IEEE Transactions on Mobile Computing,2015,14(9):1861-1875.
[10] JIANG F C,HE S B,CHENG P,et al.On optimal scheduling in wireless rechargeable sensor networks for stochastic event capture[C]//IEEE International Conference on Mobile Adhoc & Sensor Systems.IEEE,2011.
[11] SU Z Z.Research and Improvement on Routing Protocols ofWireless Sensor Networks Based on Clustering[D].Changchun:Jilin University,2016.
[12] XIE L,SHI Y,HOU Y T,et al.Multi-Node Wireless Energy Charging in Sensor Networks[J].IEEE/ACM Transactions on Networking,2015,23(2):437-450.
[13] HE S B,CHEN J M,JIANG F C,et al.Energy provisioning in wireless rechargeable sensor networks[J].IEEE Transactions on Mobile Computing,2013,12(10):1931-1942.
[14] YU L C,LV H F,HE L,et al.Optimization of Charging Path for Wireless Rechargeable Sensor Networks[J].Journal of Shanghai DianJi University,2018(4):25-30.
[15] FU L K,CHENG P,GU Y,et al.Minimizing charging delay in wireless rechargeable sensor networks[C]//2013 Proceedings IEEE INFOCOM.IEEE,2013:2922-2930.
[16] VARTIAINEN E M,INO Y,SHIMANO R,et al.Numericalphase correction method for terahertz time-domain reflection spectroscopy[J].Journal of Applied Physics,2004,96(8):4171-4176.
[17] GU S,LILLICRAP T,SUTSKEVER I,et al.Continuous deep q-learning with model-based acceleration[C]//International Conference on Machine Learning.2016:2829-2838.
[18] SONG Y C,ZHANG Y Y,MENG H D.Research Based on Euclid Distance with Weights of Clustering Method[J].Computer Engineering and Applications,2007,43(4):179-180,226.

相关文章 15

[1]	刘兴光, 周力, 刘琰, 张晓瀛, 谭翔, 魏急波. 基于边缘智能的频谱地图构建与分发方法 Construction and Distribution Method of REM Based on Edge Intelligence 计算机科学, 2022, 49(9): 236-241. https://doi.org/10.11896/jsjkx.220400148
[2]	黄丽, 朱焱, 李春平. 基于异构网络表征学习的作者学术行为预测 Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning 计算机科学, 2022, 49(9): 76-82. https://doi.org/10.11896/jsjkx.210900078
[3]	吕晓锋, 赵书良, 高恒达, 武永亮, 张宝奇. 基于异质信息网的短文本特征扩充方法 Short Texts Feautre Enrichment Method Based on Heterogeneous Information Network 计算机科学, 2022, 49(9): 92-100. https://doi.org/10.11896/jsjkx.210700241
[4]	熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述 Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization 计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[5]	史殿习, 赵琛然, 张耀文, 杨绍武, 张拥军. 基于多智能体强化学习的端到端合作的自适应奖励方法 Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning 计算机科学, 2022, 49(8): 247-256. https://doi.org/10.11896/jsjkx.210700100
[6]	袁唯淋, 罗俊仁, 陆丽娜, 陈佳星, 张万鹏, 陈璟. 智能博弈对抗方法:博弈论与强化学习综合视角对比分析 Methods in Adversarial Intelligent Game:A Holistic Comparative Analysis from Perspective of Game Theory and Reinforcement Learning 计算机科学, 2022, 49(8): 191-204. https://doi.org/10.11896/jsjkx.220200174
[7]	王兵, 吴洪亮, 牛新征. 基于改进势场法的机器人路径规划 Robot Path Planning Based on Improved Potential Field Method 计算机科学, 2022, 49(7): 196-203. https://doi.org/10.11896/jsjkx.210500020
[8]	于滨, 李学华, 潘春雨, 李娜. 基于深度强化学习的边云协同资源分配算法 Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning 计算机科学, 2022, 49(7): 248-253. https://doi.org/10.11896/jsjkx.210400219
[9]	李梦菲, 毛莺池, 屠子健, 王瑄, 徐淑芳. 基于深度确定性策略梯度的服务器可靠性任务卸载策略 Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient 计算机科学, 2022, 49(7): 271-279. https://doi.org/10.11896/jsjkx.210600040
[10]	高文龙, 周天阳, 朱俊虎, 赵子恒. 基于双向蚁群算法的网络攻击路径发现方法 Network Attack Path Discovery Method Based on Bidirectional Ant Colony Algorithm 计算机科学, 2022, 49(6A): 516-522. https://doi.org/10.11896/jsjkx.210500072
[11]	谭任深, 徐龙博, 周冰, 荆朝霞, 黄向生. 海上风电场通用运维路径规划模型优化及仿真 Optimization and Simulation of General Operation and Maintenance Path Planning Model for Offshore Wind Farms 计算机科学, 2022, 49(6A): 795-801. https://doi.org/10.11896/jsjkx.210400300
[12]	杨浩雄, 高晶, 邵恩露. 考虑一单多品的外卖订单配送时间的带时间窗的车辆路径问题 Vehicle Routing Problem with Time Window of Takeaway Food ConsideringOne-order-multi-product Order Delivery 计算机科学, 2022, 49(6A): 191-198. https://doi.org/10.11896/jsjkx.210400005
[13]	王永, 崔源. 基于四边形最优圈内最短路径的旅行商问题割边方法 Cutting Edge Method for Traveling Salesman Problem Based on the Shortest Paths in Optimal Cycles of Quadrilaterals 计算机科学, 2022, 49(6A): 199-205. https://doi.org/10.11896/jsjkx.210400065
[14]	陈钧吾, 余华山. 面向无尺度图的Δ-stepping算法改进策略 Strategies for Improving Δ-stepping Algorithm on Scale-free Graphs 计算机科学, 2022, 49(6A): 594-600. https://doi.org/10.11896/jsjkx.210400062
[15]	谢万城, 李斌, 代玥玥. 空中智能反射面辅助边缘计算中基于PPO的任务卸载方案 PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing 计算机科学, 2022, 49(6): 3-11. https://doi.org/10.11896/jsjkx.220100249

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

基于强化学习的无线可充电传感网移动充电路径优化

Optimization of Mobile Charging Path of Wireless Rechargeable Sensor Networks Based on Reinforcement Learning

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 0