Computer Science ›› 2022, Vol. 49 ›› Issue (3): 46-51.doi: 10.11896/jsjkx.210700010

• Novel Distributed Computing Technology and System • Previous Articles     Next Articles

DRL-based Vehicle Control Strategy for Signal-free Intersections

OUYANG Zhuo1, ZHOU Si-yuan1,2, LYU Yong1, TAN Guo-ping1,2, ZHANG Yue1, XIANG Liang-liang1   

  1. 1 School of Computer and Information,Hohai University,Nanjing 211100,China
    2 Jiangsu Intelligent Transportation and Intelligent Driving Research Institute,Nanjing 210019,China
  • Received:2021-07-01 Revised:2021-08-28 Online:2022-03-15 Published:2022-03-15
  • About author:OUYANG Zhuo,born in 1995,postgra-duate.His main research interests include wireless communication theory and cooperative communications.
    TAN Guo -ping,born in 1975,Ph.D,professor,Ph.D supervisor.His main research interests include Internet of vehicles,mobile edge computing,and wireless distributed machine learning.
  • Supported by:
    National Natural Science Foundation of China(61701168,61832005),China Postdoctoral Science Funded Project(2019M651546) and Major Technological Projects of Jiangsu Province Transportations Department(2019Z07).

Abstract: Using deep learning technology to control vehicles at intersections is a research hotspot in the field of intelligent transportation.Previous studies suffer from the inability to adapt to dynamic changes in the number of self-driving vehicles,slow convergence of training,and locally optimal training results.This work focuses on how autonomous vehicles can use distributed deep reinforcement methods to improve the efficiency of intersections at unsignalized intersections.First,an efficient reward function is proposed to apply the distributed reinforcement learning algorithm to the unsignalized intersection scenario,which can effectively improve the efficiency of intersection passage by relying on only local information even if the vehicle cannot obtain the whole intersection state information.Then,to address the problem of inefficient training of reinforcement learning methods in open intersection scenarios,a transfer learning approach is used to improve the training efficiency by using the trained strategy in the closed figure-of-eight scenario as a warm start and continuing the training in the unsignalized intersection scenario.Finally,this paper proposes a strategy that can be adapted to all proportions of autonomous vehicles,and this strategy can improve intersection access efficiency in scenarios with any proportion of autonomous vehicles.The algorithm is validated on the simulation platform Flow,and the experimental results show that the proposed smart body model converges quickly in training,can adapt to dynamic changes in the proportion of self-driving vehicles,and can effectively improve the efficiency of intersections.

Key words: Autonomous vehicles, Deep reinforcement learning, Signal-free intersections, V2X

CLC Number: 

  • TP391
[1]MA M,LI Z.A time-independent trajectory optimization ap-proach for connected and auto-nomous vehicles under reservation-based inte-rsection control[J].Transportation Research Interdisciplinary Perspectives,2021,9(5):100312.
[2]LV P,HE Y B,XU J.An Improved Trust Evaluation Model Based on Bayesian for WSNs[J].Acta Electronica Sinica,2021,49(5):912-919.
[3]RIOS -TORRES J,MALIKOPOULOS A A.Automated andCooperative Vehicle Merging at Highway On-Ramps[J].IEEE Transactions on Intelligent Transportation Systems,2016,18(4):1-10.
[4]WANG Z,KIM B G,KOBAYASHI H,et al.Agent-Based Mo-deling and Simulation of Connected and Automated Vehicles Using Game Engine:A Cooperative On-Ramp Merging Study[J].arXiv:1810.09952,2018.
[5]MAITLAND A,MCPHEE J.Quasi-translations for fast hybrid nonlinear model predictive control[J].Control Engineering Practice,2020,97(4):104352.1-104352.9.
[6]DING J,LI L,PENG H,et al.A Rule-Based Cooperative Merging Strategy for Connected and Automated Vehicles[J].IEEE Transactions on Intelligent Transportation Systems,2020,21(8):3436-3446.
[7]XIONG L,KANG Y C,ZHANG P Z,et al.Research on beha-vior decision-making system for unmanned vehicle[J].Automobile Technology,2018,515(8):1-9.
[8]KAMRAN D,LOPEZ C,LAUER M,et al.Risk-aware high-level decisions for automated driving at occluded intersections with reinfor-cement learning[J].arXiv:2004.04450,2020.
[9]ISELE D,RAHIMI R,COSGUN A,et al.Navigating occluded intersections with autonomous vehicles using deep reinforcement learning[C]//2018 IEEE ICRA.Brisbane:IEEE,2018:2034-2039.
[10]XU G Y,ZONG X P,YU G Z,et al.A research on intelligent obstacle avoidance of unmanned vehicle based on DDPG algorithm[J].Automotive Engineering,2019,41(2):206-212.
[11]ZHANG B,HE M,CHEN X L,et al.Self-driving via improved DDPG algorithm[J].Computer Engineering and Applications,2019,55(10):264-270.
[12]DAI S S,LIU Q.Action Constrained Deep ReinforcementLearning Based Safe Automatic Driving Method[J].Computer Science,2021,48(9):235-243.
[13]SUN C Y,MU C X.Important scientific probems of multi-agent deep reinforcement learning[J].Acta Automatica Sinica,2020,46(7):1301-1312.
[14]SUN H,CHEN C L,LIU Q,et al.Constrained Deep Reinforcement Learning Based Safe A-utomatic Driving Method[J].Computer Science,2020,47(2):169-174.
[15]WEI H,LIU X,MASHAYEKHY L,et al.Mixed-AutonomyTraffic Control with Proximal Policy Optimization[C]//2019 IEEE Vehicular Networking Conference (VNC).IEEE,2019.
[16]VINITSKY E,LICHTLE N,PARVATE K,et al.OptimizingMixed Autonomy Traffic Flow With Decentralized Autonomous Vehicles and Multi-Agent RL[J].arXiv:2011.00120,2020.
[17]CHEN D,LI Z J,WANG Y Q,et al.Deep Multi-agent Rein-forcement Learning for High-way On-Ramp Merging in Mixed Traffic[J].arXiv:2105.05701v1,2021.
[18]TRAN D Q,BAE S H.Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection[J].Applied Sciences,2020,10(16):5722.
[19]TREIBER M,HENNECKE A,HELBING D.Congested traffic states in empirical observations and microscopic simulations[J].Physical Review E,2000,62(2):1805.
[20]CUI J,MACKE W,YEDIDSION H,et al.Scalable MultiagentDriving Policies For Reducing Traffic Congestion[J].arXiv:2103.00058,2021.
[21]WU C,KREIDIEH A,PARVATE K,et al.Flow:A Modular Learning Framework for Autonomy in Traffic[J].arXiv:1710.05465v2,2007.
[22]LIANG E,LIAW R,NISHIHARA R,et al.Ray RLLib:A Composable and Scalable Reinforcement Learning Library[J].arXiv:1712.09381,2017.
[23]KRAJZEWICZ D,ERDMANN J,BEHRISCH M,et al.Recent Development and Applications of SUMO Simulation of Urban MObility[J].International Journal on Advances in Systems and Measurements,2012,12(3/4/5):128-138.
[1] YU Bin, LI Xue-hua, PAN Chun-yu, LI Na. Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning [J]. Computer Science, 2022, 49(7): 248-253.
[2] LI Meng-fei, MAO Ying-chi, TU Zi-jian, WANG Xuan, XU Shu-fang. Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient [J]. Computer Science, 2022, 49(7): 271-279.
[3] XIE Wan-cheng, LI Bin, DAI Yue-yue. PPO Based Task Offloading Scheme in Aerial Reconfigurable Intelligent Surface-assisted Edge Computing [J]. Computer Science, 2022, 49(6): 3-11.
[4] HONG Zhi-li, LAI Jun, CAO Lei, CHEN Xi-liang, XU Zhi-xiong. Study on Intelligent Recommendation Method of Dueling Network Reinforcement Learning Based on Regret Exploration [J]. Computer Science, 2022, 49(6): 149-157.
[5] LI Peng, YI Xiu-wen, QI De-kang, DUAN Zhe-wen, LI Tian-rui. Heating Strategy Optimization Method Based on Deep Learning [J]. Computer Science, 2022, 49(4): 263-268.
[6] DAI Shan-shan, LIU Quan. Action Constrained Deep Reinforcement Learning Based Safe Automatic Driving Method [J]. Computer Science, 2021, 48(9): 235-243.
[7] CHENG Zhao-wei, SHEN Hang, WANG Yue, WANG Min, BAI Guang-wei. Deep Reinforcement Learning Based UAV Assisted SVC Video Multicast [J]. Computer Science, 2021, 48(9): 271-277.
[8] LIANG Jun-bin, ZHANG Hai-han, JIANG Chan, WANG Tian-shu. Research Progress of Task Offloading Based on Deep Reinforcement Learning in Mobile Edge Computing [J]. Computer Science, 2021, 48(7): 316-323.
[9] WANG Ying-kai, WANG Qing-shan. Reinforcement Learning Based Energy Allocation Strategy for Multi-access Wireless Communications with Energy Harvesting [J]. Computer Science, 2021, 48(7): 333-339.
[10] ZHOU Shi-cheng, LIU Jing-ju, ZHONG Xiao-feng, LU Can-ju. Intelligent Penetration Testing Path Discovery Based on Deep Reinforcement Learning [J]. Computer Science, 2021, 48(7): 40-46.
[11] LI Bei-bei, SONG Jia-rui, DU Qing-yun, HE Jun-jiang. DRL-IDS:Deep Reinforcement Learning Based Intrusion Detection System for Industrial Internet of Things [J]. Computer Science, 2021, 48(7): 47-54.
[12] FAN Yan-fang, YUAN Shuang, CAI Ying, CHEN Ruo-yu. Deep Reinforcement Learning-based Collaborative Computation Offloading Scheme in VehicularEdge Computing [J]. Computer Science, 2021, 48(5): 270-276.
[13] FAN Jia-kuan, WANG Hao-yue, ZHAO Sheng-yu, ZHOU Tian-yi, WANG Wei. Data-driven Methods for Quantitative Assessment and Enhancement of Open Source Contributions [J]. Computer Science, 2021, 48(5): 45-50.
[14] HUANG Zhi-yong, WU Hao-lin, WANG Zhuang, LI Hui. DQN Algorithm Based on Averaged Neural Network Parameters [J]. Computer Science, 2021, 48(4): 223-228.
[15] LI Li, ZHENG Jia-li, LUO Wen-cong, QUAN Yi-xuan. RFID Indoor Positioning Algorithm Based on Proximal Policy Optimization [J]. Computer Science, 2021, 48(4): 274-281.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!