Computer Science ›› 2019, Vol. 46 ›› Issue (11A): 117-121.

• Intelligent Computing • Previous Articles     Next Articles

Traffic Signal Control Based on Double Deep Q-learning Network with Dueling Architecture

LAI Jian-hui   

  1. (College of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310023,China)
  • Online:2019-11-10 Published:2019-11-20

Abstract: The intersection is the core and hub of the urban road network.Reasonable optimization of the signal control at the intersection can greatly improve the operational efficiency of the urban transportation system.Using real-time traffic information as input and dynamically adjusting the phase time of the traffic signal becomes the important direction of current research.This paper proposed a traffic signal control method based on double deep Q-learning network with Dueling Architecture (D3QN).The deep learning network is combined with the traffic signal control machine to form an intelligent agent for adjusting the signal control strategy of the intersection.Then the DTSE (Discrete Traffic State Coding) method is used to transform the traffic state of the intersection into a two-dimensional matrix composed of the position and velocity information of the vehicle.Then high-level features are captured by deep neural network,which makes accurate perception of traffic state come true.On this basis,an adaptive traffic signal control strategy is realized through reinforcement learning.Finally,the traffic micro-simulator (SUMO) is used for simulation experiments,the timing control and induction control methods are used as control experiments.The results show that the proposed method achieves better control effect and is therefore feasible and effective.

Key words: Deep learning, Deep reinforcement learning, Intelligent transportation, Reinforcement learning, Traffic signal control

CLC Number: 

  • TP391
[1]LI L,WEN D,YAO D Y.A survey of traffic control with vehicu-lar communications[J].IEEE Transactions on Intelligent Transportation Systems,2014,15(1):425-432.
[2]FADLULLAH Z,TANG F,MAO B,et al.State-of-the-ArtDeep Learning:Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems[J].IEEE Communications Surveys & Tutorials,2017,PP(99):1-1.
[3]NGUYEN,THUY T T,ARMITAGE G J.A survey of tech-niques for internet traffic classification using machine learning[J].IEEE Communications Surveys & Tutorials,2008,10(3):56-76.
[4]CHIN Y K,KOW W Y,KHONG W L,et al.Q-Learning Traffic Signal Optimization within Multiple Intersections Traffic Network[C]∥2012 Sixth UKSim/AMSS European Symposium on Computer Modeling and Simulation (EMS).IEEE,2012.
[5]CHIN Y K,LEE L K,BOLONG N,et al.Exploring Q-Learning Optimization in Traffic Signal Timing Plan Management[C]∥Third International Conference on Computational Intelligence.IEEE,2011.
[6]ARAGHI S,KHOSRAVI A,CREIGHTON D C,et al.Optimal fuzzy traffic signal controller for an isolated intersection[C]∥IEEE International Conference on Systems.IEEE,2014.
[7]CHEN Y H,CHANG C J,HUANG C Y.Fuzzy Q-Learning Admission Control for WCDMA/WLAN Heterogeneous Networks with Multimedia Traffic[J].IEEE Transactions on Mobile Computing,2009,8(11):1469-1479.
[8]CHIU S,CHAND S.Adaptive traffic signal control using fuzzy logic[C]∥IEEE International Conference on Fuzzy Systems.IEEE,1992.
[9]B BINGHAM E.Reinforcement learning in neurofuzzy trafficsignal control[J].European Journal of Operational Research,2001,131(2):232-241.
[10]LA P,BHATNAGAR S.Reinforcement Learning With Func-tion Approximation for Traffic Signal Control[J].IEEE Tran-sactions on Intelligent Transportation Systems,2011,12(2):412-421.
[11]EL-TANTAWY S,ABDULHAI B,ABDELGAWAD H.Mul-tiagent reinforcement learning for integrated network of adaptive trafFIc signal controllers (MARLIN-ATSC):methodology and large-scale application on downtown Toronto[J].IEEE Transactions on Intelligent Transportation Systems,2013,14(3):1140-1150.
[12]OZAN C,BASKAN O,HALDENBILEN S,et al.A modified reinforcement learning algorithm for solving coordinated signalized networks[J].Transportation Research Part C:Emerging Technologies,2015,54:40-55.
[13]ELTANTAWY S,ABDULHAI B,ABDELGAWAD H.Design of Reinforcement Learning Parameters for Seamless Application of Adaptive Traffic Signal Control[J].Journal of Intelligent Transportation Systems,2014,18(3):227-245.
[14]ABDOOS M,MOZAYANI N,BAZZAN A L C.Holonic multi-agent system for traffic signals control[J].Engineering Applications of Artificial Intelligence,2013,26(5/6):1575-1587.
[15]ABDULHAI B,PRINGLE R,KARAKOULAS G J.Reinforcement learning for true adaptive traffic signal control[J].Journal of Transportation Engineering,2003,129(3):278-285.
[16]AREL I,LIU C,URBANIK T,et al.Reinforcement learning-based multi-agent system for network traffic signal control[J].IET Intelligent Transport Systems,2010,4(2):128.
[17]BALAJI P G,GERMAN X,SRINIVASAN D.Urban traffic signal control using reinforcement learning agents[J].IET Intelligent Transport Systems,2010,4(3).
[18]GENDERS W,RAZAVI S.Using a Deep Reinforcement Lear-ning Agent for Traffic Signal Control[J].arXiv:1611.01142v1,2016.
[19]SUTTON R,BARTO A.Reinforcement Learning:An Introduction[M].Cambridge,MA:MIT Press,1998.
[20]HINTON G E,SALAKHUTDINOV R R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504-507.
[21]BENGIO Y.Learning deep architectures for AI[J].Foundations and Trendsin Machine Learning,2009,2(1):1-127.
[22]LANGE S,RIEDMILLER M.Deep auto-encoder neural net-works in reinforcement learning[C]∥Proceedings of the 2010 International Joint Conference on Neural Networks (IJCNN).Barcelona:IEEE,2010:1-8.
[23]MNIH V,KAVUKCUOGLU K,SILVER D,et al.Human-level control through deep reinforcement learning[J].Nature,2015,518(7540):529-533.
[24]LI L,LV Y,WANG F Y.Traffic Signal Timing via Deep Reinforcement Learning[J].IEEE/CAA Journal of Automatica Sinica,2016,3(3):247-254.
[25]MNIH V,KAVUKCUOGLU K,SILVER D,et al.Playing Atari with Deep Reinforcement Learning[J].arXiv:1312.5602,2013.
[1] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2] LIU Xing-guang, ZHOU Li, LIU Yan, ZHANG Xiao-ying, TAN Xiang, WEI Ji-bo. Construction and Distribution Method of REM Based on Edge Intelligence [J]. Computer Science, 2022, 49(9): 236-241.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[5] SHI Dian-xi, ZHAO Chen-ran, ZHANG Yao-wen, YANG Shao-wu, ZHANG Yong-jun. Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning [J]. Computer Science, 2022, 49(8): 247-256.
[6] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[7] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[8] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[9] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[10] YUAN Wei-lin, LUO Jun-ren, LU Li-na, CHEN Jia-xing, ZHANG Wan-peng, CHEN Jing. Methods in Adversarial Intelligent Game:A Holistic Comparative Analysis from Perspective of Game Theory and Reinforcement Learning [J]. Computer Science, 2022, 49(8): 191-204.
[11] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[12] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[13] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[14] YU Bin, LI Xue-hua, PAN Chun-yu, LI Na. Edge-Cloud Collaborative Resource Allocation Algorithm Based on Deep Reinforcement Learning [J]. Computer Science, 2022, 49(7): 248-253.
[15] LI Meng-fei, MAO Ying-chi, TU Zi-jian, WANG Xuan, XU Shu-fang. Server-reliability Task Offloading Strategy Based on Deep Deterministic Policy Gradient [J]. Computer Science, 2022, 49(7): 271-279.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!