计算机科学 ›› 2021, Vol. 48 ›› Issue (6A): 334-341.doi: 10.11896/jsjkx.200800066

• 智能计算 • 上一篇    下一篇

时空图注意力网络在交叉口车辆轨迹预测的应用

曾伟良, 陈漪皓, 姚若愚, 廖睿翔, 孙为军   

  1. 广东工业大学自动化学院 广州510006
  • 出版日期:2021-06-10 发布日期:2021-06-17
  • 通讯作者: 孙为军(14341569@qq.com)
  • 作者简介:weiliangzeng@gdut.edu.cn
  • 基金资助:
    国家自然科学基金(61803100,U1911401);广东省智能交通系统重点实验室开放基金(202005003);广东省科技计划(2019B010121001,2019B010118001,2019B01019001);工信部工业互联网创新发展工程(TC190A3X9-2-2);国家重点研发计划(2018YFB1802400)

Application of Spatial-Temporal Graph Attention Networks in Trajectory Prediction for Vehicles at Intersections

ZENG Wei-liang, CHEN Yi-hao, YAO Ruo-yu, LIAO Rui-xiang, SUN Wei-jun   

  1. School of Automation,Guangdong University of Technology,Guangzhou 510006,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:ZENG Wei-liang,born in 1986,Ph.D associate professor.His main research interests include routing problem in complex network,traffic simulation,and big data visualization for smart city.
    SUN Wei-jun,born in 1975,Ph.D,is a member of CCF YOCSEF AC.His main research interests include internet of thing and machine learning.
  • Supported by:
    National Natural Science Foundation of China(61803100,U1911401),Guangdong Provincial Key Laboratory of Intelligent Transportation System(202005003),Science and Technology Planning Project of Guangdong Province,China(2019B010121001,2019B010118001,2019B01019001),Industrial Internet Innovation and Development Project of MIIT(TC190A3X9-2-2) and National Key Research and Development Project(2018YFB1802400).

摘要: 随着人工智能和大数据技术的快速发展,以深度学习为代表的自动驾驶轨迹预测是未来的热点研究方向。在混合交通场景下,如何准确地预测机动车与非机动车的轨迹,是实现自动驾驶技术中安全行驶和高效轨迹规划等问题的前提。针对交叉路口中不同运动对象之间发生交互时的轨迹预测问题,提出了基于图注意力网络的建模方案。所采用的模型结合了时间与空间上研究对象之间的相互作用,对机动车与非机动车的未来轨迹做出了更准确的预测,可应用于自动驾驶的轨迹规划方案,确保在复杂交通场景下,机动车与非机动车能够安全且高效地通过交叉路口。该模型在简单交互情况下,可取得较小的轨迹平均位移误差和最终位移误差,而在复杂交互情况下,可提供更为合理的规划路径。

关键词: 轨迹预测, 深度学习, 图注意力网络, 自动驾驶

Abstract: With the rapid development of artificial intelligence and big data technology,the application of deep learning to trajectory prediction on autonomous driving has become a hot topic in recent years.The premise of keeping the safety of navigation and efficient path planning in autonomous driving is to make trajectory predictions of motor and non-motor vehicles accurate,especially in mixed traffic scenes.Aiming at tackling the problem related to path planning when interactions happen among different research objects at an intersection,a modelling scheme based on graph attention networks is proposed.The model applied combines spatial and temporal interactions among traffic-agents to improve the accuracy of trajectory predictions for motor and non-motor vehicles.In addition,the proposed model can be applied to path planning of autonomous driving,ensuring that motor and non-motor vehicles are capable of passing the interactions safely and efficiently in complex traffic scenes.In the case of simple interactions,the average displacement error and final displacement error of the trajectories derived from the model reach relatively small.And in the case of complex interactions,the future paths provided by the model are more reasonable than the future ground truths.

Key words: Autonomous driving, Deep learning, Graph Attention Networks, Trajectory prediction

中图分类号: 

  • TP391
[1] WU Y,SCHUSTER M,CHEN Z,et al.Google's neural machine translation system:bridging the gap between human and machine translation[J].arXiv:1609.08144,2016.
[2] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[J].arXiv:1409.0473,2014.
[3] GRAVES A,JAITLY N.Towards end-to-end speech recogni-tion with recurrent neural networks[C]//International Conference on Machine Learning.2014:1764-1772.
[4] CHOROWSKI J,BAHDANAU D,CHO K,et al.End-to-endcontinuous speech recognition using attention-based recurrent nn:First results[J].arXiv:1412.1602,2014.
[5] XU K,BAJ,KIROS R,et al.Show,attend and tell:Neural image caption generation with visual attention[C]//International Conference on Machine Learning.2015:2048-2057.
[6] LAROCA R,SEVERO E,ZANLORENSI L A,et al.A robustreal-time automatic license plate recognition based on the yolodetector[C]//International JointConference on Neural Networks.2018:1-10.
[7] BERGER C.From a competition for self-driving miniature cars to a standardized experimental platform:concept,models,architecture,and evaluation[J].arXiv:1406.7768,2014.
[8] HELBING D,MOLNAR P.Social force model for pedestriandynamics [J].Physical Review E Statistical Physics Plasmas Fluids & Related Interdisciplinary Topics,1998,51(5):4282.
[9] TAY C,LAUGIER C.Modelling smooth paths using gaussian processes[C]//Springer Tracts in Advanced Robotics,2007,42:381-390.
[10] CHAN Y T,HU A G C,PLANT J B.A kalmanfilter basedtracking scheme with input estimation[J].IEEE Transactions on Aerospace and Electronic Systems,1979,AES-15(2):237-244.
[11] ALAHI A,GOEL K,RAMANATHAN V,et al.Social lstm:human trajectory prediction in crowded spaces[C]//IEEE Conference on Computer Vision and Pattern Recognition.2016.
[12] ILYA S,ORIOL V,QUOC V L.Sequence tosequence learning with neural networks [C]//Advances in Neural Information Processing Systems.2014:3104-3112.
[13] AGRIM G,JUSTIN J,LI F,et al.Social gan:socially acceptable trajectories with generative adversarial networks[C]//IEEE Conference on Computer Vision and Pattern Recognition.2018.
[14] JAVAD A,JEAN-BERNARD H,JULIAN P.Social ways:learning multi-modal distributions of pedestrian trajectories with gans[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2019.
[15] AMIR S,VINEET K,ALI S,et al.SoPhie:an attentive gan for predicting paths compliant to social and physical constraints[C]//IEEE Conference on Computer Vision and Pattern Recognition.2018.
[16] ZHAO T,XU Y,MONFORT M,et al.Multi-agent tensor fusion for contextual trajectory prediction[C]//IEEE Conference on Computer Vision and Pattern Recognition.2019.
[17] ASHISH V,NOAM S,NIKI P,et al.Attention is all you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017,30.
[18] VEMULA A,MUELLING K,OH J.Social attention:modeling attention in human crowds[C]//2018 IEEE international Conference on Robotics and Automation (ICRA).IEEE,2018.
[19] XU K,HU W,LESKOVEC J,et al.How powerful are graphneural networks?[J].arXiv:1810.00826.
[20] SIJIE Y,YUANJUN X,DAHUA L.Spatial temporal graphconvolutional networks for skeleton-based action recognition[C]//Thirty-Second AAAI Conference on ArtificialIntelligence.2018.
[21] YAN S J,XIONG Y J,LIN D H.Spatial temporal graph convolutional networks for skeleton-based action recognition[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2018(1).
[22] CHENYANG S,WENTAO C,WEI W,et al.An attention enhanced graph convolutional lstm network for skeleton-basedaction recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:1227-1236.
[23] VELIKOVI P,CUCURULL G,CASANOVA A,et al.Graph attention networks[J].arXiv:1710.10903.
[24] HUANG Y,BI H,LI Z,et al.STGAT:modeling spatial-temporal interactions for human trajectory prediction[C]//IEEE International Conference on Computer Vision.2019:6272-6281.
[25] CHO K,VAN M B,GULCEHRE C,et al.Learning phrase representations usingrnnencoder-decoder for statistical machine translation[J].arXiv:1406.1078.
[26] HASIM S,ANDREW S,FRANCOISE B.Long short-termmemory based recurrent neural network architectures for large vocabulary speech recognition[J].arXiv:1402.1128.
[27] SUZUKI K,NAKAMURA H.Traffic Analyzer-the integratedvideo image processing system for traffic flow analysis[C]//Proceedings of the 13th World Congress on Intelligent Transportation Systems.2006.
[28] LI X,CHEN S,HU X,YANG J.Understanding the disharmony between dropout and batch normalization by variance shift[C]//IEEE Conference on Computer Vision and Pattern Recognition.2019:2677-2685.
[29] NWANKPA C,IJOMAH W,GACHAGAN A,et al.Activation functions:comparison of trends in practice and research for deep learning[J].arXiv:1811.03378.
[30] BELLO I,ZOPH B,VASUDEVAN V,et al.Neural optimizer search with reinforcement Learning[C]//International Conference on Machine Learning.2016:459-468.
[31] HOJJATS,SHARN S,JOSEPH B.Recent advances in recurrent neural networks[J].arXiv:1801.01078,2017.
[32] WOJCIECH Z,ILYA S,ORIOL V.Recurrent neural network regularization[J].arXiv:1409.2329.
[33] SEPP H,JURGEN S.Long short-term memory[J].NeuralComputation,1997,9(8):1735-1780.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[3] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[4] 史殿习, 赵琛然, 张耀文, 杨绍武, 张拥军.
基于多智能体强化学习的端到端合作的自适应奖励方法
Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning
计算机科学, 2022, 49(8): 247-256. https://doi.org/10.11896/jsjkx.210700100
[5] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[6] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[7] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[10] 胡艳羽, 赵龙, 董祥军.
一种用于癌症分类的两阶段深度特征选择提取算法
Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification
计算机科学, 2022, 49(7): 73-78. https://doi.org/10.11896/jsjkx.210500092
[11] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[12] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[13] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
[14] 苏丹宁, 曹桂涛, 王燕楠, 王宏, 任赫.
小样本雷达辐射源识别的深度学习方法综述
Survey of Deep Learning for Radar Emitter Identification Based on Small Sample
计算机科学, 2022, 49(7): 226-235. https://doi.org/10.11896/jsjkx.210600138
[15] 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰.
基于多源迁移学习的大坝裂缝检测
Dam Crack Detection Based on Multi-source Transfer Learning
计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!