Computer Science ›› 2023, Vol. 50 ›› Issue (10): 135-145.doi: 10.11896/jsjkx.230700127

• Artificial Intelligence • Previous Articles     Next Articles

Co-Forecasting for Multi-modal Traffic Flow Based on Graph Contrastive Learning

XIAO Yang1, QIN Jianyang1, LI Kenli2, WANG Ge3, LI Rui4, LIAO Qing1,5   

  1. 1 School of Computer Science and Technology,Harbin Institute of Technology(Shenzhen),Shenzhen,Guangdong 518055,China
    2 College of Computer Science and Electronic Engineering,Hunan University,Changsha 410082,China
    3 School of Computer Science and Technology,Xi'an Jiaotong University,Xi'an 710049,China
    4 School of Computer Science and Technology,Xidian University,Xi'an 710071,China
    5 Pengcheng Laboratory,Shenzhen,Guangdong 518000,China
  • Received:2023-07-18 Revised:2023-09-18 Online:2023-10-10 Published:2023-10-10
  • About author:XIAO Yang,born in 2001,postgra-duate.Her main research interests include artificial intelligence and spatio-temporal data mining.LIAO Qing,born in 1988,Ph.D,professor.Her research interests include artificial intelligence and data mining.
  • Supported by:
    National Key Research and Development Program of China(2020YFB2104003).

Abstract: An accurate traffic flow prediction in urban areas is of important significance to provide guidance for urban vehicle scheduling and public transportation system optimization.So far,most existing traffic flow prediction methods only consider a single type of traffic flow prediction in a regular grid area,ignoring the spatial irregularity and heterogeneity in the traffic network and the interactivity among different kinds of traffic flow.To address these problems,this paper proposes a co-forecasting method for multi-modal traffic flow based on graph contrastive learning,named CoF-MGCL,so as to reveal the effect of the interaction among various traffic flows on the traffic demand in irregular and heterogeneous areas.Specifically,this paper collects multi-modeltraffic data,including the individual and total traffic flow of various travel types(e.g.,the traffic flow of bike and taxi);then,constructs a heterogeneous graph with multiple relations,including geographical proximity and functional similarity relations,for irregular areas.By using a heterogeneous graph coding module,this paper can fuse multiple relations in a heterogeneous graph to learn high-quality representations for various traffic flows in different areas.The learned representations of each individual traffic flow are integrated via an attention mechanism,which is compared with the representation of total traffic flow via a graph con-trastive learning,so as to capture the interactive correlation among different traffic flows.Finally,this paper introduces a mutual information regularization for multi-modal traffic flow co-forecasting,maximizing multi-modal information learning.To achieve multi-modal traffic flow forecasting in irregular areas,two new multi-modal traffic flow datasets for the Manhattan Borough of New York and Chicago have been constructed and used for experiments.Experimental results demonstrate that the proposed method can be combined with existing uni-modal traffic flow forecasting methods to obtain 0.60%~12.13% performance gains in terms of root-mean-square error(RMSE) and mean-absolute error(MAE),verifying the effectiveness of the proposed method.

Key words: Traffic flow forecasting, Multi-modal, Heterogeneous graph representation, Graph contrastive learning, Mutual information

CLC Number: 

  • TP183
[1]FU R,ZHANG Z,LI L.Using LSTM and GRU neural network methods for traffic flow prediction[C]//2016 31st Youth Academic Annual Conference of Chinese Association of Automation.2016:324-328.
[2]ZHANG J,ZHENG Y,QI D.Deep spatio-temporal residual networks for citywide crowd flows prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2017:1655-1661.
[3]LIANG Y,OUYANG K,SUN J,et al.Fine-grained urban flow prediction[C]//Proceedings of the Web Conference 2021.2021:1833-1845.
[4]OKUTANI I,STEPHANEDES Y J.Dynamic prediction of traffic volume through Kalman filtering theory[J].Transportation Research Part B:Methodological,1984,18(1):1-11.
[5]WU C H,HO J M,LEE D T.Travel-time prediction with sup-port vector regression[J].IEEE transactions on Intelligent Transportation Systems,2004,5(4):276-281.
[6]VAN LINT J W C,VAN HINSBERGEN C.Short-term traffic and travel time prediction models[J].Artificial Intelligence Applications to Critical Transportation Issues,2012,22(1):22-41.
[7]ZHAO W,GAO Y,JI T,et al.Deep temporal convolutional networks for short-term traffic flow forecasting[J].IEEE Access,2019,7:114496-114507.
[8]ZHANG C,PATRAS P.Long-term mobile traffic forecastingusing deep spatio-temporal neural networks[C]//Proceedings of the Eighteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing.2018:231-240.
[9]YAO H,TANG X,WEI H,et al.Revisiting spatial-temporalsimilarity:A deep learning framework for traffic prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:5668-5675.
[10]WU Z,PAN S,LONG G,et al.Graph wavenet for deep spatial-temporal graph modeling[J].arXiv:1906.00121,2019.
[11]YU B,YIN H,ZHU Z.Spatio-temporal graph convolutional networks:A deep learning framework for traffic forecasting[C]//Proceedings of the International Joint Conference on Artificial Intelligence.2018:3634-3640.
[12]GUO K,HU Y,SUN Y,et al.Hierarchical graph convolutionnetwork for traffic forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021,35(1):151-159.
[13]GUO S,LIN Y,FENG N,et al.Attention based spatial-temporal graph convolutional networks for traffic flow forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:922-929.
[14]XING D,ZHAO C,WANG G.A Spatial-Temporal AttentionMulti-Graph Convolution Network for Ride-Hailing Demand Prediction Based on Periodicity with Offset[J].arXiv:2203.12505,2022.
[15]GENG X,LI Y,WANG L,et al.Spatiotemporal multi-graphconvolution network for ride-hailing demand forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:3656-3663.
[16]SUTSKEVER I,VINYALS O,LE Q V.Sequence to sequence learning with neural networks[C]//Proceedings of the Advances in Neural Information Processing Systems.2014.
[17]SONG C,LIN Y,GUO S,et al.Spatial-temporal synchronous graph convolutional networks:A new framework for spatial-temporal network data forecasting[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:914-921.
[18]ZHENG C,FAN X,WANG C,et al.Gman:A graph multi-attention network for traffic prediction[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:1234-1241.
[19]WU Z,PAN S,LONG G,et al.Connecting the dots:Multiva-riate time series forecasting with graph neural networks[C]//Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery \& Data Mining.2020:753-763.
[20]CHEN C,LIU Y,CHEN L,et al.Bidirectional spatial-temporal adaptive transformer for Urban traffic flow forecasting[J/OL].IEEE Transactions on Neural Networks and Learning Systems,2022:1-13. https://ieeexplore.ieee.org/document/9810964.
[21]LI F,FENG J,YAN H,et al.Dynamic graph convolutional recurrent network for traffic prediction:Benchmark and solution[J].ACM Transactions on Knowledge Discovery from Data,2023,17(1):1-21.
[1] ZHU Yuying, GUO Yan, WAN Yizhao, TIAN Kai. New Word Detection Based on Branch Entropy-Segmentation Probability Model [J]. Computer Science, 2023, 50(7): 221-228.
[2] CUI Bingjing, ZHANG Yipu, WANG Biao. Multimodal Data Fusion Algorithm Based on Hypergraph Regularization [J]. Computer Science, 2023, 50(6): 167-174.
[3] ZOU Yunzhu, DU Shengdong, TENG Fei, LI Tianrui. Visual Question Answering Model Based on Multi-modal Deep Feature Fusion [J]. Computer Science, 2023, 50(2): 123-129.
[4] HUANG Zenan, LIU Xiaojie, ZHAO Chenhui, DENG Yabin, GUO Donghui. Spiking Neural Network Model for Brain-like Computing and Progress of Its Learning Algorithm [J]. Computer Science, 2023, 50(1): 229-242.
[5] ZHOU Xu, QIAN Sheng-sheng, LI Zhang-ming, FANG Quan, XU Chang-sheng. Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification [J]. Computer Science, 2022, 49(9): 132-138.
[6] LIU Chuang, XIONG De-yi. Survey of Multilingual Question Answering [J]. Computer Science, 2022, 49(1): 65-72.
[7] ZHOU Xin-min, HU Yi-gui, LIU Wen-jie, SUN Rong-jun. Research on Urban Function Recognition Based on Multi-modal and Multi-level Data Fusion Method [J]. Computer Science, 2021, 48(9): 50-58.
[8] WU A-ming, JIANG Pin, HAN Ya-hong. Survey of Cross-media Question Answering and Reasoning Based on Vision and Language [J]. Computer Science, 2021, 48(3): 71-78.
[9] WANG Shu-hui, YAN Xu, HUANG Qing-ming. Overview of Research on Cross-media Analysis and Reasoning Technology [J]. Computer Science, 2021, 48(3): 79-86.
[10] LYU Ming-qi, HONG Zhao-xiong, CHEN Tie-ming. Traffic Flow Forecasting Method Combining Spatio-Temporal Correlations and Social Events [J]. Computer Science, 2021, 48(2): 264-270.
[11] WANG Li-fang, WANG Rui-fang, LIN Su-zhen, QIN Pin-le, GAO Yuan, ZHANG Jin. Multimodal Medical Image Fusion Based on Dual Residual Hyper Densely Networks [J]. Computer Science, 2021, 48(2): 160-166.
[12] LI Hao, WANG Fei, XIE Si-yu, KOU Yong-qi, ZHANG Lan, YANG Bing, KANG Yan. Dual Autoregressive Components Traffic Prediction Based on Improved Graph WaveNet [J]. Computer Science, 2021, 48(11A): 159-165.
[13] LEI Yang, JIANG Ying. Anomaly Judgment of Directly Associated Nodes Under Cloud Computing Environment [J]. Computer Science, 2021, 48(1): 295-300.
[14] CHEN Jie-ting, WANG Wei-ying, JIN Qin. Multi-label Video Classification Assisted by Danmaku [J]. Computer Science, 2021, 48(1): 167-174.
[15] DENG Yi-jiao, ZHANG Feng-li, CHEN Xue-qin, AI Qing, YU Su-zhe. Collaborative Attention Network Model for Cross-modal Retrieval [J]. Computer Science, 2020, 47(4): 54-59.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!