计算机科学 ›› 2018, Vol. 45 ›› Issue (5): 185-189.doi: 10.11896/j.issn.1002-137X.2018.05.031

• 人工智能 • 上一篇    下一篇

基于生存分析的GPS轨迹缺失规律挖掘

郑剑炜,顾晶晶,庄毅   

  1. 南京航空航天大学计算机科学与技术学院 南京211100,南京航空航天大学计算机科学与技术学院 南京211100,南京航空航天大学计算机科学与技术学院 南京211100
  • 出版日期:2018-05-15 发布日期:2018-07-25
  • 基金资助:
    本文受国家自然科学基金面上项目(61572253),航空基金项目(2016ZC52030)资助

Pattern Mining of Missing GPS Trajectory Based on Survival Analysis

ZHENG Jian-wei, GU Jing-jing and ZHUANG Yi   

  • Online:2018-05-15 Published:2018-07-25

摘要: 近年来,智能交通系统(Intelligent Transportation Systems,ITS)已成为提高交通系统性能和增强出行安全性的有效方式。但随着系统数据量的增加,数据缺失问题日益严重,其中由于车载GPS信号丢失导致的轨迹数据缺失是主要的研究问题之一。引起GPS轨迹缺失的原因的多样性造成数据补全工作困难,且至今很少有关于轨迹缺失规律的研究。针对GPS信号丢失原因多样化的问题,基于大量真实数据,首次将生存分析应用于数据缺失领域,提出了基于生存分析的GPS轨迹缺失规律挖掘模型(Survival Analysis-Missing Trajectory Pattern Mining,SA-MTPM)。首先通过生存函数描述信号丢失时长与丢失原因的关系,然后利用Cox回归模型分析信号丢失的关键因素。使用上海市强生出租车公司一个月内13666辆车的数据进行实验,结果表明GPS轨迹缺失存在一定规律,据此可以方便地对信号丢失事件进行识别分类,为进一步对大数据进行研究提供了参考。

关键词: 轨迹缺失,信号丢失,生存分析,规律挖掘

Abstract: In recent years,intelligent transportation systems(ITS) has been an effective way to improve the traffic performance of transportation system and enhance the safety of travels.However,with the increase of data size in intelligent transportation system,the problem of data loss becomes increasingly serious.The trajectory data missing caused by vehicle-mounted GPS signal loss is one of the main research subjects.The reasons of GPS data missing are various,and they make the data completion difficult.However,there are few studies on the pattern of missing GPS trajectories.In this paper,based on large amounts of real data on diversification of GPS signal loss,the survival analysis was first applied into data missing field,and a survival analysis-missing trajectory pattern mining(SA-MTPM) model was proposed.The relationship between the length of signal loss and the regression causes of loss was described in the survival function,and the Cox model was used to analyze the key factors of signal loss.This paper performed experiments based on the GPS data of 13666 vehicles in Shanghai Qiangsheng Taxi Company for a month.The experimental results show that these signal loss events can be classified,which provides a further study for big data.

Key words: Track missing,Signal loss,Survival analysis,Pattern mining

[1] LV Y,DUAN Y,KANG W,et al.Traffic Flow Prediction With Big Data:A Deep Learning Approach[J].IEEE Transactions on Intelligent Transportation Systems,2015,16(2):865-873.
[2] FANASWALA M,KRISHNAMURTHY V.Detection of Ano-malous Trajectory Patterns in Target Tracking via Stochastic Context-Free Grammars and Reciprocal Process Models[J].IEEE Journal of Selected Topics in Signal Processing,2013,7(1):76-90.
[3] RUBIN D B.Inference and missing data[J].Biometrika,1976,63(3):581-592.
[4] SCHAFER J L,GRAHAM J W.Missing data:our view of the state of the art[J].Psychological Methods,2002,7(2):147-177.
[5] OMMERET D.Testing the mechanism of missing data[EB/OL].[2017-03-23].https://hal.archives-ouvertes.fr/hal-00669339.
[6] SUN J,JIN Y J,DAI M F.Discussion on the Test Method of Data Deletion Mechanism [J].Mathematics in Practice and Theo-ry,2013,43(12):166-173.(in Chinese) 孙婕,金勇进,戴明锋.关于数据缺失机制的检验方法探讨[J].数学的实践与认识,2013,43(12):166-173.
[7] SHAN M,WORRALL S,NEBOT E.Probabilistic Long-Term Vehicle Motion Prediction and Tracking in Large Environments[J].IEEE Transactions on Intelligent Transportation Systems,2013,14(2):539-552.
[8] LI L,LI Y,LI Z.Efficient missing data imputing for traffic flow by considering temporal and spatial dependence[J].Transportation Research Part C Emerging Technologies,2013,34(9):108-120.
[9] ASIF M T,DAUWELS J,GOH C Y,et al.Spatiotemporal Patterns in Large-Scale Traffic Speed Prediction[J].IEEE Transactions on Intelligent Transportation Systems,2014,15(2):794-804.
[10] ASIF M T,MITROVIC N,GARG L,et al.Low-dimensionalmodels for missing data imputation in road networks [C]∥International Conference on Acoustics,Speech and Signal Proces-sing.IEEE,2013:3527-3531.
[11] SCHNFELDER S,AXHAUSEN K.Analysing the rhythms of travel using survival analysis[C]∥Transport Research Board (TRB) 2001 Annual Meeting.2000.
[12] MAY M,KRNER C,HECKER D,et al.Handling missing va-lues in GPS surveys using survival analysis:a GPS case study of outdoor advertising[C]∥International Workshop on Data Mi-ning & Audience Intelligence for Advertising.ACM,2009.
[13] WANG G N.Spatial-Temporal Data Mining Based on GPS Trajectory and Geo-Tagged Photo Trajectory [D].Changsha:Central South University,2013.(in Chinese) 王冠男.基于GPS轨迹和照片轨迹的时空数据挖掘[D].长沙:中南大学,2013.
[14] HUAN M,YANG X B,JIA B.Red-Light Running Behavior of Non-Motor Vehicles Based on Survival Analysis [J].Transactions of Beijing Institute of Technology,2013,33(8):815-819.(in Chinese) 环梅,杨小宝,贾斌.基于生存分析方法的非机动车闯红灯行为研究[J].北京理工大学学报,2013,33(8):815-819.
[15] SUN J,ZHANG J.Survival Analyses of Traffic Flow Break-down at Urban Expressway Bottlenecks [J].Journal of Tongji University(Natural Science),2013,41(4):530-535.(in Chinese) 孙剑,张娟.城市快速路瓶颈交通流失效生存分析[J].同济大学学报自然科学版,2013,41(4):530-535.
[16] CHEUNG T T,POON R T,YUEN W K,et al.Long-term survival analysis of pure laparoscopic versus open hepatectomy for hepatocellular carcinoma in patients with cirrhosis:a single-center experience[J].Annals of Surgery,2013,257(3):506.
[17] HAMMOUCHE S,CLARK S,WONG A H,et al.Long-term survival analysis of atypical meningiomas:survival rates,prognostic factors,operative and radiotherapy treatment [J].Acta Neurochirurgica,2014,156(8):1475-1481.
[18] MUSCI R J,FAIRMAN B,MASYN K E,et al.Polygenic Score×Intervention Moderation:an Application of Discrete-Time Survival Analysis to Model the Timing of First Marijuana Use Among Urban Youth[J].Prevention Science,2015,7(1):1-9.
[19] XIE K,NING X,WANG X,et al.Recover Corrupted Data in Sensor Networks:a Matrix Completion Solution[J].IEEE Transactions on Mobile Computing,2017,PP(99):1.
[20] 彭非,王伟.生存分析[M].北京:中国人民大学出版社,2004.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!