计算机科学 ›› 2016, Vol. 43 ›› Issue (10): 214-219.doi: 10.11896/j.issn.1002-137X.2016.10.041

• 人工智能 • 上一篇    下一篇

基于微博的时空事件识别研究

郑喆君,金蓓弘,崔艳玲   

  1. 中国科学院软件研究所 北京100190 中国科学院大学 北京100190,中国科学院软件研究所 北京100190 中国科学院大学 北京100190,中国科学院软件研究所 北京100190 中国科学院大学 北京100190
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(61472408,61372182)资助

Study on Recognition of Spatial-Temporal Events Based on Microblogs

ZHENG Zhe-jun, JIN Bei-hong and CUI Yan-ling   

  • Online:2018-12-01 Published:2018-12-01

摘要: 微博是一种社交网络服务,它主要基于用户的关注关系进行信息分享和传播,具有时效性强、传播迅速等特点。将微博看成是反映城市动态的一类感知器,从识别微博的主题入手,检测微博中反映的时空事件。为此,首先提出了一种用于分析微博主题的主题模型ST-LDA,并应用该模型将具有语义相似性、时空聚集性的微博归属于同一主题下;然后给出了从主题中检测时空事件的方法。基于真实的新浪微博数据进行实验,结果表明此方法比基于LDA的方法、基于TimeLDA的方法在事件识别上有更高的查全率和查准率。

关键词: 微博,时空事件,主题模型

Abstract: As a kind of social networking service,microblog service can share and broadcast information mainly through the microbloggers’ followers and features strong timeliness of topics and rapid spread.This paper viewed the microblogs as a kind of event sensor which can perceive the dynamic behaviors in the city,and started with identifying the to-pics in microblogs,and then detecting the spatial-temporal events in microblogs.This paper presented a topic model named ST-LDA for analyzing the topics in microblogs.Applying this model,the microblogs with similar semantics and close spatial-temporal nature can be classified into a same topic.Then,this paper gave a method of discovering the spatial-temporal events from topics. Experimental results based on the real data from weibo.com show that our method has a higher recall and precision ratio than LDA-based and TimeLDA-based methods.

Key words: Microblogs,Spatial-temporal events,Topic model

[1] Zhang W,Qi G,Pan G,et al.City-scale Social Event Detection and Evaluation with Taxi Traces[J].ACM TIST,2015,6(3):1-20
[2] Du R,Yu Z,Mei T,et al.Predicting activity attendance in event-based social networks:content,context and social influence[C]∥UbiComp.2014:425-434
[3] Lee R,Wakamiya S,Sumiya K.Discovery of unusual regional social activities using geo-tagged microblogs[J].World Wide Web,2011,14(4):321-349
[4] Sakaki T,Okazaki M,Matsuo Y.Earthquake shakes Twitterusers:real-time event detection by social sensors[C]∥Procee-dings of the 19th International Conference on World Wide Web.ACM,2010:851-860
[5] Sankaranarayanan J,Samet H,Teitler B E,et al.Twitterstand:news in tweets[C]∥Proceedings of the 17th ACM Sigspatial International Conference on Advances in Geographic Information Systems.ACM,2009:42-51
[6] Becker H,Naaman M,Gravano L.Learning similarity metricsfor event identification in social media[C]∥Proceedings of the Third ACM International Conference on Web Search and Data Mining.ACM,2010:291-300
[7] Becker H,Naaman M,Gravano L.Beyond Trending Topics:Real-World Event Identification on Twitter[J].ICWSM,2011,11:438-441
[8] Blei D M,Ng A Y,Jordan M I,et al.Latent Dirichlet allocation[J].Journal of Machine Learning Research,2003,3:993-1022
[9] Blei D M.Introduction to Probabilistic Topic Models[J].Signal Processing Magazine IEEE,2011,27(6):55-65
[10] Ramage D,Dumais S T,Liebling D J.Characterizing Microblogs with Topic Models[J].ICWSM,2010,5(4):130-137
[11] Wang X,McCallum A.Topics over time:a non-Markov continuous-time model of topical trends[C]∥Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Disco-very and Data Mining.ACM,2006:424-433
[12] Diao Q,Jiang J,Zhu F,et al.Finding bursty topics from microblogs[C]∥Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics:Long Papers-Volume 1,Association for Computational Linguistics.2012:536-544
[13] Heinrich G.Parameter estimation for text analysis[R].Technical Report,2004
[14] Xu Ge,Wang Hou-feng.The Development of Topic Models in Natural Language Processing[J].Chinese Journal of Computer,2011,34(8):1423-1436(in Chinese) 徐戈,王厚峰.自然语言处理中主题模型的发展[J].计算机学报,2011,34(8):1423-1436
[15] Shi Jing,Fan Meng,Li Wan-long.Topic Analysis Based on LDA Model[J].Acta Automatica Sinica,2009,35(12):1586-1592(in Chinese) 石晶,范猛,李万龙.基于LDA模型的主题分析[J].自动化学报,2009,35(12):1586-1592
[16] Duan Lian,Guo Wei,Zhu Xin-yan,et al.Constructing Spatio-Temporal Topic Model for Microblog Topic Retrieving[J].Geomatics and Information Science of Wuhan University,2014,39(2):210-213(in Chinese) 段炼,呙维,朱欣焰,等.基于时空主题模型的微博主题提取[J].武汉大学学报(信息科学版),2014,39(2):210-213
[17] Chen Wen-tao,Zhang Xiao-ming,Li Zhou-jun.Analysis of Topic Models on Modeling MicroBlog User Interestringness[J].Computer Science,2013,0(4):127-130(in Chinese) 陈文涛,张小明,李舟军.构建微博用户兴趣模型的主题模型的分析[J].计算机科学,2013,40(4):127-130

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!