计算机科学 ›› 2015, Vol. 42 ›› Issue (5): 42-46.doi: 10.11896/j.issn.1002-137X.2015.05.008

• 2014' 数据挖掘会议 • 上一篇    下一篇

基于领域划分的微博用户影响力分析

刘金,吴 斌,陈 震,沈崇玮   

  1. 北京邮电大学计算机学院 北京100876,北京邮电大学计算机学院 北京100876,北京邮电大学计算机学院 北京100876,北京邮电大学计算机学院 北京100876
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家重点基础研究发展计划(973)(2013CB329606),国家自然科学基金项目(71231002)资助

Research on Influence of Micro Blogging Based on Field Division

LIU Jin-long, WU Bin, CHEN Zhen and SHEN Chong-wei   

  • Online:2018-11-14 Published:2018-11-14

摘要: 近年来微博作为一种新兴的社交网络逐渐被广大用户使用。微博信息简短、更新迅速、包含信息量大,给微博用户获取信息带来了诸多不便,因此,利用影响力分析的手段找到具有较大影响力的微博用户具有重大意义。微博内容较传统的媒体信息具有较强的时效性和权威性,同时微博用语也极其不规范,这给微博用户影响力的分析带来了极大的困难。首先对获取的微博用户信息进行领域的划分,采用基于微博内容和用户关注的方式将用户归类到其所属的领域。其中,采用新词发现以及特征扩展的方法来提高划分结果的准确性。然后,对各个领域的用户进行影响力分析,提出3种影响力传播模型,用户最终的影响力大小根据3种模型的结果进行加权计算。最后对实验结果进行分析、比较,证明了计算用户影响力的方法能取得较优的结果。

关键词: 新浪微博,领域划分,影响力,文本分析,新词发现

Abstract: In recent years,as an emerging social network,Microblog is being used by a lot of users.The micro blogging platform contains a large amount of information and the update speed of the information is fast so that it often makes users can not find the information they need.Then,it is important to help users find the information which is sent by people who have a great influence.The content that the micro blogging platform publishes and updates is very fast,and the content is not standardized,so that the timeliness and authoritativeness tend to be more important.The purpose of research on influence of micro blogging users in this paper is to identify the influence of different users in different areas.This paper divided the users into different areas by using two different features which are the content of the micro blogging and the concerned relationship of the users.During this,we used the parallel new word recognition algorithm in the micro blogging content and semantic extensions on some important text feature to improve accuracy of the dividing result.Then we computed the influence of the users in all the classification using three models,and combined the result in different weight.At last,we tested the accuracy of the result and compared it with other ways.

Key words: Sina microblog,Field division,Influence,Text analysis,New word recognition

[1] Wu Shao-mei,Hofman J M.Who says what to whom on Twitter[C]∥Proc.of the 20th Int.Conf.on World Wide Web (2011).Hyderabad,India,2011:705-714
[2] Page L,Brin S,Motwani R,et al.The PageRank citation ranking:bringing order to the Web.http://ilpubs.stanford.edu:8090/4221
[3] He X,Li Y,Fan C.Web-based links and authoritative content Pagerank Improvement[C]∥2010 International Conference on E-Business and E-Government (ICEE).IEEE,2010:5016-5019
[4] Zhang J,Ackerman M,Adamic L.Expertise networks in online communities:structure and algorithms[C]∥Proc.of the 16th Int.Conf.on World Wide Web (2007).2007:221-230
[5] Tang J,Sun J,Wang C,et al.Social Influence Analysis in Large-scale Networks[C]∥Proc.of the 15th Int.Conf.on Knowledge Discovery and Data Mining (SIGKDD 2009).Paris,France,2009:807-816
[6] 田军伟.基于社会网络的用户兴趣模型研究[D].成都:电子科技大学,2010
[7] Sharifi B,Hutton M-A,Kalita J K.Experiments in Microblog Summarization[C]∥IEEE Second International Conference on Social Computing.2010:49-56
[8] Yang Y.Expert network:Effective and efficient learning from human decisions in text categorization and retrieval[C]∥Proceedings of the 17th Annual International ACM SIGIR Confe-rence on Research and Development in Information Retrieval (1994).1994:13-22
[9] Cortes C,Vapnik V.Support vector networks[J].MachineLearning,1995,20:273-297
[10] Feng Hao-di,Chen Kang,Deng Xiao-tie,et al.Accessor Variety Criteria for Chinese Word Extraction[J].Computational Linguistics,2004,30(1):75-93
[11] 沈崇玮.基于微博数据的用户影响力分析研究[D].北京:北京邮电大学,2013

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!