计算机科学 ›› 2016, Vol. 43 ›› Issue (9): 66-70.doi: 10.11896/j.issn.1002-137X.2016.09.012

• 2015 年第三届CCF 大数据学术会议 • 上一篇    下一篇

基于MapReduce的新型微博用户影响力排名算法研究

徐文涛,刘锋,朱二周   

  1. 安徽大学计算机科学与技术学院 合肥230601,安徽大学计算机科学与技术学院 合肥230601,安徽大学计算机科学与技术学院 合肥230601
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(61300169)资助

Research on Novel Ranking Algorithm of Microblog User’s Influence Based on MapReduce

XU Wen-tao, LIU Feng and ZHU Er-zhou   

  • Online:2018-12-01 Published:2018-12-01

摘要: 微博凭借其即时发布、实时传播、简便易用的特点逐渐成为最为主流的自媒体平台。用户影响力评价是微博社交网络中基本而又重要的问题,它对于优化与推动社会信息传播来说有着重要意义。以新浪微博为实验对象,通过综合考虑微博用户关系网络特性和用户行为,结合MapReduce编程计算模型,提出了一种基于MapReduce的新型用户影响力排名算法——QRank。在Hadoop平台上的实验结果表明,QRank算法具有良好的可扩展性,能够有效结合微博用户关系网络与行为特性,从而更加真实与充分地反映用户的实际影响力。

关键词: PageRank算法,MapReduce,用户影响力,Hadoop平台

Abstract: Featured by instant release,real-time transmission and easy to use,microblog has gradually stepped into the rank of the most popular self-media information platform.User’s influence,which is of great importance to optimize and motivate social information transmission,plays a basic as well as important role in microblog social network.Taking into account the network features of microblog users’ relationship as well as their behaviors, taking Sina microblog as the experimental subject, this paper aimed to introduce the QRank algorithm,a new ranking algorithm based on MapReduce to judge user’s influence.An experiment on the Hadoop platform shows that,with great scalability,QRank algorithm can effectively combine the relationship and behavior features of microblog users and reflect the real influence of users in a more convincing and sufficient way.

Key words: PageRank algorithm,MapReduce,User’s influence,Hadoop platform

[1] Lin Jia-li,Li Zhen-yu,Wang Dong,et al.Analysis and Compari-son of Interaction Patternsin online Social Network and Social Media[C]∥Proc of the 21st International Conference on Computer Communications and Networks.Munich,Germany,2012:1-7
[2] Wu Xin-dong,Li Yi,Li Lei.Influence Analysis of Online Social Networks[J].Chinese Journal of Computers,2014,37(4):735-752(in Chinese) 吴信东,李毅,李磊.在线社交网络影响力分析[J].计算机学报,2014,37(4):735-752
[3] Statistic Report of the 35th China Internet Developing Situation[R].Beijing:China Internet Network Information Center,2015(in Chinese) 第35次中国互联网络发展状况统计报告[R].北京:中国互联网络信息中心,2015
[4] Zhang Qun-yan,Ma Hai-xin,Qian Wei-ning,et al.Duplicate Detection for Identifying Social Spamin Microblogs[C]∥Proc of the IEEE International Congress on Big Data.Santa Clara,CA 2013:141-148
[5] Yang Chang-chun,Yu Ke-fei,Ye Shi-ren,et al.New Assessment Method on Influence of Bloggers in Community of Chinese Microblog[J].Computer Engineering and Applications,2012,48(25):229-233(in Chinese) 杨长春,俞克非,叶施仁,等.一种新的中文微博社区博主影响力的评估方法[J].计算机工程与应用,2012,48(25):229-233
[6] Liang Qiu-shi,Wu Yi-lei,Feng Lei.User Ranking Algorithm for Microblog Search Based on MapReduce[J].Journal of Computer Applications,2012,32(11):2989-2293(in Chinese) 梁秋实,吴一雷,封磊.基于MapReduce 的微博用户搜索排名算法[J].计算机应用,2012,32(11):2989-2993
[7] Tang Fei-long,Ye Shi-ren,Xiao Chun.Blogger Influence Ran-king Algorithm Based on User Quality in Sina Microblog Community[J].Computer Engineering and Applications,2015,51(4):128-132(in Chinese) 唐飞龙,叶施仁,肖春.基于用户质量的微博社区博主影响力排序算法[J].计算机工程与应用,2015,51(4):128-132
[8] Meeyoung C, Hamed H,Fabricio B,et al.Measuring User Influen-ce in Twitter:the Million Follower Fallacy[C]∥Procof the 4th International AAAI Conference on Weblogs and Social Media.Menlo Park:AAAI Press,2010:10-17
[9] Brin S,Page L.The Anatomy of a Large Scale Hypertextual Web Search Engine[C]∥Proc of the 7th International World Wide Web Conference.Brisbane:ACM Press,1998:107-117
[10] Cao Shan-shan,Wang Chong.Improved PageRank AlgorithmBased on Links and User Feedback[J].Computer Science,2014,41(12):179-182(in Chinese) 曹珊珊,王冲.基于网页链接与用户反馈的PageRank算法改进研究[J].计算机科学,2014,41(12):179-182
[11] Chen Xiao-fei,Wang Yi-tong,Feng Xiao-jun.An Improvementof PageRank Algorithm Based on Page Quality[J].Journal of Computer Research and Development,2009,46(Suppl.):381-387(in Chinese) 陈小飞,王轶彤,冯小军,一种基于网页质量的PageRank算法改进[J].计算机研究与发展,2009,46(增刊):381-387
[12] Apache Hadoop.http://hadoop.apache.org
[13] Lammel R.Google’s MapReduce Programming Model Revised[J].Science of Computer Programming,2007,68(3):208-237
[14] Srirama S N,Jakovits P,Vainikko E.Adapting Scientific Computing Problems to Clouds Using MapReduce[J].Future Gene-rations Computer Systems,2012,28(1):184-192
[15] Chen Gong,Niu Qin-zhou.Research on PageRank AlgorithmBased on MapReduce[J].Microelectronics & Computer,2012,29(5):81-85(in Chinese) 陈宫,牛秦洲.基于MapReduce的PageRank算法的研究[J].微电子学与计算机,2012,29(5):81-85
[16] Chen Hao,Die Ge.MicroBlog User Ranking Research Based on Hadoop[D].Shanghai:East China University of Science and Technology,2014(in Chinese) 陈浩,迭戈.基于Hadoop的微博用户影响力排名算法研究[D].上海:华东理工大学,2014

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!