计算机科学 ›› 2012, Vol. 39 ›› Issue (Z6): 257-260.

• • 上一篇    下一篇

基于K近邻的新话题热度预测算法

聂恩伦,陈黎,王亚强,秦湘清,金宇,于中华   

  1. (四川大学计算机学院 成都 610065)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Algorithm for Prediction of New Topic's Hotness Using the K-nearest Neighbors

  • Online:2018-11-16 Published:2018-11-16

摘要: 随着互联网的快速发展,网络舆情成为政府部门和企业以及社会大众关注的焦点,对网络奥情进行有效监管和正确引导是当前巫待解决的问题,话题热度预测是典情监管和引导的基础。针对现有算法无法对新话题的热度进行有效预测的缺点,提出了一种基于K近部的新话题热度预测算法。该算法利用与新话题相似的历史话题的点击数时间序列来对新话题的热度进行预测。实验结果表明,在允许相对误差分别低于1000,20%和30%的情况下,算法预测的前3天点击数的平均正确率分别为47.2600,61%和67. 7"0,点击数变化趋势平均正确率达到73. 73 0 o,这也说明了相似的话题在话题出现的初期具有近似的热度变化趋势。

关键词: 热度预测,新话题,K-近邻算法,话题相似性,网络奥情

Abstract: With the rapid development of the Internet, the government, enterprises and public have paid more and more attentions on net mediated public sentiment. How to effectively monitor and aright guide the public sentiment on the Internet has become an issue that should be coped urgently with. As a basis to solving the issue, it is necessary to have ability of predicting topic's hotness appearing on the Internet As traditional algorithms could not predict aright new topie's hotness,a novel algorithm based on K-nearest neighbors(K-NN) was proposed in this paper. The algorithm prediets the hotness of new topics by using hotness times series of their historical similar topics. The experimental results show that the average accuracies of the hotness prediction during the first 3 days arc 47. 26 0 0,61 0 0 and 67. 7 0 0 rcspcclively with the corresponding relative errors being less than 10 0 o,20"o and 30"0,and the average accuracy of the hotness trends within the first 3 days could be up to 73. 73 0 o. Meanwhile, the results also demonstrate that similar topics approximately have same hotness trends in their early developing stages.

Key words: Hotness prediction, New topic, KNN, Topic similarity, Net mediated public sentiment

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!