计算机科学 ›› 2015, Vol. 42 ›› Issue (11): 248-250.doi: 10.11896/j.issn.1002-137X.2015.11.050

• 人工智能 • 上一篇    下一篇

一种改进的无偏节点标签预测方法研究

俞刚,张泉方   

  1. 浙江大学软件学院 杭州310027,浙江大学计算机科学与技术学院 杭州310027
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金(61170306),浙江省卫生厅项目(2012KYA123)资助

Improved Unbiased Node Label Prediction Algorithm

YU Gang and ZHANG Quan-fang   

  • Online:2018-11-14 Published:2018-11-14

摘要: 在社会网络中,用户的位置和属性以及图片的标签预测等都具有广泛的应用前景。为了提高标签预测的性能,提出了一种改进的无偏节点标签预测算法。首先,对社会网络中的标签预测问题进行了形式化描述。其次,基于所有观察数据的训练目标的联合概率最大化与以这些数据为条件的单变量边缘预测值的不匹配现象,提出了一种改进的图模型训练方法。最后,通过对置信度的无偏估计,基于子图方法提出一种不包含额外标签数据的无偏算法用于模型的训练。在Twitter和Pokec数据集上的实验表明,提出的算法与相关的标签预测算法相比,其准确性和运行效率都得到了明显的提升。

关键词: 社会网络,标签预测,无偏估计,图模型

Abstract: In social networks,predictions of attributes and locations of users and labels of images are extensively applied in many fields.In order to improve the performance of label prediction,this paper proposed an improved unbiased node label prediction algorithm.Firstly,we formalized the label prediction problem in social networks.Secondly,based on the mismatch of the maximization of joint likelihood of training objective under all observed labels and the single variable marginal prediction scores conditioned by the observed labels,we proposed an improved graphical model training algorithm.Finally,according to the unbiased estimation of confidence,we proposed a training model not including additional labels based on sub-graph method.Experiments on the Twitter and Pokec datasets show that,compared with related works,the proposed algorithm has better accuracy and execution efficiency while predicting labels.

Key words: Social networks,Labeling prediction,Unbiased estimation,Graphical model

[1] Sadilek A,Kautz H,Bigham J P.Finding your friends and following them to where you are[C]∥Proceedings of the fifth ACM international conference on Web search and data mining.ACM,2012:723-732
[2] 张子柯.社会化标签系统的结构,演化和功能[J].上海理工大学学报,2011,33(5):445-451 Zhang Zi-ke.Social tagging systems:structure,dynamics and function[J].Journal of University of Shanghai For Science and Technology,2011,3(5):445-451
[3] Weinsberg U,Bhagat S,Ioannidis S,et al.BlurMe:Inferring and obfuscating user gender based on ratings[C]∥Proceedings of the Sixth ACM Conference on Recommender Systems.ACM,2012:195-202
[4] Hong L,Ahmed A,Gurumurthy S,et al.Discovering geogra-phical topics in the twitter stream[C]∥Proceedings of the 21st International Conference on World Wide Web.ACM,2012:769-778
[5] Mahmud J,Nichols J,Drews C.Where Is This Tweet From? Inferring Home Locations of Twitter Users[C]∥ICWSM.2012:511-514
[6] 魏建良,朱庆华.基于社会化标注的个性化推荐研究进展 [J].情报学报,2010,29(4):625-633 Wei Jian-liang,Zhu Qing-hua.Advances in Personalized Information Recommendation Based on Social Tagging[J].Journal of The China Society for Scientific and Technical Information,2010,9(4):625-633
[7] 吴超,周波.基于复杂网络的社会化标签分析[J].浙江大学学报(工学版),2010,44(11):2194-2197 Wu Chao,Zhou Bo.Complex network analysis of tag as a social network[J].Journal of Zhejiang University(Engineering Science),2010,4(11):2194-2197
[8] Sen P,Namata G,Bilgic M,et al.Collective classification in network data[J].AI magazine,2008,29(3):93-106
[9] Azran A.The rendezvous algorithm:Multiclass semi-supervised learning with markov random walks[C]∥Proceedings of the 24th International Conference on Machine Learning.ACM,2007:49-56
[10] 廖志芳,李玲,刘丽敏,等.三部图张量分解标签推荐算法[J].计算机学报,2012,35(12):2625-2632 Liao Zhi-fang,Li Ling,Liu Li-min,et al.A Tripartite Decomposition of Tensor for Social Tagging[J].Chinese Journal of Computers,2012,5(12):2625-2632
[11] Talukdar P P,Reisinger J,Pas,ca M,et al.Weakly-supervised acquisition of labeled class instances using graph random walks[C]∥Proceedings of the Conference on Empirical Methods in Natural Language Processing.Association for Computational Linguistics,2008:582-590
[12] 袁柳,张龙波.基于概率主题模型的标签预测[J].计算机科学,2011,38(7):175-180 Yuan Liu,Zhang Long-bo.Social Tag Predication Based on Probabilistic Topic Model[J].Computer Science,2011,8(7):175-180
[13] Bhagat S,Cormode G,Muthukrishnan S.Node classification insocial networks[M]∥Social network data analytics.Springer US,2011:115-148
[14] Chaudhari G.High confidence predictions in social networks[D].IIT Bombay,2013
[15] Java A,Song X,Finin T,et al.Why we twitter:understanding microblogging usage and communities[C]∥Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mi-ning and Social Network Analysis.ACM,2007:56-65
[16] Bae S H,Halperin D,West J,et al.Scalable Flow-Based Community Detection for Large-Scale Network Analysis[C]∥2013 IEEE 13th International Conference on Data Mining Workshops (ICDMW).IEEE,2013:303-310

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!