Computer Science ›› 2019, Vol. 46 ›› Issue (4): 77-82.doi: 10.11896/j.issn.1002-137X.2019.04.012

• Big Data & Data Science • Previous Articles     Next Articles

Associated Users Mining Algorithm Based on Multi-information Fusion Representation Learning

HAN Zhong-ming1,2, ZHENG Chen-ye1, DUAN Da-gao1, DONG Jian3   

  1. School of Computer and Information Engineering,Beijing Technology and Business University,Beijing 100048,China1
    Beijing Key Laboratory of Food Safety Big Data Technology,Beijing 100048,China2
    The Third Research Institute of The Ministry of Public Security,The Ministry of Public Security Key Laboratory of Information Network Security,Shanghai 200031,China3
  • Received:2018-11-09 Online:2019-04-15 Published:2019-04-23

Abstract: With rapid development and popularization of Internet technologies,more and more users have begun to share and exchange various information through social networks.The same user in the network may apply for multiple diffe-rent accounts to distribute information,and these accounts constitute the associated users in the network.Effectively mining associated users in social networks can suppress false information and illegal behaviors in the network,and thus ensure the security and fairness of the network environment.Existing associated user mining methods only consider user attributes or user relationship information without merging multiple types of information contained in the network comprehensively.In addition,most methods draw lessons from the methods in other fields,such as de-anonymization,and they can’t accurately solve the problem of associated user mining.In light of this,this paper proposed an associated user mining algorithm based on multi-information fusion representation learning(AUMA-MRL).In this algorithm,the idea of network representation learning is utilized to learn various dimensional information in the networks,such as user attributes,network topology,etc.Then the learned multi-information is effectively fused to obtain multi-information node embedding,which can accurately characterize multiple types of information in networks,and mine associated users in networks through similarity vectors between node embedding.The proposed algorithm was validated on three real networks namely protein network PPI and social network Flickr,Facebook.In the experiment,the accuracy and recall rate is selected as the performance evaluation indexes.The results show that the recall rate of proposed algorithm is increased by 17.5% on average compared with the existing classical algorithms,and it can effectively mine associated users in networks.

Key words: Associated users, Node embedding, Representation learning, Social networks security

CLC Number: 

  • TP311
[1]ZHOU X P,LIANG X,ZHAO J C,et al.A Survey of Related User Mining Methods for Social Network[J].Journal of Software,2017,28(6):1565-1583.(in Chinese) 周小平,梁循,赵吉超,等.面向社会网络融合的关联用户挖掘方法综述[J].软件学报,2017,28(6):1565-1583.
[2]CAI J,STRUBE M.End-to-end coreference resolution via hy- pergraph partitioning[C]∥Proceedings of the 23rd InternationalConference on Computational Linguistics.Association for Computational Linguistics,2010:143-151.
[3]WANG J,LI G,YU J X,et al.Entity matching:How similar is similar[J] Proceedings of the VLDB Endowment,2011,4(10):622-633.
[4]KALASHNIKOVD V,CHEN Z Q,MEHROTRA S,et al.Web People Search via Connection Analysis[J].IEEE Transactions on Knowledge and Data Engineering,2008,20(11):1550-1565.
[5]QIAN Y,HU Y,CUI J,et al.Combining machine learning and human judgment in author disambiguation[C]∥Proceedings of the 20th ACM International Conference on Information and Knowledge Management.ACM,2011:1241-1246.
[6]TANG J,FONG A C M,WANG B,et al.A Unified Probabilistic Framework for Name Disambiguation in Digital Library[J].IEEE Transactions on Knowledge and Data Engineering,2012,24(6):975-987.
[7]LIU J,ZHANG F,SONG X,et al.What’s in a name? an unsupervised approach to link users across communities[C]∥ACM International Conference on Web Search and Data Mining.ACM,2013:495-504.
[8]ZAFARANI R,LIU H.Connecting users across social media sites:a behavioral-modeling approach[C]∥ACM SIGKDD International Conference on Knowledge Discovery and Data Mi-ning.ACM,2013:41-49.
[9]ZHANG H,KAN M Y,LIU Y,et al.Online Social Network Profile Linkage[M]∥Information Retrieval Technology.Springer International Publishing,2014:197-208.
[10]NARAYANAN A,SHMATIKOV V.De-anonymizing Social Networks[C]∥Security and Privacy IEEE Symposium.IEEE,2009:173-187.
[11]ZHOU X,LIANG X,ZHANG H,et al.Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks[J].IEEE Transactions on Knowledge and Data Engineering,2015,28(2):411-424.
[12]FU H,ZHANG A,XIE X.Effective social graph deanonymization based on graph structure and descriptive information[C]∥ACM Transactions on Intelligent Systems and Technology (TIST),2015,6(4):1-29.
[13]SINGH R,XU J B,BERGER B.Global alignment of multiple protein interaction networks with application to functional orthology detection[J].Proceedings of the National Academy of Sciences of the United States of America,2008,105(35):12763-12768.
[14]CAI H Y,ZHENG V W,CHANG K.A Comprehensive Survey of Graph Embedding:Problems,Techniques and Applications[J].IEEE Transactions on Knowledge and Data Engineering,2018,30(9):1616-1637.
[15]WANG D,CUI P,ZHU W.Structural Deep Network Embed- ding[C]∥Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM,2016:1225-1234.
[16]KIPF T N,WELLING M.Semi-Supervised Classification with Graph Convolutional Networks[J].arXiv preprint arXiv:1609.02907,2016.
[17]HAMILTON W,YING Z,LESKOVEC J.Inductive representation learning on large graphs[M]∥Advances in Neural Information Processing Systems.Bertin:Springer,2017:1024-1034.
[18]ANDREW G,ARORA R,BILMES J,et al.Deep canonical correlation analysis[C]∥International Conference on Machine Learning.JMLR.org,2013:1247-1255.
[19]BURGES C J C.A Tutorial on Support Vector Machines for Pattern Recognition[J].Data Mining and Knowledge Discovery,1998,2(2):121-167.
[1] SONG Jie, LIANG Mei-yu, XUE Zhe, DU Jun-ping, KOU Fei-fei. Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level [J]. Computer Science, 2022, 49(9): 64-69.
[2] HUANG Li, ZHU Yan, LI Chun-ping. Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning [J]. Computer Science, 2022, 49(9): 76-82.
[3] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[4] LI Zong-min, ZHANG Yu-peng, LIU Yu-jie, LI Hua. Deformable Graph Convolutional Networks Based Point Cloud Representation Learning [J]. Computer Science, 2022, 49(8): 273-278.
[5] HUANG Pu, DU Xu-ran, SHEN Yang-yang, YANG Zhang-jing. Face Recognition Based on Locality Regularized Double Linear Reconstruction Representation [J]. Computer Science, 2022, 49(6A): 407-411.
[6] JIANG Zong-li, FAN Ke, ZHANG Jin-li. Generative Adversarial Network and Meta-path Based Heterogeneous Network Representation Learning [J]. Computer Science, 2022, 49(1): 133-139.
[7] WANG Ying-li, JIANG Cong-cong, FENG Xiao-nian, QIAN Tie-yun. Time Aware Point-of-interest Recommendation [J]. Computer Science, 2021, 48(9): 43-49.
[8] ZHAO Jin-long, ZHAO Zhong-ying. Recommendation Algorithm Based on Heterogeneous Information Network Embedding and Attention Neural Network [J]. Computer Science, 2021, 48(8): 72-79.
[9] YE Hong-liang, ZHU Wan-ning, HONG Lei. Music Style Transfer Method with Human Voice Based on CQT and Mel-spectrum [J]. Computer Science, 2021, 48(6A): 326-330.
[10] YANG Ru-han, DAI Yi-ru, WANG Jian, DONG Jin. Humans-Cyber-Physical Ontology Fusion of Industry Based on Representation Learning [J]. Computer Science, 2021, 48(5): 190-196.
[11] QIAN Sheng-sheng, ZHANG Tian-zhu, XU Chang-sheng. Survey of Multimedia Social Events Analysis [J]. Computer Science, 2021, 48(3): 97-112.
[12] WANG Xing , KANG Zhao. Smooth Representation-based Semi-supervised Classification [J]. Computer Science, 2021, 48(3): 124-129.
[13] HE Miao-miao, GUO Wei-bin. Inductive Learning Algorithm of Graph Node Embedding Based on KNN and Matrix Transform [J]. Computer Science, 2021, 48(3): 201-205.
[14] WANG Xue-cen, ZHANG Yu, LIU Ying-jie, YU Ge. Evaluation of Quality of Interaction in Online Learning Based on Representation Learning [J]. Computer Science, 2021, 48(2): 207-211.
[15] LI Xin-chao, LI Pei-feng, ZHU Qiao-ming. Directed Network Representation Method Based on Hierarchical Structure Information [J]. Computer Science, 2021, 48(2): 100-104.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!