Computer Science ›› 2020, Vol. 47 ›› Issue (8): 302-312.doi: 10.11896/jsjkx.190700136

Previous Articles     Next Articles

Review of Link Prediction Methods Based on Feature Classification

WANG Hui1, 2, LE Zi-chun3, GONG Xuan1, WU Yu-kun1, ZUO Hao1   

  1. 1 College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China
    2 College of Applied Science, Jiangxi University of Science and Technology, Ganzhou, Jiangxi 341000, China
    3 College of Science, Zhejiang University of Technology, Hangzhou 310023, China
  • Online:2020-08-15 Published:2020-08-10
  • About author:WANG Hui, born in 1983, Ph.D student, lecturer, is a member of China Computer Federation.Her main research interests include link prediction, deep learning, AI and big data.
  • Supported by:
    This work was supported by the Special Funding of “the Belt and Road” International Cooperation of Zhejiang Province (2015C04005).

Abstract: As an important research direction of network science research, link prediction of complex network has been more and more attention from experts from various disciplines, and it can make use of existing network information, such as the features of the nodes and edges, to predict possible future relationships, missing information in the network, and new or disappearing information, identify the false interactions, evaluate network evolution mechanism and conduct network reconfiguration, etc.Current articles on link prediction mainly come from the expert in engineering, computer science and physics.They are independent and lack cooperation, and there are few review articles combining multidisciplinary link prediction.The goal of this article is from the perspective of computer science and physics comprehensive to review, analyze, and discusse the research progress of link prediction algorithm based on feature classification, introduces a variety of feature extraction techniques in this field.This paper firstly introduces the idea of layering into the classification of link prediction algorithm, which divides the classification model into three layers, the metadata layer, features classification layer and feature extraction layer.The classification model includes two parts and seven aspects, that is, the prediction algorithm is divided into two parts, features extraction method and features learning method.The seven aspects are the similarity based methods, probabilistic methods, likelihood based methods, matrix factorization based method, random walk based method, neural network based method and custom loss function based method.This classification method covers many classical and latest link prediction techniques in various disciplines, including GNN, GCN, RNN and RL, which are the most popular link prediction techniques in graph neural networks.The differences of these algorithms in model complexity and prediction performance are studied, and the challenges of current link prediction technology in the future are discussed.

Key words: Complex network, Feature classification, Graph neural network, Link prediction, Machine learning

CLC Number: 

  • TP391
[1]RAMESH R S.Link prediction and path analysis using markov chains[J].Computer Networks, 2000, 60(33):377-386.
[2]LIBEN-NOWEL D, KLEINBERGN, JON K.The link-prediction problem forsocial networks[J].Journal of The American Society For InformationScience and Technology, 2007, 58(7):1019-1031.
[3]LV L L.Link prediction on complex networks[J].Journal ofUniversity of Electronic Science and Technology of China, 2010, 39(5):651-661.
[4]HASAN M A, ZAKI M.A survey of link prediction in socialnetworks[J].Social Network Data Analytics, 2011, 40(5):243-275.
[5]LV L L, REN X L, ZHOU T.Network link prediction:concepts and frontiers [J].Communication of China Computer Society, 2016, 12(4):12-21.
[6]VALVERDE R J, LOPES A A.Exploiting behaviors of communities of twitter users for link prediction[J].Soc. Netw. Anal. Min., 2013, 20(3):1063-1074.
[7]LIU H, HU Z, HADDADI H.Hidden link prediction based on node centrality and weak ties[J].Europhys Lett., 2013, 101(1):65-75.
[8]YANG S H, LONG B, SMOLA A, et al.Like like alike:jointfriendship and interest propagation in social networks[C]∥Proceedings of the 20th International Conference on World Wide Web.ACM, 2011:537-546.
[9]CHO E, MYERS S A, LESKOVEC J.Friendship and mobility:user movement in location-based social networks[C]∥Procee-dings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.ACM.2011:1082-1090.
[10]LEO K.A new status index derived from sociometric analysis[J].Psychometrika, 1953, 50(18):39-43.
[11]ELIZABETH A L, PETTER H, MARK E N.Vertex similarityin networks[J].Physical Review E, 2006, 73(2):121-130.
[12]JEH G, WIDOM J.SimRank:a measure of structural-contextsimilarity[C]∥Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’02).ACM, 2002:538-543.
[13]KLOUMANN I M, UGANDER J, KLEINBERG J.Block mo-dels and personalized PageRank[J].The National Academy of Sciences, 2017, 114(1):33-38.
[14]SHANG M S, L L, ZENG W, et al.Relevance is more significant than correlation:Information filtering on sparse data[J].Europhys Lett., 2009, 88(6):68008p1-68008p4.
[15]NEWMAN M E J.Clustering and preferential attachment ingrowing networks[J].Phys. Rev. E., 2001, 64(10):251-264.
[16]SALTON G, MCGILLM.Introduction to Modern InformationRetrieval[J].McGraw-Hill, 1983, 32(6):528-536.
[17]ADAMIC L A, ADAR E.Friend and neighbors on the web[J].Soc Networks, 2003, 25(6):211-230.
[18]LIU Y Y, ZHAO C L, WANG X J.The degree-related clustering coefficient and its application to link prediction[J].Physica A, 2016, 454(15):24-33.
[19]ZHOU T, L L, ZHANG Y C.Predicting missing links via local information[J].Eur. Phys. J. B., 2009, 80(71):623-630.
[20]RAVASZ E, SOMERA A L, MONGRU D, et al.Hierarchicalorganization of modularity in metabolic networks[J].Science, 2002, 297(12):1551-1558.
[21]LV L, JIN C H, ZHOU T.Similarity index based on local paths for link prediction of complex network[J].Phys. Reve., 2009, 80(4):211-223.
[22]WANG P, XU B W, WU Y R.Link predictionin social net-works:the state-of-the-art[J].Science China Information Scien-ces, 2015, 58(1):1-38.
[23]HU R J, TANG J X, YUAN Y N.Link prediction in complex networks based on the interactions among paths[J].Physical A, 2019, 510(4):52-67.
[24]ALEXIS P, PANAGIOTIS S, YANNIS M.Fast and accuratelink prediction in social networking systems[J].Systems and Software, 2012, 60(9):2119-2132.
[25]ZHU X, TIAN H, CAI S, et al.Predicting missing links via significant paths[J].EPL, 2014, 108(4):18008-18014.
[26]AARON C, CRISTOPHER M, MARK E J.Hierarchicalstructure and theprediction of missing links in networks[J].Nature, 2008, 453(71):98-101.
[27]GUIMER R, SALES P M.Missing and spurious interactionsand the reconstruction of complex networks[J].PNAS, 2009, 106(52):22073-22078.
[28]BEN T, PIETER A, WONG M F.Relationalmarkov networks[J].Introduction to Statistical Relational Learning, 2007, 40(13):175-200.
[29]LV L Y.ZHOU T.Link prediction in complex networks:A survey[J].Physica A:Statistical Mechanics and Its Applications, 2011, 390(6):1150-1170.
[30]MENON A K, ELKAN C.Link prediction via matrix factorization in Machine Learning and Knowledge Discovery in Databases[C]∥Springer.2011:437-452.
[31]ADITYA K M, CHARLES E.Link Prediction via Matrix Factorization[C]∥Proceedings of the 2011 European Conference on Machine Learning and Knowledge Discovery in Databases.ACM, 2011:437-452.
[32]KUNEGIS J, LOMMATZSCH A.Learning spectral graphtransformations for link prediction[C]∥ Proceedings of the 26th Annual International Conference on Machine Learning.ACM, 2009:561-568.
[33]CHEN Z, ZHANG W.A Marginalized Denoising Method forLink Prediction in Relational Data[C]∥ Proceedings of the 2014 SIAM International Conference on Data Mining.ACM, 2015:64-72.
[34]DAI L Y, KONG X Z, YUAN S S, et al.Integrative graph regularized matrix factorization for drug-pathway associations analysis[J].Computational Biology and Chemistry, 2019, 78(7):474-480.
[35]REN Y W, ZHOU J L, WANG J.Quality-Relevant Fault Monitoring Based on Locally Linear Embedding Orthogonal Projection to Latent Structure[J].Industrial & Engineering Chemistry Research, 2019, 58(3):1262-1272.
[36]OU M D, CUI P, PEI J, et al.Asymmetrictransitivity preserving graph embedding[C]∥Proceedings of the 22nd ACMSIGKDD International Conference on Knowledge Discovery and Data Mining.ACM, 2017:1105-1114.
[37]BELKIN M, NIYOGI P.Laplacian eigenmaps and spectral techniques for embedding and clustering[C]∥Advances in Neural Information Processing Systems.2002:585-591.
[38]CAO S, LU W, XU Q.Grarep:Learning graph representations with global structural information[C]∥Proceedings of the 24th ACM International on Conference on Information and Know-ledge Management.ACM, 2015:891-900.
[39]WANG H W, WANG J L, XIE X, et al.GraphGAN:Graph representation learningwith generative adversarial nets[J].IEEE Transaction on Knowledge and Data Engineering, 2017, 30(22):11-19.
[40]CAI L J, XU Y B, HE T Q, et al.A New Algorithm of DeepWalk Based On Probability[J].Journal of Physics:Conference Series, 2019, 1069(1):130-135.
[41]GROVER A, LESKOVEC J.Node2Vec:Scalable Feature learning for Network[J].HHS Public Access, 2016, 14(8):855-864.
[42]WANG D, CUI P, ZHU W.Structural deep network embedding[C]∥Proceedings of the 22nd International Conference on Knowledge Discovery and Data Mining.ACM, 2016:1225-1234.
[43]FRENSNO D, CARLOS W L, SARAI M C, et al.DNGR-1 in dendritic cells limits tissue damage by dampening neutrophil recruitment[J].Science, 2019, 6412(362):351-356.
[44]TOMAS N K, MAX W, Variational Graph Auto-Encoders[J].Springer, 2016, 28(3):61-63.
[45]LUCA F, MATHIAS N, MASSIMILIANO P.Learning Discrete
Structures for Graph Neural Networks[J].ICML.2019, 30(11):960-973.
[46]YOU J X, REX Y, XIANG R.Graphrnn:Generating realisticgraphs with deepauto-regressive models[C]∥International Conference on Machine Learning.ACM, 2018:5694-5703.
[47]MONTI F, BRONSTEIN M, BRESSON X.Geometric matrixcompletionwith recurrent multi-graph neural networks[J].Advances in NeuralInformation Processing Systems, 2017, 30(4):3697-3707.
[48]BUI L Q, PHAN A V, NGUYEN Y L H.DGCNN:A convolutional neural network over large-scale labeled graphs[J].Neural Networks:The Official Journal of the International Neural Network Society, 2017, 108(11):533-543.
[49]CAO D N, KIPF T.Molgan:An implicit generative model forsmall molecular graphs[J].ICML 2018 Workshop on Theoretical Foundations and Applications of Deep Generative Models, 2018, 97(30):181-192.
[50]TANG J, QU M, WANG M, et al.Line:Large-scale information network embedding[C]∥Ternational Conference on Word Wide Web.ACM, 2015:1067-1077.
[51]KIM Y H, YOO J, KIM S J, et al.Comparison of Scattering Cross-Sections by MCNP5 and TRANSX/TWODANT Codes in Sodium-Cooled Fast Reactor[J].Transactions of the American Nuclear Society, 2018, 106(1):719-722.
[52]WANG H, LE Z C, GONG X, et al.Link predicton of complex network is analyzed from the perspective of informatics[J].Journal of Chinese Computer Systems, 2020, 41(2):316-326.
[1] LENG Dian-dian, DU Peng, CHEN Jian-ting, XIANG Yang. Automated Container Terminal Oriented Travel Time Estimation of AGV [J]. Computer Science, 2022, 49(9): 208-214.
[2] NING Han-yang, MA Miao, YANG Bo, LIU Shi-chang. Research Progress and Analysis on Intelligent Cryptology [J]. Computer Science, 2022, 49(9): 288-296.
[3] ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[4] SONG Jie, LIANG Mei-yu, XUE Zhe, DU Jun-ping, KOU Fei-fei. Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level [J]. Computer Science, 2022, 49(9): 64-69.
[5] HUANG Li, ZHU Yan, LI Chun-ping. Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning [J]. Computer Science, 2022, 49(9): 76-82.
[6] ZHENG Wen-ping, LIU Mei-lin, YANG Gui. Community Detection Algorithm Based on Node Stability and Neighbor Similarity [J]. Computer Science, 2022, 49(9): 83-91.
[7] LI Yao, LI Tao, LI Qi-fan, LIANG Jia-rui, Ibegbu Nnamdi JULIAN, CHEN Jun-jie, GUO Hao. Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network [J]. Computer Science, 2022, 49(8): 257-266.
[8] ZHANG Guang-hua, GAO Tian-jiao, CHEN Zhen-guo, YU Nai-wen. Study on Malware Classification Based on N-Gram Static Analysis Technology [J]. Computer Science, 2022, 49(8): 336-343.
[9] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[10] HE Qiang, YIN Zhen-yu, HUANG Min, WANG Xing-wei, WANG Yuan-tian, CUI Shuo, ZHAO Yong. Survey of Influence Analysis of Evolutionary Network Based on Big Data [J]. Computer Science, 2022, 49(8): 1-11.
[11] CHEN Ming-xin, ZHANG Jun-bo, LI Tian-rui. Survey on Attacks and Defenses in Federated Learning [J]. Computer Science, 2022, 49(7): 310-323.
[12] QI Xiu-xiu, WANG Jia-hao, LI Wen-xiong, ZHOU Fan. Fusion Algorithm for Matrix Completion Prediction Based on Probabilistic Meta-learning [J]. Computer Science, 2022, 49(7): 18-24.
[13] YANG Bing-xin, GUO Yan-rong, HAO Shi-jie, Hong Ri-chang. Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition [J]. Computer Science, 2022, 49(7): 57-63.
[14] LI Ya-ru, ZHANG Yu-lai, WANG Jia-chen. Survey on Bayesian Optimization Methods for Hyper-parameter Tuning [J]. Computer Science, 2022, 49(6A): 86-92.
[15] ZHAO Lu, YUAN Li-ming, HAO Kun. Review of Multi-instance Learning Algorithms [J]. Computer Science, 2022, 49(6A): 93-99.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!