Computer Science ›› 2021, Vol. 48 ›› Issue (12): 226-230.doi: 10.11896/jsjkx.200800026

• Database & Big Data & Data Science • Previous Articles     Next Articles

Complex Network Link Prediction Method Based on Topology Similarity and XGBoost

GONG Zhui-fei, WEI Chuan-jia   

  1. College of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310023,China
  • Received:2020-08-04 Revised:2020-09-21 Online:2021-12-15 Published:2021-11-26
  • About author:GONG Zhui-fei,born in 1977,postgra-duate,Ph.D,lecturer,senior engineer.Her main research interests include complex network and link prediction.
  • Supported by:
    National Natural Science Foundation of China(61773348) and Natural Science Foundation of Zhejiang Province,China(LY17F030016).

Abstract: In order to improve the performance of complex network link prediction,topology similarity and XGBoost algorithm are used to complete link prediction in complex network.According to the topological structure of complex network,the adjacency matrix is established to solve the common neighbor set.Then the similarity score function of complex network is calculated according to the topological similarity theory.The score function and weight parameters of each time window are taken as input,and XGBoost algorithm is used to realize the link prediction of complex network.By setting two regularization coefficients of XGBoost algorithm through differentiation,the influence on link prediction accuracy is tested,and the optimal regularization coefficient is obtained,thus a stable XGBoost link prediction model is obtained.The experimental results show that,compared with the common network link prediction algorithms,the prediction accuracy based on topology similarity and XGBoost algorithm has obvious advantages,and the prediction time performance is smaller than other algorithms,especially suitable for large-scale complex network link prediction.

Key words: Complex network, Link prediction, Regularization, Time window, Topology similarity, XGBoost algorithm

CLC Number: 

  • TP391
[1]LI H,MA X P,SHI J,et al.Research on trust transfer based recommendation model in complex network environment[J].Acta Automatica Sinica,2018,44(2):363-376.
[2]XU X K,XU S,ZHU Y X,et al.Link predictability in complex networks[J].Complex Systems and Complexity Science,2014,11(1):41-47.
[3]WANG K,LIU S X,YU H T,et al.Complex network link prediction algorithm based on common neighbor validity[J].Journal of University of Electronic Science and Technology of China,2019,48(3):114-121.
[4]CHEN B,ZHU W,LIU Y.Algorithm for complex network diameter based on distance matrix[J].Journal of Systems Engineering & Electronics,2018,29(2):118-124.
[5]YANG X H,WANG C.Community Detection Algorithm in Complex Network Based on Network Embedding and Local Resultant Force[J].Computer Science,2021,48(4):229-236.
[6]WANG H,LE Z C,GONG X,et al.Review of Link Prediction Methods Based on Feature Classification[J].Computer Science,2020,47(8):302-312.
[7]WU Z F,LIANG Q,LIU Q,et al.link prediction optimization algorithm based on AdaBoost[J].Journal of Communications,2014(3):116-123.
[8]MO H M.Application of Belief Function to Identification of Node Influence in Complex Networks[J].Journal of Chongqing Technology and Business University(Natural Science Edition),2018,35(1):71-78.
[9]WANG X,CHEN X,QIAN F L,et al.Node similarity link prediction algorithm based on common neighbor contribution[J].Data Acquisition and Processing,2018,33(5):900-910.
[10]GUO W Y,LIU H Y,SUN Q,et al.Topological similarity measurement of contour lines using tree edit distance[J].Journal of Surveying and Mapping Science and Technology,2019(1):79-85.
[11]FU L D,WEI H,LI D,et al.Community dividing algorithm based on similarity of common neighbor nodes[J].Journal of Computer Applications,2019,39(7):2024-2029.
[12]GÓMEZ-RÍOS A, LUENGO J,HERRERA F.A Study on the Noise Label Influence in Boosting Algorithms:AdaBoost,GBM and XGBoost[C]//International Conference on Hybrid Artificial Intelligence Systems.2017.
[13]ZHANG Y,YAO Y G.Research on network intrusion detection based on xgboost algorithm[J].Information Network Security,2018(9):102-105.
[14]LI Z S,LIU Z G.Feature selection algorithm based on xgboost[J].Journal of Communications,2019(10):101-108.
[15]ZHANG X Y,TANG K.Traffic identification method based on xgboost algorithm and domain name information screening[J].Electronic Design Engineering,2019(6):177-182,187.
[16]CUI Y P,SHI K X,HU J W.Research of Webshell Detection Method Based on XGBoost Algorithm[J].Computer Science,2018,45(0z1):375-379.
[17]SU B J,ZHOU Y P,LIANG X G.Emotion recognition model of e-commerce review text based on xgboost algorithm[J].Internet of Things Technology,2018,8(1):54-57.
[18]ZHANG Y X,FENG Y X.Overview of link prediction methods and development[J].TT & C Technology,2019,38(2):8-12.
[1] SONG Jie, LIANG Mei-yu, XUE Zhe, DU Jun-ping, KOU Fei-fei. Scientific Paper Heterogeneous Graph Node Representation Learning Method Based onUnsupervised Clustering Level [J]. Computer Science, 2022, 49(9): 64-69.
[2] HUANG Li, ZHU Yan, LI Chun-ping. Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning [J]. Computer Science, 2022, 49(9): 76-82.
[3] ZHENG Wen-ping, LIU Mei-lin, YANG Gui. Community Detection Algorithm Based on Node Stability and Neighbor Similarity [J]. Computer Science, 2022, 49(9): 83-91.
[4] HE Xi, HE Ke-tai, WANG Jin-shan, LIN Shen-wen, YANG Jing-lin, FENG Yu-chao. Analysis of Bitcoin Entity Transaction Patterns [J]. Computer Science, 2022, 49(6A): 502-507.
[5] YANG Hao-xiong, GAO Jing, SHAO En-lu. Vehicle Routing Problem with Time Window of Takeaway Food ConsideringOne-order-multi-product Order Delivery [J]. Computer Science, 2022, 49(6A): 191-198.
[6] YANG Bo, LI Yuan-biao. Complex Network Analysis on Curriculum System of Data Science and Big Data Technology [J]. Computer Science, 2022, 49(6A): 680-685.
[7] ZHANG Wen-xuan, WU Qin. Fine-grained Image Classification Based on Multi-branch Attention-augmentation [J]. Computer Science, 2022, 49(5): 105-112.
[8] WANG Ben-yu, GU Yi-jun, PENG Shu-fan, ZHENG Di-wen. Community Detection Algorithm Based on Dynamic Distance and Stochastic Competitive Learning [J]. Computer Science, 2022, 49(5): 170-178.
[9] LI Yong, WU Jing-peng, ZHANG Zhong-ying, ZHANG Qiang. Link Prediction for Node Featureless Networks Based on Faster Attention Mechanism [J]. Computer Science, 2022, 49(4): 43-48.
[10] ZHAO Liang, ZHANG Jie, CHEN Zhi-kui. Adaptive Multimodal Robust Feature Learning Based on Dual Graph-regularization [J]. Computer Science, 2022, 49(4): 124-133.
[11] CHEN Shi-cong, YUAN De-yu, HUANG Shu-hua, YANG Ming. Node Label Classification Algorithm Based on Structural Depth Network Embedding Model [J]. Computer Science, 2022, 49(3): 105-112.
[12] ZHAO Xue-lei, JI Xin-sheng, LIU Shu-xin, LI Ying-le, LI Hai-tao. Link Prediction Method for Directed Networks Based on Path Connection Strength [J]. Computer Science, 2022, 49(2): 216-222.
[13] LI Jia-wen, GUO Bing-hui, YANG Xiao-bo, ZHENG Zhi-ming. Disease Genes Recognition Based on Information Propagation [J]. Computer Science, 2022, 49(1): 264-270.
[14] ZHENG Jian-wei, HUANG Juan-juan, QIN Meng-jie, XU Hong-hui, LIU Zhi. Hyperspectral Image Denoising Based on Non-local Similarity and Weighted-truncated NuclearNorm [J]. Computer Science, 2021, 48(9): 160-167.
[15] YANG Lei, JIANG Ai-lian, QIANG Yan. Structure Preserving Unsupervised Feature Selection Based on Autoencoder and Manifold Regularization [J]. Computer Science, 2021, 48(8): 53-59.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!