Computer Science ›› 2021, Vol. 48 ›› Issue (3): 214-219.doi: 10.11896/jsjkx.191200103

• Artificial Intelligence • Previous Articles     Next Articles

Label Propagation Algorithm Based on Weighted Samples and Consensus-rate

CHU Jie, ZHANG Zheng-jun, TANG Xin-yao, HUANG Zhen-sheng   

  1. School of Science,Nanjing University of Science and Technology,Nanjing 210094,China
  • Received:2019-12-16 Revised:2020-04-29 Online:2021-03-15 Published:2021-03-05
  • About author:CHU Jie,born in 1996,postgraduate.His main research interests include data mining and machine learning.
    ZHANG Zheng-jun,born in 1965,Ph.D,associate professor.His main research interests include data mining and gra-phic image.
  • Supported by:
    National Statistical Science Research Major Program of China(2018LD01).

Abstract: Label Propagation is one of the most widely used semi-supervised classification methods.Consensus rate-based label propagation(CRLP) algorithmconstructs the graph by summarizing multiple clustering solutions to incorporate various properties of the data.Like most graph-based semi-supervised classification method,CRLP focuses on optimizing the graph to improve the performance.In fact,samples are not always evenly distributed.The importance of different samples in the algorithm is diffe-rent.CRLP algorithm is easily affected by the numbers of clustering and the clustering methods,and it is not adaptable to low-dimensional data.To deal with these problems,a label propagation algorithm based on weighted samples and consensus-rate (WSCRLP) is proposed.WSCRLP firstly clusters the dataset multiple times to explore the structure of sample and combines the consensus-rate and the local information of the sample to construct a graph.Secondly,different weights are assigned to labeled samples with different distributions.Finally,semi-supervised classification is performed based on constructed graph and weighted samples.Experiments on real datasets show that the WSCRLP of weighting and constructing graphs on labeled samples can significantly improve classification accuracy,and is superior to other compared methods in 84% of the experiments.Compared with CRLP,WSCRLP not only has better performance,but also is robust to input parameters.

Key words: Consensus-rate, Label propagation, Semi-supervised classification, Weighted samples

CLC Number: 

  • TP301.6
[1]BLUM A,CHAWLA S.Learning from labeled and unlabeleddata using graph mincuts[C]//Processdings of 18th Internatio-nal Conference on Machine Learning.San Francisco:Morgan Kaufman Publishers Inc,2001:19-26.
[2]LI J N,ZHU Q S.Semi-Supervised self-training method based on an optimum-path forest[J].IEEE Access,2019,7:36388-36399.
[3]GAO Y,MA J,ALAN L,et al.Semi-Supervised sparse representation based classification for face recognition with insufficient labeled samples[J].IEEE Transactions Image Processing,2017,26(5):2545-2560.
[4]BELKIN M,NIYOGI P,SINDHWANI V.Manifold Regularization:A geometric framework for learning from labeled and unlabeled examples[J].The Journal of Machine Learning Research,2006,7:2399-2434.
[5]ZHU X,GHAHRAMANI Z,LAFFERTYJ D.Semi-supervised learning using Gaussian fields and harmonic functions[C]//Processdings of the Twentieth International Conference on Machine Learning.Washington:AAAI Press,2003:912-919.
[6]TAO G H,HUA L Z,WU W,et al.Safety-aware graph-based semi-supervised learning[J].Expert Systems with Applications,2018,107:243-254.
[7]WANG J,YAO G J,YU Z W.Semi-supervised classification by discriminative regularization[J].Applied Soft Computing,2017,58:245-255.
[8]NIGAM K,MCCALLLUM A K,et al.Text classification from labeled and unlabeled documents using EM[J].Machine Lear-ning,2000,39:103-134.
[9]WANG S,WU L,JIAO L,et al.Improve the performance of co-training by committee with refinement of class probability estimations[J].Neurocomputing,2014,136:30-40.
[10]HONG Y,ZHU W.Spatial co-training for semi-supervised ima-ge classification[J].Pattern Recognition Letters,2015,63:59-65.
[11]LI Y C,WANG Y L,BI C,et al.Revisiting transductive support vector machines with margin distribution embedding[J].Know-ledge-based Systems,2018,152:200-214.
[12]JURIC L,CECI M,KOCEV D,et al.Self-training for multi-target regression with tree ensembles[J].Knowledge-based Systems,2017,123:41-60.
[13]ZHOU D,BOUSQUET O,LAL T N,et al.Learning with local and global consistency[C]//Proceedings of the Sixteenth Advance in Neural Information Processing Systems.Whistler:MIT Press,2003:321-328.
[14]WANG F,ZHANG C.Label propagation through linear neighborhoods[J].IEEE Transactions on Knowledge and Data Engineering,2008,20(1):55-67.
[15]ZHAO M,CHOW T W S,ZHANG Z,et al.Automatic image annotation via compact graph based semi-supervised learning[J].Knowledge-Based Systems,2015,76:148-165.
[16]WU F,WANG W,YANG Y,et al.Classification by semi-supervised discriminative regularization[J].Neurcomputing,2010,73(10):1641-1651.
[17]YU J,SB K.Consensus rate-based label propagation for semi-supervised classification[J].Information Sciences,2018,465:265-284.
[18]DAS S,MOORE T,WONG W K,et al.End-user feature labeling:Supervised and semi-supervised approaches based on locally-weighted logistic regression[J].Artificial Intelligence,2013,204:56-74.
[19]REN Y,DOMENICONI C,ZHANG G,et al.Weighted-object ensemble clustering[C]//IEEE International Conference on Data Mining.Dallas:IEEE Press,2013:627-636.
[20]CHEN X,YU G X,TAN Q Y,et al.Weighted samples based semi-supervised classification[J].Applied Soft Computing,2019,79:46-58.
[1] HE Xi, HE Ke-tai, WANG Jin-shan, LIN Shen-wen, YANG Jing-lin, FENG Yu-chao. Analysis of Bitcoin Entity Transaction Patterns [J]. Computer Science, 2022, 49(6A): 502-507.
[2] WANG Xing , KANG Zhao. Smooth Representation-based Semi-supervised Classification [J]. Computer Science, 2021, 48(3): 124-129.
[3] DAI Cai-yan, CHEN Ling, HU Kong-fa. Algorithm for Mining Bipartite Network Based on Incremental Modularity [J]. Computer Science, 2018, 45(6A): 442-446.
[4] LIU Su-qi, BAI Guang-wei and SHEN Hang. Taxonomy Construction Based on User Self-describing Tags [J]. Computer Science, 2016, 43(7): 224-229.
[5] XU Wei, LIN Bo-gang, LIN Si-juan and YANG Yang. Assessment of User Influence in Social Networks Based on Multi-label Propagation [J]. Computer Science, 2016, 43(10): 135-140.
[6] MA Jie-liang, HAN Lu, PAN Zhen-zhen and SONG Yan. Label Propagation Algorithm Based on Community Core for Community Detection [J]. Computer Science, 2015, 42(1): 119-121.
[7] YANG Ge-lan,JIN Hui-xia,MENG Ling-zhong and ZHU Xing-hui. Graph-based Semi-supervised Dimensionality Reduction Algorithm [J]. Computer Science, 2014, 41(4): 280-282.
[8] . Study on Label Propagation Based Community Detection Algorithm for Social Semantic Network [J]. Computer Science, 2013, 40(2): 53-57.
[9] . Research on Sentiment Classification of Collaborative Learning Based on Emotion Words and Sentiment Words [J]. Computer Science, 2012, 39(12): 245-248.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!