Computer Science ›› 2022, Vol. 49 ›› Issue (6A): 113-118.doi: 10.11896/jsjkx.210500034

• Intelligent Computing • Previous Articles     Next Articles

Active Metric Learning Based on Support Vector Machines

HOU Xia-ye1, CHEN Hai-yan1,3, ZHANG Bing1, YUAN Li-gang2, JIA Yi-zhen1   

  1. 1 College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China
    2 College of Civil Aviation,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China
    3 Collaborative Innovation Center of Novel Software Technology and Industrialization,Nanjing 210023,China
  • Online:2022-06-10 Published:2022-06-08
  • About author:HOU Xia-ye,born in 1996,postgra-duate.His main research interests include machine learning and artificial intelligence.
    CHEN Hai-yan,born in 1979,Ph.D,lecturer,is a member of China ComputerFederation.Her main research interests include machine learning and artificial intelligence.
  • Supported by:
    National Natural Science Foundation of China(61501229) and Fundamental Research Funds for the Central Universities(NS2019054,NS2020045).

Abstract: Metric learning is an important issue in machine learning.The measuring results will significantly affect the perfor-mance of machine learning algorithms.Current researches on metric learning mainly focus on supervised learning problems.How-ever,in real world applications,there is a large amount of data that has no label or needs to pay a high price to get labels.To handle this problem,this paper proposes an active metric learning algorithm based on support vector machines(ASVM2L),which can be used for semi-supervised learning.Firstly,a small size of samples randomly selected from the unlabeled dataset are labeled by oracles,and then these samples are used to train the support vector machine metric learner(SVM2L).According to the output measuring result,the rest unlabeled samples are classified by K-NN classifiers with different values of K,and the sample with the largest voting differences is selected and submitted to the oracle to get a label.Then,the sample is added to the training set to retrain the ASVM2L model.Repeating the above steps until the termination condition is met,then the best metric matrix can be obtained from the limited labeled samples.Comparative experiments on the standard datasets verify that the proposed ASVM2L algorithm can obtain more information with the least labeled samples without affecting the classification accuracy,and therefore has better measuring performance.

Key words: Active learning, Metric Learning, Sampling strategy, Semi-supervised learning, Support vector machine metric learning

CLC Number: 

  • TP391
[1] KULIS B.Metric learning:A survey[J].Foundations andTrends in Machine Learning,2012,5(4):287-364.
[2] XING E P,NG A Y,JORDAN M I,et al.Distance metric lear-ning with application to clustering with side-information[J].NIPS,2002,15(12):505-512
[3] LIU W,WEN Y,YU Z,et al.Sphereface:Deep hypersphere embedding for face recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:212-220.
[4] PLÖTZ T,ROTH S.Neural nearest neighbors networks[J].Advances in Neural Information Processing Systems,2018,31:1087-1098.
[5] CHOI J,MIN C,LEE B.Mathematical Analysis on Information-Theoretic Metric Learning With Application to Supervised Learning[J].IEEE Access,2019,7:121998-122005.
[6] WU W,TAO D,LI H,et al.Deep features for person re-identification on metric learning[J].Pattern Recognition,2021,110:107424.
[7] WANG F,ZUO W,ZHANG L,et al.A kernel classificationframework for metric learning[J].IEEE Transactions on Neural Networks and Learning Systems,2014,26(9):1950-1962.
[8] PASOLLI E,YANG H L,CRAWFORD M M.Active-metriclearning for classification of remotely sensed hyperspectral images[J].IEEE Transactions on Geoscience and Remote Sensing,2015,54(4):1925-1939.
[9] KUMARI P,GORU R,CHAUDHURI S,et al.Batch Decorrelation for Active Metric Learning[C]//IJCAI.2020.
[10] ZHOU Z H.A brief introduction to weakly supervised learning[J].National science review,2018,5(1):44-53.
[11] AGGARWAL C C,KONG X,GU Q,et al.Active learning:Asurvey[M]//Data Classification:Algorithms and Applications.CRC Press,2014:571-605.
[12] SHARMA M,BILGIC M.Evidence-based uncertainty sampling for active learning[J].Data Mining and Knowledge Discovery,2017,31(1):164-202.
[13] YE Y,LI T,ADJEROH D,et al.A survey on malware detection using data mining techniques[J].ACM Computing Surveys(CSUR),2017,50(3):1-40.
[14] GRAVES A,BELLEMARE M G,MENICK J,et al.Automated Curriculum Learning for Neural Networks[C]//International Conference on Machine Learning.PMLR,2017:1311-1320.
[15] YANG Y,LOOG M.A benchmark and comparison of activelearning for logistic regression[J].Pattern Recognition,2018,83:401-415.
[16] KAO C C,LEE T Y,SEN P,et al.Localization-aware activelearning for object detection[C]//Asian Conference on Compu-ter Vision.Cham:Springer,2018:506-522.
[1] WU Hong-xin, HAN Meng, CHEN Zhi-qiang, ZHANG Xi-long, LI Mu-hang. Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning [J]. Computer Science, 2022, 49(8): 12-25.
[2] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[3] WANG Yu-fei, CHEN Wen. Tri-training Algorithm Based on DECORATE Ensemble Learning and Credibility Assessment [J]. Computer Science, 2022, 49(6): 127-133.
[4] XU Hua-jie, CHEN Yu, YANG Yang, QIN Yuan-zhuo. Semi-supervised Learning Method Based on Automated Mixed Sample Data Augmentation Techniques [J]. Computer Science, 2022, 49(3): 288-293.
[5] ZHANG Ren-zhi, ZHU Yan. Malicious User Detection Method for Social Network Based on Active Learning [J]. Computer Science, 2021, 48(6): 332-337.
[6] WANG Ti-shuang, LI Pei-feng, ZHU Qiao-ming. Chinese Implicit Discourse Relation Recognition Based on Data Augmentation [J]. Computer Science, 2021, 48(10): 85-90.
[7] DONG Xin-yue, FAN Rui-dong, HOU Chen-ping. Active Label Distribution Learning Based on Marginal Probability Distribution Matching [J]. Computer Science, 2020, 47(9): 190-197.
[8] QIN Yue, DING Shi-fei. Survey of Semi-supervised Clustering [J]. Computer Science, 2019, 46(9): 15-21.
[9] WU Zhen-yu, LI Yun-lei, WU Fan. Semi-supervised Support Tensor Based on Tucker Decomposition [J]. Computer Science, 2019, 46(9): 195-200.
[10] LI Yi-hong, LIU Fang-zheng, DU Zhen-yu. Malware Detection Algorithm for Improving Active Learning [J]. Computer Science, 2019, 46(5): 92-99.
[11] SHEN Hong, LIU Jun-fa, CHEN Yi-qiang, JIANG Xin-long, HUANG Zheng-yu. Semi-supervised Scene Recognition Method Based on Multi-mode Fusion [J]. Computer Science, 2019, 46(12): 306-312.
[12] YU Cheng, ZHU Wan-ning, YOU Kun, ZHU Jin-fu. Prediction Model of E-sports Behavior Pattern Based on Attention Mechanism and LRUA Module [J]. Computer Science, 2019, 46(11A): 76-79.
[13] ZHAO Hai-yan, WANG Jing, CHEN Qing-kui, CAO Jian. Application of Active Learning in Recommendation System [J]. Computer Science, 2019, 46(11A): 153-158.
[14] BAO Zong-ming, GONG Sheng-rong, ZHONG Shan, YAN Ran, DAI Xing-hua. Person Re-identification Algorithm Based on Bidirectional KNN Ranking Optimization [J]. Computer Science, 2019, 46(11): 267-271.
[15] YU Ying, CHEN Ke, SHOU Li-dan, CHEN Gang, WU Xiao-fan. Sentiment Analysis of User Comments Based on Extraction of Key Words and Key Sentences [J]. Computer Science, 2019, 46(10): 19-26.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!