计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 1-9.doi: 10.11896/jsjkx.210500128

• 计算机图形学&多媒体* 上一篇    下一篇

面向图像分类的小样本学习算法综述

彭云聪1,3, 秦小林1,2,3, 张力戈1,3, 顾勇翔1,3   

  1. 1 中国科学院成都计算机应用研究所 成都610041
    2 南昌理工学院 南昌330044
    3 中国科学院大学计算机科学与技术学院 北京100049
  • 收稿日期:2021-05-18 修回日期:2021-10-22 出版日期:2022-05-15 发布日期:2022-05-06
  • 通讯作者: 秦小林(qinxl2001@126.com)
  • 作者简介:(pengyuncong19@mails.ucas.ac.cn)
  • 基金资助:
    国家自然科学基金(61402537);四川省科技计划资助项目(2019ZDZX0005,2019ZDZX0006,2020YFQ0056,2021YFG0034);四川省委组织部人才专项资助;全国科学院联盟合作项目(中国科学院成都分院-重庆科学技术研究院)

Survey on Few-shot Learning Algorithms for Image Classification

PENG Yun-cong1,3, QIN Xiao-lin1,2,3, ZHANG Li-ge1,3, GU Yong-xiang1,3   

  1. 1 Chengdu Institute of Computer Applications,Chinese Academy of Sciences,Chengdu 610041,China
    2 Nanchang Institute of Technology,Nanchang 330044,China
    3 School of Computer Science and Technology,University of Chinese Academy of Sciences,Beijing 100049,China
  • Received:2021-05-18 Revised:2021-10-22 Online:2022-05-15 Published:2022-05-06
  • About author:PENG Yun-cong,born in 1998,postgraduate.His main research interests include few-shot learning and theory of statistical machine learning.
    QIN Xiao-lin,born in 1980,Ph.D,professor,Ph.D supervisor.His main research interests include automatic reasoning and swarm intelligence.
  • Supported by:
    National Natural Science Foundation of China(61402537),Sichuan Science and Technology Program(2019ZDZX0005,2019ZDZX0006,2020YFQ0056,2021YFG0034),Talents by Sichuan Provincial Party Committee Organization Department and National Academy of Science Alliance Collaborative Program(Chengdu Branch of Chinese Academy of Sciences-Chongqing Academy of Science and Technology).

摘要: 目前,以深度学习为代表的人工智能算法凭借超大规模数据集以及强大的计算资源,在图像分类、生物特征识别、医疗辅助诊断等领域取得了优秀的成果并成功落地。然而,在许多实际的应用场景中,因诸多限制,研究人员无法获取到大量样本或者获取样本的代价过高,因此研究图像分类任务在小样本情形下的学习算法成为了推动智能化进程的核心动力,同时也成为了当下的研究热点。小样本学习指在监督信息数量有限的情况下进行学习并解决问题的算法。首先,从机器学习理论的角度描述了小样本学习困难的原因;其次,根据小样本学习算法的设计动机将现有算法归为表征学习、数据扩充、学习策略三大类,并分析其优缺点;然后,总结了常用的小样本学习评价方法以及现有模型在公用数据集上的表现;最后,讨论了小样本图像分类技术的难点及未来的研究趋势,为今后的研究提供参考。

关键词: 表征学习, 迁移学习, 数据扩充, 图像分类, 小样本学习

Abstract: Presently,artificial intelligence algorithms represented by deep learning have achieved advanced results and been successfully used in fields such as image classification,biometric recognition and medical assisted diagnosis by virtue of ultra-large-scale data sets and powerful computing resources.However,due to many restrictions in the actual environment,it is impossible to obtain a large number of samples or the cost of obtaining samples is too high.Therefore,studying the learning algorithm in the case of small samples is the core driving force to promote the intelligent process,and it has also become a current research hot-spot.Few-shot learning is the algorithm to learn and solve the problem under the condition of limited supervision information.Firstly,it describes the reasons why few-shot learning is difficult to generalize from the perspective of machine learning theory.Secondly,according to the design motivation of the few-shot learning algorithm,existing algorithms are classified into three categories:representation learning,data expansion and learning strategy,and their advantages and disadvantages are analyzed.Thirdly,we summarize the commonly used few-shot learning evaluation methods and the performance of existing models in public data sets.Finally,we discuss the difficulties and future research trends of small sample image classification technology to provide re-ferences for future research.

Key words: Data expansion, Few-shot learning, Image classification, Learning representation, Transfer learning

中图分类号: 

  • TP181
[1]REN S,HE K,GIRSHICK R B,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[C]//International Conference on Neural Information Proces-sing Systems.2015.
[2]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImagenetClassificationwith Deep Convolutional Neural Networks[J].Advances in Neural Information Processing Systems,2012,25:1097-1105.
[3]YAN L,ZHENG Y,CAO J.Few-shot Learning for Short Text Classification[J].Multimedia Tools Applications,2018,77(22):29799-29810.
[4]LI F F,FERGUS R,PERONA P.One-shot Learning of Object Categories[J].IEEE Transactions on Pattern Analysis Machine Intelligence,2006,28(4):594-611.
[5]LAKE B M,SALAKHUTDINOV R,TENENBAUM J B.One-shot Learning by Inverting a Compositional Causal Process[C]//Proceedings of the 26th International Conference on Neural Information Processing Systems.2013:2526-2534.
[6]TANG K D,TAPPEN M F,SUKTHANKAR R,et al.Optimizing One-shot Recognition with Micro-set Learning[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2010:3027-3034.
[7]MUNKHDALAI T,YUAN X,MEHRI S,et al.Rapid Adaptation with Conditionally Shifted Neurons[C]//International Conference on Machine Learning.2018:3664-3673.
[8]FINK M.Object Classification From a Single Example Utilizing Class Relevance Metrics[C]//International Conference on Neural Information Processing Systems.2004:449-456.
[9]XU Z,ZHU L,YANG Y.Few-shot Object Recognition fromMachine-labeled Web Images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1164-1172.
[10]SIMON C,KONIUSZ P,NOCK R,et al.Adaptive Subspaces for Few-shot Learning[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2020:4136-4145.
[11]CHEN Z,FU Y,ZHANG Y,et al.Multi-level Semantic Feature Augmentation for One-shot Learning[J].IEEE Transactions on Image Processing,2019,28(9):4594-4605.
[12]DOERSCH C,GUPTA A,EFROS A A.Unsupervised Visual Representation Learning by Context Prediction[C]//Procee-dings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1422-1430.
[13]PATHAK D,KRAHENBUHL P,DONAHUE J,et al.Context Encoders:Feature Learning by Inpainting[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:2536-2544.
[14]ZHANG R,ISOLA P,EFROS A A.Colorful Image Colorization[C]//European Conference on Computer Vision.2016:649-666.
[15]GIDARIS S,SINGH P,KOMODAKIS N.Unsupervised Representation Learning by Predicting Image Rotations[C]//International Conference on Learning Representations.2018.
[16]RITCHIE D,WANG K,LIN Y A.Fast and Flexible IndoorScene Synthesis via Deep Convolutional Generative Models[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019.
[17]CHEN M,FANG Y,WANG X,et al.Diversity Transfer Network for Few-shot Learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:10559-10566.
[18]SANTORO A,BARTUNOV S,BOTVINICK M,et al.Meta-learning with Memory-augmented Neural Networks[C]//International Conference on Machine Learning.2016:1842-1850.
[19]WANG Y X,HEBERT M.Learning from Small Sample Sets by Combining Unsupervised Meta-training with CNNs[C]//International Conference on Neural Information Processing Systems.2016:244-252.
[20]JAMAL M A,QI G J.Task Agnostic Meta-learning for Few-shot Learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:11719-11727.
[21]MISHRA N,ROHANINEJAD M,CHEN X,et al.A SimpleNeural Attentive Meta-Learner[C]//International Conference on Learning Representations.2018.
[22]RUSU A A,RAO D,SYGNOWSKI J,et al.Meta-Learning with Latent Embedding Optimization[C]//International Conference on Learning Representations.2018.
[23]ZHOU F,WU B,LI Z.Deep Meta-learning:Learning to Learn in The Concept Space[J].arXiv:1802.03596,2018.
[24]BERTINETTO L,HENRIQUES J,TORR P,et al.Meta-lear-ning with Differentiable Closed-form Solvers[C]//International Conference on Learning Representations.2019.
[25]ZHAO K L,JIN X L,WANG Y Z.Survey on Few-shot Lear-ning[J].Journal of Software,2021,32(2):349-369.
[26]LIU Y,LEI Y B,FAN J L,et al.Survey on Image Classification Technology Based on Small Sample Learning[J].Acta Automatica Sinica,2021,47(2):297-315.
[27]BOTTOU L,CURTIS F E,NOCEDAL J.Optimization Methods for Large-scale Machine Learning[J].Siam Review,2018,60(2):223-311.
[28]BOTTOU L,BOUSQUET O.The Tradeoffs of Large ScaleLearning[C]//International Conference on Neural Information Processing Systems.2007:161-168.
[29]WANG Y,YAO Q,KWOK J T,et al.Generalizing from A Few Examples:A Survey on Few-shot Learning[J].ACM Computing Surveys,2020,53(3):1-34.
[30]QI H,BROWN M,LOWE D G.Low-shot Learning with Imprinted Weights[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5822-5830.
[31]KOCH G.Siamese Neural Networks for One-shot Image Recog-nition[C]//Internation Conference on Machine Learning Deep Learning Workshop.2015.
[32]CHEN W Y,LIU Y C,KIRA Z,et al.A Closer Look at Few-shot Classification[C]//International Conference on Learning Representations.2019.
[33]GIDARIS S,BURSUC A,KOMODAKIS N,et al.Boosting Few-shot Visual Learning with Self-supervision[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8059-8068.
[34]SU J C,MAJI S,HARIHARAN B.When does Self-supervision Improve Few-shot Learning?[C]//European Conference on Computer Vision.2020:645-666.
[35]MISRA I,MAATEN L V D.Self-supervised Learning of Pretext-invariant Representations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:6707-6717.
[36]HJELM R D,FEDOROV A,LAVOIE-MARCHILDON S,et al.Learning Deep Representations by Mutual Information Estimation and Maximization[C]//International Conference on Lear-ning Representations.2018.
[37]BACHMAN P,HJELM R D,BUCHWALTER W.LearningRepresentations by Maximizing Mutual Information Across Views[J].arXiv:1906.00910,2019.
[38]CHEN D,CHEN Y,LI Y,et al.Self-supervised Learning for Few-shot Image Classification[J].arXiv:1911.06045,2019.
[39]TSAI Y H,HUANG L K,SALAKHUTDINOV R.LearningRobust Visual-Semantic Embeddings[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:3571-3580.
[40]MANGLA P,KUMARI N,SINHA A,et al.Charting the Right Manifold:Manifold Mixup for Few-shot Learning[C]//Procee-dings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2020:2218-2227.
[41]GUO H,MAO Y,ZHANG R.Mixup as Locally Linear Out-of-manifold Regularization[C]//Proceedings of the AAAI Confe-rence on Artificial Intelligence.2019:3714-3722.
[42]ZHANG H,CISSE M,DAUPHIN Y N,et al.Mixup:Beyond Empirical Risk Minimization[J].arXiv:1710.09412,2017.
[43]WANG Y X,GIRSHICK R,HEBERT M,et al.Low-shotLearning from Imaginary Data[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7278-7286.
[44]AZURI I,WEINSHALL A D.Generative Latent Implicit Conditional Optimization when Learning from Small Sample[C]//International Conference on Pattern Recognition.2020.
[45]SCHWARTZ E,KARLINSKY L,SHTOK J,et al.Delta-En-coder:An Effective Sample Synthesis Method for Few-shot Object Recognition[C]//International Conference on Neural Information Processing Systems.2018:2850-2860.
[46]DHILLON G S,CHAUDHARI P,RAVICHANDRAN A,et al.A Baseline for Few-Shot Image Classification[C]//International Conference on Learning Representations.2019.
[47]WANG Y X,HEBERT M.Learning to Learn:Model Regression Networks for Easy Small Sample Learning[C]//European Conference on Computer Vision.2016:616-634.
[48]HU Y,GRIPON V,PATEUX S.Leveraging the Feature Distribution in Transfer-based Few-shot Learning[J].arXiv:2006.03806,2020.
[49]VINYALS O,BLUNDELL C,LILLICRAP T,et al.MatchingNetworks for One shot Learning[C]//International Conference on Neural Information Processing Systems.2016:3637-3645.
[50]SUNG F,YANG Y,ZHANG L,et al.Learning to Compare:Relation Network for Few-shot Learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:1199-1208.
[51]GARCIA V,BRUNA J.Few-Shot Learning with Graph Neural Networks[C]//International Conference on Learning Representations.2018.
[52]KIM J,KIM T,KIM S,et al.Edge-labeling Graph Neural Network for Few-shot Learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:11-20.
[53]PHAM H,DAI Z,XIE Q,et al.Meta Pseudo Labels[J].arXiv:2003.10580,2020.
[54]LI J,WONG Y,ZHAO Q,et al.Learning to Learn from Noisy Labeled Data[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:5051-5059.
[55]FINN C,ABBEEL P,LEVINE S.Model-agnostic Meta-learning for Fast Adaptation of Deep Networks[C]//International Conference on Machine Learning.2017:1126-1135.
[56]NICHOL A,SCHULMAN J.Reptile:A Scalable Metalearning Algorithm[J].arXiv:1803.02999,2018.
[57]RAVI S,LAROCHELLE H.Optimization as a Model for Few-shot Learning[C]//International Conference on Learning Representations.2017.
[58]SUN Q,LIU Y,CHUA T S,et al.Meta-transfer Learning for Few-shot Learning[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2019:403-412.
[59]LI Z,ZHOU F,CHEN F,et al.Meta-SGD:Learning to Learn Quickly for Few-shot Learning[J].arXiv:1707.09835,2017.
[60]ANTONIOU A,EDWARDS H,STORKEY A.How to Train your MAML[J].arXiv:1810.09502,2018.
[61]ZIKO I,DOLZ J,GRANGER E,et al.Laplacian Regularized Few-shot Learning[C]//International Conference on Machine Learning.2020:11660-11670.
[62]SHYAM P,GUPTA S,DUKKIPATI A.Attentive RecurrentComparators[C]//International Conference on Machine Lear-ning.2017:3173-3181.
[63]SNELL J,SWERSKY K,ZEMEL R.Prototypical Networks for Few-shot Learning[C]//International Conference on Neural Information Processing Systems.2017:4080-4090.
[64]TRIANTAFILLOU E,ZEMEL R S,URTASUN R.Few-Shot Learning Through an Information Retrieval Lens[C]//International Conference on Neural Information Processing Systems.2017.
[65]LU Y Q,MIN W Q,DUAN H,et al.Few-shot Food Recognition Combining Triplet Convolutional Neural Network with Relational Network[J].Computer Science,2020,47 (1):136-143.
[66]KANG B,LIU Z,WANG X,et al.Few-shot Object Detection via Feature Reweighting[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8420-8429.
[67]ROSTAMI M,KOLOURI S,EATON E,et al.Deep Transfer Learning for Few-shot SAR Image classification[J].Remote Sensing.2019,11(11):1374.
[68]WANG L,BAI X,ZHOU F.Few-Shot SAR ATR Based onConv-BiLSTM Prototypical Networks[C]//2019 6th Asia-Pacific Conference on Synthetic Aperture Radar (APSAR).2019:1-5.
[69]FAN Q,ZHUO W,TANG C K,et al.Few-shot Object Detection with Attention-RPN and Multi-relation Detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:4013-4022.
[70]LIU W,ZHANG C,LIN G,et al.Crnet:Cross-reference Net-works for Few-shot Segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:4165-4173.
[71]LI Y,YANG J.Meta-learning Baselines and Database for Few-shot Classification in Agriculture[J].Computers and Electro-nics in Agriculture,2021,182:106055.
[1] 黄丽, 朱焱, 李春平.
基于异构网络表征学习的作者学术行为预测
Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning
计算机科学, 2022, 49(9): 76-82. https://doi.org/10.11896/jsjkx.210900078
[2] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[3] 方义秋, 张震坤, 葛君伟.
基于自注意力机制和迁移学习的跨领域推荐算法
Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning
计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011
[4] 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航.
监督和半监督学习下的多标签分类综述
Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning
计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[5] 杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建.
基于注意力机制和多任务学习的阿尔茨海默症分类
Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning
计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072
[6] 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰.
基于多源迁移学习的大坝裂缝检测
Dam Crack Detection Based on Multi-source Transfer Learning
计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124
[7] 杨健楠, 张帆.
一种结合双注意力机制和层次网络结构的细碎农作物分类方法
Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure
计算机科学, 2022, 49(6A): 353-357. https://doi.org/10.11896/jsjkx.210200169
[8] 朱旭东, 熊贇.
基于样本分布损失的图像多标签分类研究
Study on Multi-label Image Classification Based on Sample Distribution Loss
计算机科学, 2022, 49(6): 210-216. https://doi.org/10.11896/jsjkx.210300267
[9] 张文轩, 吴秦.
基于多分支注意力增强的细粒度图像分类
Fine-grained Image Classification Based on Multi-branch Attention-augmentation
计算机科学, 2022, 49(5): 105-112. https://doi.org/10.11896/jsjkx.210100108
[10] 谭珍琼, 姜文君, 任演纳, 张吉, 任德盛, 李晓鸿.
基于二分图的个性化学习任务分配
Personalized Learning Task Assignment Based on Bipartite Graph
计算机科学, 2022, 49(4): 269-281. https://doi.org/10.11896/jsjkx.210500125
[11] 左杰格, 柳晓鸣, 蔡兵.
基于图像分块与特征融合的户外图像天气识别
Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion
计算机科学, 2022, 49(3): 197-203. https://doi.org/10.11896/jsjkx.201200263
[12] 张舒萌, 余增, 李天瑞.
跨领域文本的可迁移情绪分析方法
Transferable Emotion Analysis Method for Cross-domain Text
计算机科学, 2022, 49(3): 218-224. https://doi.org/10.11896/jsjkx.210400034
[13] 许华杰, 陈育, 杨洋, 秦远卓.
基于混合样本自动数据增强技术的半监督学习方法
Semi-supervised Learning Method Based on Automated Mixed Sample Data Augmentation Techniques
计算机科学, 2022, 49(3): 288-293. https://doi.org/10.11896/jsjkx.210100156
[14] 李星燃, 张立言, 姚树婧.
结合特征融合和注意力机制的微表情识别方法
Micro-expression Recognition Method Combining Feature Fusion and Attention Mechanism
计算机科学, 2022, 49(2): 4-11. https://doi.org/10.11896/jsjkx.210900028
[15] 董琳, 黄丽清, 叶锋, 黄添强, 翁彬, 徐超.
人脸伪造检测泛化性方法综述
Survey on Generalization Methods of Face Forgery Detection
计算机科学, 2022, 49(2): 12-30. https://doi.org/10.11896/jsjkx.210900146
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!