计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 1-9.doi: 10.11896/jsjkx.210500128
彭云聪1,3, 秦小林1,2,3, 张力戈1,3, 顾勇翔1,3
PENG Yun-cong1,3, QIN Xiao-lin1,2,3, ZHANG Li-ge1,3, GU Yong-xiang1,3
摘要: 目前,以深度学习为代表的人工智能算法凭借超大规模数据集以及强大的计算资源,在图像分类、生物特征识别、医疗辅助诊断等领域取得了优秀的成果并成功落地。然而,在许多实际的应用场景中,因诸多限制,研究人员无法获取到大量样本或者获取样本的代价过高,因此研究图像分类任务在小样本情形下的学习算法成为了推动智能化进程的核心动力,同时也成为了当下的研究热点。小样本学习指在监督信息数量有限的情况下进行学习并解决问题的算法。首先,从机器学习理论的角度描述了小样本学习困难的原因;其次,根据小样本学习算法的设计动机将现有算法归为表征学习、数据扩充、学习策略三大类,并分析其优缺点;然后,总结了常用的小样本学习评价方法以及现有模型在公用数据集上的表现;最后,讨论了小样本图像分类技术的难点及未来的研究趋势,为今后的研究提供参考。
中图分类号:
[1]REN S,HE K,GIRSHICK R B,et al.Faster R-CNN:Towards Real-Time Object Detection with Region Proposal Networks[C]//International Conference on Neural Information Proces-sing Systems.2015. [2]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImagenetClassificationwith Deep Convolutional Neural Networks[J].Advances in Neural Information Processing Systems,2012,25:1097-1105. [3]YAN L,ZHENG Y,CAO J.Few-shot Learning for Short Text Classification[J].Multimedia Tools Applications,2018,77(22):29799-29810. [4]LI F F,FERGUS R,PERONA P.One-shot Learning of Object Categories[J].IEEE Transactions on Pattern Analysis Machine Intelligence,2006,28(4):594-611. [5]LAKE B M,SALAKHUTDINOV R,TENENBAUM J B.One-shot Learning by Inverting a Compositional Causal Process[C]//Proceedings of the 26th International Conference on Neural Information Processing Systems.2013:2526-2534. [6]TANG K D,TAPPEN M F,SUKTHANKAR R,et al.Optimizing One-shot Recognition with Micro-set Learning[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.2010:3027-3034. [7]MUNKHDALAI T,YUAN X,MEHRI S,et al.Rapid Adaptation with Conditionally Shifted Neurons[C]//International Conference on Machine Learning.2018:3664-3673. [8]FINK M.Object Classification From a Single Example Utilizing Class Relevance Metrics[C]//International Conference on Neural Information Processing Systems.2004:449-456. [9]XU Z,ZHU L,YANG Y.Few-shot Object Recognition fromMachine-labeled Web Images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1164-1172. [10]SIMON C,KONIUSZ P,NOCK R,et al.Adaptive Subspaces for Few-shot Learning[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2020:4136-4145. [11]CHEN Z,FU Y,ZHANG Y,et al.Multi-level Semantic Feature Augmentation for One-shot Learning[J].IEEE Transactions on Image Processing,2019,28(9):4594-4605. [12]DOERSCH C,GUPTA A,EFROS A A.Unsupervised Visual Representation Learning by Context Prediction[C]//Procee-dings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:1422-1430. [13]PATHAK D,KRAHENBUHL P,DONAHUE J,et al.Context Encoders:Feature Learning by Inpainting[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:2536-2544. [14]ZHANG R,ISOLA P,EFROS A A.Colorful Image Colorization[C]//European Conference on Computer Vision.2016:649-666. [15]GIDARIS S,SINGH P,KOMODAKIS N.Unsupervised Representation Learning by Predicting Image Rotations[C]//International Conference on Learning Representations.2018. [16]RITCHIE D,WANG K,LIN Y A.Fast and Flexible IndoorScene Synthesis via Deep Convolutional Generative Models[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019. [17]CHEN M,FANG Y,WANG X,et al.Diversity Transfer Network for Few-shot Learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:10559-10566. [18]SANTORO A,BARTUNOV S,BOTVINICK M,et al.Meta-learning with Memory-augmented Neural Networks[C]//International Conference on Machine Learning.2016:1842-1850. [19]WANG Y X,HEBERT M.Learning from Small Sample Sets by Combining Unsupervised Meta-training with CNNs[C]//International Conference on Neural Information Processing Systems.2016:244-252. [20]JAMAL M A,QI G J.Task Agnostic Meta-learning for Few-shot Learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:11719-11727. [21]MISHRA N,ROHANINEJAD M,CHEN X,et al.A SimpleNeural Attentive Meta-Learner[C]//International Conference on Learning Representations.2018. [22]RUSU A A,RAO D,SYGNOWSKI J,et al.Meta-Learning with Latent Embedding Optimization[C]//International Conference on Learning Representations.2018. [23]ZHOU F,WU B,LI Z.Deep Meta-learning:Learning to Learn in The Concept Space[J].arXiv:1802.03596,2018. [24]BERTINETTO L,HENRIQUES J,TORR P,et al.Meta-lear-ning with Differentiable Closed-form Solvers[C]//International Conference on Learning Representations.2019. [25]ZHAO K L,JIN X L,WANG Y Z.Survey on Few-shot Lear-ning[J].Journal of Software,2021,32(2):349-369. [26]LIU Y,LEI Y B,FAN J L,et al.Survey on Image Classification Technology Based on Small Sample Learning[J].Acta Automatica Sinica,2021,47(2):297-315. [27]BOTTOU L,CURTIS F E,NOCEDAL J.Optimization Methods for Large-scale Machine Learning[J].Siam Review,2018,60(2):223-311. [28]BOTTOU L,BOUSQUET O.The Tradeoffs of Large ScaleLearning[C]//International Conference on Neural Information Processing Systems.2007:161-168. [29]WANG Y,YAO Q,KWOK J T,et al.Generalizing from A Few Examples:A Survey on Few-shot Learning[J].ACM Computing Surveys,2020,53(3):1-34. [30]QI H,BROWN M,LOWE D G.Low-shot Learning with Imprinted Weights[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5822-5830. [31]KOCH G.Siamese Neural Networks for One-shot Image Recog-nition[C]//Internation Conference on Machine Learning Deep Learning Workshop.2015. [32]CHEN W Y,LIU Y C,KIRA Z,et al.A Closer Look at Few-shot Classification[C]//International Conference on Learning Representations.2019. [33]GIDARIS S,BURSUC A,KOMODAKIS N,et al.Boosting Few-shot Visual Learning with Self-supervision[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8059-8068. [34]SU J C,MAJI S,HARIHARAN B.When does Self-supervision Improve Few-shot Learning?[C]//European Conference on Computer Vision.2020:645-666. [35]MISRA I,MAATEN L V D.Self-supervised Learning of Pretext-invariant Representations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:6707-6717. [36]HJELM R D,FEDOROV A,LAVOIE-MARCHILDON S,et al.Learning Deep Representations by Mutual Information Estimation and Maximization[C]//International Conference on Lear-ning Representations.2018. [37]BACHMAN P,HJELM R D,BUCHWALTER W.LearningRepresentations by Maximizing Mutual Information Across Views[J].arXiv:1906.00910,2019. [38]CHEN D,CHEN Y,LI Y,et al.Self-supervised Learning for Few-shot Image Classification[J].arXiv:1911.06045,2019. [39]TSAI Y H,HUANG L K,SALAKHUTDINOV R.LearningRobust Visual-Semantic Embeddings[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:3571-3580. [40]MANGLA P,KUMARI N,SINHA A,et al.Charting the Right Manifold:Manifold Mixup for Few-shot Learning[C]//Procee-dings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2020:2218-2227. [41]GUO H,MAO Y,ZHANG R.Mixup as Locally Linear Out-of-manifold Regularization[C]//Proceedings of the AAAI Confe-rence on Artificial Intelligence.2019:3714-3722. [42]ZHANG H,CISSE M,DAUPHIN Y N,et al.Mixup:Beyond Empirical Risk Minimization[J].arXiv:1710.09412,2017. [43]WANG Y X,GIRSHICK R,HEBERT M,et al.Low-shotLearning from Imaginary Data[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7278-7286. [44]AZURI I,WEINSHALL A D.Generative Latent Implicit Conditional Optimization when Learning from Small Sample[C]//International Conference on Pattern Recognition.2020. [45]SCHWARTZ E,KARLINSKY L,SHTOK J,et al.Delta-En-coder:An Effective Sample Synthesis Method for Few-shot Object Recognition[C]//International Conference on Neural Information Processing Systems.2018:2850-2860. [46]DHILLON G S,CHAUDHARI P,RAVICHANDRAN A,et al.A Baseline for Few-Shot Image Classification[C]//International Conference on Learning Representations.2019. [47]WANG Y X,HEBERT M.Learning to Learn:Model Regression Networks for Easy Small Sample Learning[C]//European Conference on Computer Vision.2016:616-634. [48]HU Y,GRIPON V,PATEUX S.Leveraging the Feature Distribution in Transfer-based Few-shot Learning[J].arXiv:2006.03806,2020. [49]VINYALS O,BLUNDELL C,LILLICRAP T,et al.MatchingNetworks for One shot Learning[C]//International Conference on Neural Information Processing Systems.2016:3637-3645. [50]SUNG F,YANG Y,ZHANG L,et al.Learning to Compare:Relation Network for Few-shot Learning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:1199-1208. [51]GARCIA V,BRUNA J.Few-Shot Learning with Graph Neural Networks[C]//International Conference on Learning Representations.2018. [52]KIM J,KIM T,KIM S,et al.Edge-labeling Graph Neural Network for Few-shot Learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:11-20. [53]PHAM H,DAI Z,XIE Q,et al.Meta Pseudo Labels[J].arXiv:2003.10580,2020. [54]LI J,WONG Y,ZHAO Q,et al.Learning to Learn from Noisy Labeled Data[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:5051-5059. [55]FINN C,ABBEEL P,LEVINE S.Model-agnostic Meta-learning for Fast Adaptation of Deep Networks[C]//International Conference on Machine Learning.2017:1126-1135. [56]NICHOL A,SCHULMAN J.Reptile:A Scalable Metalearning Algorithm[J].arXiv:1803.02999,2018. [57]RAVI S,LAROCHELLE H.Optimization as a Model for Few-shot Learning[C]//International Conference on Learning Representations.2017. [58]SUN Q,LIU Y,CHUA T S,et al.Meta-transfer Learning for Few-shot Learning[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2019:403-412. [59]LI Z,ZHOU F,CHEN F,et al.Meta-SGD:Learning to Learn Quickly for Few-shot Learning[J].arXiv:1707.09835,2017. [60]ANTONIOU A,EDWARDS H,STORKEY A.How to Train your MAML[J].arXiv:1810.09502,2018. [61]ZIKO I,DOLZ J,GRANGER E,et al.Laplacian Regularized Few-shot Learning[C]//International Conference on Machine Learning.2020:11660-11670. [62]SHYAM P,GUPTA S,DUKKIPATI A.Attentive RecurrentComparators[C]//International Conference on Machine Lear-ning.2017:3173-3181. [63]SNELL J,SWERSKY K,ZEMEL R.Prototypical Networks for Few-shot Learning[C]//International Conference on Neural Information Processing Systems.2017:4080-4090. [64]TRIANTAFILLOU E,ZEMEL R S,URTASUN R.Few-Shot Learning Through an Information Retrieval Lens[C]//International Conference on Neural Information Processing Systems.2017. [65]LU Y Q,MIN W Q,DUAN H,et al.Few-shot Food Recognition Combining Triplet Convolutional Neural Network with Relational Network[J].Computer Science,2020,47 (1):136-143. [66]KANG B,LIU Z,WANG X,et al.Few-shot Object Detection via Feature Reweighting[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8420-8429. [67]ROSTAMI M,KOLOURI S,EATON E,et al.Deep Transfer Learning for Few-shot SAR Image classification[J].Remote Sensing.2019,11(11):1374. [68]WANG L,BAI X,ZHOU F.Few-Shot SAR ATR Based onConv-BiLSTM Prototypical Networks[C]//2019 6th Asia-Pacific Conference on Synthetic Aperture Radar (APSAR).2019:1-5. [69]FAN Q,ZHUO W,TANG C K,et al.Few-shot Object Detection with Attention-RPN and Multi-relation Detector[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:4013-4022. [70]LIU W,ZHANG C,LIN G,et al.Crnet:Cross-reference Net-works for Few-shot Segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:4165-4173. [71]LI Y,YANG J.Meta-learning Baselines and Database for Few-shot Classification in Agriculture[J].Computers and Electro-nics in Agriculture,2021,182:106055. |
[1] | 黄丽, 朱焱, 李春平. 基于异构网络表征学习的作者学术行为预测 Author’s Academic Behavior Prediction Based on Heterogeneous Network Representation Learning 计算机科学, 2022, 49(9): 76-82. https://doi.org/10.11896/jsjkx.210900078 |
[2] | 李宗民, 张玉鹏, 刘玉杰, 李华. 基于可变形图卷积的点云表征学习 Deformable Graph Convolutional Networks Based Point Cloud Representation Learning 计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023 |
[3] | 方义秋, 张震坤, 葛君伟. 基于自注意力机制和迁移学习的跨领域推荐算法 Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning 计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011 |
[4] | 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航. 监督和半监督学习下的多标签分类综述 Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning 计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111 |
[5] | 杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建. 基于注意力机制和多任务学习的阿尔茨海默症分类 Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning 计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072 |
[6] | 王君锋, 刘凡, 杨赛, 吕坦悦, 陈峙宇, 许峰. 基于多源迁移学习的大坝裂缝检测 Dam Crack Detection Based on Multi-source Transfer Learning 计算机科学, 2022, 49(6A): 319-324. https://doi.org/10.11896/jsjkx.210500124 |
[7] | 杨健楠, 张帆. 一种结合双注意力机制和层次网络结构的细碎农作物分类方法 Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure 计算机科学, 2022, 49(6A): 353-357. https://doi.org/10.11896/jsjkx.210200169 |
[8] | 朱旭东, 熊贇. 基于样本分布损失的图像多标签分类研究 Study on Multi-label Image Classification Based on Sample Distribution Loss 计算机科学, 2022, 49(6): 210-216. https://doi.org/10.11896/jsjkx.210300267 |
[9] | 张文轩, 吴秦. 基于多分支注意力增强的细粒度图像分类 Fine-grained Image Classification Based on Multi-branch Attention-augmentation 计算机科学, 2022, 49(5): 105-112. https://doi.org/10.11896/jsjkx.210100108 |
[10] | 谭珍琼, 姜文君, 任演纳, 张吉, 任德盛, 李晓鸿. 基于二分图的个性化学习任务分配 Personalized Learning Task Assignment Based on Bipartite Graph 计算机科学, 2022, 49(4): 269-281. https://doi.org/10.11896/jsjkx.210500125 |
[11] | 左杰格, 柳晓鸣, 蔡兵. 基于图像分块与特征融合的户外图像天气识别 Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion 计算机科学, 2022, 49(3): 197-203. https://doi.org/10.11896/jsjkx.201200263 |
[12] | 张舒萌, 余增, 李天瑞. 跨领域文本的可迁移情绪分析方法 Transferable Emotion Analysis Method for Cross-domain Text 计算机科学, 2022, 49(3): 218-224. https://doi.org/10.11896/jsjkx.210400034 |
[13] | 许华杰, 陈育, 杨洋, 秦远卓. 基于混合样本自动数据增强技术的半监督学习方法 Semi-supervised Learning Method Based on Automated Mixed Sample Data Augmentation Techniques 计算机科学, 2022, 49(3): 288-293. https://doi.org/10.11896/jsjkx.210100156 |
[14] | 李星燃, 张立言, 姚树婧. 结合特征融合和注意力机制的微表情识别方法 Micro-expression Recognition Method Combining Feature Fusion and Attention Mechanism 计算机科学, 2022, 49(2): 4-11. https://doi.org/10.11896/jsjkx.210900028 |
[15] | 董琳, 黄丽清, 叶锋, 黄添强, 翁彬, 徐超. 人脸伪造检测泛化性方法综述 Survey on Generalization Methods of Face Forgery Detection 计算机科学, 2022, 49(2): 12-30. https://doi.org/10.11896/jsjkx.210900146 |
|