Computer Science ›› 2022, Vol. 49 ›› Issue (6A): 93-99.doi: 10.11896/jsjkx.210500047

• Intelligent Computing • Previous Articles     Next Articles

Review of Multi-instance Learning Algorithms

ZHAO Lu1, YUAN Li-ming2, HAO Kun1   

  1. 1 School of Computer and Information Engineering,Tianjin Chengjian University,Tianjin 300384,China
    2 School of Computer Science and Engineering,Tianjin University of Technology,Tianjin 300384,China
  • Online:2022-06-10 Published:2022-06-08
  • About author:ZHAO Lu,born in 1983,Ph.D,lecturer,is a member of China Computer Federation.Her main research interests include multi-instance learning and deep learning.
    YUAN Li-ming,born in 1982,Ph.D,associate professor,postgraduate supervisor,is a member of China Computer Federation.His main research interests include multi-instance learning and deep learning.
  • Supported by:
    Special Foundation for Technology Innovation of Tianjin(21YDTPJC00250),National Natural Science Foundation of China(61902273),Major Social Science Program of Tianjin Municipal Education Commission(2019JWZD02),New-Generation AI Science and Technology Major Project of Tianjin(18ZXZNGX00150) and Open Foundation of Key Laboratory of Computer Vision and Systems of Ministry of Education(TJUT-CVS20170001).

Abstract: Multi-instance learning(MIL) is a typical weakly supervised learning framework,where every training example,called bag,is a set of instances.Since the learning process of an MIL algorithm depends on only the labels of bags rather than those of any individual instances,MIL can fit well with applications in which instance labels are difficult to get.Recently,deep multi-instance learning methods attract widespread attention,so deep MIL has become a major research focus.This paper reviews some research progress of MIL.Firstly,MIL algorithms are divided into shallow and deep models according to their hierarchical structure.Secondly,various algorithms are reviewed and summarized in these two categories,and then different pooling methods of deep MIL models are analyzed.Moreover,the fundamental theorem of symmetric functions for models with set-type data as training samples and its application in deep MIL are expounded.Finally,the performance of different algorithms is compared and analyzed through experiments,and their interpretability is analyzed thoroughly.After that,problems to be further investigated are discussed.

Key words: Deep learning, Fundamental theorem of symmetric functions, Interpretability, Machine learning, Multi-instance learning, Multi-instance pooling

CLC Number: 

  • TP181
[1] SCHMIDHUBER J.Deep learning in neural networks:An overview[J].Neural Networks,2015,61:85-117.
[2] LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436-444.
[3] LIU W,WANG Z,LIU X,et al.A survey of deep neural net-work architectures and their applications[J].Neurocomputing,2017,234:11-26.
[4] LITJENS G,KOOI T,BEJNORDI B E,et al.A survey on deep learning in medical image analysis[J].Medical Image Analysis,2017,42:60-88.
[5] ZHOU Z H.A brief introduction to weakly supervised learning[J].National Science Review,2018,5(1):44-53.
[6] ZHOU Z H.Multi-instance Learning[R].Report of State Key Laboratory for Novel Software Technology at Nanjing Univer-sity,Nanjing,2006:1-13.
[7] XU,Y,MO,T,FENG,Q,et al.Deep learning of feature representation with multiple instance learning for medical image analysis[C]//Proceedings of 2014 IEEE International Conference on Acoustics,Speech and Signal Processing.Italy,2014:1626-1630.
[8] DIETTERICH T G,LATHROP R H,LOZANO-PÉREZ T.Solving the multiple instance problem with axis-parallel rectangles[J].Artificial Intelligence,1997,89(1/2):31-71.
[9] AMORES J.Multiple instance classification:Review,taxonomyand comparative study[J].Artificial Intelligence,2013,201:81-105.
[10] MARON O,LOZANO-PÉREZ T.A framework for multiple-instance learning[C]//Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems 10(NIPS '97).USA,1998:570-576.
[11] ZHANG Q,GOLDMAN S A.EM-DD:An improved multiple-instance learning technique[C]//Proceedings of the 14th International Conference on Neural Information Processing Systems:Natural and Synthetic(NIPS'01).USA,2001:1073-1080.
[12] ANDREWS S,TSOCHANTARIDIS I,HOFMANN T.Support vector machines for multiple-instance learning[C]//Advances in Neural Information Processing Systems.Canada,2002:577-584.
[13] ZHOU Z H,ZHANG M L.Neural networks for multi-instance learning[R].Report of Computer Science Technology Department at Nanjing University,Nanjing,2002:1-14.
[14] WANG J,ZUCKER J D.Solving multiple-instance problem:Alazy learning approach[C]//Proceedings of 17th International Conference on Machine Learning.USA,2000:1119-1126.
[15] ZHANG M L,ZHOU Z H.Adapting RBF neural networks to multi-instance learning[J].Neural Processing Letters,2006,23(1):1-26.
[16] GÄRTNER T,FLACH P A,KOWALCZYK A,et al.Multi-instance kernels[C]//Proceedings of International Conference on Machine Learning.Australia,2002:179-186.
[17] ZHOU Z H,SUN Y Y,LI Y F.Multi-instance learning by treating instances as non-i.i.d.samples[C]//Proceedings of International Conference on Machine Learning.Canada,2009:1249-1256.
[18] CHEN Y,WANG J Z.Image categorization by learning and reasoning with regions[J].Journal of Machine Learning Research,2004,5:913-939.
[19] CHEN Y,BI J,WANG J Z.MILES:Multiple-instance learning via embedded instance selection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(12):1931-1947.
[20] ZHOU Z H,ZHANG M L.Solving multi-instance problemswith classifier ensemble based on constructive clustering[J].Knowledge and Information Systems,2007,11(2):155-170.
[21] ZHANG M L,ZHOU Z H.Multi-instance clustering with applications to multi-instance prediction[J].Applied Intelligence,2009,31(1):47-68.
[22] CHEPLYGINA V,TAX D M,LOOG M.Multiple instancelearning with bag dissimilarities[J].Pattern Recognition,2015,48(1):264-275.
[23] RAMON J,DE RAEDT L.Multi instance neural networks[C]//Proceedings of ICML-2000 Workshop on Attribute-Value and Relational Learning.USA,2000:53-60.
[24] ZHANG M L,ZHOU Z H.Improve multi-instance neural networks through feature selection[J].Neural Processing Letters,2004,19(1):1-10.
[25] WU J,YU Y,HUANG C,et al.Deep multiple instance learning for image classification and auto-annotation[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.USA,2015:3460-3469.
[26] WANG X,YAN Y,TANG,P,et al.Revisiting multiple instance neural networks[J].Pattern Recognition,2018,74:15-24.
[27] ILSE M,TOMCZAK J M,WELLING M.Attention-based deep multiple instance learning[C]//Proceedings of International Conference on Machine Learning.Sweden,2018:2132-2141.
[28] ZAHEER M,KOTTUR S,RAVANBAKHSH S,et al.Deepsets[C]//Advances in Neural Information Processing Systems.USA,2017:3391-3401.
[29] SHI X S,XING F Y,XIE Y P,et al.Loss-based attention for deep multiple instance learning[C]//Proceedings of the AAAI Conference on Artificial Intelligence.USA,2020:5742-5749.
[30] YAN Y,WANG X,GUO X,et al.Deep multi-instance learning with dynamic pooling[C]//Asian Conference on Machine Lear-ning.China,2018:662-677.
[31] TU M,HUANG J,HE X,et al.Multiple instance learning with graph neural networks[J].arXiv:1906.04881,2019.
[32] WANG X,YAN Y,TANG P,et al.Bag similarity network for deep multi-instance learning[J].Information Sciences,2019,504:578-588.
[33] ZHOU Z H,FENG J.Deep forest:Towards an alternative todeep neural networks[C]//Proceedings of the 26th InternationalJoint Conference on Artificial Intelligence.USA,2017:3553-3559.
[34] REN J,HOU B J,JIANG Y.Deep forest for multiple instance learning[J].Journal of Computer Research and Development,2019,56(8):1670-1676.
[35] LAROCHELLE H,MURRAY I.The neural autoregressivedistribution estimator[C]//International Conference on Artificial Intelligence and Statistics.USA,2011:29-37.
[36] VINYALS O,BENGIO S,KUDLUR M.Order matters:Se-quence to sequence for sets[J].arXiv:1511.06391,2015.
[37] GENS R,DOMINGOS P M.Deep symmetry networks[C]//Advances in Neural Information Processing Systems.Canada,2014:2537-2545.
[38] COHEN T,WELLING M.Group equivariant convolutional networks[C]//Proceedings of International Conference on Machine Learning.USA,2016:2990-2999.
[39] RAVANBAKHSH S,SCHNEIDER J,POCZOS B.Equivariance through parameter-sharing[C]//International Conference on Machine Learning.Australia,2017:2892-2901.
[40] FANG H,GUPTA S,IANDOLA F,et al.From captions to vi-sual concepts and back[C]//IEEE Conference on Computer Vision and Pattern Recognition.USA,2015:1473-1482.
[41] DIBA A,SHARMA V,PAZANDEH A,et al.Weakly supervised cascaded convolutional networks[C]//IEEE Conference on Computer Vision and Pattern Recognition.USA,2017:5131-5139.
[42] TANG P,WANG X,BAI X,et al.Multiple instance detection network with online instance classifier refinement[C]//IEEE Conference on Computer Vision and Pattern Recognition.USA,2017:3059-3067.
[43] ANGELIDIS S,LAPATA M.Multiple instance learning net-works for fine-grained sentiment analysis[J].Transactions of the Association for Computational Linguistics,2018,6:17-31.
[44] LIN Y,SHEN S,LIU Z,et al.Neural relation extraction with selective attention over instances[C]//Annual Meeting of the Association for Computational Linguistics.Association for Computational Linguistics,Germany,2016:2124-2133.
[45] E H H,ZHANG W J,XIAO S Q,et al.Survey of entity relationship extraction based on deep learning[J].Ruan Jian Xue Bao/Journal of Software,2019,30(6):1793-1818.
[46] LITJENS G,KOOI T,BEJNORDI B E,et al.A survey on deep learning in medical image analysis[J].Medical Image Analysis,2017,42:60-88.
[47] XU Y,MO T,FENG Q,et al.Deep learning of feature representation with multiple instance learning for medical image analysis[C]//International Conference on Acoustics,Speech and Signal Processing.Italy,2014:1626-1630.
[48] KRAUS O Z,BA J L,FREY B J.Classifying and segmenting microscopy images with deep multiple instance learning[J].Bioinformatics,2016,32(12):52-59.
[49] YAN Z,ZHAN Y,PENG Z,et al.Multi-instance deep learning:Discover discriminative local anatomies for bodypart recognition[J].IEEE Transactions on Medical Imaging,2016,35(5):1332-1343.
[50] ZHU W,LOU Q,VANG Y S,et al.Deep multi-instance networks with sparse label assignment for whole mammogram classification[C]//International Conference on Medical Image Computing and Computer-Assisted Intervention.Canada,2017:603-611.
[51] LIU M,ZHANG J,ADELI E,et al.Landmark-based deep multi-instance learning for brain disease diagnosis[J].Medical Image Analysis,2018,43:157-168.
[52] SONG W,REN D,YU J,et al.A new special video detection algorithm based on 3D convolution CoHOG and MIL[J].Chinese Journal of Computers,2019,42(1):149-163.
[53] TSENG S Y,LI J,WANG Y,et al.Multiple Instance DeepLearning for Weakly Supervised Small-Footprint Audio Event Detection[J].arXiv:1712.09673,2017.
[54] KONG Q,XU Y,WANG W,et al.Audio set classification with attention model:A probabilistic perspective[C]//International Conference on Acoustics,Speech and Signal Processing.Canada,2018:316-320.
[55] YU C,BARSIM K S,KONG Q,et al.Multi-level attentionmodel for weakly supervised audio classification[J].arXiv:1803.02353v1,2018.
[56] KONG Q,YU C,XU Y,et al.Weakly labelled audioset tagging with attention neural networks[J].IEEE/ACM Transactions on Audio,Speech,and Language Processing,2019,27(11):1791-1802.
[57] WANG Y,LI J,METZE F.A comparison of five multiple in-stance learning pooling functions for sound event detection with weak labeling[C]//International Conference on Acoustics,Speech and Signal Processing.UK,2019:31-35.
[58] LIU X,JIAO L,ZHAO J,et al.Deep multiple instance learning-based spatial-spectral classification for PAN and MS imagery[J].IEEE Transactions on Geoscience and Remote Sensing,2017,56(1):461-473.
[59] URIOT T.Learning with sets in multiple instance regressionapplied to remote sensing[J].arXiv:1903.07745,2019.
[60] OLIVA J B,DUBEY A,ZAHEER,M,et al.Transformation autoregressive networks[C]//International Conference on Machine Learning.Sweden,2018:3898-3907.
[1] WANG Ming, WU Wen-fang, WANG Da-ling, FENG Shi, ZHANG Yi-fei. Generative Link Tree:A Counterfactual Explanation Generation Approach with High Data Fidelity [J]. Computer Science, 2022, 49(9): 33-40.
[2] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[3] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[4] LENG Dian-dian, DU Peng, CHEN Jian-ting, XIANG Yang. Automated Container Terminal Oriented Travel Time Estimation of AGV [J]. Computer Science, 2022, 49(9): 208-214.
[5] NING Han-yang, MA Miao, YANG Bo, LIU Shi-chang. Research Progress and Analysis on Intelligent Cryptology [J]. Computer Science, 2022, 49(9): 288-296.
[6] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[7] HE Qiang, YIN Zhen-yu, HUANG Min, WANG Xing-wei, WANG Yuan-tian, CUI Shuo, ZHAO Yong. Survey of Influence Analysis of Evolutionary Network Based on Big Data [J]. Computer Science, 2022, 49(8): 1-11.
[8] LI Yao, LI Tao, LI Qi-fan, LIANG Jia-rui, Ibegbu Nnamdi JULIAN, CHEN Jun-jie, GUO Hao. Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network [J]. Computer Science, 2022, 49(8): 257-266.
[9] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[10] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[11] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[12] ZHANG Guang-hua, GAO Tian-jiao, CHEN Zhen-guo, YU Nai-wen. Study on Malware Classification Based on N-Gram Static Analysis Technology [J]. Computer Science, 2022, 49(8): 336-343.
[13] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[14] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[15] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!