Computer Science ›› 2013, Vol. 40 ›› Issue (4): 14-21.

Previous Articles     Next Articles

Multi-label Data Mining:A Survey

LI Si-nan,LI Ning and LI Zhan-huai   

  • Online:2018-11-16 Published:2018-11-16

Abstract: In traditional single-label data mining,each sample belongs to only one label.However,an real-world object usually associates with more than one attribute.Multi-label data mining is motivated by increasing requirements of mo-dern applications and is widely applied in many fields,such as semantic annotation of images and video,functional geno-mics,music categorization into emotions and directed marketing,which has attracted significant attention from a lot of researchers.This paper systematically introduced technology of multi-label data mining from two aspects:methods and evaluation metrics.Finally,we summarized some problems and challenges in current study and gave prospects of the tendency in this area.

Key words: Multi-label,Data mining,Classification,Ranking,Metrics

[1] Matthew R B,Luo Jie-bo,Shen Xi-peng,et al.Learning multi-label scene classification[J].Pattern Recognition,2004(37):1757-1771
[2] Zhang Min-ling,Zhou Zhi-hua.Ml-kNN:A lazy learning ap-proach to multi-label learning[J].Pattern Recognition,2007(40):2038-2048
[3] Xu Xin-shun,Jiang Yuan,Peng Liang,et al.Ensemble approach based on conditional random field for multi-label image and videoannotation[C]∥Proceedings of the 19th ACM international conference on Multimedia.Scottsdale,Arizona,USA,2011:1377-1380
[4] Wang Jing-dong,Zhao Ying-hai,Wu Xiu-qing,et al.A transductive multi-label learning approach for video concept detection[J].Pattern Recognition,2011(44):10-11
[5] Snoek C,Worring M,Gemert J V,et al.The challenge problem for automated detection of 101semantic concepts in multimedia[C]∥Proceedings of the ACM Multimedia.Santa Barbara,CA,USA,2006:421-430
[6] Clare A,King R.Knowledge discovery in multi-label phenotype data[C]∥Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD).Freiburg,Germany,2001:42-53
[7] Elisseeff A,Weston J.A kernel method for multi-labelled classification[J].Advances in Neural Information Processing Systems,2001(14):681-687
[8] Blockeel H,Schietgat L,Struyf J,et al.Decision trees for hierarchical multilabel classification:A case study in functional genomics[C]∥Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 4213LNAI.2006:18-29
[9] Cesa-Bianchi N,Gentile C,Zaniboni L.Hierarchical classifica-tion:combining bayes with svm[C]∥Proceedings of the 23rd International Conference on Machine learning.2006:177-184
[10] Li Guo-zheng,You Ming-yu,Ge Lei,et al.Feature selection for semi-supervised multi-label learning with application to gene function analysis[C]∥Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology.Niagara Falls,New York,2010:354-357
[11] Li Tao, Ogihara M.Detecting emotion in music[C]∥Procee-dings of the International Symposium on Music Information Retrieval.Washington D.C.,USA,2003:239-240
[12] Sanden C,Zhang J Z.Enhancing multi-label music genre classification through ensemble techniques[C]∥Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’11).New York,USA,2011:705-714
[13] Wieczorkowska A,Synak P,Ras Z.Multi-label classification of emotions in music[C]∥Proceedings of the 2006International Conference on Intelligent Information Processing and Web Mi-ning (IIPWM).2006:307-315
[14] Trohidis K,Tsoumakas G,Kalliris G,et al.Multilabel classification of music into emotions[C]∥Proceeding of 9th International Conference on Music Information Retrieval (ISMIR).Philadelphia,PA,USA,2008
[15] Zhang Yi,Burer S,Street W N.Ensemble pruning via semi-defi-nite programming[J].Journal of Machine Learning Research,2006(7):1315-1338
[16] Tsoumakas G,Katakis I,Vlahavas I.Mining multi-label data(2nd edition)[M].Oded Maimon and Lior Rokach,eds.Data Mining and Knowledge Discovery Handbook,chapter 34,Springer,2010:667-685
[17] Lewis D D,Yang Yi-ming,Rose T G.RCV1:A new benchmark collection for text categorization research[J].The Journal of Machine Learning Research,2004(5):361-397
[18] Veloso A,Weira M J ,Zaki M.Calibrated lazy associative classification[C]∥Proceedings of the 23rd Brazilian Symposium on Databases.Sao Paulo,Brazil,2008:135-149
[19] Vembu S,Gartner T.Label ranking algorithms:A survey[J].Preference Learning,2009
[20] Brinker K,Furnkranz J,Hullermeier E.A unified model for multilabel classification and ranking[C]∥Proceedings of the 17th European Conference on Artificial Intelligence (ECAI ’06).Riva del Garda,Italy,2006:489-493
[21] Tsoumakas G,Katakis I.Multi-label classification:An overview[J].International Journal of Data Warehousing and Mining,2007(3):1-13
[22] Read J,Pfahringer B,Holmes G,et al.Classifier Chains forMulti-label Classification[J].Machine Learning,2011,85(3):333-359
[23] Hullermeier E,Furnkranz J,Cheng W,et al.Label ranking by learning pairwise preferences[J].Artificial Intelligence,2008,172(16/17):1897-1916
[24] Mencia E L,Furnkranz J.Pairwise learning of multilabel classifications with perceptrons[C]∥ IEEE International Joint Conference on Neural Networks (08’ IJCNN).Hong Kong,2008:2900-2907
[25] Furnkranz J,Hullermeier E,Mencia E L,et al.Multi-label classification via calibrated label ranking [J].Machine Learning,2008,73(2):133-152
[26] Read J.A pruned problem transformation method for multi-label classification[C]∥ Proceeding of New Zealand Computer Science Research Student Conference (08’ NZCSRS).2008:143-150
[27] Tsoumakas G,Vlahavas I.Random k-labelsets:An ensemblemethod for multilabel classification[C]∥ Proceedings of the 18th European Conference on Machine Learning (07’ ECML).Warsaw,Poland,2007:406-417
[28] Schapire R E,Singer Y.Boostexter:a boosting-based system for text categorization[J].Machine Learning,2000(39):135-168
[29] de Comite F,Gilleron R,Tommasi M.Learning multi-label alternating decision trees from texts and data[C]∥ Proceedings of the 3rd International Conference on Machine Learning and Data Mining in Pattern Recognition (MLDM).Leipzig,Germany,2003:35-49
[30] Zhang Min-ling,Zhou Zhi-hua.Multi-label neural networks with applications to functional genomics and text categorization[J].IEEE Transactions on Knowledge and Data Engineering,2006(18):1338-1351
[31] Crammer K,Singer Y.A family of additive online algorithms for category ranking[J].Journal of Machine Learning Research,2003(3):1025-1058
[32] Godbole S,Sarawagi S.Discriminative methods for multi-labeled classification[C]∥ Proceedings of the 8th Pacific-Asia Confe-rence on Knowledge Discovery and Data Mining (PAKDD).2004:22-30
[33] Luo Xiao,Zincir-Heywood A N.Evaluation of two systems on multi-class multi-label document classification[C]∥ Procee-dings of the 15th International Symposium on Methodologies for Intelligent Systems(ISMIS).Saratoga Springs,NY,2005:161-169
[34] Xu Jian-hua.Multi-label weighted k-nearest neighbor classifierwith adaptive weight estimation[C]∥Proceedings of the 18th international conference on Neural Information Processing Volume Part II (ICONIP’11).Berlin,Heidelberg,2011:79-88
[35] Xu Jian-hua.An efficient multi-label support vector machinewith a zero label[J].Expert Systems with Applications,2012(39):4796-4804
[36] Thabtah F,Cowling P,Peng Y.MMAC:A new multi-class,multi-label associative classification approach[C]∥Proceedings of the 4th IEEE International Conference on Data Mining(ICDM).2004:217-224
[37] Veloso A,Wagner M J,Goncalves M,et al.Multi-label lazy associative classification[C]∥Proceedings of the 11th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD).Warsaw,Poland,Springer,2007:605-612
[38] McCallum A.Multi-label text classification with a mixture model trained by EM[C]∥Proceedings of the AAAI’ 99Workshop on Text Learning.1999
[39] Ueda N,Saito K.Parametric mixture models for multi-labeled text[J].Advances in Neural Information Processing Systems,2003(15):721-728
[40] Streich A P,Buhmann J M.Classification of multi-labeled data:A generative approach[C]∥12th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD).Antwerp,Belgium,2008:390-405
[41] Ghamrawi N,McCallum A.Collective multi-label classification[C]∥Proceedings of the 2005ACM Conference on Information and Knowledge Management (CIKM).Bremen,Germany,2005:195-200
[42] 郑伟,王朝坤,刘璋,等.一种基于随机游走模型的多标签分类算法[J].计算机学报,2010,3(8):1418-1426
[43] Zhu Sheng-huo,Ji Xiang,Xu Wei.Multi-labelled classification using maximum entropy method[C]∥ Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.2005:274-281
[44] Yang Yi-ming.An evaluation of statistical approaches to text categorization[J].Journal of Information Retrieval,1999(1):67-88
[45] 牛罡,罗爱宝,商琳.半监督文本分类综述[J].计算机科学与探索,2011(4):313-323
[46] Yi Liu,Rong Jin,Liu Yang.Semi-supervised Multi-label Learning by Constrained Non-Negative Matrix Factorization[C]∥Proceedings of the 21st National Conference on Artificial Intelligence (AAAI).2006:421-426
[47] Zha Zheng-jun,Mei Tao,Wang Jing-dong,et al.Graph-BasedSemi-supervised Learning with Multi-label[C]∥ International Conference on Multimedia and Expo.2008:1321-1324
[48] Chen Gang,Song Yang-qiu,Wang Fei.Semi-supervised Multi-label Learning by Solving a Sylvester Equation[C]∥SDM.2008:410-419
[49] Chen W,Yan J,Zhang B,et al.Document transformationfor multi-label feature selection in text categorization[C]∥7th IEEE International Conference on Data Mining (ICDM’07).2007:451-456
[50] Cesa-Bianchi N,Gentile C,Zaniboni L.Incremental algorithmsfor hierarchical classification[J].Journal of Machine Learning Research,2006(7):31-54
[51] Esuli A,Fagni T,Sebastiani F.Boosting multi-label hierarchical text categorization[J].Information Retrieval,2008(11):287-313
[52] Tsoumakas G,Katakis I,Vlahavas I.Effective and efficient multilabel classification in domains with large number of labels[C]∥Proceeding of ECML/PKDD 2008Workshop on Mining Multidimensional Data (08’ MMD).2008:30-44
[53] 苏金树,张博峰,徐昕.基于机器学习的文本分类进展[J].软件学报,2006,7(9):1848-1859
[54] Chawla N V,Sylvester J C.Exploiting diversity in ensembles.Improving the performance on unbalanced datasets[C]∥ Proceedings of the 7th International Conference on Multiple Classifier Systems.2007:397-406
[55] Tahir M A,Kittler J,Ahmed,et al.Multilabel classification using heterogeneous ensemble of multi-label classifiers[J].Pattern Recognition Letters,2012,33(5):513-523
[56] Shi Chuan,Kong Xiang-nan,Yu P S,et al.Multi-label ensemble learning[C]∥Proceedings of the 2011European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD’11).Heidelberg,2011(3):223-239

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!