Computer Science ›› 2020, Vol. 47 ›› Issue (5): 103-109.doi: 10.11896/jsjkx.180601099

• Databωe & Big Data & Data Science • Previous Articles     Next Articles

Imbalance Data Classification Based on Model of Multi-class Neighbourhood Three-way Decision

XIANG Wei1, WANG Xin-wei2   

  1. 1 School of Intelligent Manufacturing,Sichuan University of Arts and Science,Dazhou,Sichuan 635000,China
    2 College of Computer Science,Sichuan University,Chengdu 610065,China
  • Received:2018-06-19 Online:2020-05-15 Published:2020-05-19
  • About author:XIANG Wei,born in 1976,associate professor.His main research interests include computer-based information proces-sing & intelligent algorithm.
  • Supported by:
    This work was supported by the Major Project of Sichuan Education Department (16ZB0360).

Abstract: Imbalance data classification is an important data classification problem,traditional classification algorithm does not have better classification effect for smaller class in imbalance data.Therefore,this paper proposed an algorithm of imbalance data classification based on multi-class neighbourhood three-way decision.In the case of mixed data and multiple classes,traditional three-way decision is firstly generalized,and the multi-class neighbourhood three-way decision model of mixed data is presented.Then,a setting method of self-adaption cost function is given in the model,and based on this method,the algorithm of imbalance data classification of multi-class neighbourhood three-way decision model is proposed.Simulation experiment results show that the proposed classification algorithm has better classification performance for imbalance data.

Key words: Classification, Cost function, Imbalance data, Self-adaption, Three-way decision

CLC Number: 

  • TP18
[1]ZHANG S,SADAOUI S,MOUHOUB M.An empirical analysis of imbalanced data classification[J].Computer & Information Science,2015,8(1):151-162.
[2]HE H B,GARCIA E.Learning from imbalanced data[J].IEEE Transactions on Knowledge & Data Engineering,2009,21(9):1263-1284.
[3]HE H L,ZHANG W Y,ZHANG S.A novel ensemble method for credit scoring:Adaption of different imbalance ratios[J].Expert Systems with Applications,2018,98(15):105-117.
[4]RIVERA W A.Noise reduction a priori synthetic over-sampling for class imbalanced data sets[J].Information Sciences,2017,408:146-161.
[5]DOUZAS G,BACAO F,LAST F.Improving imbalanced lear-ning through a heuristic oversampling method based on k-means and SMOTE[J].Information Sciences,2018,465:1-20.
[6]CORDÓN I,GARCÍA S,FERNÁNDEZ A,et al.Imbalance:Oversampling algorithms for imbalanced classification in R[J].Knowledge-Based Systems,2018,161:329-341.
[7]ZHU Y J,WANG Z,GAO D Q.Gravitational fixed radius nearest neighbor for imbalanced problem[J].Knowledge-Based Systems,2015,90:224-238.
[8]WU G,CHANG E.KBA:Kernel boundary alignment conside-ring imbalanced data distribution[J].IEEE Transactions on Knowledge & Data Engineering,2005,17(6):786-795.
[9]GUPTA D,RICHHARIYA B,BORAH P.A fuzzy twin support vector machine based on information entropy for class imbalance learning[J].Neural Computing & Applications,2018(3):1-12.
[10]DÍEZ-PASTOR J F,RODRÍGUEZ J J,GARCÍA-OSORIO C,et al.Random Balance:Ensembles of variable priors classifiers for imbalanced data[J].Knowledge-Based Systems,2015,85(2/3):96-111.
[11]KHAN S H,HAYAT M,BENNAMOUN M,et al.Cost-sensitive learning of deep feature representations from imbalanced data[J].IEEE Transactions on Neural Networks & Learning Systems,2018,29(8):3573-3587.
[12]CAO C J,WANG Z.IMCStacking:Cost-sensitive stacking lear-ning with feature inverse mapping for imbalanced problems[J].Knowledge-Based Systems,2018,150(15):27-37.
[13]YAO Y Y.Three-way decisions with probabilistic rough sets[J].Information Sciences,2010,180(3):341-353.
[14]ZHOU B.Multi-class decision-theoretic rough sets[J].International Journal of Approximate Reasoning,2014,55(1):211-224.
[15]LIANG D C,LIU D,KOBINA A.Three-way group decisions with decision-theoretic rough sets[J].Information Sciences,2016,345:46-64.
[16]CHEN Y F,YUE X D,FUJITA H,et al.Three-way decision support for diagnosis on focal liver lesions[J].Knowledge-Based Systems,2017,127:85-99.
[17]LIU D,LI T R,LI H X.A multiple-category classification approach with decision-theoretic rough sets[J].Fundamenta Informaticae,2012,115(2/3):173-188.
[18]LI W W,HUANG Z Q,JIA X Y,et al.Neighborhood based decision-theoretic rough set models[J].International Journal of Approximate Reasoning,2016,69:1-17.
[19]HU Q H,YU D R,LIU J F,et al.Neighborhood rough set based heterogeneous feature subset selection[J].Information Sciences,2008,178(18):3577-3594.
[20]HU Q H,YU D R,XIE Z X.Neighborhood classifiers[J].Expert Systems with Applications,2008,34(2):866-876.
[21]KUBAT M,HOLTE R,MATWIN S.Learning when negative examples abound[C]//European Conference on Machine Lear-ning.Springer Berlin Heidelberg,1997:146-153.
[22]DAVIS J,GOADRICH M.The relationship between Precision-Recall and ROC curves[C]//Proceedings of the,International Conference on Machine Learning(ICML 2006).New York,USA:ACM Press,2006:233-240.
[23]FAWCETT T.An introduction to ROC analysis[J].PatternRecognition Letters,2006,27(8):861-874.
[24]JIANG S Y,XIE Z Q,YU W.Classification of naive Bayes imbalanced data based on cost sensitive[J].Journal of Computer Research and Development,2011,48(S1):387-390.
[25]PATEL H,THAKUR G S.A hybrid weighted nearest neighbor approach to mine imbalanced data[C]//International Conference on Data Mining.Las Vegas:IEEE,2016:106-112.
[1] CHEN Zhi-qiang, HAN Meng, LI Mu-hang, WU Hong-xin, ZHANG Xi-long. Survey of Concept Drift Handling Methods in Data Streams [J]. Computer Science, 2022, 49(9): 14-32.
[2] ZHOU Xu, QIAN Sheng-sheng, LI Zhang-ming, FANG Quan, XU Chang-sheng. Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification [J]. Computer Science, 2022, 49(9): 132-138.
[3] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[4] TAN Ying-ying, WANG Jun-li, ZHANG Chao-bo. Review of Text Classification Methods Based on Graph Convolutional Network [J]. Computer Science, 2022, 49(8): 205-216.
[5] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[6] WU Hong-xin, HAN Meng, CHEN Zhi-qiang, ZHANG Xi-long, LI Mu-hang. Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning [J]. Computer Science, 2022, 49(8): 12-25.
[7] GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang. Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features [J]. Computer Science, 2022, 49(7): 40-49.
[8] YANG Bing-xin, GUO Yan-rong, HAO Shi-jie, Hong Ri-chang. Application of Graph Neural Network Based on Data Augmentation and Model Ensemble in Depression Recognition [J]. Computer Science, 2022, 49(7): 57-63.
[9] ZHANG Hong-bo, DONG Li-jia, PAN Yu-biao, HSIAO Tsung-chih, ZHANG Hui-zhen, DU Ji-xiang. Survey on Action Quality Assessment Methods in Video Understanding [J]. Computer Science, 2022, 49(7): 79-88.
[10] SHAO Xin-xin. TI-FastText Automatic Goods Classification Algorithm [J]. Computer Science, 2022, 49(6A): 206-210.
[11] CHEN Jing-nian. Acceleration of SVM for Multi-class Classification [J]. Computer Science, 2022, 49(6A): 297-300.
[12] YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
[13] YANG Han, WAN You, CAI Jie-xuan, FANG Ming-yu, WU Zhuo-chao, JIN Yang, QIAN Wei-xing. Pedestrian Navigation Method Based on Virtual Inertial Measurement Unit Assisted by GaitClassification [J]. Computer Science, 2022, 49(6A): 759-763.
[14] PANG Xing-long, ZHU Guo-sheng. Survey of Network Traffic Analysis Based on Semi Supervised Learning [J]. Computer Science, 2022, 49(6A): 544-554.
[15] WANG Shan, XU Chu-yi, SHI Chun-xiang, ZHANG Ying. Study on Cloud Classification Method of Satellite Cloud Images Based on CNN-LSTM [J]. Computer Science, 2022, 49(6A): 675-679.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!