Computer Science ›› 2016, Vol. 43 ›› Issue (3): 62-67.doi: 10.11896/j.issn.1002-137X.2016.03.012

Previous Articles     Next Articles

Imbalanced Online Sequential Extreme Learning Machine Based on Principal Curve

WANG Jin-wan, MAO Wen-tao, WANG Li-yun and HE Ling   

  • Online:2018-12-01 Published:2018-12-01

Abstract: Many traditional machine learning methods tend to get biased classifier which leads to lower classification precision for minor class in sequential imbalanced data.To improve the classification accuracy of minor class,a new imbalanced online sequential extreme learning machine based on principal curve was proposed.The core idea of the method is to get balanced samples based on the distribution features of online sequential data,reducing the blindness in the process of synthetic minority,which contains two stages.In offline stage,the principal curve is introduced to establish the distribution model of two kinds of samples.Over-sampling is done by using SMOTE for minor class.Then the membership degree of each sample is set according to the projection distance respectively,and the majority and virtual minor samples are deleted according to the under interval.Then the initial model is established.In online stage,over-sampling is done by using SMOTE for online sequential minor samples,getting the balanced samples according to the under interval.Then network weight is updated dynamically.The proposed algorithm has upper bound of the loss of information through the theoretical proof.The experiment was taken on three UCI datasets and the real-world air pollutant forecasting dataset,which shows that the proposed method outperforms the traditional methods in terms of prediction accuracy and numerical stability.

Key words: Online sequential extreme learning machine,Imbalanced data,Principal curve,Synthetic minority over-sampling

[1] Yang Zhi-ming,Qiao Li-yan,Peng Xi-yuan.Research on Data-ming Method for Imbalanced Dataset Based on Improved SMOTE[J].Acta Electronica Sinica,2007,5(12A):22-26(in Chinese) 杨智明,乔立言,彭喜元.基于改进SMOTE的不平衡数据挖掘方法研究[J].电子学报,2007,5(12A):22-26
[2] Fu Zhong-liang.Cost-sensitive Ensemble Learning Algorithm forMulti-label Classification Problems[J].Acta Automatica Sinica,2014(6):1075-1085(in Chinese) 付忠良.多标签代价敏感分类集成学习算法[J].自动化学报,2014(6):1075-1085
[3] Zeng Hui.Research on Improved Weighted Support Vector Machine and Application In Fault Diagnosis Method[D].Guangzhou:South China University of Technology,2010(in Chinese) 曾辉.改进加权支持向量机的研究及在故障诊断中的应用[D].广州:华南理工大学,2010
[4] Zhang Chun-xia,Zhang Jiang-she.A Survey of Selective Ensemble learning Algorithm[J].Chinese J ournal of Computers,2011,4(8):1399-1410(in Chinese) 张春霞,张讲社.选择性集成学习算法综述[J].计算机学报,2011,4(8):1399-1410
[5] Rok B,Lara L.SMOTE for high-dimensional class-imbalanceddata[J].BMC Bioinformatics,2013,4(1):1-16
[6] Zeng Zhi-qiang,Wu Qun,Liao Bei-shui,et al.A Classification Method For Imbalance Data Set Based on Kernel SMOTE[J].Acta Electronica Sinica,2009,7(11):2489-2495(in Chinese) 曾志强,吴群,廖备水,等.一种基于核SMOTE的非平衡数据集分类方法[J].电子学报,2009,7(11):2489-2495
[7] Jeatrakul P,Wong KW,Fung C C.Classification of Imbalanced Data by Combining the Complementary Neural Network and SMOTE Algorithm[M]∥Neural Information Processing.Mo-dels and Applications,2010:152-159
[8] Zhai Y,Ma N,Ruan D.An effective over-sampling method for imbalanced data sets classification[J].Chinese Journal of Electronics, 2011,0(3):489-494
[9] Huang G-B,Zhou H,Ding X,et al.Extreme Learning Machine for Regression and Multiclass[J].IEEE Transactions on Systems,Man,and Cybernetics-Part B:Cybernetics,2012,42(2):513-529
[10] Liang N Y,Huang G B.A fast and accurate online sequential learning algorithm for feedforward networks[J].IEEE Trans Neural Networks,2006,7:1411-1423
[11] Yuan P,Ma H,Fu H.Hotspot-entropy based data forwarding in opportunistic social networks[J].Pervasive and Mobile Computing,2015(1),16(A):136-154
[12] Li Hao.Soft Sensing and its Applied Research Based on Principal Curves[D].Hangzhou:Zhejiang University,2013(in Chinese) 李浩.基于主曲线的软测量及其应用研究[D].杭州:浙江大学,2013
[13] Nele V,Enislay R,Chris Cornelis,et al.Preprocessing noisy imbalanced datasets using SMOTE enhanced with fuzzy rough prototype selection[J].Applied Soft Computing Journal,2014,22:511-517
[14] SMG.E-publication Download Page.http://www.smg.gov.mo/www/ccaa/pdf/e_pdf_download.php
[15] Newman D J,Hettich S,Blake C L,et al.UCI Repository of machine learning databases.Irvine,CA:University of California,Department of Information and Computer Science.http://www.ics.uci.edu/mlearn/ML Repository.html

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!