计算机科学 ›› 2014, Vol. 41 ›› Issue (12): 211-215.doi: 10.11896/j.issn.1002-137X.2014.12.046

• 人工智能 • 上一篇    下一篇

嵌入代价敏感的极限学习机相异性集成的基因表达数据分类

安春霖,陆慧娟,魏莎莎,杨小兵   

  1. 中国计量学院信息工程学院 杭州310018;中国计量学院信息工程学院 杭州310018;中国计量学院信息工程学院 杭州310018;中国计量学院信息工程学院 杭州310018
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金(61272315,60842009),浙江省自然科学基金(Y1110342),浙江省科技厅国际合作项目(2012C24030)资助

Dissimilarity Based Ensemble of Extreme Learning Machine with Cost-sensitive for Gene Expression Data Classification

AN Chun-lin,LU Hui-juan,WEI Sha-sha and YANG Xiao-bing   

  • Online:2018-11-14 Published:2018-11-14

摘要: 极限学习机的相异性集成算法(Dissimilarity Based Ensemble of Extreme Learning Machine,D-ELM)在基因表达数据分类中能够得到较稳定的分类效果,然而这种分类算法是基于分类精度的,当所给样本的误分类代价不相等时,不能直接实现代价敏感分类过程中的最小平均误分类代价的要求。通过在分类过程中引入概率估计以及误分类代价和拒识代价重新构造分类结果,提出了基于相异性集成极限学习机的代价敏感算法(CS-D-ELM)。该算法被运用到基因表达数据集上,得到了较好的分类效果。

关键词: 极限学习机,相异性集成,代价敏感,基因表达数据,分类

Abstract: Dissimilarity based ensemble of Extreme Learning Machine (D-ELM) gets stable classification results on gene expression data classification.While this algorithm is based on the classification accuracy,it cannot meet the requirement to get the minimum misclassification of cost-sensitive classification,when the misclassification costs are not equal.This paper used probability estimate and misclassification cost to reconstruct the classification results.Then we proposed the algorithm of Cost-sensitive Dissimilarity based ensemble of Extreme Learning Machine (CS-D-ELM).This algorithm is applied on the data of gene expression and the experiment demonstrates that it can get better result.

Key words: Extreme learning machine,Dissimilarity based ensemble,Cost-sensitive,Gene expression data,Classification

[1] Han J,Kamber M.Data mining:concepts and techniques[M].San Francisco CA:Morgan Kanfnann,2001:29-31
[2] 陆慧娟.基于基因表达数据的肿瘤分类算法研究[D].徐州:中国矿业大学,2012
[3] Huang G B,Zhu Q Y,Siew C K.Extreme learning machine:theo-ry and applications[J].Neurocomputing,2006(70):489-501
[4] Huang G B,Zhu Q Y,Siew C K.Extreme learning machine:anew learning scheme of feedforward neural networks[C]∥Proceedings of International Joint Conference on Neural Networks.2004:25-29
[5] Huang G B,Wang D H,Lan Y.Extreme Learning Machines:A Survey[J].International Journal of Machine Learning and Cybernetics,2011,2(2):107-122
[6] Huang G B,Ding X J,Zhou H M.Optimization method based extreme learning machine for classification[J].Neurocompu-ting,2010,4:155-163
[7] Lan Yuan,Yeng C-S,Huang Guang-bin.Ensemble of online sequential extreme learning machine[J].Neurocomputing,2009,73 (13-15):3391-3395
[8] Tian Hui-xin,Meng Bo.A new modeling method based on bagging ELM for day-ahead electricity price prediction[C]∥2010 IEEE Fifth International Conference Bio-Inspired Computing:Theories and Applications (BIC-TA).Changsha,China,2010:1076-1079
[9] Tian Hui-xin,Mao Zhi-zhong.An Ensemble ELM Based onModified AdaBoost.RT Algorithm for Predicting the Temperature of Molten Steel in Ladle Furnace[J].Automation Science and Engineering,2009,7(1):73-80
[10] Cao J W,Lin Z P,Huang G B,et al.Voting based extreme learning machine[J].Information Sciences,2012,185(1):66-77
[11] 陆慧娟,安春霖,马小平,等.基于输出不一致测度的极限学习机集成的基因表达数据分类[J].计算机学报,2013,6:341-348
[12] Foggia P,Sansone C,Torella F,et al.Mulit-classification:reject criteria for the bayesian combiner[J].Pattern Recognition,1999 (32):1435-1447
[13] 郑恩辉,李平,宋执环.代价敏感支持向量机[J].控制与决策,2006,1(4):473-476
[14] 邹超,郑恩辉,任玉玲,等.嵌入误分类代价和拒识代价的二元分类算法[J].广西师范大学学报:自然科学版,2010,8(3):201-208
[15] 付忠良.多分类问题代价敏感AdaBoost算法[J].自动化学报,2012,7(8):973-983
[16] 史小伍,陶红,阚今中,等.组合代价敏感支持向量机及其应用[J].计算机技术与发展,2012,2(5):71-78
[17] 万建武,杨明,陈银娟.代价敏感的半监督Laplacian支持向量机[J].电子学报,2012,0(7):1410-1415

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!