计算机科学 ›› 2010, Vol. 37 ›› Issue (10): 217-220.

• 人工智能 • 上一篇    下一篇

基于相容关系的基因选择方法

焦娜,苗夺谦   

  1. (同济大学计算机科学与技术系 上海201804) (华东政法大学信息科学与技术系 上海201620)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(60475019,60775036),教育部博士学科点专项科研基金(20060247039)资助.

Gene Selection with Tolerance Rough Set Theory from Gene Expression Data

JIAO Na,MIAO Duo-qian   

  • Online:2018-12-01 Published:2018-12-01

摘要: 有效的基因选择是对基因表达数据进行分析的重要内容。粗糙集作为一种软计算方法能够保持在数据集分类能力不变的基础上,对属性进行约简。由于基因表达数据的连续性,为了避免运用粗糙集方法所必需的离散化过程带来的信息丢失,将相容粗糙集应用于基因的特征选取,提出了基于相容关系的基因选择方法。首先,通过i检验对基因表达数据进行排列,选择评分靠前的若干基因;然后,通过相容粗糙集对这些基因进一步约简。在两个标准的基因表达数据上进行了实验,结果表明该方法是可行性和有效性的。

关键词: 基因表达数据,基因选择,属性约简,粗糙集,相容关系

Abstract: Efficient gene selection is a key procedure of the discriminant analysis of microarray data. Rough set theory is an efficient tool for further reducing redundancy. One limitation of rough set theory is the lack of effective methods for processing real-valued data. However, gene expression data sets are always continuous. Discretization methods can result in information loss. hhis paper investigated an approach combining feature ranking together with features selection based on tolerance rough set theory. To evaluate the performance of the proposed approach, we applied it to two benchmark gene expression data sets and compared our results with those obtained by conventional method. Experimental resups illustrate that our algorithm is more effective for selecting high discriminative genes in cancer classification task.

Key words: Gene expression data,Gene selection, Attribute reduction, Rough set theory, Tolerance relation

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!