计算机科学 ›› 2012, Vol. 39 ›› Issue (10): 193-197.

• 人工智能 • 上一篇    下一篇

多类别肿瘤基因表达谱的自动特征选择方法

高娟,王国胤,胡峰   

  1. (重庆邮电大学计算机科学与技术学院 重庆400065)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Auto-selection of Informative Gene for Multi-class Tumor Gene Expression Profiles

  • Online:2018-11-16 Published:2018-11-16

摘要: 从信息学角度出发寻找肿瘤相关基因、发现肿瘤基因表达特征对肿瘤的诊断和治疗具有重要的生物学意义,而肿瘤与正常组织的分类是其中一个重要应用。根据多类别肿瘤基因表达谱,提出了一种自动特征选择方法。首先,结合非参数方法和filter思想,利用决策序列的随机性度量基因的权值并排序;然后,采用相关信息嫡进行冗余性排除,自动地选择出具有高分辨能力、低冗余度的特征基因子集。实验结果表明,提出的方法能从多类别肿瘤基因表达谱数据中自动选出30个具有良好分类能力的特征基因,且具有较高的正确识别率。

关键词: 肿瘤基因表达谱,特征选择,随机序列,相关信息墒

Abstract: In microarray analysis, the selection of informative gene is an essential issue for tissue classification and successful treatment because of its ability to improve the accuracy and decrease computational complexity. The ability of successfully distinguishing tumor from normal tissues using gene expression data is an important aspect of this novel approach for cancer classification. In this paper, a non-parameter method for autonomous selection of informative gene was proposed for processing multi-class tumor gene expression profile,which contained 218 tumor samples spanning 14common tumor types, as well as 90 normal tissue samples, to find a small subset of genes for distinguishing tumor from normal tissues. At First, the randomness of a decision sequence was defined to measure gene importance based on the non-parameter method and filter algorithm. I}hcn correlation information entropy was used to eliminate redundant genes and selected informative feature genes. As a result, 30 informative genes are selected as markers for making distinctions between different tumor tissues and their normal counterparts. Simulation experiment results show that the selected genes arc very efficient for distinguishing tumor from normal tissues. In the end, several methods for informative gene selection were also analyzed and compared to validate the feasibility and efficiency of the proposed method for dealing with tumor gene expression profiles.

Key words: Rumor gene expression, Feature selection, Random sequence, Correlation information entropy

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!