计算机科学 ›› 2009, Vol. 36 ›› Issue (10): 189-191.

• 人工智能 • 上一篇    下一篇

基于类标号扩展的半监督特征选择算法

王博,贾焰,田李   

  1. (国防科技大学计算机学院 长沙 410073); (94326部队 济南 250023)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受863国家重点基金项目(2006AA01Z451l,2007AA01Z474,2007AA010502)资助。

Semi-supervised Feature Selection Algorithm Based on Extension of Label

WANG Bo, JIA Yan, TIAN Li   

  • Online:2018-11-16 Published:2018-11-16

摘要: 特征选择是数据挖掘、机器学习等领域的重要内容,在缺乏已标记样本的情况下,如何有效选择特征是一个非常值得研究的问题。基于集合间相关度与自相关度的定义,提出了一种新颖的半监督特征选择方法,从原始、少量、且已标记的训练样本出发,通过扩展类标号得到最终的聚类效果,采用复合的评价方法作为衡量特征子集的标准。大量实验结果表明,该算法是有效的。

关键词: 特征选择,半监督,集合相关度,集合自相关度

Abstract: Feature selection is an important step during data mining and machine learning. With the lack of labeled instances, the problem of effective selection is worthy of consideration. This paper proposed a novel semi-supervised fealure selection algorithm based on the definition of inter-set and infra-set correlation,which starts from the original and small labeled samples and gains the final clusters by extension of labels. A complex evaluation was utilized as criterion to find optimal feature subset. Finally, the experimental results demonstrate the efficacy of the algorithm.

Key words: Feature selection, Semi-supervised, Inter-set correlation, Intra-set correlation

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!