计算机科学 ›› 2012, Vol. 39 ›› Issue (1): 175-177.

• 人工智能 • 上一篇    下一篇

一种基于邻域距离的聚类特征选择方法

秦奇伟,梁吉业,钱宇华   

  1. (计算智能与中文信息处理教育部重点实验室 太原030006); (山西大学计算机与信息技术学院 太原030006)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Clustering Feature Selection Method Based on Neighborhood Distance

  • Online:2018-11-16 Published:2018-11-16

摘要: 针对高维复杂的符号数据集在聚类中的聚类效果差和计算耗时过大的问题,首先提出了一种基于部域距离的无监督特征选择算法,然后在选择到的特征子集上进行重新聚类,从而有效提高了聚类结果的精度,降低了聚类计算的计算耗时。实验结果表明,该算法可以找到有效的特征子集,提高数据集的聚类精度,降低面对高维复杂数据集聚类的计算耗时。

关键词: 特征选择,聚类计算,部域距离,属性重要度

Abstract: To overcome the limitation of bad results on clustering and time-consuming of existing clustering algorithm to high-dimensional data, we provided an unsupervised feature selection algorithm based on neighborhood distance, then we clustered again on the selected feature subset The use of the selected feature subset can improve clustering accuracy. The results of the experiment show that the method can find the valid features, and also improve the timcconsuming problems in clustering on high-dimensional data.

Key words: Feature selection, Clustering compute, Neighborhood distance, Attribute significance

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!