计算机科学 ›› 2013, Vol. 40 ›› Issue (Z11): 157-159.

• 智能控制与优化 • 上一篇    下一篇

基于属性值相关距离的KNN算法的改进研究

肖辉辉,段艳明   

  1. 河池学院 宜州546300;河池学院 宜州546300
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受广西教育厅科研基金项目(201106LX577,201106LX604),国家自然科学基金项目(40971234),河池学院青年科研项目(2012B-N005,2012B-N007)资助

Improved the KNN Algorithm Based on Related to the Distance of Attribute Value

XIAO Hui-hui and DUAN Yan-ming   

  • Online:2018-11-16 Published:2018-11-16

摘要: 样本距离机制的定义直接影响到KNN算法的准确性和效率。针对传统KNN算法在距离的定义及类别决定上的不足,提出了利用属性值对类别的重要性进行改进的KNN算法(FCD-KNN)。首先定义两个样本间的距离为属性值的相关距离,此距离有效度量了样本间的相似度。再根据此距离选取与待测试样本距离最小的K个近邻,最后根据各类近邻样本点的平均距离及个数判断待测试样本的类别。理论分析及仿真实验结果表明,FCD-KNN算法较传统KNN及距离加权-KNN的分类准确性要高。

关键词: KNN算法,相关距离,属性值,样本距离机制

Abstract: Definition of the samples will directly impact on the accuracy and the efficiency of KNN.In view of disadvantages to the traditional KNN algorithm on the distance the definition and categories of decision,proposed the use of attribute importance to category to improve KNN algorithm (FCD-KNN).At first,a distance of the two samples is defined as the correlation distance of the same attribute values.The distance can effectively measure the similarity degree of the two sample.Secondly,According to this distance selects the k nearest neighbors.Finally,the category of the test sample is decided by the average distance and the numbers on the respective category.The theoretical analysis and the simulation experiment show that compared with KNN and-KNN,raised the rate of accuracy enormously in classification.

Key words: KNN algorithm,Correlation distances,Attribute,Sample distance mechanism

[1] 王增民,王开珏.基于熵权的K最临近算法改进[J].计算机工程与应用,2009,45(30):129-131
[2] 周靖,刘晋胜.特征联合熵的一种改进k近邻分类算法[J].计算机应用,2011,7(7):1787-1792
[3] 陆微微,刘晶.一种提高k-近邻算法效率的新算法[J].计算机工程与应用,2008,44(4):163-165
[4] 周靖,刘晋胜.一种采用类相关度优化距离的KNN算法[J].微计算机应用,2010,31(11):7-12
[5] 杨立,左春,王裕国.基于语义距离的K-最近邻分类方法[J].软件学报,2005,16(12):2054-2062
[6] Wu Xin-dong,Kumar V,Quinlan J R,et al.Top 10Algorithms in Data Mining[J].Knowledge and Information Systems,2008,14(1):1-37
[7] 童先群,周忠眉.基于属性值信息熵的KNN改进算法[J].计算机工程与应用,2010,46(3):114-117
[8] 周靖,刘晋胜.基于特征熵相关度差异的KNN算法[J].计算机工程,2011,7(17):146-148

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!