计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 152-158.doi: 10.11896/jsjkx.210300302

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于三角不等式判定和局部策略的高效邻域覆盖模型

陈于思, 艾志华, 张清华   

  1. 重庆邮电大学计算智能重庆市重点实验室 重庆400065
  • 收稿日期:2021-03-31 修回日期:2021-10-24 出版日期:2022-05-15 发布日期:2022-05-06
  • 通讯作者: 陈于思(576131318@qq.com)
  • 基金资助:
    国家自然科学基金(61876201)

Efficient Neighborhood Covering Model Based on Triangle Inequality Checkand Local Strategy

CHEN Yu-si, AI Zhi-hua, ZHANG Qing-hua   

  1. Chongqing Key Laboratory of Computational Intelligence,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2021-03-31 Revised:2021-10-24 Online:2022-05-15 Published:2022-05-06
  • About author:CHEN Yu-si,born in 1994,postgra-duate.His main research interests include rough sets,machine learning and uncertain information processing.
  • Supported by:
    National Natural Science Foundation of China(61876201).

摘要: 邻域覆盖模型由于其原理简单以及对复杂数据具有较好的处理能力,在分类任务中得到了广泛应用。然而,邻域覆盖模型普遍存在运行效率较低的问题,且缺乏相关研究工作。为解决此问题,在传统邻域覆盖模型中引入距离间的三角不等式关系以提升构建邻域的效率,同时引入局部策略,定义了局部邻域覆盖以提升构建邻域覆盖的效率。为提升运行效率,从两个角度对传统邻域覆盖模型进行了改进,提出了基于三角不等式判定和局部策略的邻域覆盖模型(Neighborhood Covering Model based on Triangle Inequality Check and Local Strategy,TI-LNC)。此外,当前基于邻域覆盖模型的分类算法通常仅根据邻域中心以及邻域半径对样本进行分类,缺乏对邻域内样本信息的使用,从而影响了分类精度。为提高邻域覆盖模型的分类精度,增加了对邻域内样本信息的考虑,并基于TI-LNC设计了新的分类算法。在10个UCI数据集上的实验结果表明,所提模型能达到较高的运行效率以及较好的分类精度,具有一定的合理性及有效性。

关键词: 局部邻域覆盖, 邻域粗糙集, 邻域覆盖模型, 三角不等式判定

Abstract: Neighborhood covering model is widely used in classification tasks for its simple mechanism and ability to handle complex data.However,the neighborhood covering model has the problem of low efficiency and lack of related research work.To solve this problem,triangle inequality between distances is introduced to improve the efficiency of constructing neighborhood.Meanwhile,local neighborhood covering is defined.The local strategy is used to improve the efficiency of constructing neighborhood covering.In summary,to improve the efficiency,traditional neighborhood covering model is improved from two perspectives,and a neighborhood covering model based on triangle inequality check and local strategy (TI-LNC) is proposed.In addition,current classification algorithms based on neighborhood covering models only classify samples based on neighborhood centers and neighborhood radius,and ignore the sample information in neighborhoods,which affects classification accuracy.To improve the classification accuracy of the neighborhood covering model,the consideration of sample information in the neighborhood is added,and a new classification algorithm based on TI-LNC is designed.The experimental results on 10 UCI data sets show that the proposed model which is reasonable and effective can achieve higher efficiency and better classification accuracy.

Key words: Local neighborhood covering, Neighborhood covering model, Neighborhood rough set, Triangle inequality check

中图分类号: 

  • TP391.9
[1]PAWLAK Z.Rough Set[J].International Journal of Computer and Information Sciences,1982,11(5):341-356.
[2]HU Q H,YU D R,XIE Z X.Neighborhood Classifiers[J].Ex-pert Systems with Applications,2008,34(2):866-876.
[3]LIN T Y.Granular Computing on Binary Relations I:Data mi-ning and Neighborhood Systems[J].Rough Sets in Knowledge Discovery,1998,1:107-121.
[4]WANG Q,QIAN Y H,LIANG X Y,et al.Local Neighborhood Rough Set[J].Knowledge-Based Systems,2018,153:53-64.
[5]HU M,TSANG E C C,GUO Y T,et al.A Novel Approach to Attribute Reduction Based on Weighted Neighborhood Rough Sets[J].Knowledge-Based Systems,2021,220(5):106908.
[6]CHEN Y M,XUE Y,MA Y,et al.Measures of Uncertainty for Neighborhood Rough Sets[J].Knowledge-Based Systems,2017,120:226-235.
[7]XU S P,YANG X B,YU H L,et al.Neighborhood Collaborative Representation Based Classification Method[J].Computer Science,2017,44(9):234-238.
[8]HU Q H,YU D R,LIU J F,et al.Neighborhood Rough SetBased Heterogeneous Feature Subset Selection[J].Information Sciences,2008,178(18):3577-3594.
[9]HU Q H,PEDRYCZ W,YU D R,et al.Selecting Discrete and Continuous Features Based on Neighborhood Decision Error Minimization[J].IEEE Transactions on Systems,Man,and Cybernetics,2009,40(1):137-150.
[10]YAO P,LU Y H.Neighborhood Rough Set and SVM BasedHybrid Credit Scoring Classifier[J].Expert Systems with Applications,2011,38(9):11300-11304.
[11]CHEN H M,LI T R,CAI Y,et al.Parallel Attribute Reduction in Dominance-based Neighborhood Rough Set[J].Information Sciences,2016,373:351-368.
[12]XIA S Y,ZHANG H,LI W H,et al.GBNRS:A Novel Rough Set Algorithm for Fast Adaptive Attribute Reduction in Classification[J/OL].IEEE Transactions on Knowledge and Data Engineering.https://ieeexplore.ieee.org/document/9099413.
[13]JIANG Z H,WANG Y B,XU G,et al.Multi-scale Based Acce-lerator for Attribute Reduction[J].Computer Science,2019,46(12):250-256.
[14]DU Y,HU Q H,ZHU P F,et al.Rule Learning for Classification based on Neighborhood Covering Reduction[J].Information Sciences,2011,181(24):5457-5467.
[15]ZHU P F,HU Q H,YU D R.Ensemble Learning Based on Randomized Attribute Selection and Neighborhood Covering Reduction[J].Acta Electronica Sinica,2012,40(2):273-279.
[16]ZHANG B W,MIN F,CIUCCI D.Representative-based Classification Through Covering-based Neighborhood Rough Sets[J].Applied Intelligence,2015,43(4):840-854.
[17]YUE X D,CHEN Y F,MIAO D Q,et al.Tri-partition Neighborhood Covering Reduction for Robust Classification[J].International Journal of Approximate Reasoning,2017,83:371-384.
[18]YUE X D,CHEN Y F,MIAO D Q,et al.Fuzzy Neighborhood Covering for Three-way Classification[J].Information Sciences,2020,507:795-808.
[19]YUE X D,ZHOU J,YAO Y Y,et al.Shadowed Neighborhoods Based on Fuzzy Rough Transformation for Three-way Classification[J].IEEE Transactions on Fuzzy Systems,2020,28(5):978-991.
[20]PAN Y W,PAN Z B,WANG Y K,et al.A New Fast Search Algorithm for Exact K-nearest Neighbors Based on Optimal Triangle-inequality-based Check Strategy[J].Knowledge-Based Systems,2019,189:105088.
[21]WANG X Y.A Fast Exact K-nearest Neighbors Algorithm for High Dimensional Search Using K-means Clustering and Triangle Inequality[C]//The 2011 International Joint Conference on Neural Networks.IEEE,2011:1293-1299.
[22]CHANG J Y,HE C X.K-means Algorithm Based on Triangle Inequality[J].Computer Engineering and Design,2007,28(21):5094-5096.
[23]WILSON D R,MARTINEZ T R.Improved Heterogeneous Dis-tance Functions[J].Journal of Artificial Intelligence Research,1997,6:1-34.
[24]ZHANG Z L,CAO Z Y,LI Y T.Research Based on Euclid Distance with Weights of K_means Algorithm[J].Journal of Zhengzhou University(Engineering Science),2010,31(1):89-92.
[1] 孙林, 黄苗苗, 徐久成.
基于邻域粗糙集和Relief的弱标记特征选择方法
Weak Label Feature Selection Method Based on Neighborhood Rough Sets and Relief
计算机科学, 2022, 49(4): 152-160. https://doi.org/10.11896/jsjkx.210300094
[2] 杨洁,王国胤,李帅.
基于边界域的邻域知识距离度量模型
Neighborhood Knowledge Distance Measure Model Based on Boundary Regions
计算机科学, 2020, 47(3): 61-66. https://doi.org/10.11896/jsjkx.190500174
[3] 饶梦,苗夺谦,罗晟.
一种粗糙不确定的图像分割方法
Rough Uncertain Image Segmentation Method
计算机科学, 2020, 47(2): 72-75. https://doi.org/10.11896/jsjkx.190500177
[4] 樊鑫,陈红梅.
基于差别矩阵和mRMR的分步优化特征选择算法
Stepwise Optimized Feature Selection Algorithm Based on Discernibility Matrix and mRMR
计算机科学, 2020, 47(1): 87-95. https://doi.org/10.11896/jsjkx.181202320
[5] 姜泽华, 王怡博, 徐刚, 杨习贝, 王平心.
面向多尺度的属性约简加速器
Multi-scale Based Accelerator for Attribute Reduction
计算机科学, 2019, 46(12): 250-256. https://doi.org/10.11896/jsjkx.181102031
[6] 孙林,潘俊方,张霄雨,王伟,徐久成.
一种基于邻域粗糙集的多标记专属特征选择方法
Multi-label-specific Feature Selection Method Based on Neighborhood Rough Set
计算机科学, 2018, 45(1): 173-178. https://doi.org/10.11896/j.issn.1002-137X.2018.01.030
[7] 惠景丽,潘巍,吴康康,周晓英.
基于非对称变邻域粗糙集模型的属性约简
Attribute Reduction Based on Asymmetric Variable Neighborhood Rough Set
计算机科学, 2015, 42(6): 282-287. https://doi.org/10.11896/j.issn.1002-137X.2015.06.059
[8] 陈涛,洪增林,邓方安.
基于优化的邻域粗糙集的混合基因选择算法
Hybrid Gene Selection Algorithm Based on Optimized Neighborhood Rough Set
计算机科学, 2014, 41(10): 291-294. https://doi.org/10.11896/j.issn.1002-137X.2014.10.061
[9] 胡清华,朱鹏飞,左明.
基于间隔分布集成优化的齿轮箱故障诊断
Gear Fault Diagnosis Based on Margin Distribution Ensemble Optimization
计算机科学, 2013, 40(4): 204-208.
[10] 韩虎,党建武,任恩恩.
基于邻域粗糙集的支持向量机分类方法研究
Research of Support Vector Classifier Based on Neighborhood Rough Set
计算机科学, 2010, 37(2): 229-231.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!