计算机科学 ›› 2025, Vol. 52 ›› Issue (2): 165-172.doi: 10.11896/jsjkx.231100202

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于模糊邻域相对决策熵的属性约简算法

徐久成, 张杉, 白晴, 马妙贤   

  1. 河南师范大学计算机与信息工程学院 河南 新乡 453007
    河南师范大学智慧商务与物联网技术河南省工程实验室 河南 新乡 453007
  • 收稿日期:2023-11-30 修回日期:2024-06-07 出版日期:2025-02-15 发布日期:2025-02-17
  • 通讯作者: 张杉(2208183014@stu.htu.edu.cn)
  • 作者简介:(xjc@htu.edu.cn)
  • 基金资助:
    国家自然科学基金(61976082,62076089,62002103)

Attribute Reduction Algorithm Based on Fuzzy Neighborhood Relative Decision Entropy

XU Jiucheng, ZHANG Shan, BAI Qing, MA Miaoxian   

  1. College of Computer and Information Engineering,Henan Normal University,Xinxiang,Henan 453007,China
    Engineering Lab of Intelligence Business & Internet of Things,Henan Province,Henan Normal University,Xinxiang,Henan 453007,China
  • Received:2023-11-30 Revised:2024-06-07 Online:2025-02-15 Published:2025-02-17
  • About author:XU Jiucheng,born in 1964,Ph.D,professor,Ph.D supervisor.His main research interests include granular computing,data mining and bioinformatics.
    ZHANG Shan,born in 1998,postgra-duate.Her main research interests include data mining and fuzzy rough set.
  • Supported by:
    National Natural Science Foundation of China(61976082,62076089,62002103).

摘要: 针对模糊邻域粗糙集对数据分布敏感且无法有效评估密度差异较大数据集的分类不确定性这一问题,提出了一种基于模糊邻域相对决策熵的属性约简算法。首先,采用相对距离定义样本的分类不确定度,重塑模糊邻域相似关系,并结合变精度模糊邻域粗糙近似,减少样本被归入错误类别的可能性;其次,在信息观下将模糊邻域可信度和覆盖度引入信息熵,并与基于代数观构造的模糊邻域相对依赖度相结合,提出模糊邻域相对决策熵;最后,设计一种基于模糊邻域相对决策熵的属性约简算法,从信息观和代数观两个角度来评估属性的重要度。在8个公共数据集上将其与现有的6种属性约简算法进行对比实验,结果表明,所提算法能有效地测量不同数据分布下样本的不确定度,提高数据的分类性能。

关键词: 属性约简, 模糊邻域粗糙集, 分类不确定度, 信息熵

Abstract: Aiming at the problem that fuzzy neighborhood rough set is sensitive to data distribution and cannot effectively evaluate the classification uncertainty of datasets with large density differences,this paper proposes an attribute reduction algorithm based on fuzzy neighborhood relative decision entropy.Firstly,the classification uncertainty of the sample is defined by using the relative distance,thus remodeling the fuzzy neighborhood similarity relationship.Combined with the variable precision fuzzy neighborhood rough approximation,the possibility of the sample being classified into the wrong category is reduced.Secondly,the information entropy is augmented with the fuzzy neighborhood credibility and coverage under the information view,and this is integrated with the fuzzy neighborhood relative dependence constructed based on the algebraic view to introduce the fuzzy neighborhood relative decision entropy.Finally,an attribute reduction algorithm based on the fuzzy neighborhood relative decision entropy is designed to evaluate the importance of attributes from both the information and algebraic viewpoints.Comparative experiments with six existing attribute reduction algorithms on eight public datasets show that the proposed algorithm can effectively measure the uncertainty of samples under different data distributions and improve the classification performance of data.

Key words: Attribute reduction, Fuzzy neighborhood rough set, Classification uncertainty, Information entropy

中图分类号: 

  • TP181
[1]PAWLAK Z.Rough Sets[J].International Journal of Computer and Information Science,1982,11:341-356.
[2]HU S D,MIAO D Q,YAO Y Y.Three-way Label Propagation Based Semi-supervised Attribute Reduction[J].Chinese Journal of Computers,2021,44(11):2332-2343.
[3]CAO D T,SHU W H,QIAN J.Feature Selection AlgorithmBased on Rough Set and Density Peak Clustering[J].Computer Science,2023,50(10):37-47.
[4]HU Q H,YU R D,XIE Z X.Numerical Attribute Reduction Based on Neighborhood Granulation and Rough Approximation[J].Journal of Software,2008(3):640-649.
[5]DUBOIS D,PRADE H.Rough Fuzzy Setsand Fuzzy Rough Sets [J].International Journal of General Systems,1990,17(2/3):191-209.
[6]WANG C Z,SHAO M W,HE Q,et al.Feature Subset Selection Based on Fuzzy Neighborhood Rough Sets[J].Knowledge-Based Systems,2016,111:173-179.
[7]HU M,GUO Y T,CHEN D G,et al.Attribute Reduction Based on Neighborhood Constrained Fuzzy Rough Sets[J].Know-ledge-Based Systems,2023,274:110632.
[8]ZIARKO W.Variable Precision Rough Set Model[J].Journal of Computer and System Sciences,1993,46(1):39-59.
[9]HU Q H,AN S,YU D.Soft Fuzzy Rough Sets for Robust Feature Evaluation and Selection[J].Information Sciences,2010,180(22):4384-4400.
[10]HAMIDZADEH J,REZAEENIK E,MORADI M.PredictingUsers' Preferences by Fuzzy Rough Set Quarter-Sphere Support Vector Machine[J].Applied Soft Computing,2021,112:107740.
[11]AN S,ZHAO E H,WANG C Z,et al.Relative Fuzzy Rough Approximations for Feature Selection and Classification[J].IEEE Transactions on Cybernetics,2023,53(4):2200-2210.
[12]SUN L,WANG L Y,DING W P,et al.Neighborhood Multi-granulation Rough Sets-based Attribute Reduction Using Lebesgue and Entropy Measures in Incomplete Neighborhood Decision Systems[J].Knowledge-Based Systems,2020,192:105373.
[13]WANG G Y.Rough Reduction in Algebra View and Information View[J].International Journal of Intelligent Systems,2003,18(6):679-688.
[14]QU K L,XU J C,HAN Z Q,et al.Maximum Relevance Minimum Redundancy-based Feature Selection Using Rough Mutual Information in Adaptive Neighborhood Rough Sets[J].Applied Intelligence,2023,53(14):17727-17746.
[15]XU J C,SUN Y H,QU K L,et al.Online Group Streaming Feature Selection Using Entropy-based Uncertainty Measures for Fuzzy Neighborhood Rough Sets[J].Complex & Intelligent Systems,2022,8(6):5309-5328.
[16]SUN L,ZHANG X Y,QIAN Y H,et al.Feature SelectionUsing Neighborhood Entropy-based Uncertainty Measures for Gene Expression Data Classification[J].Information Sciences,2019,502:18-41.
[17]TSUMOTO S.Rough Sets and Current Trends in Computing[M].Berlin:Springer Berlin Heidelberg,2002:373-380.
[18]YUAN Z,CHEN H M,XIE P,et al.Attribute Reduction Me-thods in Fuzzy Rough Set Theory:An Overview,Comparative Experiments,And New Directions[J].Applied Soft Computing,2021,107:107353.
[19]XU J C,MENG X R,QU K L,et al.Feature Selection Method Based on Fuzzy Neighborhood Relative Dependency Mutual Information[J].Fuzzy Systems and Mathematics,2023,37(1):121-135.
[20]AN S,ZHANG M R,WANG C Z,et al.Robust Fuzzy Rough Approximations with KNN Granules for Semi-supervised Feature Selection[J].Fuzzy Sets and Systems,2023,461:108476.
[21]MIAO D Q,HU G R.A Heuristic Algorithm for Reduction of Knowledge[J].Journal of Computer Research & Development,1999(6):42-45.
[22]WANG G Y,YU H,YANG D C.Decision Table ReductionBased on Conditional Information Entropy[J].Chinese Journal of Computers,2002(7):759-766.
[23]HU Q H,XIE Z X,YU D R.Hybrid Attribute Reduction Based on A novel Fuzzy-rough Model and Information Granulation[J].Pattern Recognition,2007,40(12):3509-3521.
[24]WANG C Z,QI Y L,SHAO M W,et al.A Fitting Model for Feature Selection with Fuzzy Rough Sets[J].IEEE Transactions on Fuzzy Systems,2017,25(4):741-753.
[25]TAN A H,WU W Z,QIAN Y H,et al.Intuitionistic FuzzyRough Set-based Granular Structures and Attribute Subset Selection[J].IEEE Transactions on Fuzzy Systems,2019,27(3):527-539.
[26]SUN L,WANG L Y,QIAN Y H,et al.Feature Selection Using Lebesgue and Entropy Measures for Incomplete Neighborhood Decision Systems[J].Knowledge-Based Systems,2019,186:104942.
[27]XU J C,QU K L,MENG X R,et al.Feature Selection Based on Multiview Entropy Measures in Multiperspective Rough Set[J].International Journal of Intelligent Systems,2022,37(10):7200-7234.
[28]XU J C,MENG X R,QU K L,et al.Feature Selection UsingRelative Dependency Complement Mutual Information in Fitting Fuzzy Rough Set Model[J].Applied Intelligence,2023,53(15):18239-18262.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!