计算机科学 ›› 2020, Vol. 47 ›› Issue (3): 67-72.doi: 10.11896/jsjkx.190100196

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于距离比值尺度的模糊粗糙集属性约简

陈毅宁1,陈红梅2   

  1. (西南交通大学信息科学与技术学院 成都611756)1;
    (西南交通大学云计算与智能技术高校重点实验室 成都611756)2
  • 收稿日期:2019-01-23 出版日期:2020-03-15 发布日期:2020-03-30
  • 通讯作者: 陈红梅(hmchen@swjtu.edu.cn)
  • 基金资助:
    国家自然科学基金(61572406)

Attribute Reduction of Fuzzy Rough Set Based on Distance Ratio Scale

CHEN Yi-ning1,CHEN Hong-mei2   

  1. (School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China)1;
    (Key Laboratory of Cloud Computing and Intelligent Technology, Southwest Jiaotong University, Chengdu 611756, China)2
  • Received:2019-01-23 Online:2020-03-15 Published:2020-03-30
  • About author:CHEN Yi-ning,born in 1995,postgraduate.His main research interests include areas of rough set and so on. CHEN Hong-mei,born in 1971,Ph.D,professor,Ph.D supervisor,is member of China Computer Federation (CCF).Her main research interests include rough set,granular computing,and intelligent information processing.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61572406).

摘要: 属性约简能有效地去除不必要属性,提高分类器的性能。模糊粗糙集是处理不确定信息的重要范式,能有效地应用于属性约简。在模糊粗糙集中,样本分布的不确定性会影响对象的近似集,进而影响有效属性约简的获取。为有效地定义近似集,文中提出了基于距离比值尺度的模糊粗糙集,该模型引入了基于距离比值尺度的样本集的定义,通过对距离比值尺度的控制,避免了样本分布不确定性对近似集的影响;给出了该模型的基本性质,定义了新的依赖度函数,进而设计了属性约简算法;以SVM,NaiveBayes和J48作为测试分类器,在UCI数据集上评测所提算法的性能。实验结果表明,所提出的属性约简算法能够有效获取约简并提高分类的精度。

关键词: 距离比值尺度, 模糊粗糙集, 属性约简

Abstract: Attribute reduction can effectively remove the unnecessary attributes in order to improve the performance of the classifiers.Fuzzy rough set theory is an important formal of processing the uncertain information.In the fuzzy rough set model,the approximations of an object may be affected by uncertain distribution of samples.Consequently,acquiring effective attribute reduction may be influenced.In order to effectively define approximations,this paper proposed a novel fuzzy rough set model named distance ratio scale based fuzzy rough set.The definition of samples based on distance ratio scale is introduced.The influence of uncertain distribution of samples to approximations is avoided by controlling the distance ratio scale.The basic properties of this fuzzy rough set model are presented and the new dependent function is defined.Furthermore,the algorithm for attribute reduction is designed.SVM,NaiveBayes,and J48 were used as test classifier executed on UCI data sets to verify the performance of the proposed algorithm.The experimental results show that attribute reduction can be effectively obtained by the proposed attribute reduction algorithm and the classification precisions of classifiers are improved.

Key words: Attribute reduction, Distance ratio scale, Fuzzy rough set

中图分类号: 

  • TP301.6
[1]HONG R C,PAN J X,HAO S J,et al.Image quality assessment based on matching pursuit[J].Information Sciences,2014,273:196-211.
[2]HONG R C,WANG M,GAO Y,et al.Image annotation by mul- tiple-instance learning with discriminative feature mapping and selection[J].IEEE Transactions on Cybernetics,2014,44(5):669-680.
[3]LU J J,ZHAO T Z,ZHANG Y F.Feature selection based-on genetic algorithm for image annotation[J].Knowledge-Based Systems,2008,21(8):887-891.
[4]PAWLAK Z.Rough set[J].International Journal of Computer & Information Sciences,1982,11(5):341-356.
[5]CHEN J K,LI J J,LIN Y J.Computing connected components of simple undirected graphs based on generalized rough sets[J].Knowledge-Based Systems,2013,37:80-85.
[6]CHEN H M,LI T R,LUO C,et al.A decision-theoretic rough set approach for dynamic data mining[J].IEEE Transactions on Fuzzy Systems,2015,23(6):1958-1970.
[7]CHEN J K,LIN Y,LIN G,et al.The relationship between attribute reducts in rough sets and minimal vertex covers of graphs[J].Information Sciences,2015,325:87-97.
[8]LI J H,REN Y,MEI C L,et al.A comparative study of multigranulation rough sets and concept lattices via rule acquisition[J].Knowledge-Based Systems,2016,91:152-164.
[9]DUBOIS D,PRADE H.Rough fuzzy sets and fuzzy rough sets[J].International Journal of General Systems,1990,17(2/3):191-209.
[10]HU Q H,YU D R,LIU J F,et al.Neighborhood rough set based heterogeneous feature subset selection[J].Information Sciences,2008,178(18):3577-3594.
[11]JENSEN R,SHEN Q.Fuzzy-rough attribute reduction with application to web categorization[J].Fuzzy Sets and Systems,2004,141(3):469-485.
[12]SHEN Q,JENSEN R.Selecting informative features with fuzzy-rough sets and its application for complex systems monitoring[J].Pattern Recognition,2004,37(7):1351-1363.
[13]HU Q H,YU D R,XIE Z X.Information-preserving hybrid data reduction based on fuzzy-rough techniques[J].Pattern Recognition Letters,2006,27(5):414-423.
[14]WANG C Z,QI Y L,HE Q.Attribute reduction using distance-based fuzzy rough sets[C]∥International Conference on Machine Learning and Cybernetics.IEEE,2015:860-865.
[15]PAWLAK Z.Rough Sets:Theoretical Aspects of Reasoning about Data[M].Kluwer Academic Publishers,1992.
[16]ZHANG W X.Rough set theory and method[M].Beijing:Science Press,2001.
[17]YEUNG D S,CHEN D G,TSANG E C C,et al.On the generalization of fuzzy rough sets[J].IEEE Transactions on Fuzzy Systems,2005,13(3):343-361.
[18]MORSI N N,YAKOUT M M.Axiomatics for fuzzy rough sets[J].Fuzzy Sets and Systems,1998,100(1/2/3):327-342.
[19]CORTES C,VAPNIK V.Support-vector networks[J].Machine Learning,1995,20(3):273-297.
[20]HU Q H,ZHANG L,CHEN D G,et al.Gaussian kernel based fuzzy rough sets:Model,uncertainty measures and applications[J].International Journal of Approximate Reasoning,2010,51(4):453-471.
[1] 许思雨, 秦克云.
基于剩余格的模糊粗糙集的拓扑性质
Topological Properties of Fuzzy Rough Sets Based on Residuated Lattices
计算机科学, 2022, 49(6A): 140-143. https://doi.org/10.11896/jsjkx.210200123
[2] 王子茵, 李磊军, 米据生, 李美争, 解滨.
基于误分代价的变精度模糊粗糙集属性约简
Attribute Reduction of Variable Precision Fuzzy Rough Set Based on Misclassification Cost
计算机科学, 2022, 49(4): 161-167. https://doi.org/10.11896/jsjkx.210500211
[3] 王志成, 高灿, 邢金明.
一种基于正域的三支近似约简
Three-way Approximate Reduction Based on Positive Region
计算机科学, 2022, 49(4): 168-173. https://doi.org/10.11896/jsjkx.210500067
[4] 李艳, 范斌, 郭劼, 林梓源, 赵曌.
基于k-原型聚类和粗糙集的属性约简方法
Attribute Reduction Method Based on k-prototypes Clustering and Rough Sets
计算机科学, 2021, 48(6A): 342-348. https://doi.org/10.11896/jsjkx.201000053
[5] 曾惠坤, 米据生, 李仲玲.
形式背景中概念及约简的动态更新方法
Dynamic Updating Method of Concepts and Reduction in Formal Context
计算机科学, 2021, 48(1): 131-135. https://doi.org/10.11896/jsjkx.200800018
[6] 桑彬彬, 杨留中, 陈红梅, 王生武.
优势关系粗糙集增量属性约简算法
Incremental Attribute Reduction Algorithm in Dominance-based Rough Set
计算机科学, 2020, 47(8): 137-143. https://doi.org/10.11896/jsjkx.190700188
[7] 岳晓威, 彭莎, 秦克云.
基于面向对象(属性)概念格的形式背景属性约简方法
Attribute Reduction Methods of Formal Context Based on ObJect (Attribute) Oriented Concept Lattice
计算机科学, 2020, 47(6A): 436-439. https://doi.org/10.11896/JsJkx.191100011
[8] 徐怡,唐静昕.
基于优化可辨识矩阵和改进差别信息树的属性约简算法
Attribute Reduction Algorithm Based on Optimized Discernibility Matrix and Improving Discernibility Information Tree
计算机科学, 2020, 47(3): 73-78. https://doi.org/10.11896/jsjkx.190500125
[9] 侯成军,米据生,梁美社.
基于局部可调节多粒度粗糙集的属性约简
Attribute Reduction Based on Local Adjustable Multi-granulation Rough Set
计算机科学, 2020, 47(3): 87-91. https://doi.org/10.11896/jsjkx.190500162
[10] 郭庆春,马建敏.
对偶区间集概念格上区间集协调集的判定方法
Judgment Methods of Interval-set Consistent Sets of Dual Interval-set Concept Lattices
计算机科学, 2020, 47(3): 98-102. https://doi.org/10.11896/jsjkx.190500098
[11] 龙柄翰, 徐伟华, 张晓燕.
不协调目标信息系统中基于改进差别信息树的分布属性约简
Distribution Attribute Reduction Based on Improved Discernibility Information Tree in Inconsistent System
计算机科学, 2019, 46(6A): 115-119.
[12] 李艳, 张丽, 陈俊芬.
动态信息系统中基于序贯三支决策的属性约简方法
Attribute Reduction Method Based on Sequential Three-way Decisions in Dynamic Information Systems
计算机科学, 2019, 46(6A): 120-123.
[13] 李艳, 张丽, 王雪静, 陈俊芬.
优势-等价关系下序贯三支决策的属性约简
Attribute Reduction for Sequential Three-way Decisions Under Dominance-Equivalence Relations
计算机科学, 2019, 46(2): 242-148. https://doi.org/10.11896/j.issn.1002-137X.2019.02.037
[14] 姜泽华, 王怡博, 徐刚, 杨习贝, 王平心.
面向多尺度的属性约简加速器
Multi-scale Based Accelerator for Attribute Reduction
计算机科学, 2019, 46(12): 250-256. https://doi.org/10.11896/jsjkx.181102031
[15] 李仲玲, 米据生, 解滨.
不协调决策形式背景的属性约简
Attribute Reduction in Inconsistent Decision Formal Contexts
计算机科学, 2019, 46(12): 257-260. https://doi.org/10.11896/jsjkx.181102137
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!