计算机科学 ›› 2015, Vol. 42 ›› Issue (Z6): 94-97.

• 智能计算 • 上一篇    下一篇

基于相对细化量的粗糙集属性约简算法

徐婕,郭明   

  1. 湖北大学计算机与信息工程学院 武汉430062,湖北大学计算机与信息工程学院 武汉430062
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金青年项目(61403132)资助

Attribute Reduction Algorithm Based on Relative Refinement Capacity

XU Jie and GUO Ming   

  • Online:2018-11-14 Published:2018-11-14

摘要: 属性约简是指将信息表中不影响决策或者分类的多余属性去掉,是粗糙集理论研究中的一个核心内容。现已证明寻找信息表的最小约简是一个NP-hard问题。目前提出的启发式算法一般没有同时考虑算法完备性和数据噪音这两个方面。在分析了属性的重要性是与其细化能力相关的基础上,提出使用相对细化量作为启发式信息的属性约简算法REDA。该算法能解决数据中的噪音问题,且从理论上和实例中证明是完备和有效的,而且求得最小约简的可能性要高于其它基于属性重要性的算法。实验也表明,REDA是一个高效的属性约简算法,能够有效地降低后继工作的时间和空间复杂度。

Abstract: Attribute reduction is one of the key problems in the research field of rough set theory,which can eliminate unnecessary attributes of the information table.However,the number of possible subsets is always very large when the number of attributes N is large,because there are 2N subsets for N attributes.Hence exhaustively examining all subsets of attributes for finding the minimal reduction is a NP-hard problem.Many heuristic algorithms are proposed to solve this difficulty.But most of them have not considered two problems at the same time which are the completeness and the data noise.An efficient algorithm named REDA that can process data noises was proposed in this paper.Firstly,the feature that the significance of attribute is related to the property of its refinement was analyzed.Then the relative refinement capacity was employed as the heuristic information.The completeness and the correctness of REDA were proved from the theoretic analysis.The examples show the minimal attribute reduction can be found by REDA in some cases with a higher probability than that gained by other approaches.Finally,the experiment indicates REDA is an efficient and complete attribute reduction algorithm,which can reduce the temporal and spatial complexity for the next work.

Key words: Rough set,Attribute reduction,Heuristic,Complete,Relative refinement capacity

[1] Pawlak Z.Rough set[J].International Journal of Computer and Information Science,1982,11(4):341-356
[2] Wong S K M,Ziarko W.On optimal decision rules in decision tables[J].Bulletin of Polish Academy of Sciences,1985,33(11/12):693-696
[3] 徐燕,怀进鹏,王兆其.基于区分能力大小的启发式约简算法及其应用[J].计算机学报,2003,6(1):97-103
[4] 王莎莎,江峰,王文鹏.基于相对决策熵与加权相似性的粗糙集数据补齐方法[J].计算机科学,2014,1(2):245-248
[5] 刘少辉等.Rough集高效算法的研究[J].计算机学报,2003,6(5):524-529
[6] 张迎春,王宇新,郭禾.基于有序差别集和属性重要性的属性约简[J].计算机科学,2011,8(10):243-247
[7] 叶东毅.Jelonek 属性约简算法的一个改进[J].电子学报,2000,8(12):81-82
[8] 张颖淳,苏伯洪,曹娟.基于粗糙集的属性约简在数据挖掘中的应用研究[J].计算机科学,2013,8(40):223-226
[9] 周建华,徐章艳,章晨光.改进的差别矩阵的快速属性约简算法[J].小型微型计算机系统,2014,35(4):831-834
[10] 张长胜.基于决策表的区分矩阵增量属性约简算法[J].计算机工程与应用,2012,8(35):110-113
[11] 马希骜,王国胤,于洪.决策域分布保持的启发式属性约简方法[J].软件学报,2014,5(8):1761-1780
[12] 焦娜.基于相容粗糙集的改进的基因特征选择方法[J].计算机科学,2013,0(Z6):125-128,0
[13] 张文修,等.Rough 集理论与方法[M].北京:科学出版社,2001
[14] 王国胤.Rough集理论和知识获取[M].西安:西安交通大学出版社,2001
[15] 徐章艳,刘作鹏,杨炳儒,等.一个复杂度为max(O(|C||U|),O(|C|2|U/C|))的快速属性约简算法[J].计算机学报,2006,29(3):391-399
[16] Newman D J,Hettich S,Blake C L,et al.UCI repository of machine learning databases.http://www.ics.uci.edu/~mlearn/MLRepository.html

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!