Computer Science ›› 2020, Vol. 47 ›› Issue (3): 73-78.doi: 10.11896/jsjkx.190500125

• Database & Big Data & Data Science • Previous Articles     Next Articles

Attribute Reduction Algorithm Based on Optimized Discernibility Matrix and Improving Discernibility Information Tree

XU Yi1,2,TANG Jing-xin2   

  1. (Key Laboratory of Intelligent Computing and Signal Processing and Ministry of Education, Anhui University, Hefei 230039, China)1;
    (College of Computer Science and Technology, Anhui University, Hefei 230601, China)2
  • Received:2019-05-22 Online:2020-03-15 Published:2020-03-30
  • About author:XU Yi,born in 1981,Ph.D, associate professor, is member of ChinaComputer Federation.Her main research interests include intelligent information processing and rough set.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China under Grant (61402005), Natural Science Foundation of Anhui Province under Grant (1308085QF114) and provincial Natural Science Foundation of Anhui higher education institute under Grant (KJ2013A015).

Abstract: Discernibility matrix expresses the distinguishing information of all objects in the information system with matrix elements,which provides a new idea for attribute reduction.However,the traditional discernibility matrix uses the core attributes to eliminate redundant element items after the construction is finished,ignoring the role of the core attributes in the matrix construction process.In response to this problem,the following research is done.Firstly,the definition of the discernibility matrix is optimized.Before calculating the distinguishing information of any two objects,it is first determined whether the values on the core attributes are equal.If not,the corresponding element items are directly recorded as ø,and the judgment of other attributes is ignored.Secondly,the concept of attribute weighted importance is proposed.The ratio of each condition attribute to the non-empty element term in the discernibility matrix (called macro importance) and the contribution of each attribute to the distinguishing object (called micro Importance) are comprehensively considered,and the rationality of the measurement method is illustrated by an example.Thirdly,aiming at the disadvantages that there are a lot of redundant elements and empty sets in the optimized discernibility matrix,by combining the concept of discernibility information tree,discernibility information tree based on optimized discernibility matrix and attribute weighted importance is proposed.All non-empty element items in the optimized discernibility matrix are sorted according to attribute weighted importance,so that attributes with high importance are shared by more nodes.Element items that do not contain core attributes are mapped to a path in the tree during the build process,while element items that contain core attributes are ignored.Finally,a reduction algorithm HSDI-tree based on optimized discernibility matrix and improving discernibility information tree is proposed.This paper compared the reduction results and the number of nodes of the HSDI-tree algorithm,CDI-tree,DI-tree and IDI-tree algorithms on the five data sets of UCI.The experimental results show that the HSDI-tree algorithm can effectively find the minimum attribute reduction and has better space compression ability.

Key words: Attribute importance, Attribute reduction, Discernibility information tree, Discernibility matrix, Rough set

CLC Number: 

  • TP181
[1]PAWLAK Z.Rough set[J].International Journal of Information and Computer Science,1982,11(5):341-356.
[2]SHEN Q,JENSEN R.Selecting informative features with fuzzy-rough sets and its application for complex systems monitoring[J].Pattern Recognition,2004,37(7):1351-1363.
[3]WEN S D,BAO Q H.A fast heuristic attribute reduction approach to ordered decision systems[J].European Journal ofOpe-rational Research,2018,264:440-452.
[4]ZHENG J G,YAN R X.Attribute Reduction Based on Cross Entropy in Rough Set Theory[J].Journal of Information & ComputationalScience,2012,9(3):745-750.
[5]WEN S D,BAO Q H.A fast heuristic attribute reduction approach to ordered decision systems[J].European Journal of Operational Research,2018,264:440-452.
[6]MAJI P,PAUL S.Rough Set Based Maximum Relevance-Maximum Significance Criterion and Gene Selection from Microarray Data[J].International Journal of Approximate Reasoning,2011,152:408-420.
[7]ELEYAN D.Ant Colony Optimization based Feature Selection in Rough Set Theory[J].International Journal of Computer Science and Electronics Engineering,2013,1(2):244-247.
[8]SKOWRON A,RAUSZER C.The discernibility matrices and functions in information systems [M]∥Intelligent Decision Support-handbook of Applicationsand Advances of the Rough Sets Theory.Dordrecht:Kluwer Academic Publisher,1991:331-362.
[9]YAO Y Y,ZHAO Y.Discernibility matrix simplification for constructing attribute reducts[J].Information Sciences,2009,179(5):867-882.
[10]PARTHALÁIN,NEIL M,SHEN Q,et al. distance measure approach to exploring the rough set boundary region for attribute reduction[J].IEEE Transactions on Knowledge and Data Engi- neering,2010,22(3):305-317.
[11]YAMAGUCHI D.Atribute dependency functions considering data Efficiency[J].International Journal of Approximate Reasoning,2009,51(1):89-98.
[12]FELIX R,USHIO T.Rough sets-based machine learning using a binary discernibility matrix[C]∥Proceeding of 2nd InternationalConference on Inteligent Procesing and Manufacturing of Materials.Ha-wai,1999:299-305.
[13]QIAN W B,XU Z Y,HUANG L Y,et al.Atribution reduction algorithm based on binary discernibility matrix of information entropy[J].Computer Enginering and Applications,2010,46(6):120-123.
[14]ZHI T Y,MIAO D Q.The Binary Discernibility Matrix’s Transformation and High Efficiency Attributes Reduction Algorith m’s Conformation[J].Computer Science,2002,29(2):140-143.
[15]JIANG Y.Attribute reduction with rough set based on discerni- bility information tree[J].Control & Decision,2015,30(8):1531-1536.
[16]PAWLAK Z,SKOWRON A.Rudiments of rough sets[J].Information Sciences,2007,177:3-27.
[17]PAWLAK Z.Rough Sets:Theoretical Aspects of Reasoning About Data[M].Boston:Kluwer Academic Publishers,1991.
[18]JIANG Y.Attribute reduction with rough set based on Improving discernibility information tree[J].Control & Decision,2019,34(6):135-140.
[19]YANG L,ZHANG X Y,XU W H.Attribute reduction of discernibility information tree in interval-valued ordered information system[J].Journal of Frontiers of Computer Science & Technology,2019(6):1062-1069.
[20]JIANG Y,YU Y.Minimal attribute reduction with rough set based on compactness discernibility information tree[J].Soft Computing,2016,20:2233-2243.
[1] CHENG Fu-hao, XU Tai-hua, CHEN Jian-jun, SONG Jing-jing, YANG Xi-bei. Strongly Connected Components Mining Algorithm Based on k-step Search of Vertex Granule and Rough Set Theory [J]. Computer Science, 2022, 49(8): 97-107.
[2] XU Si-yu, QIN Ke-yun. Topological Properties of Fuzzy Rough Sets Based on Residuated Lattices [J]. Computer Science, 2022, 49(6A): 140-143.
[3] FANG Lian-hua, LIN Yu-mei, WU Wei-zhi. Optimal Scale Selection in Random Multi-scale Ordered Decision Systems [J]. Computer Science, 2022, 49(6): 172-179.
[4] CHEN Yu-si, AI Zhi-hua, ZHANG Qing-hua. Efficient Neighborhood Covering Model Based on Triangle Inequality Checkand Local Strategy [J]. Computer Science, 2022, 49(5): 152-158.
[5] SUN Lin, HUANG Miao-miao, XU Jiu-cheng. Weak Label Feature Selection Method Based on Neighborhood Rough Sets and Relief [J]. Computer Science, 2022, 49(4): 152-160.
[6] WANG Zi-yin, LI Lei-jun, MI Ju-sheng, LI Mei-zheng, XIE Bin. Attribute Reduction of Variable Precision Fuzzy Rough Set Based on Misclassification Cost [J]. Computer Science, 2022, 49(4): 161-167.
[7] WANG Zhi-cheng, GAO Can, XING Jin-ming. Three-way Approximate Reduction Based on Positive Region [J]. Computer Science, 2022, 49(4): 168-173.
[8] XUE Zhan-ao, HOU Hao-dong, SUN Bing-xin, YAO Shou-qian. Label-based Approach for Dynamic Updating Approximations in Incomplete Fuzzy Probabilistic Rough Sets over Two Universes [J]. Computer Science, 2022, 49(3): 255-262.
[9] LI Yan, FAN Bin, GUO Jie, LIN Zi-yuan, ZHAO Zhao. Attribute Reduction Method Based on k-prototypes Clustering and Rough Sets [J]. Computer Science, 2021, 48(6A): 342-348.
[10] XUE Zhan-ao, SUN Bing-xin, HOU Hao-dong, JING Meng-meng. Optimal Granulation Selection Method Based on Multi-granulation Rough Intuitionistic Hesitant Fuzzy Sets [J]. Computer Science, 2021, 48(10): 98-106.
[11] WANG Xia, PENG Zhi-hua, LI Jun-yu, WU Wei-zhi. Method of Concept Reduction Based on Concept Discernibility Matrix [J]. Computer Science, 2021, 48(1): 125-130.
[12] ZENG Hui-kun, MI Ju-sheng, LI Zhong-ling. Dynamic Updating Method of Concepts and Reduction in Formal Context [J]. Computer Science, 2021, 48(1): 131-135.
[13] XUE Zhan-ao, ZHANG Min, ZHAO Li-ping, LI Yong-xiang. Variable Three-way Decision Model of Multi-granulation Decision Rough Sets Under Set-pair Dominance Relation [J]. Computer Science, 2021, 48(1): 157-166.
[14] SANG Bin-bin, YANG Liu-zhong, CHEN Hong-mei, WANG Sheng-wu. Incremental Attribute Reduction Algorithm in Dominance-based Rough Set [J]. Computer Science, 2020, 47(8): 137-143.
[15] CHEN Yu-jin, XU Ji-hui, SHI Jia-hui, LIU Yu. Three-way Decision Models Based on Intuitionistic Hesitant Fuzzy Sets and Its Applications [J]. Computer Science, 2020, 47(8): 144-150.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!