Computer Science ›› 2018, Vol. 45 ›› Issue (10): 217-224.doi: 10.11896/j.issn.1002-137X.2018.10.040

• Artificial Intelligence • Previous Articles     Next Articles

Attribute Reduction Algorithm Using Information Gain and Inconsistency to Fill

LI Hong-li, MENG Zu-qiang   

  1. College of Computer and Electronic Information,Guangxi University,Nanning 530004,China
  • Received:2017-08-10 Online:2018-11-05 Published:2018-11-05

Abstract: The attribute reduction of incomplete and inconsistent data is a major content of data mining.Combining information gain and inconsistent degree of data,this paper proposed an attribute reduction algorithm for incomplete and inconsistent data.First,the information gain is introduced,and the concept and algorithm formula of inconsistent degree are defined.Besides,the method of data filling based on information gain and inconsistent degree is given.Then,based on this data filling method,the attribute reduction algorithm is provided with the information gain under the condition of taking the maximum inconsistent degree as the weight and inconsistent degree as heuristic information.Finally,the experimental results demonstrate the effectiveness of the proposed algorithm.

Key words: Attribute reduction, Filling, Incomplete, Inconsistent, Information gain

CLC Number: 

  • TP181
[1]PAWLAK Z.Rough Sets:Theoretical Aspects of Reasoning about Data[M].Kluwer Academic Publishers,1991,9:24-26.
[2]STEFANOWSKI J,TSOUKIS A.Incomplete Information Tables and Rough Classification[J].Computational Intelligence,2001,17(3):545-566.
[3]LIU P,QIU T R,XIONG X X,et al.An Incomplete Data Filling Approach Based on a New Valued Tolerance Relation[J].Open Automation & Control Systems Journal,2014,6(1):1456-1462.
[4]JIN C M,E X,MU H J,et al.Data Filling Method Based on New Relationship Matrix[J].Computer Engineering,2011,37(19):28-31.(in Chinese)
金成美,鄂旭,穆海军,等.一种基于新型关系矩阵的数据填补方法[J].计算机工程,2011,37(19):28-31.
[5]WU K K,PAN W.Attribute significance based imputation method[J].Computer Engineering and Design,2016,37(3):725-730.(in Chinese)
吴康康,潘巍.基于属性重要度的数据补齐方法[J].计算机工程与设计,2016,37(3):725-730.
[6]KIRAN P M,RAO A P,RATNAMALA B.An Efficient Approach for Filling Incomplete Data[C]∥National Conference on Advances in Computer Science and Applications with International Journal of Computer Applications(NCACSA 2012).2012:23-27.
[7]YANG X P.Completing incomplete data based on maximum similarity in Rough sets[J].Computer Engineering and Applications,2012,48(36):164-166.(in Chinese)
杨小平.粗集中最大相似度的不完备数据补齐[J].计算机工程与应用,2012,48(36):164-166.
[8]WU S,FENG X D,SHAN Z G.Missing Data Imputation Approach Based on Incomplete Data Clustering[J].Chinese Journal of Computers,2012,35(8):1726-1738.(in Chinese)
武森,冯小东,单志广.基于不完备数据聚类的缺失数据填补方法[J].计算机学报,2012,35(8):1726-1738.
[9]YANG T,LUO J W,WANG Y,et al.Missing value estimation for gene expression data based on Mahalanobis distance[J].Computer Applications,2005,25(12):2868-2871.(in Chinese)
杨涛,骆嘉伟,王艳,等.基于马氏距离的缺失值填充算法[J].计算机应用,2005,25(12):2868-2871.
[10]KIM K Y,KIM B J,YI G S.Reuse of imputed data in microarray analysis increases imputation efficiency[J].Bmc Bioinformatics,2004,5(1):160.
[11]CHEN Z K,YANG Y D,ZHANG Q C,et al.Novel algorithm for filling incomplete data of internet of things based on attri-bute reduction[J].Computer Engineering and Design,2013,34(2):418-422.(in Chinese)
陈志奎,杨英达,张清辰,等.基于属性约简的物联网不完全数据填充算法[J].计算机工程与设计,2013,34(2):418-422.
[12]ZHANG H X.Missing data imputation:Information gain based on approach[J].Computer Engineering and Design,2006,27(24):4810-4812.(in Chinese)
张红霞.缺失值填充:基于信息增益的方法[J].计算机工程与设计,2006,27(24):4810-4812.
[13]QIN Z.Information Gain based Algorithm for Filling Missing Data[J].Microcomputer Information,2007,23(12):180-181.(in Chinese)
覃泽.基于信息增益的数据库缺失值填充算法[J].微计算机信息,2007,23(12):180-181.
[14]KRYSZKIEWICZ M.Rough Set Approach to Incomplete Information System[J].Information Sciences,1998,112(1-4):39-49.
[15]WANG G Y.Extension of Rough Set Under Incomplete Information systems[J].Journal of Computer Research and Development,2002,39(10):1238-1243.(in Chinese)
王国胤.Rough 集理论在不完备信息系统中的扩充[J].计算机研究与发展,2002,39(10):1238-1243.
[16]FU A,WANG G Y,HU J.Information entropy based attribute reduction algorithm in incomplete information systems[J].Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition),2008,20(5):586-592.(in Chinese)
付昂,王国胤,胡军.基于信息熵的不完备信息系统属性约简算法[J].重庆邮电大学学报(自然科学版),2008,20(5):586-592.
[17]TAO Z,LIU Q Z,LI W M.Attribute reduction based on GA under incomplete information system[J].Systems Engineering and Electronics,2007,29(9):1484-1487.(in Chinese)
陶志,刘庆拯,李卫民.基于遗传算法的不完备信息系统属性约简方法[J].系统工程与电子技术,2007,29(9):1484-1487.
[18]KRYSZKIEWICZ M.Rules in incomplete information systems[J].Information Sciences,1999,113(3-4):271-292.
[19]XIE H,CHENG H Z,NIU D X.Discretization of Continuous Attributes in Rough Set Theory Based on Information Entropy[J].Chinese Journal of Computers,2005,28(9):1570-1574.(in Chinese)
谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574.
[20]蒋盛益,李霞,郑琪.数据挖掘原理与实践[M].北京:电子工业出版社,2011:48-58.
[21]FU M L,ZENG H L.Oprimization Selection and Rules Extraction in Inconsistent and Incomplete Information System[J].Computer Science,2007,34(10):208-211.(in Chinese)
伏明兰,曾黄麟.一种不一致不完备信息系统的最优选择及规则约简方法研究[J].计算机科学,2007,34(10):208-211.
[22]HE W,LIU C Y,ZHAO J,et al.An Algorithm of Attributes Reduction in Incomplete Information System[J].ComputerScien-ce,2004,31(2):117-119.(in Chinese)
何伟,刘春亚,赵军,等.不完备信息系统下的属性约简算法[J].计算机科学,2004,31(2):117-119.
[23]MENG Z Q,XU K,ZHOU S Q.Maximum distribution reduction and computation methods for incomplete inconsistent decision systems[J].Journal of Guangxi Normal University(Natural Science Edition),2011,29(3):89-93.(in Chinese)
蒙祖强,许珂,周石泉.不完备不一致决策系统的最大分布约简及计算方法[J].广西师范大学学报(自然科学版),2011,29(3):89-93.
[24]MENG Z Q,SHI Z Z.A fast approach to attribute reduction in incomplete decision systems with tolerance relation—based rough sets[J].Information Sciences,2009,179(16):2774-2793.
[25]MA F M,LIU T T,XU A P.Data completion with rough sets based on fuzzy weighted similarity measure [J].Computer Engineering and Applications,2016,52(9):62-66.(in Chinese)
马福民,刘涛涛,徐安平.基于模糊加权相似度量的粗糙集数据补齐方法[J].计算机工程与应用,2016,52(9):62-66.
[26]YANG C Q.The attribute reduction algorithms based on rough sets[J].Journal of Northwest University(Natural Science Edition),2012,42(2):223-225.(in Chinese)
杨常清.基于粗糙集的属性约简算法[J].西北大学学报(自然科学版),2012,42(2):223-225.
[27]YE D Y.An Improvement to Jelonek′s Attribute Reduction Algorithm[J].Acta Electronica Sinca,2000,28(12):81-82.(in Chinese)
叶东毅.Jelonek属性约简算法的一个改进[J].电子学报,2000,28(12):81-82.
[1] WANG Ming, WU Wen-fang, WANG Da-ling, FENG Shi, ZHANG Yi-fei. Generative Link Tree:A Counterfactual Explanation Generation Approach with High Data Fidelity [J]. Computer Science, 2022, 49(9): 33-40.
[2] ZHOU Xu, QIAN Sheng-sheng, LI Zhang-ming, FANG Quan, XU Chang-sheng. Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification [J]. Computer Science, 2022, 49(9): 132-138.
[3] WANG Zi-yin, LI Lei-jun, MI Ju-sheng, LI Mei-zheng, XIE Bin. Attribute Reduction of Variable Precision Fuzzy Rough Set Based on Misclassification Cost [J]. Computer Science, 2022, 49(4): 161-167.
[4] XUE Zhan-ao, HOU Hao-dong, SUN Bing-xin, YAO Shou-qian. Label-based Approach for Dynamic Updating Approximations in Incomplete Fuzzy Probabilistic Rough Sets over Two Universes [J]. Computer Science, 2022, 49(3): 255-262.
[5] ZHENG Su-su, GUAN Dong-hai, YUAN Wei-wei. Heterogeneous Information Network Embedding with Incomplete Multi-view Fusion [J]. Computer Science, 2021, 48(9): 68-76.
[6] LI Shao-hui, ZHANG Guo-min, SONG Li-hua, WANG Xiu-lei. Incomplete Information Game Theoretic Analysis to Defend Fingerprinting [J]. Computer Science, 2021, 48(8): 291-299.
[7] LI Yan, FAN Bin, GUO Jie, LIN Zi-yuan, ZHAO Zhao. Attribute Reduction Method Based on k-prototypes Clustering and Rough Sets [J]. Computer Science, 2021, 48(6A): 342-348.
[8] JIANG Yan, MA Yu, LIANG Yuan-zhe, WANG Yuan, LI Guang-hao, MA Ding. Lung Tissue Segmentation Algorithm:Fractional Order Sparrow Search Optimization for OTSU [J]. Computer Science, 2021, 48(6A): 28-32.
[9] ZHAO Zhi-qiang, YI Xiu-shuang, LI Jie, WANG Xing-wei. Research on DoS Intrusion Detection Technology of IPv6 Network Based on GR-AD-KNN Algorithm [J]. Computer Science, 2021, 48(6A): 524-528.
[10] FU Kun, ZHAO Xiao-meng, FU Zi-tong, GAO Jin-hui, MA Hao-ran. Deep Network Representation Learning Method on Incomplete Information Networks [J]. Computer Science, 2021, 48(12): 212-218.
[11] ZENG Hui-kun, MI Ju-sheng, LI Zhong-ling. Dynamic Updating Method of Concepts and Reduction in Formal Context [J]. Computer Science, 2021, 48(1): 131-135.
[12] XUE Zhan-ao, ZHANG Min, ZHAO Li-ping, LI Yong-xiang. Variable Three-way Decision Model of Multi-granulation Decision Rough Sets Under Set-pair Dominance Relation [J]. Computer Science, 2021, 48(1): 157-166.
[13] ZHANG Yu-shuai, ZHAO Huan, LI Bo. Semantic Slot Filling Based on BERT and BiLSTM [J]. Computer Science, 2021, 48(1): 247-252.
[14] SANG Bin-bin, YANG Liu-zhong, CHEN Hong-mei, WANG Sheng-wu. Incremental Attribute Reduction Algorithm in Dominance-based Rough Set [J]. Computer Science, 2020, 47(8): 137-143.
[15] YUE Xiao-wei, PENG Sha and QIN Ke-yun. Attribute Reduction Methods of Formal Context Based on ObJect (Attribute) Oriented Concept Lattice [J]. Computer Science, 2020, 47(6A): 436-439.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!