计算机科学 ›› 2017, Vol. 44 ›› Issue (9): 222-226.doi: 10.11896/j.issn.1002-137X.2017.09.041

• 人工智能 • 上一篇    下一篇

一种结合二元蚁群和粗糙集的连续属性离散化算法

曹峰,唐超,张婧   

  1. 山西大学计算机与信息技术学院 太原030006,合肥学院计算机与科学技术系 合肥230601,太原学院数学系 太原030006
  • 出版日期:2018-11-13 发布日期:2018-11-13
  • 基金资助:
    本文受国家自然科学基金项目(41401521,61403238,61502288),山西省青年科技研究基金(2015021101),智能信息处理山西省重点实验室开放课题基金项目(2004001,2016001),安徽高校自然科学研究项目(KJ2015A206),合肥学院人才科研基金项目(15RC07)资助

Algorithm of Continuous Attribute Discretization Based on Binary Ant Colony and Rough Sets

CAO Feng, TANG Chao and ZHANG Jing   

  • Online:2018-11-13 Published:2018-11-13

摘要: 离散化是一个重要的数据预处理过程,在规则提取、知识发现、分类等研究领域都有广泛的应用。提出一种结合二元蚁群和粗糙集的连续属性离散化算法。该算法在多维连续属性候选断点集空间上构建二元蚁群网络,通过粗糙集近似分类精度建立蚁群算法适宜度评价函数,寻找全局最优离散化断点集。通过UCI数据集验证算法的有效性,实验结果表明,该算法具有较好的离散化性能。

关键词: 离散化,二元蚁群算法,粗糙集

Abstract: Discretization is an important process of data preprocessing and has been widely applied in the research fields of rule extraction,knowledge discovery,and classification.A discretization algorithm of continuous attribute based on binary ant colony and rough sets was proposed in this paper.The algorithm constructs binary ant colony network on the cut points set generated by multidimensional continuous attributes.Meanwhile,it searches global optimal discretization cut points set by using fitness function constructed with the accuracy of approximation classification of rough sets.To validate the effectiveness of the proposed discretization algorithm,it is applied to seven UCI data sets.And the experimental results indicate that it has relative better performance.

Key words: Discretization,Binary ant colony algorithm,Rough sets

[1] HOU L J,WANG G Y,NIE N,et al.Discretization in rough set theory[J].Computer Science,2000,27(12):89-94.(in Chinese) 侯利娟,王国胤,聂能,等.粗糙集理论中的离散化问题[J].计算机科学,2000,27(12):89-94.
[2] CAO F.Research on spatial data discretization[D].Beijing:University of Chinese Academy of Sciences,2013.(in Chinese) 曹峰.空间数据离散化研究[D].北京:中国科学院大学,2013.
[3] SANG Y.Research on discretization methods for continuous data[D].Dalian:Dalian University of Technology,2012.(in Chinese) 桑雨.连续数据离散化方法研究[D].大连:大连理工大学,2012.
[4] DOUGHERTY J,KOHAVI R,SAHAMI M.Supervised andunsupervised discretization of continuous features[C]∥Procee-dings of the Twelfth International Conference on Machine Lear-ning.Morgan San Francisco:Morgan Kaufmann Publishers,1995:194-202.
[5] KHANMOHAMMADI S,CHOU C A.A Gaussian mixturemodel based discretization algorithm for associative classification of medical data[J].Expert Systems With Applications,2016,58(c):119-129.
[6] SHI Z C,XIA Y X,ZHOU J Z.Discretization algorithm based on granular computing and its application[J].Computer Scie-nces,2013,40(6A):133-135.(in Chinese) 史志才,夏永祥,周金祖.基于粒计算的离散化算法及其应用[J].计算机科学,2013,0(6A):133-135.
[7] LIU H,HUSSAIN F,TAN C L,et al.Discretization:a enabling technique[J].Data Mining and Knowledge Discovery,2002,6:393-423.
[8] XIE H,CHENG H Z,NIU D X.Algorithm of continuous attri-bute discretization for rough set theory based on information entropy [J].Chinese Journal of Computers,2005,28(9):1570-1574.(in Chinese) 谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574.
[9] TAY E H,SHEN L.A modified chi2 algorithm for discretization[J].IEEE Transactions on Knowledge and Data Engineering,2002,14(3):666-670.
[10] SU C T,HSU J H.An extended Chi2 algorithm for discretization of real value attributes[J].IEEE Transactions on Know-ledge and Data Engineering,2005,17(3):437-441.
[11] KURGAN L A,CIOS K J.CAIM discretization algorithm[J].IEEE Transactions on Knowledge and Data Engineering,2004,16(2):145-153.
[12] JIANG F,SUI Y F.A novel approach for discretization of continuous attributes in rough set theory[J].Knowledge-Based Systems,2015,73:324-334.
[13] QIAN Q,CHENG M Y,XIONG W Q,et al.Reviews of binary ant colony optimization[J].Application Research of Computers,2012,29(4):1211-1215.(in Chinese) 钱乾,程美英,熊伟清,等.二元蚁群算法研究综述[J].计算机应用研究,2012,29(4):1211-1215.
[14] PAWLAK Z.Rough sets[J].Communications of the Acm,1995,38(11):88-95.
[15] 王国胤.Rough集理论与知识获取[M].西安交通大学出版社,2001.
[16] XIONG W Q,WANG L Y,YAN C Y.Binary ant colony evolutionary algorithm[J].International Journal of Information Technology,2006,12(3):10-20.
[17] FAYYAD U M,IRANI K B.On the handling of continuous-va-lued attributes in decision tree generation[J].Machine Lear-ning,1992,8(1):87-102.
[18] QUINLAN J R.C4.5:programs for machine learning[M].San Francisco:Morgan Kaufmann,1993.
[19] KONONENKO I.Naive Bayesian classier and continuous attributes[J].Informatica,2010:317-326.
[20] ZIARKO W.Variable precision rough set model[J].Journal of Computer & System Sciences,1993,46(1):39-59.
[21] XIONG W Q,WEI P.Binary ant colony evolutionary algorithm[J].Acta Automatica Sinica,2007,33(3):259-264.(in Chinese) 熊伟清,魏平.二进制蚁群进化算法[J].自动化学报,2007,33(3):259-264.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!