Computer Science ›› 2014, Vol. 41 ›› Issue (2): 245-248.

Previous Articles     Next Articles

Rough Set Approach to Data Completion Based on Relative Decision Entropy and Weighted Similarity

WANG Sha-sha,JIANG Feng and WANG Wen-peng   

  • Online:2018-11-14 Published:2018-11-14

Abstract: The current data completion methods based on rough sets do not consider the differences between different condition attributes when calculating the similarities between any two objects.To solve this problem,this paper introduced a new notion of weighted similarity,and proposed a rough set data completion algorithm called RDNAWS based on relative decision entropy and weighted similarity.RDNAWS algorithm adopts the concept of relative decision entropy to measure the significance of each condition attribute.Through calculating the significance of each condition attribute and the dependence of the set of decision attributes on it,RDNAWS provides a weight for each condition attribute,which can efficiently distinguish various condition attributes.The experimental results on real data sets demonstrate that our algorithm can obtain better classification performance than the current algorithms.

Key words: Incomplete data,Rough set,Data completion,Relative decision entropy,Weighted similarity

[1] 王国胤.Rough集理论与知识获取[M].西安:西安交通大学出版社,2001
[2] 江峰,王春平,曾惠芬.基于相对决策熵的决策树算法及其在入侵检测中的应用[J].计算机科学,2012,39(4):223-226
[3] 焦娜,苗夺谦,张红云.多决策表缺失属性补齐算法的研究[J].计算机科学,2009,36(1):142-145
[4] 潘巍,王阳生,杨宏戟.粗糙集理论中新的针对不完备信息系统的处理方法研究[J].计算机科学,2007,134(16):158-161
[5] Pawlak Z.Rough Sets[J].International Journal of Computerand Information Sciences,1982,11:341-356
[6] 孟军,刘永超,莫海波.基于粗糙集理论的不完备数据填补方法[J].计算机工程与应用,2008,44(6):175-177
[7] 王国胤.Rough集理论在不完备信息系统中的扩充[J].计算机研究与发展,2002,39(10):1238-1243
[8] Kryszkiewicz M.Rough set approach to incomplete information system [J].Information Sciences,1998,112(14):39-49
[9] 徐章艳,刘作鹏,杨炳儒,等.一个复杂度为max(O(|C||U|),O(|C|2|U/C|)) 的快速属性约简算法[J].计算机学报,2006,29(3):391-399
[10] Bay S D.The UCI KDD repository.http://kdd.ics.uci.edu,1999
[11] hrn A.Rosetta Technical Reference Manual[R].http://www.idi.ntnu.no/aleks/rosetta,1999
[12] Hall M,Frank E,Pfahringer H G,et al.The WEKA data mining software:an update[J].SIGKDD Explor.News.,2009,11(1):10-18
[13] 赵洪波,江峰,曾惠芬,等.一种基于加权相似性的粗糙集数据补齐方法[J].计算机科学,2011,38(11):167-170
[14] 田树新,吴晓平,王红霞.一种基于改进的ROUSTIDA算法的数据补齐方法[J].海军工程大学学报,2011,23(5):11-15
[15] 李萍,吴祈宗.基于概率相似度的不完备信息系统数据补齐算法[J].计算机应用研究,2009,26(3):881-883

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!