计算机科学 ›› 2018, Vol. 45 ›› Issue (4): 169-172.doi: 10.11896/j.issn.1002-137X.2018.04.028
刘琴
LIU Qin
摘要: 为了发现线索,提高数据质量,提出了一种应用于计算机取证领域的基于约束的数据修复算法。首先,利用等价类,针对不同的约束对数据进行初始化;然后,对初始化阶段发现的有问题的数据进行修正,修正值依据约束类型的不同而取不同的值;最后, 根据函数依赖集合和其他约束集,对经过修复的单元格集合重新生成问题单元格集合,如果依然存在问题单元格集合,则继续修复,直到不存在问题单元格为止。实验数据证明了所提方法的有效性和高效性。
[1] PIPINO L,YANG W,RICHARD Y.Data quality assessment[J].Communication of the ACM,2002,5(4):211-218. [2] LIANG J S,LI T Y,WANG H X,et al.Research on Data Qua-lity Assessment Algorithm Based on Constraint[J].Science Technology and Engineering,2012,3(12):551-554.(in Chinese) 梁吉胜,李天阳,王惠霞,等.基于约束的数据质量评估算法研究[J].科学技术与工程,2012,3(12):551-554. [3] KOLAHI S,LAKSHMANAN L V S.On approximating opti-mum repairs for functional dependency violations[C]∥Proc of ICDT.Petersburg,Russia:ACM Press,2009. [4] CHIANG F,MILLER R J.A unified model for data and con-straint repair[C]∥Proceedings of the ICDE.Iscataway,NJ:IEEE Computer Society,2011. [5] BOHANNON P,FLASTER M,FAN W F,et al.A cost-based model and effective heuristic for repairing constraints by value modicfication[C]∥Proceedings of the SIGMOD.New York:ACM Press,2005:143-154. [6] JIN C Q,LIU H P,ZHOU A Y.Functional Dependency and Conditional Constraint Based Data Repari[J].Journal of Software,2016,7(7):1671-1684.(in Chinese) 金澈清,刘辉平,周傲英.基于函数依赖与条件约束的数据修复方法[J].软件学报,2016,7(7):1671-1684. [7] BEKALES G,ILYAS I F,GOLAB L.Sampling the repairs offunctional dependency violations under hard constraints[C]∥Proceedings of the VLDB.Singapore:VLDB Endowment,2010:197-207. [8] BESKALES G,ILYAS I F,GOLAB L,et al.Sampling from repairs of conditional functional dependency viloations[J].VLDB Journal,2014,3(1):103-128. |
No related articles found! |
|