计算机科学 ›› 2015, Vol. 42 ›› Issue (5): 230-233.doi: 10.11896/j.issn.1002-137X.2015.05.046
洪 沙,林佳丽,张月良
HONG Sha, LIN Jia-li and ZHANG Yue-liang
摘要: 针对不确定数据集进行离群点检测,设计了基于密度的不确定数据的局部离群因子(Uncertain Local Outlier Factor,ULOF)算法。通过建立不确定数据的可能世界模型来确定不确定对象在可能世界中的概率。结合传统的LOF算法推导出ULOF算法,根据ULOF值判断不确定对象的局部离群程度;然后对ULOF算法的效率性和准确性进行了详细分析,提出了基于网格的剪枝策略、k最近邻查询优化来减少数据的候选集;最后通过实验证明了ULOF算法对不确定数据检测的可行性和效率性,优化后的方法有效地提高了异常检测准确率,降低了时间复杂度,改善了不确定数据的异常检测性能。
[1] Garces H,Sbarbaro D.Outliers Detection in Environmental Monitoring Databases [J].Engineering Applications of Artificial Intelligence,2011,24(2):341-349 [2] Jampani R,Xu F,Wu M.A Monte Carlo Approach to Managing Uncertain Data [C]∥Proc.SIGMOD,2008:687-700 [3] Widom J.Trio:A System for Integrated Management of Data,Accuracy,and Lineage [C]∥Proc.of the Second Biennial Conference on Innovative Data Systems Research.Asilomar,2005:262-276 [4] Li F F,Yi K,Jestes J.Ranking Distributed Probabilistic Data[C]∥Proc.SIGMOD Conference.ACM New York,NY,USA 2009:361-374 [5] 张晓峰,王丽珍,陆叶.一种基于属性加权的不确定K-means聚类算法[J].计算机研究与发展,2009,46(10):504-508 [6] Tsang S,Kao B,Yip K Y.Decision Trees for Uncertain Data[C]∥The 25th International Conference on Data Engineering New Jersey :IEEE Press,2009:441-444 [7] Kriegel H P,Pfeifle M.Density-based Clustering of UncertainData[C]∥ACM Knowledge Discovery and Data Mining.ACM Press,2005:672-677 [8] Aggarwal C C.Managing and Mining Uncertain Data[J].Advances in Database Systems,2009(35):75-89 [9] Ngai W K,Kao B,Chui C K,et al.Efficient Clustering of Uncertain Data[C]∥ICDM,IEEE Computer Society,2006:436-445 [10] Qin B,Xia Y,Li F.A Bayesian Classifier for Uncertain Data[C]∥SAC,ACM,2010:1010-1014 [11] 于浩,王斌,肖刚,等.基于距离的不确定离群点检测[C]∥NDBC2009(第26届中国数据库学术会议论文集(A集,2009))2009:15-18,143-150 [12] Charu C,Aggarwal,Philip S Y.Outlier Detection with Uncertain Data [R].IBM T.J,Watson Research Center.2008 [13] Wang B,Xiao G,Yu H,et al.Distance-based Outlier Detectionon Uncertain Data[C]∥CIT (1).IEEE Computer Society,2009:293-298 [14] Liu B,Yin J,Xiao Y,et al.Exploiting Local Data Uncertainty to Boost Global Outlier Detection[C]∥ICDM.IEEE Computer Society,2010:304-313 [15] Jiang B,Pei J.Outlier Detection on Uncertain Data:Objects,Instances,and Inferences[C]∥ICDE.IEEE Computer Society,2011:422-433 [16] Liu Jing,Deng Hui-fang.Outlier Detection on Uncertain Data Based on Local Information [J].Knowledge-based System,2013,7(51):60-71 [17] 李健,阎保平,李俊.基于记忆效应的局部异常检测算法[J].计算机工程,2008,4(12):4-6 |
No related articles found! |
|