计算机科学 ›› 2010, Vol. 37 ›› Issue (5): 26-29.

• 综述 • 上一篇    下一篇

领域无关数据清洗研究综述

曹建军,刁兴春,汪挺,王芳潇   

  1. (总参第63研究所 南京210007)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受江苏省博士后科研资助计划(0907074B)和国家自然科学基金(50705097)资助。

Research on Domain-independent Data Cleaning: A Survey

CAO Jian-jun,DIAO Xing-chun,WANG Ting,WANG Fang-xiao   

  • Online:2018-12-01 Published:2018-12-01

摘要: 对领域无关数据清洗的研究进行了综述。首先阐明了全面数据质量管理、数据集成和数据清洗之间的关系,着重说明了领域无关数据清洗的特点。将领域无关数据清洗方法分为基于特征相似度的方法、基于上下文的方法和基于关系的方法分别介绍。最后对领域无关数据清洗的研究方向进行了展望。

关键词: 数据质量,数据清洗,数据集成,领域无关数据清洗

Abstract: Research on domain-independent data cleaning was surveyed. First, relationships among total data quality management, data integration and data cleaning were clarified, and characteristics of domain-independent data cleaning were emphasized. hhen, domain-independent data cleaning was classified as fcaturcbased similarity methods, context based methods and relationship-based methods. They were introduced respectively. At last, the future research direclions of domain-independent data cleaning were discussed.

Key words: Data quality, Data cleaning, Data integration, Domain-independent data cleaning

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!