计算机科学 ›› 2010, Vol. 37 ›› Issue (3): 178-181.

• 软件工程与数据库技术 • 上一篇    下一篇

基于语义的通用数据抽取方法

张建英,孙永洁,王秀坤   

  1. (大连理工大学计算机科学与工程系 大连116024)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(60873054)资助。

Generic&Semantic-based Data Extraction Approach

ZHANG Jian-ying,SUN Yong-jie,WANG Xiu-kun   

  • Online:2018-12-01 Published:2018-12-01

摘要: 关系数据库可以看作是元组以及外键关系构成的有向图。为便于数据复制以及共享,在进行数据抽取时,往往既要使语义上相关的数据一起抽取,又要使得抽取的数据尽量逻辑上独立。将多根树作为语义上相关、逻辑独立的数据集,给出了关系数据抽取方法并进行了实现。在Oracle中,使用TPC-C数据库结构对该方法进行了测试与分析,从而验证了算法的有效性和通用性。

关键词: 多根树,语义相关,数据抽取

Abstract: A relational database can be viewed as a directed graph constructed by tuples and foreign key references. To facilitate data replication and sharing,semanticrclativity and logically independence should be satisfied when relational data is extracted. Multi-tree structures are employed as clusters of such data extracted from a relational database in this paper. Then the corresponding data extraction approach was proposed and implemented. We evaluated the extraction algorithm on a TPC-C database in Oracle, demonstrating the effectiveness and generalization of the approach.

Key words: Multi tree, Semantic-related, Data extraction

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!