计算机科学 ›› 2017, Vol. 44 ›› Issue (2): 112-116.doi: 10.11896/j.issn.1002-137X.2017.02.016

• 2016 第十三届全国Web 信息系统及其应用学术会议 • 上一篇    下一篇

异构信息空间中时间感知的实体集成框架

杨丹,陈默,申德荣   

  1. 辽宁科技大学软件学院 鞍山114051,东北大学计算机科学与工程学院 沈阳110004,东北大学计算机科学与工程学院 沈阳110004
  • 出版日期:2018-11-13 发布日期:2018-11-13
  • 基金资助:
    本文受国家自然科学基金项目(61402213,61402093),教育部中央高校基本科研业务费(N141604001),辽宁省自然科学基金(2015020018)资助

Time-aware Entity Integration Framework in Heterogeneous Information Spaces

YANG Dan, CHEN Mo and SHEN De-rong   

  • Online:2018-11-13 Published:2018-11-13

摘要: 异构信息空间中的实体和关联关系普遍具有时间信息、多种时间版本的实体数据共存,而传统的实体集成忽略了时间信息,不支持时间维度上的集成。提出一种异构信息空间中时间感知的实体集成框架T-EI,从大量异构实体数据中聚集事实形成干净的、完整的、具有时间信息的实体概貌,进而支持时间感知的实体搜索。T-EI利用实体及关联关系所具有的时间信息提出时间感知的实体识别算法,并通过考虑数据时效性提出时间感知的数据融合算法。在真实数据集上的实验结果表明了T-EI的可行性和有效性。

关键词: 异构信息空间,时间感知,实体集成,实体概貌

Abstract: In heterogeneous information spaces,entities and associations generally have time information,entities of multiple time versions coexist.While traditional entity integration (EI) ignores time information,does not support the integration on the time dimension.In this paper,a time-aware EI framework in heterogeneous information spaces T-EI was proposed,which can aggregate large collections of heterogeneous entities into a set of clean and complete entity profiles with time information to support time-aware entity search.T-EI adopts time-aware entity resolution algorithm leveraging time information of entities and associations,and adopts time-aware data fusion algorithm considering data currency.Experimental results on the real data sets demonstrate the feasibility and effectiveness of T-EI.

Key words: Heterogeneous information spaces,Time-aware,Entity integration,Entity profile

[1] lOANNOU E,NIEDEREE C,WOLFGAN N.Probabilistic entitylinkage for heterogeneous information spaces[C]∥Proc of CAISE.2008.
[2] LORENZO,HACID,PAIK,et al.Data integration in mashups[J].SIGMOD Record,2009,38(1):59-66.
[3] EENDRULLIS S,THOR A,RAHM E.WETSUIT:an efficient mashup tool for searching and fusing web entities[J].PVLDB,2012,5(12):1970-1973.
[4] THOR A,RAHM E.CloudFuice:A flexible cloud-based data integration System[C]∥Proc of 10th Intl.Conference on Web Engineering (ICWE).2011.
[5] HERNADEZ M,KOUTRIKA G,KRISHNAMURTHY R.HIL:a high-level scripting language for entity integration[C]∥Proc of EDBT.2013.
[6] LI P,DONG X.Linking temporal records[C]∥Proc.of VLDB.2011.
[7] CHIANG Y H ,DOAN A H,NAUGHTON J F.Modeling Entity evolution for temporal record matching[C]∥Proc.of SIGMOD.2014:1175-1186.
[8] CHIANG Y H ,DOAN A H,NAUGHTON J F.Tracking entities in the dynamic world:a fast algorithm for matching temporal records[J].PVLDB,2014,7(6):469-480.
[9] LI F R,LI M,HSU W,et al.Linking temporal records for profiling entities[C]∥Proc.of SIGMOD.2015:593-605.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!