计算机科学 ›› 2022, Vol. 49 ›› Issue (11): 90-97.doi: 10.11896/jsjkx.211100065

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于关系数据库的时态RDF建模

韩啸, 章哲庆, 严丽   

  1. 南京航空航天大学计算机科学与技术学院 南京 211106
  • 收稿日期:2021-11-05 修回日期:2022-02-22 出版日期:2022-11-15 发布日期:2022-11-03
  • 通讯作者: 严丽(yanli@nuaa.edu.cn)
  • 作者简介:(han_xiao1996@163.com)
  • 基金资助:
    江苏省基础研究计划(BK20191274)

Temporal RDF Modeling Based on Relational Database

HAN Xiao, ZHANG Zhe-qing, YAN Li   

  1. College of Computer Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,China
  • Received:2021-11-05 Revised:2022-02-22 Online:2022-11-15 Published:2022-11-03
  • About author:HAN Xiao,born in 1996,postgraduate.His main research interests include semantic web and temporal RDF data.
    YAN Li,born in 1964,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.Her main research interests include big data,know-ledge graph,spatiotemporal information processing and NoSQL database.
  • Supported by:
    Basic Research Program of Jiangsu Province,China(BK20191274).

摘要: 随着时态数据的不断增加,时态知识图谱的概念得到了普及,如何高效地表示时态知识图谱已成为一个重要的研究方向。RDF(Resource Description Framework)虽然在传统知识图谱建模中被广泛运用,但其只能表示静态语义,缺乏表示时态知识图谱的能力,因此已有几种针对时态知识图谱的时态RDF模型被提出。但这些模型都只是将时态信息简单地附加在谓语或整个三元组上,缺少对时态信息所属对象的准确定位。为了更好地表示时态知识图谱,文中提出了一个新的时态RDF表示模型-tRDF。该模型首先根据宾语的不同类型,选择性地将时态信息附加在宾语或谓语上;其次,结合时态数据库的概念,给出了一种基于关系数据库PostgreSQL的tRDF数据存储方法;最后,从数据存储的时间和空间两个方面对所提出的tRDF数据存储方法进行了验证。实验结果表明,所提方案能有效地表示时态知识图谱。

关键词: RDF, 时态扩展, 时态RDF, 时态知识图谱, 时态数据库

Abstract: With the increase of temporal data,the concept of temporal knowledge graph is popularized,and how to represent temporal knowledge graph efficiently has become an important research direction.Although resource description framework(RDF) is widely used in traditional knowledge graph modeling,it can only represent static semantics and lacks the ability to represent temporal knowledge graph.Therefore,several temporal RDF models have been proposed for temporal knowledge graph,but all these models simply attach temporal information to the predicate of RDF or the whole triple,and lack the accurate positioning of the object to which the temporal information belongs.In order to better represent temporal knowledge graph,firstly,this paper proposes a new temporal RDF representation model called tRDF,which attaches temporal information to the object or predicate according to the type of object.Secondly,by combining the concept of temporal database,this paper presents a tRDF data storage method based on the relational database,PostgreSQL.Finally,the proposed tRDF data storage method is verified from two aspects,the time of storing and the size of space.Experimental results show that the proposed scheme can effectively represent temporal knowledge graph.

Key words: RDF, Temporal expansion, Temporal RDF, Temporal knowledge graph, Temporal database

中图分类号: 

  • TP399
[1]JUPP S,MALONE J,BOLLEMAN J,et al.The EBI RDF platform:linked open data for the life sciences[J].Bioinformatics,2014,30(9):1338-1339.
[2]RANZINGER R,AOKI-KINOSHITA K F,CAMPBELL M P,et al.GlycoRDF:an ontology to standardize glycomics data in RDF[J].Bioinformatics,2015,31(6):919-925.
[3]REESE J T,UNNI D R,CALLAHAN T J,et al.KG-COVID-19:a framework to produce customized knowledge graphs for COVID-19 response[J].ScienceDirect,2020,2(1):100155.
[4]STADLER C,LEHMANN J,HöFFNER K,et al.Linkedgeodata:A core for a web of spatial open data[J].Semantic Web,2012,3(4):333-354.
[5]NEUMANN T,WEIKUM G.RDF-3X:a RISC-style engine for RDF[J].Proceedings of the VLDB Endowment,2008,1(1):647-659.
[6]HARRIS S,LAMB N,SHADBOLT N.4store:The design and implementation of a clustered RDF store[C]//Proceedings of the 5th International Workshop on Scalable Semantic Web Knowledge Base Systems.Washington DC:CEUR,2009:94-109.
[7]SALAS P E,MARX E,MERA A,et al.RDB2RDF plugin:relational databases to RDF plugin for eclipse[C]//Proceedings of the 1st Workshop on Developing Tools as Plug-ins.South Paci-fic:ACM,2011:28-31.
[8]BORNEA M A,DOLBY J,KEMENTSIETSI-DIS A,et al.Building an efficient RDF store over a relational database[C]//Proceedings of the 2013 ACM SIGMOD International Confe-rence on Management of Data.New York:ACM,2013:121-132.
[9]OEZSU M T.A survey of RDF data management systems[J].Frontiers of Computer Science,2016,10(3):418-432.
[10]LU J W,YAN L.Mapping Method from Object-relational Database to RDF(S)[J].Chinese Computer Science,2021,48(10):145-151.
[11]MA Z M,CAPRETZ M,YAN L.Storing massive Resource Description Framework(RDF) data:A survey[J].The Knowledge Engineering Review,2016,31(4):391-413.
[12]SUN J L,JIN Q.Scalable RDF store based on HBase and Map-Reduce[C]//Proceedings of the 3rd International Conference on Advanced Computer Theory and Engineering.Chengdu:IEEE,2010:633-636.
[13]SHAO B,WANG H X,LI Y T.Trinity:A Distributed GraphEngine on a Memory Cloud[C]//Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data.New York:ACM,2013:505-516.
[14]HOFFART J,SUCHANEK F M,BERBERIC K,et al.YAGO2:A Spatially and Temporally Enhanced Knowledge Base from Wikipedia[J].Artificial Intelligence,2013,194(JAN.):28-61.
[15]GOAL R,KAZEMI S,BRUBAKER M,et al.Diachronic Embedding for Temporal Knowledge Graph Completion[C]//Proceedings of the AAAI Conference on Artificial Intelligence.New York:AAAI,2020:3988-3995.
[16]WANG J Y,DI X F,LIU J M,et al.A Constraint Framework for Uncertain Spatiotemporal Data in RDF Graphs[C]//Proceedings of the 15th International Conference on Natural Computation,Fuzzy Systems and Knowledge Discovery.Kunming:Springer,2019:727-375.
[17]BAI L Y,WANG J Y,DI X F,et al.Fixing the inconsistencies in fuzzy spatiotemporal RDF graph[J].Information Sciences,2021,578(2021):166-180.
[18]CLAUDIO G,HURTADO C A,VAISMAN A A.TemporalRDF[C]//Proceedings of the Second European Conference on The Semantic Web:Research and Applications.Berlin:Sprin-ger,2005:93-107.
[19]CLAUDIO G,HURTADO C A,VAISMAN A A,et al.Introducing time into RDF[J].IEEE Transactions on Knowledge and Data Engineering,2007,19(2):207-218.
[20]PUGLIESE A,UDREA O,SUBREHMAN-IAN V S.ScalingRDF with time[C]//Proceedings of the 17th International Conference on World Wide Web.New York:ACM,2008:605-614.
[21]KOUBARAKIS M,KYZIRAKOS K.Modeling and QueryingMetadata in the Semantic Sensor Web:The Model stRDF and the Query Language stSPARQL[C]//Proceedings of the Semantic Web:Research and Applications,7th Extended Semantic Web Conference.Berlin:Springer,2010:425-439.
[22]ZHANG F,WANG K,LI Z,et al.Temporal Data Representation and Querying Based on RDF[J].IEEE Access,2019,7:85000-85023.
[23]CHEN Y Y,YAN L,ZHANG Z Q,et al.Temporal RDF Model and Index Method Based on Neighborhood Structure[J].Chinese Computer Science,2021,48(10):167-176.
[24]BRANDT S,ELEM G K,RYZHIKOV V,et al.A Framework for Temporal Ontology-Based Data Access:A Proposal[C]//Proceedings of the European Conference on Advances in Databases and Information Systems.Nicosia:Springer,2017:161-173.
[25]ELEM G K,XIAO G,RYZHIKOV V,et al.Ontop-temporal:A Tool for Ontology-based Query Answering over Temporal Data[C]//Proceedings of the 27th ACM International Conference.Indiana:ACM,2018:1927-1930.
[26]YAN L,ZHAO P,MA Z M.Indexing temporal RDF graph[J].Computing,2019,101(10):1457-1488.
[27]ZHAO P,YAN L.A methodology for indexing temporal RDF data[J].Journal of Information Science and Engineering,2019,35(4):923-934.
[28]FAN T Y,YAN L,MA Z M.Mapping fuzzy RDF(S) into fuzzy object-oriented databases[J].International Journal of Intelligent Systems,2019,34(10):2607-2632.
[29]FAN T Y,YAN L,MA Z M.Storing and querying fuzzy RDF(S) in HBase databases[J].International Journal of Intelligent Systems,2020,35(4):751-780.
[30]O’CONNOR M J,DAS A.A Lightweight Model for Representing and Reasoning with Temporal Information in Biomedical Ontologies [C]//Proceedings of the 3rd International Confe-rence on Health Informatics.Barcelona:DBLP,2010:90-97.
[31]KULKARNI K,MICHELS J.Temporal features in SQL:2011[J].ACM SIGMOD Record,2012,41(3):34-43.
[32]GAO Q,LEE M L,DOBBIE G,et al.A Semantic Framework for Designing Temporal SQL Databases[C]//Proceedings of the 37th International Conference on Conceptual Modeling.Xi’an:Springer,2018:382-396.
[33]LU W,ZHAO Z H,WANG X Y.A lightweight and efficient temporal database management system in TDSQL[C]//Proceedings of the 45th International Conference on Very Large Data Bases.Los Angeles:VLDB,2019:2035-2046.
[34]ANSELMA L,PIOVESAN L,TERENZIANI P.Dealing with temporal indeterminacy in relational databases:An AI metho-dology[J].AI Communications,2019,32(3):1-15.
[35]Al-FEDAGHI S.Conceptual Temporal Modeling Applied to Databases[J].International Journal of Advanced Computer Science and Applications,2021,12(1):524-534.
[36]RDF 1.1 Primer[EB/OL].(2014-02-25) [2021-11-01].http://www.w3.org/TR/2014/NOTE-rdf11-primer-20140225/.
[1] 王引娣, 章哲庆, 严丽.
基于双时态RDF模型的索引方法
Indexing Bi-temporal RDF Model
计算机科学, 2021, 48(4): 63-69. https://doi.org/10.11896/jsjkx.200600084
[2] 鲁佳文, 严丽.
对象关系数据库到RDF(S)的映射方法
Mapping Method from Object-relational Database to RDF(S)
计算机科学, 2021, 48(10): 145-151. https://doi.org/10.11896/jsjkx.200800006
[3] 陈圆圆, 严丽, 章哲庆, 马宗民.
基于邻域结构的时态RDF模型及索引方法
Temporal RDF Model and Index Method Based on Neighborhood Structure
计算机科学, 2021, 48(10): 167-176. https://doi.org/10.11896/jsjkx.200900114
[4] 卢海川, 符海东, 刘宇.
基于CAN的地理语义数据存储与检索机制
Geo-semantic Data Storage and Retrieval Mechanism Based on CAN
计算机科学, 2019, 46(2): 171-177. https://doi.org/10.11896/j.issn.1002-137X.2019.02.027
[5] 宫法明,李翛然.
基于Neo4j的海量石油领域本体数据存储研究
Research on Ontology Data Storage of Massive Oil Field Based on Neo4j
计算机科学, 2018, 45(6A): 549-554.
[6] 郑翠春,汪璟玢.
RDF数据分布式并行语义编码算法
Distributed Parallel Semantic Coding Algorithm for RDF Data
计算机科学, 2016, 43(9): 197-202. https://doi.org/10.11896/j.issn.1002-137X.2016.09.039
[7] 袁柳,张龙波.
模式级链接关联数据集上的关联规则挖掘研究
Association Rules Mining on Schema-level Interconnected Associated Data
计算机科学, 2016, 43(9): 91-98. https://doi.org/10.11896/j.issn.1002-137X.2016.09.017
[8] 董书暕,汪璟玢,陈远.
HMSST+:基于分布式内存数据库的HMSST算法优化
HMSST+:HMSST Algorithm Optimization Based on Distributed Memory Database
计算机科学, 2016, 43(3): 220-224. https://doi.org/10.11896/j.issn.1002-137X.2016.03.040
[9] 郑志蕴,王振涛,张行进,王振飞.
基于二分图的RDF关键词扩展查询方法
Keyword Expansion Query Approach over RDF Data Based on Bipartite Graph
计算机科学, 2016, 43(11): 272-279. https://doi.org/10.11896/j.issn.1002-137X.2016.11.053
[10] 郑志蕴 刘 博 李 伦 王振飞.
基于关键词的RDF数据图查询模型研究
Research of Keyword Search Model over RDF Data Graph
计算机科学, 2015, 42(7): 234-239. https://doi.org/10.11896/j.issn.1002-137X.2015.07.050
[11] 柯叶青,马志柔,伍海江,刘 杰.
一种简历语义搜索系统的实现方法
SmartHR:A Resume Query and Management System Based on Semantic Web
计算机科学, 2015, 42(12): 56-59.
[12] 袁柳,张龙波.
一种基于聚类模式的RDF数据聚类方法
Cluster Pattern Based RDF Data Clustering Method
计算机科学, 2015, 42(10): 266-270.
[13] 董书暕,汪璟玢.
HMSST:一种高效的SPARQL查询优化算法
HMSST:An Efficient Algorithm for SPARQL Query
计算机科学, 2014, 41(Z11): 323-326.
[14] 汪璟玢,方知立,张燕琴.
面向分布式的SPARQL查询优化算法
Distributed Optimized Query Algorithm Based on SPARQL
计算机科学, 2014, 41(7): 227-231. https://doi.org/10.11896/j.issn.1002-137X.2014.07.047
[15] 汪璟玢,方知立.
基于索引的分布式RDF查询优化算法
Distributed Optimized Query Algorithm Based on Index
计算机科学, 2014, 41(11): 233-238. https://doi.org/10.11896/j.issn.1002-137X.2014.11.045
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!