计算机科学 ›› 2018, Vol. 45 ›› Issue (6A): 549-554.

• 综合、交叉与应用 • 上一篇    下一篇

基于Neo4j的海量石油领域本体数据存储研究

宫法明,李翛然   

  1. 中国石油大学华东计算机与通信工程学院 山东 青岛266580
  • 出版日期:2018-06-20 发布日期:2018-08-03
  • 作者简介:宫法明(1970-),硕士,副教授,主要研究方向为自然语言处理、大数据、深度学习、计算机视觉,E-mail:gfaming@163.com;李翛然(1993-),男,硕士,主要研究方向为自然语言处理,E-mail:lixiaoran93@163.com(通信作者)。
  • 基金资助:
    科技部创新方法工作专项基金资助

Research on Ontology Data Storage of Massive Oil Field Based on Neo4j

GONG Fa-ming,LI Xiao-ran   

  1. College of Computer & Communication Engineering,China University of Petroleum,Qingdao,Shandong 266580,China
  • Online:2018-06-20 Published:2018-08-03

摘要: 语义网技术的发展促进了石油领域中多学科本体之间的整合技术的发展。随着数据的规模的增大,传统的基于关系型数据库的数据存储和信息检索等存在较多问题。对此,提出了一个基于Neo4j数据库的领域本体构建过程,专注于改进数据存储和信息检索两个方面。首先,提出了一种基于图形数据库Neo4j的大规模本体数据存储问题的解决方案,通过设计一种基于Neo4j的存储模型配合分布式存储机制,实现存储空间的高效利用。其次,在Neo4j数据模型的基础上,设计了一种两层索引结构的检索算法。实验评估表明,提出的方法与基于关系数据库的方法相比,在数据存储方面可以节省10%以上的存储空间,在信息检索方面将检索效率提高了30多倍。

关键词: Neo4j数据库, RDF数据, 大数据存储, 石油领域本体, 运动目标判断

Abstract: The development of semantic web technology has promoted the development of integrated technology between multidisciplinary ontology in the oil field.As the scale of data increases,the traditional data storage and information retrieval based on relational database have encountered a lot of problems.In view of this problem,this paper proposed a domain ontology construction process based on Neo4j database to improve data storage and information retrie-val.Firstly,this paper proposed a solution of large-scale ontology data storage problem based on Neo4j graphics database.By designing a distributed storage mechanism based on Neo4j storage model,the efficient use of storage space was realized .Secondly,based on the Neo4j data model,this paper designed a two-tier index architecture retrieval algorithm.In the light of experimental evaluation,compared with the method based on the relational database,the method proposed in this paper can save more than 10% storage space,and improve the search efficiency by more than 30 times.

Key words: Big data storage, Moving object judgment, Neo4j database, Ontology in oil field, RDF data

中图分类号: 

  • TP391
[1]ISOTANI S,IBERT BITTENCOURT I,BARBOSA E F,et al.Ontology Driven Software Engineering:A Review of Challenges and Opportunities[J].IEEE Latin America Transactions,2015,13(3):863-869.
[2]JONES M V,COVIELLO N,TANG Y K.International Entrepreneurship research (1989-2009):A domain ontology and thematic analysis[J].Journal of Business Venturing,2011,26(6):632-659.
[3]SEQUEDA J F,ARENAS M,MIRANKER D P.On Directly Mapping Relational Databases to RDF and OWL[C]∥International Conference on World Wide Web.ACM,2012:649-658.
[4]LIU B,HUANG K,LI J,et al.An incremental and distributed inference method for large-scale ontologies based on MapReduce paradigm[J].IEEE Transactions on Cybernetics,2015,45(1):53.
[5]段炼.基于文本分析的石油领域本体自动构建方法的研究[D].大庆:东北石油大学,2015.
[6]文必龙,段炼,汪志群,等.基于语料库和规则库的石油本体自动构建研究[J].计算机技术与发展,2015(9):209-212.
[7]王月.基于本体的油田开发知识库构建研究[D].大庆:东北石油大学,2016.
[8]WANG E,YONG S K,KIM H S,et al.Ontology Modeling and Storage System for Robot Context Understanding[M]∥Know-ledge-Based Intelligent Information and Engineering Systems.Springer Berlin Heidelberg,2005:922-929.
[9]KHALID A,SHAH S A H,QADIR M A.OntRel:An Ontology Indexer to store OWL-DL Ontologies and its Instances[C]∥Soft Computing and Pattern Recognition.2009:478-483.
[10]THAKER R,GOEL A.Domain Specific Ontology based Query processing System for Urdu Language[J].International Journal of Computer Applications,2015,121(3):20-23.
[11]陶皖,姚红燕. OWL本体关系数据库存储模式设计.计算机技术与发展,2007,17(2):111-114.
[12]PINKEL C,BINNIG C,JIM NEZRUIZ E,et al.RODI:A Ben- chmark for Automatic Mapping Generation in Relational-to-Ontology Data Integration[M]∥The Semantic Web.Latest Advances and New Domains.Springer International Publishing,2015:21-37.
[13]ELBATTAH M,ROUSHDY M,AREF M,et al.Large-scale ontology storage and query using graph database-oriented approach:The case of Freebase[C]∥IEEE Seventh International Conference on Intelligent Computing and Information Systems.IEEE,2016:39-43.
[14]HUANG J,ABADI D J,REN K.Scalable SPARQL querying of large RDF graphs[J].Proceedings of the Vldb Endowment,2011,4:1123-1134.
[15]MALEWICZ G,AUSTERN M H,BIK A J C,et al.Pregel:a system for large-scale graph processing[C]∥ACM SIGMOD International Conference on Management of Data.ACM,2010:135-146.
[16]KHAN L,LUO F.Ontology construction for information selection[C]∥IEEE International Conference on TOOLS with Artificial Intelligence.IEEE Xplore,2002:122-127.
[17]DOMBAYCI C,FARRERES J,RODRíGUEZ H,et al.On the Process of Building a Process Systems Engineering Ontology Using a Semi-Automatic Construction Approach[J].Computer Aided Chemical Engineering,2016,37:941-946.
[18]张晓冉,舒昝.基于关系数据库的油田领域数据质量本体构建[J].微型电脑应用,2016,32(7):71-73.
[19]HARTIG O.Reconciliation of RDF* and Property Graphs[R].University of Waterloo,2014.
[20]张前进.基于Neo4j的智能学习系统语义链接图式存储研究[J].佳木斯大学学报(自然科学版),2017,35(2):299-301.
[21]彭安琪.分布式溯源信息存储系统的研究与实现[D].成都:电子科技大学,2016.
[22]康杰华,罗章璇.基于图形数据库Neo4j的RDF数据存储研究[J].信息技术,2015(6):115-117.
[23]HOLZSCHUHER F.Performance of graph query languages: comparison of cypher,gremlin and native access in Neo4j[C]∥Joint EDBT/ICDT 2013 Workshop GraphQ.2013:195-204.
[24]KU R F.Apache Solr 4 Cookbook[M].Packt Publishing,2013.
[25]ASSOCIATION I R M.International journal on Semantic Web and information systems[J].Journal of Polymer Science Polymer Chemistry Edition,2013,22(10):2625-2640.
[26]RAJITH A,NISHIMURA S,YOKOTA H.JARS:Join-Aware Distributed RDF Storage[C]∥International Database Enginee-ring & Applications Symposium.ACM,2016:264-271.
[1] 郑志蕴 刘 博 李 伦 王振飞.
基于关键词的RDF数据图查询模型研究
Research of Keyword Search Model over RDF Data Graph
计算机科学, 2015, 42(7): 234-239. https://doi.org/10.11896/j.issn.1002-137X.2015.07.050
[2] 李慧颖,瞿裕忠.
基于关键词的语义网数据查询研究综述
Keyword-based Search on Semantic Web Data: The State of the Art
计算机科学, 2011, 38(7): 18-23.
[3] 吴鸿汉 瞿裕忠.
RDF数据浏览的研究综述

计算机科学, 2009, 36(2): 5-10.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!