计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 171-177.doi: 10.11896/j.issn.1002-137X.2019.02.027

所属专题: 数据库技术

• 软件与数据库技术 • 上一篇    下一篇

基于CAN的地理语义数据存储与检索机制

卢海川, 符海东, 刘宇   

  1. 武汉科技大学计算机科学与技术学院 武汉430065
    智能信息处理与实时工业系统湖北省重点实验室 武汉430065
  • 收稿日期:2018-08-16 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 刘 宇(1980-),男,博士,副教授,主要研究方向为语义网、自然语言处理和分布式计算,E-mail:liuyu@wust.edu.cn。
  • 作者简介:卢海川(1993-),男,硕士生,主要研究方向为语义网、分布式计算和信息检索,E-mail:lhch35@126.com;符海东(1971-),男,博士,教授,主要研究方向为软件工程、数据挖掘
  • 基金资助:
    本文受国家自然科学基金(61673304,61272110,61502359),国家社会科学基金重大计划(11&ZD189)资助。

Geo-semantic Data Storage and Retrieval Mechanism Based on CAN

LU Hai-chuan, FU Hai-dong, LIU Yu   

  1. College of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan 430065,China
    Hubei Province Key Laboratory of Intelligent Information Processing and Real Time Industrial System,Wuhan 430065,China
  • Received:2018-08-16 Online:2019-02-25 Published:2019-02-25

摘要: 语义技术能够更智能、更精确地检索信息,辅助工作人员进行科学决策,已被应用于地理信息处理,并形成了基于RDF(Resource Description Framework)数据的地理查询语言GeoSPARQL。然而,基于地理语义信息处理的应用平台多采用中心化的存储和检索服务,使得这些平台存在单节点失效、扩展性差等缺陷。尽管已有研究人员提出了多种方法,试图利用对等网络技术来解决语义数据的分布式处理,从而提升应用系统的可靠性和扩展性,但这些方法并没有考虑地理语义数据自身的特征。针对上述问题,文中利用地理语义数据的特征在对等网络上对其进行存储,提出基于CAN(Content Addressable Network)的地理语义存储和检索方案,根据位置信息将地理语义数据映射到对等网络中,从而提高了语义数据的检索效率。实验结果表明,所提方案不仅具有良好的扩展性,而且地理信息的拓扑关系查询效率优于现有方案。

关键词: GeoSPARQL, RDF存储查询, 地理语义信息, 内容寻址网络

Abstract: Semantic technology can search information more intelligently and accurately,and assist researchers to make scientific decisions.Therefore,this technology has been introduced into geographic information processing and formed a geo-query language GeoSPARQL based on RDF (Resource Description Framework).However,the existing application platforms based on geographic semantic information processing adopt centralized storage and retrieval services,which will cause the disadvantages of single node failure and poor scalability.Although researchers have proposed a variety of methods to use peer-to-peer network to improve the reliability and scalability of application systems,these methods do not consider the characteristics of geographic semantic data.In view of the above problems,this paper considered the feature of geographical semantic data and optimized the storage of semantic data on the peer-to-peer network.This paper proposed a storage and retrieval scheme based on content addressed network,and also improved the retrieval efficiency of semantic data by mapping the triple to the network according to its position.The experimental results show that the proposed scheme has good expansibility,and the query efficiency of topology relation is superior to the existing schemes.

Key words: Content addressable network, Geo-semantic information, GeoSPARQL, RDF storage and query

中图分类号: 

  • TP311
[1]BATTLE R,KOLAS D.Enabling the geospatial semantic web with parliament and geosparql[J].Semantic Web,2012,3(4):355-370.
[2]BALLATORE A,WILSON D C,BERTOLOTTO M.A survey of volunteered open geo-knowledge bases in the semantic web[M]∥Berlin:Quality Issues in the Management of Web Information.Springer,2013:93-120.
[3]GERLA M,LEE E K,PAU G,et al.Internet of vehicles:From intelligent grid to autonomous cars and vehicular clouds[C]∥2014 IEEE World Forum on Internet of Things (WF-IoT).IEEE,2014:241-246.
[4]RATNASAMY S,FRANCIS P,HANDLEY M,et al.A scalable content-addressable network[C]∥ Conference on Applications,Technologies,Architectures,and Protocols for Computer Communications.ACM,2001:161-172.
[5]ZHANG C,ZHAO T,LI W.Automatic search of geospatial features for disaster and emergency management[J].International Journal of Applied Earth Observation and Geoinformation,2010,12(6):409-418.
[6]ZHANG C,ZHAO T,LI W.The framework of a geospatial semantic web-based spatial decision support system for digital earth[J].International Journal of Digital Earth,2010,3(2):111-134.
[7]CRUZ I F,GANESH V R,MIRREZAEI S I.Semantic extraction of geographic data from web tables for big data integration[C]∥Proceedings of the 7th Workshop on Geographic Information Retrieval.ACM,2013:19-26.
[8]YU L,LU F,ZHANG X,et al.Context Enhanced Keyword Extraction for Sparse Geo-Entity Relation from Web Texts[C]∥Asia-Pacific Web Conference.Springer,Cham,2016:253-264.
[9]DUAN H W,MENG L K,HUANG C Q,et al.A Method for Geo Semantic Spatial Index on SPARQL Query[J].Acta Geodaetica et Cartographica Sinica,2014,43(2):193-199.(in Chinese)
段红伟,孟令奎,黄长青,等.面向 SPARQL 查询的地理语义空间索引构建方法[J].测绘学报,2014,43(2):193-199.
[10]LI W,GOODCHILD M F,RASKIN R.Towards geospatial semantic search:exploiting latent semantic relations in geospatial data[J].International Journal of Digital Earth,2014,7(1):17-37.
[11]ZHANG C,ZHAO T,LI W.Towards an interoperable online volunteered geographic information system for disaster response[J].Journal of Spatial Science,2015,60(2):257-275.
[12]ZHAO T,ZHANG C,ANSELIN L,et al.A parallel approach for improving Geo-SPARQL query performance[J].Internatio-nal Journal of Digital Earth,2015,8(5):383-402.
[13]ZHANG C,ZHAO T,ANSELIN L,et al.A Map-Reduce based parallel approach for improving query performance in a geospatial semantic web for disaster response[J].Earth Science Informatics,2015,8(3):499-509.
[14]DEAN J,GHEMAWAT S.MapReduce:simplified data proces- sing on large clusters[J].Communications of the ACM,2008,51(1):107-113.
[15]ZHOU J,HALL W,DE R D.Building a distributed infrastructure for scalable triple stores[J].Journal of Computer Science and Technology,2009,24(3):447-462.
[16]GIUNCHIGLIA F,KHARKEVICH U,HUME A,et al.Semantic flooding:search over semantic links[C]∥2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW).IEEE,2010:191-196.
[17]SAROIU S,GUMMADI P K,GRIBBLE S D.Measurement study of peer-to-peer file sharing systems[C]∥Multimedia Computing and Networking 2002.International Society for Optics and Photonics,2001,4673:156-171.
[18]SEN S,WANG J.Analyzing peer-to-peer traffic across large networks[C]∥Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurement.ACM,2002:137-150.
[19]STOICA I,MORRIS R,LIBEN-N D,et al.Chord:a scalable peer-to-peer lookup protocol for internet applications[J].IEEE/ACM Transactions on Networking (TON),2003,11(1):17-32.
[20]CAI M,FRANK M,YAN B,et al.A subscribable peer-to-peer RDF repository for distributed metadata management[J].Web Semantics:Science,Services and Agents on the World Wide Web,2004,2(2):109-130.
[21]MATONO A,PAHLEVI S M,KOJIMA I.RDFCube:A P2P-based three-dimensional index for structural joins on distributed triple stores[M]∥Databases,Information Systems,and Peer-to-Peer Computing.Springer Berlin Heidelberg,2007:323-330.
[22]LIAROU E,IDREOS S,KOUBAEAKIS M.Publish/subscribe with RDF data over large structured overlay networks[M]∥Databases,Information Systems,and Peer-to-Peer Computing.Springer Berlin Heidelberg,2007:135-146.
[23]PELLEGRINO L,HUET F,BAUDE F,et al.A distributed publish/subscribe system for RDF data[C]∥International Conference on Data Management in Cloud,Grid and P2P Systems.Springer Berlin Heidelberg,2013:39-50.
[24]ALI L,JANSON T,SCHINDELHAUER C.Towards Load Ba- lancing and Parallelizing of RDF Query Processing in P2P Based Distributed RDF Data Stores[C]∥Euromicro International Conference on Parallel.2014.
[25]NIKOLAOU C,KOUBARAKIS M.Fast consistency checking of very large real-world RCC-8 constraint networks using graph partitioning[C]∥Twenty-eighth Aaai Conference on Artificial Intelligence.2014.
[26]RIPEANU M.Peer-to-peer architecture case study:Gnutella network[C]∥Proceedings First International Conference on Peer-to-Peer Computing.2001:99-100.
[1] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[2] 王子凯, 朱健, 张伯钧, 胡凯.
区块链与智能合约并行方法研究与实现
Research and Implementation of Parallel Method in Blockchain and Smart Contract
计算机科学, 2022, 49(9): 312-317. https://doi.org/10.11896/jsjkx.210800102
[3] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[4] 熊罗庚, 郑尚, 邹海涛, 于化龙, 高尚.
融合双向门控循环单元和注意力机制的软件自承认技术债识别方法
Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism
计算机科学, 2022, 49(7): 212-219. https://doi.org/10.11896/jsjkx.210500075
[5] 潘志勇, 程宝雷, 樊建席, 卞庆荣.
数据中心网络BCDC上的顶点独立生成树构造算法
Algorithm to Construct Node-independent Spanning Trees in Data Center Network BCDC
计算机科学, 2022, 49(7): 287-296. https://doi.org/10.11896/jsjkx.210500170
[6] 李瑭, 秦小麟, 迟贺宇, 费珂.
面向多无人系统的安全协同模型
Secure Coordination Model for Multiple Unmanned Systems
计算机科学, 2022, 49(7): 332-339. https://doi.org/10.11896/jsjkx.210600107
[7] 黄觉, 周春来.
基于本地化差分隐私的频率特征提取
Frequency Feature Extraction Based on Localized Differential Privacy
计算机科学, 2022, 49(7): 350-356. https://doi.org/10.11896/jsjkx.210900229
[8] 叶跃进, 李芳, 陈德训, 郭恒, 陈鑫.
基于国产众核架构的非结构网格分区块重构预处理算法研究
Study on Preprocessing Algorithm for Partition Reconnection of Unstructured-grid Based on Domestic Many-core Architecture
计算机科学, 2022, 49(6): 73-80. https://doi.org/10.11896/jsjkx.210900045
[9] 赵静文, 付岩, 吴艳霞, 陈俊文, 冯云, 董继斌, 刘嘉琪.
多线程数据竞争检测技术研究综述
Survey on Multithreaded Data Race Detection Techniques
计算机科学, 2022, 49(6): 89-98. https://doi.org/10.11896/jsjkx.210700187
[10] 陈鑫, 李芳, 丁海昕, 孙唯哲, 刘鑫, 陈德训, 叶跃进, 何香.
面向国产异构众核架构的CFD非结构网格计算并行优化方法
Parallel Optimization Method of Unstructured-grid Computing in CFD for DomesticHeterogeneous Many-core Architecture
计算机科学, 2022, 49(6): 99-107. https://doi.org/10.11896/jsjkx.210400157
[11] 王毅, 李政浩, 陈星.
基于用户场景的Android 应用服务推荐方法
Recommendation of Android Application Services via User Scenarios
计算机科学, 2022, 49(6A): 267-271. https://doi.org/10.11896/jsjkx.210700123
[12] 傅丽玉, 陆歌皓, 吴义明, 罗娅玲.
区块链技术的研究及其发展综述
Overview of Research and Development of Blockchain Technology
计算机科学, 2022, 49(6A): 447-461. https://doi.org/10.11896/jsjkx.210600214
[13] 蒋成满, 华保健, 樊淇梁, 朱洪军, 徐波, 潘志中.
Python虚拟机本地代码的安全性实证研究
Empirical Security Study of Native Code in Python Virtual Machines
计算机科学, 2022, 49(6A): 474-479. https://doi.org/10.11896/jsjkx.210600200
[14] 袁昊男, 王瑞锦, 郑博文, 吴邦彦.
基于Fabric的电子病历跨链可信共享系统设计与实现
Design and Implementation of Cross-chain Trusted EMR Sharing System Based on Fabric
计算机科学, 2022, 49(6A): 490-495. https://doi.org/10.11896/jsjkx.210500063
[15] 陈钧吾, 余华山.
面向无尺度图的Δ-stepping算法改进策略
Strategies for Improving Δ-stepping Algorithm on Scale-free Graphs
计算机科学, 2022, 49(6A): 594-600. https://doi.org/10.11896/jsjkx.210400062
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!