计算机科学 ›› 2011, Vol. 38 ›› Issue (6): 187-190.

• 数据库与数据挖掘 • 上一篇    下一篇

基于Cassandra的可扩展分布式反向索引的构建

唐李洋,倪志伟,李应   

  1. (合肥工业大学管理学院智能管理研究所 合肥230009)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家自然科学基金(70871033),国家高技术研究发展计划(863)(2007AA04Z116),国家社会科学基金项目(10CGL024)资助。

Scalable Distributed Inverted Index Built on Cassandra

TANG Li-yang ,NI Zhi-wei,LI Ying   

  • Online:2018-11-16 Published:2018-11-16

摘要: 随着云计算时代的到来,大型W cb应用的不断发展,海量数据不断增加,集中式的数据检索已不再满足需求。如何在分布式的环境中高效地处理数据检索成为亚待解决的问题。传统的关系型数据存储也无法完全适应云环境,NoSQL(Not only SQL)作为一种云存储形式应运而生,其中assandra的应用较为广泛。以分布式的多节点架构的索引构建为背景,提出了建立在分布可扩展的数据存储Cassandra之上的分布式反向索引(DII, Distributed Invcrtcd Index),并给出了数据模型和查询处理流程的分析,最后给出了Cassandra的性能测试。

关键词: 云存储,分布式索引,反向索引,Cassandra

Abstract: As the age of cloud computing,giant Web-scale application and massive data exert big challenges on centralined data retrieval. It becomes a tricky issue to process efficiently data retrieval in distributed environment. Traditional relational database is not fully suitable to the cloud any more. As a novel cloud storage, NoSQL (Not only SQL) emerges, among which Cassandra is widely used. With the application of index building under the distributed multi-node architecture, a scalable distributed inverted index built on Cassandra was put forward to solve the problem. Besides, analysis on data model and processing flow, as well as a performance test about Cassandra, were proposed.

Key words: Cloud storage, Distributed index, Inverted index, Cassandra

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!