计算机科学 ›› 2011, Vol. 38 ›› Issue (5): 20-23.

• 计算机网络与信息安全 • 上一篇    下一篇

内容感知存储系统中的两阶段检索策略

刘科,秦磊华,周敬利,聂雪军,曾东   

  1. (华中科技大学计算机科学与技术学院 武汉430074)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家自然科学基金(60673001),部委基金“基于服务定制的智能存储系统研究”资助。

Two-phrase Retrieval Strategy in Content Aware Network Storage System

LIU Ke,QIN Lei-hua,ZHOU Jing-li,NIE Xue-jun,ZENG Dong   

  • Online:2018-11-16 Published:2018-11-16

摘要: 随着存储系统规模的不断扩大,如何有效组织、管理和查询存储系统中的资源,成为了研究者必须应对的一个问题。目前存储系统中的查询需求主要来自系统管理员对元数据的查询以及普通用户对关键字内容的查询等两个方面。而内容感知存储系统自身所具备的重复数据删除和块相似性检测能力并没有被用于优化上述查询过程。为了充分利用存储系统感知到的上层语义和底层重复数据块信息,为使用者提供高效、便捷的查询服务,提出了内容感知网络存储系统中的两阶段检索策略。该策略将上层基于元数据和关键字的查询与底层存储系统的块相似性查询相结合,利用两次查询相关度的加权平均值作为相似度评价指标。最终的实验结果表明了该策略在降低失效性、提高查全率等方面的有效性。

关键词: 元数据,数据迁移,内容寻址存储,两阶段检索,内容感知

Abstract: As the storage capacity approach Exabytes, how to efficiently organize, find and manage data is becoming increasingly difficult for us. The query requests in storage system are coming from two aspects, the first one is metadata retrieval delivered by administrator and the second one is user's common keyword query. But the functions of de-duplication and block similarity detection in content aware storage system are not utilized to enhance the above query processing. In order to take advantage of the upper semantic information and the lower storage system's duplicate block information to deliver efficient query service for users, a two-phrase retrieval strategy was introduced. It combined metadata/keyword query with block similarity query and utilized ranking coefficient to evaluate similarity among query resups. The experiments indicate that the retrieval strategy has efficiently enhanced the retrieval recall.

Key words: Metadata, Data migration, Content addressable storage, Two-phrase retrieval, Content aware

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!