计算机科学 ›› 2009, Vol. 36 ›› Issue (11): 123-126.

• 软件工程与数据库技术 • 上一篇    下一篇

一种高可用对象存储系统的数据组织研究

詹玲,张强善,万继光   

  1. (华中科技大学计算机学院信息存储系统教育部重点实验室 武汉光电国家实验室 武汉430074); (武汉大学计算机学院 武汉430072);(信阳职业技术学院网络中心 信阳464000)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家自然科学基金重点项目(编号:60933002)和国家863计划重大项目(编号:2009AA01A402 )资助。

Data Organization Research of the High Availability Object Storage System

ZHAN Ling,ZHANG Qiang-shan,WAN Ji-guang   

  • Online:2018-11-16 Published:2018-11-16

摘要: 通过对现有不同存储系统的容错能力进行认真分析,提出了一种新的高可用对象存储系统架构HAOSS(High Availability Objcct Storage System) .HAOSS系统分两层:上层存储的对象采用在多个设备之间的数据备份的 方式来实现高可靠性,多个备份能够同时对外提供服务,保证系统的高性能,但是磁盘利用率比较低。下层采用RAIDS , RAID6以及RAID I31aum等不同的容错编码来实现多盘容错功能,磁盘利用率较高,但是由于编码越来越复杂,容错编码需要大量的计算,性能受到很大的影响。在数据组织上,新对象和热点对象放上层,这样大部分请求都能够在上层命中,从而保证系统的性能。下层主要用来放不常用的数据,磁盘利用率比较高。在1000Mbps以太网环境下对HAOSS性能进行了测试,结果表明,HAOSS顺序读写性能都比较好,最大可以达到104MI3/s,达到1000Mb以太网的理论最大物理带宽。

关键词: 数据组织,高可用,对象存储,容错

Abstract: Based on a thorough analysis on the fault tolerance capability on various existing storage systems, we proposed a new hierarchical, highly reliable, multi-disk fault tolerant storage system architecture; High Availability Object Story ge System (HAOSS). The HAOSS is composed of two layers; the upper-layer and the lower-layer. I}he upper-layer achieves the high availability by storing multiple replicas for each storage object in a set of storage devices. The indi-victual replicas can service the I/O requests in parallel so as to obtain high performance. But the effective disk space uti-lization rate for the upper-layer is relatively low. The lower-layer deploys RAIDS,RAID6 or RAID-Blaum coding schemes to tolerate multi-disk failures. The disk utilization rate of coding schemes is higher than that of multiple replicas. These advantages come at the price of more complicated fault tolerant coding schemes, which involve a large amount of calculation for encoding and cause an adverse impact on the I/O performance. The HAOSS puts new objects and hot objects in its upper-layer,so that the majority of the rectuests are absorbed by the upper-layer,hence achieving guaranteed system I/O performance. The main purpose of the lower-layer is to provide a reservoir for the cold data. In a 1000Mb Ethernet interconnection environment, with a request block size of 1024kB, the sequential read performance for a HAOSS server reaches 104MB/s, which is very close to the theoretical maximum effective bandwidth of Ethernet networks.

Key words: Data organization,High availability,Objcct storage systcm,Fault-tolerant

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!