计算机科学 ›› 2010, Vol. 37 ›› Issue (12): 1-7.

• 综述 •    下一篇

列存储数据库关键技术综述

李超,张明博,邢春晓,胡劲松   

  1. (清华大学信息技术研究院 北京100084)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家863计划(编号2009AA01Z143),铁道部-清华大学科技研究基金(编号:J2008X009)的资助.

Survey and Review on Key Technologies of Column Oriented Database Systems

LI Chao,ZHANG Ming-bo,XING Chun-xiao,HU Jin-song   

  • Online:2018-12-01 Published:2018-12-01

摘要: 随着互联网技术的发展、硬件的不断更新、企业及政府信息化的不断深入,应用的复杂性要求越来越高,推动着数据存储技术向着海量数据、分析数据、智能数据的方向发展,以便为数据仓库、在线分析提供高效实时的技术支持。基于行存储的数据库技术面临新的问题,已经出现了技术瓶颈。近些年来,一种新的数据存储理念,即基于列存储的关系型数据库(简称列数据库,下同)应运而生。列数据库能够快速发展,主要原因是其复杂查询效率高,读磁盘少,存储空间少,以及由此带来的技术、管理和应用优势。对列数据库技术的基本现状、关键支撑技术以及应用优势进行了介绍和分析。

关键词: 列数据库,列存储,数据压缩,延时物化,成组迭代,不可见连接,数据仓库,商业智能,TPCH

Abstract: Column-oriented database is a kind of new database storage technology that stores data according to column (not traditionally row). The database pioneers such as Dr. Michael Stonebraker are advocating and exploring the new theory and technology for column-oriented database. The main features of it arc good query efficiency,less disk access, less storage,and significant improvement of database performance. Column-oriented database is an ideal architecture for data warehouse natively, and thus shows a good potential in supporting highly efficient business intelligence applicadons. This new technology is promising in both academic and business, therefore attracting lots of high-tech corporalions and research institutes to devote in it, This paper introduced and analysed the main features,key technologies and current R&D situations of column-oriented database.

Key words: Column-oricntcd databasc,Comprcssion,l3lock itcration,Latc matcrialization,Invisiblc join,Data warchousc Business intelfigence,TPCH

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!