计算机科学 ›› 2016, Vol. 43 ›› Issue (11): 265-271.doi: 10.11896/j.issn.1002-137X.2016.11.052

• 软件与数据库技术 • 上一篇    下一篇

集成CPU-GPU架构上的列存储连接优化技术研究

丁祥武,李子通   

  1. 东华大学计算机科学与技术学院 上海201620,东华大学计算机科学与技术学院 上海201620
  • 出版日期:2018-12-01 发布日期:2018-12-01

Column-stores Join Optimization on Coupled CPU-GPU Architecture

DING Xiang-wu and LI Zi-tong   

  • Online:2018-12-01 Published:2018-12-01

摘要: 集成多核CPU-GPU架构已经成为计算机处理器芯片的发展方向。利用这种架构的并行计算能力进行数据处理已经成为了数据库领域的研究热点。为了提高列存储系统的查询性能,首先改进了已有协处理机制中的负载分配策略,通过监测数据库系统CPU占用率,动态地为处理器提供合理的数据划分;然后,针对集成多核CPU-GPU架构上的数据预取机制,提出了一种确定预取数据大小的模型,同时,针对GPU访存的特点,进行了GPU访存优化;最后,使用OpenCL作为编程语言,实现了一种集成多核CPU-GPU架构上的列存储排序归并连接算法,并采用提出的方法对连接处理进行优化。实验证明,所提优化策略可以使列存储系统排序归并连接性能提升33%。

关键词: 异构芯片,数据预取,查询优化,排序归并连接,OpenCL

Abstract: Heterogeneous architecture is the new trend of the development of computer system central processor unit(CPU).Taking advantage of its powerful computer power has been a new research hotspot in database system field.First,in order to enhance the query performance of column-oriented database,we proposed a data partition model which is environmentally sensitive.The data partition model provides optimal data division for every processing unit dynamically by monitoring the CPU occupancy rate.Then,for GPU memory access optimization,we proposed a DFAT estimate model for prefetching.At the same time,we optimized GPU memory access based on coalesced access.We implemented a sort-merge join algorithm on a PC with an integrated CPU-GPU chip,which adopts the out data partition model and our cost model in prefetching.Our strategy is able to distribute data to different processing units automatically,and can make sort-merge join achieve a performance improvement of 33% on coupled CPU-GPU architecture.

Key words: Heterogeneous chip,Data pre-fetching,Query optimization,Sort-merge join,OpenCL

[1] He J,Lu M,He B.Revisiting co-processing for hash joins on the coupled CPU-GPU Architecture[J].Proceedings of the VLDB Endowment,2013,6(10):889-900
[2] Lu Feng-shun,Song Jun-qiang,Yin Fu-kang,et al.Survey ofCPU/GPU Synergetic Parallel Computing[J].Computer Science 2011,38(3):5-9(in Chinese) 卢风顺,宋君强,银福康,等.CPU/GPU协同并行计算研究综述[J].计算机科学,2011,38(3):5-9
[3] He J,Zhang S,He B.In-Cache Query Co-Processing on Coupled CPU-GPU Architectures[J].Proceedings of the VLDB Endowment,2014,8(4):329-340
[4] Albutiu M C,Kemper A,Neumann T.Massively parallel sort-merge joins in main memory multi-core database systems[J].Proceedings of the VLDB Endowment,2012,5(10):1064-1075
[5] Balkesen C,Alonso G,Teubner J,et al.Multi-core,main-memory joins:Sort vs.hash revisited[J].Proceedings of the VLDB Endowment,2013,7(1):85-96
[6] He B,Lu M,Yang K,et al.Relational query coprocessing ongraphics processors[J].ACM Transactions on Database Systems(TODS),2009,34(4):2939-2965
[7] Fang R,He B,Lu M,et al.GPUQP:query co-processing using graphics processors[C]∥Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data.ACM,2007:1061-1063
[8] Heimel M,Saecker M,Pirk H,et al.Hardware-oblivious paral-lelism for in-memory column-stores[J].Proceedings of the VLDB Endowment,2013,6(9):709-720
[9] Chen S,Ailamaki A,Gibbons P B,et al.Improving hash joinperformance through prefetching[J].ACM Transactions on Database Systems(TODS),2007,32(3):116-127
[10] Zhou J,Cieslewicz J,Ross K A,et al.Improving database performance on simultaneous multithreading processors[C]∥Proceedings of the 31st international conference on Very large data bases.VLDB Endowment,2005:49-60
[11] Peters H,Schulz-Hildebrandt O,Luttenberger N.Parallel Processing and Applied Mathematics[M].Springer Berlin Heidelberg,2010:403-410
[12] Hardavellas N,Pandis I,Johnson R,et al.Database servers onchip multiprocessors:Limitations and opportunities[C]∥Proceedings of the Biennial Conference on Innovative Data Systems Research.2007:79-87
[13] He B,Yang K,Fang R,et al.Relational joins on graphics proces-sors[C]∥Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data.ACM,2008:511-524
[14] Rentzsch J.Data alignment:Straighten up and fly right[R/OL].(2005-02-08).https://www.Ibm .com/developerworks/library/pa-dalign/pa-dalign-pdf.pdf
[15] Chen Hu,Chen Si-tong,Li Guan-zhao,et al.Column-Storage database parallel query on heterogeneous computing platforms[J].Journal of Computer Research and Development,2012,9(Suppl.):65-71(in Chinese) 陈虎,陈思桐,李观钊,等.异构计算平台上列存储数据库的并行查询技术研究[J].计算机研究与发展,2012,9(Suppl.):65-71
[16] Gaster B R,Howes L,Kaeli D R,et al.OpenCL异构计算[M].张云泉,张先轶,龙国平,等译.北京:清华大学出版社,2012:42-44

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!