计算机科学 ›› 2012, Vol. 39 ›› Issue (5): 271-277.

• 体系结构 • 上一篇    下一篇

基于OpenCL的拉普拉斯图像增强算法优化研究

贾海鹏,张云泉,龙国平,徐建良,李炎   

  1. (中国科学院软件研究所并行软件与计算科学实验室北京100190);(中国海洋大学信息科学与工程学院青岛266100);(中国科学院软件研究所计算机科学国家重点实验室北京100190);(中国科学院研究生院北京100190)4
  • 出版日期:2018-11-16 发布日期:2018-11-16

Research on Laplace Image Enhancement Algorithm Optimization Based on OpenCL

  • Online:2018-11-16 Published:2018-11-16

摘要: OpcnCL是面向异构计算平台的通用编程框架,然而由于硬件体系结构的差异,如何在平台间功能移植的基 础上实现性能移植仍是有待研究的问题。当前已有算法优化研究一般只针对单一硬件平台,它们很难实现在不同平 台上的高效运行。在分析了不同GPU平台底层硬件架构的基础上,从Global Memory的访存效率、CPU计算资源的 有效利用率及其硬件资源的限制等多个角度考察了不同优化方法在不同GPU硬件平台上对性能的影响;并在此基 础上实现了基于OpenCL的拉普拉斯图像增强算法。实验结果表明,优化后的算法在不考虑数据传输时间的前提下, 在AMI)和NVIDIA CPU上都取得了3. 7-136. 1倍、平均56. 7倍的性能加速,优化后的kernel比NVIDIA NPP库 中相应函数也取得了12.3%-346.7%、平均143. 1%的性能提升,验证了提出的优化方法的有效性和性能可移植性。

关键词: OpenCL,通用计算,拉普拉斯算法,跨平台

Abstract: OOpenCL is a general-purpose programming framework forheterogeneous computing platforms, however, due to the differences in hardware architecture,how to achieve performance portability on different platforms based on the function portability is still to be studied. Currently most of the researches on algorithm optimization are aimed at a sin- gle hardware platform, and difficult to achieve the efficient running on different platforms. This paper analysed the differences between the underlying hardware architectures of GPU, and studied the effects of different GPU platforms using different optimization methods on performance from the access efficiency of global memory, full use of the(}PU compute resource, the constraints with hardware resource and other aspects. Based on this, the Laplace image enhance- ment algorithm based on OpenCL was implemented. Experimental results show that optimized algorithm gets 3. 7一136. 1 times and 56. 7 times on average speedup(without calculate the data transfer time) on both AMD and NVIDIA GPU, and the performance of the optimized kernel increases 12. 3%一346. 7 0 o and 143. 1 0 o on average than the CUDA ver- sion in NVIDIA NPP library,which verifies the effectiveness and cross-platform ability of optimization methods.

Key words: OpenCL, General-purpose computing, Laplace, Across platform

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!