计算机科学 ›› 2011, Vol. 38 ›› Issue (7): 298-301.

• 体系结构 • 上一篇    下一篇

SIMD技术与向量数学库研究

解庆春,张云泉,王可,李焱,许亚武   

  1. (中国科学院软件研究所并行软件与计算科学实验室 北京100190)(中国科学院计算机科学国家重点实验室 北京100190)(中国科学院研究生院 北京1001900)(广州大学网络与现代教育技术中心 广州510006)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家863项目(2006AA01A125, 2009AA01A129, 2009AA01A134) ,国家自然科学基金 项目(60303032) , 国家自然基金重点项目(60533020)资助。

Research of the SIMD and Vector Math Library

XIE Qing-chun,ZHANG Yun-quan,WANG Ke,LI Yan,XU Ya-wu   

  • Online:2018-11-16 Published:2018-11-16

摘要: 首先,结合Intel, AMD和IBM处理器,介绍了单指令流多数据流(SIMD)向量化技术及其各自的特点。其次,在3种平台上对各自开发的函数库中的部分向量数学函数进行了测试。结果表明,相对传统的标量计算,向量化技术带来的加速比较高,特别是Celll SDK函数,因其独特的体系结构,多个向量处理单元带来的平均加速比为10。最后,通过测试结果的对比,发现不同数学库中的向量函数之间在性能方面也存在着差异,并对差异原因进行了分析,得出性能差异主要是处理器架构和向量计算单元个数和访存等因素造成的。

关键词: 向量化,SSE, MMX, 3DNow!, SIMD

Abstract: Firstly,we introduced the single Instruction Multiple Data(SIMD) vectorization technology and the features separately, based on the processors of Intel AMD and IBM Cell. Secondly, some vectorization functions were tested in these three platforms, which were deve-loped by the three vendors separately. Our test results show that we achieve high performance with the technology of the vectorization, compared to the traditional methods of the scalar calculation.Especially, the speedup of the Cell SDK functions is 10 on average, which were achieved by the help of many processing elements and the special processor structure. Lastly,we also found that there are some differences between the vectorial functions,which are in different vector math libraries. We analyzed that there are some reasons caused the difference betwecn the math functions in different platforms, such as processor structure, the number of the processing elements, memery accessing and so on.

Key words: Vectorization, SSE, MMX, 3DNow! , SIMD

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!