Computer Science ›› 2011, Vol. 38 ›› Issue (5): 290-294.
Previous Articles Next Articles
CHEN Xiang,SHEN Li,LI Jia-wen
Online:
Published:
Abstract: Nowadays, more and more general-purpose microprocessors provide enhanced SIMD instruction-set extensions to exploit data level parallelism. However, some inherent characteristics of applications and algorithms, such as memory address nonalignment, inconsecutive memory access and control flow, etc.,make compilers or programmers have to use permutation instruction to reorganize the element of vectors to get correct operands for SIMI)instructions.And these redundant permutation instructions had become the performance bottleneck of exploiting data level parallelism. hhis paper proposed an automatic data permutation generation and optimization algorithm. It can effectively reduce the performance loss caused by permutation instruction. The algorithm is based on a new intermediate representation,which contains enough address message of the operand, with which the problem of data permutation generation and optimization can be solved via identifying and eliminating all conflict edges in data flow graphs with minimal costs. The test result to a group of typical multimedia program shows that the algorithm can achieve performance acceleration up to 7% on the average.
Key words: Data permutation,Intermediate representation,Conflict edge
CHEN Xiang,SHEN Li,LI Jia-wen. Automatic Data Permutation Generation and Optimization for SIMD Devices[J].Computer Science, 2011, 38(5): 290-294.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.jsjkx.com/EN/
https://www.jsjkx.com/EN/Y2011/V38/I5/290
Cited