Computer Science ›› 2025, Vol. 52 ›› Issue (6A): 240300156-7.doi: 10.11896/jsjkx.240300156

• Computer Software & Architecture • Previous Articles     Next Articles

Research on Efficient Code Generation Techniques for Array Computation for Vector DSPs

LIAO Zeming, LIU Guikai, HU Yonghua, XIE Anxing   

  1. School of Computer Science and Engineering,Hunan University of Science and Technology,Xiangtan,Hunan 411100,China
  • Online:2025-06-16 Published:2025-06-12
  • About author:LIAO Zeming,born in 1998,postgra-duate.His main research interests include high performance computing and code generation.
    HU Yonghua,born in 1981,Ph.D,professor,Ph.D supervisor,is a member of CCF(No.E200034324M).His main research interests include compilation technology,code optimization for parallel computing,etc.
  • Supported by:
    Natural Science Foundation of Hunan Province,China(2023JJ50019,2024JJ7172).

Abstract: With the continuous development of large-scale integrated circuits technology,vector DSPs incorporating SIMD,VLIW and other instruction parallel processing technologies have gained more and more attention and applications in the field of high-performance computing.Adapting different kinds of algorithm function libraries becomes one of the key challenges for vector DSPs.Only by reducing the input of repetitive work in programming and concentrating more on code optimization based on vector DSP architecture and hardware resources can the application development efficiency be effectively improved.Taking into account the amount of data involved in the computation in vector DSP codes,we proposed an automatic generation method for efficient code generation based on template-based array computation,which implements automated dynamic cache allocation,data rearrangement for discontinuous data accesses,and optimization of scalar instructions,so that the generated code could use the dedicated vector resources of the processor.Experimental results show that using the technique to generate code substantially improves the efficiency of obtaining relevant function code,and that the average performance of the generated vector computation assembly code reaches about 75% of the average performance of handwritten assembly code,and has an average speedup ratio of 8.7 times compared to the performance of scalar assembly code on average.

Key words: High performance computing, Code generation, Automatic vectorization, Vector DSP

CLC Number: 

  • TP314
[1]GUO J,GUO Z D,YANG Z Q,et al.Research on pseudo-color system for long-wave infrared polarization image based on DSP[J].Optics and Photonics Technology,2022,20(02):126-133.
[2]YANG B,HAN J,SUN L Y.CANFD communication realization based on DSP and FPGA[J].Navigation and Control,2021,20(6):53-59.
[3]GU C Y,CHEN Y Q,CHEN H M,et al.A RISC-V digital signal processing processor for narrowband communication and voice processing[J].China Integrated Circuits,2021,30(12):42-47.
[4]ZHU C Q,WU X Y.Analysis of DSP development history and future development trend[J].Industry and Technology Forum,2013,12(11):122-123.
[5]GAO W,ZHAO R C,HAN L,et al.Overview of SIMD automatic vectorized compilation optimization[J].Journal of Software,2015,26(6):1265-1284.
[6]WANG J,SOHL J,KRAIGHER O,et al.ePUMA:Embedded Parallel DSP Processor Architecture with Unique Memory Access[C]//International Conference on Information and Communication Security.IEEE,2011.
[7]BORKAR S,CHIENA A.The future of microprocessors[J].Communications of the ACM,2011,54(5):67-77.
[8]YU A.The future of microprocessors[J].IEEE Micro,1978,16(6):46-53.
[9]LARSENS,AMARASINGHE S.Exploiting superword levelparallelism with multimedia instruction sets[J].ACM SIGPLAN Notices,2000,35(5):145-165.
[10]LEUPERS R.Code selection for media processors with SIMD instructions[C]//Design,Automation and Test in Europe Conference and Exhibition 2000.IEEE,2000.
[11]SRERAMAN N,GOVINDARAJAN R.A Vectorizing Compiler for Multimedia Extensions[J].International Journal of Parallel Programming,2000,28(4):363-400.
[12]SRERAMAN N,GOVINDARAJAN R.A Vectorizing Compiler for Multimedia Extensions[J].International Journal of Parallel Programming,2000,28(4):363-400.
[13]REICHE O,KOBYLKO C,HANNIGF,et al.Auto-vectorization for image processing DSLs[J].ACM SIGPLAN Notices,2017,52(5):21-30.
[14]YAO J Y,ZHAO R C,WANG Q,et al.Loop-nest Auto-vectoriz-ation Method Based on Benefit Analysis[C]//Proceedings of 2018 the 2nd International Conference on Advances in Image Processing(ICAIP 2018).ACM,2018.
[15]WEI S.Research of SIMD vectorization algorithm and regroup technology [D].Zhengzhou:Information Engineering University,2012.
[16]XIAR J.Research and realization of key technology of automatic vectorization based on FT-Matrix2 [D].Changsha:National University of Defense Technology,2017.
[17]LI W.Research on automatic vector optimization method based on improved VEGEN [D].Harbin:Harbin Engineering University,2024.
[18]LI J N,HAN L,CHAI E D.Automatic vectorized porting and optimization of LLVM for domestic platforms[J].Computer Engineering,2022,48(1):142-148.
[19]LI P Y,ZHAO R C,GAO W.et al.A vectorized code generation method supporting cross-amplitude access[J].Computer Science,2015,42(5):194-199,203.
[20]CHEN L,LENG L,YANG Z,et al.Enhanced Multitask Learning for Hash Code Generation of Palmprint Biometrics[J].International Journal of Neural Systems,2024,34(4):2450020.
[21]YEO S,MA Y,KIM C S,et al.Framework for evaluating code generation ability of large language models[J].ETRI Journal,2024,46(1):106-117.
[22]GUANG Y,YU Z,XIANG C,et al.A syntax-guided multi-task learning approach for Turducken-style code generation[J].ar-Xiv:2303.05061,2023:
[23]YING J,WENJUN Y,YANG Y.The Metric for AutomaticCode Generation Based on Dynamic Abstract Syntax Tree[J].International Journal of Digital Crime and Forensics(IJDCF),2023,15(1):1-20.
[24]YUAN R T,XIAO C,LIU M J,et al.CAN message unpack and pack based on simulink automatic code generation technology[C]//2021 International Conference on Control Theory and Application.2021.
[25]HU K,DUAN Z,WANG J,et al.Template-based AADL automatic code generation[J].Frontiers of Computer Science,2019,13(4):698-714.
[26]LU M L,HUANG Z M.Design and realization of code automatic generation system based on Spring Boot[J].Popular Science and Technology,2023,25(4):11-16.
[1] ZUO Xianyu, ZHOU Xiaohu, ZHOU Liming, XIE Yi, LIU Cheng. Efficient Remote Sensing Common Product Production Algorithm Based on Product Reuse Model [J]. Computer Science, 2025, 52(6): 316-323.
[2] TAN Zhengyuan, ZHONG Jiaqing, CHEN Juan. AI+HPC:An Overview of Supercomputing System Software and Application Technology Development Driven by “AI+” [J]. Computer Science, 2025, 52(5): 1-10.
[3] LIAO Qiucheng, ZHOU Yang, LIN Xinhua. Metrics and Tools for Evaluating the Deviation in Parallel Timing [J]. Computer Science, 2025, 52(5): 41-49.
[4] LIU Lili, SHAN Zheng, LI Yingying, WU Wenhao, LIU Wenbo. Research on Function Vectorization Technology Based on Directive Statements [J]. Computer Science, 2025, 52(5): 76-82.
[5] YAN Xiaoting, WANG Xiaoning, DONG Sheng, ZHAO Yining, XIAO Haili. Review on the Development and Application of Checkpointing Technology in High-performanceComputing [J]. Computer Science, 2024, 51(9): 1-14.
[6] CHEN Yiyang, WANG Xiaoning, YAN Xiaoting, LI Guanlong ZHAO Yining, LU Shasha, XIAO Haili. Study on High Performance Computing Container Checkpoint Technology Based on CRIU [J]. Computer Science, 2024, 51(9): 40-50.
[7] LING Shixiang, YANG Zhibin, ZHOU Yong. Integrated Avionics Software Code Automatic Generation Method for ARINC653 Operating System [J]. Computer Science, 2024, 51(7): 10-21.
[8] XU Yiran, ZHOU Yu. Prompt Learning Based Parameter-efficient Code Generation [J]. Computer Science, 2024, 51(6): 61-67.
[9] HE Haotian, ZHOU Bei, GUO Shaozhong, ZHANG Zuoyan, HAO Jiangwei, XU Jinchen. Optimisation of Automatic Matrix Multiplication Mixing Accuracy Based on Polyhedral Models [J]. Computer Science, 2024, 51(12): 110-119.
[10] HE Haotian, ZHOU Bei, GUO Shaozhong, ZHANG Zuoyan, HAO Jiangwei, JI Liguang, XU Jinchen. Automatic Mixing Precision Optimization for Matrix Multiplication Calculation [J]. Computer Science, 2024, 51(11A): 240300057-10.
[11] PEI Xue, WEI Shuai, SHAO Yangxue, YU Hong, GE Chenyang. Compilation Optimization and Implementation of High-order Cryptographic Operators on FPGA [J]. Computer Science, 2024, 51(11A): 231200184-11.
[12] CHEN Yiyang, WANG Xiaoning, LU Shasha, XIAO Haili. Survey of Container Technology for High-performance Computing System [J]. Computer Science, 2023, 50(2): 353-363.
[13] ZHU Jian, HU Kai, WANG Jun, LI Jie, YE Yafei, SHI Xiyan. Reliable Smart Contract Automatic Generation Based on Event-B [J]. Computer Science, 2023, 50(10): 343-349.
[14] LU Hao-song, HU Yong-hua, WANG Shu-ying, ZHOU Xin-lian, LI Hui-xiang. Study on Hybrid Resource Heuristic Loop Unrolling Factor Selection Method Based on Vector DSP [J]. Computer Science, 2022, 49(6A): 777-783.
[15] GAO Xiu-wu, HUANG Liang-ming, JIANG Jun. Optimization Method of Streaming Storage Based on GCC Compiler [J]. Computer Science, 2022, 49(11): 76-82.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!