Computer Science ›› 2020, Vol. 47 ›› Issue (8): 105-111.doi: 10.11896/jsjkx.190700036

Previous Articles     Next Articles

Real-time SIFT Algorithm Based on GPU

WANG Liang, ZHOU Xin-zhi, YNA Hua   

  1. School of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
  • Online:2020-08-15 Published:2020-08-10
  • About author:WANG Liang, born in 1993, postgra-duate.His main research interests include computer vision and parallel computing.
    YAN Hua, born in 1971, Ph.D, professor.His main research interests include intelligent algorithm, storage system and path planning.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61403265).

Abstract: Aiming at the complex and low real-time defects of the SIFT feature extraction algorithm, a real-time SIFT algorithm based on GPU is proposed, called CoSift (CUDA Optimized SIFT).Firstly, the algorithm uses the CUDA stream concurrency mechanism to construct the SIFT scale space.In this process, the high-speed memory in the CUDA memory model is fully utilized to improve data access speed, and the two-dimensional Gaussian convolution kernel is optimized to reduce the amount of computation.Then, the warp-based histogram policy is designed to rebalance the workload during the characterization process.Compared with the traditional algorithm of the CPU version and the improved algorithm of the GPU version, the proposed algorithm greatly improves the real-time performance of the SIFT algorithm without reducing the accuracy of feature extraction, and has a relatively higher optimization effect on large-size images.CoSift can extract features within 7.7~8.8ms (116.28~129.87fps) on the GTX1080Ti.The algorithm effectively reduces the complexity of the traditional SIFT algorithm process, improves the real-time performance, and is convenient to be applied in scenarios where the real-time requirement of SIFT algorithm is higher.

Key words: Scale invariant feature transformation, Feature extraction, Real-time, CUDA, Parallel acceleration

CLC Number: 

  • TP391.4
[1] LOWE D G.Distinctive Image Features from Scale-InvariantKeypoints[J].International Journal of Computer Vision, 2004, 60(2):91-110.
[2] KE N Y, SUKTHANKAR R.PCA-SIFT:A More Distinctive Representation for Local Image Descriptors[C]∥Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.IEEE Computer Society, 2004.
[3] BAY H, TUYTELAARS T, GOOL L V.SURF:Speeded UpRobust Features[C]∥European Conference on Computer Vision.2006.
[4] DU C, YUAN J, DONG J, et al.GPU based Parallel Optimization for Real Time Panoramic Video Stitching[J].Pattern Recognition Letters, 2018, 133(5):62-69.
[5] ACHARYA K A, BABU R V, VADHIYAR S S.A Real-Time Implementation of SIFT Using GPU[J].Journal of Real-Time Image Processing, 2014, 14(8):267-277.
[6] ZHOU Y, MEI K, XIANG J, et al.Parallelization and Optimization of SIFT on GPU Using CUDA[C]∥IEEE International Conference on High Performance Computing & Communications & IEEE International Conference on Embedded & Ubiquitous Computing.2014.
[7] LI Z, JIA H, ZHANG Y.HartSift:A High-Accuracy and Real-Time SIFT Based on GPU[C]∥2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS).IEEE Computer Society, 2017.
[8] NVIDIA Corporation.CUDA Programming Guide 9.0[OL].
[9] NVIDIA Corporation.CUDA Toolkit Documentation v9.0[OL].
[10] TIAN W, XU F, WANG H Y, et al. Fast Scale Invariant Feature Transform Algorithm Based on CUDA.Computer Engineering, 2010, 36(8):219-221.
[11] YAN J H, HANG Y Q, XU J F, et al.Quick Realization of CUDA-Based Registration of High-Resolution Digital Video Images[J].Chinese Journal of Scientific Instrument, 2014, 35(2):380-386.
[12] RAGHU R P K, SURESH M, JOHN M.An Approach to Parallelization of SIFT Algorithm on GPUs for Real-Time Applications[J].Journal of Computer and Communications, 2016, 4(17):18-50.
[13] JIANG C, GENG Z X, LOU B, et al.Parallel Processing Re-search on SIFT Feature Matching Algorithm Based on GPU[J].Computer Science, 2013, 40(12):295-297, 307.
[14] WU C.SiftGPU :A GPU Implementation of Scale InvariantFeature Transform (SIFT)[OL].
[15] BJRKMAN M, BERGSTRM N, KRAGIC D.Detecting, Segmenting and Tracking Unknown Objects Using Multi-label MRF Inference[J].Computer Vision and Image Understanding, 2014, 118:111-127.
[16] ZHANG K, YANG H Y, SHI L Y.Panorama Generation ofSIFT and Stitch Line Based on CUDA[J].Computer Technology and Development, 2015(9):22-26.
[17] ZHI X, YAN J, HANG Y, et al.Realization of CUDA-BasedReal-Time Registration and Target Localization for High-Resolution Video Images[J].Journal of Real-Time Image Proces-sing, 2016, 16:1025-1036.
[18] The Oxford Buildings Dataset[OL].
[1] LIU Yang, JIN Zhong. Fine-grained Image Recognition Method Combining with Non-local and Multi-region Attention Mechanism [J]. Computer Science, 2021, 48(1): 197-203.
[2] MA Meng-yu, WU Ye, CHEN Luo, WU Jiang-jiang, LI Jun, JING Ning. Display-oriented Data Visualization Technique for Large-scale Geographic Vector Data [J]. Computer Science, 2020, 47(9): 117-122.
[3] LUO Xiang-yu, XU Hang-na, ZENG Hao-cheng, CHEN Zu-xi, YANG Fan. Symbolic Model Checking for Discrete Real-time Linear Dynamic Logic [J]. Computer Science, 2020, 47(9): 204-212.
[4] BAO Yu-xuan, LU Tian-liang, DU Yan-hui. Overview of Deepfake Video Detection Technology [J]. Computer Science, 2020, 47(9): 283-292.
[5] LIANG Zheng-you, HE Jing-lin, SUN Yu. Three-dimensional Convolutional Neural Network Evolution Method for Facial Micro-expression Auto-recognition [J]. Computer Science, 2020, 47(8): 227-232.
[6] YANG Wei-chao, GUO Yuan-bo, LI Tao, ZHU Ben-quan. Method Based on Traffic Fingerprint for IoT Device Identification and IoT Security Model [J]. Computer Science, 2020, 47(7): 299-306.
[7] LAN Zhang-li, SHEN De-xing, CAO Juan and ZHANG Yu-xin. Content-independent Method for Basis Image Extraction and Image Reconstruction [J]. Computer Science, 2020, 47(6A): 226-229.
[8] ZHOU Li-peng, MENG Li-min, ZHOU Lei, JIANG Wei and DONG Jian-ping. Fall Detection Algorithm Based on BP Neural Network [J]. Computer Science, 2020, 47(6A): 242-246.
[9] YUAN De-yu, ZHANG Yi-fan, GAO Jian and SUN Hai-chun. Abnormal User Detection Method in Sina Weibo Based on User Feature Extraction [J]. Computer Science, 2020, 47(6A): 364-368.
[10] ZUO Xian-yu, ZHANG Zhe, SU Yue-han, LIU Yang, GE Qiang, TIAN Jun-feng. Extraction Algorithm of NDVI Based on GPU Multi-stream Parallel Model [J]. Computer Science, 2020, 47(4): 25-29.
[11] DENG Yi-jiao, ZHANG Feng-li, CHEN Xue-qin, AI Qing, YU Su-zhe. Collaborative Attention Network Model for Cross-modal Retrieval [J]. Computer Science, 2020, 47(4): 54-59.
[12] ZHAO Nan, PI Wen-chao, XU Chang-qiao. Video Recommendation Algorithm for Multidimensional Feature Analysis and Filtering [J]. Computer Science, 2020, 47(4): 103-107.
[13] WANG Kun-lun, LIU Wen-can, HE Xiao-hai, QING Lin-bo, WU Xiao-hong. Motion Feature Descriptor for Abnormal Behavior Detection [J]. Computer Science, 2020, 47(4): 119-124.
[14] CHEN Li-fu,LIU Yan-zhi,ZHANG Peng,YUAN Zhi-hui,XING Xue-min. Road Extraction Algorithm of Multi-feature High-resolution SAR Image Based on Multi-Path RefineNet [J]. Computer Science, 2020, 47(3): 156-161.
[15] HE Chao-lei,BI Xiu-li,XIAO Bin. Zernike Moment Based Approach for Local Feature Detection [J]. Computer Science, 2020, 47(2): 135-142.
Full text



[1] LEI Li-hui and WANG Jing. Parallelization of LTL Model Checking Based on Possibility Measure[J]. Computer Science, 2018, 45(4): 71 -75 .
[2] SUN Qi, JIN Yan, HE Kun and XU Ling-xuan. Hybrid Evolutionary Algorithm for Solving Mixed Capacitated General Routing Problem[J]. Computer Science, 2018, 45(4): 76 -82 .
[3] ZHANG Jia-nan and XIAO Ming-yu. Approximation Algorithm for Weighted Mixed Domination Problem[J]. Computer Science, 2018, 45(4): 83 -88 .
[4] WU Jian-hui, HUANG Zhong-xiang, LI Wu, WU Jian-hui, PENG Xin and ZHANG Sheng. Robustness Optimization of Sequence Decision in Urban Road Construction[J]. Computer Science, 2018, 45(4): 89 -93 .
[5] SHI Wen-jun, WU Ji-gang and LUO Yu-chun. Fast and Efficient Scheduling Algorithms for Mobile Cloud Offloading[J]. Computer Science, 2018, 45(4): 94 -99 .
[6] ZHOU Yan-ping and YE Qiao-lin. L1-norm Distance Based Least Squares Twin Support Vector Machine[J]. Computer Science, 2018, 45(4): 100 -105 .
[7] CUI Qiong, LI Jian-hua, WANG Hong and NAN Ming-li. Resilience Analysis Model of Networked Command Information System Based on Node Repairability[J]. Computer Science, 2018, 45(4): 117 -121 .
[8] YANG Yu-qi, ZHANG Guo-an and JIN Xi-long. Dual-cluster-head Routing Protocol Based on Vehicle Density in VANETs[J]. Computer Science, 2018, 45(4): 126 -130 .
[9] HAN Kui-kui, XIE Zai-peng and LV Xin. Fog Computing Task Scheduling Strategy Based on Improved Genetic Algorithm[J]. Computer Science, 2018, 45(4): 137 -142 .
[10] PANG Bo, JIN Qian-kun, HENIGULI·Wu Mai Er and QI Xing-bin. Routing Scheme Based on Network Slicing and ILP Model in SDN[J]. Computer Science, 2018, 45(4): 143 -147 .