Computer Science ›› 2024, Vol. 51 ›› Issue (4): 229-235.doi: 10.11896/jsjkx.230100137

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Algorithm of Stereo Matching Based on GAANET

SONG Hao, MAO Kuanmin, ZHU Zhou   

  1. School of Mechanical Science & Engineering,Huazhong University of Science and Technology,Wuhan 430074,China
  • Received:2023-01-31 Revised:2023-04-20 Online:2024-04-15 Published:2024-04-10
  • Supported by:
    Key R & D Program of Ningxia Hui Autonomous Region,China(2021BFH02001).

Abstract: End-to-end stereo matching algorithms have become increasingly popular in stereo matching tasks due to their advantages in computational time and matching accuracy.However,feature extraction in such algorithms can result in redundant features,information loss,and insufficient multi-scale feature fusion,thereby increasing computational complexity and decreasing matching accuracy.To address these challenges,an improved ghost adaptive aggregation network(GAANET) is proposed based on the adaptive aggregation network(AANET),and its feature extraction module is improved to make it more suitable for stereo matching tasks.Multi-scale features are extracted in the G-Ghost phase,and partial features are generated through low-cost ope-rations to reduce feature redundancy and preserve shallow features.An efficient channel attention mechanism is implemented to allocate weights to each channel,and an improved feature pyramid structure is introduced to mitigate channel information loss in traditional pyramids and optimize feature fusion,thus enhancing information supplement for features across scales.The proposed GAANET model is trained and evaluated on the SceneFlow,KITTI2015,and KITTI2012 datasets.Experimental resultsdemons-trate that GAANET outperforms the baseline method,with accuracy improvements of 0.92%,0.25%,and 0.20%,respectively,while reducing parameter volume by 13.75% and computational complexity by 4.8%.

Key words: Stereo matching, Feature extraction, End-to-end stereo matching network, Attention module, Deep learning

CLC Number: 

  • TP391
[1]LIN X,WANG J,LIN C.Research on 3D Reconstruction in Bi-nocular Stereo Vision Based on Feature Point Matching Method[C]//2020 IEEE 3rd International Conference on Information Systems and Computer Aided Education(ICISCAE).Dalian:IEEE,2020:551-556.
[2]CHEN Y,ZHAO L W,ZHAN H C,et al.Study on Reconstruction of Indoor 3D Scene Based on Binocular Vision[J].Computer Science,2020,47(11A):175-177.
[3]CHANG Z T,SHI Y Q,WANG J,et al.Vehicle Speed Measure-ment Method Based on Binocular Vision[J].Computer Science,2021,48(9):135-139.
[4]DE SANTANA J R,CAMBUIM L F S,BARROS E.Bi-Window Based Stereo Matching Using Combined Siamese Convolutional Neural Network[C]//13th International Conference on Digital Image Processing(ICDIP).Electr Network:SPIE,2021:118781Z.
[5]SEKI A,POLLEFEYS M.SGM-Nets:Semi-global matchingwith neural networks[C]//30th IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Honolulu:IEEE,2017:6640-6649.
[6]JIE Z Q,WANG P F,LING Y G,et al.Left-Right Comparative Recurrent Model for Stereo Matching[C]//31st IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Salt Lake City:IEEE,2018:3838-3846.
[7]WANG Y,LAI Z H,HUANG G,et al.Anytime Stereo Image Depth Estimation on Mobile Devices[C]//IEEE International Conference on Robotics and Automation(ICRA).Montreal:IEEE,2019:5893-5900.
[8]KHAMIS S,FANELLO S,RHEMANN C,et al.Stereonet:Guided hierarchical refinement for real-time edge-aware depth prediction[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:573-590.
[9]LIU S G,ZHANG T,YANG J G,et al.Progressive dialtion residual network for deep binocular stereo matching[J].Journal of Xidian University,2022,49(5):175-180.
[10]LI T,MA W,XU S B,et al.Task-Adaptive End-to-End Net-works for Stereo Matching[J].Journal of Computer Research and Development,2020,57(7):1531-1538.
[11]HAN K,WANG Y,XU C,et al.GhostNets on Heterogeneous Devices via Cheap Operations[J].International Journal of Computer Vision,2022,130(4):1050-1069.
[12]WANG Q,WU B,ZHU P,et al.ECA-Net:Efficient Channel Attention for Deep Convolutional Neural Networks[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Seattle:IEEE,2020:11534-11542.
[13]LUO Y,CAO X,ZHANG J,et al.CE-FPN:enhancing channel information for object detection[J].Multimedia Tools and Applications,2022,81(21):30685-30704.
[14]XU H F,ZHANG J Y.AANet:Adaptive Aggregation Network for Efficient Stereo Matching[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Electr Network:IEEE,2020:1956-1965.
[15]CHABRA R,STRAUB J,SWEENEY C,et al.StereoDRNet:Dilated Residual StereoNet[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Long Beach:IEEE,2019:11778-11787.
[16]DAI J,QI H,XIONG Y,et al.Deformable Convolutional Networks[C]//2017 IEEE International Conference on Computer Vision(ICCV).Venice:IEEE,2017:764-773.
[17]CHENG X,ZHONG Y,HARANDI M,et al.Hierarchical neural architecture search for deep stereo matching[J].Advances in Neural Information Processing Systems,2020,33:22158-22169.
[18]SHEN Z,DAI Y C,SONG X B,et al.PCW-Net:Pyramid Combination and Warping Cost Volume for Stereo Matching[C]//17th European Conference on Computer Vision(ECCV).Tel Aviv:Springer,2022:280-297.
[19]SHEN Z L,DAI Y C,RAO Z B,et al.CFNet:Cascade and Fused Cost Volume for Robust Stereo Matching[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Electr Network:IEEE,2021:13901-13910.
[20]WANG Q,SHI S H,ZHENG S Z,et al.FADNet:A Fast and Accurate Network for Disparity Estimation[C]//IEEE International Conference on Robotics and Automation(ICRA).Electr Network:IEEE,2020:101-107.
[1] CHEN Runhuan, DAI Hua, ZHENG Guineng, LI Hui , YANG Geng. Urban Electricity Load Forecasting Method Based on Discrepancy Compensation and Short-termSampling Contrastive Loss [J]. Computer Science, 2024, 51(4): 158-164.
[2] LIN Binwei, YU Zhiyong, HUANG Fangwan, GUO Xianwei. Data Completion and Prediction of Street Parking Spaces Based on Transformer [J]. Computer Science, 2024, 51(4): 165-173.
[3] XUE Jinqiang, WU Qin. Progressive Multi-stage Image Denoising Algorithm Combining Convolutional Neural Network and
Multi-layer Perceptron
[J]. Computer Science, 2024, 51(4): 243-253.
[4] CHEN Jinyin, LI Xiao, JIN Haibo, CHEN Ruoxi, ZHENG Haibin, LI Hu. CheatKD:Knowledge Distillation Backdoor Attack Method Based on Poisoned Neuronal Assimilation [J]. Computer Science, 2024, 51(3): 351-359.
[5] HUANG Kun, SUN Weiwei. Traffic Speed Forecasting Algorithm Based on Missing Data [J]. Computer Science, 2024, 51(3): 72-80.
[6] XU Bangwu, WU Qin, ZHOU Haojie. Appearance Fusion Based Motion-aware Architecture for Moving Object Segmentation [J]. Computer Science, 2024, 51(3): 155-164.
[7] ZHENG Cheng, SHI Jingwei, WEI Suhua, CHENG Jiaming. Dual Feature Adaptive Fusion Network Based on Dependency Type Pruning for Aspect-basedSentiment Analysis [J]. Computer Science, 2024, 51(3): 205-213.
[8] WANG Wenmiao. Prediction of Lower Limb Joint Angle Based on VMD-ELMAN Electromyographic Signals [J]. Computer Science, 2024, 51(3): 257-264.
[9] HUANG Wenke, TENG Fei, WANG Zidan, FENG Li. Image Segmentation Based on Deep Learning:A Survey [J]. Computer Science, 2024, 51(2): 107-116.
[10] CAI Jiacheng, DONG Fangmin, SUN Shuifa, TANG Yongheng. Unsupervised Learning of Monocular Depth Estimation:A Survey [J]. Computer Science, 2024, 51(2): 117-134.
[11] ZHAO Jiangfeng, HE Hongjie, CHEN Fan, YANG Shubin. Two-stage Visible Watermark Removal Model Based on Global and Local Features for Document Images [J]. Computer Science, 2024, 51(2): 172-181.
[12] ZHANG Feng, HUANG Shixin, HUA Qiang, DONG Chunru. Novel Image Classification Model Based on Depth-wise Convolution Neural Network andVisual Transformer [J]. Computer Science, 2024, 51(2): 196-204.
[13] WANG Yangmin, HU Chengyu, YAN Xuesong, ZENG Deze. Study on Deep Reinforcement Learning for Energy-aware Virtual Machine Scheduling [J]. Computer Science, 2024, 51(2): 293-299.
[14] HUANG Changxi, ZHAO Chengxin, JIANG Xiaoteng, LING Hefei, LIU Hui. Screen-shooting Resilient DCT Domain Watermarking Method Based on Deep Learning [J]. Computer Science, 2024, 51(2): 343-351.
[15] HOU Jing, DENG Xiaomei, HAN Pengwu. Survey on Domain Limited Relation Extraction [J]. Computer Science, 2024, 51(1): 252-265.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!