计算机科学 ›› 2020, Vol. 47 ›› Issue (6A): 161-165.doi: 10.11896/JsJkx.191200127

• 计算机图形学 & 多媒体 • 上一篇    下一篇

全局双边网络的语义分割算法

任天赐1, 黄向生2, 丁伟利1, 安重阳1, 翟鹏博3   

  1. 1 燕山大学电气工程学院 河北 秦皇岛 066000;
    2 中国科学院自动化研究所 北京 100190;
    3 中国科学院微电子所 北京 100029
  • 发布日期:2020-07-07
  • 通讯作者: 丁伟利(weiye51@ysu.edu.cn)
  • 作者简介:tianci.ren@stumail.ysu.edu.cn
  • 基金资助:
    国家重点研发计划(2018YFB1308302,2018YFB1308300);复杂物体表面纹理获取和三维重建的关键技术研究项目(61573356)

Global Bilateral Segmentation Network for Segmantic Segmentation

REN Tian-ci1, HUANG Xiang-sheng2, DING Wei-li1, AN Chong-yang1 and ZHAI Peng-bo3   

  1. 1 Institute of Electrical Engineering,Yanshan University,Qinhuangdao,Hebei 066000,China
    2 Institute of Automation,Chinese Academy of Sciences,BeiJing 100190,China
    3 Institute of Microelectronics,Chinese Academy of Sciences,BeiJing 100029,China
  • Published:2020-07-07
  • About author:REN Tian-ci, born in 1995, postgradua-te.His main research interests include computer vision and pattern recognition.
    DING Wei-li, born in 1979, Ph.D, professor, Ph.D supervisor, is a member of China Computer Federation.Her main research interests include computer vision, pattern recognition and human-computer interaction.
  • Supported by:
    This work was supported by the National Key R&D Program of China (2018YFB1308302,2018YFB1308300) and Research on Key Technologies of texture acquisition and 3D reconstruction of complex obJect surface (61573356).

摘要: 语义分割任务是对图像中的物体按照类别进行像素级别的预测,其难点在于在保留足够空间信息的同时获取足够的上下文信息。为解决这一问题,文中提出了全局双边网络语义分割算法。该算法将大尺度卷积核融入BiSeNet网络中,在BiSeNet网络原有的空间路径和上下文路径两条分支的基础上增加全局路径分支,使网络能够捕获更多的上下文信息,同时提出将BiSeNet网络中的注意力优化模块和特征融合模块中的全局池化模块替换为全局卷积模块,进一步提高了网络获取上下文信息的能力,从而使预测结果更加准确。实验结果表明,该算法在Cityscapes数据集上将交并比(MIoU)指标提高了0.84%,获得了优于BiSeNet网络的表现。

关键词: 全局卷积网络, 双边分割网络, 语义分割

Abstract: The task of semantic segmentation is to predict the obJects according to the category at the pixel level.The difficulty lies in retaining enough spatial information and obtaining enough context information.In order to solve this problem,this paper proposes a global bilateral network semantic segmentation algorithm.In this algorithm,the large-scale convolution kernel is integrated into the BiSeNet Network,and the global path branches are added to the original spatial path and context path of the BiSeNet Network,so that the network can capture more context information.At the same time,the global pooling module in the attention optimization module and feature fusion module is replaced by the global convolution module to further improve the network acquisition.The experimental results show that the algorithm improves the MIoU index by 0.84% on Cityscaps dataset,and achieves better performance than BiSeNet Network.

Key words: Bilateral segmentation network, Global convolutional network, Semantic segmentation

中图分类号: 

  • TP389.1
[1] LONG J,SHELHAMER E,DARRELL T.Fully Convolutional Networks for Semantic Segmentation.IEEE Transactions on Pattern Analysis & Machine Intelligence,2014,39(4):640-651.
[2] BADRINARAYANAN V,HANDA A,CIPOLLA R.SegNet:A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(12):2481-2495.
[3] NOH H,HONG S,HAN B.Learning deconvolution network for semantic segmentation//Proceedings of the IEEE International Conference on Computer Vision.2015:1520-1528.
[4] RONNEBERGER O,FISCHER P,BROX T.U-net:Convolutional networks for biomedical image segmentation//International Conference on Medical image computing and computer-assisted intervention.Springer,Cham,2015:234-241.
[5] Lin G,Milan A,Shen C,et al.RefineNet:Multi-path refinement networks with identity mappings for high-resolution semantic segmentation.arXiv:1611.06612.
[6] CHEN L C,PAPANDREOU G,KOKKINOS I,et al.DeepLab:Semantic Image Segmentation with Deep Convolutional Nets,Atrous Convolution,and Fully Connected CRFs.IEEE Transactions on Pattern Analysis & Machine Intelligence,2016,40(4):834-848.
[7] CHEN L C,PAPANDREOU G,SCHROFF F,et al.Rethinking atrous convolution for semantic image segmentation.arXiv:1706.05587,2017.
[8] CHEN L C,ZHU Y,PAPANDREOU G,et al.Encoder-decoder with atrous separable convolution for semantic image segmentation//Proceedings of the European Conference on Computer vision (ECCV).2018:801-818.
[9] ZHAO H S,SHI J P,QI X J,et al.Pyramid Scene Parsing Network.arXiv:1612.01105v2,2017.
[10] ZHANG R,TANG S,ZHANG Y,et al.Scale-Adaptive Convolutions for Scene Parsing//2017 IEEE International Conference on Computer Vision (ICCV).IEEE,2017.
[11] YU C,WANG J,PENG C,et al.BiSeNet:Bilateral Segmentation Network for Real-Time Semantic Segmentation//Computer Vision-ECCV 2018.Lecture Notes in Computer Science, vol 11217.
[12] PENG C,ZHANG X,YU G,et al.Large Kernel Matters-Improve Semantic Segmentation by Global Convolutional Network//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2017.
[13] SIMONYAN K, ZISSERMAN A.Very Deep ConvolutionalNetworks for Large-Scale Image Recognition.Computer Science,2014.
[14] SZEGEDY C,LIU W,JIA Y,et al.Going Deeper with Convolutions//2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Boston,MA,2015:1-9.
[15] CHEN L C,PAPANDREOU G,KOKKINOS I,et al.Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs.Computer Science,2014(4):357-361.
[16] HU J,SHEN L,ALBANIE S,et al.Squeeze-and-Excitation Net-works.arXiv:1709.01507v4,2019.
[17] CORDTS M,OMRAN M,RAMOS S,et al.The Cityscapes Dataset for Semantic Urban Scene Understanding.arXiv:1604.01685,2016.
[18] BROSTOW G J,SHOTTON J,FAUQUEUR J,et al.Segmentation and Recognition Using Structure from Motion Point Clouds//European Conference on Computer Vision.Springer,Berlin,Heidelberg,2008.
[19] HE K,ZHANG X,REN S,et al.Deep Residual Learning for Image Recognition.arXiv:1512.03385,2015.
[1] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[2] 胡伏原, 万新军, 沈鸣飞, 徐江浪, 姚睿, 陶重犇.
深度卷积神经网络图像实例分割方法研究进展
Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network
计算机科学, 2022, 49(5): 10-24. https://doi.org/10.11896/jsjkx.210200038
[3] 王施云, 杨帆.
基于U-Net特征融合优化策略的遥感影像语义分割方法
Remote Sensing Image Semantic Segmentation Method Based on U-Net Feature Fusion Optimization Strategy
计算机科学, 2021, 48(8): 162-168. https://doi.org/10.11896/jsjkx.200700182
[4] 詹瑞, 雷印杰, 陈训敏, 叶书函.
基于多重差异特征网络的街景变化检测
Street Scene Change Detection Based on Multiple Difference Features Network
计算机科学, 2021, 48(2): 142-147. https://doi.org/10.11896/jsjkx.200500158
[5] 王鑫, 张昊宇, 凌诚.
基于U-Net优化的SAR遥感图像语义分割
Semantic Segmentation of SAR Remote Sensing Image Based on U-Net Optimization
计算机科学, 2021, 48(11A): 376-381. https://doi.org/10.11896/jsjkx.210300260
[6] 朱戎, 叶宽, 杨博, 谢欢, 赵蕾.
基于改进DeeplabV3+的地物分类方法研究
Feature Classification Method Based on Improved DeeplabV3+
计算机科学, 2021, 48(11A): 382-385. https://doi.org/10.11896/jsjkx.201100184
[7] 刘彬, 刘宏哲.
基于改进Enet网络的车道线检测算法
Lane Detection Algorithm Based on Improved Enet Network
计算机科学, 2020, 47(4): 142-149. https://doi.org/10.11896/jsjkx.190500021
[8] 周鹏程,龚声蓉,钟珊,包宗铭,戴兴华.
基于深度特征融合的图像语义分割
Image Semantic Segmentation Based on Deep Feature Fusion
计算机科学, 2020, 47(2): 126-134. https://doi.org/10.11896/jsjkx.190100119
[9] 王赛男, 郑雄风.
基于边缘计算的图像语义分割应用与研究
Application and Research of Image Semantic Segmentation Based on Edge Computing
计算机科学, 2020, 47(11A): 276-280. https://doi.org/10.11896/jsjkx.200900046
[10] 杨培健, 吴晓富, 张索非, 周全.
基于空洞卷积鉴别器的语义分割迁移算法
Semantic Segmentation Transfer Algorithm Based on Atrous Convolution Discriminator
计算机科学, 2020, 47(11): 174-178. https://doi.org/10.11896/jsjkx.191100014
[11] 缪永伟, 李高怡, 鲍陈, 张旭东, 彭思龙.
基于卷积神经网络的图像局部风格迁移
Image Localized Style Transfer Based on Convolutional Neural Network
计算机科学, 2019, 46(9): 259-264. https://doi.org/10.11896/j.issn.1002-137X.2019.09.039
[12] 王嫣然, 陈清亮, 吴俊君.
面向复杂环境的图像语义分割方法综述
Research on Image Semantic Segmentation for Complex Environments
计算机科学, 2019, 46(9): 36-46. https://doi.org/10.11896/j.issn.1002-137X.2019.09.005
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!