计算机科学 ›› 2023, Vol. 50 ›› Issue (2): 231-236.doi: 10.11896/jsjkx.211200290

• 计算机图形学&多媒体 • 上一篇    下一篇

基于注意力机制和轻量级空洞卷积的混凝土路面裂缝检测

瞿中, 王彩云   

  1. 重庆邮电大学计算机科学与技术学院 重庆 400065
  • 收稿日期:2021-12-27 修回日期:2022-07-02 出版日期:2023-02-15 发布日期:2023-02-22
  • 通讯作者: 瞿中(quzhong@cqupt.edu.cn)
  • 基金资助:
    国家自然科学基金(62176034)

Crack Detection of Concrete Pavement Based on Attention Mechanism and Lightweight DilatedConvolution

QU Zhong, WANG Caiyun   

  1. College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China
  • Received:2021-12-27 Revised:2022-07-02 Online:2023-02-15 Published:2023-02-22
  • Supported by:
    National Natural Science Foundation of China(62176034)

摘要: 混凝土路面上的裂缝会影响结构的安全性、适用性和耐久性,裂缝检测是一个充满挑战的研究热点。文中提出了由改进的全卷积网络和深监督网络组成的裂缝检测模型,以改进的VGG-16作为主干网络,首先将低层卷积特征聚合,通过空间注意力机制再次融合到主干网络;其次,将中高层卷积特征通过轻量级空洞卷积融合模块进行多尺度融合得到具有清晰边缘且分辨率较高的特征图像,所有的侧边特征图像相加产生最终的预测图像;最后,深监督网络为每个阶段的检测结果提供直接监督。该网络选择焦点损失函数作为评价函数,经过训练的网络模型能够在光照不均、背景复杂等各种条件下从输入的原始图像中高效地识别出裂缝位置。为验证所提方法的有效性和鲁棒性,在DeepCrack,CFD,Crack500这3个数据集上与6种方法进行了比较,所提算法表现出卓越的性能,F-score值达到了87.12%。

关键词: 裂缝检测, 注意力机制, 空洞卷积, 多尺度融合, 全卷积, 深度监督网络

Abstract: Cracks in the concrete pavement will affect the safety,applicability,and durability of the structure,and crack detection is a challenging research hotspot.This paper proposes a crack detection model composed of an improved full convolutional network and a deep supervision network,which uses the improved VGG-16 as the backbone network.Firstly,the low-level convolutional feature aggregation is fused to the backbone network again through the spatial attention mechanism.Secondly,the middle and high-level convolutional features are fused through the lightweight dilated convolution fusion module for multi-feature fusion to get the clear edge and high-resolution feature maps,all side feature maps are added to produce the final prediction map.Finally,the deep supervision network provides direct supervision for the detection results of each stage.In this paper,the focus loss function is selected as the evaluation function,and the trained network model can efficiently identify the crack location from the input original image under various conditions such as uneven illumination and complex background.To verify the effectiveness and robustness of the proposed method,it is compared with six methods on three datasets,DeepCrack,CFD,and Crack500,and the results show that it has excellent performance,and the F-score value reaches 87.12%.

Key words: Crack detection, Attention mechanism, Dilated convolution, Multiscale fusion, Fully convolutional network, Deep supervision network

中图分类号: 

  • TP391.41
[1]CHENG H,SHI X,GLAZIER C.Real-Time Image Thresholding Based on Sample Space Reduction and Interpolation Approach [J].Journal of Computing in Civil Engineering,2003,17(4):264-272.
[2]NEJAD F M,ZAKERI H.An Optimum Feature ExtractionMethod based on Wavelet-Radon Transform and Dynamic Neural Network for Pavement Distress Classification [J].Expert Systems with Applications,2011,38(8):9442-9460.
[3]AMHAZ R,CHAMBON S,IDIER J,et al.Automatic Crack Detection on Two-Dimensional Pavement Images:An Algorithm Based on Minimal Path Selection [J].IEEE Transactions on Intelligence Transport System,2016,17(10):2718-2729.
[4]YAMAGUCHI T,HASHIMOTO S.Fast crack detection me-thod for large-size concrete surface images using percolation-based image processing[J].Machine Vision and Applications,2010,21(5):797-809.
[5]FAN Z,LI C,CHEN Y,et al.Automatic Crack Detection on Road Pavements Using Encoder-Decoder Architecture [J].Materials,2020,13(13):2960.
[6]QU Z,CAO C,LIU L,et al.A Deeply Supervised Convolutional Neural Network for Pavement Crack Detection with Multiscale Feature Fusion[J].IEEE Transactions on Neural Networks and Learning Systems,2021,33(9):4890-4899.
[7]XU G Z,LIAO C G,C J J,et al.Extraction of Apparent Crack Information of Concrete Based on HU-ResNet[J].Computer Engineering,2020,46(11):279-285.
[8]HU J,LI S,GANG S,et al.Squeeze-and-Excitation Networks [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,42(8):2011 -2023.
[9]WOO S,PARK J,LEE J Y,et al.CBAM:Convolutional Block Attention Module [C]//European Conference on Computer Vision.Berlin:Springer,2018:3-19.
[10]CHEN Y,DAI X,LIU M,et al.Dynamic Convolution:Attention over convolution kernels [C]//IEEE Computer Vision and Pattern Recognition.2020:11030-11039.
[11]ZHAO T,WU X Q.Pyramid Feature Attention Network for Saliency detection [C]//IEEE Computer Vision and Pattern Re-cognition.2019:3080-3089.
[12]CHEN L C,PAPANDREOU G,SCHROFF F,et al.Rethinking Atrous Convolution for Semantic Image Segmentation [J].ar-Xiv:1706.05587,2017.
[13]CHOLLET F.Xception:Deep Learning with Depthwise Separable Convolutions [C]//IEEE Computer Vision and Pattern Re-cognition.IEEE,2017:1063-6919.
[14]LIU Y H,YAO J,LU X H,et al.DeepCrack:A Deep Hierarchical Feature Learning Architecture for Crack Segmentation [J].Neurocomputing,2019,338:139-153.
[15]SHI Y,CUI L,QI Z,et al.Automatic Road Crack Detectionusing Random Structured Forests [J].IEEE Transactions Intelligence Transport System,2016,17(12):3434-3445.
[16]YANG F,ZHANG L,YU S J,et al.Feature Pyramid and Hie-rarchical Boosting Network for Pavement Crack Detection [J].IEEE Transactions on Intelligent Transportation Systems,2020,21(4):1525-1535.
[17]REN Y P,HUANG J S,HONG Z Y,et al.Image-based Concrete Crack Detection in Tunnels Using Deep Fully Convolu-tional Networks [J].Construction and Building Materials,2020,234(20):117367.
[18]GU Z W,CHENG J,FU H Z,et al.CE-Net:Context Encoder Network for 2D Medical Image Segmentation [J].IEEE Transac-tions on Medical Imaging.2019(10):2281-2292.
[1] 冯程程, 刘派, 姜琳颖, 梅笑寒, 郭贵冰.
文档增强型知识库问答
Document-enhanced Question Answering over Knowledge-Bases
计算机科学, 2023, 50(3): 266-275. https://doi.org/10.11896/jsjkx.220300022
[2] 李帅, 徐彬, 韩祎珂, 廖同鑫.
SS-GCN:情感增强和句法增强的方面级情感分析模型
SS-GCN:Aspect-based Sentiment Analysis Model with Affective Enhancement and Syntactic Enhancement
计算机科学, 2023, 50(3): 3-11. https://doi.org/10.11896/jsjkx.220700238
[3] 陈富强, 寇嘉敏, 苏利敏, 李克.
基于图神经网络的多信息优化实体对齐模型
Multi-information Optimized Entity Alignment Model Based on Graph Neural Network
计算机科学, 2023, 50(3): 34-41. https://doi.org/10.11896/jsjkx.220700242
[4] 周明强, 代开浪, 吴全旺, 朱庆生.
异构信息网络的注意力感知多通道图卷积评分预测模型
Attention-aware Multi-channel Graph Convolutional Rating Prediction Model for Heterogeneous Information Networks
计算机科学, 2023, 50(3): 129-138. https://doi.org/10.11896/jsjkx.220300004
[5] 邹芸竹, 杜圣东, 滕飞, 李天瑞.
一种基于多模态深度特征融合的视觉问答模型
Visual Question Answering Model Based on Multi-modal Deep Feature Fusion
计算机科学, 2023, 50(2): 123-129. https://doi.org/10.11896/jsjkx.211200303
[6] 王鹏宇, 台文鑫, 刘芳, 钟婷, 罗绪成, 周帆.
基于数据增强的自监督飞行航迹预测
Self-supervised Flight Trajectory Prediction Based on Data Augmentation
计算机科学, 2023, 50(2): 130-137. https://doi.org/10.11896/jsjkx.211200016
[7] 刘露平, 周欣, 程军军, 何小海, 卿粼波, 王美玲.
基于会话式机器阅读理解模型的事件抽取方法
Event Extraction Method Based on Conversational Machine Reading Comprehension Model
计算机科学, 2023, 50(2): 275-284. https://doi.org/10.11896/jsjkx.220400271
[8] 李雪辉, 张拥军, 史殿习, 徐化池, 史燕燕.
融合注意力特征的无锚框视觉目标跟踪方法
AFTM:Anchor-free Object Tracking Method with Attention Features
计算机科学, 2023, 50(1): 138-146. https://doi.org/10.11896/jsjkx.211000083
[9] 赵倩, 周冬明, 杨浩, 王长城.
残差注意力与多特征融合的图像去模糊
Image Deblurring Based on Residual Attention and Multi-feature Fusion
计算机科学, 2023, 50(1): 147-155. https://doi.org/10.11896/jsjkx.211100161
[10] 郑诚, 梅亮, 赵伊研, 张苏航.
基于双向注意力机制和门控图卷积网络的文本分类方法
Text Classification Method Based on Bidirectional Attention and Gated Graph Convolutional Networks
计算机科学, 2023, 50(1): 221-228. https://doi.org/10.11896/jsjkx.211100095
[11] 蔡肖, 陈志华, 盛斌.
基于移位窗口金字塔Transformer的遥感图像目标检测
SPT:Swin Pyramid Transformer for Object Detection of Remote Sensing
计算机科学, 2023, 50(1): 105-113. https://doi.org/10.11896/jsjkx.211100208
[12] 张婧媛, 王宏霞, 何沛松.
基于Transformer的多任务图像拼接篡改检测算法
Multitask Transformer-based Network for Image Splicing Manipulation Detection
计算机科学, 2023, 50(1): 114-122. https://doi.org/10.11896/jsjkx.211100269
[13] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[14] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[15] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!