计算机科学 ›› 2025, Vol. 52 ›› Issue (12): 166-174.doi: 10.11896/jsjkx.241000130
黄坤, 何朗, 王展青
HUANG Kun, HE Lang, WANG Zhanqing
摘要: 铁轨扣件病害是影响铁路交通安全的重要因素。利用深度学习图像识别方法对铁轨扣件检测机器人所采集的图像进行分割,可以有效提高扣件病害检测的效率。针对目前缺乏公开可用的铁轨扣件数据集,以及扣件数据量大但背景环境复杂导致分割难度大、耗时长等问题,人工制作了RFS铁轨扣件数据集并提出基于Sc-DeepLabV3+模型的铁轨扣件分割方法。在原始DeepLabV3+模型的基础上,替换其主干网络为轻量MobileNetV4网络以加快运算速度,提出改进的S-ASPP模块,使网络能够获得更密集的像素采样,从而增强网络提取细节特征的能力。此外,加入CSWin注意力机制并行地计算横向和纵向的注意力,减少复杂背景环境的干扰。实验部分,提出了RailAugment数据增强技术,有效增加了数据集的多样性和覆盖度,最终获得的扣件数据集共有6 832张图像,其中训练集4 782张,验证集1 366张,测试集684张。实验结果表明,mIoU和mPA分别达到了95.17%和97.14%,相较于原模型提高了2.19个百分点和0.3个百分点。尽管性能提升幅度较小,但在细节特征提取和背景干扰处理上有明显改善。在公共DeepGlobe数据集上验证了Sc-DeepLabV3+模型的鲁棒性和泛化能力,其推理速度较主流Swin-UNet模型和Segmenter模型快51.4 ms和66.5 ms,展现了良好的效率与实时性。因此,该模型在铁路维护等领域具有广泛应用潜力,能够有效降低人力和算力成本,提高检测效率。
中图分类号:
| [1]GAO S G,WANG A B,XIAO J,et al.Research on Damage and Structural Optimization Analysis of Spring Bars in High Speed Railway Fasteners[J].Noise and Vibration Control,2018,38(6):190-193. [2]LU Z W.Overview of high-speed railway track technology[J].Journal of Railway Engineering,2007(1):41-54. [3]WEI X K,SUO D,WEI D H,et al.Overview of the Application of Machine Vision in State Detection of Rail Transit Systems [J].Control and Decision Making,2021,36(2):257-282. [4]HSIEH H Y,CHEN N,LIAO C L.Visual recognition system ofelastic rail clips for mass rapid transit systems [C]//IEEE Joint Rail Conference & Internal Combustion Engine Division Spring Technical Conference.IEEE,2007:234-237. [5]LI J L,MA H F,ZHANG W Z,et al.Research on edge detection of rail fasteners based on machine vision [J].Journal of Northwest Normal University:Natural Science Edition,2017,53(5):45-48. [6]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440. [7]NASH W,DRUMMOND T,BIRBILIS N.Quantity beats quali-ty for semantic segmentation of corrosion in images[J].arXiv:1807.03138,2018. [8]KATSAMENIS I,DOULAMIS N,DOULAMIS A,et al.Simultaneous precise localization and classification of metal rust defects for robotic-driven maintenance and prefabrication using residual attention U-Net [J].Automation in Construction,2022,137:104182. [9]HAN Q,ZHAO N,XU J.Recognition and location of steelstructure surface corrosion based on unmanned aerial vehicle images [J].Journal of Civil Structural Health Monitoring,2021,11(5):1375-1392. [10]QIAN C.Evaluation of deep learning-based semantic segmentation approaches for autonomous corrosion detection on metallic surfaces[D].West Lafayette:Purdue University,2019. [11]LIN G,MILAN A,SHEN C,et al.Refinenet:Multipath refinement networks for high-resolution semantic segmentation [C]//Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2017.5168-5177. [12]ZHAO H,SHI J,QI X,et al.Pyramid Scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2881-2890. [13]RAHMAN A,WU Z Y,KALFARISI R.Semantic deep learning integrated with RGBfeature-based rule optimization for facility surface corrosion detection and evaluation [J].Journal of Computing in Civil Engineering,2021,35(6):04021018. [14] ZHANG H,LIAO D,LI C.Rail Surface Defect Detection Model Based on Attention Module and Hybrid-supervised Learning[EB/OL].https://www.jsjkx.com/CN/article/openArticlePDF.jsp?id=21188. [15]QIN D F,LEICHNER C,DELAKIS M,et al.MobileNetV4-Universal Models for the Mobile[C]//Computer Vision-ECCV 2024:18th European Conference.2024:78-96. [16]DONG X Y,BAO J M,CHEN D D,et al.CSWin Transformer:A General Vision Transformer Backbone with Cross-Shaped Windows[J].arXiv:2107.00652,2021. [17]WANG Q,HE L,WANG Z Q,et al.Road Extraction Algorithm for Remote Sensing Images Based on Improved DeepLabv3+[J].Computer Science,2024,51(8):168-175. [18]CAO Z,YU L,XU D,et al.Swin-Unet:Unifying self-attention and convolutional networks for medical image segmentation[J].IEEE Transactions on Medical Imaging,2021,41(5):1382-1392. [19]STRUDEL R S,CARION X,CORD I,et al.Segmenter:Transformer-based Semantic Segmentation[J].Theoretical Computer Science,2021,730(19):1-20. |
|
||