计算机科学 ›› 2025, Vol. 52 ›› Issue (12): 166-174.doi: 10.11896/jsjkx.241000130

• 计算机图形学&多媒体 • 上一篇    下一篇

基于Sc-DeepLabV3+模型的铁轨扣件分割方法

黄坤, 何朗, 王展青   

  1. 武汉理工大学理学院 武汉 430070
  • 收稿日期:2024-10-22 修回日期:2025-02-08 出版日期:2025-12-15 发布日期:2025-12-09
  • 通讯作者: 何朗(helang@whut.edu.cn)
  • 作者简介:(417136363@qq.com)
  • 基金资助:
    国家自然科学基金青年科学基金(12201475)

Railway Fastener Segmentation Method Based on Sc-DeepLabV3+ Model

HUANG Kun, HE Lang, WANG Zhanqing   

  1. School of Science, Wuhan University of Technology, Wuhan 430070, China
  • Received:2024-10-22 Revised:2025-02-08 Published:2025-12-15 Online:2025-12-09
  • About author:HUANG Kun,born in 2000,postgra-duate.His main research interest is image processing.
    HE Lang,born in 1974,professor,Ph.D.His main research interests include intelligent calculation and image processing.
  • Supported by:
    This work was supported by the Young Scientists Fund of the National Natural Science Foundation of China(12201475).

摘要: 铁轨扣件病害是影响铁路交通安全的重要因素。利用深度学习图像识别方法对铁轨扣件检测机器人所采集的图像进行分割,可以有效提高扣件病害检测的效率。针对目前缺乏公开可用的铁轨扣件数据集,以及扣件数据量大但背景环境复杂导致分割难度大、耗时长等问题,人工制作了RFS铁轨扣件数据集并提出基于Sc-DeepLabV3+模型的铁轨扣件分割方法。在原始DeepLabV3+模型的基础上,替换其主干网络为轻量MobileNetV4网络以加快运算速度,提出改进的S-ASPP模块,使网络能够获得更密集的像素采样,从而增强网络提取细节特征的能力。此外,加入CSWin注意力机制并行地计算横向和纵向的注意力,减少复杂背景环境的干扰。实验部分,提出了RailAugment数据增强技术,有效增加了数据集的多样性和覆盖度,最终获得的扣件数据集共有6 832张图像,其中训练集4 782张,验证集1 366张,测试集684张。实验结果表明,mIoU和mPA分别达到了95.17%和97.14%,相较于原模型提高了2.19个百分点和0.3个百分点。尽管性能提升幅度较小,但在细节特征提取和背景干扰处理上有明显改善。在公共DeepGlobe数据集上验证了Sc-DeepLabV3+模型的鲁棒性和泛化能力,其推理速度较主流Swin-UNet模型和Segmenter模型快51.4 ms和66.5 ms,展现了良好的效率与实时性。因此,该模型在铁路维护等领域具有广泛应用潜力,能够有效降低人力和算力成本,提高检测效率。

关键词: 深度学习, 图像语义分割, DeepLabV3+, 铁轨扣件, 数据增强

Abstract: The deterioration of track fasteners is a critical factor affecting railway traffic safety.Utilizing deep learning image re-cognition methods for segmenting images collected by track fastener detection robots can significantly improve the efficiency of fastener defect detection.This paper addresses the current lack of publicly available datasets for track fasteners and the challenges posed by complex backgrounds that increase segmentation difficulty and processing time.This paper manually creats the RFS(Rail Fastener Segmentation) track fastener dataset and proposes a segmentation method based on the Sc-DeepLabV3+ model.By replacing the backbone network of the original DeepLabV3+ model with the lightweight MobileNetV4,it accelerates computation speed and introduces an improved S-ASPP(Switchable Atrous Spatial Pyramid Pooling) module to enable the network to achieve denser pixel sampling,enhancing its ability to extract detailed features.Additionally,it incorporates the CSWin(Cross-Shaped Window Self-Attention) attention mechanism to compute horizontal and vertical attention in parallel,reducing interference from complex backgrounds.In the experimental section,this paper proposes the RailAugment data augmentation technique to effectively increase the diversity and coverage of the dataset,ultimately resulting in a total of 6 832 images,including 4 782 for training,1 366 for validation,and 684 for testing.Experimental results show that the mIoU and mPA reach 95.17% and 97.14%,respectively,which represent improvements of 2.19 percentage point and 0.36 percentage point compared to the original model.Although the performance improvement is relatively small,significant improvements are observed in detailed feature extraction and background interference handling.Furthermore,the Sc-DeepLabV3+ model is validated on the DeepGlobe dataset,demonstrating its robustness and generalization ability.Its inference speed is 51.4 ms and 66.5 ms faster than the mainstream Swin-UNet and Segmenter models,respectively,showing good efficiency and real-time performance.Therefore,this model has broad application potential in railway maintenance and other fields,effectively reducing labor and computational costs while improving detection efficiency.

Key words: Deep learning, Image semantic segmentation, DeepLabV3+, Railway fasteners, Data augmentation

中图分类号: 

  • TP319
[1]GAO S G,WANG A B,XIAO J,et al.Research on Damage and Structural Optimization Analysis of Spring Bars in High Speed Railway Fasteners[J].Noise and Vibration Control,2018,38(6):190-193.
[2]LU Z W.Overview of high-speed railway track technology[J].Journal of Railway Engineering,2007(1):41-54.
[3]WEI X K,SUO D,WEI D H,et al.Overview of the Application of Machine Vision in State Detection of Rail Transit Systems [J].Control and Decision Making,2021,36(2):257-282.
[4]HSIEH H Y,CHEN N,LIAO C L.Visual recognition system ofelastic rail clips for mass rapid transit systems [C]//IEEE Joint Rail Conference & Internal Combustion Engine Division Spring Technical Conference.IEEE,2007:234-237.
[5]LI J L,MA H F,ZHANG W Z,et al.Research on edge detection of rail fasteners based on machine vision [J].Journal of Northwest Normal University:Natural Science Edition,2017,53(5):45-48.
[6]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440.
[7]NASH W,DRUMMOND T,BIRBILIS N.Quantity beats quali-ty for semantic segmentation of corrosion in images[J].arXiv:1807.03138,2018.
[8]KATSAMENIS I,DOULAMIS N,DOULAMIS A,et al.Simultaneous precise localization and classification of metal rust defects for robotic-driven maintenance and prefabrication using residual attention U-Net [J].Automation in Construction,2022,137:104182.
[9]HAN Q,ZHAO N,XU J.Recognition and location of steelstructure surface corrosion based on unmanned aerial vehicle images [J].Journal of Civil Structural Health Monitoring,2021,11(5):1375-1392.
[10]QIAN C.Evaluation of deep learning-based semantic segmentation approaches for autonomous corrosion detection on metallic surfaces[D].West Lafayette:Purdue University,2019.
[11]LIN G,MILAN A,SHEN C,et al.Refinenet:Multipath refinement networks for high-resolution semantic segmentation [C]//Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2017.5168-5177.
[12]ZHAO H,SHI J,QI X,et al.Pyramid Scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2881-2890.
[13]RAHMAN A,WU Z Y,KALFARISI R.Semantic deep learning integrated with RGBfeature-based rule optimization for facility surface corrosion detection and evaluation [J].Journal of Computing in Civil Engineering,2021,35(6):04021018.
[14] ZHANG H,LIAO D,LI C.Rail Surface Defect Detection Model Based on Attention Module and Hybrid-supervised Learning[EB/OL].https://www.jsjkx.com/CN/article/openArticlePDF.jsp?id=21188.
[15]QIN D F,LEICHNER C,DELAKIS M,et al.MobileNetV4-Universal Models for the Mobile[C]//Computer Vision-ECCV 2024:18th European Conference.2024:78-96.
[16]DONG X Y,BAO J M,CHEN D D,et al.CSWin Transformer:A General Vision Transformer Backbone with Cross-Shaped Windows[J].arXiv:2107.00652,2021.
[17]WANG Q,HE L,WANG Z Q,et al.Road Extraction Algorithm for Remote Sensing Images Based on Improved DeepLabv3+[J].Computer Science,2024,51(8):168-175.
[18]CAO Z,YU L,XU D,et al.Swin-Unet:Unifying self-attention and convolutional networks for medical image segmentation[J].IEEE Transactions on Medical Imaging,2021,41(5):1382-1392.
[19]STRUDEL R S,CARION X,CORD I,et al.Segmenter:Transformer-based Semantic Segmentation[J].Theoretical Computer Science,2021,730(19):1-20.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!