Computer Science ›› 2025, Vol. 52 ›› Issue (12): 166-174.doi: 10.11896/jsjkx.241000130

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Railway Fastener Segmentation Method Based on Sc-DeepLabV3+ Model

HUANG Kun, HE Lang, WANG Zhanqing   

  1. School of Science, Wuhan University of Technology, Wuhan 430070, China
  • Received:2024-10-22 Revised:2025-02-08 Online:2025-12-15 Published:2025-12-09
  • About author:HUANG Kun,born in 2000,postgra-duate.His main research interest is image processing.
    HE Lang,born in 1974,professor,Ph.D.His main research interests include intelligent calculation and image processing.
  • Supported by:
    This work was supported by the Young Scientists Fund of the National Natural Science Foundation of China(12201475).

Abstract: The deterioration of track fasteners is a critical factor affecting railway traffic safety.Utilizing deep learning image re-cognition methods for segmenting images collected by track fastener detection robots can significantly improve the efficiency of fastener defect detection.This paper addresses the current lack of publicly available datasets for track fasteners and the challenges posed by complex backgrounds that increase segmentation difficulty and processing time.This paper manually creats the RFS(Rail Fastener Segmentation) track fastener dataset and proposes a segmentation method based on the Sc-DeepLabV3+ model.By replacing the backbone network of the original DeepLabV3+ model with the lightweight MobileNetV4,it accelerates computation speed and introduces an improved S-ASPP(Switchable Atrous Spatial Pyramid Pooling) module to enable the network to achieve denser pixel sampling,enhancing its ability to extract detailed features.Additionally,it incorporates the CSWin(Cross-Shaped Window Self-Attention) attention mechanism to compute horizontal and vertical attention in parallel,reducing interference from complex backgrounds.In the experimental section,this paper proposes the RailAugment data augmentation technique to effectively increase the diversity and coverage of the dataset,ultimately resulting in a total of 6 832 images,including 4 782 for training,1 366 for validation,and 684 for testing.Experimental results show that the mIoU and mPA reach 95.17% and 97.14%,respectively,which represent improvements of 2.19 percentage point and 0.36 percentage point compared to the original model.Although the performance improvement is relatively small,significant improvements are observed in detailed feature extraction and background interference handling.Furthermore,the Sc-DeepLabV3+ model is validated on the DeepGlobe dataset,demonstrating its robustness and generalization ability.Its inference speed is 51.4 ms and 66.5 ms faster than the mainstream Swin-UNet and Segmenter models,respectively,showing good efficiency and real-time performance.Therefore,this model has broad application potential in railway maintenance and other fields,effectively reducing labor and computational costs while improving detection efficiency.

Key words: Deep learning, Image semantic segmentation, DeepLabV3+, Railway fasteners, Data augmentation

CLC Number: 

  • TP319
[1]GAO S G,WANG A B,XIAO J,et al.Research on Damage and Structural Optimization Analysis of Spring Bars in High Speed Railway Fasteners[J].Noise and Vibration Control,2018,38(6):190-193.
[2]LU Z W.Overview of high-speed railway track technology[J].Journal of Railway Engineering,2007(1):41-54.
[3]WEI X K,SUO D,WEI D H,et al.Overview of the Application of Machine Vision in State Detection of Rail Transit Systems [J].Control and Decision Making,2021,36(2):257-282.
[4]HSIEH H Y,CHEN N,LIAO C L.Visual recognition system ofelastic rail clips for mass rapid transit systems [C]//IEEE Joint Rail Conference & Internal Combustion Engine Division Spring Technical Conference.IEEE,2007:234-237.
[5]LI J L,MA H F,ZHANG W Z,et al.Research on edge detection of rail fasteners based on machine vision [J].Journal of Northwest Normal University:Natural Science Edition,2017,53(5):45-48.
[6]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:3431-3440.
[7]NASH W,DRUMMOND T,BIRBILIS N.Quantity beats quali-ty for semantic segmentation of corrosion in images[J].arXiv:1807.03138,2018.
[8]KATSAMENIS I,DOULAMIS N,DOULAMIS A,et al.Simultaneous precise localization and classification of metal rust defects for robotic-driven maintenance and prefabrication using residual attention U-Net [J].Automation in Construction,2022,137:104182.
[9]HAN Q,ZHAO N,XU J.Recognition and location of steelstructure surface corrosion based on unmanned aerial vehicle images [J].Journal of Civil Structural Health Monitoring,2021,11(5):1375-1392.
[10]QIAN C.Evaluation of deep learning-based semantic segmentation approaches for autonomous corrosion detection on metallic surfaces[D].West Lafayette:Purdue University,2019.
[11]LIN G,MILAN A,SHEN C,et al.Refinenet:Multipath refinement networks for high-resolution semantic segmentation [C]//Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2017.5168-5177.
[12]ZHAO H,SHI J,QI X,et al.Pyramid Scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2881-2890.
[13]RAHMAN A,WU Z Y,KALFARISI R.Semantic deep learning integrated with RGBfeature-based rule optimization for facility surface corrosion detection and evaluation [J].Journal of Computing in Civil Engineering,2021,35(6):04021018.
[14] ZHANG H,LIAO D,LI C.Rail Surface Defect Detection Model Based on Attention Module and Hybrid-supervised Learning[EB/OL].https://www.jsjkx.com/CN/article/openArticlePDF.jsp?id=21188.
[15]QIN D F,LEICHNER C,DELAKIS M,et al.MobileNetV4-Universal Models for the Mobile[C]//Computer Vision-ECCV 2024:18th European Conference.2024:78-96.
[16]DONG X Y,BAO J M,CHEN D D,et al.CSWin Transformer:A General Vision Transformer Backbone with Cross-Shaped Windows[J].arXiv:2107.00652,2021.
[17]WANG Q,HE L,WANG Z Q,et al.Road Extraction Algorithm for Remote Sensing Images Based on Improved DeepLabv3+[J].Computer Science,2024,51(8):168-175.
[18]CAO Z,YU L,XU D,et al.Swin-Unet:Unifying self-attention and convolutional networks for medical image segmentation[J].IEEE Transactions on Medical Imaging,2021,41(5):1382-1392.
[19]STRUDEL R S,CARION X,CORD I,et al.Segmenter:Transformer-based Semantic Segmentation[J].Theoretical Computer Science,2021,730(19):1-20.
[1] YIN Shi, SHI Zhenyang, WU Menglin, CAI Jinyan, YU De. Deep Learning-based Kidney Segmentation in Ultrasound Imaging:Current Trends and Challenges [J]. Computer Science, 2025, 52(9): 16-24.
[2] ZENG Lili, XIA Jianan, LI Shaowen, JING Maike, ZHAO Huihui, ZHOU Xuezhong. M2T-Net:Cross-task Transfer Learning Tongue Diagnosis Method Based on Multi-source Data [J]. Computer Science, 2025, 52(9): 47-53.
[3] LI Yaru, WANG Qianqian, CHE Chao, ZHU Deheng. Graph-based Compound-Protein Interaction Prediction with Drug Substructures and Protein 3D Information [J]. Computer Science, 2025, 52(9): 71-79.
[4] LUO Chi, LU Lingyun, LIU Fei. Partial Differential Equation Solving Method Based on Locally Enhanced Fourier NeuralOperators [J]. Computer Science, 2025, 52(9): 144-151.
[5] LIU Leyuan, CHEN Gege, WU Wei, WANG Yong, ZHOU Fan. Survey of Data Classification and Grading Studies [J]. Computer Science, 2025, 52(9): 195-211.
[6] LIU Wei, XU Yong, FANG Juan, LI Cheng, ZHU Yujun, FANG Qun, HE Xin. Multimodal Air-writing Gesture Recognition Based on Radar-Vision Fusion [J]. Computer Science, 2025, 52(9): 259-268.
[7] LIU Zhengyu, ZHANG Fan, QI Xiaofeng, GAO Yanzhao, SONG Yijing, FAN Wang. Review of Research on Deep Learning Compiler [J]. Computer Science, 2025, 52(8): 29-44.
[8] TANG Boyuan, LI Qi. Review on Application of Spatial-Temporal Graph Neural Network in PM2.5 ConcentrationForecasting [J]. Computer Science, 2025, 52(8): 71-85.
[9] YANG Jian, SUN Liu, ZHANG Lifang. Survey on Data Processing and Data Augmentation in Low-resource Language Automatic Speech Recognition [J]. Computer Science, 2025, 52(8): 86-99.
[10] LI Mengxi, GAO Xindan, LI Xue. Two-way Feature Augmentation Graph Convolution Networks Algorithm [J]. Computer Science, 2025, 52(7): 127-134.
[11] ZHENG Cheng, YANG Nan. Aspect-based Sentiment Analysis Based on Syntax,Semantics and Affective Knowledge [J]. Computer Science, 2025, 52(7): 218-225.
[12] ZHOU Lei, SHI Huaifeng, YANG Kai, WANG Rui, LIU Chaofan. Intelligent Prediction of Network Traffic Based on Large Language Model [J]. Computer Science, 2025, 52(6A): 241100058-7.
[13] GUAN Xin, YANG Xueyong, YANG Xiaolin, MENG Xiangfu. Tumor Mutation Prediction Model of Lung Adenocarcinoma Based on Pathological [J]. Computer Science, 2025, 52(6A): 240700010-8.
[14] TAN Jiahui, WEN Chenyan, HUANG Wei, HU Kai. CT Image Segmentation of Intracranial Hemorrhage Based on ESC-TransUNet Network [J]. Computer Science, 2025, 52(6A): 240700030-9.
[15] RAN Qin, RUAN Xiaoli, XU Jing, LI Shaobo, HU Bingqi. Function Prediction of Therapeutic Peptides with Multi-coded Neural Networks Based on Projected Gradient Descent [J]. Computer Science, 2025, 52(6A): 240800024-6.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!