Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241000155-9.doi: 10.11896/jsjkx.241000155

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

DEFM-YOLOv8-based Detection Algorithm for High-speed Rail Contact Network Wire State

GAO Yuli, WANG Baohui   

  1. School of Software,Beihang University,Beijing 100191,China
  • Online:2025-11-15 Published:2025-11-10

Abstract: The high-speed rail contact network is a critical conductor in the electrified railway system,and ensuring the proper functioning of its wires is crucial for maintaining the stable operation of the railway.Traditional manual inspection methods are inefficient and prone to oversight.With the rapid development of deep learning technologies,the use of computer vision techniques for automated detection has become an urgent necessity.In response to the challenges of detecting the state of wires in high-speed rail contact networks under various complex outdoor backgrounds and diverse environmental conditions(such as night and day),this paper proposes a wire state detection algorithm based on the combination of a Detail Enhancement Fusion Module(DEFM) and YOLOv8.By incorporating spatial and channel attention mechanisms,the algorithm fuses infrared and visible light images,introducing multimodal fusion and the Shuffle Attention mechanism.Experiments conducted on a real dataset demonstrate the mo-del’s significant improvement in performance metrics such as detection accuracy and recall rate.The results show that the improved algorithm increases the recall rate by 0.94% and mAP by 2.09% compared to the original algorithms.Practical tests indicate that the DEFM-YOLOv8-based detection model performs effectively in detecting wires in the high-speed rail contact network,regardless of whether the environment is nighttime or daytime,even under complex backgrounds.

Key words: High-speed rail contact network, YOLOv8, DEFM, Infrared-visible light image fusion, Attention mechanism, Object detection

CLC Number: 

  • TP302
[1]ZHANG T.Thoughts on Enhancing the Emergency Repair Capacity for Large-scale Faults in High-Speed Railway Catenary [J].Electrified Railway2023,34(S1):114-117.
[2]LIU H D.Software Design for Defect Image Recognition ofHigh-Speed Railway Catenary Components Based on Deep Learning Calculate [D].Shijiazhuang:Shijiazhuang Tiedao University,2023.
[3]REN S,HE K,GIRSHICK R,et al.Towards real-time object detection with region proposal networks,Adv[J].Neural Inf.Process,2015,28.
[4]HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision.2017:2961-2969.
[5]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,The Netherlands,Part I 14.Springer International Publishing,2016:21-37.
[6]JIANG P,ERGU D,LIU F,et al.A Review of Yolo algorithm developments[J].Procedia Computer Science,2022,199:1066-1073.
[7]WANG Y,WANG C,ZHANG H,et al.Automatic ship detection based on RetinaNet using multi-resolution Gaofen-3 imagery[J].Remote Sensing,2019,11(5):531.
[8]HALTMEIER M,NEUMANN L,RABANSER S.Single-stagereconstruction algorithm for quantitative photoacoustic tomography[J].Inverse Problems,2015,31(6):065005.
[9]LI Y,LIANG S,BAI B,et al.Detecting and tracking dim small targets in infrared image sequences under complex backgrounds[J].Multimedia Tools And applications,2014,71:1179-1199.
[10]ZHANG Q L,YANG Y B.Sa-net:Shuffle attention for deep convolutional neural networks[C]//2021 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP 2021).IEEE,2021:2235-2239.
[11]HUSSAIN M.YOLO-v1 to YOLO-v8,the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection[J].Machines,2023,11(7):677.
[12]MAO Q C,SUN H M,LIU Y B,et al.Mini-YOLOv3:real-time object detector for embedded applications[J].IEEE Access,2019,7:133529-133538.
[13]JOCHER G,CHAURASIA A,STOKEN A,et al.ultralytics/yolov5:v6.2-yolov5 classification models,apple m1,reproducibility,clearml and deci.ai integrations[J].Zenodo,2022.
[14]MA W,WANG K,LI J,et al.Infrared and visible image fusion technology and application:A review[J].Sensors,2023,23(2):599.
[15]DESHMUKH M,BHOSALE U.Image fusion and image quality assessment of fused images[J].International Journal of Image Processing(IJIP),2010,4(5):484.
[16]ZHANG X,LI L,DI D,et al.SERNet:Squeeze and excitation residual network for semantic segmentation of high-resolution remote sensing images[J].Remote Sensing,2022,14(19):4770.
[17]WOO S,PARK J,LEE J Y,et al.Cbam:Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[1] PENG Jiao, HE Yue, SHANG Xiaoran, HU Saier, ZHANG Bo, CHANG Yongjuan, OU Zhonghong, LU Yanyan, JIANG dan, LIU Yaduo. Text-Dynamic Image Cross-modal Retrieval Algorithm Based on Progressive Prototype Matching [J]. Computer Science, 2025, 52(9): 276-281.
[2] GAO Long, LI Yang, WANG Suge. Sentiment Classification Method Based on Stepwise Cooperative Fusion Representation [J]. Computer Science, 2025, 52(9): 313-319.
[3] SHEN Tao, ZHANG Xiuzai, XU Dai. Improved RT-DETR Algorithm for Small Object Detection in Remote Sensing Images [J]. Computer Science, 2025, 52(8): 214-221.
[4] LIU Jian, YAO Renyuan, GAO Nan, LIANG Ronghua, CHEN Peng. VSRI:Visual Semantic Relational Interactor for Image Caption [J]. Computer Science, 2025, 52(8): 222-231.
[5] LIU Yajun, JI Qingge. Pedestrian Trajectory Prediction Based on Motion Patterns and Time-Frequency Domain Fusion [J]. Computer Science, 2025, 52(7): 92-102.
[6] LIU Chengzhuang, ZHAI Sulan, LIU Haiqing, WANG Kunpeng. Weakly-aligned RGBT Salient Object Detection Based on Multi-modal Feature Alignment [J]. Computer Science, 2025, 52(7): 142-150.
[7] ZHUANG Jianjun, WAN Li. SCF U2-Net:Lightweight U2-Net Improved Method for Breast Ultrasound Lesion SegmentationCombined with Fuzzy Logic [J]. Computer Science, 2025, 52(7): 161-169.
[8] XU Yongwei, REN Haopan, WANG Pengfei. Object Detection Algorithm Based on YOLOv8 Enhancement and Its Application Norms [J]. Computer Science, 2025, 52(7): 189-200.
[9] ZHENG Cheng, YANG Nan. Aspect-based Sentiment Analysis Based on Syntax,Semantics and Affective Knowledge [J]. Computer Science, 2025, 52(7): 218-225.
[10] WANG Youkang, CHENG Chunling. Multimodal Sentiment Analysis Model Based on Cross-modal Unidirectional Weighting [J]. Computer Science, 2025, 52(7): 226-232.
[11] KONG Yinling, WANG Zhongqing, WANG Hongling. Study on Opinion Summarization Incorporating Evaluation Object Information [J]. Computer Science, 2025, 52(7): 233-240.
[12] LI Daicheng, LI Han, LIU Zheyu, GONG Shiheng. MacBERT Based Chinese Named Entity Recognition Fusion with Dependent Syntactic Information and Multi-view Lexical Information [J]. Computer Science, 2025, 52(6A): 240600121-8.
[13] HUANG Bocheng, WANG Xiaolong, AN Guocheng, ZHANG Tao. Transmission Line Fault Identification Method Based on Transfer Learning and Improved YOLOv8s [J]. Computer Science, 2025, 52(6A): 240800044-8.
[14] WU Zhihua, CHENG Jianghua, LIU Tong, CAI Yahui, CHENG Bang, PAN Lehao. Human Target Detection Algorithm for Low-quality Laser Through-window Imaging [J]. Computer Science, 2025, 52(6A): 240600069-6.
[15] GUAN Xin, YANG Xueyong, YANG Xiaolin, MENG Xiangfu. Tumor Mutation Prediction Model of Lung Adenocarcinoma Based on Pathological [J]. Computer Science, 2025, 52(6A): 240700010-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!