Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241000155-9.doi: 10.11896/jsjkx.241000155

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

DEFM-YOLOv8-based Detection Algorithm for High-speed Rail Contact Network Wire State

GAO Yuli, WANG Baohui   

  1. School of Software,Beihang University,Beijing 100191,China
  • Online:2025-11-15 Published:2025-11-10
  • About author:GAO Yuli,born in 1997,postgraduate.His main research interests include graph neural networks and big data.
    WANG Baohui,born in 1973,senior engineer,master supervisor.His main research interests include software architecture,big data,artificial intelligence,etc.

Abstract: The high-speed rail contact network is a critical conductor in the electrified railway system,and ensuring the proper functioning of its wires is crucial for maintaining the stable operation of the railway.Traditional manual inspection methods are inefficient and prone to oversight.With the rapid development of deep learning technologies,the use of computer vision techniques for automated detection has become an urgent necessity.In response to the challenges of detecting the state of wires in high-speed rail contact networks under various complex outdoor backgrounds and diverse environmental conditions(such as night and day),this paper proposes a wire state detection algorithm based on the combination of a Detail Enhancement Fusion Module(DEFM) and YOLOv8.By incorporating spatial and channel attention mechanisms,the algorithm fuses infrared and visible light images,introducing multimodal fusion and the Shuffle Attention mechanism.Experiments conducted on a real dataset demonstrate the mo-del’s significant improvement in performance metrics such as detection accuracy and recall rate.The results show that the improved algorithm increases the recall rate by 0.94% and mAP by 2.09% compared to the original algorithms.Practical tests indicate that the DEFM-YOLOv8-based detection model performs effectively in detecting wires in the high-speed rail contact network,regardless of whether the environment is nighttime or daytime,even under complex backgrounds.

Key words: High-speed rail contact network, YOLOv8, DEFM, Infrared-visible light image fusion, Attention mechanism, Object detection

CLC Number: 

  • TP302
[1]ZHANG T.Thoughts on Enhancing the Emergency Repair Capacity for Large-scale Faults in High-Speed Railway Catenary [J].Electrified Railway2023,34(S1):114-117.
[2]LIU H D.Software Design for Defect Image Recognition ofHigh-Speed Railway Catenary Components Based on Deep Learning Calculate [D].Shijiazhuang:Shijiazhuang Tiedao University,2023.
[3]REN S,HE K,GIRSHICK R,et al.Towards real-time object detection with region proposal networks,Adv[J].Neural Inf.Process,2015,28.
[4]HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision.2017:2961-2969.
[5]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,The Netherlands,Part I 14.Springer International Publishing,2016:21-37.
[6]JIANG P,ERGU D,LIU F,et al.A Review of Yolo algorithm developments[J].Procedia Computer Science,2022,199:1066-1073.
[7]WANG Y,WANG C,ZHANG H,et al.Automatic ship detection based on RetinaNet using multi-resolution Gaofen-3 imagery[J].Remote Sensing,2019,11(5):531.
[8]HALTMEIER M,NEUMANN L,RABANSER S.Single-stagereconstruction algorithm for quantitative photoacoustic tomography[J].Inverse Problems,2015,31(6):065005.
[9]LI Y,LIANG S,BAI B,et al.Detecting and tracking dim small targets in infrared image sequences under complex backgrounds[J].Multimedia Tools And applications,2014,71:1179-1199.
[10]ZHANG Q L,YANG Y B.Sa-net:Shuffle attention for deep convolutional neural networks[C]//2021 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP 2021).IEEE,2021:2235-2239.
[11]HUSSAIN M.YOLO-v1 to YOLO-v8,the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection[J].Machines,2023,11(7):677.
[12]MAO Q C,SUN H M,LIU Y B,et al.Mini-YOLOv3:real-time object detector for embedded applications[J].IEEE Access,2019,7:133529-133538.
[13]JOCHER G,CHAURASIA A,STOKEN A,et al.ultralytics/yolov5:v6.2-yolov5 classification models,apple m1,reproducibility,clearml and deci.ai integrations[J].Zenodo,2022.
[14]MA W,WANG K,LI J,et al.Infrared and visible image fusion technology and application:A review[J].Sensors,2023,23(2):599.
[15]DESHMUKH M,BHOSALE U.Image fusion and image quality assessment of fused images[J].International Journal of Image Processing(IJIP),2010,4(5):484.
[16]ZHANG X,LI L,DI D,et al.SERNet:Squeeze and excitation residual network for semantic segmentation of high-resolution remote sensing images[J].Remote Sensing,2022,14(19):4770.
[17]WOO S,PARK J,LEE J Y,et al.Cbam:Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[1] QIAN Qing, CHEN Huicheng, CUI Yunhe, TANG Ruixue, FU Jinmei. Joint Entity and Relation Extraction Method with Multi-scale Collaborative Aggregation and Axial-semantic Guidance [J]. Computer Science, 2026, 53(3): 97-106.
[2] GE Zeqing, HUANG Shengjun. Semi-supervised Learning Method for Multi-label Tabular Data [J]. Computer Science, 2026, 53(3): 151-157.
[3] ZHAO Binbei, ZHU Li, ZHAO Hongli, LI Yutong. Computer Vision Applications in Rail Transit Systems [J]. Computer Science, 2026, 53(3): 214-224.
[4] WANG Xinyu, GAO Donghuai, NING Yuwen, XU Hao, QI Haonan. Student Behavior Detection Method Based on Improved YOLO Algorithm [J]. Computer Science, 2026, 53(3): 246-256.
[5] TANG Xinliang, PAN Xiaorun, WANG Jianchao, SU He. Integrate ByteTrack’s EAP-YOLOv8 UAV Marker Point Detection and Tracking [J]. Computer Science, 2026, 53(3): 266-276.
[6] CHANG Xuanwei, DUAN Liguo, CHEN Jiahao, CUI Juanjuan, LI Aiping. Method for Span-level Sentiment Triplet Extraction by Deeply Integrating Syntactic and Semantic
Features
[J]. Computer Science, 2026, 53(2): 322-330.
[7] ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[8] ZHUO Tienong, YING Di, ZHAO Hui. Research on Student Classroom Concentration Integrating Cross-modal Attention and Role
Interaction
[J]. Computer Science, 2026, 53(2): 67-77.
[9] XU Jingtao, YANG Yan, JIANG Yongquan. Time-Frequency Attention Based Model for Time Series Anomaly Detection [J]. Computer Science, 2026, 53(2): 161-169.
[10] HUANG Jing, WANG Teng, LIU Jian, HU Kai, PENG Xin, HUANG Yamin, WEN Yuanqiao. Multimodal Visual Detection for Underwater Sonar Target Images [J]. Computer Science, 2026, 53(2): 227-235.
[11] HAN Lei, SHANG Haoyu, QIAN Xiaoyan, GU Yan, LIU Qingsong, WANG Chuang. Constrained Multi-loss Video Anomaly Detection with Dual-branch Feature Fusion [J]. Computer Science, 2026, 53(2): 236-244.
[12] GUO Xingxing, XIAO Yannan, WEN Peizhi, XU Zhi, HUANG Wenming. Attention-based Audio-driven Digital Face Video Generation Method [J]. Computer Science, 2026, 53(2): 245-252.
[13] JI Sai, QIAO Liwei, SUN Yajie. Semantic-guided Hybrid Cross-feature Fusion Method for Infrared and Visible Light Images [J]. Computer Science, 2026, 53(2): 253-263.
[14] ZHOU Bingquan, JIANG Jie, CHEN Jiangmin, ZHAN Lixin. EvR-DETR:Event-RGB Fusion for Lightweight End-to-End Object Detection [J]. Computer Science, 2026, 53(1): 153-162.
[15] LI Fangfang, KONG Yuqiu, LIU Yang , LI Pengyue. Co-salient Object Detection Guided by Category Labels [J]. Computer Science, 2026, 53(1): 163-172.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!