Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241200131-8.doi: 10.11896/jsjkx.241200131

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Target Recognition Algorithm in Urban Traffic Field of View Based on Improved YOLOv8

CHEN Junjie, ZHAO Hong, LUO Yong, DING Xiaoyun   

  1. College of Mechanical and Electrical Engineering,Qingdao University,Qingdao,Shangdong 266071,China
  • Online:2025-11-15 Published:2025-11-10
  • About author:CHEN Junjie,born in 2000,postgra-duate.His main research interests include deep learning and object detection.
    ZHAO Hong,born in 1973,Ph.D,asso-ciate professor.Her main research interests include emission reductionand new energy technology.
  • Supported by:
    Qingdao Science and Technology Benefiting the People Demonstration Project(24-1-8-cspz-18-nsh).

Abstract: To reduce the issues of false detection and missed detection in target detection algorithms within urban environments,the YOLOv8 target detection model is used as the foundation,and a small-object detection layer is introduced to enable the network to better capture and recognize small-sized objects in the field of view,thereby improving its focus on target recognition.A novel remote sensing target detection model is integrated to reconstruct the C2f module,enhancing its perception of rich gradient flow information and its ability to dynamically adjust the receptive field.By applying topological optimization concepts to improve the CBAM attention mechanism,the GSAM attention mechanism is proposed and embedded at appropriate positions in the network to enhance the utilization of semantic information.To address the problem of missed detections,the performance of multiple IoU methods is compared,and the optimal EIoU is selected to accelerate the convergence speed of the algorithm and improve regression accuracy.Testing and ablation experiments conducted on the Cityscapes public dataset show that,compared to the baseline algorithm,the improved algorithm achieves increases of 2.5,5.8,and 6.1 percentage points in precision,recall,and mean average precision(mAP),respectively.These results effectively enhance the accuracy of vehicle target detection in urban traffic scenarios,providing reliable support for applications such as road video surveillance.

Key words: Urban traffic, Target recognition, YOLOv8, Attention mechanism, IOU

CLC Number: 

  • U495
[1]WU J,ZHAO Y Q,QlU X Y.Vehicle Steering Angle ldentification based on GRNN[J].Journal of Basic Science and Engineering,2014,22(1):170-178.
[2]GAO T,LIU Z G,QIU X Y.Traffic Vehicle Contour Tracking and lts Engineering Application[J].Journal of Basic Science and Engineering,2010,18(2):343-351.
[3]LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,The Netherlands,Part I 14.Springer International Publishing,2016:21-37.
[4]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[5]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[6]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[7]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(6):1137-1149.
[8]HE K,ZHANG X,REN S,et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1904-1916.
[9]SONG C L,CHAI W Q,ZHANG X S.Road small target detection based on improved YOLO v5 algorithm [J].Systems Engineering and Electronics,2024,46(10):3271-3278.
[10]MU L,ZHAO H,LI Y,et al.Traffic Flow Statistics MethodBased on Deep Learning and Multi-Feature Fusion[J].CMES-Computer Modeling in Engineering & Sciences,2021,129(2):465-483.
[11]CHEN Z Y,WANG X L,HE D,et al.Lightweight Vehicle Detection NetworkBased on Improved YOLOv8 [J].Computer Engineering,2025,51(5):314-325.
[12]ZHENG Q M,WANG LL,WANG F H.Small Object Detection in Traffic Scene Based on Improved Convolutional Neural Network[J].Computer Engineering,2020,46(6):26-33.
[13]LI G J,HU J,AI J Y.Vehicle Detection Based on Improved SSD Algorithm[J].Computer Engineering,2022,48(1):266-274.
[14]CAI Y,LUAN T,GAO H,et al.YOLOv4-5D:An effective and efficient object detector for autonomous driving[J].IEEE Transactions on Instrumentation and Measurement,2021,70:1-13.
[15]SONG H J,ZHOU L.Vehicle detection and recognition algo-rithm based on function improvement of YOLOv3[J].Chinese Journal of Intelligent Science and Technology,2023,5(4):535-542.
[16]PI J,LIU Y H,LI J H.Research on lightweight forest fire detection algorithm based on YOLOv5s[J].Journal of Graphics,2023,44(1):26-32.
[17]LI G,ZHAO W,LIU P,et al.Smooth-IoU Loss for BoundingBox Regression in Visual Tracking[J].Acta Automatica Sinica,2023,49(2):288-306.
[18]GUO Y Y,HU W C,DAI S,et al.Lightweight Vehicle Detection Model for Roadside Traffic Monitoring Scenarios[J].Computer Engineering and Applications,2022,58(6):192-199.
[19]MIAO Y Z,ZHANG Z W,WANG H S,et al.Detection method of fallen leaves on road based on AC-YOLO[J].Control and Decision,2023,38(7):1878-1886.
[20]WEN B J,ZHANG C T.Lightweight mask wearing detection algorithm based on YOLOv3[J].Electronic Measurement Technology,2021(44):105-110.
[21]MA N,ZHANG X,ZHENG H T,et al.Shufflenet v2:Practical guidelines for efficient cnn architecture design[C]//15th European Conference on Computer Vision(ECCV).Munich:SPRINGER-VERLAG BERLIN,2018:116-131.
[22]LI Y X,HOU Q B,ZHENG Z H,et al.Large selective kernel network for remote sensing object detection[J].arXiv:2303.09030,2023.
[23]ZHANG X,ZENG H,GUO S,et al.:Efficient long-range attention network for image super-resolution[J].arXiv:220306697.
[24]WOO S,PARK J,LEE J Y,et al.CBAM:convolutional block attention module[C]//ECCV 2018.LNCS,Springer,2018:3-19.
[25]WANG Q,WU B,ZHU P,et al.ECA-net:Efficient channel attention for deep convolutional neural networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:11534-11542.
[26]RUAND,WANG D,ZHENG Y,et al.Gaussian Context Transformer[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).Nashville,TN,USA.IEEE.2021:15124-15133.
[27]HU J,SHEN L,SUN G.Squeeze-and-Excitation Networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition,Salt Lake City,UT,USA,.IEEE.2018:7132-7141.
[1] WANG Xinyu, GAO Donghuai, NING Yuwen, XU Hao, QI Haonan. Student Behavior Detection Method Based on Improved YOLO Algorithm [J]. Computer Science, 2026, 53(3): 246-256.
[2] TANG Xinliang, PAN Xiaorun, WANG Jianchao, SU He. Integrate ByteTrack’s EAP-YOLOv8 UAV Marker Point Detection and Tracking [J]. Computer Science, 2026, 53(3): 266-276.
[3] QIAN Qing, CHEN Huicheng, CUI Yunhe, TANG Ruixue, FU Jinmei. Joint Entity and Relation Extraction Method with Multi-scale Collaborative Aggregation and Axial-semantic Guidance [J]. Computer Science, 2026, 53(3): 97-106.
[4] GE Zeqing, HUANG Shengjun. Semi-supervised Learning Method for Multi-label Tabular Data [J]. Computer Science, 2026, 53(3): 151-157.
[5] CHANG Xuanwei, DUAN Liguo, CHEN Jiahao, CUI Juanjuan, LI Aiping. Method for Span-level Sentiment Triplet Extraction by Deeply Integrating Syntactic and Semantic
Features
[J]. Computer Science, 2026, 53(2): 322-330.
[6] ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[7] ZHUO Tienong, YING Di, ZHAO Hui. Research on Student Classroom Concentration Integrating Cross-modal Attention and Role
Interaction
[J]. Computer Science, 2026, 53(2): 67-77.
[8] XU Jingtao, YANG Yan, JIANG Yongquan. Time-Frequency Attention Based Model for Time Series Anomaly Detection [J]. Computer Science, 2026, 53(2): 161-169.
[9] HAN Lei, SHANG Haoyu, QIAN Xiaoyan, GU Yan, LIU Qingsong, WANG Chuang. Constrained Multi-loss Video Anomaly Detection with Dual-branch Feature Fusion [J]. Computer Science, 2026, 53(2): 236-244.
[10] GUO Xingxing, XIAO Yannan, WEN Peizhi, XU Zhi, HUANG Wenming. Attention-based Audio-driven Digital Face Video Generation Method [J]. Computer Science, 2026, 53(2): 245-252.
[11] JI Sai, QIAO Liwei, SUN Yajie. Semantic-guided Hybrid Cross-feature Fusion Method for Infrared and Visible Light Images [J]. Computer Science, 2026, 53(2): 253-263.
[12] LYU Jinggang, GAO Shuo, LI Yuzhi, ZHOU Jin. Facial Expression Recognition with Channel Attention Guided Global-Local Semantic Cooperation [J]. Computer Science, 2026, 53(1): 195-205.
[13] FAN Jiabin, WANG Baohui, CHEN Jixuan. Method for Symbol Detection in Substation Layout Diagrams Based on Text-Image MultimodalFusion [J]. Computer Science, 2026, 53(1): 206-215.
[14] WANG Haoyan, LI Chongshou, LI Tianrui. Reinforcement Learning Method for Solving Flexible Job Shop Scheduling Problem Based onDouble Layer Attention Network [J]. Computer Science, 2026, 53(1): 231-240.
[15] CHEN Qian, CHENG Kaixuan, GUO Xin, ZHANG Xiaoxia, WANG Suge, LI Yanhong. Bidirectional Prompt-Tuning for Event Argument Extraction with Topic and Entity Embeddings [J]. Computer Science, 2026, 53(1): 278-284.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!