Computer Science ›› 2024, Vol. 51 ›› Issue (6A): 230500176-7.doi: 10.11896/jsjkx.230500176

• Image Processing & Multimedia Technolog • Previous Articles     Next Articles

Small Object Detection for Fish Based on SPD-Conv and NAM Attention Module

CHEN Yuzhang, WANG Shiqi, ZHOU Wen, ZHOU Wanting   

  1. School of Computer Science and Information Engineering,Hubei University,Wuhan 430062,China
  • Published:2024-06-06
  • About author:CHEN Yuzhang,born in 1984,Ph.D,associate professor.His main researchinterests include photoelectric detection and image processing.
    WANG Shiqi,born in 1999,posgra-duate.Her main research interests include deep learning and neural networks.
  • Supported by:
    Industry-University Cooperation and Education Program of the Ministry of Education(202101142041).

Abstract: In order to solve the problem of low image resolution due to the degradation of underwater imaging environment and low detection accuracy caused by small fish targets,an improved YOLOv7 detection algorithm combining SPD-Conv structure and NAM attention mechanism is proposed.Firstly,the space-to-fepth(SPD) structure is used to improve the head network,which replaces the original straddle convolution structure in the network,retains more fine-grained information,improves the efficiency of feature learning,and improves the detection effect of the network on low-resolution images.Then,the normalization-based attention module(NAM) attention mechanism is introduced into the network,and the module integration method of CBAM is adopted,and the BN scaling factor is used to calculate the attention weight,which suppresses the insignificant features and improves the accuracy of small target detection.Finally,for underwater imaging degradation,the detection image is deconvolved and preprocessed,which reduces the impact of underwater imaging degradation factors on detection.Experimental results show that in the WildFish dataset,the overall accuracy of the model reaches 97.2%,which is 7.6% higher than that of the YOLOv7 algorithm,the accuracy rate is increased by 8.5%,and the recall rate is increased by 9.8%,compared with the Efficientdet,SSD,YOLOv5 and YOLOv8 algorithms,the accuracy of the proposed model is improved by 12.6%,17.8%,4% and 2.9%,respectively.The overall accuracy of the model reaches 80.5%,which is 18.4%,11.6%,6.9%,2.0% and 2.7% higher than that of Efficientdet,SSD,YOLOv5,YOLOv7 and YOLOv8,respectively,which can meet the needs of underwater fish identification.

Key words: Space-to-Depth Conv(SPD Conv), Normalization-based attention module(NAM), YOLOv7, Fish detection, Object detection

CLC Number: 

  • TP391.41
[1]ID L,MIAO Z,PENG F,et al.Automatic counting methods in aquaculture:a review[J].Journal of the World Aquaculture Society,2021,52(2):269-283.
[2]FAN L Z,LIU Y.Automate fry counting using computer vision and multi-class least squares support vector machine[J].Aquaculture,2013,380/381/382/383:91-98.
[3]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//2014 IEEE Conference on Computer Vision and Pattern Recognition.Columbus,OH,USA.IEEE,2014:580-587.
[4]LIU W,ANGUELOV D,ERHAND,et al.Ssd:Single shotmultibox detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[5]REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[6]REDMON J,FARHADIA.YOLO9000:better,faster,stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7263-7271.
[7]REDMON J,FARHADI A.Yolov3:An incremental improve-ment[J].arXiv:1804.02767,2018.
[8]TSENG C H,KUO C H.Detecting and counting harvested fish and identifying fish types in electronic monitoring system videos using deep convolutional neural networks[J].ICES Journal of Marine Science,2020,77(4):1367-1378.
[9]ZENG L C,SUN B,ZHU D Q.Underwater target detectionbased on Faster R-CNN and adversarial occlusion network[J].Engineering Applications of Artificial Intelligence,2021,100:104190.
[10]ZHAO D,YANG B,DOU Y,et al.Underwater fish detection in sonar image based on an improved Faster RCNN[C]//2022 9th International Forum on Electrical Engineering and Automation(IFEEA).Zhuhai,China,2022:358-363.
[11]SHEN J Y,LI L Y,DAI Y L,et al.A fish stock detection method based on feature fusion SSD[J].Computer Simulation,2020,37(11):422-426,469.
[12]ZHANG L,HUANG L,LI B B,et al.Fish counting method based on multi-scale fusion and anchorless YOLO v3[J].Transactions of the Chinese Society for Agricultural Machinery,2021,52(S1):237-244.
[13]ABDULLAH A M,FAKHRUL H,MD F H B E,et al.Fahad Hasan Bhuiyan EMON,et al.YOLO-Fish:A robust fish detection model to detect fish in realistic underwater environment[J].Ecological Informatics,2022,72:101847.
[14]ZHAO S L,ZHANG S,LU J M,et al.A lightweight dead fish detection method based on deformable convolution and YOLOV4[J].Computers and Electronics in Agriculture,2022,198:107098.
[15]ZHANG Y S,XU W X,YANG S S,et al.Improved YOLOX detection algorithm for contraband in X-ray images[J].Applied Optics,2022,61:6297-6310.
[16]VIJIYAKUMAR K,GOVINDASAMY V,AKILA G.Hybridi-zation of Deep Convolutional Neural Network for Underwater Object Detection and Tracking Model[J].Microprocessors and Microsystems,2022,94:104628.
[17]WANG C Y,BOCHKOVSKIY Al,LIAO H Y.YOLOv7:Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J].arXiv:2207.02696,2022.
[18]SUNKARA R,LUO T.No More Strided Convolutions or Pooling:A New CNN Building Block for Low-Resolution Images and Small Objects[J].arXiv:2208.03641,2022.
[19]LIU Y,SHAO Z,TENG Y,et al.NAM:Normalization-basedAttention Module[J].arXiv,abs/2111.12419,2021.
[20]MIAO Y.Underwater image adaptive restoration and analysisby turbulence model[C]//2012 World Congress on Information and Communication Technologies.IEEE,2012:1182-1187.
[21]ZHUANG P,WANG Y,QIAO Y Y.Wildfish:A large bench-mark for fish recognition in the wild[C]//Proceedings of the 26th ACM international conference on Multimedia.2018:1301-1309.
[22]Roboflow.Aquarium Combined Image Dataset[EB/OL].https://roboflow.com
[1] ZHENG Shenhai, GAO Xi, LIU Pengwei, LI Weisheng. Occluded Video Instance Segmentation Method Based on Feature Fusion of Tracking and Detection in Time Sequence [J]. Computer Science, 2024, 51(6A): 230600186-6.
[2] LIU Hongli, WANG Yulin, SHAO Lei, LI Ji. Study on Monocular Vision Vehicle Ranging Based on Lower Edge of Detection Frame [J]. Computer Science, 2024, 51(6A): 231000077-6.
[3] QUE Yue, GAN Menghan, LIU Zhiwei. Object Detection with Receptive Field Expansion and Multi-branch Aggregation [J]. Computer Science, 2024, 51(6A): 230600151-6.
[4] HUANG Haixin, WU Di. Steel Defect Detection Based on Improved YOLOv7 [J]. Computer Science, 2024, 51(6A): 230800018-5.
[5] JIAO Ruodan, GAO Donghui, HUANG Yanhua, LIU Shuo, DUAN Xuanfei, WANG Rui, LIU Weidong. Study and Verification on Few-shot Evaluation Methods for AI-based Quality Inspection in Production Lines [J]. Computer Science, 2024, 51(6A): 230700086-8.
[6] ZHAO Junjie, ZHOU Xiaojing, LI Jiaxing. Improved YOLOV7 for Fall Detection [J]. Computer Science, 2024, 51(6A): 230800039-6.
[7] LIU Jiasen, HUANG Jun. Center Point Target Detection Algorithm Based on Improved Swin Transformer [J]. Computer Science, 2024, 51(6): 264-271.
[8] LI Yuehao, WANG Dengjiang, JIAN Haifang, WANG Hongchang, CHENG Qinghua. LiDAR-Radar Fusion Object Detection Algorithm Based on BEV Occupancy Prediction [J]. Computer Science, 2024, 51(6): 215-222.
[9] LIAO Junshuang, TAN Qinhong. DETR with Multi-granularity Spatial Attention and Spatial Prior Supervision [J]. Computer Science, 2024, 51(6): 239-246.
[10] BAI Xuefei, SHEN Wucheng, WANG Wenjian. Salient Object Detection Based on Feature Attention Purification [J]. Computer Science, 2024, 51(5): 125-133.
[11] WU Xiaoqin, ZHOU Wenjun, ZUO Chenglin, WANG Yifan, PENG Bo. Salient Object Detection Method Based on Multi-scale Visual Perception Feature Fusion [J]. Computer Science, 2024, 51(5): 143-150.
[12] JIAN Yingjie, YANG Wenxia, FANG Xi, HAN Huan. 3D Object Detection Based on Edge Convolution and Bottleneck Attention Module for Point Cloud [J]. Computer Science, 2024, 51(5): 162-171.
[13] XU Hao, LI Fengrun, LU Lu. Metal Surface Defect Detection Method Based on Dual-stream YOLOv4 [J]. Computer Science, 2024, 51(4): 209-216.
[14] LIU Zeyu, LIU Jianwei. Video and Image Salient Object Detection Based on Multi-task Learning [J]. Computer Science, 2024, 51(4): 217-228.
[15] HAO Ran, WANG Hongjun, LI Tianrui. Deep Neural Network Model for Transmission Line Defect Detection Based on Dual-branch Sequential Mixed Attention [J]. Computer Science, 2024, 51(3): 135-140.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!