Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 250200043-7.doi: 10.11896/jsjkx.250200043

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Cotton Disease Detection Based on Feature Enhancement and Group Mix Attention

WANG Hongqiang, ZHAO Hui, JIA Zhenhong   

  1. College of Computer Science and Technology,Xinjiang University,Urumqi 830049,China
  • Online:2025-11-15 Published:2025-11-10
  • About author:WANG Hongqiang,born in 1997,postgraduate.His main research interest is image detection based on deep learning.
    ZHAO Hui,born in 1972,Ph.D,professor,is member of CCF(No.25440S).Her main research interests include artificial inteligence,affective computing,speech and digital image processing.
  • Supported by:
    Key R&D Projects in the Autonomous Region(2023B01032) and National Science and Technology Major Project of the Ministry of Science and Technology of China(2022ZD0115802).

Abstract: In order to achieve rapid and accurate detection of cotton diseases in real field environments,this paper proposes a cotton disease target detection model based on feature enhancement and attention mechanism.To ensure the accuracy of the model’sdetection in real field environments,an improved feature enhancement module is used in the Neck module to weight the feature maps and reduce the interference of background or other objects on the targets in the image.After the feature enhancement mo-dule,Group Mix Attention is used to connect contextual information and enrich the feature map information.The proposed model can effectively improve the detection accuracy of models in real field environments,effectively reducing the occurrence of model false positives and false negatives using SIoU loss function.The experimental results show that the proposed model performs well on the self built real field environment cotton disease target detection dataset,effectively improving the detection accuracy of the model in real field environments.Compared with the baseline model,the mAP and Precision have increased by 2 percentage points and 4.5 percentage points.

Key words: Cotton diseases and pests, Small goals, Feature enhancement, Attention mechanism, Loss function

CLC Number: 

  • TP391
[1]REDMON J.You only look once:Unified,real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016.
[2]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shotmultibox detector[C]//Computer Vision-ECCV 2016:14th European Conference.Amsterdam,The Netherlands,.Springer,2016:21-37.
[3]LIN T.Focal loss for dense object detection[J].arXiv:1708.02002,2017.
[4]TAN M,PANG R,LE Q V.EfficientDet:Scalable and efficient object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:10781-10790.
[5]TIAN Z,SHEN C,CHEN H,et al.FCOS:Fully convolutionalone-stage object detection[J].arXiv:1904.01355,2019.
[6]LAW H,DENG J.CornerNet:Detecting objects as paired keypoints[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:734-750.
[7]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:580-587.
[8]CAI Z,VASCONCELOS N.Cascade R-CNN:Delving into high quality object detection[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2018:6154-6162.
[9]GIRSHICK R.Fast R-CNN[J].arXiv:1504.08083,2015.
[10]REN S.Faster R-CNN:Towards real-time object detection with region proposal networks[J].arXiv:1506.01497,2015.
[11]HAQUE M A,MARWAHA S,DEB C K,et al.Deep learning-based approach for identification of diseases of maize crop[J].Scientific Reports,2022,12(1):6334.
[12]ZHAO Y,SUN C,XU X,et al.RIC-Net:A plant disease classification model based on the fusion of inception and residual structure and embedded attention mechanism[J].Computers and Electronics in Agriculture,2022,193:106644.
[13]FAN X,SUN T,CHAI X,et al.YOLO-WDNet:A lightweight and accurate model for weeds detection in cotton field[J].Computers and Electronics in Agriculture,2024,225:109317.
[14]WANG B,HUA J,XIA L,et al.A defect detection method for Akidzuki pears based on computer vision and deep learning[J].Postharvest Biology and Technology,2024,218:113157.
[15]SYED-AB-RAHMAN S F,HESAMIAN M H,PRASAD M.Citrus disease detection and classification using end-to-end anchor-based deep learning model[J].Applied Intelligence,2022,52(1):927-938.
[16]JOHNSON J,SHARMA G,SRINIVASAN S,et al.Enhancedfield-based detection of potato blight in complex backgrounds using deep learning[J].Plant Phenomics,2021,2021:9835724.
[17]BAO W,ZHU Z,HU G,et al.UAV remote sensing detection of tea leaf blight based on DDMA-YOLO[J].Computers and Electronics in Agriculture,2023,205:107637.
[18]TIAN Y,WANG S,LI E,et al.MD-YOLO:Multi-scale dense YOLO for small target pest detection[J].Computers and Electronics in Agriculture,2023,213:108233.
[19]FAN Z,LIU Q.Adaptive region-aware feature enhancement for object detection[J].Pattern Recognition,2022,124:108437.
[20]QI J,LIU X,LIU K,et al.An improved YOLOv5 model based on visual attention mechanism:Application to recognition of tomato virus disease[J].Computers and Electronics in Agriculture,2022,194:106780.
[21]ZHANG Y,MA B,HU Y,et al.Accurate cotton diseases and pests detection in complex background based on an improved YOLOX model[J].Computers and Electronics in Agriculture,2022,203:107484.
[22]GE C,DING X,TONG Z,et al.Advancing vision transformers with group-mix attention[J].arXiv:2311.15157,2023,11:113027-113041.
[23]DENG C,WANG M,LIU L,et al.Extended feature pyramid network for small object detection[J].IEEE Transactions on Multimedia,2021,24:1968-1979.
[24]LI S,LIU C,TANG K,et al.Improved YOLOv5s algorithm for small target detection in UAV aerial photography[J].IEEE Access,2024.
[25]JING R,ZHANG W,LIU Y,et al.An effective method for small object detection in low-resolution images[J].Engineering Applications of Artificial Intelligence,2024,127:107206.
[26]GAO J,GENG X,ZHANG Y,et al.Augmented weighted bidirectional feature pyramid network for marine object detection[J].Expert Systems with Applications,2024,237:121688.
[27]MA J,LU A,CHEN C,et al.YOLOv5-Lotus:An efficient object detection method for lotus seedpod in a natural environment[J].Computers and Electronics in Agriculture,2023,206:107635.
[28]SONG G,DU H,ZHANG X,et al.Small object detection in unmanned aerial vehicle images using multi-scale hybrid attention[J].Engineering Applications of Artificial Intelligence,2024,128:107455.
[29]HUANG M,YAN W,DAI W,et al.EST-YOLOv5s:SAR image aircraft target detection model based on improved YOLOv5s[J].IEEE Access,2023,11:113027-113041.
[30]VARGHESE R,SAMBATH M.YOLOv8:A novel object detection algorithm with enhanced performance and robustness[C]//2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems(ADICS).IEEE,2024:1-6.
[31]GEVORGYAN Z.SIoU loss:More powerful learning for bounding box regression[J].arXiv:2205.12740,2022.
[32]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[33]HOU Q,ZHOU D,FENG J.Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:13713-13722.
[34]WANG Q,WU B,ZHU P,et al.ECA-Net:Efficient channel attention for deep convolutional neural networks[C]//Procee-dings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:11534-11542.
[35]WOO S,PARK J,LEE J Y,et al.CBAM:Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[36]ZHENG Z,WANG P,LIU W,et al.Distance-IoU loss:Fasterand better learning for bounding box regression[C]//Procee-dings of the AAAI Conference on Artificial Intelligence.2020:12993-13000.
[37]ZHANG Y F,REN W,ZHANG Z,et al.Focal and efficient IoU loss for accurate bounding box regression[J].Neurocomputing,2022,506:146-157.
[38]TONG Z,CHEN Y,XU Z,et al.Wise-IoU:Bounding box regression loss with dynamic focusing mechanism[J].arXiv:2301.10051,2023.
[1] WANG Xinyu, GAO Donghuai, NING Yuwen, XU Hao, QI Haonan. Student Behavior Detection Method Based on Improved YOLO Algorithm [J]. Computer Science, 2026, 53(3): 246-256.
[2] QIAN Qing, CHEN Huicheng, CUI Yunhe, TANG Ruixue, FU Jinmei. Joint Entity and Relation Extraction Method with Multi-scale Collaborative Aggregation and Axial-semantic Guidance [J]. Computer Science, 2026, 53(3): 97-106.
[3] GE Zeqing, HUANG Shengjun. Semi-supervised Learning Method for Multi-label Tabular Data [J]. Computer Science, 2026, 53(3): 151-157.
[4] CHANG Xuanwei, DUAN Liguo, CHEN Jiahao, CUI Juanjuan, LI Aiping. Method for Span-level Sentiment Triplet Extraction by Deeply Integrating Syntactic and Semantic
Features
[J]. Computer Science, 2026, 53(2): 322-330.
[5] ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[6] ZHUO Tienong, YING Di, ZHAO Hui. Research on Student Classroom Concentration Integrating Cross-modal Attention and Role
Interaction
[J]. Computer Science, 2026, 53(2): 67-77.
[7] XU Jingtao, YANG Yan, JIANG Yongquan. Time-Frequency Attention Based Model for Time Series Anomaly Detection [J]. Computer Science, 2026, 53(2): 161-169.
[8] HAN Lei, SHANG Haoyu, QIAN Xiaoyan, GU Yan, LIU Qingsong, WANG Chuang. Constrained Multi-loss Video Anomaly Detection with Dual-branch Feature Fusion [J]. Computer Science, 2026, 53(2): 236-244.
[9] GUO Xingxing, XIAO Yannan, WEN Peizhi, XU Zhi, HUANG Wenming. Attention-based Audio-driven Digital Face Video Generation Method [J]. Computer Science, 2026, 53(2): 245-252.
[10] JI Sai, QIAO Liwei, SUN Yajie. Semantic-guided Hybrid Cross-feature Fusion Method for Infrared and Visible Light Images [J]. Computer Science, 2026, 53(2): 253-263.
[11] LYU Jinggang, GAO Shuo, LI Yuzhi, ZHOU Jin. Facial Expression Recognition with Channel Attention Guided Global-Local Semantic Cooperation [J]. Computer Science, 2026, 53(1): 195-205.
[12] FAN Jiabin, WANG Baohui, CHEN Jixuan. Method for Symbol Detection in Substation Layout Diagrams Based on Text-Image MultimodalFusion [J]. Computer Science, 2026, 53(1): 206-215.
[13] WANG Haoyan, LI Chongshou, LI Tianrui. Reinforcement Learning Method for Solving Flexible Job Shop Scheduling Problem Based onDouble Layer Attention Network [J]. Computer Science, 2026, 53(1): 231-240.
[14] CHEN Qian, CHENG Kaixuan, GUO Xin, ZHANG Xiaoxia, WANG Suge, LI Yanhong. Bidirectional Prompt-Tuning for Event Argument Extraction with Topic and Entity Embeddings [J]. Computer Science, 2026, 53(1): 278-284.
[15] PENG Jiao, HE Yue, SHANG Xiaoran, HU Saier, ZHANG Bo, CHANG Yongjuan, OU Zhonghong, LU Yanyan, JIANG dan, LIU Yaduo. Text-Dynamic Image Cross-modal Retrieval Algorithm Based on Progressive Prototype Matching [J]. Computer Science, 2025, 52(9): 276-281.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!