Computer Science ›› 2026, Vol. 53 ›› Issue (6): 252-262.doi: 10.11896/jsjkx.250400032

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Power Object Detection Based on Spatial Interaction and Split Attention in Few-shots

WU Man1,2, WANG Gaocai3, LU Yuting1, WEN Lili2,3   

  1. 1 School of Electrical Engineering,Guangxi University,Nanning 530004,China
    2 Research Center for Carbon Sink and Low-Carbon Engineering in the Beibu Gulf of Guangxi,Guangxi Academy of Sciences,
    Nanning 530007,China
    3 School of Computer,Electronics and Information,Guangxi University,Nanning 530004,China
  • Received:2025-04-08 Revised:2025-07-04 Online:2026-06-15 Published:2026-06-09
  • About author:WU Man,born in 1985,Ph.D,professor-level senior engineer,master's supervisor.His main research interests include power system automation and machine vision algorithms.
    WANG Gaocai,born in 1976,Ph.D,professor,Ph.D supervisor.His main research interests include computer networks,system performance evaluation and machine vision.
  • Supported by:
    Natural Science Foundation of Guangxi(2025GXNSFAA069673,2025GXNSFBA069129) and Science and Technology Major Project of Guangxi(Guike-AA22068072).

Abstract: Power line inspection is a core task for preventing power outages,ensuring the safe and stable operation of power grids,and promoting economic development.In practical applications,challenges such as limited defect sample numbers,diverse equipment shapes,target occlusion/adhesion,and sample imbalance are often encountered.To address these issues,this paper proposes a novel two-stage object detection network,AKS2-Net,which integrates attention and metric learning improvements.This network combines multiple attention mechanisms for feature extraction and fusion,and uses metric learning for secondary scree-ning of candidate targets,enhancing the extraction and fusion of feature information for irregular,small/fuzzy,and occluded targets,and reducing the impact of few samples and sample imbalance on network performance.Specifically,1)It designs a group convolution feature extraction network AKS2 block(AKConv,Spatial-shift and Split-attention Block)based on variable convolution and spatial shift,making spatial information interaction between image features and relationship learning between feature channels possible,thereby enhancing the network's ability to mine irregular and multi-scale feature information.2)It proposes a novel multi-branch attention multi-scale feature fusion(MAFF)module,which fuses multi-channel and multi-level image detail features and spatial information through atrous spatial pyramid pooling(ASPP)and mixed skip connections(MSC),thereby improving segmentation accuracy and boundary localization ability in complex scenes.3)It proposes a feature similarity calculation method based on metric learning,which re-screens candidate negative samples by calculating the similarity between negative sample features selected by the region proposal network and all support category features,and combines a threshold to correct positive samples misjudged as negative samples,reducing interference with network training and lowering the missed detection rate of small and fuzzy targets.4)It introduces the FocalLoss function in the classification loss calculation to mitigate the impact of sample imbalance on detection results.5)Based on AKS2-Net as the backbone,a two-stage fine-tuning-based object detection network suitable for small sample and imbalanced sample conditions is constructed,providing a new option for small sample object detection through the fine-tuning mechanism.A large number of experimental results show that the proposed method performs well on the power line object detection dataset,especially enhancing the network's detection ability for far small/fuzzy and occluded objects,and has significant practical value.Additionally,under similar experimental conditions using various datasets including small sample datasets,the proposed model demonstrates stronger competitiveness compared to existing object detection networks such as ResNet 50,ResNet 101,Inception ResNet,ResNeXt 101,and ResNeSt 101.

Key words: Few-shot, Metric learning, AKConv, Spatial-shift, Split-Attention, Feature fusion

CLC Number: 

  • TM755
[1]LI S,ZHOU H,WANG G,et al.Cracked Insulator Detection Based on R-FCN[C]//3rd Annual International Conference on Information System and Artificial Intelligence(ISAI2018).2018.
[2]LEI X,SUI Z.Intelligent fault detection of high voltage linebased on the Faster R-CNN[J].Measurement,2019,138:379-385.
[3]LIU X,LI Y,SHUANG F,et al.ISSD:Improved SSD for Insulator and Spacer Online Detection Based on UAV System[J].Sensors,2020,20(23):6961.
[4]YANG Y,WANG L.Insulator recognition based on convolution neural network[C]//MATEC Web of Conferences.EDP Sciences,2017.
[5]LIU X,JIANG H,CHEN J,et al.Insulator detection in aerial images based on faster regions with convolutional neural network[C]//2018 IEEE 14th International Conference on Control and Automation(ICCA).IEEE,2018:1082-1086.
[6]ZHAO Z N,LIU Q F,CAO W M,et al.Self-guided Information for Few-shot Classification[J].Pattern Recognition,2022,131:108880.
[7]DAI J,LI Y,HE K,et al.R-fcn:Object detection via region-based fully convolutional networks[C]//Advances in Neural Information Processing Systems.2016:379-387.
[8]BAI Y F,WANG L B,GAO W D,et al.Multi-modal hierarchicalclassification for power equipment defect detection.Journal of Image and Graphics,29(7):2011-2023.
[9]WU Y,LIU L,XIAO Y,et al.Power Device Image Recognition Based on Improved Attention Mechanism[J].Proceedings of the CSEE,2025,45(3):870-883.
[10]LI L F,MA W F,LI L,et al.Research on detection algorithm for bridge cracks based on deep learning[J].Acta Automatica Sinica,2019,45(9):1727-1742.
[11]ZHAO Z,TANG P,ZHAO L,et al.Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning[J].IEEE Geoscience and Remote Sensing Letters,2022,19:1-5.
[12]DAI J,QI H,XIONG Y,et al Deformable convolutional net-works[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision.IEEE,2017:764-773.
[13]WU A,HAN Y,ZHU L,et al.Universal-prototype enhancing for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2021:9567-9576.
[14]HU H,BAI S,LI A,et al.Dense relation distillation with context-aware aggregation for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:10185-10194.
[15]KANG B,LIU Z,WANG X,et al.Few-shot object detection via feature reweighting[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:8420-8429.
[16]WANG X,HUANG T E,DARRELL T,et al.Frustratinglysimple few-shot object detection [C]//Proceedings of the 2020 International Conference on Machine Learning.New York:PMLR,2020:9919-9928.
[17]YAN X,CHEN Z,XU A,et al.Meta r-cnn:Towards general solver for instance-level low-shot learning[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9577-9586.
[18]YAN X,CHEN Z,XU A,et al.Meta r-cnn:Towards general solver for instance-level low-shot learning[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9577-9586.
[19]HAN G,HE Y,HUANG S,et al.Query adaptive few-shot object detection with heterogeneous graph convolutional networks[C]//Proceedings of the 2021 IEEE/CVF International Confe-rence on Computer Vision.Piscataway,NJ:IEEE,2021:3263-3272.
[20]XIAO Y,LEPETIT V,MARLET R.Few-shot object detection and viewpoint estimation for objects in the wild [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(3):3090-3106.
[21]KANG B,LIU Z,WANG X,et al.Few-shot object detection via feature reweighting[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:8420-8429.
[22]WANG Y X,RAMANAN D,HEBERT M.Meta-learning to detect rare objects[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9925-9934.
[23]WANG X,HUANG T E,DARRELL T,et al.Frustratinglysimple few-shot object detection [C]//Proceedings of the 2020 International Conference on Machine Learning.New York:PMLR,2020:9919-9928.
[24]ZHU C,CHEN F,AHMED U,et al.Semantic relation reasoning for shot-stable few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:8782-8791.
[25]SUN B,LI B,CAI S,et al.Fsce:Few-shot object detection via contrastive proposal encoding[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:7352-7362.
[26]FAN Z,MA Y,LI Z,et al.Generalized few-shot object detection without forgetting[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Pisca-taway,NJ:IEEE,2021:4527-4536.
[27]WU A,HAN Y,ZHU L,et al.Universal-prototype enhancing for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2021:9567-9576.
[28]HU H,BAI S,LI A,et al.Dense relation distillation with context-aware aggregation for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:10185-10194.
[1] LIU Jikang, HUANG Lei, ZHANG Ke, NIE Jie, WEI Zhiqiang. Object Detection Method Based on Dynamic Feature Fusion [J]. Computer Science, 2026, 53(6): 263-269.
[2] LI Wenli, FENG Xiaonian, QIAN Tieyun. Few-shot Continuous Toxicity Detection Based on Large Language Model Augmentation [J]. Computer Science, 2026, 53(3): 321-330.
[3] SONG Jianhua, HE Jiawei, ZHANG Yan. Dual-channel Source Code Vulnerability Detection Model Based on Contrastive Learning [J]. Computer Science, 2026, 53(3): 424-432.
[4] HUANG Jing, WANG Teng, LIU Jian, HU Kai, PENG Xin, HUANG Yamin, WEN Yuanqiao. Multimodal Visual Detection for Underwater Sonar Target Images [J]. Computer Science, 2026, 53(2): 227-235.
[5] LIU Chenhong, LI Fenglian, YANG Jia, WANG Suzhe, CHEN Guijun. Boundary-focused Multi-scale Feature Fusion Network for Stroke Lesion Segmentation [J]. Computer Science, 2026, 53(2): 264-272.
[6] CHEN Lin, MA Longxuan, ZHANG Yongbing, HUANG Yuxin, GAO Shengxiang, YU Zhengtao. Industrial Text Classification for Chinese and Vietnamese Based on Prompt Learning and AdaptiveLoss Weighting [J]. Computer Science, 2026, 53(2): 312-321.
[7] ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[8] FAN Jiabin, WANG Baohui, CHEN Jixuan. Method for Symbol Detection in Substation Layout Diagrams Based on Text-Image MultimodalFusion [J]. Computer Science, 2026, 53(1): 206-215.
[9] DUAN Pengting, WEN Chao, WANG Baoping, WANG Zhenni. Collaborative Semantics Fusion for Multi-agent Behavior Decision-making [J]. Computer Science, 2026, 53(1): 252-261.
[10] ZHANG Xiaomin, ZHAO Junzhi, HE Hongjie. Screen-shooting Resilient Watermarking Method for Document Image Based on Attention Mechanism [J]. Computer Science, 2026, 53(1): 413-422.
[11] LUO Chi, LU Lingyun, LIU Fei. Partial Differential Equation Solving Method Based on Locally Enhanced Fourier NeuralOperators [J]. Computer Science, 2025, 52(9): 144-151.
[12] GUO Husheng, ZHANG Xufei, SUN Yujie, WANG Wenjian. Continuously Evolution Streaming Graph Neural Network [J]. Computer Science, 2025, 52(8): 118-126.
[13] SHEN Tao, ZHANG Xiuzai, XU Dai. Improved RT-DETR Algorithm for Small Object Detection in Remote Sensing Images [J]. Computer Science, 2025, 52(8): 214-221.
[14] WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[15] LUO Xuyang, TAN Zhiyi. Knowledge-aware Graph Refinement Network for Recommendation [J]. Computer Science, 2025, 52(7): 103-109.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!