小样本下基于空间交互和分割注意力的电力目标检测

doi:10.11896/jsjkx.250400032

Abstract

Abstract: Power line inspection is a core task for preventing power outages,ensuring the safe and stable operation of power grids,and promoting economic development.In practical applications,challenges such as limited defect sample numbers,diverse equipment shapes,target occlusion/adhesion,and sample imbalance are often encountered.To address these issues,this paper proposes a novel two-stage object detection network,AKS²-Net,which integrates attention and metric learning improvements.This network combines multiple attention mechanisms for feature extraction and fusion,and uses metric learning for secondary scree-ning of candidate targets,enhancing the extraction and fusion of feature information for irregular,small/fuzzy,and occluded targets,and reducing the impact of few samples and sample imbalance on network performance.Specifically,1)It designs a group convolution feature extraction network AKS² block(AKConv,Spatial-shift and Split-attention Block)based on variable convolution and spatial shift,making spatial information interaction between image features and relationship learning between feature channels possible,thereby enhancing the network's ability to mine irregular and multi-scale feature information.2)It proposes a novel multi-branch attention multi-scale feature fusion(MAFF)module,which fuses multi-channel and multi-level image detail features and spatial information through atrous spatial pyramid pooling(ASPP)and mixed skip connections(MSC),thereby improving segmentation accuracy and boundary localization ability in complex scenes.3)It proposes a feature similarity calculation method based on metric learning,which re-screens candidate negative samples by calculating the similarity between negative sample features selected by the region proposal network and all support category features,and combines a threshold to correct positive samples misjudged as negative samples,reducing interference with network training and lowering the missed detection rate of small and fuzzy targets.4)It introduces the FocalLoss function in the classification loss calculation to mitigate the impact of sample imbalance on detection results.5)Based on AKS²-Net as the backbone,a two-stage fine-tuning-based object detection network suitable for small sample and imbalanced sample conditions is constructed,providing a new option for small sample object detection through the fine-tuning mechanism.A large number of experimental results show that the proposed method performs well on the power line object detection dataset,especially enhancing the network's detection ability for far small/fuzzy and occluded objects,and has significant practical value.Additionally,under similar experimental conditions using various datasets including small sample datasets,the proposed model demonstrates stronger competitiveness compared to existing object detection networks such as ResNet 50,ResNet 101,Inception ResNet,ResNeXt 101,and ResNeSt 101.

Key words: Few-shot, Metric learning, AKConv, Spatial-shift, Split-Attention, Feature fusion

CLC Number:

TM755

WU Man, WANG Gaocai, LU Yuting, WEN Lili. Power Object Detection Based on Spatial Interaction and Split Attention in Few-shots[J].Computer Science, 2026, 53(6): 252-262.

References

[1]LI S,ZHOU H,WANG G,et al.Cracked Insulator Detection Based on R-FCN[C]//3rd Annual International Conference on Information System and Artificial Intelligence(ISAI2018).2018.
[2]LEI X,SUI Z.Intelligent fault detection of high voltage linebased on the Faster R-CNN[J].Measurement,2019,138:379-385.
[3]LIU X,LI Y,SHUANG F,et al.ISSD:Improved SSD for Insulator and Spacer Online Detection Based on UAV System[J].Sensors,2020,20(23):6961.
[4]YANG Y,WANG L.Insulator recognition based on convolution neural network[C]//MATEC Web of Conferences.EDP Sciences,2017.
[5]LIU X,JIANG H,CHEN J,et al.Insulator detection in aerial images based on faster regions with convolutional neural network[C]//2018 IEEE 14th International Conference on Control and Automation(ICCA).IEEE,2018:1082-1086.
[6]ZHAO Z N,LIU Q F,CAO W M,et al.Self-guided Information for Few-shot Classification[J].Pattern Recognition,2022,131:108880.
[7]DAI J,LI Y,HE K,et al.R-fcn:Object detection via region-based fully convolutional networks[C]//Advances in Neural Information Processing Systems.2016:379-387.
[8]BAI Y F,WANG L B,GAO W D,et al.Multi-modal hierarchicalclassification for power equipment defect detection.Journal of Image and Graphics,29(7):2011-2023.
[9]WU Y,LIU L,XIAO Y,et al.Power Device Image Recognition Based on Improved Attention Mechanism[J].Proceedings of the CSEE,2025,45(3):870-883.
[10]LI L F,MA W F,LI L,et al.Research on detection algorithm for bridge cracks based on deep learning[J].Acta Automatica Sinica,2019,45(9):1727-1742.
[11]ZHAO Z,TANG P,ZHAO L,et al.Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning[J].IEEE Geoscience and Remote Sensing Letters,2022,19:1-5.
[12]DAI J,QI H,XIONG Y,et al Deformable convolutional net-works[C]//Proceedings of the 2017 IEEE International Conference on Computer Vision.IEEE,2017:764-773.
[13]WU A,HAN Y,ZHU L,et al.Universal-prototype enhancing for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2021:9567-9576.
[14]HU H,BAI S,LI A,et al.Dense relation distillation with context-aware aggregation for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:10185-10194.
[15]KANG B,LIU Z,WANG X,et al.Few-shot object detection via feature reweighting[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:8420-8429.
[16]WANG X,HUANG T E,DARRELL T,et al.Frustratinglysimple few-shot object detection [C]//Proceedings of the 2020 International Conference on Machine Learning.New York:PMLR,2020:9919-9928.
[17]YAN X,CHEN Z,XU A,et al.Meta r-cnn:Towards general solver for instance-level low-shot learning[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9577-9586.
[18]YAN X,CHEN Z,XU A,et al.Meta r-cnn:Towards general solver for instance-level low-shot learning[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9577-9586.
[19]HAN G,HE Y,HUANG S,et al.Query adaptive few-shot object detection with heterogeneous graph convolutional networks[C]//Proceedings of the 2021 IEEE/CVF International Confe-rence on Computer Vision.Piscataway,NJ:IEEE,2021:3263-3272.
[20]XIAO Y,LEPETIT V,MARLET R.Few-shot object detection and viewpoint estimation for objects in the wild [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(3):3090-3106.
[21]KANG B,LIU Z,WANG X,et al.Few-shot object detection via feature reweighting[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:8420-8429.
[22]WANG Y X,RAMANAN D,HEBERT M.Meta-learning to detect rare objects[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2019:9925-9934.
[23]WANG X,HUANG T E,DARRELL T,et al.Frustratinglysimple few-shot object detection [C]//Proceedings of the 2020 International Conference on Machine Learning.New York:PMLR,2020:9919-9928.
[24]ZHU C,CHEN F,AHMED U,et al.Semantic relation reasoning for shot-stable few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:8782-8791.
[25]SUN B,LI B,CAI S,et al.Fsce:Few-shot object detection via contrastive proposal encoding[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:7352-7362.
[26]FAN Z,MA Y,LI Z,et al.Generalized few-shot object detection without forgetting[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Pisca-taway,NJ:IEEE,2021:4527-4536.
[27]WU A,HAN Y,ZHU L,et al.Universal-prototype enhancing for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision.Piscataway,NJ:IEEE,2021:9567-9576.
[28]HU H,BAI S,LI A,et al.Dense relation distillation with context-aware aggregation for few-shot object detection[C]//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2021:10185-10194.

Related Articles 15

[1]	LIU Jikang, HUANG Lei, ZHANG Ke, NIE Jie, WEI Zhiqiang. Object Detection Method Based on Dynamic Feature Fusion [J]. Computer Science, 2026, 53(6): 263-269.
[2]	LI Wenli, FENG Xiaonian, QIAN Tieyun. Few-shot Continuous Toxicity Detection Based on Large Language Model Augmentation [J]. Computer Science, 2026, 53(3): 321-330.
[3]	SONG Jianhua, HE Jiawei, ZHANG Yan. Dual-channel Source Code Vulnerability Detection Model Based on Contrastive Learning [J]. Computer Science, 2026, 53(3): 424-432.
[4]	HUANG Jing, WANG Teng, LIU Jian, HU Kai, PENG Xin, HUANG Yamin, WEN Yuanqiao. Multimodal Visual Detection for Underwater Sonar Target Images [J]. Computer Science, 2026, 53(2): 227-235.
[5]	LIU Chenhong, LI Fenglian, YANG Jia, WANG Suzhe, CHEN Guijun. Boundary-focused Multi-scale Feature Fusion Network for Stroke Lesion Segmentation [J]. Computer Science, 2026, 53(2): 264-272.
[6]	CHEN Lin, MA Longxuan, ZHANG Yongbing, HUANG Yuxin, GAO Shengxiang, YU Zhengtao. Industrial Text Classification for Chinese and Vietnamese Based on Prompt Learning and AdaptiveLoss Weighting [J]. Computer Science, 2026, 53(2): 312-321.
[7]	ZHANG Jing, PAN Jinghao, JIANG Wenchao. Background Structure-aware Few-shot Knowledge Graph Completion [J]. Computer Science, 2026, 53(2): 331-341.
[8]	FAN Jiabin, WANG Baohui, CHEN Jixuan. Method for Symbol Detection in Substation Layout Diagrams Based on Text-Image MultimodalFusion [J]. Computer Science, 2026, 53(1): 206-215.
[9]	DUAN Pengting, WEN Chao, WANG Baoping, WANG Zhenni. Collaborative Semantics Fusion for Multi-agent Behavior Decision-making [J]. Computer Science, 2026, 53(1): 252-261.
[10]	ZHANG Xiaomin, ZHAO Junzhi, HE Hongjie. Screen-shooting Resilient Watermarking Method for Document Image Based on Attention Mechanism [J]. Computer Science, 2026, 53(1): 413-422.
[11]	LUO Chi, LU Lingyun, LIU Fei. Partial Differential Equation Solving Method Based on Locally Enhanced Fourier NeuralOperators [J]. Computer Science, 2025, 52(9): 144-151.
[12]	GUO Husheng, ZHANG Xufei, SUN Yujie, WANG Wenjian. Continuously Evolution Streaming Graph Neural Network [J]. Computer Science, 2025, 52(8): 118-126.
[13]	SHEN Tao, ZHANG Xiuzai, XU Dai. Improved RT-DETR Algorithm for Small Object Detection in Remote Sensing Images [J]. Computer Science, 2025, 52(8): 214-221.
[14]	WANG Jia, XIA Ying, FENG Jiangfan. Few-shot Video Action Recognition Based on Two-stage Spatio-Temporal Alignment [J]. Computer Science, 2025, 52(8): 251-258.
[15]	LUO Xuyang, TAN Zhiyi. Knowledge-aware Graph Refinement Network for Recommendation [J]. Computer Science, 2025, 52(7): 103-109.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Power Object Detection Based on Spatial Interaction and Split Attention in Few-shots

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0