Computer Science ›› 2023, Vol. 50 ›› Issue (8): 93-98.doi: 10.11896/jsjkx.220600258

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Adaptive Object Counting Model for Aerial Imagery

WEI Chang, GUAN Jihong, ZHANG Yichao, LI Wengen   

  1. School of Electronic and Information Engineering,Tongji University,Shanghai 201800,China
  • Received:2022-06-18 Revised:2022-11-14 Online:2023-08-15 Published:2023-08-02
  • About author:WEI Chang,born in 1999,master,is a student member of China Computer Federation.His main research interests include target counting,spatiotemporal data management and analysis.
    GUAN Jihong,professor,doctoral supervisor,is a member of China Computer Federation.Her main research interests include artificial intelligence,spatio-temporal data management and analysis,network modeling,game and colla-boration,application-oriented big data analysis and application services.
  • Supported by:
    Joint Funds of the National Natural Science Foundation of China(U1936205).

Abstract: Object counting aims to obtain the number of specific types of objects such as vehicles,buildings,people contained in a given image,which is of great significance to urban planning,emergency response,national security,etc.The current object coun-ting task mainly relies on the images taken by low-altitude cameras,and there are obvious problems such as the object being easily occluded and the small counting space range.Widespread use of high-definition aerial remote sensing imagery makes it possible to count objects in large areas.However,the object counting task for aerial images has challenges such as large differences in object scales,dense distribution,and uncertain orientation.Existing object detection counting models and regression counting models based on low-altitude images are not suitable for object counting in aerial images.To solve this problem,this paper proposes an adaptive object counting model for aerial images.Firstly,the geometric adaptive Gaussian convolution method is used to solve the problem of object scale variation.Then,the image loss judgment method based on structural similarity is used to solve the pro-blem of poor counting stability of object dense regions.Experimental analysis shows that the proposed model can achieve better object count accuracy than the benchmark model.

Key words: Object counting, Aerial imagery, Regression counting, Gaussian convolution, Structural similarity

CLC Number: 

  • TP391
[1]ZHANG Y,ZHOU D,CHEN S,et al.Single-image crowdcounting via multi-column convolutional neural network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:589-597.
[2]WANG Y,ZOU Y.Fast visual object counting via example-based density estimation[C]//2016 IEEE International Confe-rence on Image Processing(ICIP).IEEE,2016:3653-3657.
[3]FRENCH G,FISHER M,MACKIEWICZ M,et al.Convolu-tional neural networks for counting fish in fisheries surveillance video[C]//Workshop on Machine Vision of Animals and their Behaviour.BMVA Press,2015:7.1-7.10.
[4]DOLLAR P,WOJEK C,SCHIELE B,et al.Pedestrian detec-tion:a benchmark[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2009:304-311.
[5]UIJLINGS J R,VAN D S K E A,GEVERS T,et al.Selective search for object recognition[J].International Journal of Computer Vision,2013,104(2):154-171.
[6]LAZEBNIK S,SCHMID C,PONCE J.Beyond Bags of Fea-tures:Spatial Pyramid Matching for Recognizing Natural Scene Categories[C]//Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.Paris:IEEE,2006:2169-2178.
[7]YU Y,ZHANG J,HUANG Y,et al.Object detection by context and boosted HOG-LBP[C]//ECCV Workshop on PASCAL VOC.2010.
[8]GIRSHICK R.Fast R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition.Santiago:IEEE,2015:1440-1448.
[9]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:TowardsReal-Time Object Detection with Region Proposal Networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[10]HE K,GKIOXARI G,DOLLR P,et al.Mask R-CNN[C]//Pro-ceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Venice:IEEE,2017:2961-2969.
[11]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:7263-7271.
[12]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single ShotMulti Box Detector[C]//European Conference on Computer Vision.Cham:Springer,2016:21-37.
[13]XIE E,DING J,WANG W,et al.Detco:Unsupervised contrastive learning for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:8392-8401.
[14]XU M,ZHANG Z,HU H,et al.End-to-end semi-supervised object detection with soft teacher[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:3060-3069.
[15]DAI X,CHEN Y,YANG J,et al.Dynamic detr:End-to-end object detection with dynamic attention[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:2988-2997.
[16]DAI X,CHEN Y,XIAO B,et al.Dynamic head:Unifying object detection heads with attentions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:7373-7382.
[17]WANG C,ZHANG H,YANG L,et al.Deep people counting in extremely dense crowds[C]//Proceedings of the 23rd ACM International Conference on Multimedia.New York:ACM,2015:1299-1302.
[18]ZHANG C,LI H S,WANG X G,et al.Cross-scene crowdcounting via deep convolutional neural networks [C] //Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway,NJ:IEEE,2015:833-841.
[19]LIANG D,CHEN X,XU W,et al.TransCrowd:weakly-supervised crowd counting with transformers[J].Science China Information Sciences,2022,65(6):1-14.
[20]SUN G,LIU Y,PROBST T,et al.Boosting crowd counting with transformers[J].arXiv:2105.10926,2021.
[21]GAO G,LIU Q,WANG Y.Counting dense objects in remotesensing images[C]//ICASSP 2020-2020 IEEE International Conference on Acoustics,Speech and Signal Processing(IC-ASSP).IEEE,2020:4137-4141.
[22]DING G,CUI M,YANG D,et al.Object Counting for Remote-
Sensing Images via Adaptive Density Map-Assisted Learning[J].IEEE Transactions on Geoscience and Remote Sensing,2022,60:1-11.
[23]LIU Z,LIN Y,CAO Y,et al.Swin transformer:Hierarchical vision transformer using shifted windows[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:10012-10022.
[24]CAO X,WANG Z,ZHAO Y,et al.Scale aggregation network for accurate and efficient crowd counting[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:734-750.
[25]MUNDHENK T N,KONJEVOD G,SAKLA W A,et al.Alarge contextual dataset for classification,detection and counting of cars with deep learning[C]//European Conference on Computer Vision.Cham:Springer,2016:785-800.
[26]HSIEH M R,LIN Y L,HSU W H.Drone-based object counting by spatially regularized regional proposal network[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:4145-4153.
[27]XIA G S,BAI X,DING J,et al.DOTA:A large-scale dataset for object detection in aerial images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3974-3983.
[1] HE Tao, ZHAO Ting, XU He. Novel Algorithm of Single Image Dehazing Based on Dark Channel Prior [J]. Computer Science, 2021, 48(7): 219-224.
[2] HUANG Tao, HUANG Ren and ZHANG Kun. Improved Collaborative Filtering Recommendation Algorithm [J]. Computer Science, 2016, 43(Z6): 400-403.
[3] GU Zhi-peng and HE Xin-guang. Remote Sensing Images Fusion Method Coupling Contourlet Transform with Particle Swarm Optimization [J]. Computer Science, 2016, 43(Z11): 223-228.
[4] ZHAO Chuan,MA Xi-rong,MA Ling and ZHANG Tong. Adaptive Denoising Method Based on Anisotropic Diffusion Equation [J]. Computer Science, 2014, 41(Z11): 133-135.
[5] SANG Qing-bing,LIANG Di-lin,WU Xiao-jun and LI Chao-feng. Gradient Structural Similarity Image Assessment Index Based on Dilation [J]. Computer Science, 2014, 41(6): 287-290.
[6] ZHANG Rui-xue,SONG Ming-qiu,GONG Yan-lei. Parsing DOM Tree Reversely and Extracting Web Main Page Information [J]. Computer Science, 2011, 38(4): 213-215.
[7] XIANG You-jun, LEI Na, YU Wei-yu, XIE Sheng-li. Research of Block Matching Criterion for Motion Estimation [J]. Computer Science, 2009, 36(9): 278-280.
[8] JI Guo-li , NI Xiao-ming. Image Quality Assessment Algorithm Based on High Frequency Band of Haar Wavelet Transform [J]. Computer Science, 2009, 36(10): 262-264.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!