三维城市场景中的小物体检测

doi:10.11896/jsjkx.210400174

计算机科学 ›› 2022, Vol. 49 ›› Issue (6): 238-244.doi: 10.11896/jsjkx.210400174

• 计算机图形学&多媒体 • 上一篇下一篇

三维城市场景中的小物体检测

陈佳舟¹, 赵熠波¹, 徐阳辉¹, 马骥¹, 金灵枫^1,2, 秦绪佳¹

1 浙江工业大学计算机科学与技术学院杭州 310012
2 东南数字经济发展研究院数字空间技术研发中心浙江衢州 324000

收稿日期:2021-04-17 修回日期:2021-08-09 出版日期:2022-06-15 发布日期:2022-06-08
通讯作者: 马骥(maji@zjut.edu.cn)
作者简介:(cjz@zjut.edu.cn)
基金资助:
浙江省文物科技保护项目(2020014);国家自然科学基金(61902350);衢州市科技计划项目(2019K38)

Small Object Detection in 3D Urban Scenes

CHEN Jia-zhou¹, ZHAO Yi-bo¹, XU Yang-hui¹, MA Ji¹, JIN Ling-feng^1,2, QIN Xu-jia¹

1 College of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310012,China
2 Digital Space Technology R&D Center,Southeast Digital Economic Development Institute,Quzhou,Zhejiang 324000,China

Received:2021-04-17 Revised:2021-08-09 Online:2022-06-15 Published:2022-06-08
About author:CHEN Jia-zhou,born in 1984,Ph.D,associate professor,master supervisor,is a member of China Computer Federation.His main research interests include computer graphics and visual analysis.
MA Ji,born in 1985,Ph.D,lecturer,is a member of China Computer Federation.His main research interests include data visualization and so on.
Supported by:
Science and Technology Protection Project of Cultural Relics in Zhejiang Province(2020014),National Natural Science Foundation of China(61902350) and Science and Technology Project of Quzhou(2019K38).

摘要/Abstract

摘要： 三维目标检测是三维城市场景语义分析的关键环节,但是现有的目标检测方法主要关注诸如建筑、道路等较大的物体,对路灯、井盖等小物体的检测误差较大。为此,提出了一种多视图的三维城市场景小物体检测方法,在倾斜摄影的基础上结合精准三维定位方法,提高了三维城市场景中小物体检测的精度。首先在无人机原片上利用深度学习方法检测城市小物体,然后将这些图像检测结果反投影到三维城市模型上,并通过聚类得到最终的三维检测结果。实验结果表明,所提方法能够在倾斜摄影测量得到的大规模三维城市模型上自动检测井盖、窗户等城市小物体,不受视线遮挡的影响,相对于正射图上的物体检测具有较高的准确性和稳定性。

关键词: 多视角, 聚类, 目标检测, 三维城市模型, 小物体

Abstract: 3D object detection is the core of semantic analysis in 3D urban scenes,but the existing object detection methods mainly focus on large objects such as buildings and roads,while the detection accuracy of these methods for small objects such as street lamps and manhole covers is low.For this sake,a multi-view small object detection method for 3D urban scenes is proposed.It combines the oblique photogrammetry and 3D object localization,to improve the detection accuracy of small objects.Firstly,small objects are detected in the UAV images using a deep neural network.Then,detection results are back projected onto the three-dimensional urban model.Finally,the 3D detection results are obtained by clustering these 3D objects obtained by back projection.Experimental results show that the proposed method can automatically detect small objects such as manhole covers and windows on the large-scale 3D urban model reconstructedby oblique photogrammetry,it is free of spatial occlusion,and has high accuracy and stability compared with object detection on orthophoto maps.

Key words: 3D urban model, Clustering, Multi-view, Object detection, Small objects

中图分类号:

TP391

陈佳舟, 赵熠波, 徐阳辉, 马骥, 金灵枫, 秦绪佳. 三维城市场景中的小物体检测[J]. 计算机科学, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174

CHEN Jia-zhou, ZHAO Yi-bo, XU Yang-hui, MA Ji, JIN Ling-feng, QIN Xu-jia. Small Object Detection in 3D Urban Scenes[J]. Computer Science, 2022, 49(6): 238-244. https://doi.org/10.11896/jsjkx.210400174

参考文献

[1] ZHU Q.Three-dimensional GIS and its Application in SmartCity[J].Geo-Information Science,2014,16(2):151-157.
[2] LI S L,XIE W J,LI L,et al.A review of computer rapid building modeling methods[J].Journal of Geographical Sciences,2019,42(9):1966-1990.
[3] AGARWAL S,FURUKAWA Y,SNAVELY N,et al.Building Rome in a day[J].Communicationsof the ACM,2011,54(10):105-112.
[4] FURUKAWA Y,PONCE J.Accurate,dense,and robust multi-view stereopsis[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,32(8):1362-1376.
[5] WU C,AGARWAL S,CURLESS B,et al.Multicore bundle adjustment[C]//Proceedings of Computer Vision and Pattern Recognition.IEEE,2011:3057-3064.
[6] GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich featurehierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).2014:580-587.
[7] GIRSHICK R.Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:1440-1448.
[8] REN S,HE K,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(6):1137-1149.
[9] HE K,GKIOXARI G,DOLLÁR P,et al.Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[10] REDMON J,DIVVALA S,GIRSHICK R,et al.You only look once:Unified,real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[11] REDMON J,FARHADI A.YOLO9000:better,faster,stronger[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:7263-7271.
[12] REDMON J,FARHADI A.Yolov3:An incremental improvement[J].arXiv:1804.02767,2018.
[13] LIU W,ANGUELOV D,ERHAN D,et al.Ssd:Single shotmultibox detector[C]//Proceedings of European Conference on Computer Vision.Cham:Springer,2016:21-37.
[14] CHEN J,ZHANG Y Q,SONG P,et al.Application of DeepLearning in 3D Reconstruction of Objects Based on Single Image[J].IEEE/CAA Journal of Automatica Sinica (JAS),2019,45(4):657-668.
[15] TEKIN B,SINHA S N,FUA P.Real-time seamless single shot 6d object pose prediction[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition (CVPR).2018:292-301.
[16] LI B,OUYANG W,SHENG L,et al.Gs3d:An efficient 3dobject detection framework for autonomous driving[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:1019-1028.
[17] MOUSAVIAN A,ANGUELOV D,FLYNN J,et al.3d boun-ding box estimation using deep learning and geometry[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition(CVPR).2017:7074-7082.
[18] LI P,CHEN X,SHEN S.Stereo r-cnn based 3d object detection for autonomous driving[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:7644-7652.
[19] QI C R,LIU W,WU C,et al.Frustum pointnets for 3d object detection from rgb-d data[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition (CVPR).2018:918-927.
[20] SHI S,WANG X,LI H.Pointrcnn:3d object proposal generation and detection from point cloud[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).2019:770-779.
[21] SHI S,WANG Z,WANG X,et al.Part-a² net:3d part-aware and aggregation neural network for object detection from point cloud[J].arXiv:1907.03670,2019.
[22] KUANG H,WANG B,AN J,et al.Voxel-FPN:Multi-ScaleVoxel Feature Aggregation for 3D Object Detection from LIDAR Point Clouds[J/OL].https://arxiv.org/abs/1907.05286.
[23] LIU Z,GAO G,SUN L,et al.IPG-Net:Image Pyramid Guidance Network for Small Object Detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops(CVPR).2020:1026-1027.

相关文章 15

[1]	柴慧敏, 张勇, 方敏. 基于特征相似度聚类的空中目标分群方法 Aerial Target Grouping Method Based on Feature Similarity Clustering 计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[2]	鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩. 基于分层抽样优化的面向异构客户端的联邦学习 Federated Learning Based on Stratified Sampling Optimization for Heterogeneous Clients 计算机科学, 2022, 49(9): 183-193. https://doi.org/10.11896/jsjkx.220500263
[3]	李斌, 万源. 基于相似度矩阵学习和矩阵校正的无监督多视角特征选择 Unsupervised Multi-view Feature Selection Based on Similarity Matrix Learning and Matrix Alignment 计算机科学, 2022, 49(8): 86-96. https://doi.org/10.11896/jsjkx.210700124
[4]	刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉. 基于边框距离度量的增量目标检测方法 Incremental Object Detection Method Based on Border Distance Measurement 计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132
[5]	王灿, 刘永坚, 解庆, 马艳春. 基于软标签和样本权重优化的Anchor Free目标检测算法 Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization 计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[6]	刘丽, 李仁发. 医疗CPS协作网络控制策略优化 Control Strategy Optimization of Medical CPS Cooperative Network 计算机科学, 2022, 49(6A): 39-43. https://doi.org/10.11896/jsjkx.210300230
[7]	祝文韬, 兰先超, 罗唤霖, 岳彬, 汪洋. 改进Faster R-CNN的光学遥感飞机目标检测 Remote Sensing Aircraft Target Detection Based on Improved Faster R-CNN 计算机科学, 2022, 49(6A): 378-383. https://doi.org/10.11896/jsjkx.210300121
[8]	马宾, 付永康, 王春鹏, 李健, 王玉立. 基于GDIoU损失函数的YOLOv4绝缘子高效定位算法 High Performance Insulators Location Scheme Based on YOLOv4 with GDIoU Loss Function 计算机科学, 2022, 49(6A): 412-417. https://doi.org/10.11896/jsjkx.210600089
[9]	陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强. 基于外接圆半径差损失的实时安全帽检测算法 Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss 计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252
[10]	鲁晨阳, 邓苏, 马武彬, 吴亚辉, 周浩浩. 基于DBSCAN聚类的集群联邦学习方法 Clustered Federated Learning Methods Based on DBSCAN Clustering 计算机科学, 2022, 49(6A): 232-237. https://doi.org/10.11896/jsjkx.211100059
[11]	郁舒昊, 周辉, 叶春杨, 王太正. SDFA:基于多特征融合的船舶轨迹聚类方法研究 SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion 计算机科学, 2022, 49(6A): 256-260. https://doi.org/10.11896/jsjkx.211100253
[12]	毛森林, 夏镇, 耿新宇, 陈剑辉, 蒋宏霞. 基于密度敏感距离和模糊划分的改进FCM算法 FCM Algorithm Based on Density Sensitive Distance and Fuzzy Partition 计算机科学, 2022, 49(6A): 285-290. https://doi.org/10.11896/jsjkx.210700042
[13]	陈景年. 一种适于多分类问题的支持向量机加速方法 Acceleration of SVM for Multi-class Classification 计算机科学, 2022, 49(6A): 297-300. https://doi.org/10.11896/jsjkx.210400149
[14]	邢云冰, 龙广玉, 胡春雨, 忽丽莎. 基于SVM的类别增量人体活动识别方法 Human Activity Recognition Method Based on Class Increment SVM 计算机科学, 2022, 49(5): 78-83. https://doi.org/10.11896/jsjkx.210400024
[15]	朱哲清, 耿海军, 钱宇华. 面向化学结构的线段聚类算法 Line-Segment Clustering Algorithm for Chemical Structure 计算机科学, 2022, 49(5): 113-119. https://doi.org/10.11896/jsjkx.210700131

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

三维城市场景中的小物体检测

Small Object Detection in 3D Urban Scenes

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 0