Computer Science ›› 2020, Vol. 47 ›› Issue (7): 192-198.doi: 10.11896/jsjkx.190700180

• Artificial Intelligence • Previous Articles     Next Articles

Point Cloud Deep Learning Network Based on Dynamic Graph Convolution and Spatial Pyramid Pooling

ZHU Wei1,2, SHENG Rong-jin1, TANG Ru1, HE De-feng1,2   

  1. 1 College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China
    2 United Key Laboratory of Embedded System of Zhejiang Province,Hangzhou 310023,China
  • Received:2019-07-26 Online:2020-07-15 Published:2020-07-16
  • About author:ZHU Wei,born in 1982,Ph.D,associate professor.His main research interests include video processing,machine learning and intelligent robot.
  • Supported by:
    This work was supported by the Natural Science Foundation of Zhejiang Province (LY17F010013) and National Natural Science Foundation of China (61401398)

Abstract: The classification and semantic segmentation of point cloud data have important applications in automatic driving,intelligent robot and holographic projection.While using the traditional method of manually extracting point cloud features or the feature learning method of firstly transforming three-dimensional point cloud data into data forms of multi-view and volumetric grid,there exist problems such as many processing links and great loss of three-dimensional features,resulting in low accuracy of classification and segmentation.The existing deep neural network PointNet,which can directly process point cloud data,ignoresthe local fine-grained features of point cloud and is weak in processing complex point cloud scenarios.To solve the above problems,this paper proposes a point cloud deep learning network based on dynamic graph convolution and spatial pyramid pooling.On the basis of PointNet,the dynamic graph convolution module GraphConv is used to replace the feature learning module in PointNet,which enhances the network’s ability to learn local topological structure information.At the same time,a point-based spatial py-ramid pooling structure PSPP is designed to capture multi-scale local features.Compared with the multi-scale sampling point cloud of PointNet++ and the repeated grouping method for multi-scale local features learning,it is simpler and more efficient.Experimental results show that,on the three benchmark data sets of point cloud classification and semantic segmentation task,the proposed network has higher classification and segmentation accuracy than the existing network.

Key words: Dynamic graph convolution, Local features, Point cloud, PointNet, Spatial pyramid pooling

CLC Number: 

  • TP391
[1]LIU J,WU Z K,ZHOU M Q.Overview of point cloud modelsegmentation and application technology[J].Computer Science,2011,38(4):21-24.
[2]QI C R,SU H,KAICHUN M,et al.PointNet:deep learning on point sets for 3D classification and segmentation[C]//Procee-dings of the IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Computer Society Press,2017:77-85.
[3]SU H,MAJI S,KALOGERAKIS E,et al.Multi-view convolutional neural networks for 3D shape recognition [C]//2015 IEEE International Conference on Computer Vision.New York:IEEE Press,2015:945-953.
[4]KLOKOV R,LEMPITSKY V.Escape from cells:Deep kd-networks for the recognition of 3d point cloud models[C]//Proceedings of the IEEE International Conference on Computer Vision.Honolulu:IEEE Computer Society Press,2017:863-872.
[5]XU Y,FAN T,XU M,et al.Spidercnn:Deep learning on point sets with parameterized convolutional filters[C]//Proceedings of the European Conference on Computer Vision (ECCV).Munich:IEEE Press,2018:87-102.
[6]CHEN C,LUCA Z F,ANTONIOS T.GAPNet:Graph Attention based Point Neural Network for Exploiting Local Feature of Point Cloud[J].arXiv:1905.08705.
[7]BAI J,SI Q L,QIN F Y.LightPointNet,a lightweight real-time point cloud classification network[J].Journal of Computer-Aided Design and Graphics,2019,31(4):612-621.
[8]QI C R,SU H,NIEβNER M,et al.Volumetric and multi-view cnns for object classification on 3d data[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE Computer Society Press,2016:5648-5656.
[9]MATURANA D,SCHERER S.Voxnet:A 3d convolutionalneural network for real-time object recognition[C]//2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).New York:IEEE Press,2015:922-928.
[10]WU Z,SONG S,KHOSLA A,et al.3d shapenets:A deep representation for volumetric shapes[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Boston:IEEE Computer Society Press,2015:1912-1920.
[11]JADERBERG M,SIMONYAN K,ZISSERMAN A.Spatialtransformer networks[C]//The 24th Annual Conference on Neural Information Processing Systems.Cambridge:MIT Press,2015:2017-2025.
[12]QI C R,YI L,SU H,et al.PointNet++:Deep hierarchical feature learning on point sets in a metric space[C]//The 24th Annual Conference on Neural Information Processing Systems.Cambridge:MIT Press,2017:5105-5114.
[13]DEFFERRARD M,BRESSON X,VANDERGHEYNST P.Convolutional neural networks on graphs with fast localized spectral filtering[C]//Advances in Neural Information Processing Systems.New York:IEEE Press,2016:3844-3852.
[14]SIMONOVSKY M,KOMODAKIS N.Dynamic edge-conditioned filters in convolutional neural networks on graphs[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Computer Society Press,2017:3693-3702.
[15]WANG Y,SUN Y,LIU Z,et al.Dynamic Graph CNN forLearning on Point Clouds[J].arXiv:1801.07829.
[16]HE K,ZHANG X,REN S,et al.Spatial pyramid pooling in deep convolutional networks for visual recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1904-1916.
[17]YI L,GUIBAS L,KIM V G,et al.A scalable active framework for region annotation in 3D shape collections[J].ACM Transactions on Graphics,2016,35(6):1-12.
[18]ARMENI I,SENER O,ZAMIR A R,et al.3d semantic parsing of large-scale indoor spaces[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE Computer Society Press,2016:1534-1543.
[1] LI Zong-min, ZHANG Yu-peng, LIU Yu-jie, LI Hua. Deformable Graph Convolutional Networks Based Point Cloud Representation Learning [J]. Computer Science, 2022, 49(8): 273-278.
[2] YANG Wen-kun, YUAN Xiao-pei, CHEN Xiao-feng, GUO Rui. Spatial Multi-feature Segmentation of 3D Lidar Point Cloud [J]. Computer Science, 2022, 49(8): 143-149.
[3] MENG Yue-bo, MU Si-rong, LIU Guang-hui, XU Sheng-jun, HAN Jiu-qiang. Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism [J]. Computer Science, 2022, 49(7): 142-147.
[4] FENG Lei, ZHU Deng-ming, LI Zhao-xin, WANG Zhao-qi. Sparse Point Cloud Filtering Algorithm Based on Mask [J]. Computer Science, 2022, 49(5): 25-32.
[5] XU Tao, CHEN Yi-ren, LYU Zong-lei. Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm [J]. Computer Science, 2022, 49(4): 239-246.
[6] ZHAO Xin-can, CHANG Han-xing, JIN Ren-biao. 3D Point Cloud Shape Completion GAN [J]. Computer Science, 2021, 48(4): 192-196.
[7] YAO Nan, ZHANG Zheng. Scar Area Calculation Based on 3D Image [J]. Computer Science, 2021, 48(11A): 308-313.
[8] XU Hua-jie, YANG Yang, LI Gui-lan. Material Recognition Method Based on Attention Mechanism and Deep Convolutional Neural Network [J]. Computer Science, 2021, 48(10): 220-225.
[9] ZENG Jun-fei,YANG Hai-qing,WU Hao. Adaptive Levenberg-Marquardt Cloud Registration Method for 3D Reconstruction [J]. Computer Science, 2020, 47(3): 137-142.
[10] SHI Wen-kai, ZHANG Zhao-chen, YU Meng-juan, WU Rui, NIE Jian-hui. Point Cloud Coarse Alignment Algorithm Based on Feature Detection and Depth FeatureDescription [J]. Computer Science, 2020, 47(12): 252-257.
[11] LI Jian, YANG Xiang-ru, HE Bin. Geometric Features Matching with Deep Learning [J]. Computer Science, 2019, 46(7): 274-279.
[12] WU Fei, ZHAO Xin-can, ZHAN Peng-lei, GUAN Ling. FPFH Feature Extraction Algorithm Based on Adaptive Neighborhood Selection [J]. Computer Science, 2019, 46(2): 266-270.
[13] LIU Zhen-yu, GUAN Tong. Head Posture Detection Based on RGB-D Image [J]. Computer Science, 2019, 46(11A): 334-340.
[14] ZHAO Er-ping, MENG Xiao-feng. Spatial Index of 3D Point Cloud Data Based on Spark [J]. Computer Science, 2018, 45(9): 213-219.
[15] LIU Xue-jun, WEI Yu-chen, YUAN Bi-xian, LU Hao, DAI Bo, LI Cui-qing. Study on Adaptive Hierarchical Clustering De-noising Algorithm of Laser Ranging in Storage of Dangerous Chemicals [J]. Computer Science, 2018, 45(11A): 208-211.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!