基于多种强调机制的深度点云网络改进研究

doi:10.11896/jsjkx.220400164

Abstract

Abstract: Machine vision is a key technology for robots to identify working objects from complex spatial environments.Kinect depth cameras or laser scanning sensors commonly used in robotic systems are capable of acquiring three-dimensional information about the target,which makes it possible for robots to perform more complex work tasks such as assembly,disassembly,and grasping.However,this also places higher demands on the robot system’s ability to process 3D information such as 3D localization,work object size measurement,and estimation.We analyze the main feature emphasis mechanisms of soft threshold squeeze-and-excitation,channel-wise gated,and attention mechanisms based on PointNet networks,and improve PointNet networks by using soft threshold squeeze-and-excitation,channel-wise gated,and attention networks,respectively,and experimentally validate them on the publicly available ShapeNet dataset from Stanford University.Experimental results show that the improvement of original network by the three emphasis mechanisms improves segmentation accuracy(mean intersection and merge ratio) of 3D point clouds by 0.24%,0.68%,and 0.93%,respectively,in comparison with original PointNet network.The improved method lays foundation for the subsequent solution of accurate estimation for the size of working objects in tasks such as assembly,disassembly and grasping by robots.

Key words: Machine vision, 3D point cloud, Squeeze-and-excitation, Channel-wise gated, Attention module

CLC Number:

TP391

LIU Hui, TIAN Shuaihua. Study on Improvement of Deep Point Cloud Network Based on Multiple Emphasis Mechanisms[J].Computer Science, 2023, 50(6A): 220400164-7.

References

[1]XU X,MCGORRY R W.The validity of the first and secondgeneration Microsoft Kinect for identifying joint center locations during static postures[J].Appl. Ergon., 2015,49:47-54.
[2]ZHOU Y,YU Z,XU X D,et al.Practice research of classroom teaching system based on Kinect[C]//15th Int.Conf.Comput.Sci.Educ(ICCSE 2020).2020:572-575.
[3]CUNHA A,PÁDUA L,COSTA L,et al.Evaluation of MS Kinect for Elderly Meal Intake Monitoring[C]//Procedia Tech-nol.2014:1383-1390.
[4]CARUSO L,RUSSO R,SAVINO S.Microsoft Kinect V2 vision system in a manufacturing application[J].Robot.Comput.Integr.Manuf.,2017,48:174-181.
[5]BIERMANN H,PHILIPSEN R,BRELL T,et al.Users’ Expectations,Fears,and Attributions Regarding Autonomous Driving-A Comparison of Traffic Scenarios[M].Springer International Publishing,2021.
[6]JAWAID I,QURESHI J K.Advancements in medical imagingthrough Kinect:A review[C]//2017 Int.Symp.Wirel.Syst.Networks(ISWSN 2017).2017:1-5.
[7]FERNANDES A O,MOREIRA L F E,MATA J M.Machine vision applications and development aspects[C]//IEEE Int.Conf.Control Autom(ICCA).2011:1274-1278.
[8]ALOIMONOS J,WEISS I,BANDYOPADHYAY A.Active vision[J].Int.J.Comput.Vis.,1988,1(4):333-356.
[9]KIM P,CHEN J,CHO Y K.SLAM-driven robotic mapping and registration of 3D point clouds[J]. Autom.Constr.,2018,89:38-48.
[10]DÖNMEZ E,KOCAMAZ A F,DIRIK M.A Vision-Based Real-Time Mobile Robot Controller Design Based on Gaussian Function for Indoor Environment[J].Arab.J.Sci.Eng.,2018,43(12):7127-7142.
[11]KHAIRUDIN M,CHEN G D,WU M C,et al.Control of a movable robot head using vision-based object tracking[J].Int.J.Electr.Comput.Eng.,2019,9(4):2503-2512.
[12]KUZNETSOVA A,MALEVA T,SOLOVIEV V.UsingYOLOv3 algorithm with pre-And post-processing for apple detection in fruit-harvesting robot[J].Agronomy,2020,10(7).
[13]ZHENG F,FANG F,MA X.Trajectory Sampling and Fitting Restoration Based on Machine Vision for Robot Fast Teaching[C]//Proc.15th IEEE Conf.Ind.Electron.Appl.(ICIEA 2020).2020:604-609.
[14]TANG B,JIANG L.Binocular stereovision omnidirectional motion handling robot[J].Int.J.Adv.Robot.Syst.,2020,17(3):1-11.
[15]LI Y,LIU Y.Vision-based Obstacle Avoidance Algorithm forMobile Robot[C]//Proc.-2020 Chinese Autom.Congr.(CAC 2020).2020:1273-1278.
[16]CHAUDHURY A.Machine Vision System for 3D Plant Phenotyping[J].IEEE/ACM Trans.Comput.Biol.Bioinforma.,2018,16(6):2009-2022.
[17]CHERAGHIAN A,RAHMAN S,PETERSSON L.Zero-shot learning of 3d point cloud objects[C]//Proc.16th Int.Conf.Mach.Vis.Appl.(MVA 2019).2019.
[18]MAHDAOUI A.3D point cloud simplification based on the clustering algorithm and introducing the Shannon’s entropy[C]//Thirteenth International Conference on Machine Vision.SPIE,2021,11605:174-182.
[19]LIANG J G,CHEN M L,MA H.Registration of Terrestrial Laser Scanning Data Based on Projection Distribution Entropy[J].Laser & Optoelectronics Progress,2019,56(13):131501.
[20]LAN W H,LI N,TONG Q.Improved3-D Point Cloud Registration Algorithm with Oriented Bounding Box[J].Computer Engineering and Applications,2022,58(14):177-184.
[21]CHANG A X,FUNKHOUSER T,GUIBAS L,et al.Shapenet:An information-rich 3d model repository[J].arXiv:1512.03012,2015.
[22]GUO Y,WANG H,HU Q,et al.Deep learning for 3d point clouds:A survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,43(12):4338-4364.
[23]QI C R,SU H,MO K,et al.PointNet:Deep learning on point sets for 3D classification and segmentation[C]//Proc.30th IEEE Conf.Comput.Vis.Pattern Recognition(CVPR 2017).2017:77-85.
[24]HU J.Squeeze-and-Excitation_Networks_CVPR_2018_paper.pdf[C]//CVPR.2018:7132-7141.
[25]WOO S,PARK J,LEE J,et al.CBAM:Convolutional Block Attention Module[C]//ECCV.2018:3-19.
[26]LI X,WU X,LU H,et al.Channel-wise gated res2net:Towards robust detection of synthetic speech attacks[J].arXiv:2107.08803,2021.
[27]TOLSTIKHIN I O,HOULSBY N,KOLESNIKOV A,et al.Mlp-mixer:An all-mlp architecture for vision[J].Advances in Neural Information Processing Systems,2021,34:24261-24272.
[28]LIU W,WEN Y,YU Z,et al.Large-margin softmax loss for convolutional neural networks[J].arXiv:1612.02295, 2016.
[29]ZHAO M,ZHONG S,FU X,et al.Deep Residual ShrinkageNetworks for Fault Diagnosis[J].IEEE Trans.Ind.Informati-cs,2020,16(7):4681-4690.
[30]PENG Y H.De-noising by modified soft-thresholding[J].IEEE Asia-Pacific Conf.Circuits Syst.,2000,41(3):760-762.
[31]LIN M,CHEN Q,YAN S.Network in network(2nd)[C]//Int.Conf.Learn.Represent.ICLR 2014-Conf.Track Proc.2014:1-10.
[32]SALTZER J H,REED D P,CLARK D D.End-to-end arguments in system design[J].ACM Trans.Comput.Syst.,1984,2(4):277-288.

Related Articles 15

[1]	LONG Tao, DONG Anguo, LIU Laijun. Pavement Crack Detection Based on Attention Mechanism and Deformable Convolution [J]. Computer Science, 2023, 50(6A): 220300214-6.
[2]	WANG Wei, BAI Long, MA Huanchang, LIU Yanheng. Study on Safety Warning Method of Driver’s Blind Area Based on Machine Vision [J]. Computer Science, 2023, 50(6A): 220700141-7.
[3]	WEI Kai-xuan, FU Ying. Re-parameterized Multi-scale Fusion Network for Efficient Extreme Low-light Raw Denoising [J]. Computer Science, 2022, 49(8): 120-126.
[4]	LIU Dong-mei, XU Yang, WU Ze-bin, LIU Qian, SONG Bin, WEI Zhi-hui. Incremental Object Detection Method Based on Border Distance Measurement [J]. Computer Science, 2022, 49(8): 136-142.
[5]	YANG Wen-kun, YUAN Xiao-pei, CHEN Xiao-feng, GUO Rui. Spatial Multi-feature Segmentation of 3D Lidar Point Cloud [J]. Computer Science, 2022, 49(8): 143-149.
[6]	WU Lin, SUN Jing-yu. Multi-branch RA Capsule Network and Its Application in Image Classification [J]. Computer Science, 2022, 49(6): 224-230.
[7]	XU Hua-jie, QIN Yuan-zhuo, YANG Yang. Scene Recognition Method Based on Multi-level Feature Fusion and Attention Module [J]. Computer Science, 2022, 49(4): 209-214.
[8]	ZHAO Yue, YU Zhi-bin, LI Yong-chun. Cross-attention Guided Siamese Network Object Tracking Algorithm [J]. Computer Science, 2022, 49(3): 163-169.
[9]	LI Zi-dong, YAO Yi-fei, WANG Wei-wei, ZHAO Rui-lian. Web Application Page Element Recognition and Visual Script Generation Based on Machine Vision [J]. Computer Science, 2022, 49(11): 65-75.
[10]	ZHOU Wen-hui, SHI Min, ZHU Deng-ming, ZHOU Jun. Seismic Data Super-resolution Method Based on Residual Attention Network [J]. Computer Science, 2021, 48(8): 24-31.
[11]	QING Lai-yun, ZHANG Jian-gong, MIAO Jun. Temporal Modeling for Online Anomaly Detection [J]. Computer Science, 2021, 48(7): 206-212.
[12]	WANG Dong, ZHOU Da-ke, HUANG You-da , YANG Xin. Multi-scale Multi-granularity Feature for Pedestrian Re-identification [J]. Computer Science, 2021, 48(7): 238-244.
[13]	ZHAO Xin-can, CHANG Han-xing, JIN Ren-biao. 3D Point Cloud Shape Completion GAN [J]. Computer Science, 2021, 48(4): 192-196.
[14]	HAN Ke-kun, HU Gui-chuan, REN Jing, HE Hong-yu, LIU Jia-yin. Application of Image Processing in Feature Size Detection of Wind Turbine Blade’s Flange Face [J]. Computer Science, 2019, 46(6A): 562-565.
[15]	ZHAO Er-ping, MENG Xiao-feng. Spatial Index of 3D Point Cloud Data Based on Spark [J]. Computer Science, 2018, 45(9): 213-219.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Study on Improvement of Deep Point Cloud Network Based on Multiple Emphasis Mechanisms

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0