计算机科学 ›› 2019, Vol. 46 ›› Issue (3): 148-153.doi: 10.11896/j.issn.1002-137X.2019.03.022

• 2018 中国多媒体大会 • 上一篇    下一篇

基于残差网络的三维模型检索算法

李荫民1,2,薛凯心1,2,高赞1,2,3,薛彦兵1,2,徐光平1,2,张桦1,2   

  1. (天津理工大学计算机视觉与系统教育部重点实验室 天津 300384)1
    (天津理工大学天津市智能计算及软件新技术重点实验室 天津 300384)2
    (齐鲁工业大学(山东省科学院)山东省人工智能研究院 济南 250014)3
  • 收稿日期:2018-07-11 修回日期:2018-09-21 出版日期:2019-03-15 发布日期:2019-03-22
  • 通讯作者: 高赞(1980-),男,博士,教授,博士生导师,主要研究方向为图像/视频分析、信息检索和机器学习,E-mail:gaozan114@126.com(通信作者)
  • 作者简介:李荫民(1994-),男,硕士生,CCF会员,主要研究方向为计算机视觉和模式识别,E-mail:18300601870@163.com;薛凯心(1993-),女,硕士生,CCF会员,主要研究方向为计算机视觉和模式识别;薛彦兵(1979-),男,硕士,副研究员,硕士生导师,主要研究方向为计算机视觉和模式识别;徐光平(1977-),男,博士,副教授,硕士生导师,主要研究方向为计算机视觉和存储编码;张桦(1962-),女,博士,教授,博士生导师,主要研究方向为视频分析、虚拟现实。
  • 基金资助:
    国家自然科学基金(61872270,61572357),天津市应用基础与前沿技术研究计划(14JCZDJC31700),天津市自然科学基金(13JCQNJC0040)资助

3-D Model Retrieval Algorithm Based on Residual Network

LI Yin-min1,2,XUE Kai-xin1,2,GAO Zan1,2,3,XUE Yan-bin1,2,XU Guang-ping1,2,ZHANG Hua1,2   

  1. (Key Laboratory of Computer Vision and System of Ministry of Education,Tianjin University of Technology,Tianjin 300384,China)1
    (Tianjin Key Laboratory of Intelligence Computing and Novel Software Technology,Tianjin University of Technology,Tianjin 300384,China)2
    (Institute of AI,Shandong Computer Center,Qilu University of Technology (Shandong Academy of Science),Jinan 250014,China)3
  • Received:2018-07-11 Revised:2018-09-21 Online:2019-03-15 Published:2019-03-22

摘要: 近年来,基于视图的3D模型检索已经成为计算机视觉领域的重点研究方向。3D模型检索算法包括特征提取和检索算法两个部分,且鲁棒的特征对于检索算法起着决定性的作用。目前,研究者们已经提出了许多人工设计特征和深度学习特征,但是很少有人比较它们的异同。因此,文中对不同的人工设计特征和深度学习特征的性能进行了评估分析,基于充分对比的前提,采用了多个数据集、多样的评价标准和不同的检索算法进行了实验,并进一步比较了深度网络不同层特征对性能的影响,从而提出了基于残差网络的三维模型检索算法。在多个公开数据集上的实验表明:1)残差网络所提取的深度特征相较于传统特征,综合性能提升了1%~20%;2)与VGG网络所提取的深度特征相比,残差网络的综合性能提升了1%~5%;3)VGG网络中不同层特征的性能也有差异,深层特征与浅层特征相比,综合性能提升了1%~6%;4)随着网络深度的增加,残差网络所提取的特征的综合性能得到了有限提高,且比其他对比特征均更加鲁棒。

关键词: 3D模型检索, 残差网络, 人工特征, 深度特征, 特征提取

Abstract: In recent years,view-based 3D model retrieval has become a key research direction in the field of computer vision.The 3D model retrieval algorithm includes feature extraction and model retrieval where robust features play a decisive rolein retrieval algorithm.Up to now,the traditional hand-crafted features and deep learning features were proposed,but very few people systematically compare them.Therefore,in this work,the performance of different artificial design features and deep learning features was evaluated and analyzed.Based on the premise of full comparison,multiple data sets,multiple evaluation criteria,and different search algorithms were used to conduct experiments.The effects of different layers of deep network on performance were further compared,and a 3D model retrieval algorithm based on residual network was proposed.Several conclusions could be obtained from the experimental results on multiple public datasets.1)When comparing the deep learning features of VGG network and residual network with traditional hand-crafted features,the improvement of comprehensive performance can reaches 3% to 20%.2)Compared with the deep features extracted by VGG network,the comprehensive performance of the residual network is increased by 1% to 5%.3)The performance of different layer features in the VGG network is also different,and the comprehensive performance of the deep and shallow features is increased by 1% to 6%.4)As the depth of the network increase,the overall perfor-mance of the extracted features of the residual network has limited improvement,and is more robust than other contrasting features.

Key words:

第3期李荫民, Deep features, Feature extraction, Hand-crafted features, Residual network, 等:基于残差网络的三维模型检索算法
3D model retrieval

中图分类号: 

  • TP391.4
[1]OHBUCHI R,FURUYA T.Accelerating bag-of-features sift algorithm for 3d model retrieval∥Proc. SAMT 2008 Workshop on Semantic 3D Media.2008:23-30.
[2]GAO Y,DAI Q,ZHANG N Y.3D model comparison using spatial structure circular descriptor[J].Pattern Recognition,2010,43(3):1142-1151.
[3]LI B,JOHAN H.3D model retrieval using hybrid features and class information[M].Kluwer Academic Publishers,2013.
[4]VRANIC D V,SAUPE D.3D model retrieval.Leipzig:University of Leipzig,2004.
[5]DARAS P,AXENOPOULOS A.A 3D Shape Retrieval Framework Supporting Multimodal Queries[J].International Journal of Computer Vision,2010,89(2-3):229-247.
[6]SHIH J L,LEE C H,WANG J T.A new 3D model retrieval approach based on the elevation descriptor[J].Pattern Recognition,2007,40(1):283-295.
[7]LU K,HE N,XUE J,et al.Learning View-Model Joint Relevance for 3D Object Retrieval[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2015,24(5):1449-1459.
[8]OHBUCHI R,FURUYA T.Scale-weighted dense bag of visual features for 3D model retrieval from a partial view 3D model[C]∥IEEE,International Conference on Computer Vision Workshops.IEEE,2009:63-70.
[9]DUBUISSON M P,JAIN A K.A modified Hausdorff distance for object matching[C]∥IEEE Conference on Computer Vision and Image Processing.1994:566-568.
[10]GAO Y,DAI Q.View-based 3-D object retrieval.Holland:Morgan Kaufmann,2014.
[11]ANSARY T F,DAOUDI M,VANDEBORRE J P.A Bayesian 3-D Search Engine Using Adaptive Views Clustering[J].IEEE Transactions on Multimedia,2006,9(1):78-88.
[12]ANSARY T F,DAOUDI M,VANDEBORRE J P.A Bayesian 3-D Search Engine Using Adaptive Views Clustering[J].IEEE Transactions on Multimedia,2006,9(1):78-88.
[13]GAO Y,TANG J,HONG R,et al.Camera Constraint-Free
View-Based 3-D Object Retrieval[J].IEEE Transactions on Ima-ge Processing,2012,21(4):2269-2281.
[14]LIU X,WANG M,YIN B C,et al.Event-Based Media Enrichment Using an Adaptive Probabilistic Hypergraph Model.[J].IEEE Transactions on Cybernetics,2015,45(11):2461.
[15]ZHU L,SHEN J,JIN H,et al.Content-Based Visual Landmark Search via Multimodal Hypergraph Learning[J].IEEE Transactions on Cybernetics,2015,45(12):2756.
[16]LIU A A,NIE W Z,GAO Y,et al.Multi-Modal Clique-Graph Matching for View-Based 3D Model Retrieval[J].IEEE Tran-sactions on Image Processing,2016,25(5):2103-2116.
[17]LU K,HE N,XUE J,et al.Learning View-Model Joint Rele-
vance for 3D Object Retrieval[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2015,24(5):1449-1459.
[18]GAO Y,DAI Q,WANG M,et al.3D model retrieval using
weighted bipartite graph matching[J].Signal Processing Image Communication,2011,26(1):39-47.
[19]LEORDEANU M,HEBERT M.A Spectral Technique for Correspondence Problems Using Pairwise Constraints[C]∥Tenth IEEE International Conference on Computer Vision.IEEE Computer Society,2005:1482-1489.
[20]CHO M,LEE J,LEE K M.Reweighted random walks for graph matching[C]∥European Conference on Computer Vision.Springer,Berlin,Heidelberg,2010:492-505.
[21]LIU A,WANG Z,NIE W,et al.Graph-based characteristic view set extraction and matching for 3D model retrieval[J].Information Sciences,2015,320:429-442.
[22]KHOTANZAD A,HONG Y H.Invariant Image Recognition by Zernike Moments[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,1990,12(5):489-497.
[23]DALAL N,TRIGGS B.Histograms of oriented gradients for
human detection[C]∥IEEE Computer Society Conference on Computer Vision and Pattern Recognition,2005(CVPR 2005).IEEE,2005:886-893.
[24]PERRONNIN F,MENSINK T.Improving the fisher kernel for large-scale image classification[C]∥European Conference on Computer Vision.Springer-Verlag,2010:143-156.
[25]DONAHUE J,JIA Y,VINYALS O,et al.DeCAF:A Deep Convolutional Activation Feature for Generic Visual Recognition∥International Conference on Machine Learning.2014:647-655.
[26]NIE W,CAO Q,LIU A,et al.Convolutional deep learning for 3D object retrieval.Multimedia Systems,2017,23(3):325-332.
[27]LEIBE B,SCHIELE B.Analyzing appearance and contour based methods for object categorization∥2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.IEEE,2003,2:II-409.
[28]CHEN D Y.On visual similarity based 3d model retrieval[C]∥Computer Graphics Forum.2003:223-232.
[29]GAO Y,WANG M,JI R,et al.3-D Object Retrieval With Haus-
dorff Distance Learning[J].IEEE Transactions on Industrial Electronics,2013,61(4):2088-2098.
[30]GAO Y,WANG M,ZHA Z J,et al.Less is More:Efficient 3-D Object Retrieval With Query View Selection[J].IEEE Transactions on Multimedia,2011,13(5):1007-1018.
[31]LLER H,LLER W,SQUIRE D M,et al.Performance evaluation in content-based image retrieval:overview and proposals[J].Pattern Recognition Letters,2001,22(5):593-601.
[32]NIE W Z,LIU A A,SU Y T.3D object retrieval based on sparse coding in weak supervision[J].Journal of Visual Communication &Image Representation,2016,37(C):40-45.
[1] 王馨彤, 王璇, 孙知信.
基于多尺度记忆残差网络的网络流量异常检测模型
Network Traffic Anomaly Detection Method Based on Multi-scale Memory Residual Network
计算机科学, 2022, 49(8): 314-322. https://doi.org/10.11896/jsjkx.220200011
[2] 张源, 康乐, 宫朝辉, 张志鸿.
基于Bi-LSTM的期货市场关联交易行为检测方法
Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM
计算机科学, 2022, 49(7): 31-39. https://doi.org/10.11896/jsjkx.210400304
[3] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[4] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[5] 刘伟业, 鲁慧民, 李玉鹏, 马宁.
指静脉识别技术研究综述
Survey on Finger Vein Recognition Research
计算机科学, 2022, 49(6A): 1-11. https://doi.org/10.11896/jsjkx.210400056
[6] 高荣华, 白强, 王荣, 吴华瑞, 孙想.
改进注意力机制的多叉树网络多作物早期病害识别方法
Multi-tree Network Multi-crop Early Disease Recognition Method Based on Improved Attention Mechanism
计算机科学, 2022, 49(6A): 363-369. https://doi.org/10.11896/jsjkx.210500044
[7] 韩红旗, 冉亚鑫, 张运良, 桂婕, 高雄, 易梦琳.
基于共同子空间分类学习的跨媒体检索研究
Study on Cross-media Information Retrieval Based on Common Subspace Classification Learning
计算机科学, 2022, 49(5): 33-42. https://doi.org/10.11896/jsjkx.210200157
[8] 高元浩, 罗晓清, 张战成.
基于特征分离的红外与可见光图像融合算法
Infrared and Visible Image Fusion Based on Feature Separation
计算机科学, 2022, 49(5): 58-63. https://doi.org/10.11896/jsjkx.210200148
[9] 赵人行, 徐频捷, 刘瑶.
基于深度卷积残差网络的心电单导联房颤检测方法
ECG-based Atrial Fibrillation Detection Based on Deep Convolutional Residual Neural Network
计算机科学, 2022, 49(5): 186-193. https://doi.org/10.11896/jsjkx.220200002
[10] 高心悦, 田汉民.
基于改进U-Net网络的液滴分割方法
Droplet Segmentation Method Based on Improved U-Net Network
计算机科学, 2022, 49(4): 227-232. https://doi.org/10.11896/jsjkx.210300193
[11] 张红民, 李萍萍, 房晓冰, 刘宏.
改进YOLOv3网络模型的人体异常行为检测方法
Human Abnormal Behavior Detection Method Based on Improved YOLOv3 Network Model
计算机科学, 2022, 49(4): 233-238. https://doi.org/10.11896/jsjkx.210300251
[12] 瞿中, 陈雯.
基于空洞卷积和多特征融合的混凝土路面裂缝检测
Concrete Pavement Crack Detection Based on Dilated Convolution and Multi-features Fusion
计算机科学, 2022, 49(3): 192-196. https://doi.org/10.11896/jsjkx.210100164
[13] 左杰格, 柳晓鸣, 蔡兵.
基于图像分块与特征融合的户外图像天气识别
Outdoor Image Weather Recognition Based on Image Blocks and Feature Fusion
计算机科学, 2022, 49(3): 197-203. https://doi.org/10.11896/jsjkx.201200263
[14] 任首朋, 李劲, 王静茹, 岳昆.
基于集成回归决策树的lncRNA-疾病关联预测方法
Ensemble Regression Decision Trees-based lncRNA-disease Association Prediction
计算机科学, 2022, 49(2): 265-271. https://doi.org/10.11896/jsjkx.201100132
[15] 张师鹏, 李永忠.
基于降噪自编码器和三支决策的入侵检测方法
Intrusion Detection Method Based on Denoising Autoencoder and Three-way Decisions
计算机科学, 2021, 48(9): 345-351. https://doi.org/10.11896/jsjkx.200500059
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!