计算机科学 ›› 2021, Vol. 48 ›› Issue (6A): 51-56.doi: 10.11896/jsjkx.200500122

• 图像处理&多媒体技术 • 上一篇    下一篇

基于卷积神经网络的建筑构件图像识别

熊朝阳, 王婷   

  1. 南昌航空大学土木建筑学院 南昌330063
  • 出版日期:2021-06-10 发布日期:2021-06-17
  • 通讯作者: 王婷(1070180557@qq.com)
  • 作者简介:shanyunfengmail@163.com
  • 基金资助:
    国家自然科学基金(51968051)

Image Recognition for Building Components Based on Convolutional Neural Network

XIONG Zhao-yang, WANG Ting   

  1. School of Civil Engineering,Nanchang Hangkong University,Nanchang 330063,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:XIONG Zhao-yang,born in 1996,master.His main research interests include deep learning and intelligent construction.
    WANG Ting,born in 1975,Ph.D,associate professor.Her main research interests include BIM and intelligent construction.
  • Supported by:
    National Natural Science Foundation of China (51968051).

摘要: 对于现存的大量既有建筑,利用三维激光扫描所得到的点云数据生成BIM模型,需要将点云数据转换成建筑RGB-D图像,并对图像进行分类处理。传统图像识别技术无论是识别准确度还是面对复杂场景的模型泛化能力等,都难以满足现在的需求。文中基于深度学习算法,针对室内建筑门窗构件图像的分类问题,提出了一种运用卷积神经网络模型进行建筑构件图像识别的方法。该方法首先将收集的数据集进行数据增强处理以增加数据丰富度,并使用在ImageNet上已经训练好权重的VGG16作为识别网络,随后对网络进行优化,包括增加Dropout层、L2正则化以及采用Fine-tune操作来提升网络的识别精度。实验结果表明,进行了Fine-tune等优化后的模型的平均识别准确率达到95.4%,相比于未经过优化的模型的准确率提高了大约5.1%。

关键词: 建筑构件, 卷积神经网络, 迁移学习, 图像识别

Abstract: It is necessary to convert the point cloud data into theRGB-D images of building and classify the images,when using the point cloud data obtained by 3D laser scanner to generate BIM model for a large number of existing buildings.In this paper,based on the deep learning algorithm,a method of building components image recognition employing the convolution neural network is proposed by using the transfer learning theory to dealing with the classification problem of interior building components image such as doors and windows.First of all,the VGG16 with weight parameters trained in Imagenet is used as the image recognition neural network.In addition,the network is optimized by adding Dropout layer,L2 regularization and using Fine-tune operation to improve the recognition accuracy of the network.The experimental results show that the average recognition accuracy of the model optimized by Fine-tune is 95.4%,which is about 5.1% higher than that of the model without optimization.

Key words: Building components, Convolutional neural network, Image recognition, Transfer learning

中图分类号: 

  • TP391
[1] BRILAKIS I,LOURAKIS R,SACKS R,et al.Toward automated generation of parametric BIMs based on hybrid video and laser scanning data [J].Advanced Engineering Informatics,2010,24(4):456-465.
[2] CORTES C,VAPNIK V N.Support Vector Networks [J].Machine Learning,1995,20(3):273-297.
[3] WANG C F,XIAO S B.Study of Image Classification Based on SVM[J].Computer and Digital Engineering,2006,34(8):74-76,81.
[4] CHEN P H,SHEN H K,LEI C Y,et al.Support-vector-machine-based method for automated steel bridge rust assessment [J].Automation in Construction,2012,23:9-19.
[5] LOWE D G.Object Recognition from Local Scale-Invariant Fea-tures[C]//International Conference on Computer Vision.1999.
[6] YU K,JIA L,CHEN Y Q,et al.Deep Learning:Yesterday,Today,Tomorrow[J].Journal of Computer Researchand Development,2013,50(9):1799-1804.
[7] HINTON E G.Reducing the Dimensionality of Data with Neural Networks [J].Science,2006,313(5786):504-507.
[8] KRIZHEVSKY A,SUTSKEVER I,HINTON G.ImageNetClassification with Deep Convolutional Neural Networks [J].Advances in Neural Information Processing Systems,2012,25(2).
[9] CHA Y J,CHOI W,BÜYÜKZTÜRK O.Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks [J].Computer-aided Civil & Infrastructure Engineering,2017,32(5):361-378.
[10] HUANG H W,LI Q T.Image recognition for water leakage in shield tunnel based on deep learning [J].Chinese Journal of Rock Mechanicsand Engineering,2017,32(5):361-378.
[11] LIU W T,LI S H,QIN Y C.Automatic Building Roof Extraction with Fully Convolutional Neural Network [J].Journal of Geo-Information Science,2018,20(11):1562-1570.
[12] YI Y K,ZHANG Y,MYUNG J.House style recognition using deep convolutional neural network [J].Automation in Construction,2020,118:103-307.
[13] SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition [J].arXiv:1409.1556.
[14] DENG J,DONG W,SOCHER R,et al.ImageNet:a Large-Scale Hierarchical Image Database[C]//2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009).Miami,Florida,USA,2009:20-25.
[15] ATKINSON G A,ZHANG W,HANSEN M F,et al.Image segmentation of underfloor scenes using a mask regions convolutional neural network with two-stage transfer learning [J].Automation in Construction,2020,113:103-118.
[16] GAO Y,MOSALAM K M.Deep Transfer Learning for Image-Based Structural Damage Recognition [J].Computer-Aided Civil and Infrastructure Engineering,2018,33(9):748-768.
[17] GLOROT X,BORDES A,BENGIO Y.Deep Sparse RectifierNeural Networks [C]//14th International Conference on Artificial Intelligence and Statistics (AISTATS).Fort Lauderdale,FL,USA,2011.
[18] SAINATH T N,MOHAMED A R,KINGSBURY B,et al.Deep convolutional neural networks for LVCSR[C]//2013 IEEE International Conference on Acoustics,Speech and Signal Proces-sing (ICASSP).IEEE,2013.
[19] SRIVASTAVA N,HINTON G,KRIZHEVSKY A,et al.Dropout:a simple way to prevent neural networks from overfitting [J].J Mach Learn Res,2014,15(1):1929-1958.
[20] KINGMA D P,BA J.Adam A Method for Stochastic Optimization [J].arXiv:abs1412.6980.
[21] BAKHODA A,YUAN G L,FUNG W W L,et al.Analyzing CUDA workloads using a detailed GPU simulator[C]//2009 ISPASS 2009 IEEE International Symposium on Performance Analysis of Systems and Software.2009.
[22] ABADI M,AGARWAL A,BARHAM P,et al.TensorFlow:Large-Scale Machine Learning on Heterogeneous Distributed Systems [J].arXiv:1603.04467,2016.
[23] SONG S,LICHTENBERG S P,XIAO J.SUN RGB-D:A RGB-D scene understanding benchmark suite[C]//IEEE Conference on Computer Vision & Pattern Recognition(CVPR).IEEE,2015:567-576.
[24] GUPTA S,GIRSHICK R,ARBELÁEZ P,et al.Learning Rich Features from RGB-D Images for Object Detection and Segmentation [C]//ECCV 2014.2014:345-360.
[25] LIU C,XU X J.RGB-D image classification based on convolutional neural network[J].Electronic Design Engineering,2018,26(24):159-163.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[3] 方义秋, 张震坤, 葛君伟.
基于自注意力机制和迁移学习的跨领域推荐算法
Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning
计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011
[4] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[5] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[6] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[7] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[8] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[9] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
[10] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[11] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[12] 张嘉淏, 刘峰, 齐佳音.
一种基于Bottleneck Transformer的轻量级微表情识别架构
Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer
计算机科学, 2022, 49(6A): 370-377. https://doi.org/10.11896/jsjkx.210500023
[13] 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤.
不同数据增强方法对模型识别精度的影响
Influence of Different Data Augmentation Methods on Model Recognition Accuracy
计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210
[14] 孙洁琪, 李亚峰, 张文博, 刘鹏辉.
基于离散小波变换的双域特征融合深度卷积神经网络
Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation
计算机科学, 2022, 49(6A): 434-440. https://doi.org/10.11896/jsjkx.210900199
[15] 孙福权, 崔志清, 邹彭, 张琨.
基于多尺度特征的脑肿瘤分割算法
Brain Tumor Segmentation Algorithm Based on Multi-scale Features
计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!