计算机科学 ›› 2019, Vol. 46 ›› Issue (9): 265-270.doi: 10.11896/j.issn.1002-137X.2019.09.040

• 图形图像与模式识别 • 上一篇    下一篇

基于多路卷积神经网络的多光谱场景识别方法

江泽涛1,2, 秦嘉奇1, 胡硕3   

  1. (桂林电子科技大学广西图像图形智能处理高校重点实验室 广西 桂林541004)1;
    (桂林电子科技大学广西可信软件重点实验室 广西 桂林541004)2;
    (南昌航空大学 南昌330063)3
  • 收稿日期:2018-08-03 出版日期:2019-09-15 发布日期:2019-09-02
  • 通讯作者: 秦嘉奇(1993-),男,硕士生,主要研究方向为计算机视觉,E-mail:1445155606@qq.com
  • 作者简介:江泽涛(1961-),男,博士,教授,CCF会员,主要研究方向为计算机视觉;胡 硕(1983-),博士生,讲师,主要研究方向为智能计算。
  • 基金资助:
    国家自然科学基金(61572147),广西科技计划项目(AC16380108),广西图像图形智能处理重点实验室项目(GIIP201701),广西研究生教育创新计划资助项目(2018YJCX46),江西省自然科学基金资助项目(20171BAB212015)

Multi-spectral Scene Recognition Method Based on Multi-way Convolution Neural Network

JIANG Ze-tao1,2, QIN Jia-qi1, HU Shuo3   

  1. (The Key Laboratory of Image and Graphic Intelligent Processing of Higher Education in Guangxi,Guilin University of Electronic Technology,Guilin,Guangxi 541004,China)1;
    (The Key Laboratory of Dependable Software of Guangxi,Guilin University of Electronic Technology,Guilin,Guangxi 541004,China)2;
    (Nanchang Hangkong University,Nanchang 330063,China)3
  • Received:2018-08-03 Online:2019-09-15 Published:2019-09-02

摘要: 现有的基于卷积神经网络的场景识别算法无法处理目标场景图形是多光谱图像的情况,在数据量较小的情况下,该算法的识别率不高。针对以上问题,提出一种基于多路卷积神经网络的多光谱场景识别方法。多路卷积神经网络接受三通道可见光彩色图像(RGB图像)以及单通道的近红外图像(NIR图像)共四通道输入。所提方法能够有效提取可见光图像特征、红外光图像特征以及可见光和红外光图像之间的关联特征,并将特征在全连接层进行融合,合理利用了各个光谱图像之间的相关信息,并通过结合预训练的方法来提高识别精度。在NIR_RGB数据集上的实验表明,与AlexNet、InceptionNet、ResNet以及人工设计特征描述子方法相比,该网络的平均识别率较高。并且,对此网络稍加改动,就能将其推广到其他多光谱图像分类任务中。

关键词: 场景识别, 多光谱, 卷积神经网络, 图像分类

Abstract: The existing scene recognition algorithm based on convolution neural network can’t deal with the multi spectral image of the target scene and can’t implement ideal accuracy in the case of insufficient data.In view of the above problems,this paper proposed a multi-spectral convolution neural network based multispectral scene recognition me-thod.The multi-way convolution neural network accepts three channels of visible light color image (RGB image) and a single channel near infrared image (NIR image) with a total of four channels.The proposed method can effectively extract the features of visible light image,infrared image and the correlation between visible and infrared images,and combine the features in the full connection layer,utilizing the correlation information among spectral images reasonably.The pre-training method is combined to improve the accuracy.Experiment results on the NIR_RGB dataset show that the average accuracy of the network is higher than that of AlexNet,InceptionNet,ResNet and artificial design feature descriptors.Moreover,this network can be extended to other multi-spectral image classification tasks with slight modification.

Key words: Convolution neural network, Image classification, Multi-spectral, Scene recognition

中图分类号: 

  • TP391.4
[1]HINTON G E,OSINDERO S,TEH Y W.A Fast Learning Algorithm For Deep Belief Nets[J].Neural Computation,2006,18(7):1527-1554.
[2]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]//International Conference on Neural Information Processing Systems.Curran Associates Inc.,2012:1097-1105.
[3]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[C]//Proceedings of the International Conference on Learning Representations.Oxford,USA,2015.
[4]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:1-9.
[5]HE K M,ZHANG X Y,REN S Q,et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:770-778.
[6]GAO H,ZHUANG L,MAATEN L V D,et al.Densely Connected Convolutional Networks[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2017.
[7]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[J/OL].https://arxiv.org/abs/1709.01507.
[8]ROUTRAY S,RAY A K,MISHRA C,et al.Efficient hybrid image denoising scheme based on SVM classification[J].Optik,2018,157:503-511.
[9]LEE S J,CHEN T,YU L,et al.Image Classification Based on the Boost Convolutional Neural Network[J].IEEE Access,2018,6(99):12755-12768.
[10]SHI L,WANG Y M,CAO Y J,et al.Vehicle Type Recognition Based on Deep Convolution Neural Network[J].Computer Science,2018,45(5):280-284.
[11]GONG A,FEI F,ZHENG J.Multi-person Behavior Recognition Method Based on Convolutional Neural Networks[J].Computer Science,2018,45(2):306-311,321.(in Chinese)龚安,费凡,郑君.基于卷积神经网络的多人行为识别方法[J].计算机科学,2018,45(2):306-311,321.
[12]ZHOU F Y,JIN L P,DONG J.Review of convolutional neuralnetwork[J].Chinese Journal of Computers,2017,40(6):1229-1251.(in Chinese)周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251.
[13]BROWN M,SÜSSTRUNK S.Multi-spectral SIFT for scenecategory recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2011:177-184.
[14]XIAO Y,WU J,YUAN J.mCENTRIST:A Multi-Channel Feature Generation Mechanism for Scene Categorization[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2014,23(2):823-836.
[15]VAPNIK V N.An overview of statistical learning theory[J].IEEE Transactions on Neural Networks,1999,10(5):988-999.
[16]SALAMATI N,LARLUS D,CSURKA G.Combining Visibleand Near-Infrared Cues for Image Categorisation[C]//British Machine Vision Conference.BMVA Press,2011.
[17]ZHANG M M,CHOI J,DANIILIDIS K,et al.VAIS:A dataset for recognizing maritime imagery in the visible and infrared spectrums[C]//Computer Vision and Pattern Recognition Workshops.IEEE,2015:10-16.
[18]LIU F,SHEN T S,MA X X,et al.Ship recognition based on multi-band neural network[J].Optics and Precision Enginee-ring,2017,25(11):2939-2946.(in Chinese)刘峰,沈同圣,马新星,等.基于多波段深度神经网络的舰船目标识别[J].光学精密工程,2017,25(11):2939-2946.
[19]LIU F,SHEN T S,MA X X.Convolutional Neural NetworkBased Multi-Band Ship Target Recognition with Feature Fusion[J].Acta Optica Sinica,2017(10):240-248.(in Chinese)刘峰,沈同圣,马新星.特征融合的卷积神经网络多波段舰船目标识别[J].光学学报,2017(10):240-248.
[20]HAJHASSAN H,CHADDAD A,HARKOUSS Y,et al.Classifications of Multispectral Colorectal Cancer Tissues Using Convolution Neural Network[J].Journal of Pathology Informatics,2017,8(1):1-7.
[21]WANG J,LUO C,HUANG H,et al.Transferring Pre-Trained Deep CNNs for Remote Scene Classification with General Features Learned from Linear PCA Network[J].Remote Sensing,2017,9(3):225.
[22]LOWE D G.Distinctive image features from scale-invariant keypoint[J].International Journal of Computer Vision,2004,60(2):91-110.
[23]LEUTENEGGER S,CHLI M,SIEGWART R Y.BRISK:Binary Robust invariant scalable keypoints[C]//IEEE International Conference on Computer Vision.IEEE,2012:2548-2555.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航.
监督和半监督学习下的多标签分类综述
Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning
计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[3] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[4] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[5] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[6] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[7] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[8] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[9] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[10] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
[11] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[12] 张嘉淏, 刘峰, 齐佳音.
一种基于Bottleneck Transformer的轻量级微表情识别架构
Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer
计算机科学, 2022, 49(6A): 370-377. https://doi.org/10.11896/jsjkx.210500023
[13] 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤.
不同数据增强方法对模型识别精度的影响
Influence of Different Data Augmentation Methods on Model Recognition Accuracy
计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210
[14] 孙洁琪, 李亚峰, 张文博, 刘鹏辉.
基于离散小波变换的双域特征融合深度卷积神经网络
Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation
计算机科学, 2022, 49(6A): 434-440. https://doi.org/10.11896/jsjkx.210900199
[15] 杨涵, 万游, 蔡洁萱, 方铭宇, 吴卓超, 金扬, 钱伟行.
基于步态分类辅助的虚拟IMU的行人导航方法
Pedestrian Navigation Method Based on Virtual Inertial Measurement Unit Assisted by GaitClassification
计算机科学, 2022, 49(6A): 759-763. https://doi.org/10.11896/jsjkx.211200148
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!