计算机科学 ›› 2019, Vol. 46 ›› Issue (9): 265-270.doi: 10.11896/j.issn.1002-137X.2019.09.040
江泽涛1,2, 秦嘉奇1, 胡硕3
JIANG Ze-tao1,2, QIN Jia-qi1, HU Shuo3
摘要: 现有的基于卷积神经网络的场景识别算法无法处理目标场景图形是多光谱图像的情况,在数据量较小的情况下,该算法的识别率不高。针对以上问题,提出一种基于多路卷积神经网络的多光谱场景识别方法。多路卷积神经网络接受三通道可见光彩色图像(RGB图像)以及单通道的近红外图像(NIR图像)共四通道输入。所提方法能够有效提取可见光图像特征、红外光图像特征以及可见光和红外光图像之间的关联特征,并将特征在全连接层进行融合,合理利用了各个光谱图像之间的相关信息,并通过结合预训练的方法来提高识别精度。在NIR_RGB数据集上的实验表明,与AlexNet、InceptionNet、ResNet以及人工设计特征描述子方法相比,该网络的平均识别率较高。并且,对此网络稍加改动,就能将其推广到其他多光谱图像分类任务中。
中图分类号:
[1]HINTON G E,OSINDERO S,TEH Y W.A Fast Learning Algorithm For Deep Belief Nets[J].Neural Computation,2006,18(7):1527-1554. [2]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]//International Conference on Neural Information Processing Systems.Curran Associates Inc.,2012:1097-1105. [3]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[C]//Proceedings of the International Conference on Learning Representations.Oxford,USA,2015. [4]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:1-9. [5]HE K M,ZHANG X Y,REN S Q,et al.Deep Residual Learning for Image Recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:770-778. [6]GAO H,ZHUANG L,MAATEN L V D,et al.Densely Connected Convolutional Networks[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2017. [7]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[J/OL].https://arxiv.org/abs/1709.01507. [8]ROUTRAY S,RAY A K,MISHRA C,et al.Efficient hybrid image denoising scheme based on SVM classification[J].Optik,2018,157:503-511. [9]LEE S J,CHEN T,YU L,et al.Image Classification Based on the Boost Convolutional Neural Network[J].IEEE Access,2018,6(99):12755-12768. [10]SHI L,WANG Y M,CAO Y J,et al.Vehicle Type Recognition Based on Deep Convolution Neural Network[J].Computer Science,2018,45(5):280-284. [11]GONG A,FEI F,ZHENG J.Multi-person Behavior Recognition Method Based on Convolutional Neural Networks[J].Computer Science,2018,45(2):306-311,321.(in Chinese)龚安,费凡,郑君.基于卷积神经网络的多人行为识别方法[J].计算机科学,2018,45(2):306-311,321. [12]ZHOU F Y,JIN L P,DONG J.Review of convolutional neuralnetwork[J].Chinese Journal of Computers,2017,40(6):1229-1251.(in Chinese)周飞燕,金林鹏,董军.卷积神经网络研究综述[J].计算机学报,2017,40(6):1229-1251. [13]BROWN M,SÜSSTRUNK S.Multi-spectral SIFT for scenecategory recognition[C]//IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2011:177-184. [14]XIAO Y,WU J,YUAN J.mCENTRIST:A Multi-Channel Feature Generation Mechanism for Scene Categorization[J].IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society,2014,23(2):823-836. [15]VAPNIK V N.An overview of statistical learning theory[J].IEEE Transactions on Neural Networks,1999,10(5):988-999. [16]SALAMATI N,LARLUS D,CSURKA G.Combining Visibleand Near-Infrared Cues for Image Categorisation[C]//British Machine Vision Conference.BMVA Press,2011. [17]ZHANG M M,CHOI J,DANIILIDIS K,et al.VAIS:A dataset for recognizing maritime imagery in the visible and infrared spectrums[C]//Computer Vision and Pattern Recognition Workshops.IEEE,2015:10-16. [18]LIU F,SHEN T S,MA X X,et al.Ship recognition based on multi-band neural network[J].Optics and Precision Enginee-ring,2017,25(11):2939-2946.(in Chinese)刘峰,沈同圣,马新星,等.基于多波段深度神经网络的舰船目标识别[J].光学精密工程,2017,25(11):2939-2946. [19]LIU F,SHEN T S,MA X X.Convolutional Neural NetworkBased Multi-Band Ship Target Recognition with Feature Fusion[J].Acta Optica Sinica,2017(10):240-248.(in Chinese)刘峰,沈同圣,马新星.特征融合的卷积神经网络多波段舰船目标识别[J].光学学报,2017(10):240-248. [20]HAJHASSAN H,CHADDAD A,HARKOUSS Y,et al.Classifications of Multispectral Colorectal Cancer Tissues Using Convolution Neural Network[J].Journal of Pathology Informatics,2017,8(1):1-7. [21]WANG J,LUO C,HUANG H,et al.Transferring Pre-Trained Deep CNNs for Remote Scene Classification with General Features Learned from Linear PCA Network[J].Remote Sensing,2017,9(3):225. [22]LOWE D G.Distinctive image features from scale-invariant keypoint[J].International Journal of Computer Vision,2004,60(2):91-110. [23]LEUTENEGGER S,CHLI M,SIEGWART R Y.BRISK:Binary Robust invariant scalable keypoints[C]//IEEE International Conference on Computer Vision.IEEE,2012:2548-2555. |
[1] | 周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026 |
[2] | 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航. 监督和半监督学习下的多标签分类综述 Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning 计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111 |
[3] | 李宗民, 张玉鹏, 刘玉杰, 李华. 基于可变形图卷积的点云表征学习 Deformable Graph Convolutional Networks Based Point Cloud Representation Learning 计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023 |
[4] | 陈泳全, 姜瑛. 基于卷积神经网络的APP用户行为分析方法 Analysis Method of APP User Behavior Based on Convolutional Neural Network 计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121 |
[5] | 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153 |
[6] | 檀莹莹, 王俊丽, 张超波. 基于图卷积神经网络的文本分类方法研究综述 Review of Text Classification Methods Based on Graph Convolutional Network 计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064 |
[7] | 金方焱, 王秀利. 融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取 Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM 计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190 |
[8] | 张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036 |
[9] | 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮. 基于DNGAN的磁共振图像超分辨率重建算法 Super-resolution Reconstruction of MRI Based on DNGAN 计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105 |
[10] | 刘月红, 牛少华, 神显豪. 基于卷积神经网络的虚拟现实视频帧内预测编码 Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network 计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179 |
[11] | 徐鸣珂, 张帆. Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法 Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition 计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085 |
[12] | 张嘉淏, 刘峰, 齐佳音. 一种基于Bottleneck Transformer的轻量级微表情识别架构 Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer 计算机科学, 2022, 49(6A): 370-377. https://doi.org/10.11896/jsjkx.210500023 |
[13] | 王建明, 陈响育, 杨自忠, 史晨阳, 张宇航, 钱正坤. 不同数据增强方法对模型识别精度的影响 Influence of Different Data Augmentation Methods on Model Recognition Accuracy 计算机科学, 2022, 49(6A): 418-423. https://doi.org/10.11896/jsjkx.210700210 |
[14] | 孙洁琪, 李亚峰, 张文博, 刘鹏辉. 基于离散小波变换的双域特征融合深度卷积神经网络 Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation 计算机科学, 2022, 49(6A): 434-440. https://doi.org/10.11896/jsjkx.210900199 |
[15] | 杨涵, 万游, 蔡洁萱, 方铭宇, 吴卓超, 金扬, 钱伟行. 基于步态分类辅助的虚拟IMU的行人导航方法 Pedestrian Navigation Method Based on Virtual Inertial Measurement Unit Assisted by GaitClassification 计算机科学, 2022, 49(6A): 759-763. https://doi.org/10.11896/jsjkx.211200148 |
|