计算机科学 ›› 2018, Vol. 45 ›› Issue (11A): 238-243.
顾哲彬, 曹飞龙
GU Zhe-bin, CAO Fei-long
摘要: 传统人工神经网络的输入均为向量形式,而图像由矩阵形式表示,因此,在用人工神经网络进行图像处理时,图像将以向量形式输入至神经网络,这破坏了图像的结构信息,从而影响了图像处理的效果。为了提高网络对图像的处理能力,文中借鉴了深度学习的思想与方法,引进了具有矩阵输入的多层前向神经网络。同时,采用传统的反向传播训练算法(BP)训练该网络,给出了训练过程与训练算法,并在USPS手写数字数据集上进行了数值实验。实验结果表明,相对于单隐层矩阵输入前向神经网络(2D-BP),所提多层网络具有较好的分类效果。此外,对于彩色图片分类问题,利用所提出的2D-BP网络,给出了一个有效的可行方法。
中图分类号:
[1]高隽,谢昭.图像理解理论与方法 [M].北京:科学出版社,2009. [2]RICHARDS L E.Principal Component Analysis [J].Journal of Marketing Research,1988,38(22):41-64. [3]STONE J.Principal Component Analysis and Factor Analysis[M].MIT Press,2004:129-135. [4]LU J,PLATANIOTIS K N,VENETSANOPOULOS A N.Face recognition using LDA-based algorithms [J].IEEE Transactions on Neural Networks,2003,14(1):195-200. [5]ZHENG W S,LAI J H,YUEN P C.GA-fisher:A new LDA-based face recognition algorithm with selection of principal components[J].IEEE Transactions on Systems Man & Cybernetics Part B,2005,35(5):1065-1078. [6]VAPNIK V N.The nature of statistical learning theory[M].New York:Springer,2000. [7]SHAKHNAROVICH G,DARRELL T,INDYK P.Nearest- neighbor methods in learning and vision[J].Pattern Analysis and Applications,2008,11(2):221-222. [8]HORNIK K,STINCHCOMBE M,WHITE H.Multilayer feedforward networks are universal approximators[J].Neural Networks,1989,2(5):359-366. [9]DAI K,ZHAO J,CAO F.A novel algorithm of extended neural networks for image recognition [J].Engineering Applications of Artificial Intelligence,2015,42(1):57-66. [10]YAN X,YAN X,ZHANG L,et al.Feature extraction based on fuzzy 2DLDA[J].Neurocomputing,2010,73(10-12):1556-1561. [11]SANGUANSAT P,ASDORNWISED W,JITAPUNKUL S,et al.Two-Dimensional Linear Discriminant Analysis of Principle Component Vectors for Face Recognition[J].IEICE-Transactions on Information and Systems,2006,E89-D(7):2164-2170. [12]LU J,ZHAO J,CAO F.Extended feed forward neural networks with random weights for face recognition[J].Neurocomputing,2014,136(1):96-102. [13]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324. [14]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]∥International Conference on Neural Information Processing Systems.Curran Associates Inc.2012:1097-1105. [15]ZEILER M D,FERGUS R.Visualizing and Understanding Convolutional Networks[C]∥European Conference on Computer Vision.2013:818-833. [16]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J/OL].[2015-04-10].https://arxiv.org/pdf/1409.1556v6.pdf. [17]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]∥IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:1-9. [18]HE K,ZHANG X,REN S,et al.Deep Residual Learning for Ima-ge Recognition[C]∥Computer Vision and Pattern Recognition.IEEE,2016:770-778. [19]HULL J J.A database for handwritten text recognition research[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1994,16(5):550-554. [20]KRIZHEVSKY,ALEX.Learning Multiple Layers of Features from Tiny Images[M].Toronto:University of Toronto,2009. |
[1] | 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航. 监督和半监督学习下的多标签分类综述 Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning 计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111 |
[2] | 杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建. 基于注意力机制和多任务学习的阿尔茨海默症分类 Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning 计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072 |
[3] | 杨健楠, 张帆. 一种结合双注意力机制和层次网络结构的细碎农作物分类方法 Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure 计算机科学, 2022, 49(6A): 353-357. https://doi.org/10.11896/jsjkx.210200169 |
[4] | 朱旭东, 熊贇. 基于样本分布损失的图像多标签分类研究 Study on Multi-label Image Classification Based on Sample Distribution Loss 计算机科学, 2022, 49(6): 210-216. https://doi.org/10.11896/jsjkx.210300267 |
[5] | 彭云聪, 秦小林, 张力戈, 顾勇翔. 面向图像分类的小样本学习算法综述 Survey on Few-shot Learning Algorithms for Image Classification 计算机科学, 2022, 49(5): 1-9. https://doi.org/10.11896/jsjkx.210500128 |
[6] | 张文轩, 吴秦. 基于多分支注意力增强的细粒度图像分类 Fine-grained Image Classification Based on Multi-branch Attention-augmentation 计算机科学, 2022, 49(5): 105-112. https://doi.org/10.11896/jsjkx.210100108 |
[7] | 许华杰, 陈育, 杨洋, 秦远卓. 基于混合样本自动数据增强技术的半监督学习方法 Semi-supervised Learning Method Based on Automated Mixed Sample Data Augmentation Techniques 计算机科学, 2022, 49(3): 288-293. https://doi.org/10.11896/jsjkx.210100156 |
[8] | 董琳, 黄丽清, 叶锋, 黄添强, 翁彬, 徐超. 人脸伪造检测泛化性方法综述 Survey on Generalization Methods of Face Forgery Detection 计算机科学, 2022, 49(2): 12-30. https://doi.org/10.11896/jsjkx.210900146 |
[9] | 陈天荣, 凌捷. 基于特征映射的差分隐私保护机器学习方法 Differential Privacy Protection Machine Learning Method Based on Features Mapping 计算机科学, 2021, 48(7): 33-39. https://doi.org/10.11896/jsjkx.201200224 |
[10] | 胡京徽, 许鹏. 一种基于图像分类的航空紧固件产品自动分类方法 Automatic Classification of Aviation Fastener Products Based on Image Classification 计算机科学, 2021, 48(6A): 63-66. https://doi.org/10.11896/jsjkx.200900163 |
[11] | 刘汉卿, 康晓东, 李博, 张华丽, 冯继超, 韩俊玲. 利用深度学习网络对医学影像分类识别的比较研究 Comparative Study on Classification and Recognition of Medical Images Using Deep Learning Network 计算机科学, 2021, 48(6A): 89-94. https://doi.org/10.11896/jsjkx.201000116 |
[12] | 魏冬, 刘浩, 陈根龙, 宫晓蕙. 基于颜色校正和去模糊的水下图像增强方法 Underwater Image Enhancement Based on Color Correction and Deblurring 计算机科学, 2021, 48(4): 144-150. https://doi.org/10.11896/jsjkx.200800185 |
[13] | 谢海平, 李高源, 杨海涛, 赵洪利. 超分辨率重构遥感图像分类研究 Classification Research of Remote Sensing Image Based on Super Resolution Reconstruction 计算机科学, 2021, 48(11A): 424-428. https://doi.org/10.11896/jsjkx.210300132 |
[14] | 吴昊昊, 王方石. 多尺度膨胀卷积在图像分类中的应用 Application of Multi-scale Dilated Convolution in Image Classification 计算机科学, 2020, 47(6A): 166-171. https://doi.org/10.11896/JsJkx.190600179 |
[15] | 张华丽, 康晓东, 冉华, 王亚鸽, 李博, 白放. 用于肺结节影像分类识别的DBN与CNN的比较研究 Comparative Study of DBN and CNN for Pulmonary Nodule Image Recognition 计算机科学, 2020, 47(6A): 254-259. https://doi.org/10.11896/JsJkx.190700107 |
|