多层前向人工神经网络图像分类算法

摘要/Abstract

摘要： 传统人工神经网络的输入均为向量形式,而图像由矩阵形式表示,因此,在用人工神经网络进行图像处理时,图像将以向量形式输入至神经网络,这破坏了图像的结构信息,从而影响了图像处理的效果。为了提高网络对图像的处理能力,文中借鉴了深度学习的思想与方法,引进了具有矩阵输入的多层前向神经网络。同时,采用传统的反向传播训练算法(BP)训练该网络,给出了训练过程与训练算法,并在USPS手写数字数据集上进行了数值实验。实验结果表明,相对于单隐层矩阵输入前向神经网络(2D-BP),所提多层网络具有较好的分类效果。此外,对于彩色图片分类问题,利用所提出的2D-BP网络,给出了一个有效的可行方法。

关键词: 多层前向神经网络, 梯度优化, 图像分类, 学习算法

Abstract: The input of traditional artificial neural network is in vector form,but the image is represented by matrix.Therefore,in the process of image processing,the image will be inputted into the neural network in vector form,which will destroy the structure information of image,and thus affect the effect of image processing.In order to improve the ability of network on image processing,the multilayer feedforward neural networks with matrix inputs are introducedbased on the idea and method of deep learning.At the same time,the traditional back-propagation algorithm (BP) is used to train the network,and the training process and training algorithm are given.After a lot of experiments,the network structure with good performance were determined,and the numerical experiments were carried out on the USPS handwritten digital data set.The experimental results show that the proposed multilayer network has better classification results than the single hidden layer feed forward neural network with matrix input (2D-BP).In addition,to deal with the problem of color image classification,this paper provided an effective and feasible method,the new 2D-BP network,to deal with it

Key words: Gradient optimization, Image classification, Learning algorithm, Multilayer feedforward neural networks

中图分类号:

TN911.71

顾哲彬, 曹飞龙. 多层前向人工神经网络图像分类算法[J]. 计算机科学, 2018, 45(11A): 238-243. https://doi.org/

GU Zhe-bin, CAO Fei-long. Algorithm of Multi-layer Forward Artificial Neural Network for Image Classification[J]. Computer Science, 2018, 45(11A): 238-243. https://doi.org/

参考文献

[1]高隽,谢昭.图像理解理论与方法 [M].北京:科学出版社,2009.
[2]RICHARDS L E.Principal Component Analysis [J].Journal of Marketing Research,1988,38(22):41-64.
[3]STONE J.Principal Component Analysis and Factor Analysis[M].MIT Press,2004:129-135.
[4]LU J,PLATANIOTIS K N,VENETSANOPOULOS A N.Face recognition using LDA-based algorithms [J].IEEE Transactions on Neural Networks,2003,14(1):195-200.
[5]ZHENG W S,LAI J H,YUEN P C.GA-fisher:A new LDA-based face recognition algorithm with selection of principal components[J].IEEE Transactions on Systems Man & Cybernetics Part B,2005,35(5):1065-1078.
[6]VAPNIK V N.The nature of statistical learning theory[M].New York:Springer,2000.
[7]SHAKHNAROVICH G,DARRELL T,INDYK P.Nearest- neighbor methods in learning and vision[J].Pattern Analysis and Applications,2008,11(2):221-222.
[8]HORNIK K,STINCHCOMBE M,WHITE H.Multilayer feedforward networks are universal approximators[J].Neural Networks,1989,2(5):359-366.
[9]DAI K,ZHAO J,CAO F.A novel algorithm of extended neural networks for image recognition [J].Engineering Applications of Artificial Intelligence,2015,42(1):57-66.
[10]YAN X,YAN X,ZHANG L,et al.Feature extraction based on fuzzy 2DLDA[J].Neurocomputing,2010,73(10-12):1556-1561.
[11]SANGUANSAT P,ASDORNWISED W,JITAPUNKUL S,et al.Two-Dimensional Linear Discriminant Analysis of Principle Component Vectors for Face Recognition[J].IEICE-Transactions on Information and Systems,2006,E89-D(7):2164-2170.
[12]LU J,ZHAO J,CAO F.Extended feed forward neural networks with random weights for face recognition[J].Neurocomputing,2014,136(1):96-102.
[13]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-based learning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324.
[14]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]∥International Conference on Neural Information Processing Systems.Curran Associates Inc.2012:1097-1105.
[15]ZEILER M D,FERGUS R.Visualizing and Understanding Convolutional Networks[C]∥European Conference on Computer Vision.2013:818-833.
[16]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J/OL].[2015-04-10].https://arxiv.org/pdf/1409.1556v6.pdf.
[17]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[C]∥IEEE Conference on Computer Vision and Pattern Recognition.IEEE Computer Society,2015:1-9.
[18]HE K,ZHANG X,REN S,et al.Deep Residual Learning for Ima-ge Recognition[C]∥Computer Vision and Pattern Recognition.IEEE,2016:770-778.
[19]HULL J J.A database for handwritten text recognition research[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1994,16(5):550-554.
[20]KRIZHEVSKY,ALEX.Learning Multiple Layers of Features from Tiny Images[M].Toronto:University of Toronto,2009.

相关文章 15

[1]	武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航. 监督和半监督学习下的多标签分类综述 Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning 计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[2]	杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建. 基于注意力机制和多任务学习的阿尔茨海默症分类 Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning 计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072
[3]	杨健楠, 张帆. 一种结合双注意力机制和层次网络结构的细碎农作物分类方法 Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure 计算机科学, 2022, 49(6A): 353-357. https://doi.org/10.11896/jsjkx.210200169
[4]	朱旭东, 熊贇. 基于样本分布损失的图像多标签分类研究 Study on Multi-label Image Classification Based on Sample Distribution Loss 计算机科学, 2022, 49(6): 210-216. https://doi.org/10.11896/jsjkx.210300267
[5]	彭云聪, 秦小林, 张力戈, 顾勇翔. 面向图像分类的小样本学习算法综述 Survey on Few-shot Learning Algorithms for Image Classification 计算机科学, 2022, 49(5): 1-9. https://doi.org/10.11896/jsjkx.210500128
[6]	张文轩, 吴秦. 基于多分支注意力增强的细粒度图像分类 Fine-grained Image Classification Based on Multi-branch Attention-augmentation 计算机科学, 2022, 49(5): 105-112. https://doi.org/10.11896/jsjkx.210100108
[7]	许华杰, 陈育, 杨洋, 秦远卓. 基于混合样本自动数据增强技术的半监督学习方法 Semi-supervised Learning Method Based on Automated Mixed Sample Data Augmentation Techniques 计算机科学, 2022, 49(3): 288-293. https://doi.org/10.11896/jsjkx.210100156
[8]	董琳, 黄丽清, 叶锋, 黄添强, 翁彬, 徐超. 人脸伪造检测泛化性方法综述 Survey on Generalization Methods of Face Forgery Detection 计算机科学, 2022, 49(2): 12-30. https://doi.org/10.11896/jsjkx.210900146
[9]	陈天荣, 凌捷. 基于特征映射的差分隐私保护机器学习方法 Differential Privacy Protection Machine Learning Method Based on Features Mapping 计算机科学, 2021, 48(7): 33-39. https://doi.org/10.11896/jsjkx.201200224
[10]	胡京徽, 许鹏. 一种基于图像分类的航空紧固件产品自动分类方法 Automatic Classification of Aviation Fastener Products Based on Image Classification 计算机科学, 2021, 48(6A): 63-66. https://doi.org/10.11896/jsjkx.200900163
[11]	刘汉卿, 康晓东, 李博, 张华丽, 冯继超, 韩俊玲. 利用深度学习网络对医学影像分类识别的比较研究 Comparative Study on Classification and Recognition of Medical Images Using Deep Learning Network 计算机科学, 2021, 48(6A): 89-94. https://doi.org/10.11896/jsjkx.201000116
[12]	魏冬, 刘浩, 陈根龙, 宫晓蕙. 基于颜色校正和去模糊的水下图像增强方法 Underwater Image Enhancement Based on Color Correction and Deblurring 计算机科学, 2021, 48(4): 144-150. https://doi.org/10.11896/jsjkx.200800185
[13]	谢海平, 李高源, 杨海涛, 赵洪利. 超分辨率重构遥感图像分类研究 Classification Research of Remote Sensing Image Based on Super Resolution Reconstruction 计算机科学, 2021, 48(11A): 424-428. https://doi.org/10.11896/jsjkx.210300132
[14]	吴昊昊, 王方石. 多尺度膨胀卷积在图像分类中的应用 Application of Multi-scale Dilated Convolution in Image Classification 计算机科学, 2020, 47(6A): 166-171. https://doi.org/10.11896/JsJkx.190600179
[15]	张华丽, 康晓东, 冉华, 王亚鸽, 李博, 白放. 用于肺结节影像分类识别的DBN与CNN的比较研究 Comparative Study of DBN and CNN for Pulmonary Nodule Image Recognition 计算机科学, 2020, 47(6A): 254-259. https://doi.org/10.11896/JsJkx.190700107

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed