计算机科学 ›› 2016, Vol. 43 ›› Issue (9): 223-226.doi: 10.11896/j.issn.1002-137X.2016.09.044

• 人工智能 • 上一篇    下一篇

基于多分类器协同学习的卷积神经网络训练算法

陈文,张恩阳,赵勇   

  1. 四川大学计算机学院 成都610065,长虹技术中心基础技术研究所 成都610094,长虹技术中心基础技术研究所 成都610094
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金:基于云的人工免疫检测器生成算法及其中网络安全中的应用研究(61402308)资助

CNN Training Algorithm Based on Co-studying of Multiple Classifiers

CHEN Wen, ZHANG En-yang and ZHAO Yong   

  • Online:2018-12-01 Published:2018-12-01

摘要: 卷积神经网络(CNN)是一类重要的深度神经网络,然而其训练过程需要大量的已标记样本,从而限制了其实际应用。针对这一问题,分析了CNN分类器的协同学习过程,给出了基于迭代进化的分类器协同训练算法CAMC。该算法结合了CNN和多分类器协同训练的优势,首先采用不同的卷积核提取出多种样本特征以产生不同的CNN分类器;然后利用少量的已标记样本和大量的未标记样本对多个分类器进行协同训练,以持续提高分类性能。在人脸表情标准数据集上的实验结果表明,相对于传统的表情特征识别法LBP和Gabor,CAMC能够在分类过程中利用未标记样本持续实现性能提升,从而具有更高的分类准确率。

关键词: 机器学习,卷积神经网络,协同训练,图像识别,多分类器

Abstract: Convolutional neural network (CNN) is a kind of important deep neural network.However,amounts of labeled samples are needed in the training process of CNN,which largely limits its applications.For the problem,the co-training process of CNN was analyzed,and a co-training algorithm CAMC based on the iterative evolution of classifiers was given.The advantages of CNN and co-training of multiple classifiers are combined in CAMC.Firstly,multiple features are drawn using different convolutional kernels to build up different CNN classifiers.Then to continually improve the classification performance,some labeled samples together with many unlabeled samples are employed to co-train the multi classifiers.The experimental results based on the standard data sets of human expression demonstrates that,compared with traditional expression recognition methods LBP and Gabor,the performance of CAMC can be continually improved,thus it has higher classification accuracy.

Key words: Machine learning,Convolutional neural network,Co-training,Image recognition,Multiple classifiers

[1] Liu Cong,Xu Wei-sheng,Wu Qi-di.Spatiotemporal convolutio-nal neural networks and its application in action recognition [J].Computer Science,2015,42(7):245-249(in Chinese) 刘琮,许维胜,吴启迪.时空域深度卷积神经网络及其在行为识别上的应用[J].计算机科学,2015,2(7):245-249
[2] Evgeny A S,Denis M T,Serge N A.Comparison of Regularization Methods for ImageNet Classification with Deep Convolutional Neural Networks [J].Aasri Procedia,2014,6(1):89-94
[3] Dong Zhen,Wu Yu-wei,Pei Ming-tao,et al.Vehicle Type Classification Using a Semisupervised Convolutional Neural Network [J].IEEE Transactions on Intelligent Transportation Systems,2015,16(4):2247-2256
[4] Wu Hai-bing,Gu Xiao-dong.Towards dropout training for con-volutional neural networks [J].Neural Networks,2015,71:1-10
[5] Shi He-huan,Xu Yue-lei,Ma Shi-ping,et al.Convolutional neural networks recognition algorithm based on PCA [J].Journal of Xidian University,2016,43(3):175-180(in Chinese) 史鹤欢,许悦雷,马时平,等.PCA预训练的卷积神经网络目标识别算法[J].西安电子科技大学学报,2016,43(3):175-180
[6] Liu Fa-yao,Lin Guo-sheng,Shen Chun-hua.CRF learning with CNN features for image segmentation [J].Pattern Recognition,2015,48(10):2983-2992
[7] Deng Chao,Guo Mao-Zu.Tri-Training and Data Editing Based Semi-Supervised Clustering Algorithm [J].Journal of Software,2008,19(3):663-673
[8] Blum A,Mitchell T.Combining labeled and unlabeled data with co-training [C]∥Conference on Computational Learning theory.ACM Press,1998:92-100
[9] Goldman S,Zhou Y.Enhancing supervised learning with unlabeled data [C]∥The 17th International Conference on Machine Learning.Morgan Kaufmann Publishers,2000:327-334
[10] Zhou Z H,Li M.Tri-Training:Exploiting unlabeled data using three classifiers [J].IEEE Trans.on Knowledge and Data Engineering,2005,17(11):1529-1541
[11] Li Hai-feng,Li Chuan-guo.Note on deep architecture and deep learning algorithms [J].Journal of Heibei University(Natural Science Edition),2012,32(5):538-544(in Chinese) 李海峰,李纯果.深度学习结构和算法比较分析[J].河北大学学报(自然科学版),2012,2(5):538-544
[12] Hinton G E,Salakhutdinov R R.Reducing the dimensionality of data with neural networks [J].Science,2006,313 (5786):504-507
[13] Angluin D,Laird P.Learning from noisy examples[J].Machine Learning,1988,2(4):343-370
[14] CMU face images data set .http://archive.ics.uci.edu/ml/ datasets
[15] Kanade T,Cohn J,Tian Y.Comprehensive Database for Facial Expression Analysis[C]∥Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition.EE Computer Society,2000:46-53
[16] Bashyal S,Venayagamoorthy G K.Recognition of facial expressions using Gabor wavelets and learning vector quantization [J].Engineering Applications of Artificial Intelligence,2008,1(7):1056-1064
[17] Ojala T,Pietikainan M.Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24(7):971-987
[18] Bashyal S,Venayagamoorthy G K.Recognition of Facial Expressions Using Gabor Wavelets and Learning Vector Quantization [J].Engineering Applications of Artificial Intelligence,2008,21(7):1056-1064

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!