计算机科学 ›› 2018, Vol. 45 ›› Issue (11A): 278-281.

• 模式识别与图像处理 • 上一篇    下一篇

基于伪PCA的手写数字识别算法

韩旭, 刘强, 许瑾, 谌海云   

  1. 西南石油大学电气信息学院 四川 南充637001
  • 出版日期:2019-02-26 发布日期:2019-02-26
  • 作者简介:韩 旭(1991-),男,硕士生,主要研究方向为机器学习、模式识别,E-mail:201621000193@stu.swpu.edu.cn;刘 强(1992-),男,硕士生,主要研究方向为图像识别、模式识别;许 瑾(1980-),女,硕士,讲师,主要研究方向为智能控制;谌海云(1967-),男,硕士,教授,主要研究方向为机器人自动化控制。
  • 基金资助:
    本文受南充市校科技战略合作专项项目(NC17SY4011)资助。

Handwritten Numeral Recognition Algorithm Based on Similar Principal Component Analysis

HAN Xu, LIU Qiang, XU Jin, CHEN Hai-yun   

  1. School of Electrical Information Engineering,Southwest Petroleum University,Nanchong,Sichuan 637001,China
  • Online:2019-02-26 Published:2019-02-26

摘要: PCA(Principal Component Analysis)是最重要的数据降维算法之一,针对降维过程出现的信息丢失问题,学术界说法不一。基于此,文中提出了一种新的改进算法(Similar Principal Component Analysis,SPCA),新算法在处理过程中保留了部分细节信息。以手写数字(MNIST)数据库为例,将原始向量组进行临近特征筛选,得出多维复合非正交特征向量组;将训练库所得的向量组与测试集的向量组进行比对,识别出所测试的手写数字。结果表明,该算法能够以较少量的训练样本实现对测试样本的较为完全的识别。

关键词: 非正交特征, 临近特征, 手写数字识别, 主成分分析

Abstract: Principal component analysis (PCA) is one of the most important data reduction algorithms,there is much-maligned views in the process of handling data.A novel improved similar principal component analysis (SPCA) algorithm which is based on principal component analysis (PCA) algorithm was proposed in this paper.This algorithm can keep some detail information in the process.Taking the MNIST handwritten numeral database as an example, the near feature vector is chosen in original vectors to get the groups of non-orthogonal feature vectors.Then,the vectors of trai-ning library is compared with the vectors of testing library,and the recognition rate is calculated.Recognition results indicate that the algorithm can make high identification of the testing samples through a small number of training samples.

Key words: Handwritten numeral recognition, Near feature vector, Non-orthogonal feature vectors, Principal component analysis(PCA)

中图分类号: 

  • TP391.41
[1]PEARSON K.On lines and planes of closest fit[J].Philosophical Magazine,1901,6:559-572.
[2]KIRBY M,SIROVICH L.Application of the Karhunen-Loeve procedure for the characterization of human faces[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,1990,12(1):103-108.
[3]TURK M,PENTLAND A.Eigenfaces for recognition[J].Journal of Cognitive Neuroscience,1991,3(1):71-86.
[4]TURK M A,PENTLAND A P.Face recognition using eigenfa-ces[J].Journal of Cognitive Neurosience,1991,7:71-86.
[5]LU H,PLATANIOTIS K N,VENETSANOPOULOS A N. MPCA:Multilinear Principal Component Analysis of Tensor Objects[J].IEEE Transactions on Neural Networks,2008,19(1):18.
[6]YANG J,FRANGI A F,YANG J Y,et al.KPCA plus LDA:a complete kernel Fisher discriminant framework for feature extraction and recognition[J].IEEE Trans. Pattern Anal. Mach. Intell.,2005,27(2):230-244.
[7]CAO L J,CHUA K S,CHONG W K,et al.A comparison of PCA,KPCA and ICA for dimensionality reduction in support vector machine[J].Neurocomputing,2003,55(1):321-336.
[8]任佳佳.稀疏主成分分析的相关研究[D].西安:西安工程大学,2017.
[9]谭亚芳,刘娟,王才华,等.一种稀疏可控的主成分分析方法[J].计算机科学,2017,44(1):243-246.
[10]闫荣华,彭进业,汶德胜.PCA与2DPCA的关系[J].计算机科学,2017,44(S2):202-206,216.
[11]HASTIE T,FRIEDMAN J,TIBSHIRANI R.The Elements of Statistical Learning[J].Technometrics,2001,45(3):267-268.
[12]http://yann.lecun.com/exdb/mnist.
[13]HSU C W,LIN C J.A comparison of methods for multiclass support vector machines[J].IEEE Transactions on Neural Networks,2002,13(2):415-425.
[14]CHANG C C,LIN C J.LIBSVM:A Library for Support Vector Machines[J].Acm Transactions on Intelligent Systems and Technology,2011,2(3):1-27.
[15]张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,48(8):1289-1305.
[16]唐西林,刘燕娜.几类张量空间的乘积基的结构[J].佛山科学技术学院学报(自然科学版),2018,36(2):1-6.
[17]https://en.wikipedia.org/wiki/Correlation_and_dependence.
[18]LEE K C,HO J,KRIEGMAN D J.Acquiring Linear Subspaces for Face Recognition under Variable Lighting[J].IEEE Tran-sactions on Pattern Analysis & Machine Intelligence,2005,27(5):684-698.
[19]田绍兴,陈劲杰.基于KNN的手写数字的识别[J].农业装备与车辆工程,2017,55(10):96-97.
[20]刘荣荣.基于卷积神经网络的手写数字识别软件的设计与实现[D].包头:内蒙古大学,2015.
[1] 李其烨, 邢红杰.
基于最大相关熵的KPCA异常检测方法
KPCA Based Novelty Detection Method Using Maximum Correntropy Criterion
计算机科学, 2022, 49(8): 267-272. https://doi.org/10.11896/jsjkx.210700175
[2] 阙华坤, 冯小峰, 刘盼龙, 郭文翀, 李健, 曾伟良, 范竞敏.
Grassberger熵随机森林在窃电行为检测的应用
Application of Grassberger Entropy Random Forest to Power-stealing Behavior Detection
计算机科学, 2022, 49(6A): 790-794. https://doi.org/10.11896/jsjkx.210800032
[3] 吴善杰, 王新.
基于AGA-DBSCAN优化的RBF神经网络构造煤厚度预测方法
Prediction of Tectonic Coal Thickness Based on AGA-DBSCAN Optimized RBF Neural Networks
计算机科学, 2021, 48(7): 308-315. https://doi.org/10.11896/jsjkx.200800110
[4] 胡昕彤, 沙朝锋, 刘艳君.
基于随机投影和主成分分析的网络嵌入后处理算法
Post-processing Network Embedding Algorithm with Random Projection and Principal Component Analysis
计算机科学, 2021, 48(5): 124-129. https://doi.org/10.11896/jsjkx.200500058
[5] 王艺皓, 丁洪伟, 李波, 保利勇, 张颖婕.
基于聚类与特征融合的蛋白质亚细胞定位预测
Prediction of Protein Subcellular Localization Based on Clustering and Feature Fusion
计算机科学, 2021, 48(3): 206-213. https://doi.org/10.11896/jsjkx.200200081
[6] 冯安然, 王旭仁, 汪秋云, 熊梦博.
基于PCA和随机树的数据库异常访问检测
Database Anomaly Access Detection Based on Principal Component Analysis and Random Tree
计算机科学, 2020, 47(9): 94-98. https://doi.org/10.11896/jsjkx.190800056
[7] 史燕燕, 白静.
融合CFCC和Teager能量算子倒谱参数的语音识别
Speech Recognition Combining CFCC and Teager Energy Operators Cepstral Coefficients
计算机科学, 2019, 46(5): 286-289. https://doi.org/10.11896/j.issn.1002-137X.2019.05.044
[8] 张明月, 王静.
基于深度学习的交互似然目标跟踪算法
Interactive Likelihood Target Tracking Algorithm Based on Deep Learning
计算机科学, 2019, 46(2): 279-285. https://doi.org/10.11896/j.issn.1002-137X.2019.02.043
[9] 高忠石, 苏旸, 柳玉东.
基于PCA-LSTM的入侵检测研究
Study on Intrusion Detection Based on PCA-LSTM
计算机科学, 2019, 46(11A): 473-476.
[10] 王鹏飞, 张杭.
欠定条件下基于主成分的亚采样信号重构
Sub-sampling Signal Reconstruction Based on Principal Component Under Underdetermined Conditions
计算机科学, 2019, 46(10): 103-108. https://doi.org/10.11896/jsjkx.190700195
[11] 徐少平,曾小霞,姜尹楠,林官喜,唐祎玲.
基于最小特征值非线性修正的快速噪声水平估计算法
Fast Noise Level Estimation Algorithm Based on Nonlinear Rectification of Smallest Eigenvalue
计算机科学, 2018, 45(7): 219-225. https://doi.org/10.11896/j.issn.1002-137X.2018.07.038
[12] 李小薪, 周元申, 周旋, 李晶晶, 刘志勇.
基于奇异值分解的Gabor遮挡字典学习
Gabor Occlusion Dictionary Learning via Singular Value Decomposition
计算机科学, 2018, 45(6): 275-283. https://doi.org/10.11896/j.issn.1002-137X.2018.06.049
[13] 李姗姗,陈莉,张永新,袁娅婷.
基于RPCA的图像模糊边缘检测算法
Fuzzy Edge Detection Algorithm Based on RPCA
计算机科学, 2018, 45(5): 273-279. https://doi.org/10.11896/j.issn.1002-137X.2018.05.047
[14] 钟菲,杨斌.
基于主成分分析网络的车牌检测方法
License Plate Detection Based on Principal Component Analysis Network
计算机科学, 2018, 45(3): 268-273. https://doi.org/10.11896/j.issn.1002-137X.2018.03.043
[15] 高放,黄樟钦.
基于异构多核并行加速的嵌入式神经网络人脸识别方法
Embedded Neural Network Face Recognition Method Based on Heterogeneous Multicore Parallel Acceleration
计算机科学, 2018, 45(3): 288-293. https://doi.org/10.11896/j.issn.1002-137X.2018.03.047
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!