计算机科学 ›› 2018, Vol. 45 ›› Issue (11A): 278-281.
韩旭, 刘强, 许瑾, 谌海云
HAN Xu, LIU Qiang, XU Jin, CHEN Hai-yun
摘要: PCA(Principal Component Analysis)是最重要的数据降维算法之一,针对降维过程出现的信息丢失问题,学术界说法不一。基于此,文中提出了一种新的改进算法(Similar Principal Component Analysis,SPCA),新算法在处理过程中保留了部分细节信息。以手写数字(MNIST)数据库为例,将原始向量组进行临近特征筛选,得出多维复合非正交特征向量组;将训练库所得的向量组与测试集的向量组进行比对,识别出所测试的手写数字。结果表明,该算法能够以较少量的训练样本实现对测试样本的较为完全的识别。
中图分类号:
[1]PEARSON K.On lines and planes of closest fit[J].Philosophical Magazine,1901,6:559-572. [2]KIRBY M,SIROVICH L.Application of the Karhunen-Loeve procedure for the characterization of human faces[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,1990,12(1):103-108. [3]TURK M,PENTLAND A.Eigenfaces for recognition[J].Journal of Cognitive Neuroscience,1991,3(1):71-86. [4]TURK M A,PENTLAND A P.Face recognition using eigenfa-ces[J].Journal of Cognitive Neurosience,1991,7:71-86. [5]LU H,PLATANIOTIS K N,VENETSANOPOULOS A N. MPCA:Multilinear Principal Component Analysis of Tensor Objects[J].IEEE Transactions on Neural Networks,2008,19(1):18. [6]YANG J,FRANGI A F,YANG J Y,et al.KPCA plus LDA:a complete kernel Fisher discriminant framework for feature extraction and recognition[J].IEEE Trans. Pattern Anal. Mach. Intell.,2005,27(2):230-244. [7]CAO L J,CHUA K S,CHONG W K,et al.A comparison of PCA,KPCA and ICA for dimensionality reduction in support vector machine[J].Neurocomputing,2003,55(1):321-336. [8]任佳佳.稀疏主成分分析的相关研究[D].西安:西安工程大学,2017. [9]谭亚芳,刘娟,王才华,等.一种稀疏可控的主成分分析方法[J].计算机科学,2017,44(1):243-246. [10]闫荣华,彭进业,汶德胜.PCA与2DPCA的关系[J].计算机科学,2017,44(S2):202-206,216. [11]HASTIE T,FRIEDMAN J,TIBSHIRANI R.The Elements of Statistical Learning[J].Technometrics,2001,45(3):267-268. [12]http://yann.lecun.com/exdb/mnist. [13]HSU C W,LIN C J.A comparison of methods for multiclass support vector machines[J].IEEE Transactions on Neural Networks,2002,13(2):415-425. [14]CHANG C C,LIN C J.LIBSVM:A Library for Support Vector Machines[J].Acm Transactions on Intelligent Systems and Technology,2011,2(3):1-27. [15]张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,48(8):1289-1305. [16]唐西林,刘燕娜.几类张量空间的乘积基的结构[J].佛山科学技术学院学报(自然科学版),2018,36(2):1-6. [17]https://en.wikipedia.org/wiki/Correlation_and_dependence. [18]LEE K C,HO J,KRIEGMAN D J.Acquiring Linear Subspaces for Face Recognition under Variable Lighting[J].IEEE Tran-sactions on Pattern Analysis & Machine Intelligence,2005,27(5):684-698. [19]田绍兴,陈劲杰.基于KNN的手写数字的识别[J].农业装备与车辆工程,2017,55(10):96-97. [20]刘荣荣.基于卷积神经网络的手写数字识别软件的设计与实现[D].包头:内蒙古大学,2015. |
[1] | 李其烨, 邢红杰. 基于最大相关熵的KPCA异常检测方法 KPCA Based Novelty Detection Method Using Maximum Correntropy Criterion 计算机科学, 2022, 49(8): 267-272. https://doi.org/10.11896/jsjkx.210700175 |
[2] | 阙华坤, 冯小峰, 刘盼龙, 郭文翀, 李健, 曾伟良, 范竞敏. Grassberger熵随机森林在窃电行为检测的应用 Application of Grassberger Entropy Random Forest to Power-stealing Behavior Detection 计算机科学, 2022, 49(6A): 790-794. https://doi.org/10.11896/jsjkx.210800032 |
[3] | 吴善杰, 王新. 基于AGA-DBSCAN优化的RBF神经网络构造煤厚度预测方法 Prediction of Tectonic Coal Thickness Based on AGA-DBSCAN Optimized RBF Neural Networks 计算机科学, 2021, 48(7): 308-315. https://doi.org/10.11896/jsjkx.200800110 |
[4] | 胡昕彤, 沙朝锋, 刘艳君. 基于随机投影和主成分分析的网络嵌入后处理算法 Post-processing Network Embedding Algorithm with Random Projection and Principal Component Analysis 计算机科学, 2021, 48(5): 124-129. https://doi.org/10.11896/jsjkx.200500058 |
[5] | 王艺皓, 丁洪伟, 李波, 保利勇, 张颖婕. 基于聚类与特征融合的蛋白质亚细胞定位预测 Prediction of Protein Subcellular Localization Based on Clustering and Feature Fusion 计算机科学, 2021, 48(3): 206-213. https://doi.org/10.11896/jsjkx.200200081 |
[6] | 冯安然, 王旭仁, 汪秋云, 熊梦博. 基于PCA和随机树的数据库异常访问检测 Database Anomaly Access Detection Based on Principal Component Analysis and Random Tree 计算机科学, 2020, 47(9): 94-98. https://doi.org/10.11896/jsjkx.190800056 |
[7] | 史燕燕, 白静. 融合CFCC和Teager能量算子倒谱参数的语音识别 Speech Recognition Combining CFCC and Teager Energy Operators Cepstral Coefficients 计算机科学, 2019, 46(5): 286-289. https://doi.org/10.11896/j.issn.1002-137X.2019.05.044 |
[8] | 张明月, 王静. 基于深度学习的交互似然目标跟踪算法 Interactive Likelihood Target Tracking Algorithm Based on Deep Learning 计算机科学, 2019, 46(2): 279-285. https://doi.org/10.11896/j.issn.1002-137X.2019.02.043 |
[9] | 高忠石, 苏旸, 柳玉东. 基于PCA-LSTM的入侵检测研究 Study on Intrusion Detection Based on PCA-LSTM 计算机科学, 2019, 46(11A): 473-476. |
[10] | 王鹏飞, 张杭. 欠定条件下基于主成分的亚采样信号重构 Sub-sampling Signal Reconstruction Based on Principal Component Under Underdetermined Conditions 计算机科学, 2019, 46(10): 103-108. https://doi.org/10.11896/jsjkx.190700195 |
[11] | 徐少平,曾小霞,姜尹楠,林官喜,唐祎玲. 基于最小特征值非线性修正的快速噪声水平估计算法 Fast Noise Level Estimation Algorithm Based on Nonlinear Rectification of Smallest Eigenvalue 计算机科学, 2018, 45(7): 219-225. https://doi.org/10.11896/j.issn.1002-137X.2018.07.038 |
[12] | 李小薪, 周元申, 周旋, 李晶晶, 刘志勇. 基于奇异值分解的Gabor遮挡字典学习 Gabor Occlusion Dictionary Learning via Singular Value Decomposition 计算机科学, 2018, 45(6): 275-283. https://doi.org/10.11896/j.issn.1002-137X.2018.06.049 |
[13] | 李姗姗,陈莉,张永新,袁娅婷. 基于RPCA的图像模糊边缘检测算法 Fuzzy Edge Detection Algorithm Based on RPCA 计算机科学, 2018, 45(5): 273-279. https://doi.org/10.11896/j.issn.1002-137X.2018.05.047 |
[14] | 钟菲,杨斌. 基于主成分分析网络的车牌检测方法 License Plate Detection Based on Principal Component Analysis Network 计算机科学, 2018, 45(3): 268-273. https://doi.org/10.11896/j.issn.1002-137X.2018.03.043 |
[15] | 高放,黄樟钦. 基于异构多核并行加速的嵌入式神经网络人脸识别方法 Embedded Neural Network Face Recognition Method Based on Heterogeneous Multicore Parallel Acceleration 计算机科学, 2018, 45(3): 288-293. https://doi.org/10.11896/j.issn.1002-137X.2018.03.047 |
|