计算机科学 ›› 2021, Vol. 48 ›› Issue (1): 233-240.doi: 10.11896/jsjkx.200800211
张帆1,2,3, 贺文琪1,3, 姬红兵3, 李丹萍4, 王磊1,2,3
ZHANG Fan1,2,3, HE Wen-qi1,3, JI Hong-bing3, LI Dan-ping4, WANG Lei1,2,3
摘要: 字典学习作为一种高效的特征学习技术被广泛应用于多视角分类中。现有的多视角字典学习方法大多只利用多视角数据的部分信息,且只学习一种类型的字典。实际上,多视角数据的相关性信息和多样性信息同样重要,且仅考虑一种合成型字典或解析型字典的学习算法不能同时满足处理速度、可解释性以及应用范围的要求。针对上述问题,提出了一种基于块对角化表示的多视角字典对学习方法(Block-Diagonal Representation based Multi-View Dictionary-Pair Learning,BDR-MVDPL),该方法通过引入字典对学习模型获得包含更多对分类有用的信息的表示系数,并通过显式约束使其具有块对角化结构,保证了编码系数矩阵的判别性;然后采用特征融合的方式将所有视角的编码系数进行串联,并将串联后的编码系数回归到对应的标签向量上,使多视角数据的多样性信息和数据相关性能够同时被利用;最后,该算法将字典学习与分类器学习整合到一个框架中,采用迭代求解的方式,交替更新字典对和分类器,使所提方法能够自动完成分类。3个多特征数据集上的实验结果表明,与主流的多视角字典学习算法相比,所提算法在保持低复杂度的同时具有更高的分类准确率。
中图分类号:
[1] DONG X,ZHU L,SONG X,et al.Adaptive CollaborativeSimilarity Learning for Unsupervised Multi-view Feature Selection[C]//The International Joint Conference on Artificial Intelligence.2018:2064-2070. [2] LIU D,QIN R,CHEN X,et al.Generation of Three-dimention Vehicle Panorama[J].Computer Science,2017,44(4):302-305. [3] WANG J H,YAN D Q,LIU D S H,et al.Algorithm with Discriminative Analysis Dictionary Learning by Fusing Extreme Learning Machine[J].Computer Science,2020,47(5):137-143. [4] LI Y,GUO Y Q,GUO J,et al.CRF with locality-consistent dictionary learning for semantic segmentation[C]//2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).Kuala Lumpur,2015:509-513. [5] AHARON M,ELAD M,BRUCKSTEIN A.K-SVD:An algo-rithm for designing overcomplete dictionaries for sparse representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322. [6] ZHANG Q,LI B X.Discriminative K-SVD for dictionary lear-ning in face recognition[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR).San Francisco,CA.IEEE,2010:2691-2698. [7] JIANG Z L,LIN Z,DAVIS L S.Label Consistent K-SVD:Learning a Discriminative Dictionary for Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,35(11):2651-2664. [8] DONG J,WANG W W,DAI W.Analysis SimCO:A new algorithm for analysis dictionary learning[C]//2014 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP),Florence.IEEE,2014:7193-7197. [9] SHEKHAR S,PATEL V M,CHELLAPPA R.Analysis sparse coding models for image-based classification[C]//2014 IEEE International Conference on Image Processing (ICIP),Paris.IEEE,2014:5207-5211. [10] GU S H,ZHANG L,ZUO W M,et al.Projective dictionary pair learning for pattern classification[C]//Proc.Conf.Neural Information Processing Systems.Montreal,QC,Canada,2014. [11] SHI Y,GAO Y,YANG Y,et al.Multimodal Sparse Representa-tion-Based Classification for Lung Needle Biopsy Images[J].IEEE Transactions on Biomedical Engineering,2013,60(10):2675-2685. [12] JING X Y,HU R M,WU F,et al.Uncorrelated multi-view discrimination dictionary learning for recognition[C]//Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI).2014:2787-2795. [13] JIA Y,SALZMANN M,DARRELL T.Factorized latent spaces with structured sparsity[C]//Advances in neural information processing systems (NIPS).2010:982-990. [14] ZHENG J J,JIANG Z L,et al.Cross-View Action Recognition via Transferable Dictionary Learning[J].IEEE Transactions on Image Processing,2016,25(6):2542-2556. [15] ZHUANG Y,WANG Y,WU F,et al.Supervised coupled dictionary learning with group structures for multi-modal retrieval[C]//AAAI Conference on Artificial Intelligence (AAAI).2013:1070-1076. [16] ZHENG J,JIANG Z.Learning View-Invariant Sparse Representations for Cross-View Action Recognition[C]// IEEE International Conference on Computer Vision.IEEE,2013:3176-3183. [17] ZHANG H C,NASRABADI N M,ZHANG Y N,et al.Multi-observation visual recognition via joint dynamic sparse representation[C]//2011 International Conference on Computer Vision(ICCV).Barcelona:IEEE,2011:595-602. [18] JING X Y,HU R M,WU F,et al.Uncorrelated multi-view discrimination dictionary learning for recognition[C]//Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI).2014:2787-2795. [19] GU S,ZHANG L,ZUO W,et al.Projective dictionary pairlearning for pattern classification[C]//Neural Information Processing Systems.2014:793-801. [20] CHEN B,LI J,MA B,et al.Discriminative dictionary pair lear-ning based on differentiable support vector function for visual reco-gnition[J].Neurocomputing,2017,272(10):306-313. [21] SUN Y,ZHANG Z,JIANG W,et al.Robust Discriminative Projective Dictionary Pair Learning by Adaptive Representations[C]//International Conference on Pattern Recognition.2018:621-626. [22] ZHU X,JING X,WU F,et al.Multi-Kernel Low-Rank Dictionary Pair Learning for Multiple Features Based Image Classification[C]//National Conference on Artificial Intelligence.2017:2970-2976. [23] WANG Q Y,GUO Y Q,WANG J J,et al.Multi-View Analysis Dictionary Learning for Image Classification[J].IEEE Access,2018,6:20174-20183. [24] LI Z M,LAI Z H,XU Y,et al.A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification[J].IEEE Transactions on Neural Networks and Lear-ning Systems,2017,28(2):278-293. [25] LI Z M,ZHANG Z,QIN J,et al.Discriminative Fisher Embedding Dictionary Learning Algorithm for Object Recognition[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(3):786-800. [26] SHEKHAR S,PATEL V M,NASRABADI N M,et al.JointSparse Representation for Robust Multimodal Biometrics Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2014,36(1):113-126. [27] BAHRAMPOUR S,NASRABADI N M,RAY A,et al.Multimodal Task-Driven Dictionary Learning for Image Classification[J].IEEE Transactions on Image Processing,2016,25(1):24-38. [28] HAGHIGHAT M,ABDEL-MOTTALEB M,ALHALABI W.Discriminant Correlation Analysis:Real-Time Feature Level Fusion for Multimodal Biometric Recognition[J].IEEE Transactions on Information Forensics and Security,2016,11(9):1984-1996. [29] LI Y,NIE F,HUANG H,et al.Large-scale multi-view spectral clustering via bipartite graph[C]//Twenty-ninth Aaai Confe-rence on Artificial Intelligence.AAAI Press,2015. [30] FEI-FEI L,FERGUS R,PERONA P.Learning generative visual models from few training examples:An incremental bayesian approach tested on 101 object categories[J].Computer Vision and Image Understanding,2007,106(1):59-70. [31] WANG L,LI M,JI H,et al.When collaborative representation meets subspace projection:A novel supervised framework of graph construction augmented by anti-collaborative representation[J].Neurocomputing,2019,328:157-170. |
[1] | 张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036 |
[2] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[3] | 郁舒昊, 周辉, 叶春杨, 王太正. SDFA:基于多特征融合的船舶轨迹聚类方法研究 SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion 计算机科学, 2022, 49(6A): 256-260. https://doi.org/10.11896/jsjkx.211100253 |
[4] | 杨玥, 冯涛, 梁虹, 杨扬. 融合交叉注意力机制的图像任意风格迁移 Image Arbitrary Style Transfer via Criss-cross Attention 计算机科学, 2022, 49(6A): 345-352. https://doi.org/10.11896/jsjkx.210700236 |
[5] | 陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强. 基于外接圆半径差损失的实时安全帽检测算法 Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss 计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252 |
[6] | 孙洁琪, 李亚峰, 张文博, 刘鹏辉. 基于离散小波变换的双域特征融合深度卷积神经网络 Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation 计算机科学, 2022, 49(6A): 434-440. https://doi.org/10.11896/jsjkx.210900199 |
[7] | 蓝凌翔, 池明旻. 基于特征注意力融合网络的遥感变化检测研究 Remote Sensing Change Detection Based on Feature Fusion and Attention Network 计算机科学, 2022, 49(6): 193-198. https://doi.org/10.11896/jsjkx.210500058 |
[8] | 李发光, 伊力哈木·亚尔买买提. 基于改进CenterNet的航拍绝缘子缺陷实时检测模型 Real-time Detection Model of Insulator Defect Based on Improved CenterNet 计算机科学, 2022, 49(5): 84-91. https://doi.org/10.11896/jsjkx.210400142 |
[9] | 董奇达, 王喆, 吴松洋. 结合注意力机制与几何信息的特征融合框架 Feature Fusion Framework Combining Attention Mechanism and Geometric Information 计算机科学, 2022, 49(5): 129-134. https://doi.org/10.11896/jsjkx.210300180 |
[10] | 李鹏祖, 李瑶, Ibegbu Nnamdi JULIAN, 孙超, 郭浩, 陈俊杰. 基于多特征融合的重叠组套索脑功能超网络构建及分类 Construction and Classification of Brain Function Hypernetwork Based on Overlapping Group Lasso with Multi-feature Fusion 计算机科学, 2022, 49(5): 206-211. https://doi.org/10.11896/jsjkx.210300049 |
[11] | 范新南, 赵忠鑫, 严炜, 严锡君, 史朋飞. 结合注意力机制的多尺度特征融合图像去雾算法 Multi-scale Feature Fusion Image Dehazing Algorithm Combined with Attention Mechanism 计算机科学, 2022, 49(5): 50-57. https://doi.org/10.11896/jsjkx.210400093 |
[12] | 高心悦, 田汉民. 基于改进U-Net网络的液滴分割方法 Droplet Segmentation Method Based on Improved U-Net Network 计算机科学, 2022, 49(4): 227-232. https://doi.org/10.11896/jsjkx.210300193 |
[13] | 徐涛, 陈奕仁, 吕宗磊. 基于改进YOLOv3的机坪工作人员反光背心检测研究 Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm 计算机科学, 2022, 49(4): 239-246. https://doi.org/10.11896/jsjkx.210200119 |
[14] | 许华杰, 秦远卓, 杨洋. 基于多级特征融合与注意力模块的场景识别方法 Scene Recognition Method Based on Multi-level Feature Fusion and Attention Module 计算机科学, 2022, 49(4): 209-214. https://doi.org/10.11896/jsjkx.210100135 |
[15] | 杨晓宇, 殷康宁, 候少麒, 杜文仪, 殷光强. 基于特征定位与融合的行人重识别算法 Person Re-identification Based on Feature Location and Fusion 计算机科学, 2022, 49(3): 170-178. https://doi.org/10.11896/jsjkx.210100132 |
|