计算机科学 ›› 2021, Vol. 48 ›› Issue (1): 233-240.doi: 10.11896/jsjkx.200800211

• 人工智能 • 上一篇    下一篇

基于块对角化表示的多视角字典对学习

张帆1,2,3, 贺文琪1,3, 姬红兵3, 李丹萍4, 王磊1,2,3   

  1. 1 西安电子科技大学青岛计算技术研究院 山东 青岛 266000
    2 上海交通大学海洋智能装备与系统教育部重点实验室 上海 200240
    3 西安电子科技大学电子工程学院 西安 710071
    4 西安电子科技大学通信工程学院 西安 710071
  • 收稿日期:2020-08-31 修回日期:2020-10-05 出版日期:2021-01-15 发布日期:2021-01-15
  • 通讯作者: 王磊(leiwang@mail.xidian.edu.cn)
  • 作者简介:1135302752@qq.com
  • 基金资助:
    国家重点研发计划资助项目(2016YFE0207000);国家自然科学基金资助项目(61203137,61401328);陕西省自然科学基础研究计划资助项目(2014JQ8306,2015JM6279)

Multi-view Dictionary-pair Learning Based on Block-diagonal Representation

ZHANG Fan1,2,3, HE Wen-qi1,3, JI Hong-bing3, LI Dan-ping4, WANG Lei1,2,3   

  1. 1 Xidian University Qingdao Institute of Computing Technology,Qingdao,Shandong 266000,China
    2 Key Laboratory of Marine Intelligent Equipment and System of Ministry of Education,Shanghai Jiao Tong University,Shanghai 200240,China
    3 School of Electronic Engineering,Xidian University,Xi'an 710071,China
    4 School of Telecommunications Engineering,Xidian University,Xi'an 710071,China
  • Received:2020-08-31 Revised:2020-10-05 Online:2021-01-15 Published:2021-01-15
  • About author:ZHANG Fan,born in 1995,Ph.D student,is a member of China Compu-ter Federation.His main research interests include dictionary learning and multi-view learning.
    WANG Lei,born in 1979,associate professor,is a member of China Computer Federation.His main research interests include pattern recognition,signal & information processing,machine learning and computer vision.
  • Supported by:
    National Key Research and Development Program of China(2016YFE0207000),National Natural Science Foundation of China (61203137,61401328) and Natural Science Basic Research Program of Shaanxi Province of China(2014JQ8306, 2015JM6279).

摘要: 字典学习作为一种高效的特征学习技术被广泛应用于多视角分类中。现有的多视角字典学习方法大多只利用多视角数据的部分信息,且只学习一种类型的字典。实际上,多视角数据的相关性信息和多样性信息同样重要,且仅考虑一种合成型字典或解析型字典的学习算法不能同时满足处理速度、可解释性以及应用范围的要求。针对上述问题,提出了一种基于块对角化表示的多视角字典对学习方法(Block-Diagonal Representation based Multi-View Dictionary-Pair Learning,BDR-MVDPL),该方法通过引入字典对学习模型获得包含更多对分类有用的信息的表示系数,并通过显式约束使其具有块对角化结构,保证了编码系数矩阵的判别性;然后采用特征融合的方式将所有视角的编码系数进行串联,并将串联后的编码系数回归到对应的标签向量上,使多视角数据的多样性信息和数据相关性能够同时被利用;最后,该算法将字典学习与分类器学习整合到一个框架中,采用迭代求解的方式,交替更新字典对和分类器,使所提方法能够自动完成分类。3个多特征数据集上的实验结果表明,与主流的多视角字典学习算法相比,所提算法在保持低复杂度的同时具有更高的分类准确率。

关键词: 多视角学习, 特征融合, 字典对学习, 字典学习

Abstract: Dictionary learning is widely used in multi-view classification as an efficient feature learning technology.Most multi-view dictionary learning methods either only use part of the information of multi-view data,or only learn one type of dictionary in their frameworks.However,in practice,both the diversity information of multi-view data and the correlation of multi-view data are equally important.A single synthetic dictionary learning scheme or a single analytic dictionary learning scheme cannot meet the requirements of the processing speed,interpretability and application feasibility at the same time.To solve these issues,a novel block-diagonal representation based multi-view dictionary-pair learning framework (BDR-MVDPL) is proposed in this paper.This algorithm obtains the representation coefficients that contain more useful information for classification by introducing dictionary-pair learning model.Firstly,in order to ensure the discriminant ability of the coding coefficient matrix,the proposed me-thod directly enforces a block-diagonal constraint on the coding coefficients with explicit formulation.Then,it adopts a feature fusion strategy to concatenate the coding coefficients of different views,and regresses the concatenated coding coefficients to thecorresponding label vectors.In this way,both the diversity information of multi-view data and the correlation of multi-view data are considered.Finally,it integrates dictionary learning and classifier learning into a unified framework,so that the dictionary-pair and classifier can update alternately in an iterative manner and the whole classification task can be realized automatically.Experiments on several multi-feature datasets show that,compared to other multi-view dictionary learning algorithms,the proposed method achieves competitive performance in terms of classification accuracy,while enjoying a low computational complexity.

Key words: Dictionary learning, Dictionary-pair learning, Feature fusion, Multi-view learning

中图分类号: 

  • TP391
[1] DONG X,ZHU L,SONG X,et al.Adaptive CollaborativeSimilarity Learning for Unsupervised Multi-view Feature Selection[C]//The International Joint Conference on Artificial Intelligence.2018:2064-2070.
[2] LIU D,QIN R,CHEN X,et al.Generation of Three-dimention Vehicle Panorama[J].Computer Science,2017,44(4):302-305.
[3] WANG J H,YAN D Q,LIU D S H,et al.Algorithm with Discriminative Analysis Dictionary Learning by Fusing Extreme Learning Machine[J].Computer Science,2020,47(5):137-143.
[4] LI Y,GUO Y Q,GUO J,et al.CRF with locality-consistent dictionary learning for semantic segmentation[C]//2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).Kuala Lumpur,2015:509-513.
[5] AHARON M,ELAD M,BRUCKSTEIN A.K-SVD:An algo-rithm for designing overcomplete dictionaries for sparse representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322.
[6] ZHANG Q,LI B X.Discriminative K-SVD for dictionary lear-ning in face recognition[C]//2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR).San Francisco,CA.IEEE,2010:2691-2698.
[7] JIANG Z L,LIN Z,DAVIS L S.Label Consistent K-SVD:Learning a Discriminative Dictionary for Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,35(11):2651-2664.
[8] DONG J,WANG W W,DAI W.Analysis SimCO:A new algorithm for analysis dictionary learning[C]//2014 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP),Florence.IEEE,2014:7193-7197.
[9] SHEKHAR S,PATEL V M,CHELLAPPA R.Analysis sparse coding models for image-based classification[C]//2014 IEEE International Conference on Image Processing (ICIP),Paris.IEEE,2014:5207-5211.
[10] GU S H,ZHANG L,ZUO W M,et al.Projective dictionary pair learning for pattern classification[C]//Proc.Conf.Neural Information Processing Systems.Montreal,QC,Canada,2014.
[11] SHI Y,GAO Y,YANG Y,et al.Multimodal Sparse Representa-tion-Based Classification for Lung Needle Biopsy Images[J].IEEE Transactions on Biomedical Engineering,2013,60(10):2675-2685.
[12] JING X Y,HU R M,WU F,et al.Uncorrelated multi-view discrimination dictionary learning for recognition[C]//Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI).2014:2787-2795.
[13] JIA Y,SALZMANN M,DARRELL T.Factorized latent spaces with structured sparsity[C]//Advances in neural information processing systems (NIPS).2010:982-990.
[14] ZHENG J J,JIANG Z L,et al.Cross-View Action Recognition via Transferable Dictionary Learning[J].IEEE Transactions on Image Processing,2016,25(6):2542-2556.
[15] ZHUANG Y,WANG Y,WU F,et al.Supervised coupled dictionary learning with group structures for multi-modal retrieval[C]//AAAI Conference on Artificial Intelligence (AAAI).2013:1070-1076.
[16] ZHENG J,JIANG Z.Learning View-Invariant Sparse Representations for Cross-View Action Recognition[C]// IEEE International Conference on Computer Vision.IEEE,2013:3176-3183.
[17] ZHANG H C,NASRABADI N M,ZHANG Y N,et al.Multi-observation visual recognition via joint dynamic sparse representation[C]//2011 International Conference on Computer Vision(ICCV).Barcelona:IEEE,2011:595-602.
[18] JING X Y,HU R M,WU F,et al.Uncorrelated multi-view discrimination dictionary learning for recognition[C]//Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (AAAI).2014:2787-2795.
[19] GU S,ZHANG L,ZUO W,et al.Projective dictionary pairlearning for pattern classification[C]//Neural Information Processing Systems.2014:793-801.
[20] CHEN B,LI J,MA B,et al.Discriminative dictionary pair lear-ning based on differentiable support vector function for visual reco-gnition[J].Neurocomputing,2017,272(10):306-313.
[21] SUN Y,ZHANG Z,JIANG W,et al.Robust Discriminative Projective Dictionary Pair Learning by Adaptive Representations[C]//International Conference on Pattern Recognition.2018:621-626.
[22] ZHU X,JING X,WU F,et al.Multi-Kernel Low-Rank Dictionary Pair Learning for Multiple Features Based Image Classification[C]//National Conference on Artificial Intelligence.2017:2970-2976.
[23] WANG Q Y,GUO Y Q,WANG J J,et al.Multi-View Analysis Dictionary Learning for Image Classification[J].IEEE Access,2018,6:20174-20183.
[24] LI Z M,LAI Z H,XU Y,et al.A Locality-Constrained and Label Embedding Dictionary Learning Algorithm for Image Classification[J].IEEE Transactions on Neural Networks and Lear-ning Systems,2017,28(2):278-293.
[25] LI Z M,ZHANG Z,QIN J,et al.Discriminative Fisher Embedding Dictionary Learning Algorithm for Object Recognition[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(3):786-800.
[26] SHEKHAR S,PATEL V M,NASRABADI N M,et al.JointSparse Representation for Robust Multimodal Biometrics Recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2014,36(1):113-126.
[27] BAHRAMPOUR S,NASRABADI N M,RAY A,et al.Multimodal Task-Driven Dictionary Learning for Image Classification[J].IEEE Transactions on Image Processing,2016,25(1):24-38.
[28] HAGHIGHAT M,ABDEL-MOTTALEB M,ALHALABI W.Discriminant Correlation Analysis:Real-Time Feature Level Fusion for Multimodal Biometric Recognition[J].IEEE Transactions on Information Forensics and Security,2016,11(9):1984-1996.
[29] LI Y,NIE F,HUANG H,et al.Large-scale multi-view spectral clustering via bipartite graph[C]//Twenty-ninth Aaai Confe-rence on Artificial Intelligence.AAAI Press,2015.
[30] FEI-FEI L,FERGUS R,PERONA P.Learning generative visual models from few training examples:An incremental bayesian approach tested on 101 object categories[J].Computer Vision and Image Understanding,2007,106(1):59-70.
[31] WANG L,LI M,JI H,et al.When collaborative representation meets subspace projection:A novel supervised framework of graph construction augmented by anti-collaborative representation[J].Neurocomputing,2019,328:157-170.
[1] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[2] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[3] 郁舒昊, 周辉, 叶春杨, 王太正.
SDFA:基于多特征融合的船舶轨迹聚类方法研究
SDFA:Study on Ship Trajectory Clustering Method Based on Multi-feature Fusion
计算机科学, 2022, 49(6A): 256-260. https://doi.org/10.11896/jsjkx.211100253
[4] 杨玥, 冯涛, 梁虹, 杨扬.
融合交叉注意力机制的图像任意风格迁移
Image Arbitrary Style Transfer via Criss-cross Attention
计算机科学, 2022, 49(6A): 345-352. https://doi.org/10.11896/jsjkx.210700236
[5] 陈永平, 朱建清, 谢懿, 吴含笑, 曾焕强.
基于外接圆半径差损失的实时安全帽检测算法
Real-time Helmet Detection Algorithm Based on Circumcircle Radius Difference Loss
计算机科学, 2022, 49(6A): 424-428. https://doi.org/10.11896/jsjkx.220100252
[6] 孙洁琪, 李亚峰, 张文博, 刘鹏辉.
基于离散小波变换的双域特征融合深度卷积神经网络
Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation
计算机科学, 2022, 49(6A): 434-440. https://doi.org/10.11896/jsjkx.210900199
[7] 蓝凌翔, 池明旻.
基于特征注意力融合网络的遥感变化检测研究
Remote Sensing Change Detection Based on Feature Fusion and Attention Network
计算机科学, 2022, 49(6): 193-198. https://doi.org/10.11896/jsjkx.210500058
[8] 李发光, 伊力哈木·亚尔买买提.
基于改进CenterNet的航拍绝缘子缺陷实时检测模型
Real-time Detection Model of Insulator Defect Based on Improved CenterNet
计算机科学, 2022, 49(5): 84-91. https://doi.org/10.11896/jsjkx.210400142
[9] 董奇达, 王喆, 吴松洋.
结合注意力机制与几何信息的特征融合框架
Feature Fusion Framework Combining Attention Mechanism and Geometric Information
计算机科学, 2022, 49(5): 129-134. https://doi.org/10.11896/jsjkx.210300180
[10] 李鹏祖, 李瑶, Ibegbu Nnamdi JULIAN, 孙超, 郭浩, 陈俊杰.
基于多特征融合的重叠组套索脑功能超网络构建及分类
Construction and Classification of Brain Function Hypernetwork Based on Overlapping Group Lasso with Multi-feature Fusion
计算机科学, 2022, 49(5): 206-211. https://doi.org/10.11896/jsjkx.210300049
[11] 范新南, 赵忠鑫, 严炜, 严锡君, 史朋飞.
结合注意力机制的多尺度特征融合图像去雾算法
Multi-scale Feature Fusion Image Dehazing Algorithm Combined with Attention Mechanism
计算机科学, 2022, 49(5): 50-57. https://doi.org/10.11896/jsjkx.210400093
[12] 高心悦, 田汉民.
基于改进U-Net网络的液滴分割方法
Droplet Segmentation Method Based on Improved U-Net Network
计算机科学, 2022, 49(4): 227-232. https://doi.org/10.11896/jsjkx.210300193
[13] 徐涛, 陈奕仁, 吕宗磊.
基于改进YOLOv3的机坪工作人员反光背心检测研究
Study on Reflective Vest Detection for Apron Workers Based on Improved YOLOv3 Algorithm
计算机科学, 2022, 49(4): 239-246. https://doi.org/10.11896/jsjkx.210200119
[14] 许华杰, 秦远卓, 杨洋.
基于多级特征融合与注意力模块的场景识别方法
Scene Recognition Method Based on Multi-level Feature Fusion and Attention Module
计算机科学, 2022, 49(4): 209-214. https://doi.org/10.11896/jsjkx.210100135
[15] 杨晓宇, 殷康宁, 候少麒, 杜文仪, 殷光强.
基于特征定位与融合的行人重识别算法
Person Re-identification Based on Feature Location and Fusion
计算机科学, 2022, 49(3): 170-178. https://doi.org/10.11896/jsjkx.210100132
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!