计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 66-77.doi: 10.11896/jsjkx.251000012
黄贝贝, 刘进锋
HUANG Beibei, LIU Jinfeng
摘要: 深度学习模型由于其“黑盒”特性,特征表示缺乏可解释性。现有的解耦表征学习方法虽然在一定程度上能够通过识别数据中的独立因素来增强模型的解释能力,但它们通常忽视了数据中的复杂关联性及潜在因果结构,从而限制了模型在自动驾驶、医疗诊断等关键领域的应用,特别是在需要理解和干预因果关系的场景中表现不佳。针对当前解耦表征学习中因果关系建模不足的问题,提出了一种融合稀疏编码与因果推断的解耦表征学习框架。该框架在适当监督下通过因果推断机制精准建模数据中的因果关系,不仅能够生成高质量结构化表征,更具备对潜在因果机制的建模与干预能力,进而显著提升模型在因果任务中的适应性与鲁棒性;同时通过嵌入的卷积稀疏编码层施加稀疏性约束,有效筛选与因果结构高度相关的关键表征,进一步强化模型对高阶因果关系的敏感度与表达能力。实验结果表明,该框架在Pendulum和CelebA数据集上表现出色。样本效率在Pendulum数据集上达98.65%,在CelebA数据集上达99.55%,此外,在因果干预有效性和分布鲁棒性方面优于现有方法,证实了该方法在复杂因果场景下的优越性。
中图分类号:
| [1]CHENG K Y,MENG C Y,WANG W S,et al.Research Progress in Disentangled Representation Learning[J].Journal of Computer Applications,2021,41(12):3409-3418. [2]HIGGINS I,AMOS D,PFAU D,et al.Towards a definition ofdisentangled representations[J].arXiv:1812.02230,2018. [3]KINGMA D P,WELLING M.Auto-encoding variational Bayes[J].arXiv:1312.6114,2013. [4]GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial nets[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Cambridge,MA:MIT Press,2014:2672-2680. [5]SIKKA H D.A Deeper Look at the Unsupervised Learning of Disentangled Representations in β-VAE from the perspective of Core Object Recognition[M].Harvard University,2020. [6]PEARL J.Models,reasoning and inference[J].Cambridge,UK:Cambridge University Press,2000,19(2):3. [7]YANG T S.Disentangled Representation Learning Based onCausal Inference[D].Qingdao:Qingdao University of Science and Technology,2023. [8]SCHÖLKOPF B,LOCATELLO F,BAUER S,et al.Towardcausal representation learning[J].Proceedings of the IEEE,2021,109(5):612-634. [9]WEN Z D,WANG J R,WANG X X,et al.A Survey on Disentangled Representation Learning[J].Acta Automatica Sinica,2022,48(2):351-374. [10]HIGGINS I,MATTHEY L,PAL A,et al.beta-vae:Learningbasic visual concepts with a constrained variational framework[C]//International Conference on Learning Representations.2017. [11]CHEN X,DUAN Y,HOUTHOOFT R,et al.Infogan:Inter-pretable representation learning by information maximizing generative adversarial nets[C]//Advances in Neural Information Processing Systems.2016. [12]CHEN R T Q,LI X,GROSSE R,et al.Isolating sources of disentanglement in VAEs[C]//Proceedings of the 32nd International Conference on Neural Information Processing Systems.2018:2615-2625. [13]REZAABAD A L,VISHWANATH S.Learning representations by maximizing mutual information in variational autoencoders[C]//2020 IEEE International Symposium on Information Theory(ISIT).IEEE,2020:2729-2734. [14]LOCATELLO F,BAUER S,LUCIC M,et al.Challenging common assumptions in the unsupervised learning of disentangled representations[C]//International Conference on Machine Learning.PMLR,2019:4114-4124. [15]SUTER R,MILADINOVIC D,SCHÖLKOPF B,et al.Robustly disentangled causal mechanisms:Validating deep representations for interventional robustness[C]//International Conference on Machine Learning.PMLR,2019:6056-6065. [16]REDDY A G,BALASUBRAMANIAN V N.On causally disentangled representations[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:8089-8097. [17]YANG M,LIU F,CHEN Z,et al.Causalvae:Disentangled representation learning via neural structural causal models[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.2021:9593-9602. [18]SHEN X,LIU F,DONG H,et al.Weakly supervised disentangled generative causal representation learning[J].Journal of Machine Learning Research,2022,23(241):1-55. [19]KOMANDURI A,WU Y,CHEN F,et al.Learning CausallyDisentangled Representations via the Principle of Independent Causal Mechanisms[J].arXiv:2306.01213,2023. [20]YOU D,LI Z,SHEN J,et al.Disentangled representation learning with causal effect transmission in variational autoencoder[J].Pattern Recognition,2026,170:112018. [21]NASR-ESFAHANY A,ALIZADEH M,SHAH D.Counterfactual identifiability of bijective causal models[C]//International Conference on Machine Learning.PMLR,2023:25733-25754. [22]LI M,ZHAI P,TONG S,et al.Revisiting sparse convolutional model for visual recognition[J].Advances in Neural Information Processing Systems,2022,35:10492-10504. [23]PAPAMAKARIOS G,NALISNICK E,REZENDE D J,et al.Normalizing flows for probabilistic modeling and inference[J].Journal of Machine Learning Research,2021,22(57):1-64. [24]KHEMAKHEM I,MONTI R,LEECH R,et al.Causal autoregressive flows[C]//International Conference on Artificial Intelligence and Statistics.PMLR,2021:3520-3528. [25]LIU Z,LUO P,WANG X,et al.Deep learning face attributes in the wild[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:3730-3738. [26]GONDAL M W,WUTHRICH M,MILADINOVIC D,et al.On the transfer of inductive bias from simulation to the real world:a new disentanglement dataset[C]//Advances in Neural Information Processing Systems.2019. [27]KINNEY J B,ATWAL G S.Equitability,mutual information,and the maximal information coefficient[C]//Proceedings of the National Academy of Sciences.2014:3354-3359. |
| [1] | 刘钰婷, 顾晶晶, 周强. 基于结构因果模型的城市出行流量预测方法 Urban Flow Prediction Method Based on Structural Causal Model 计算机科学, 2025, 52(10): 70-78. https://doi.org/10.11896/jsjkx.241000088 |
| [2] | 吕佳豪, 刘进锋. 基于因果关系的领域泛化长尾学习 Domain Generalization and Long-tailed Learning Based on Causal Relationships 计算机科学, 2024, 51(11A): 240300041-8. https://doi.org/10.11896/jsjkx.240300041 |
| [3] | 黄露, 倪葎, 金澈清. 基于双视角纠偏的推荐模型 Rectifying Dual Bias for Recommendation 计算机科学, 2023, 50(9): 152-159. https://doi.org/10.11896/jsjkx.220900035 |
| [4] | 张涛, 程毅飞, 孙欣煦. 基于因果推断的图注意力网络 Graph Attention Networks Based on Causal Inference 计算机科学, 2023, 50(6A): 220600230-9. https://doi.org/10.11896/jsjkx.220600230 |
| [5] | 梁美彦, 张宇, 梁建安, 陈庆辉, 王茹, 王琳. 基于稀疏编码非局部注意力对偶网络的病理图像超分辨率重建 Pathological Image Super-resolution Reconstruction Based on Sparse Coding Non-local AttentionDual Network 计算机科学, 2023, 50(6A): 220700016-8. https://doi.org/10.11896/jsjkx.220700016 |
| [6] | 尹海涛, 王天由. 基于深度多尺度卷积稀疏编码的图像去噪算法 Image Denoising Algorithm Based on Deep Multi-scale Convolution Sparse Coding 计算机科学, 2023, 50(4): 133-140. https://doi.org/10.11896/jsjkx.220100090 |
| [7] | 袁振, 刘进锋. 一种基于因果推理的垃圾分类方法 Novel Method for Trash Classification Based on Causal Inference 计算机科学, 2023, 50(11A): 220800218-6. https://doi.org/10.11896/jsjkx.220800218 |
| [8] | 刘楠, 张凤荔, 尹嘉奇, 陈学勤, 王瑞锦. 基于边推断增强对比学习的社交媒体谣言检测模型 Rumor Detection Model on Social Media Based on Contrastive Learning with Edge-inferenceAugmentation 计算机科学, 2023, 50(11): 49-54. https://doi.org/10.11896/jsjkx.221000043 |
| [9] | 徐佳庆, 万文, 吕启. 基于二阶矩稀疏编码的高光谱遥感图像分类 Classification of Hyperspectral Remote Sensing Imagery Based on Second Order Moment Sparse Coding 计算机科学, 2018, 45(9): 288-293. https://doi.org/10.11896/j.issn.1002-137X.2018.09.048 |
| [10] | 刘颖,张明慧,阳维,卢振泰,冯前进,苏榆生. 基于稀疏编码与方向-尺度描述子的海马体自动分割 Hippocampus Segmentation Based on Spare Coding and Orientation-Scale Descriptor 计算机科学, 2017, 44(1): 314-320. https://doi.org/10.11896/j.issn.1002-137X.2017.01.058 |
| [11] | 徐煜明,宋佳伟,肖贤建. 基于亚像素块匹配和字典学习的超分辨率算法 Super Resolution Algorithm Based on Sub-pixel Block Matching and Dictionary Learning 计算机科学, 2016, 43(8): 304-308. https://doi.org/10.11896/j.issn.1002-137X.2016.08.062 |
| [12] | 张旭,蒋建国,洪日昌,杜跃. 基于低秩稀疏分解与协作表示的图像分类算法 Image Classification Algorithm Based on Low Rank and Sparse Decomposition and Collaborative Representation 计算机科学, 2016, 43(7): 83-88. https://doi.org/10.11896/j.issn.1002-137X.2016.07.014 |
| [13] | 施静兰,常侃,张智勇,覃团发. 人脸识别中基于系数相似性的字典学习算法 Coefficient-similarity-based Dictionary Learning Algorithm for Face Recognition 计算机科学, 2016, 43(6): 298-302. https://doi.org/10.11896/j.issn.1002-137X.2016.06.059 |
| [14] | 王瑞霞,彭国华,郑红婵. 拉普拉斯稀疏编码的图像检索算法 Image Retrieval Algorithm Based on Laplacian Sparse Coding 计算机科学, 2014, 41(8): 278-280. https://doi.org/10.11896/j.issn.1002-137X.2014.08.058 |
| [15] | 刘洋,李一波. 基于线性动态系统稀疏编码的异常事件检测 Abnormal Event Detection Using Linear Dynamical System Combined with Sparse Coding 计算机科学, 2014, 41(10): 300-305. https://doi.org/10.11896/j.issn.1002-137X.2014.10.063 |
|
||