计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 66-77.doi: 10.11896/jsjkx.251000012

• 人工智能与理论计算机科学交叉融合 • 上一篇    下一篇

融合稀疏编码的因果解耦表征学习

黄贝贝, 刘进锋   

  1. 宁夏大学信息工程学院 银川 750021
  • 收稿日期:2025-10-09 修回日期:2026-01-19 出版日期:2026-04-15 发布日期:2026-04-08
  • 通讯作者: 刘进锋(jfliu@nxu.edu.cn)
  • 作者简介:(12023132024@stu.nxu.edu.cn)
  • 基金资助:
    宁夏自然科学基金(2025AAC030154)

Causal Disentangled Representation Learning with Integrated Sparse Coding

HUANG Beibei, LIU Jinfeng   

  1. School of Information Engineering, Ningxia University, Yinchuan 750021, China
  • Received:2025-10-09 Revised:2026-01-19 Published:2026-04-15 Online:2026-04-08
  • About author:HUANG Beibei,born in 2001,master candidate,is a student member of CCF(No.A06163G).Her main research interests include computer vision and representation learning.
    LIU Jinfeng,born in 1971,Ph.D,professor,is a professional member of CCF(No.75718M).His main research interests include deep learning andhete-rogeneous computing.
  • Supported by:
    Natural Science Foundation of Ningxia(2025AAC030154).

摘要: 深度学习模型由于其“黑盒”特性,特征表示缺乏可解释性。现有的解耦表征学习方法虽然在一定程度上能够通过识别数据中的独立因素来增强模型的解释能力,但它们通常忽视了数据中的复杂关联性及潜在因果结构,从而限制了模型在自动驾驶、医疗诊断等关键领域的应用,特别是在需要理解和干预因果关系的场景中表现不佳。针对当前解耦表征学习中因果关系建模不足的问题,提出了一种融合稀疏编码与因果推断的解耦表征学习框架。该框架在适当监督下通过因果推断机制精准建模数据中的因果关系,不仅能够生成高质量结构化表征,更具备对潜在因果机制的建模与干预能力,进而显著提升模型在因果任务中的适应性与鲁棒性;同时通过嵌入的卷积稀疏编码层施加稀疏性约束,有效筛选与因果结构高度相关的关键表征,进一步强化模型对高阶因果关系的敏感度与表达能力。实验结果表明,该框架在Pendulum和CelebA数据集上表现出色。样本效率在Pendulum数据集上达98.65%,在CelebA数据集上达99.55%,此外,在因果干预有效性和分布鲁棒性方面优于现有方法,证实了该方法在复杂因果场景下的优越性。

关键词: 稀疏编码, 因果推断, 解耦表征学习, 样本效率, 分布鲁棒性

Abstract: Deep learning models often lack of interpretability in their feature representations due to their “black-box” nature.Although existing disentangled representation learning methods can enhance interpretability to some extent by identifying independent factors within the data,they usually neglect complex correlations and potential causal structures,which limits their applicability in critical domains such as autonomous driving and medical diagnosis,especially in scenarios that require understanding and intervention of causal relationships.To address the insufficient causal modeling in current disentangled representation learning,a disentanglement framework integrating sparse coding with causal inference is constructed.Under appropriate supervision,this framework leverages a causal inference mechanism to precisely model causal relationships within the data,thereby not only generating high-quality and structured representations but also enabling the modeling and intervention of potential causal mechanisms,which significantly improves the model’s adaptability and robustness in causal tasks.Meanwhile,the embedded convolutional sparse coding layer imposes sparsity constraints to effectively filter key representations highly relevant to causal structures,further enhancing the model’s sensitivity and expressive capacity for higher-order causal relationships.Experimental results demonstrate that the proposed framework performs excellently on both the Pendulum and CelebA datasets,achieving a sample efficiency of 98.65% on the Pendulum dataset and 99.55% on the CelebA dataset.Moreover,it outperforms existing methods in terms of causal intervention effectiveness and distribution robustness,confirming its superiority in complex causal scenarios.

Key words: Sparse coding, Causal inference, Disentangled representation learning, Sample efficiency, Distribution robustness

中图分类号: 

  • TP181
[1]CHENG K Y,MENG C Y,WANG W S,et al.Research Progress in Disentangled Representation Learning[J].Journal of Computer Applications,2021,41(12):3409-3418.
[2]HIGGINS I,AMOS D,PFAU D,et al.Towards a definition ofdisentangled representations[J].arXiv:1812.02230,2018.
[3]KINGMA D P,WELLING M.Auto-encoding variational Bayes[J].arXiv:1312.6114,2013.
[4]GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial nets[C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Cambridge,MA:MIT Press,2014:2672-2680.
[5]SIKKA H D.A Deeper Look at the Unsupervised Learning of Disentangled Representations in β-VAE from the perspective of Core Object Recognition[M].Harvard University,2020.
[6]PEARL J.Models,reasoning and inference[J].Cambridge,UK:Cambridge University Press,2000,19(2):3.
[7]YANG T S.Disentangled Representation Learning Based onCausal Inference[D].Qingdao:Qingdao University of Science and Technology,2023.
[8]SCHÖLKOPF B,LOCATELLO F,BAUER S,et al.Towardcausal representation learning[J].Proceedings of the IEEE,2021,109(5):612-634.
[9]WEN Z D,WANG J R,WANG X X,et al.A Survey on Disentangled Representation Learning[J].Acta Automatica Sinica,2022,48(2):351-374.
[10]HIGGINS I,MATTHEY L,PAL A,et al.beta-vae:Learningbasic visual concepts with a constrained variational framework[C]//International Conference on Learning Representations.2017.
[11]CHEN X,DUAN Y,HOUTHOOFT R,et al.Infogan:Inter-pretable representation learning by information maximizing generative adversarial nets[C]//Advances in Neural Information Processing Systems.2016.
[12]CHEN R T Q,LI X,GROSSE R,et al.Isolating sources of disentanglement in VAEs[C]//Proceedings of the 32nd International Conference on Neural Information Processing Systems.2018:2615-2625.
[13]REZAABAD A L,VISHWANATH S.Learning representations by maximizing mutual information in variational autoencoders[C]//2020 IEEE International Symposium on Information Theory(ISIT).IEEE,2020:2729-2734.
[14]LOCATELLO F,BAUER S,LUCIC M,et al.Challenging common assumptions in the unsupervised learning of disentangled representations[C]//International Conference on Machine Learning.PMLR,2019:4114-4124.
[15]SUTER R,MILADINOVIC D,SCHÖLKOPF B,et al.Robustly disentangled causal mechanisms:Validating deep representations for interventional robustness[C]//International Conference on Machine Learning.PMLR,2019:6056-6065.
[16]REDDY A G,BALASUBRAMANIAN V N.On causally disentangled representations[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:8089-8097.
[17]YANG M,LIU F,CHEN Z,et al.Causalvae:Disentangled representation learning via neural structural causal models[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.2021:9593-9602.
[18]SHEN X,LIU F,DONG H,et al.Weakly supervised disentangled generative causal representation learning[J].Journal of Machine Learning Research,2022,23(241):1-55.
[19]KOMANDURI A,WU Y,CHEN F,et al.Learning CausallyDisentangled Representations via the Principle of Independent Causal Mechanisms[J].arXiv:2306.01213,2023.
[20]YOU D,LI Z,SHEN J,et al.Disentangled representation learning with causal effect transmission in variational autoencoder[J].Pattern Recognition,2026,170:112018.
[21]NASR-ESFAHANY A,ALIZADEH M,SHAH D.Counterfactual identifiability of bijective causal models[C]//International Conference on Machine Learning.PMLR,2023:25733-25754.
[22]LI M,ZHAI P,TONG S,et al.Revisiting sparse convolutional model for visual recognition[J].Advances in Neural Information Processing Systems,2022,35:10492-10504.
[23]PAPAMAKARIOS G,NALISNICK E,REZENDE D J,et al.Normalizing flows for probabilistic modeling and inference[J].Journal of Machine Learning Research,2021,22(57):1-64.
[24]KHEMAKHEM I,MONTI R,LEECH R,et al.Causal autoregressive flows[C]//International Conference on Artificial Intelligence and Statistics.PMLR,2021:3520-3528.
[25]LIU Z,LUO P,WANG X,et al.Deep learning face attributes in the wild[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:3730-3738.
[26]GONDAL M W,WUTHRICH M,MILADINOVIC D,et al.On the transfer of inductive bias from simulation to the real world:a new disentanglement dataset[C]//Advances in Neural Information Processing Systems.2019.
[27]KINNEY J B,ATWAL G S.Equitability,mutual information,and the maximal information coefficient[C]//Proceedings of the National Academy of Sciences.2014:3354-3359.
[1] 刘钰婷, 顾晶晶, 周强.
基于结构因果模型的城市出行流量预测方法
Urban Flow Prediction Method Based on Structural Causal Model
计算机科学, 2025, 52(10): 70-78. https://doi.org/10.11896/jsjkx.241000088
[2] 吕佳豪, 刘进锋.
基于因果关系的领域泛化长尾学习
Domain Generalization and Long-tailed Learning Based on Causal Relationships
计算机科学, 2024, 51(11A): 240300041-8. https://doi.org/10.11896/jsjkx.240300041
[3] 黄露, 倪葎, 金澈清.
基于双视角纠偏的推荐模型
Rectifying Dual Bias for Recommendation
计算机科学, 2023, 50(9): 152-159. https://doi.org/10.11896/jsjkx.220900035
[4] 张涛, 程毅飞, 孙欣煦.
基于因果推断的图注意力网络
Graph Attention Networks Based on Causal Inference
计算机科学, 2023, 50(6A): 220600230-9. https://doi.org/10.11896/jsjkx.220600230
[5] 梁美彦, 张宇, 梁建安, 陈庆辉, 王茹, 王琳.
基于稀疏编码非局部注意力对偶网络的病理图像超分辨率重建
Pathological Image Super-resolution Reconstruction Based on Sparse Coding Non-local AttentionDual Network
计算机科学, 2023, 50(6A): 220700016-8. https://doi.org/10.11896/jsjkx.220700016
[6] 尹海涛, 王天由.
基于深度多尺度卷积稀疏编码的图像去噪算法
Image Denoising Algorithm Based on Deep Multi-scale Convolution Sparse Coding
计算机科学, 2023, 50(4): 133-140. https://doi.org/10.11896/jsjkx.220100090
[7] 袁振, 刘进锋.
一种基于因果推理的垃圾分类方法
Novel Method for Trash Classification Based on Causal Inference
计算机科学, 2023, 50(11A): 220800218-6. https://doi.org/10.11896/jsjkx.220800218
[8] 刘楠, 张凤荔, 尹嘉奇, 陈学勤, 王瑞锦.
基于边推断增强对比学习的社交媒体谣言检测模型
Rumor Detection Model on Social Media Based on Contrastive Learning with Edge-inferenceAugmentation
计算机科学, 2023, 50(11): 49-54. https://doi.org/10.11896/jsjkx.221000043
[9] 徐佳庆, 万文, 吕启.
基于二阶矩稀疏编码的高光谱遥感图像分类
Classification of Hyperspectral Remote Sensing Imagery Based on Second Order Moment Sparse Coding
计算机科学, 2018, 45(9): 288-293. https://doi.org/10.11896/j.issn.1002-137X.2018.09.048
[10] 刘颖,张明慧,阳维,卢振泰,冯前进,苏榆生.
基于稀疏编码与方向-尺度描述子的海马体自动分割
Hippocampus Segmentation Based on Spare Coding and Orientation-Scale Descriptor
计算机科学, 2017, 44(1): 314-320. https://doi.org/10.11896/j.issn.1002-137X.2017.01.058
[11] 徐煜明,宋佳伟,肖贤建.
基于亚像素块匹配和字典学习的超分辨率算法
Super Resolution Algorithm Based on Sub-pixel Block Matching and Dictionary Learning
计算机科学, 2016, 43(8): 304-308. https://doi.org/10.11896/j.issn.1002-137X.2016.08.062
[12] 张旭,蒋建国,洪日昌,杜跃.
基于低秩稀疏分解与协作表示的图像分类算法
Image Classification Algorithm Based on Low Rank and Sparse Decomposition and Collaborative Representation
计算机科学, 2016, 43(7): 83-88. https://doi.org/10.11896/j.issn.1002-137X.2016.07.014
[13] 施静兰,常侃,张智勇,覃团发.
人脸识别中基于系数相似性的字典学习算法
Coefficient-similarity-based Dictionary Learning Algorithm for Face Recognition
计算机科学, 2016, 43(6): 298-302. https://doi.org/10.11896/j.issn.1002-137X.2016.06.059
[14] 王瑞霞,彭国华,郑红婵.
拉普拉斯稀疏编码的图像检索算法
Image Retrieval Algorithm Based on Laplacian Sparse Coding
计算机科学, 2014, 41(8): 278-280. https://doi.org/10.11896/j.issn.1002-137X.2014.08.058
[15] 刘洋,李一波.
基于线性动态系统稀疏编码的异常事件检测
Abnormal Event Detection Using Linear Dynamical System Combined with Sparse Coding
计算机科学, 2014, 41(10): 300-305. https://doi.org/10.11896/j.issn.1002-137X.2014.10.063
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!