计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 173-179.doi: 10.11896/jsjkx.250300086

• 数据库&大数据&数据科学 • 上一篇    下一篇

基于点态流形与一致正则的半监督学习算法

徐亚敏, 李晓斌, 张润   

  1. 西南交通大学数学学院 成都 611756
  • 收稿日期:2025-03-17 修回日期:2025-06-16 出版日期:2026-04-15 发布日期:2026-04-08
  • 通讯作者: 李晓斌(lixiaobin@home.swjtu.edu.cn)
  • 作者简介:(xuyaminswjtu@foxmail.com)
  • 基金资助:
    理科培育专项一般项目(2682021ZTPY043,2682025ZTPY001);国家自然科学基金(11501470,11426187)

Semi-supervised Learning Algorithm Based on Pointwise Manifold Structures and Uniform Regularity Constraints

XU Yamin, LI Xiaobin, ZHANG Run   

  1. School of Mathematics, Southwest Jiaotong University, Chengdu 611756, China
  • Received:2025-03-17 Revised:2025-06-16 Published:2026-04-15 Online:2026-04-08
  • About author:XU Yamin,postgraduate,is a member of CCF(No.Z1659G).Her main research interests include semi-supervised learning and so on.
    LI Xiaobin,born in 1983,Ph.D,master’ssupervisor,is a member of CCF(No.Z1644M).His main research interests include geometric topology and its applications,and so on.
  • Supported by:
    Fundamental Research Funds for the Central Universities(2682021ZTPY043,2682025ZTPY001) and National Natural Science Foundation of China(11501470,11426187).

摘要: 流形正则化(Manifold Regularization,MR) 提供了一个有效的框架,利用有标签数据集和无标签数据集进行半监督分类。在基于流形假设的情况下,约束相似实例在样本构图上应具有相似的分类结果。值得注意的是,MR的核心在于样本构图上的成对平滑,即所有实例对中都应用了平滑约束,把每一对实例都看作一个整体。然而,平滑性本质上可以是点对点的,这意味着平滑性应当“无处不在”,以关联每个点或实例与其邻近点的行为。因此,提出了一种新型的基于点态流形正则化以及一致性正则化的半监督学习算法 URC-PW-MR。该方法不仅保留了平滑性的点对点特性,还通过考虑单个实例而非实例对,引入了单个实例的重要性。这种重要性可以通过局部密度等因素来描述。URC-PW-MR 提供了一种新的实现流形平滑性的方法,通过约束单个局部实例并引入融合一致性正则来实现半监督学习。实证结果表明,URC-PW-MR 在性能上与传统的MR 相比更为精细。

关键词: 流形正则化, 点态流形正则化, 融合一致正则, 半监督学习

Abstract: MR(Manifold Regularization) provides a powerful framework for semi-supervised classification using labeled and unlabeled datasets.Under the assumption of manifold,it enforces that similar instances should have similar classification results on the sample graph.It is notable that the core of MR lies in the pairwise smoothing on the sample graph,where smoothing constraints are applied to all instance pairs,treating each pair of instances as a whole.However,the smoothness can be point-to-point in essence,meaning that smoothness should be “everywhere” to correlate the behavior of each point or instance with that of its neighboring points.Therefore,this paper proposes a novel semi-supervised learning algorithm based on pointwise manifold and uniform regularity constraints(URC-PW-MR),which achieves semi-supervised learning by constraining individual local instances and introducing a fusion consistency regularization.This approach not only contains the pointwise nature of smoothness but also introduces the importance of individual instance by considering each instance rather than pairs of instances.The significance can be quantitatively characterized through factors such as local density.URC-PW-MR proposes a novel manifold smoothness realization approach that achieves semi-supervised learning through dual constraints:individual local instance regularization and fusion consistency regularization.Empirical evaluations demonstrate that URC-PW-MR exhibits more refined performance characteristics compared with conventional MR frameworks.

Key words: Manifold regularization, Pointwise manifold regularization, Uniform regularity constraints, Semi-supervised learning

中图分类号: 

  • TP311
[1]HE Q,LI N,LUO W J,et al.A Survey of Machine Learning Algorithms for Big Data[J].Pattern Recognition and Artificial Intelligence,2014,27(4):327-336.
[2]LIU J W,LIU Y,LUO X L.Semi-supervised Learning Methods[J],2015,38(8):1592-1617.
[3]YANG J C.Design and implementation of a semi-supervisedcontinuous learning framework[D].Nanjing:Nanjing University of Posts and Telecommunications,2020.
[4]宋雨,许王琴,李荣鹏,等.基于自适应流形正则化自表示的无监督特征选择算法[J].重庆工商大学学报(自然科学版),2023,40(6):44-52.
[5]JIA M J,LI X,KONG L,et al.Research on the Application of Deep Learning in Big Data Analysis[J].Software Engineering and Application,2022,11(3):549-557.
[6]YAO K,CAO F,LEUNG Y,et al.Deep neural network compression through interpretability-based filter pruning[J].Pattern Recognition,2021,119:108056.
[7]LIANG J Y,GAO J W,CHANG Y.Research Progress in Semi-supervised Learning[J].Shanxi University(Natural Science Edition),2009,32(4):528-534.
[8]LEI Y K.Research on manifold learning algorithm and its application[D].Hefei:University of Science and Technology of China,2011.
[9]WANG Y,HAN J,SHEN Y,et al.Pointwise manifold regularization for semi-supervised learning[J].Frontiers of Computer Science,2021,15(1):1-8.
[10]WANG J,ZHANG S Y,LIANG J Y.Semi-supervised DeepLearning Algorithm Integrating Consensus Regulars and Manifold Regulars[J].Big Data Research,2022,8(3):103-114.
[11]LI Y F,KWOK J T,ZHOU Z H.Semi-supervised learning using label mean[C]//Proceedings of the 26th International Confe-rence on Machine Learning.ACM,2009.
[12]ZHU X J,GHAHRAMANI Z.Learning from labeled and unlabeled data with label propagation[C]//NIPS 2002.2002:1-8.
[13]BELKIN M,NIYOGI P,SINDHWANI V.Manifold Regularization:A Geometric Framework for Learning from Labeled and Unlabeled Examples[J].Journal of Machine Learning Research,2006,7(1):2399-2434.
[14]WANG J,LIANG J,CUI J,et al.Semi-supervised Learning with Mixed-order Graph Convolutional Networks[J].Information Sciences,2021,573(8):171-181.
[15]LIANG J Y,CUI J B,WANG J,et al.Graph-based semi-supervised learning via improving the quality of the graph dynamically[J].Machine Learning,2021,110(6):1345-1388.
[16]LU C H.Euclid and Geometry(Part II)[M]//Science World.2019:126-127.
[17]MEI X M,HE L G.Differential manifold and Riemannian geometry[M].Beijing:Beijing Normal University Press,1987.
[18]LIU W H,QIN B Z.Mapping properties of Meso compact space and Hausdorff space[J].Journal of Texas College,2005,4(2):21-22.
[19]CHEN W H.Differential manifold preliminary.2nd edition[M].Beijing:Higher Education Press,2001.
[20]MIYATO T,MAEDASI,KOYAMA M,et al.Virtual adversarial training:aregularization method for supervised and semi-supervised learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,41(8):1979-1993.
[21]VERMA V,KAWAGUCHI K,LAMB A,et al.Interpolationconsistency training for semi-supervised learning[J].Neural Networks,2022,145:90-106.
[22]BERTHELOT D,CARLINI N,GOODFELLOW I,et al.Mix-match:A holistic approach to semi-supervised learning[J].Advances in Neural Information Processing Systems,2019,32:1-11.
[23]LAINE S,AILA T.Temporal Ensembling for Semi-Supervised Learning[C]//International Conference on Learning Representations.2017:1-13.
[24]TARVAINEN A,VALPOLA H.Mean teachers are better role models:Weight-averaged consistency targets improve semi-supervised deep learning results[J].arXiv:1703.01780,2017.
[25]OLIVER A,ODENA A,RAFFEL C A,et al.Realistic evaluation of deep semi-supervised learning algorithms[C]//Procee-dings of the 32nd International Conference on Neural Information Processing Systems.2018:3239-3250.
[1] 葛泽庆, 黄圣君.
针对多标记表格数据的半监督学习方法
Semi-supervised Learning Method for Multi-label Tabular Data
计算机科学, 2026, 53(3): 151-157. https://doi.org/10.11896/jsjkx.250600149
[2] 王一鸣, 焦敏, 赵素云, 陈红, 李翠平.
基于指示词表征学习的半监督聚类方法
Prompt-conditioned Representation Learning with Diffusion Models for Semi-supervised Clustering
计算机科学, 2026, 53(3): 158-165. https://doi.org/10.11896/jsjkx.250600063
[3] 陈麒瑞, 王宝会, 戴辰程.
基于迁移学习的心电图分类识别算法的研究
Research on Electrocardiogram Classification and Recognition Algorithm Based on Transfer Learning
计算机科学, 2025, 52(6A): 240900073-8. https://doi.org/10.11896/jsjkx.240900073
[4] 杜元花, 陈盼, 周楠, 施开波, 陈二阳, 张远鹏.
基于相关熵的多视角低秩矩阵分解和多视角数据聚类中的约束图学习
Correntropy Based Multi-view Low-rank Matrix Factorization and Constraint Graph Learning for Multi-view Data Clustering
计算机科学, 2025, 52(6A): 240900131-10. https://doi.org/10.11896/jsjkx.240900131
[5] 包晟宏, 姚有健, 李小丫, 陈文.
集成式PU学习方法PUEVD及其在软件源码漏洞检测中的应用
Integrated PU Learning Method PUEVD and Its Application in Software Source CodeVulnerability Detection
计算机科学, 2025, 52(6A): 241100144-9. https://doi.org/10.11896/jsjkx.241100144
[6] 王枭, 李冠雄, 李娜, 袁东风.
基于高斯混合判别的半监督学习流场预测方法
Semi-supervised Learning Flow Field Prediction Method Based on Gaussian Mixture Discrimination
计算机科学, 2025, 52(6): 88-95. https://doi.org/10.11896/jsjkx.241100026
[7] 武优, 王静, 李培培, 胡学钢.
半监督偏多标签特征选择
Semi-supervised Partial Multi-label Feature Selection
计算机科学, 2025, 52(4): 161-168. https://doi.org/10.11896/jsjkx.240600008
[8] 沈雅馨, 高利剑, 毛启容.
基于元学习的半监督声音事件检测方法
Semi-supervised Sound Event Detection Based on Meta Learning
计算机科学, 2025, 52(3): 222-230. https://doi.org/10.11896/jsjkx.240100191
[9] 康伟, 黎利辉, 文益民.
基于聚簇模型重用的概念漂移数据流半监督分类算法
Semi-supervised Classification of Data Stream with Concept Drift Based on Clustering Model Reuse
计算机科学, 2024, 51(4): 124-131. https://doi.org/10.11896/jsjkx.230300023
[10] 戴伟, 柴晶, 刘雅娇.
基于最大间隔和流形假设的半监督学习算法
Semi-supervised Learning Algorithm Based on Maximum Margin and Manifold Hypothesis
计算机科学, 2024, 51(2): 259-267. https://doi.org/10.11896/jsjkx.221100136
[11] 李辉, 李文根, 关佶红.
双编码半监督异常检测模型
Dually Encoded Semi-supervised Anomaly Detection
计算机科学, 2023, 50(7): 53-59. https://doi.org/10.11896/jsjkx.220900027
[12] 顾宇航, 郝洁, 陈兵.
基于数据融合的半监督高分遥感影像语义分割
Semi-supervised Semantic Segmentation for High-resolution Remote Sensing Images Based on DataFusion
计算机科学, 2023, 50(6A): 220500001-6. https://doi.org/10.11896/jsjkx.220500001
[13] 王清宇, 王海瑞, 朱贵富, 孟顺建.
基于FlexUDA模型的SQL注入检测研究
Study on SQL Injection Detection Based on FlexUDA Model
计算机科学, 2023, 50(6A): 220600172-6. https://doi.org/10.11896/jsjkx.220600172
[14] 崔冰晶, 张懿璞, 王飚.
基于超图正则化的多模态信息融合算法
Multimodal Data Fusion Algorithm Based on Hypergraph Regularization
计算机科学, 2023, 50(6): 167-174. https://doi.org/10.11896/jsjkx.220900144
[15] 秦亮, 谢良, 陈盛双, 徐海蛟.
基于锚图分类的在线半监督跨模态哈希
Online Semi-supervised Cross-modal Hashing Based on Anchor Graph Classification
计算机科学, 2023, 50(6): 183-193. https://doi.org/10.11896/jsjkx.220400038
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!