计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 173-179.doi: 10.11896/jsjkx.250300086
徐亚敏, 李晓斌, 张润
XU Yamin, LI Xiaobin, ZHANG Run
摘要: 流形正则化(Manifold Regularization,MR) 提供了一个有效的框架,利用有标签数据集和无标签数据集进行半监督分类。在基于流形假设的情况下,约束相似实例在样本构图上应具有相似的分类结果。值得注意的是,MR的核心在于样本构图上的成对平滑,即所有实例对中都应用了平滑约束,把每一对实例都看作一个整体。然而,平滑性本质上可以是点对点的,这意味着平滑性应当“无处不在”,以关联每个点或实例与其邻近点的行为。因此,提出了一种新型的基于点态流形正则化以及一致性正则化的半监督学习算法 URC-PW-MR。该方法不仅保留了平滑性的点对点特性,还通过考虑单个实例而非实例对,引入了单个实例的重要性。这种重要性可以通过局部密度等因素来描述。URC-PW-MR 提供了一种新的实现流形平滑性的方法,通过约束单个局部实例并引入融合一致性正则来实现半监督学习。实证结果表明,URC-PW-MR 在性能上与传统的MR 相比更为精细。
中图分类号:
| [1]HE Q,LI N,LUO W J,et al.A Survey of Machine Learning Algorithms for Big Data[J].Pattern Recognition and Artificial Intelligence,2014,27(4):327-336. [2]LIU J W,LIU Y,LUO X L.Semi-supervised Learning Methods[J],2015,38(8):1592-1617. [3]YANG J C.Design and implementation of a semi-supervisedcontinuous learning framework[D].Nanjing:Nanjing University of Posts and Telecommunications,2020. [4]宋雨,许王琴,李荣鹏,等.基于自适应流形正则化自表示的无监督特征选择算法[J].重庆工商大学学报(自然科学版),2023,40(6):44-52. [5]JIA M J,LI X,KONG L,et al.Research on the Application of Deep Learning in Big Data Analysis[J].Software Engineering and Application,2022,11(3):549-557. [6]YAO K,CAO F,LEUNG Y,et al.Deep neural network compression through interpretability-based filter pruning[J].Pattern Recognition,2021,119:108056. [7]LIANG J Y,GAO J W,CHANG Y.Research Progress in Semi-supervised Learning[J].Shanxi University(Natural Science Edition),2009,32(4):528-534. [8]LEI Y K.Research on manifold learning algorithm and its application[D].Hefei:University of Science and Technology of China,2011. [9]WANG Y,HAN J,SHEN Y,et al.Pointwise manifold regularization for semi-supervised learning[J].Frontiers of Computer Science,2021,15(1):1-8. [10]WANG J,ZHANG S Y,LIANG J Y.Semi-supervised DeepLearning Algorithm Integrating Consensus Regulars and Manifold Regulars[J].Big Data Research,2022,8(3):103-114. [11]LI Y F,KWOK J T,ZHOU Z H.Semi-supervised learning using label mean[C]//Proceedings of the 26th International Confe-rence on Machine Learning.ACM,2009. [12]ZHU X J,GHAHRAMANI Z.Learning from labeled and unlabeled data with label propagation[C]//NIPS 2002.2002:1-8. [13]BELKIN M,NIYOGI P,SINDHWANI V.Manifold Regularization:A Geometric Framework for Learning from Labeled and Unlabeled Examples[J].Journal of Machine Learning Research,2006,7(1):2399-2434. [14]WANG J,LIANG J,CUI J,et al.Semi-supervised Learning with Mixed-order Graph Convolutional Networks[J].Information Sciences,2021,573(8):171-181. [15]LIANG J Y,CUI J B,WANG J,et al.Graph-based semi-supervised learning via improving the quality of the graph dynamically[J].Machine Learning,2021,110(6):1345-1388. [16]LU C H.Euclid and Geometry(Part II)[M]//Science World.2019:126-127. [17]MEI X M,HE L G.Differential manifold and Riemannian geometry[M].Beijing:Beijing Normal University Press,1987. [18]LIU W H,QIN B Z.Mapping properties of Meso compact space and Hausdorff space[J].Journal of Texas College,2005,4(2):21-22. [19]CHEN W H.Differential manifold preliminary.2nd edition[M].Beijing:Higher Education Press,2001. [20]MIYATO T,MAEDASI,KOYAMA M,et al.Virtual adversarial training:aregularization method for supervised and semi-supervised learning[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,41(8):1979-1993. [21]VERMA V,KAWAGUCHI K,LAMB A,et al.Interpolationconsistency training for semi-supervised learning[J].Neural Networks,2022,145:90-106. [22]BERTHELOT D,CARLINI N,GOODFELLOW I,et al.Mix-match:A holistic approach to semi-supervised learning[J].Advances in Neural Information Processing Systems,2019,32:1-11. [23]LAINE S,AILA T.Temporal Ensembling for Semi-Supervised Learning[C]//International Conference on Learning Representations.2017:1-13. [24]TARVAINEN A,VALPOLA H.Mean teachers are better role models:Weight-averaged consistency targets improve semi-supervised deep learning results[J].arXiv:1703.01780,2017. [25]OLIVER A,ODENA A,RAFFEL C A,et al.Realistic evaluation of deep semi-supervised learning algorithms[C]//Procee-dings of the 32nd International Conference on Neural Information Processing Systems.2018:3239-3250. |
| [1] | 葛泽庆, 黄圣君. 针对多标记表格数据的半监督学习方法 Semi-supervised Learning Method for Multi-label Tabular Data 计算机科学, 2026, 53(3): 151-157. https://doi.org/10.11896/jsjkx.250600149 |
| [2] | 王一鸣, 焦敏, 赵素云, 陈红, 李翠平. 基于指示词表征学习的半监督聚类方法 Prompt-conditioned Representation Learning with Diffusion Models for Semi-supervised Clustering 计算机科学, 2026, 53(3): 158-165. https://doi.org/10.11896/jsjkx.250600063 |
| [3] | 陈麒瑞, 王宝会, 戴辰程. 基于迁移学习的心电图分类识别算法的研究 Research on Electrocardiogram Classification and Recognition Algorithm Based on Transfer Learning 计算机科学, 2025, 52(6A): 240900073-8. https://doi.org/10.11896/jsjkx.240900073 |
| [4] | 杜元花, 陈盼, 周楠, 施开波, 陈二阳, 张远鹏. 基于相关熵的多视角低秩矩阵分解和多视角数据聚类中的约束图学习 Correntropy Based Multi-view Low-rank Matrix Factorization and Constraint Graph Learning for Multi-view Data Clustering 计算机科学, 2025, 52(6A): 240900131-10. https://doi.org/10.11896/jsjkx.240900131 |
| [5] | 包晟宏, 姚有健, 李小丫, 陈文. 集成式PU学习方法PUEVD及其在软件源码漏洞检测中的应用 Integrated PU Learning Method PUEVD and Its Application in Software Source CodeVulnerability Detection 计算机科学, 2025, 52(6A): 241100144-9. https://doi.org/10.11896/jsjkx.241100144 |
| [6] | 王枭, 李冠雄, 李娜, 袁东风. 基于高斯混合判别的半监督学习流场预测方法 Semi-supervised Learning Flow Field Prediction Method Based on Gaussian Mixture Discrimination 计算机科学, 2025, 52(6): 88-95. https://doi.org/10.11896/jsjkx.241100026 |
| [7] | 武优, 王静, 李培培, 胡学钢. 半监督偏多标签特征选择 Semi-supervised Partial Multi-label Feature Selection 计算机科学, 2025, 52(4): 161-168. https://doi.org/10.11896/jsjkx.240600008 |
| [8] | 沈雅馨, 高利剑, 毛启容. 基于元学习的半监督声音事件检测方法 Semi-supervised Sound Event Detection Based on Meta Learning 计算机科学, 2025, 52(3): 222-230. https://doi.org/10.11896/jsjkx.240100191 |
| [9] | 康伟, 黎利辉, 文益民. 基于聚簇模型重用的概念漂移数据流半监督分类算法 Semi-supervised Classification of Data Stream with Concept Drift Based on Clustering Model Reuse 计算机科学, 2024, 51(4): 124-131. https://doi.org/10.11896/jsjkx.230300023 |
| [10] | 戴伟, 柴晶, 刘雅娇. 基于最大间隔和流形假设的半监督学习算法 Semi-supervised Learning Algorithm Based on Maximum Margin and Manifold Hypothesis 计算机科学, 2024, 51(2): 259-267. https://doi.org/10.11896/jsjkx.221100136 |
| [11] | 李辉, 李文根, 关佶红. 双编码半监督异常检测模型 Dually Encoded Semi-supervised Anomaly Detection 计算机科学, 2023, 50(7): 53-59. https://doi.org/10.11896/jsjkx.220900027 |
| [12] | 顾宇航, 郝洁, 陈兵. 基于数据融合的半监督高分遥感影像语义分割 Semi-supervised Semantic Segmentation for High-resolution Remote Sensing Images Based on DataFusion 计算机科学, 2023, 50(6A): 220500001-6. https://doi.org/10.11896/jsjkx.220500001 |
| [13] | 王清宇, 王海瑞, 朱贵富, 孟顺建. 基于FlexUDA模型的SQL注入检测研究 Study on SQL Injection Detection Based on FlexUDA Model 计算机科学, 2023, 50(6A): 220600172-6. https://doi.org/10.11896/jsjkx.220600172 |
| [14] | 崔冰晶, 张懿璞, 王飚. 基于超图正则化的多模态信息融合算法 Multimodal Data Fusion Algorithm Based on Hypergraph Regularization 计算机科学, 2023, 50(6): 167-174. https://doi.org/10.11896/jsjkx.220900144 |
| [15] | 秦亮, 谢良, 陈盛双, 徐海蛟. 基于锚图分类的在线半监督跨模态哈希 Online Semi-supervised Cross-modal Hashing Based on Anchor Graph Classification 计算机科学, 2023, 50(6): 183-193. https://doi.org/10.11896/jsjkx.220400038 |
|
||