基于动态可行域划分的SVM主动学习

计算机科学 ›› 2012, Vol. 39 ›› Issue (7): 175-177.

基于动态可行域划分的SVM主动学习

张晓宇

(中国科学技术信息研究所战略研究中心北京100038)

出版日期:2018-11-16 发布日期:2018-11-16

SVM Active Learning via Dynamic Version Space Division

Online:2018-11-16 Published:2018-11-16

摘要/Abstract

摘要： 针对传统SVM主动学习中批量采样方法的不足，提出了动态可行域划分算法。从特征空间与参数空间的对偶关系入手，深入分析SVM主动学习的本质，将特征空间中对样本的标注视为参数空间中对可行域的划分;通过综合利用当前分类模型和先前标注样本两方面信息，动态地优化可行域划分方案，以确保选取的样本对模型改进的价值，最终实现更为高效的选择性采样。实验结果表明，基于动态可行域划分的SVM主动学习算法能够显著提高所选样本的信息量，从而能够在有限的标注代价下大幅提高其分类性能。

关键词: 半监督学习，主动学习，选择性采样，支持向量机，可行域

Abstract: This paper presented a dynamic version space division algorithm for SVM active learning, in view of the drawbacks of traditional batch sampling methods. Based on the duality property of feature space and parameter space, we discussed SVM active learning in dual space and concluded that example labeling in feature space corresponds to ver- sion space division in parameter space. Faking both the existing classification model and the previously labeled examples into consideration, we optimized the version space division process and maximized the value of selected examples for model refinement. In this way, a more effective selective sampling was achieved. Experimental results demonstrate the effectiveness of the dynamic version space division algorithm, which remarkably improves the classification performance under the cost of limited labeling effort.

Key words: Semi supervised learning, Active learning, Selective sampling, Support vector machine, Version space

张晓宇. 基于动态可行域划分的SVM主动学习[J]. 计算机科学, 2012, 39(7): 175-177. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed