计算机科学 ›› 2012, Vol. 39 ›› Issue (7): 175-177.

• 人工智能 • 上一篇    下一篇

基于动态可行域划分的SVM主动学习

张晓宇   

  1. (中国科学技术信息研究所战略研究中心 北京100038)
  • 出版日期:2018-11-16 发布日期:2018-11-16

SVM Active Learning via Dynamic Version Space Division

  • Online:2018-11-16 Published:2018-11-16

摘要: 针对传统SVM主动学习中批量采样方法的不足,提出了动态可行域划分算法。从特征空间与参数空间的 对偶关系入手,深入分析SVM主动学习的本质,将特征空间中对样本的标注视为参数空间中对可行域的划分;通过 综合利用当前分类模型和先前标注样本两方面信息,动态地优化可行域划分方案,以确保选取的样本对模型改进的价 值,最终实现更为高效的选择性采样。实验结果表明,基于动态可行域划分的SVM主动学习算法能够显著提高所选 样本的信息量,从而能够在有限的标注代价下大幅提高其分类性能。

关键词: 半监督学习,主动学习,选择性采样,支持向量机,可行域

Abstract: This paper presented a dynamic version space division algorithm for SVM active learning, in view of the drawbacks of traditional batch sampling methods. Based on the duality property of feature space and parameter space, we discussed SVM active learning in dual space and concluded that example labeling in feature space corresponds to ver- sion space division in parameter space. Faking both the existing classification model and the previously labeled examples into consideration, we optimized the version space division process and maximized the value of selected examples for model refinement. In this way, a more effective selective sampling was achieved. Experimental results demonstrate the effectiveness of the dynamic version space division algorithm, which remarkably improves the classification performance under the cost of limited labeling effort.

Key words: Semi supervised learning, Active learning, Selective sampling, Support vector machine, Version space

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!