计算机科学 ›› 2010, Vol. 37 ›› Issue (3): 245-247.

• 人工智能 • 上一篇    下一篇

一种孤立点挖掘的混合核方法

田江,顾宏   

  1. (大连理工大学电子与信息工程学院 大连116023)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(60605022)资助。

Hybrid Method for Outliers Detection Using GPLVM and SVM

TIAN Jian,GU Hong   

  • Online:2018-12-01 Published:2018-12-01

摘要: 孤立点是不具备数据一般特性的数据对象。支持向量机((SVM)将数据点映射到高维特征空间,通过划分最大间隔的超平面来分离孤立点和正常点。利用支持向量机在处理小样本、高维数及泛化性能强等方面的优势,提出了一种新的基于高斯过程潜变量模型(GPLVM)和支持向量分类的检测模型算法。利用GPLVM提供潜变量到数据空间的平滑概率映射实现数据降维,然后通过SVM交叉验证进行孤立点检测。在KDD99数据集上进行了仿真实验,数值结果表明该算法在保证低误报率的前提下能有效地提高检测率,证明了方法的有效性。

关键词: 孤立点检测,支持向量机,数据降维,高斯过程潜变量模型

Abstract: Outlicrs arc objects that do not comply with the general behavior of the data. SVM(support vector machine)finds the maximal margin hyperplane in feature space for the purpose of distinguishing the outliers from normal samp1es. Based on the high performance of SVMs in tackling small sample size, high dimension and its good generalization,we proposed a new method for outlicr detection, which combines a novel unsupervised algorithm GPLVM(Gaussian process latent variable model) with standard SVM. GPLVM provides a smooth probabilistic mapping from latent to data space, embeds the dataset in a low-dimensional space which is used for cross validation of SVM I'he proposed approach was applied to KDD99 benchmark problems, and the simulation results show its validity.

Key words: Outlier detection, Support vector machine, Dimensionality reduction, GPLVM

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!