计算机科学 ›› 2012, Vol. 39 ›› Issue (12): 211-213.

• 人工智能 • 上一篇    下一篇

一种基于分类器相似性集成的数据流分类研究

刘余霞 吕虹 刘三民   

  1. (安徽工程大学电气工程学院 芜湖 241000) (安徽建筑工业学院电子与信息工程学院 合肥 230022) (安徽工程大学计算机与信息学院 芜湖 241000)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Classifier-similarity-based Ensemble Classification Research for Data Streams

  • Online:2018-11-16 Published:2018-11-16

摘要: 数据流分类已成为当前研究热点之一,如何解决其中的概念漂移和噪声是关键问题,为此提出了一种新的基 于分类器相似性的动态集成算法。由于数据流中相部数据具有相同概念的概率较大,因此用最新基分类器代表数据 流中即将出现的概念,同时基于此分类器求出基分类器之间的相似性作为权值进行加权多数投票,并根据相似性大小 淘汰较弱基分类器以适应概念漂移和噪声。在标准仿真数据集上进行了仿真实验,结果表明该算法相比其他集成方 法在抗噪性能和分类准确性方面均得到显著提高。

关键词: 概念漂移,相似性,集成学习,数据流分类,加权多数投票

Abstract: Classification of data streams has become one of hot research spots, and a new similarity-based dynamic en- semble algorithm was presented to deal with two critical problems, namely, concept drift and noise. Because adjacent data in data stream has the same concept with more probability, the new sub-classifier stands for the coming concept Based on it, ensemble classifier is got by similarity weighted majority voting, and sulrclassificr with worst performance is dele- ted for suiting for concept drift and noise. Experiment result on simulation data set shows the algorithm is best than other schema in classification accuracy and anti-noise.

Key words: Concept drift, Similarity, Ensemble learning, Data stream classification, Weighted majority voting

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!