Computer Science ›› 2010, Vol. 37 ›› Issue (12): 138-142.

Previous Articles     Next Articles

Classification Algorithm for Data Stream Based on Mixture Models of C4. 5 and NB

LI Yan,ZHANG Yu-hong,HU Xue-gang   

  • Online:2018-12-01 Published:2018-12-01

Abstract: Classification on the noisy data stream with concept drifts has recently become one of the most popular topics in streaming data mining. Classification algorithm for mining Data Streams based on Mixture Models of C4. 5 and NB was proposed,called CDSMM,in which decision trees based on C;4. 5 are selected as the basic classifiers and the classifier of NaW a I3ayes is adopted to filter noise data. Meanwhile, it introduces the p-hypothesis testing method to detect concept drifts. Extensive studies demonstrate that CDSMM is superior to several existing algorithms in the predictive accuracy when handling noisy data streams with concept drifts.

Key words: Data streams,Concept drifts,Classification,Noise

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!