计算机科学 ›› 2010, Vol. 37 ›› Issue (10): 27-32.

• 综述 • 上一篇    下一篇

不平衡类数据挖掘研究综述

翟云,杨炳儒,曲武   

  1. (北京科技大学信息工程学院 北京100083) (聊城大学计算机学院 聊城252059)
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(60675030,60875029)资助。

Survey of Mining Imbalanced Datasets

ZHAI Yun,YANG Bing-ru,QU Wu   

  • Online:2018-12-01 Published:2018-12-01

摘要: 综述了近年来国内外对不平衡类数据挖掘的主要研究进展。首先分析了不平衡类数据挖掘的本质。其次,详细探讨了处理不平衡类数据挖掘的各种技术,并根据其本质区别,从数据层次和算法层次分别对目前存在的各种技术方法进行了深入剖析和全面比较。最后,指出当前不平衡类数据挖掘研究的热点以及将来需要重点关注的主要问题。

关键词: 机器学习,不平衡类数据,重采样,代价敏感学习

Abstract: This paper reviewed the present situation of mining data in unbalanced classes at home and abroad in recent years. Firstly, it analysed in-depth the existing problems and their resulting nature. hhen it in detail dealt with various state-of-the-art data mining techniques under the unbalanced learning scenario. Moreover, from the data-level and algorithm-level respectively it analysed and compared them omprehensively in accordance with essential difference. At the same time, the paper summaricd measure metrics evaluating performance of mining imbalance data sets. Also, the paper pointed out recent hot issues of theoretic studies and applications. Finally, the perspectives on future work were also discussed.

Key words: Machine learning, Imbalanced classification, Resampling, Cost sensitive learning

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!