基于类权重的模糊不平衡数据分类方法

计算机科学 ›› 2008, Vol. 35 ›› Issue (11): 170-173.

基于类权重的模糊不平衡数据分类方法

出版日期:2018-11-16 发布日期:2018-11-16
基金资助:
国家自然科学基金（60574075）,国家自然科学基金项目（60703118）.

Online:2018-11-16 Published:2018-11-16

摘要/Abstract

摘要： 针对现有分类算法通常对不平衡数据挖掘表现出有偏性，即正类样本（通常是更重要的一类）的分类和预测性能差于负类样本的分类和预测性能，提出一种不平衡数据分类方法。该方法通过一个超球面将两类数据以最大分离比率分离，并且引入类权重因子和样本模糊隶属度，同时考虑了不同类的重要性和不同样本对该类的不同贡献，从而提高了不平衡数据中正类的分类和预测的性能以及整体的推广能力。分别在人造数据和UCI真实数据上进行了实验，结果验证了该方法的有效性。

关键词: 不平衡数据类加权模糊隶属度分类算法

Abstract: Using data sets that contain very few instances of the positive class usually produces biased classifiers and has a lower predictive accuracy over the positive class （usually the more important class） than over the negative class. Proposed a classificatio

Key words: Imbalanced data set, Weighted-class, Fuzzy membership, Classification algorithm

. 基于类权重的模糊不平衡数据分类方法[J]. 计算机科学, 2008, 35(11): 170-173. https://doi.org/

参考文献

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed