计算机科学 ›› 2022, Vol. 49 ›› Issue (11): 212-220.doi: 10.11896/jsjkx.210900054

• 人工智能 • 上一篇    下一篇

一种自适应权重的多分类通用集成方法

魏军胜1, 刘琰1, 陈静2, 段顺然1   

  1. 1 河南省网络空间态势感知重点实验室 郑州 450001
    2 信息工程大学大数据分析教研室 郑州 450001
  • 收稿日期:2021-09-06 修回日期:2022-03-21 出版日期:2022-11-15 发布日期:2022-11-03
  • 通讯作者: 刘琰(ms.liuyan@foxmail.com)
  • 作者简介:(1509451874@qq.com)
  • 基金资助:
    国家自然科学基金(U1804263,62002386)

Universal Multi-class Ensemble Method with Self Adaptive Weights

WEI Jun-sheng1, LIU Yan1, CHEN Jing2, DUAN Shun-ran1   

  1. 1 Key Laboratory of Cyberspace Situation Awareness of Henan,Zhengzhou 450001,China
    2 Department of Big Data Analysis,Information Engineering University,Zhengzhou 450001,China
  • Received:2021-09-06 Revised:2022-03-21 Online:2022-11-15 Published:2022-11-03
  • About author:WEI Jun-sheng,born in 1992,postgra-duate.His main research interests include big data analysis and so on.
    LIU Yan,born in 1979,Ph.D,associate professor,Ph.D supervisor.Her main research interests include network topology discovery and network data analysis.
  • Supported by:
    National Natural Science Foundation of China(U1804263,62002386).

摘要: 集成学习一直是构建强大和稳定的预测模型的策略之一,它能通过融合多个模型来提升结果的准确性和稳定性。但是,现有的集成方法在权重计算上还存在一定的缺陷,面对多种分类问题时无法自适应地选择集成权重,不具有通用性。针对以上问题,提出了一种自适应权重的多分类通用集成方法(UMEAW)。与通常的集成分类方法只针对一种分类任务不同,UMEAW面对不同的分类问题,首先根据分类个数计算权重调配系数,然后利用指数函数分布特性,根据模型评价指标与权重调配系数自动计算一次模型融合的权重,最后通过不断迭代的方法自适应地调整融合权重,实现不同分类任务下的模型融合。实验结果表明,UMEAW在9个不同分类个数、不同领域、不同规模的数据集上都能实现模型融合,其融合效果在大部分任务上都优于基线方法。与单个模型相比,用UMEAW融合后的结果F1值稳定增加了3%~25%;与其他集成方法相比,F1值稳定提升了1%~2%,证明了UMEAW的通用性和有效性。

关键词: 集成学习, 权重, 分类, 融合, 通用方法

Abstract: Ensemble learning has always been one of the strategies to build a powerful and stable predictive model.It can improve the accuracy and stability of the results by fusing multiple models.However,existing ensemble methods still have certain shortcomings in the calculation of weights.When facing a variety of classification problems,they cannot adaptively select ensemble weights,and they are not universal.In view of the above problems,a universal multi-class ensemble method with self-adaptive weights(UMEAW) is proposed.Different to usual ensemble classification method that only targets one kind of classification task,when facing different classification problems,firstly,UMEAW calculates the weight allocation coefficient according to the number of classification,and then the weights of base classifiers is automatically calculated according to the model evaluation index and the weight allocation coefficient by using the distribution characteristics of exponential function.Finally,the weights is adjusted adaptively through continuous iteration to realize model ensemble under different classification tasks.Experimental results show that UMEAW can achieve model ensemble on 9 datasets with different classification numbers,different fields and different scales,and the effect of UMEAW is better than the baselines in most tasks.Compared with a single model,F1 value increases by 3%~25% after UMEAW fusion.Compared with other ensemble methods,the F1 value improves by 1%~2%.It is proved that UMEAW is universal and effective.

Key words: Ensemble learning, Weight, Classification, Fusion, Universal method

中图分类号: 

  • TP391
[1]OPITZ D,MACLIN R.Popular Ensemble Methods:An Empirical Study[J].Journal of Artificial Intelligence Research,1999,11:169-198.
[2]HANSEN L K,SALAMON P.Neural network ensembles[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2002,12(10):993-1001.
[3]FREUND Y.Boosting a weak learning algorithm by majority[J].Information and Computation,1995,121(2):256-285.
[4]SHAPIRE R.The strength of weak learnability[J].Machine Learning,1990,5(2):197-227.
[5]JI Z,HUI Z,ROSSET S,et al.Multi-class adaboost[J].Statistics and its Interface,2009,2(3):349-360.
[6]WOLPERT D H.Stacked generalization[J].Neural Networks,2017,5(2):241-259.
[7]LITTLESTONE N,WARMUTH M K.The weighted majority algorithm[J].Information and Computation,1994,108(2):212-261.
[8]YU Z,WANG D,ZHAO Z,et al.Hybrid incremental ensemble learning for noisy real-world data classification[J].IEEE Transa-ctions on Cybernetics,2017,49(2):403-416.
[9]DONG M G,ZHANG W,JING C.Dynamic weight ensemble classification algorithm for unbalanced data streams[J].Miniature Microcomputer Systems,2020,41(8):1649-1655.
[10]GAO W,ZHOU Z H.Approximation stability and boosting[C]//International Conference on Algorithmic Learning Theory.Berlin:Springer,2010:59-73.
[11]YIN X C,HUANG K,HAO H W,et al.A novel classifier ensemble method with sparsity and diversity[J].Neurocomputing,2014,134:214-221.
[12]ZHANG L,SUGANTHAN P N.Random Forests with ensemble of feature spaces[J].Pattern Recognition,2014,47(10):3429-3437.
[13]KOLTER J Z,MALOOF M A.Dynamic weighted majority:An ensemble method for drifting concepts[J].Journal of Machine Learning Research,2007,8(Dec):2755-2790.
[14]MAO S,JIAO L,XIONG L,et al.Weighted classifier ensemble based on quadratic form[J].Pattern Recognition,2015,48(5):1688-1706.
[15]MUHLBAIER M D,TOPALIS A,POLIKAR R.Learn++.NC:Combining Ensemble of Classifiers With Dynamically Weighted Consult-and-Vote for Efficient Incremental Learning of New Classes[J].IEEE Transactions on Neural Networks,2009,20(1):152-168.
[16]HUANG Y,MONEKOSSO D,WANG H.Clustering Ensembles Based on Multi-classifier Fusion[C]//IEEE International Conference on Intelligent Computing and Intelligent Systems(ICIS 2010).Shanghai:IEEE,2010:393-397.
[17]NGUYEN T T,LIEW A W C,TRAN M T,et al.Combining multi classifiers based on a genetic algorithm-a gaussian mixture model framework[C]//International Conference on Intelligent Computing.Cham:Springer,2014:56-67.
[18]SONG J Z,GUO C Y,LIU H S.Selective SVM Ensemble Base on Clustering Analysis Apply for Analog Circuit Fault Diagnosis with Small Samples[J].Applied Mechanics & Materials,2013,380-384:841-845.
[19]YANG J,WANG F.Auto-Ensemble:An Adaptive LearningRate Scheduling Based Deep Learning Model Ensembling[J].IEEE Access,2020,8(12):217499-217509.
[20]LU Z,WU X,ZHU X,et al.Ensemble Pruning via Individual Contribution Ordering[C]//ACM Sigkdd International Confe-rence on Knowledge Discovery & Data Mining.Washington:ACM,2010:871.
[21]MARTÍNEZ-MUÑOZ G,HERNÁNDEZ-LOBATO D,SUÁREZA.An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation[J].IEEE Transactions on Pattern Analysis &Machine Intelligence,2009,31(2):245-259.
[22]HASAN M A,ABDULLAH N,RAHMAN M,et al.Dental Impression Tray Selection From Maxillary Arch Images Using Multi-Feature Fusion and Ensemble Classifier[J].IEEE Access,2021,9(2):30573-30586.
[23]BANIA R K,HALDER A.R-HEFS:Rough set based heteroge-neous ensemble feature selection method for medical data classification[J].Artificial Intelligence in Medicine,2021,114:102049-102080.
[24]LAI C M,HUANG H P.A gene selection algorithm using simplified swarm optimization with multi-filter ensemble technique[J].Applied Soft Computing,2021,100:106994-107042.
[25]ZHOUG,GUO F L.Research on high-dimensional data integration learning method based on feature selection[J].Computer Science,2021,48(S1):250-254.
[26]DEVLIN J,CHANG M,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].ar-Xiv:1810.04805,2018.
[27]CUI Y,CHE W,LIU T,et al.Pre-training with whole word masking for chinese bert[J].arXiv:1906.08101,2019.
[28]YANG Z,DAI Z,YANG Y,et al.XLNet:Generalized Autoregressive Pretraining for Language Understanding[J].Advances in Neural Information Processing Systems,2019,32:5753-5763.
[1] 陈志强, 韩萌, 李慕航, 武红鑫, 张喜龙.
数据流概念漂移处理方法研究综述
Survey of Concept Drift Handling Methods in Data Streams
计算机科学, 2022, 49(9): 14-32. https://doi.org/10.11896/jsjkx.210700112
[2] 曹晓雯, 梁美玉, 鲁康康.
基于细粒度语义推理的跨媒体双路对抗哈希学习模型
Fine-grained Semantic Reasoning Based Cross-media Dual-way Adversarial Hashing Learning Model
计算机科学, 2022, 49(9): 123-131. https://doi.org/10.11896/jsjkx.220600011
[3] 周旭, 钱胜胜, 李章明, 方全, 徐常胜.
基于对偶变分多模态注意力网络的不完备社会事件分类方法
Dual Variational Multi-modal Attention Network for Incomplete Social Event Classification
计算机科学, 2022, 49(9): 132-138. https://doi.org/10.11896/jsjkx.220600022
[4] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[5] 吴子仪, 李邵梅, 姜梦函, 张建朋.
基于自注意力模型的本体对齐方法
Ontology Alignment Method Based on Self-attention
计算机科学, 2022, 49(9): 215-220. https://doi.org/10.11896/jsjkx.210700190
[6] 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航.
监督和半监督学习下的多标签分类综述
Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning
计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[7] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[8] 秦琪琦, 张月琴, 王润泽, 张泽华.
基于知识图谱的层次粒化推荐方法
Hierarchical Granulation Recommendation Method Based on Knowledge Graph
计算机科学, 2022, 49(8): 64-69. https://doi.org/10.11896/jsjkx.210600111
[9] 魏恺轩, 付莹.
基于重参数化多尺度融合网络的高效极暗光原始图像降噪
Re-parameterized Multi-scale Fusion Network for Efficient Extreme Low-light Raw Denoising
计算机科学, 2022, 49(8): 120-126. https://doi.org/10.11896/jsjkx.220200179
[10] 王灿, 刘永坚, 解庆, 马艳春.
基于软标签和样本权重优化的Anchor Free目标检测算法
Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization
计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[11] 沈祥培, 丁彦蕊.
多检测器融合的深度相关滤波视频多目标跟踪算法
Multi-detector Fusion-based Depth Correlation Filtering Video Multi-target Tracking Algorithm
计算机科学, 2022, 49(8): 184-190. https://doi.org/10.11896/jsjkx.210600004
[12] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[13] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[14] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[15] 张源, 康乐, 宫朝辉, 张志鸿.
基于Bi-LSTM的期货市场关联交易行为检测方法
Related Transaction Behavior Detection in Futures Market Based on Bi-LSTM
计算机科学, 2022, 49(7): 31-39. https://doi.org/10.11896/jsjkx.210400304
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!