计算机科学 ›› 2023, Vol. 50 ›› Issue (2): 166-172.doi: 10.11896/jsjkx.211200292

• 数据库&大数据&数据科学 • 上一篇    下一篇

超约简求解:效率与性能的提升

王笑笑, 巴婧, 陈建军, 宋晶晶, 杨习贝   

  1. 江苏科技大学计算机学院 江苏 镇江 212100
  • 收稿日期:2021-12-27 修回日期:2022-06-28 出版日期:2023-02-15 发布日期:2023-02-22
  • 通讯作者: 宋晶晶(songjingjing108@163.com)
  • 作者简介:(w_xiaoxiao14@163.com)
  • 基金资助:
    国家自然科学基金(62076111,61906078,62006099,62006128)

Searching Super-reduct:Improvement on Efficiency and Effectiveness

WANG Xiaoxiao, BA Jing, CHEN Jianjun, SONG Jingjing, YANG Xibei   

  1. School of Computer,Jiangsu University of Science and Technology,Zhenjiang,Jiangsu 212100,China
  • Received:2021-12-27 Revised:2022-06-28 Online:2023-02-15 Published:2023-02-22
  • Supported by:
    National Natural Science Foundation of China(62076111,61906078,62006099,62006128)

摘要: 利用多重约简的结果搭建一个集成分类框架,已被证实可以显著提升后续学习的性能。超约简方法正是借鉴了这一理念,在约简求解的基础上,通过随机添加额外属性以达到获取多重超约简的目的。显然,基本的约简求解将直接影响超约简方法的效果。鉴于此,从兼顾效率和性能的角度出发,在超约简方法中同时引入属性簇和集成选择机制:属性簇用于加速基本约简的求解过程,集成选择则用于在求解过程中找到更为稳健的属性。在20组UCI数据上的实验结果表明,相比4种前沿的集成策略,所提方法不仅能够显著减少约简求解的时间消耗,而且能够提供更好的分类稳定性和准确率。

关键词: 属性簇, 集成选择, 约简求解, 超约简

Abstract: Following the derivation of multiple reducts,an ensemble based classification framework can be constructed,which has been demonstrated to be useful in improving the performance of subsequent learning tasks.The approach called super-reduct is exactly suggested with such thinking.Generally,multiple super-reducts are obtained by randomly adding more extra attributes into the fundamental reduct.Therefore,how to search fundamental reduct is the key to performing super-reduct.In view of this,considering both efficiency and effectiveness,not only attribute group but also ensemble selector is introduced into the mechanism of super-reduct:the device of attribute group is used to speed up the process of searching fundamental reduct,the device of ensemble selector is used to find more robust attributes in the procedure of searching reduct.Comprehensive experiments on 20 UCI data sets show that compared with 4 popular strategies,our approach can not only significantly reduce the computational cost but also provide superior stabilities and accuracies for classification tasks.

Key words: Attribute group, Ensemble selector, Searching reduct, Super-reduct

中图分类号: 

  • TP181
[1]PAWLAK Z.Rough Sets:Theoretical Aspects of Reasoningabout Data[M].Dordrecht:Kluwer Academic Publishers,1992.
[2]SONG J J,TSANG E C C,CHEN D G,et al.Minimal Decision Cost Reduct in Fuzzy Decision-theoretic Rough Set Model[J].Knowledge-Based Systems,2017,126:104-112.
[3]BA J,CHEN Y,YANG X B.Attribute Partition Strategy for Quick Searching Reducts Based on Granular Ball Rough Sets[J].Journal of Nanjing University of Science and Technology,2021,45(4):394-400.
[4]LI J Z,YANG X B,DOU H L,et al.Research on Ensemble Significance Based Attribute Reduction Approach[J].CAAI Transac-tions on Intelligent Systems,2018,13(3):414-421.
[5]YANG X B,YAO Y Y.Ensemble Selector for Attribute Reduction[J].Applied Soft Computing,2018,70:1-11.
[6]LIU K Y,YANG X B,YU H L,et al.Rough Set Based Semi-supervised Feature Selection via Ensemble Selector[J].Know-ledge-Based Systems,2019,165:282-296.
[7]JIANG Z H,WANG Y B,XU G,et al.Multi-scale Based Acce-lerator for Attribute Reduction[J].Computer Science,2019,46(12):250-256.
[8]JIANG F,YU X,ZHAO H B,et al.Ensemble Learning Based on Random Super-reduct and Resampling[J].Artificial Intelligence Review,2021,54(2):1-26.
[9]BANIA R K,HALDER A.R-Ensembler:A Greedy Rough Set Based Ensemble Attribute Selection Algorithm with KNN Imputation for Classification of Medical Data[J].Computer Me-thods and Programs in Biomedicine,2019,184(4):105122.
[10]NIU K,ZHANG Z M,LIU Y,et al.Resampling Ensemble Model Based on Data Distribution for Imbalanced Credit Risk Evaluation in P2P Lending[J].Information Sciences,2020,536:120-134.
[11]QIAN Y H,WANG Q,CHENG H H,et al.Fuzzy-rough Feature Selection Accelerator[J].Fuzzy Sets and Systems,2015,258:61-78.
[12]CHEN Y,LIU K Y,SONG J J,et al.Attribute Group for Attribute Reduction[J].Information Sciences,2020,535:64-80.
[13]MENG J,ZHANG J,JIANG D L,et al.Selective EnsembleClassification Integrated with Affinity Propagation Clustering[J].Journal of Computer Research and Development,2018,55(5):986-993.
[14]JIA X Y,SHANG L,ZHOU B,et al.Generalized Attribute Reduct in Rough Set Theory[J].Knowledge-Based Systems,2016,91:204-218.
[15]LI W W,JIA X Y,WANG L,et al.Multi-objective Attribute Reduction in Three-way Decision-theoretic Rough Set Model[J].International Journal of Approximate Reasoning,2019,105:327-341.
[16]SINGH U,SINGH S N.A New Optimal Feature SelectionScheme for Classification of Power Quality Disturbances Based on Ant Colony Framework[J].Applied Soft Computing,2019,74:216-225.
[17]ROMAGNOLI P P.Local Conditional Entropy in Measure for Covers with Respect to a Fixed Partition[J].Nonlinearity,2018,31(5):2201-2220.
[18]JIANG Z H,LIU K Y,YANG X B,et al.Accelerator for Supervised Neighborhood Based Attribute Reduction[J].International Journal of Approximate Reasoning,2020,119:122-150.
[19]YANG X B,LIANG S C,YU H L.Pseudo-label Neighborhood Rough Set:Measures and Attribute Reductions[J].International Journal of Approximate Reasoning,2019,105:112-129.
[20]BA J,LIU K Y,JU H R,et al.Triple-G:A New MGRS and Attribute Reduction[J].International Journal of Machine Learning and Cybernetics,2022,13(2):337-356.
[21]YANG X B,YAN X,XU S P,et al.New Heuristic Attribute Reduction Algorithm Based on Sample Selection[J].Computer Science,2016,43(1):40-43.
[22]WANG N,PENG Z H,CUI L.EasiFFRA:A Fast Feature Reduction Algorithm Based on Neighborhood Rough Set[J].Journal of Computer Research and Development,2019,56(12):2578-2588.
[23]JIANG Z H,DOU H L,SONG J J,et al.Data-guided Multi-granularity Selector for Attribute Reduction[J].Applied Intelligence,2021,51:876-888.
[24]YUAN Z,CHEN H M,LI T R,et al.Unsupervised Attribute Reduction for Mixed Data Based on Fuzzy Rough Sets[J].Information Sciences,2021,572:67-87.
[1] 蒋川宇, 韩翔宇, 杨文蕊, 吕博涵, 黄小欧, 谢夏, 谷阳.
医学知识图谱研究与应用综述
Survey of Medical Knowledge Graph Research and Application
计算机科学, 2023, 50(3): 83-93. https://doi.org/10.11896/jsjkx.220700241
[2] 华晓凤, 冯娜, 于俊清, 何云峰.
基于规则推理的足球视频任意球射门事件检测
Shooting Event Detection of Free Kick in Soccer Video Based on Rule Reasoning
计算机科学, 2023, 50(3): 181-190. https://doi.org/10.11896/jsjkx.220300062
[3] 刘哲, 殷成凤, 李天瑞.
基于BERT和多特征融合嵌入的中文拼写检查
Chinese Spelling Check Based on BERT and Multi-feature Fusion Embedding
计算机科学, 2023, 50(3): 282-290. https://doi.org/10.11896/jsjkx.220100104
[4] 彭钺峰, 赵波, 刘会, 安杨.
针对机器学习的成员推断攻击综述
Survey on Membership Inference Attacks Against Machine Learning
计算机科学, 2023, 50(3): 351-359. https://doi.org/10.11896/jsjkx.220100016
[5] 崔竞松, 张童桐, 郭迟, 郭文飞.
基于时延特征的网络设备异常检测
Network Equipment Anomaly Detection Based on Time Delay Feature
计算机科学, 2023, 50(3): 371-379. https://doi.org/10.11896/jsjkx.211200280
[6] 李海涛, 王瑞敏, 董卫宇, 蒋烈辉.
一种基于GRU的半监督网络流量异常检测方法
Semi-supervised Network Traffic Anomaly Detection Method Based on GRU
计算机科学, 2023, 50(3): 380-390. https://doi.org/10.11896/jsjkx.220100032
[7] 湛进, 王雪飞, 成雨蓉, 袁野.
贝叶斯推理与并行回火研究综述
Overview of Research on Bayesian Inference and Parallel Tempering
计算机科学, 2023, 50(2): 89-105. https://doi.org/10.11896/jsjkx.220100001
[8] 王雷, 杜亮, 周芃.
基于稀疏连接的层次化多核K-Means算法
Hierarchical Multiple Kernel K-Means Algorithm Based on Sparse Connectivity
计算机科学, 2023, 50(2): 138-145. https://doi.org/10.11896/jsjkx.220400230
[9] 王少将, 刘佳, 郑锋, 潘祎诚.
机器学习层谱聚类综述
Survey on Hierarchical Clustering for Machine Learning
计算机科学, 2023, 50(1): 9-17. https://doi.org/10.11896/jsjkx.211000185
[10] 郝敬宇, 文静轩, 刘华锋, 景丽萍, 于剑.
结合全局信息的深度图解耦协同过滤
Deep Disentangled Collaborative Filtering with Graph Global Information
计算机科学, 2023, 50(1): 41-51. https://doi.org/10.11896/jsjkx.220900255
[11] 徐平安, 刘全.
基于相似度约束的双策略蒸馏深度强化学习方法
Deep Reinforcement Learning Based on Similarity Constrained Dual Policy Distillation
计算机科学, 2023, 50(1): 253-261. https://doi.org/10.11896/jsjkx.211100167
[12] 张启阳, 陈希亮, 张巧.
基于轨迹感知的稀疏奖励探索方法
Sparse Reward Exploration Method Based on Trajectory Perception
计算机科学, 2023, 50(1): 262-269. https://doi.org/10.11896/jsjkx.220700010
[13] 魏楠, 魏祥麟, 范建华, 薛羽, 胡永扬.
面向频谱接入深度强化学习模型的后门攻击方法
Backdoor Attack Against Deep Reinforcement Learning-based Spectrum Access Model
计算机科学, 2023, 50(1): 351-361. https://doi.org/10.11896/jsjkx.220800269
[14] 梁文雅, 刘波, 林伟伟, 严远超.
联邦学习激励机制研究综述
Survey of Incentive Mechanism for Federated Learning
计算机科学, 2022, 49(12): 46-52. https://doi.org/10.11896/jsjkx.220500272
[15] 郭艳卿, 李宇航, 王湾湾, 付海燕, 吴铭侃, 李祎.
基于联邦学习的Gamma回归算法
FL-GRM:Gamma Regression Algorithm Based on Federated Learning
计算机科学, 2022, 49(12): 66-73. https://doi.org/10.11896/jsjkx.220600034
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!