Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 210800105-7.doi: 10.11896/jsjkx.210800105

• Artificial Intelligence • Previous Articles     Next Articles

Study on Risk Control Model of Selective Ensemble Algorithm Based on Hierarchical Clustering and Simulated Annealing

WANG Mao-guang, JI Hao-yue, WANG Tian-ming   

  1. School of Information,Central University of Finance and Economics,Beijing 100081,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:WANG Mao-guang,born in 1974,Ph.D,professor,is a member of China Computer Federation.His main research interests include intelligent risk control models and algorithms,big data and intelligent software engineering etc.
    JI Hao-yue,born in 1998,postgraduate.Her main research interests include Internet financial risk control and credit investigation.
  • Supported by:
    National Natural Science Foundation of China(62072487) and Research Projects of Central University of Finance and Technology(020676116004,020676114004).

Abstract: Eensemble learning model can effectively solve the problems of single model structure,stability and weak predictive ability.However,due to the complexity of its structure,problems such as low operating efficiency and excessive storage cost often occur.Selective ensemble algorithms are often used to optimize ensemble learning models to solve these problems.The currently proposed selective ensemble algorithm still has the phenomenon of insufficient operating effect and efficiency improvement.In order to make up for these shortcomings,a selective ensemble algorithm based on the stacking ensemble framework is proposed.It mainly uses the agglomerated hierarchical clustering(AHC) algorithm and the metropolis criterion of simulated annealing to select the type and number of base learners.In terms of empirical analysis,domestic and foreign online loan data are used separately to build the model.Experimental results prove that the selective ensemble model of AHC-Metropolis can effectively improve the computational efficiency,predictive ability,stability and generalization ability.It is helpful for regulating the order of the Internet financial industry,assist in financial supervision tasks,and provide an effective basis for establishing our country’s financial risk control management system and guaranteeing national financial security.

Key words: Hierarchical clustering, Simulated annealing, Selective ensemble, Financial risk control

CLC Number: 

  • TP181
[1]HAND D J,HENLEY W E.Statistical lassification methods in consumer credit scoring [J].Journal of the Royal Statistical Society,1997,160(3):523-541.
[2]SHENG J.Credit card cash out detection scoring model based on Logistic [J].Computer Applications,2009,29(11):3088-3091,3095.
[3]FANG K N,ZHANG G J,ZHANG H Y.Personal credit risk early warning method based on Lasso-logistic model[J].Quantitative Economics and Technical Economics,2014,31(2):125-136.
[4]ZHANG Y J,JIA H Y,DIAO Y F,et al.Research on CreditScoring by Fusing Social Media Information in Online Peer-to-Peer Lending[J].Procedia Computer Science,2016,91:168-174.
[5]PANG S L,HOU X Y,XIA L H.Borrowers’ credit quality scoring model and applications,with default discriminant analysis based on the extreme learning machine[J].Technological Forecasting and Social Change,2021:120462.
[6]LI X S,GUO Y H.Personal credit evaluation model based on Naive Bayes classifier[J].Computer Engineering and Applications,2006(30):197-201.
[7]WEST D.Neural network credit scoring models[J].Computers &Operations Research,2000,27:1131-1152.
[8]LI Y,JIANG T Y,LIU Y R.Research on Internet PersonalCredit Evaluation Based on Unbalanced Samples[J].Statistics and Information Forum,2017,32(2):84-90.
[9]PIERRE G,ERNST D,WEHENKEL L.Extremely randomized trees[J].Machine Learning,2006,63(1):3-42.
[10]ZHOU Q Y.Application Research of Improved AdaBoost Algorithm in Credit Imbalance Classification[D].Huangzhou:Zhejiang Gongshang University,2020.
[11]FINLAY S.Multiple cassifer achitectures and their apication to credit risk asessment[J].European Jourmal of Operational Research,2011,210(2):368-378.
[12]SUN J,LI H,CHANG P C,et al.Dynamic credit scoring using B & B with incremental-SVM-ensemble[J].Kybernetes,2015,44(4):518-535.
[13]DŽELIHODŽIĆ A,DONKO D,KEVRIĆ J.Improved CreditScoring Model Based on Bagging Neural Network[J].International Journal of Information Technology & Decision Making,2018,17(6):17.
[14]NASCIMENTO D S C,COELHO A L V,CANUTO A M P.Integrating complementary techniques for promoting diversity in classifier ensembles:A systematic study[J].Neurocomputing,2014,138:347-357.
[15]LESSMANN S,BAESENS B,SCOW H V,et al.Benchmarking stat-of-the-art lassification algorithms for credit scoring:an update of research [J].European Joumal of Operational Rescarch,2015,247(1):124-136.
[16]LIU C Z,MA D L,XIA Y F.Application of Dynamic Heterogeneous Integrated Credit Scoring Model in P2P Network Lending[J].Financial Development Research,2018(9):24-31.
[17]QI H,WANG W J,GUO H S.A SVM Bagging ensemblemethod based on feature selection[J].Small Microcomputer System,2014,35(11):2533-2537.
[18]LI Y J,GUO H X,LI Y N,et al.Classification of an ensemble learning algorithm based on Boosting in imbalanced data[J].System Engineering Theory and Practice,2016,36(1):189-199.
[19]YU L,YANG Z B,TANG L.A novel multistage deep belief network based extreme learning machine ensemble learning paradigm for credit risk assessment[J].Flexible Services and Manufacturing Journal,2016,28(4):576-592.
[20]WANG M,CAO Q,SUN J Z,et al.A method of user basic attribute prediction based on ensemble learning [J].Small Micro Computer System,2020,41(12):2509-2515.
[21]CAO Z H,YU D X,SHI J F,et al.Two-layer classifier model applied to personal credit evaluation [J].Control Engineering,2019,26(12):2231-2234.
[22]ZHOU Z H,WU J X,TANG W.Ensembling neural networks:Many could be better than al1[J].Artificial Intelligence,2002,137(1/2):239-263.
[23]ZHANG C X,ZHANG J S.Overview of selective ensemblelearning algorithms [J].Chinese Journal of Computers,2011,34(8):1399-1410.
[24]XIA Y F.A novel heterogeneous ensemble credit scoring model based on bstacking approach[J].Expert Systems with Applications,2018,93:182-199.
[25]WU M H,GUO J S,JU Y,et al.Parallel selective ensemble algorithm based on hierarchical filtering and dynamic update [J].Computer Science,2017,44(1):48-52.
[26]DU H L,ZHANG Y.Network anomaly detection based on selective ensemble algorithm[J].The Journal of Supercomputing,2020(prepublish):1-22.
[27]YU J Y.Research on corporate credit risk assessment based on heterogeneous learner integration strategy [D].Beijing:Central University of Finance and Economics,2019.
[28]YANG H.Design and research of risk control model of micro-online loan platform based on migration learning [D].Beijing:Central University of Finance and Economics,2021.
[29]COHEN J.A Coefficient of Agreement for Nominal Scales[J].Educational and Psychological Measurement,1960,20(1):37-46.
[1] WU Xiao-wen, ZHENG Qiao-xian, XU Xin-qiang. Improved Ant Colony Algorithm for Solving Multi-objective Unilateral Assembly Line Balancing Problem [J]. Computer Science, 2022, 49(11A): 210900165-5.
[2] GAO Shi-shun, ZHAO Hai-tao, ZHANG Xiao-ying, WEI Ji-bo. Self-adaptive Intelligent Wireless Propagation Model to Different Scenarios [J]. Computer Science, 2021, 48(7): 324-332.
[3] WANG Guo-wu, CHEN Yuan-yan. Improvement of DV-Hop Location Algorithm Based on Hop Correction and Genetic Simulated Annealing Algorithm [J]. Computer Science, 2021, 48(6A): 313-316.
[4] WANG Mao-guang, YANG Hang. Risk Control Model and Algorithm Based on AP-Entropy Selection Ensemble [J]. Computer Science, 2021, 48(11A): 71-76.
[5] WANG Zhe, TANG Qi, WANG Ling, WEI Ji-bo. Joint Optimization Algorithm for Partition-Scheduling of Dynamic Partial Reconfigurable Systems Based on Simulated Annealing [J]. Computer Science, 2020, 47(8): 26-31.
[6] CHEN Qing-chao, WANG Tao, FENG Wen-bo, YIN Shi-zhuang, LIU Li-jun. Unknown Binary Protocol Format Inference Method Based on Longest Continuous Interval [J]. Computer Science, 2020, 47(8): 313-318.
[7] JIN Xiao-min, HUA Wen-qiang. Energy Optimization Oriented Resource Management in Mobile Cloud Computing [J]. Computer Science, 2020, 47(6): 247-251.
[8] ZHANG Yun-fan,ZHOU Yu,HUANG Zhi-qiu. Semantic Similarity Based API Usage Pattern Recommendation [J]. Computer Science, 2020, 47(3): 34-40.
[9] ZHANG De-gan, YANG Peng, ZHANG Jie, GAO Jin-xin, ZHANG Ting. New Method of Traffic Flow Forecasting of Connected Vehicles Based on Quantum Particle Swarm Optimization Strategy [J]. Computer Science, 2020, 47(11A): 327-333.
[10] XU Fei-xiang,YE Xia,LI Lin-lin,CAO Jun-bo,WANG Xin. Comprehensive Calculation of Semantic Similarity of Ontology Concept Based on SA-BP Algorithm [J]. Computer Science, 2020, 47(1): 199-204.
[11] WANG Gai-yun, WANG Lei-yang, LU Hao-xiang. RSSI-based Centroid Localization Algorithm Optimized by Hybrid Swarm Intelligence Algorithm [J]. Computer Science, 2019, 46(9): 125-129.
[12] ZHANG Huan-long, GAO Zeng, ZHANG Xiu-jiao, SHI Kun-feng. Image Matching Method Combining Hybrid Simulated Annealing and Antlion Optimizer [J]. Computer Science, 2019, 46(6): 328-333.
[13] XIA Ying, LI Liu-jie, ZHANG XU, BAE Hae-young. Weighted Oversampling Method Based on Hierarchical Clustering for Unbalanced Data [J]. Computer Science, 2019, 46(4): 22-27.
[14] WU Yi-fan, CUI Yan-peng, HU Jian-wei. Alert Processing Method Based on Hierarchical Clustering [J]. Computer Science, 2019, 46(4): 203-209.
[15] LIU Jing-fa, LI Fan, JIANG Sheng-yi. Focused Annealing Crawler Algorithm for Rainstorm Disasters Based on Comprehensive Priority and Host Information [J]. Computer Science, 2019, 46(2): 215-222.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!