Computer Science ›› 2022, Vol. 49 ›› Issue (4): 134-139.doi: 10.11896/jsjkx.210300075

• Database & Big Data & Data Science • Previous Articles     Next Articles

Application of Gray Wolf Optimization Algorithm on Synchronous Processing of Sample Equalization and Feature Selection in Credit Evaluation

CHU An-qi, DING Zhi-jun   

  1. Key Laboratory of Embedded System and Service Computing of Ministry of Education (Tongji University), Shanghai 201804, China; Shanghai Electronic Transactions and Information Service Collaborative Innovation Center (Tongji University), Shanghai 201804, China
  • Received:2021-03-08 Revised:2021-07-14 Published:2022-04-01
  • Supported by:
    This work was supported by the Shanghai Science and Technology Innovation Action Plan(19511101300).

Abstract: With the rapid development of Internet finance industry, traditional credit risk evaluation is facing challenges in the face of massive data.Due to the unbalanced sample categories and high feature redundancy in credit evaluation, it has become the key factor affecting the classification accuracy of current evaluation.In order to solve the above problems, a method based on gray wolf optimization algorithm is proposed to process the samples under sampling and feature selection synchronously.In this me-thod, the performance of the classifier is taken as the heuristic information of the gray wolf optimization algorithm, and then the intelligent search is carried out to obtain the combination of the optimal sample and the feature set, and the tabu table strategy is introduced into the original gray wolf algorithm to avoid the algorithm falling into the local optimum.Experimental results show that the proposed method has a great improvement compared with other methods, and its performance on different data sets proves that it can effectively solve the problem of sample imbalance, reduce the dimension of feature space, and improve the accuracy of classification.Compared with the original data, the accuracy of credit risk evaluation is improved by about 3%, which proves the applicability and superiority of this method in the field of credit evaluation.

Key words: Credit evaluation, Feature selection, Gray wolf optimization algorithm, Sample imbalance

CLC Number: 

  • TP3-05
[1] SUN H,WANG B.Research on Credit Risk Assessment of Online Network Credit Based on GBDT[C]//2020 International Conference on Big Data in Management.2020.
[2] PENG M,ZHANG Q,XING X,et al.Trainable Undersampling for Class-Imbalance Learning[C]//AAAI Conference on Artificial Intelligence.2019.
[3] WILSON D L.Asymptotic Properties of Nearest NeighborRules Using Edited Data[J].IEEE Transactions on Systems Man & Cybernetics,1972,SMC-2(3):408-421.
[4] MANI I,ZHANG J.KNN Approach to Unbalanced Data Distributions:A Case Study Involving Information Extraction[C]//ICML Workshop on Learning from Imbalanced Datasets.2003.
[5] LIU X Y,WU J,ZHOU Z H.Exploratory Undersampling for Class-Imbalance Learning[J].IEEE Transactions on Cyberne-tics,2009,39(2):539-550.
[6] LEMAITRE G,NOGUEIRA F,ARIDAS C K.Imbalanced-learn:A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning[J].Journal of Machine Learning Research,2016,18(17):1-5.
[7] LIU Y,YANG K.Credit Fraud Detection for Extremely Imba-lanced Data Based on Ensembled Deep Learning[J].Journal of Computer Research and Development,2021,58(3):539-547.
[8] FRITZ S,HOSEMANN D.Restructuring the credit process:behaviour scoring for German corporates[J].Intelligent Systems in Accounting Finance & Management,2000,9(1):9-21.
[9] DING C,PENG H.Minimum redundancy feature selection from microarray gene expression data[J].Journal of Bioinformatics and Computational Biology,2005,3(2):185-206.
[10] HALL M.Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning[C]//Proceedings of the 17th International Conference on Machine Learning.San Francisco,Morgan Kaufmann,2000:359-366.
[11] TRAN B,XUE B,ZHANG M.A New Representation in PSO for Discretization-Based Feature Selection[J].IEEE Transactions on Cybernetics,2018,48(6):1733-1746.
[12] ZHANG X,LI Z S.Research on Feature Selection Algorithm Based on Natural Evolution Strategy[J].Journal of Software,2020,31(12):3733-3752.
[13] MIRJALILI S,MIRJALILI S M,LEWIS A.Grey Wolf Optimizer[J].Advances in Engineering Software,2014,69:46-61.
[14] ZHANG P Y,HUANG X Z,LI M Z,et al.Hybridization between Neural Computing and Nature-Inspired Algorithms for a Sentence Similarity Model Based on the Attention Mechanism[J].ACM Transactions on Asian and Low-Resource Language Information Processing,2021,20(1):1-21.
[15] MISHRA S,DWIVEDULA R,KSHIRSAGAR V,et al.Robust Detection of Network Intrusion using Tree-based Convolutional Neural Networks[C]//8th ACM IKDD CODS and 26th COMAD.2021.
[16] INDU S,SRIVASTAVA S,SHARMA V.Optimal CameraPlacement and Orientation of A Multi-camera System for Self Driving Cars[C]//Proceedings of the 2020 4th International Conference on Vision,Image and Signal Processing.2020:1-5.
[17] LIU J,CHEN Z,ZHANG Y,et al.Path Planning of Mobile Robots based on Improved Genetic Algorithm[C]//2020 2nd International Conference on Robotics,Intelligent Control and Artificial Intelligence.2020.
[18] WANG W J,SUN Y Y,SUN H L,et al.Research on Multi-source Heterogeneous Data Classification Based on Multi-objective Optimization Technology[J].Computer and Digital Engineering,2020,48(1):130-136.
[19] ZHOU M.Credit Evaluation for Hybrid Grey Wolf Optimization and Least Squares Support Vector Machine Approach[J].Journal of Chengdu University of Technology(Science & Technology Edition),2019,46(4):507-512.
[20] CHANG C C,LIN C J.LIBSVM:A library for support vector machines[J].ACM Transactions on Intelligent Systems and Technology,2011,2(3):1-27.
[1] LI Bin, WAN Yuan. Unsupervised Multi-view Feature Selection Based on Similarity Matrix Learning and Matrix Alignment [J]. Computer Science, 2022, 49(8): 86-96.
[2] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[3] KANG Yan, WANG Hai-ning, TAO Liu, YANG Hai-xiao, YANG Xue-kun, WANG Fei, LI Hao. Hybrid Improved Flower Pollination Algorithm and Gray Wolf Algorithm for Feature Selection [J]. Computer Science, 2022, 49(6A): 125-132.
[4] SUN Lin, HUANG Miao-miao, XU Jiu-cheng. Weak Label Feature Selection Method Based on Neighborhood Rough Sets and Relief [J]. Computer Science, 2022, 49(4): 152-160.
[5] LI Zong-ran, CHEN XIU-Hong, LU Yun, SHAO Zheng-yi. Robust Joint Sparse Uncorrelated Regression [J]. Computer Science, 2022, 49(2): 191-197.
[6] YUAN Lei, LIU Zi-yan, ZHU Ming-cheng, MA Shan-shan, CHEN Lin-zhou-ting. Improved YOLOv3 Remote Sensing Target Detection Based on Improved Dense Connection and Distributional Ranking Loss [J]. Computer Science, 2021, 48(9): 168-173.
[7] ZHANG Ye, LI Zhi-hua, WANG Chang-jie. Kernel Density Estimation-based Lightweight IoT Anomaly Traffic Detection Method [J]. Computer Science, 2021, 48(9): 337-344.
[8] YANG Lei, JIANG Ai-lian, QIANG Yan. Structure Preserving Unsupervised Feature Selection Based on Autoencoder and Manifold Regularization [J]. Computer Science, 2021, 48(8): 53-59.
[9] HOU Chun-ping, ZHAO Chun-yue, WANG Zhi-peng. Video Abnormal Event Detection Algorithm Based on Self-feedback Optimal Subclass Mining [J]. Computer Science, 2021, 48(7): 199-205.
[10] HU Yan-mei, YANG Bo, DUO Bin. Logistic Regression with Regularization Based on Network Structure [J]. Computer Science, 2021, 48(7): 281-291.
[11] ZHOU Gang, GUO Fu-liang. Research on Ensemble Learning Method Based on Feature Selection for High-dimensional Data [J]. Computer Science, 2021, 48(6A): 250-254.
[12] LI Si-di, GUO Bing-hui, YANG Xiao-bo. Study on Financial Credit Information Based on Graph Neural Network [J]. Computer Science, 2021, 48(4): 85-90.
[13] DING Si-fan, WANG Feng, WEI Wei. Relief Feature Selection Algorithm Based on Label Correlation [J]. Computer Science, 2021, 48(4): 91-96.
[14] TENG Jun-yuan, GAO Meng, ZHENG Xiao-meng, JIANG Yun-song. Noise Tolerable Feature Selection Method for Software Defect Prediction [J]. Computer Science, 2021, 48(12): 131-139.
[15] ZHANG Ya-chuan, LI Hao, SONG Chen-ming, BU Rong-jing, WANG Hai-ning, KANG Yan. Hybrid Artificial Chemical Reaction Optimization with Wolf Colony Algorithm for Feature Selection [J]. Computer Science, 2021, 48(11A): 93-101.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!