Computer Science ›› 2022, Vol. 49 ›› Issue (6A): 242-246.doi: 10.11896/jsjkx.210200108

• Big Data & Data Science • Previous Articles     Next Articles

Adaptive Ensemble Ordering Algorithm

WANG Wen-qiang1, JIA Xing-xing1,2, LI Peng1   

  1. 1 School of Mathematics and Statistics,Lanzhou University,Lanzhou 730000,China
    2 Guangxi Key Laboratory of Trusted Software,Guilin University of Electronic Technology,Guilin,Guangxi 451000,China
  • Online:2022-06-10 Published:2022-06-08
  • About author:WANG Wen-qiang,born in 1996,postgraduate.His main research interests include statistical theory and its application.
    JIA Xing-xing,born in 1982,associated professor,master supervisor.Her main research interests include secret sharing,visual cryptography and data science.
  • Supported by:
    National Natural Science Foundation of China(61902164,61972225),Fundamental Research Funds for the Chinese Central Universities(lzujbky-2021-53),Natural Science Foundation of Gansu Province of China(20JR5RA286) and Guangxi Key Laboratory of Trusted Software(KX201907).

Abstract: Ordinal variables are used to express people's attitudes and preferences towards things.For example,in recommendation system,consumers' grades for goods are ordinal variables,and the emotion in sentiment analysis of NLP is also ordinal variables.At present,the ordered Logit model is adoptedto deal with the ordinal variables.However,the ordered Logit regression mo-del requires that theordinal variables generally follow uniform distribution.When theordinal variables do not conform to uniform distribution,the prediction result of the ordered Logit regression is not ideal.Based on this,this paper proposes an adaptive ensemble ordering algorithm.Firstly,this paper proposes a boosting-like algorithm with the aid of the idea of boosting.According to the concept of the ordered Logit regression model,the ordered multi-layer perceptron model and the ordered random fo-rest model are constructed.The two models,combined with the Softmax multi classification model and the ordered Logit model,constitute a boosting-like algorithm.In data processing,when the prediction values of the four models are not identical,the sample enters the boosting-like model and continues to train until the number of training rounds exceeds a certain threshold.Then,the random fo-rest model is adopted to construct the mapping function from all the predicted values of the training set to the real values.The proposed algorithm has a high prediction accuracy when the ordered variables are arbitrarily distributed,which greatly improves the application scope of the ordered Logit regression model.When the proposed algorithm is applied to the Baijiu quality datasets and the red wine quality datasets,its prediction accuracy is superior to that of the ordered Logit model and Softmax algorithm,Multi-layer Perceptron and KNN.

Key words: Ensemble algorithm, Multi-layer perceptron, Ordered Logit regression model, Ordinal variables, Random forest algorithm

CLC Number: 

  • TP391
[1] MCCULLAGH P.Regression Models for Ordinal Data[J].Journal of the Royal Statistical Society.Series B:Methodological,1980,42(2):109-127.
[2] ENGEL J.Polytomous Logistic Regression[J].Statistica Neerlandica,2010,42(4):233-252.
[3] BENDER R,GROUVEN U.Using Binary Logistic Regression Models for Ordinal Data with Non-proportional Odds[J].Journal of Clinical Epidemiology,1998,51(10):809-816.
[4] WINSHIP C,MARE R D.Regression Models with Ordinal Va-riables[J].American Sociological Review,1984,49(4):512-525.
[5] WALTER S D,FEINSTEIN A R,WELLS C K.Coding ordinal independent variables in multiple regression analyses[J].American Journal of Epidemiology,1987,125(2):319-323.
[6] GAO G,HE L.Test of application conditions of Logistic regression for multiple categorical ordinal response variables[J].China Health Statistics,2003,20(5):276-278.
[7] GERTHEISS J,TUTZ G.Penalized Regression with OrdinalPredictors[J].International Statistical Review,2010,77(3):345-365.
[8] HONG H G,HE X.Prediction of functional status for the elderly based on a new ordinal regression model.Journal of the American Statistical Association,2010,105(491):930-941.
[9] HONG H G,ZHOU J.A multi-index model for quantile regression with ordinal data[J].Journal of Applied Statistics,2013,40(6):1231-1245.
[10] RAHMAN M A.Bayesian quantile regression for ordinal models[J].Bayesian Analysis,2016,11(1):1-24.
[11] ALHAMZAWI R.Bayesian model selection in ordinal quantile regression[J].Computational Statistics & Data Analysis,2016,103:68-78.
[12] ALHAMZAWI R.Bayesian quantile regression for ordinal longitudinal data Non-proportional Odds[J].Journal of Applied Statistics,2017,45(5):1-14.
[1] XIA Yuan, ZHAO Yun-long, FAN Qi-lin. Data Stream Ensemble Classification Algorithm Based on Information Entropy Updating Weight [J]. Computer Science, 2022, 49(3): 92-98.
[2] XU Bing, YI Pei-yu, WANG Jin-ce, PENG Jian. High-order Collaborative Filtering Recommendation System Based on Knowledge Graph Embedding [J]. Computer Science, 2021, 48(11A): 244-250.
[3] WANG Ge-ge, GUO Tao, LI Gui-yang. Multi-layer Perceptron Deep Convolutional Generative Adversarial Network [J]. Computer Science, 2019, 46(9): 243-249.
[4] CUI Jing-chun, WANG Jing. Face Expression Recognition Model Based on Enhanced Head Pose Estimation [J]. Computer Science, 2019, 46(6): 322-327.
[5] ZHENG Cheng, HONG Tong-tong, XUE Man-yi. BLSTM_MLPCNN Model for Short Text Classification [J]. Computer Science, 2019, 46(6): 206-211.
[6] XU Kui, CHEN Ke, XU Jun, TIAN Jia-lin, LIU Hao and WANG Yu-fan. CGDNA:An Ensemble De Novo Genome Assembly Algorithm Based on Clustering Graph [J]. Computer Science, 2015, 42(9): 235-239.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!