计算机科学 ›› 2024, Vol. 51 ›› Issue (11A): 240300180-5.doi: 10.11896/jsjkx.240300180

• 交叉&应用 • 上一篇    下一篇

基于演化CatBoost算法的房价预测模型

王成章1, 白晓明2, 汤文英1, 陈书涵1   

  1. 1 中央财经大学统计与数学学院 北京 100081
    2 首都经济贸易大学管理工程学院 北京 100070
  • 出版日期:2024-11-16 发布日期:2024-11-13
  • 通讯作者: 王成章(czwang@cufe.edu.cn)
  • 基金资助:
    北京市社会科学基金(21GGB018);中央财经大学教育教学改革基金(2022ZXJG09)

Evolutionary CatBoost Based Housing Price Prediction Model

WANG Chengzhang1, BAI Xiaoming2, TANG Wenying1, CHEN Shuhan1   

  1. 1 School of Statistics and Mathematics,Central University of Finance and Economics,Beijing 100081,China
    2 School of Management and Engineering,Capital University of Economics and Business,Beijing 100070,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:WANG Chengzhang,born in 1977,Ph.D,associate professor,master supervisor.His main research interests include machine learning,pattern recognition and big data analysis.
  • Supported by:
    Social Science Foundation of Beijing(21GGB018) and the Educational Reform Foundation of Central University of Finance and Economics(2022ZXJG09).

摘要: 遗传规划算法采用函数变换将原有变量张成的空间映射到新的特征空间,通过遗传算子操作实现目标函数的最优化。影响房价波动的因素有很多,各影响因素与房价之间呈现复杂的非线性关系。文中提出了一种基于演化CatBoost算法的房价预测模型,将影响房价波动的各因素变量编码为遗传规划算法的终端变量,采用CatBoost算法作为基学习器构建适应度函数,针对房价预测的特点设计合理的遗传算子,在函数映射后的特征空间上实现目标函数的最优化,以提升预测模型的性能。实验结果表明,基于演化CatBoost算法的房价预测模型的预测性能优于传统的基于随机森林算法、支持向量机算法、自适应增强算法、极致梯度提升算法等的预测模型,能够更好地实现房价的预测,在相同条件下具有更高的预测准确度。

关键词: 遗传规划, CatBoost算法, 预测模型, 决策树, 最优化

Abstract: Genetic programming algorithm uses function transformation to map the space formed by the original variables to a new feature space,and optimizes the objective function through genetic operator operations.There are many factors that affect housing price fluctuations,and each influencing factor exhibits a complex nonlinear relationship with housing prices.This paper proposes an evolutionary CatBoost algorithm based housing price prediction model.Various factor variables that affect housing price fluctuations are encoded as terminal variables of the genetic programming algorithm.CatBoost algorithm is employed as the base learner to construct a fitness function,and reasonable genetic operators are designed according to the characteristics of hou-sing price prediction.The objective function is optimized in the feature space after function mapping to improve the performance of the prediction model.Experimental results show that the prediction performance of evolutionary CatBoost algorithm based housing price prediction model is superior to that of traditional prediction models based on random forest algorithm,support vector machine algorithm,adaptive enhancement algorithm,extreme gradient enhancement algorithm,etc.It can predict housing prices more accurately than the rivals under the same conditions.

Key words: Genetic programming, CatBoost algorithm, Prediction model, Decision tree, Optimization

中图分类号: 

  • TP391
[1]LIU X Y,DU C,LI S L.Natural geographical constraints,land use regulations,and China's housing supply elasticity[J].Economic Research,2019,54(4):99-115.
[2]LIN Z L,LI X Y.Empirical study on the influencing factors of commodity housing prices based on panel model[J].Economic Mathematics,2017,34(4):73-78.
[3]XU J,YE Z Q.Analysis of Factors Influencing the Price ofCommercial Housing Based on VAR Model[J].Statistics and Decision Making,2017(11):93-97.
[4]GAO T.Empirical Study on Factors Influencing Housing Prices in Nanjing City[J].Economic Research Guide,2019(12):125-127,172.
[5]ZHENG M,WANG H F,WANG C Z,et al.Speculative beha-vior in a housing market:Boom and bust[J],Economic Modelling,2017(61):50-64.
[6]ALFREDAS L,ANTANAS L,ALGIMANTAS L.Macroeco-nomic Variables Influencing Housing Prices in Vilnius[J].International Journal of Strategic Property Management,2022,26(1):24-34.
[7]YANG H,LI C.Research on the influencing factors and contribution of housing prices in Chinese cities:a relative importance decomposition based on R~2[J].Exploration of Economic Issues,2019(11):49-62.
[8]ZHANG Y,ZHANG D,MILLER E J.Spatial AutoregressiveAnalysis and Modeling of Housing Prices in City of Toronto[J].Journal of Urban Planning and Development,2021,147(1):05021003.
[9]WANG S,ZENG Y N,YAO J Y,et al.Economic policy uncertainty,monetary policy,and housing price in China[J],Journal of Applied Economics,2020,23(1):235-252.
[10]SU C W,LI X,TAO R.How does economic policy uncertainty affect prices of housing? Evidence from Germany[J].Argumenta Oeconomica,2019(1):131-153.
[11]CUI Z,ZHOU M Q,KONG L Z.Research on Heterogeneity of Factors Influencing Urban Housing Prices in China[J].Taxation and Economics,2022(6):65-74.
[12]LI C Q.Research on the Regional Heterogeneity of the Impact of Monetary Policy on the Real Economy and Housing Prices:A GVAR Model Based on the Construction of Payment Data Weight Matrix[J].Shanghai Finance,2023(6):56-70.
[13]BIN O.A prediction comparison of housing sales prices by parametric versus semi-parametric regressions[J].Journal of Hou-sing Economics,2004,13(1):68-84.
[14]KUSAN H,AYTEKIN O,ÖZDEMIR I.The use of fuzzy logic in predicting house selling price[J].Expert Systems with Applications,2010(37):1808-1813.
[15]BALCILAR M,GUPTA R,MILLER S M.The out-of-sample forecasting performance of nonlinear models of regional housing prices in the US[J].Applied Economics,2015(47):2259-2277.
[16]JIANG X,JIA Z,LI L,et al.Understanding Housing PricesUsing Geographic Big Data:A Case Study in Shenzhen[J].Sustainability,2022,14(9):5307.
[17]CHEN J H,ONG C F,ZHENG L,et al.Forecasting spatial dynamics of the housing market using Support Vector Machine[J].International Journal of Strategic Property Management,2017(4):273-283.
[18]WU J Y,WANG S Y,SHI H W,et al. Analysis and prediction of housing market prices in Beijing based on multi wavelet analysis[J].Journal of Beijing University of Chemical Technology:Natural Science Edition,2019,46(5):101-106.
[19]ZHOU L J,ZHAO M Y.Analysis of house price predictionbased on several types of machine learning models[J].National Circulation Economy,2022(6):111-116.
[20]PARKB,BAE J K.Using machine learning algorithms for hou-sing price prediction:The case of Fairfax County,Virginia hou-sing data[J].Expert Systems with Applications,2015,42(6):2928-2934.
[21]TRAWIŃSKI B,et al.Comparison of expert algorithms with machine learning models for real estate appraisal[C]//IEEE International Conference on Inovations in Intelligent Systems and Applications.2017:51-54.
[22]ZHU H Y,WANG Z J,YE C C,Prediction of Housing Prices in Urban Hotspot Areas Based on XGBoost Algorithm:A Case Study of Jiangbei New Area in Nanjing[J].Building Economy,2022,43(S2):433-437.
[23]GU J R,ZHU M C,JIANG L G Y.Housing price forecasting based on genetic algorithm and support vector machine[J].Expert Systems with Applications,2011(38):3383-3386.
[24]AGAPITOS R,LOUGHRAN M,NICOLAU S,et al.A Survey of Statistical Machine Learning Elements in Genetic Programming[J].IEEE Transactions on Evolutionary Computation,2019,23(6):1029-1048.
[25]ZHANG H,ZHOU A,ZHANG H.An Evolutionary Forest for Regression[J].IEEE Transactions on Evolutionary Computation,2022,26(4):735-749.
[26]PROKHORENKOVA L,GUSEV G,VOROBEV A,et al.Catboost:unbiased boosting with categorical features[J].Advances in Neural Information Processing Systems,2018(31):6638-6648.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!