计算机科学 ›› 2020, Vol. 47 ›› Issue (6A): 467-473.doi: 10.11896/JsJkx.190900128

• 数据库 & 大数据 & 数据科学 • 上一篇    下一篇

基于LSTM-GA的股票价格涨跌预测模型

包振山1, 郭俊南1, 谢源2, 张文博1   

  1. 1 北京工业大学信息学部 北京 100124;
    2 康曼德资本管理有限公司 北京 100600
  • 发布日期:2020-07-07
  • 通讯作者: 张文博(zhangwenbo@bJut.edu.cn)
  • 作者简介:baozhenshan@bJut.edu.cn
  • 基金资助:
    国家重点研发计划项目(2017YFC0803300)

Model for Stock Price Trend Prediction Based on LSTM and GA

BAO Zhen-shan1, GUO Jun-nan1, XIE Yuan2 and ZHANG Wen-bo1   

  1. 1 Faulty of Information Technology,BeiJing University of Technology,BeiJing 100124,China
    2 Commando Capital Company,BeiJing 100600,China
  • Published:2020-07-07
  • About author:BAO Zhen-shan, born in 1965, is a mem-ber of China Computer Federation.His main research interests include machine learning and Financial technology.
    ZHANG Wen-bo, born in 1980, Ph.D, lecturer, is a member of China Compu-ter Federation.Her main research inte-rests include heterogeneous computing and trust computing.
  • Supported by:
    This work was supported by the National Key R&D Program of China (2017YFC0803300).

摘要: 如何准确地进行股票预测一直是量化金融领域的重要问题。长短期记忆细胞神经网络(LSTM)的出现较好地解决了股票预测这类的复杂序列化数据学习的问题。然而前期研究结果表明单一使用该方法仍存在预测不平衡、陷入局部极值导致能力不佳的问题。基于上述问题,文中利用将遗传算法(GA)解决调参问题来保证模型预测的平衡性,由此构建了新型股票预测模型。该模型分为三部分,首先利用LSTM网络进行收盘价的预测,再利用基于遗传算法的判别机制,最终获取下一刻股票的涨跌信号。这一模型不同于先前的研究,主要针对LSTM模型的输出模块进行了改进。文中使用了中证500的日内分钟数据进行测试验证。实验得出,改进模型的各方面指标均优于单独的LSTM模型。

关键词: 长短期记忆神经网络, 股票预测, 机器学习, 遗传算法

Abstract: How to make an accurate financial time series prediction is one of the important quantitative financial problems.Long and short term memory neural network (LSTM) has solved the complex serialized data learning problems such as stock prediction much better.However,the results of previous studies show that there are still some problems such as unbalanced prediction and local minimum value,which lead to poor prediction ability.Based on the above problems,the genetic algorithm (GA) is used to solve the parameter adJustment problem to ensure the balance of model prediction,and a new stock prediction model is constructed.First,LSTM neural network is used to predict closing price.Then,the prediction results are calculated to the Judgment method based on genetic algorithm.Finally,the predicted stock’s ups and downs signals are gained as the output.This model is different from the previous state-of-the-art and is mainly improved for the output module of the LSTM model.High-frequency trading data of Index China are used for verification.The results show that the improved model is better than the LSTM model.

Key words: Genetic Algorithm, Long short-term memory, Machine learning, Stock prediction

中图分类号: 

  • TP183
[1] GANDHMAL D P,KUMAR K.Systematic analysis and review of stock market prediction techni-ques.Computer Science Review,2019,34:100-190.
[2] CHAKRAVARTY S,DASH P K.A PSO based integrated functional link net and interval type-2 fuzzy logic system for predicting stock market indices.Applied Soft Computing,2012,12(2):931-941.
[3] NI L,NI Z,GAO Y,et al.Stock trend prediction based on fractal feature selection and support vector machine.Expert Systems with Applications,2011,38(5):5569-5576.
[4] KRAUSS C,DO X A,HUCK N,et al.Deep neural networks,gradient-boosted trees,random forests:Statistical arbitrage on the S&P 500.European Journal of Operational Research,2017,259(2):689-702.
[5] FISCHER T,KRAUSS C.Deep learning with long short-term memory networks for financial market predictions.European Journal of Operational Research,2018,270(2):654-669
[6] HENRIQUE B M,SOBREIRO V A,KIMURA H,et al.Literature review:Machine learning techniques applied to financial market prediction.Expert Systems with Applications,2019,124(JUN):226-251.
[7] HOCHREITER S,SCHMIDHUBER J.Long short-term memory.Neural Computation,1997,9(8):1735-1780.
[8] GERS F A,SCHMIDHUBER,JRGEN,et al.Learning to Forget:Continual Prediction with LSTM.Neural Computation,2000,12(10):2451-2471.
[9] GRAVES A.Supervised Sequence Labelling with Recurrent Neural Networks.Springer,2012.
[10] QIN H M,SUN X.Classifying Bug Reports into Bugs and Nonbugs Using LSTM//The Tenth Asia-Pacifc Symposium on Internet-ware.2018.
[11] HUANG Y M,JIANG Y,HASAN T,et al.A Topic BiLSTM Model for Sentiment Classification//Innovation in Artificial Intelligence (ICIAI).2018.
[12] NELSON D M Q , PEREIRA A C M , OLIVEIRA R A D.Stock market’s price movement prediction with LSTM neural networks//International Joint Conference on Neural Networks (IJCNN).2017.
[13] LIN M,CHEN C X.Short-term prediction of stock market price based on GA optimization LSTM neurons//International Conference on Deep Learning Technologies (ICDLT).2018.
[14] SHIU J N,ZOU J Z,ZHANG J,et al.Research of stock price prediction based on dmd-lstm model .Application Research of Computers.https://doi.org/10.19734/J.issn.1001-3695.2018.08.0657.
[15] CHEN J,LIU D X,WU D S.Stock index forecasting method based on feature selection and LSTM model.Computer Engineering and Applications,2019,55(6):108-112.
[16] HO T K.Random decision forests//International Conference on Document Analysis and Recognition.1995:278-282.
[17] CHOLLET F.Keras.https://github.com/fchollet/keras,2016.
[18] Keras Documentation.https://keras.io.
[19] GRANGER C W J.Strategies for Modelling Nonlinear Time‐Series Relationships.Economic Record,2010,69(3):233-238.
[1] 冷典典, 杜鹏, 陈建廷, 向阳.
面向自动化集装箱码头的AGV行驶时间估计
Automated Container Terminal Oriented Travel Time Estimation of AGV
计算机科学, 2022, 49(9): 208-214. https://doi.org/10.11896/jsjkx.210700028
[2] 宁晗阳, 马苗, 杨波, 刘士昌.
密码学智能化研究进展与分析
Research Progress and Analysis on Intelligent Cryptology
计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[3] 李瑶, 李涛, 李埼钒, 梁家瑞, Ibegbu Nnamdi JULIAN, 陈俊杰, 郭浩.
基于多尺度的稀疏脑功能超网络构建及多特征融合分类研究
Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network
计算机科学, 2022, 49(8): 257-266. https://doi.org/10.11896/jsjkx.210600094
[4] 张光华, 高天娇, 陈振国, 于乃文.
基于N-Gram静态分析技术的恶意软件分类研究
Study on Malware Classification Based on N-Gram Static Analysis Technology
计算机科学, 2022, 49(8): 336-343. https://doi.org/10.11896/jsjkx.210900203
[5] 何强, 尹震宇, 黄敏, 王兴伟, 王源田, 崔硕, 赵勇.
基于大数据的进化网络影响力分析研究综述
Survey of Influence Analysis of Evolutionary Network Based on Big Data
计算机科学, 2022, 49(8): 1-11. https://doi.org/10.11896/jsjkx.210700240
[6] 陈明鑫, 张钧波, 李天瑞.
联邦学习攻防研究综述
Survey on Attacks and Defenses in Federated Learning
计算机科学, 2022, 49(7): 310-323. https://doi.org/10.11896/jsjkx.211000079
[7] 李亚茹, 张宇来, 王佳晨.
面向超参数估计的贝叶斯优化方法综述
Survey on Bayesian Optimization Methods for Hyper-parameter Tuning
计算机科学, 2022, 49(6A): 86-92. https://doi.org/10.11896/jsjkx.210300208
[8] 赵璐, 袁立明, 郝琨.
多示例学习算法综述
Review of Multi-instance Learning Algorithms
计算机科学, 2022, 49(6A): 93-99. https://doi.org/10.11896/jsjkx.210500047
[9] 杨浩雄, 高晶, 邵恩露.
考虑一单多品的外卖订单配送时间的带时间窗的车辆路径问题
Vehicle Routing Problem with Time Window of Takeaway Food ConsideringOne-order-multi-product Order Delivery
计算机科学, 2022, 49(6A): 191-198. https://doi.org/10.11896/jsjkx.210400005
[10] 肖治鸿, 韩晔彤, 邹永攀.
基于多源数据和逻辑推理的行为识别技术研究
Study on Activity Recognition Based on Multi-source Data and Logical Reasoning
计算机科学, 2022, 49(6A): 397-406. https://doi.org/10.11896/jsjkx.210300270
[11] 姚烨, 朱怡安, 钱亮, 贾耀, 张黎翔, 刘瑞亮.
一种基于异质模型融合的 Android 终端恶意软件检测方法
Android Malware Detection Method Based on Heterogeneous Model Fusion
计算机科学, 2022, 49(6A): 508-515. https://doi.org/10.11896/jsjkx.210700103
[12] 王飞, 黄涛, 杨晔.
基于Stacking多模型融合的IGBT器件寿命的机器学习预测算法研究
Study on Machine Learning Algorithms for Life Prediction of IGBT Devices Based on Stacking Multi-model Fusion
计算机科学, 2022, 49(6A): 784-789. https://doi.org/10.11896/jsjkx.210400030
[13] 许杰, 祝玉坤, 邢春晓.
机器学习在金融资产定价中的应用研究综述
Application of Machine Learning in Financial Asset Pricing:A Review
计算机科学, 2022, 49(6): 276-286. https://doi.org/10.11896/jsjkx.210900127
[14] 魏辉, 陈泽茂, 张立强.
一种基于顺序和频率模式的系统调用轨迹异常检测框架
Anomaly Detection Framework of System Call Trace Based on Sequence and Frequency Patterns
计算机科学, 2022, 49(6): 350-355. https://doi.org/10.11896/jsjkx.210500031
[15] 么晓明, 丁世昌, 赵涛, 黄宏, 罗家德, 傅晓明.
大数据驱动的社会经济地位分析研究综述
Big Data-driven Based Socioeconomic Status Analysis:A Survey
计算机科学, 2022, 49(4): 80-87. https://doi.org/10.11896/jsjkx.211100014
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!