一种基于深度LSTM和注意力机制的金融数据预测方法

doi:10.11896/jsjkx.200700050

计算机科学 ›› 2020, Vol. 47 ›› Issue (12): 125-130.doi: 10.11896/jsjkx.200700050

所属专题：大数据&数据科学虚拟专题

• 数据库&大数据&数据科学 • 上一篇下一篇

一种基于深度LSTM和注意力机制的金融数据预测方法

刘翀, 杜军平

北京邮电大学计算机学院智能通信软件与多媒体北京市重点实验室北京 100876

收稿日期:2020-07-08 修回日期:2020-09-03 出版日期:2020-12-15 发布日期:2020-12-17
通讯作者: 杜军平(junpingdu@126.com)
作者简介:Alen123456@163.com
基金资助:
国家自然科学基金项目(61902037615320066177208361802028);广西科技重大专项(桂科AA18118054)

Financial Data Prediction Method Based on Deep LSTM and Attention Mechanism

LIU Chong, DU Jun-ping

Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia School of Computer Science Beijing University of Posts and Telecommunications Beijing 100876,China

Received:2020-07-08 Revised:2020-09-03 Online:2020-12-15 Published:2020-12-17
About author:LIU Chong,born in 1995postgraduateis a member of China Computer Federation.His main research interests include nature language processingcomputer vision and deep learning.
DU Jun-ping,born in 1963Ph.Dprofessoris a fellow of China ComputerFederation and CAAI.Her main research interests include artificial intelligenceimage processing and pattern recognition.
Supported by:
National Natural Science Foundation of China(61902037,61532006,61772083,61802028) and Science and Technology Major Project of Guangxi (Guike AA18118054).

摘要/Abstract

摘要： 随着互联网的迅速发展金融市场每日产生了大量在线金融数据如每日的交易次数以及交易的总金额等.近年来金融市场数据的动态预测成为了研究热点.金融市场数据量大输入序列较多且会随着时间发生变化.针对这些问题文中提出了基于深度LSTM和注意力机制的金融数据预测模型.首先该模型能处理复杂的金融市场数据输入主要是多序列的输入;其次该模型使用深度LSTM网络对金融数据进行建模解决了数据间长依赖的问题并能学习到更加复杂的市场动态特征;最后该模型引入了注意力机制使得不同时间的数据对预测的重要程度不同预测更加精准.在真实的金融大数据集上的实验表明所提模型在动态预测领域具有准确性高、稳定性好的特点.

关键词: 金融预测, 深层LSTM, 序列模型, 注意力机制

Abstract: With the rapid development of the Internetfinancial markets generate a large amount of online financial data every daysuch as the number of daily transactions and the total amount of transactions.The dynamic prediction of financial market data has become a research hotspot in recent years.Howeverthe financial market has a large amount of datamany input sequencesand changes over time.Aiming at solving these problemsthis paper proposes a financial data prediction model based on deep LSTM and attention mechanism.Firstthe model can handle complex financial market data which are mainly multi-sequence data.Secondthe model uses deep LSTM networks to model financial datasolves the problem of long dependence between dataand can learn more complex market dynamic characteristics.Finallythe model introduces the attention mechanismwhich makes the data of different time have different importance to the prediction and make the prediction more accurate.Experiments on real large data sets show that the proposed model has the characteristics of high accuracy and good stability in the field of dynamic prediction.

Key words: Attention mechanism, Deep LSTM, Financial forecasting, Sequence model

中图分类号:

TP391

刘翀, 杜军平. 一种基于深度LSTM和注意力机制的金融数据预测方法[J]. 计算机科学, 2020, 47(12): 125-130. https://doi.org/10.11896/jsjkx.200700050

LIU Chong, DU Jun-ping. Financial Data Prediction Method Based on Deep LSTM and Attention Mechanism[J]. Computer Science, 2020, 47(12): 125-130. https://doi.org/10.11896/jsjkx.200700050

参考文献

[1] ZHANG J,SUN Q.Research on Financing Cost of Small and Medium-Sized Enterprises by Internet Finance[J].Open Journal of Social Sciences,2017,5(11):95.
[2] LIN Y H,CHEN C F.Research on Enterprise Financial RiskEvaluation Based on Association Rules[J].Friends of Accounting,2017(1):32-35.
[3] LIU J X,JIA X Y.A Multi-label Classification Algorithm Based on Association Rules Mining[J].Journal of Software,2017,28(11):2865-2878.
[4] GREFF K,SRIVASTAVA R K,KOUTNÍK J,et al.LSTM:A search space odyssey[J].IEEE Transactions on Neural Networks and Learning Systems,2016,28(10):2222-2232.
[5] MERITY S,KESKAR N S,SOCHER R.Regularizing and optimizing LSTM language models[J].arXiv:1708.02182.
[6] ZHAO Z,CHEN W,WU X,et al.LSTM network:a deep learning approach for short-term traffic forecast[J].IET Intelligent Transport Systems,2017,11(2):68-75.
[7] KARIM F,MAJUMDAR S,DARABI H,et al.LSTM fully convolutional networks for time series classification[J].IEEE Access,2017,6:1662-1669.
[8] FU R,ZHANG Z,LI L.Using LSTM and GRU neural network methods for traffic flow prediction[C]//2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC).IEEE,2016:324-328.
[9] CHOI H,CHO K,BENGIO Y.Fine-grained attention mechanism for neural machine translation[J].Neurocomputing,2018,284:171-176.
[10] TILK O,ALUMÄE T.Bidirectional Recurrent Neural Network with Attention Mechanism for Punctuation Restoration[C]//Interspeech.2016:3047-3051.
[11] WANG J,SUN T,LIU B,et al.CLVSA:A convolutional LSTM based variational sequence-to-sequence model with attention for predicting trends of financial markets[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence.AAAI Press,2019:3705-3711.
[12] CHEN L,CHI Y,GUAN Y,et al.A Hybrid Attention-BasedEMD-LSTM Model for Financial Time Series Prediction[C]//2019 2nd International Conference on Artificial Intelligence and Big Data (ICAIBD).IEEE,2019:113-118.
[13] JIANG M,WANG J,LAN M,et al.An effective gated and attention-based neural network model for fine-grained financial target-dependent sentiment analysis[C]//International Conference on Knowledge Science,Engineering and Management.Springer,Cham,2017:42-54.
[14] CONTRERAS J,ESPINOLA R,NOGALES F J,et al.ARIMA models to predict next-day electricity prices[J].IEEE Transactions on Power Systems,2003,18(3):1014-1020.
[15] GAO Y,GLOWACKA D.Deep gate recurrent neural network[C]//Asian Conference on Machine Learning.2016:350-365.
[16] HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[17] ZHAO H K,WU L K,LI H,et al.Predicting the Dynamics in Internet Finance Based on Deep Neural Network Structure[J].Journal of Computer Research and Development,2019,56(8):1621-1631.

相关文章 15

[1]	周芳泉, 成卫青. 基于全局增强图神经网络的序列推荐 Sequence Recommendation Based on Global Enhanced Graph Neural Network 计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[2]	戴禹, 许林峰. 基于文本行匹配的跨图文本阅读方法 Cross-image Text Reading Method Based on Text Line Matching 计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[3]	周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4]	熊丽琴, 曹雷, 赖俊, 陈希亮. 基于值分解的多智能体深度强化学习综述 Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization 计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[5]	饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[6]	汪鸣, 彭舰, 黄飞虎. 基于多时间尺度时空图网络的交通流量预测模型 Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction 计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[7]	姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[8]	朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[9]	孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[10]	闫佳丹, 贾彩燕. 基于双图神经网络信息融合的文本分类方法 Text Classification Method Based on Information Fusion of Dual-graph Neural Network 计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[11]	张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[12]	曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨. 基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨 Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism 计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
[13]	徐鸣珂, 张帆. Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法 Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition 计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[14]	孟月波, 穆思蓉, 刘光辉, 徐胜军, 韩九强. 基于向量注意力机制GoogLeNet-GMP的行人重识别方法 Person Re-identification Method Based on GoogLeNet-GMP Based on Vector Attention Mechanism 计算机科学, 2022, 49(7): 142-147. https://doi.org/10.11896/jsjkx.210600198
[15]	金方焱, 王秀利. 融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取 Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM 计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

一种基于深度LSTM和注意力机制的金融数据预测方法

Financial Data Prediction Method Based on Deep LSTM and Attention Mechanism

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 0