Computer Science ›› 2023, Vol. 50 ›› Issue (5): 128-136.doi: 10.11896/jsjkx.220400089

• Database & Big Data & Data Science • Previous Articles     Next Articles

Deep Cross-modal Information Fusion Network for Stock Trend Prediction

CHENG Haiyang1, ZHANG Jianxin2, SUN Qisen1, ZHANG Qiang1,3, WEI Xiaopeng1,3   

  1. 1 Ministry of Education Key Laboratory of Advanced Design and Intelligent Computing,Dalian University,Dalian,Liaoning 116622,China
    2 School of Computer Science and Engineering,Dalian Minzu University,Dalian,Liaoning 116600,China
    3 School of Computer Science and Technology,Dalian University of Technology,Dalian,Liaoning 116024,China
  • Received:2022-04-11 Revised:2022-09-13 Online:2023-05-15 Published:2023-05-06
  • About author:CHENG Haiyang,born in 1998,master,is a member of China Computer Federation.His main research interests include data mining,series forecasting,and machine learning.
    ZHANG Qiang,born in 1971,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include Biocomputing and artificial intelligence,and intelligent big data processing.
  • Supported by:
    National Natural Science Foundation of Liaoning Provincial Joint Fund(U1908214),National Natural Science Foundation of China(61972062),Liaoning Revitalization Talents Program(XLYC2008017) and Liaoning Province Key R&D Program(2019JH2/10100030).

Abstract: Stock trend prediction,as a classic and challenging task,can help traders make trading decisions for greater returns.Recently,deep learning related models have achieved obvious performance improvement on this task.However,most of the current deep learning related works only leverage the historical data on stock price to complete the trend prediction,which cannot capture the market dynamics other than price indicators,thus having an accuracy limitation to a certain extent.To this end,this paper combines social media texts with stock historical price information,and proposes a novel deep cross-modal information fusion network(DCIFNet) for stock trend prediction.DCIFNet first utilizes temporal convolution operations to encode stock prices and twitter texts,so that each element can have sufficient knowledge of its neighborhood elements.Then,the results are fed into a transformer-based cross-modal fusion structure to fuse stock prices and important information in Twitter texts more effectively.Finally,a multi-graph attention convolutional network is introduced to describe the interrelationships among different stocks,which well captures the industry,wiki and correlation relationship among related stocks,leading to the accuracy improvement of stock prediction.We have performed trend prediction and simulated trading experiments on high-frequency trading datasets in nine different industries,and ablation studies as well as compared experiments with multipronged attention network for stock forecasting(MAN-SF) demonstrate the effectiveness of the proposed DCIFNet method.In addition,with the optimal accuracy of 0.6309,it obviously outperforms representative methods on the stock prediction application.

Key words: Stock trend prediction, Social media text, Cross-modal information fusion, Graph convolutional network, Temporal convolution

CLC Number: 

  • TP391
[1]DEVI B U,SUNDAR D,ALLI P.An effective time series analysis for stock trend prediction using ARIMA model for nifty midcap-50[J].International Journal of Data Mining Knowledge Management Process,2013,3(1):65-78.
[2]DENG S M,ZHANG N Y,ZHANG W,et al.Knowledge-driven stock trend prediction and explanation via temporal convolu-tional network[C]//Companion Proceedings of The 2019 World Wide Web Conference.San Francisco,New York,USA:ACM,2019:678-685.
[3]HU Z N,LIU W Q,BIAN J,et al.Listening to chaotic whispers:A deep learning framework for news-oriented stock trend prediction[C]//Proceedings of the 18th ACM International Conference on Web Search and Data Mining.Marina Del Rey,New York,USA:ACM,2018:261-269.
[4]WENG B,AHMED M A,MEGAHED F M.Stock market one-day ahead movement prediction using disparate data sources[J].Expert Systems with Applications,2017,79(2017):153-163.
[5]HUANG J Y,ZHANG Y J,ZHANG J L,et al.A tensor-based sub-mode coordinate algorithm for stock prediction[C]//IEEE 3rd International Conference on Data Science in Cyberspace(DSC).Guangzhou,China.Piscataway:IEEE,2018:716-721.
[6]SAWHNEY R,AGARWAL S,WADHWA A,et al.Deep attentive learning for stock movement prediction from social media text and company correlations[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Proces-sing(EMNLP).Online.New York:ACM,2020:8415-8426.
[7]TSAI Y H H,BAI S J,LIANG P P,et al.Multimodal transformer for unaligned multimodal language sequences[C]//Proceedings of the Conference Association for Computational Linguistics Meeting.Bethesda:NIH Public Access,2019:6558-6569.
[8]CHEN L,CHI Y G,GUAN Y Y,et al.A Hybrid Attention-Based EMD-LSTM Model for Financial Time Series Prediction[C]//2nd International Conference on Artificial Intelligence and Big Data(ICAIBD).Chengdu,China.Piscataway:IEEE,2019:113-118.
[9]LIU M,SHAN Y Y.Predicition of Closing Price of Stock Index Based on EMD-LSTM Model[J].Journal of Chongqing University of Technology(Natural Science),2021,35(12):269-276.
[10]YANG B,GONG Z,YANG W.Stock market index predictionusing deep neural network ensemble[C]//2017 36th Chinese Control Conference(CCC).Dalian,China.Piscataway:IEEE,2017:3882-3887.
[11]CHENG L C,HUANG Y H,WU M E.Applied attention-based LSTM neural networks in stock prediction[C]//IEEE International Conference on Big Data(Big Data).Seattle,WA,USA.Piscataway:IEEE,2018 :4716-4718.
[12]BOLLERSLEV T.Generalized autoregressive conditional he-teroskedasticity[J].Journal of Econometrics,1986,31(3):307-327.
[13]ETHEM A.Introduction to machine learning[M].Cambridge:MIT press,2020.
[14]SANBOON T,KEATRUANGKAMALA K,JAIYEN S.ADeep Learning Model for Predicting Buy and Sell Recommendations in Stock Exchange of Thailand using Long Short-Term Memory[C]//2019 IEEE 4th International Conference on Computer and Communication Systems(ICCCS).Singapore.Piscataway:IEEE,2019:757-760.
[15]WANG D,WANG X P,YANG C D.A Study of Stock Forecasting Based on LSTM Model of Principal Component Analysis[J].Journal of Chongqing University of Technology(Natural Science),2021,35(2):282-288.
[16]ALEXIEI D,KARL F.Financial time series forecasting-a deep learning approach[J].International Journal of Machine Learning Computing,2017,7(5):118-122.
[17]LIU Q K,CHENG X,SU S,et al.Hierarchical Complementary Attention Network for Predicting Stock Price Movements with News[C]//Proceedings of the 27th ACM International Confe-rence on Information and Knowledge Management.Torino,Italy; ACM.2018:1603-1606.
[18]DING X,ZHANG Y,LIU T,et al.Deep learning for event-dri-ven stock prediction[C]//Proceedings of the 24th International Conference on Artificial Intelligence.Buenos Aires,Argentina:AAAI.2015:2327-2333.
[19]NIKFARJAM A,EMADZADEH E,MUTHAIYAH S.Textmining approaches for stock market prediction[C]//Interna-tional Conference on Computer and Automation Engineering.Singapore.Piscataway:IEEE,2010:256-260.
[20]WU H Z,ZHANG W,SHEN W W,et al.Hybrid deep sequential modeling for social text-driven stock prediction[C]//Proceedings of the 27th ACM International Conference on Information and Knowledge Management.Torino Italy.New York:ACM,2018:1627-1630.
[21]FENG F L,HE X N,WANG X,et al.Temporal relational ran-king for stock prediction[J].ACM Transactions on Information Systems,2019,37(2):1-30.
[22]CHEN Y M,WEI Z Y,HUANG X J.Incorporating corporation relationship via graph convolutional neural networks for stock price prediction[C]//ACM International Conference on Information and Knowledge Management.New York,United States.New York:ACM,2018:1655-1658.
[23]MATSUNAGA D,SUZUMURA T,TAKAHASHI T.Exploring graph neural networks for stock market predictions with rolling window analysis[J].arXiv:1909.10660,2019.
[24]KIM R,SO C H,JEONG M,et al.Hats:A hierarchical graph attention network for stock movement prediction[J].arXiv:1908.07999,2019.
[25]XU W T,LIU W Q,XU C,et al.REST:Relational Event-driven Stock Trend Forecasting[C]//Proceedings of the Web Confe-rence.Ljubljana,Slovenia.New York:ACM,2021:1-10.
[26]SERGEY L,CHRISTIAN S.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.Lille,France.New York:PMLR,2015:448-456.
[27]XU Y,COHEN S B.Stock movement prediction from tweets and historical prices[C]//Proceedings of the 56th Annual Mee-ting of the Association for Computational Linguistics(Volume 1:Long Papers)Melbourne,Australia.Melbourne:ACL,2018:1970-1979.
[28]LEI B J,RYAN K J,E H G.Layer normalization[J].arXiv:1607.06450,2016.
[29]LIU G,WANG X J,LI R F.Multi-scale RCNN model for financial time-series classification[J].arXiv:1911.09359,2019.
[30]CHEN Q K,ROBERT C Y.Graph-Based Learning for Stock Movement Prediction with Textual and Relational Data[J].ar-Xiv:2107.10941,2021.
[31]YE J X,ZHAO J J,YE K J,et al.Multi-graph convolutional network for relationship-driven stock movement prediction[C]//2020 25th International Conference on Pattern Recognition(ICPR).Milan,Italy.Piscataway:IEEE,2021:6702-6709.
[32]BENESTY J,CHEN J D,HUANG Y T,et al.Pearson correlation coefficient[M]//Noise Reduction in Speech Processing.Berlin,Heidelberg:Springer.2009:1-4.
[33]KIPF T N,WELLINGS M.Semi-supervised classification with graph convolutional networks[J].arXiv:1609.02907,2016.
[34]GLOROT X,BORDES A,BENGIO Y.Deep sparse rectifierneural networks[C]//Proceedings of the 14th International Conference on Artificial Intelligence and Statistics.Microtome Publishing:JMLR Workshop and Conference Proceedings,2011:315-323.
[35]SHARPE W F.The sharpe ratio[J].Journal of portfolio ma-nagement,1998,21(1):169-185.
[36]CHICCO D,JURMAN G.The advantages of the Matthews correlation coefficient(MCC) over F1 score and accuracy in binary classification evaluation[J].BMC Genomics,2020,21(1):1-13.
[37]NICOLAS H.Pairs selection and outranking:An application to the S&P 100 index[J].European Journal of Operational Research,2009,196(2):819-825.
[38]NICOLAS H.Pairs trading and outranking:The multi-step-ahead forecasting case[J].European Journal of Operational Research,2010,207(3):1702-1716.
[1] DUAN Jianyong, YANG Xiao, WANG Hao, HE Li, LI Xin. Document-level Relation Extraction of Graph Attention Convolutional Network Based onInter-sentence Information [J]. Computer Science, 2023, 50(6A): 220800189-6.
[2] YANG Ying, ZHANG Fan, LI Tianrui. Aspect-based Sentiment Analysis Based on Dual-channel Graph Convolutional Network with Sentiment Knowledge [J]. Computer Science, 2023, 50(5): 230-237.
[3] WANG Yali, ZHANG Fan, YU Zeng, LI Tianrui. Aspect-level Sentiment Classification Based on Interactive Attention and Graph Convolutional Network [J]. Computer Science, 2023, 50(4): 196-203.
[4] LI Shuai, XU Bin, HAN Yike, LIAO Tongxin. SS-GCN:Aspect-based Sentiment Analysis Model with Affective Enhancement and Syntactic Enhancement [J]. Computer Science, 2023, 50(3): 3-11.
[5] ZHOU Mingqiang, DAI Kailang, WU Quanwang, ZHU Qingsheng. Attention-aware Multi-channel Graph Convolutional Rating Prediction Model for Heterogeneous Information Networks [J]. Computer Science, 2023, 50(3): 129-138.
[6] CAO Jinjuan, QIAN Zhong, LI Peifeng. End-to-End Event Factuality Identification with Joint Model [J]. Computer Science, 2023, 50(2): 292-299.
[7] ZHENG Cheng, MEI Liang, ZHAO Yiyan, ZHANG Suhang. Text Classification Method Based on Bidirectional Attention and Gated Graph Convolutional Networks [J]. Computer Science, 2023, 50(1): 221-228.
[8] WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[9] TAN Ying-ying, WANG Jun-li, ZHANG Chao-bo. Review of Text Classification Methods Based on Graph Convolutional Network [J]. Computer Science, 2022, 49(8): 205-216.
[10] ZHAO Xiao-hu, YE Sheng, LI Xiao. Multi-algorithm Fusion Behavior Classification Method for Body Bone Information Reconstruction [J]. Computer Science, 2022, 49(6): 269-275.
[11] ZHOU Hai-yu, ZHANG Dao-qiang. Multi-site Hyper-graph Convolutional Neural Networks and Application [J]. Computer Science, 2022, 49(3): 129-133.
[12] PAN Zhi-hao, ZENG Bi, LIAO Wen-xiong, WEI Peng-fei, WEN Song. Interactive Attention Graph Convolutional Networks for Aspect-based Sentiment Classification [J]. Computer Science, 2022, 49(3): 294-300.
[13] WANG Xiao-ming, WEN Xu-yun, XU Meng-ting, ZHANG Dao-qiang. Graph Convolutional Network Adversarial Attack Method for Brain Disease Diagnosis [J]. Computer Science, 2022, 49(12): 340-345.
[14] ZHANG Bin, LIU Chang-hong, ZENG Sheng, JIE An-quan. Speech-driven Personal Style Gesture Generation Method Based on Spatio-Temporal GraphConvolutional Networks [J]. Computer Science, 2022, 49(11A): 210900094-5.
[15] XIAO Zheng-ye, LIN Shi-quan, WAN Xiu-an, FANGYu-chun, NI Lan. Temporal Relation Guided Knowledge Distillation for Continuous Sign Language Recognition [J]. Computer Science, 2022, 49(11): 156-162.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!