Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250600172-8.doi: 10.11896/jsjkx.250600172

• Big Data & Data Science • Previous Articles     Next Articles

Time Series Prediction Method Based on Multi-level Wavelet Decomposition Bidirectional Mamba

LIU Pneg1, SHEN Jiying2, LIU Dongsheng1, CHEN Guibo1, SONG Yuanwei1   

  1. 1 College of Computer Science and Technology,Zhejiang Gongshang University,Hangzhou 310018,China
    2 Hangzhou Shuzheng Technology Co.,Ltd.,Hangzhou 310020,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:LIU Peng,born in 1999,postgraduate.His main research interests include time series analysis and deep learning.
    LIU Dongsheng,born in 1971,professor,master's supervisor.His main research interests include multimodal data fusion algorithms,time series prediction,anomaly detection,and other deep reinforcement learning algorithms in artificial intelligence.
  • Supported by:
    “Pioneer” and “Leading Goose” R&D Program of Zhejiang(2026C01017,2026C01018,2025C04022,2025C01037) and Open Project of State Key Laboratory of Industrial Control Technology(ICT2025C02).

Abstract: Time series forecasting can provide insights into future trends and patterns,which is crucial for various applications.For example,weather forecast,power load forecast,etc.However,existing time series prediction models suffer from problems such as large model parameters,high computational complexity,and insufficient utilization of frequency domain information in data.To address these issues,a new model based on state space is proposed for long-term time series prediction.The model firstly uses multi-level wavelet decomposition to decouple the original time-series data into multiple sub sequences of different frequency bands.Secondly,it designs independent bidirectional Mamba modules for each subsequence to capture its unique dynamic patterns.Finally,the prediction results of each frequency band are accurately fused into the final prediction through wavelet reconstruction.Experimental results on seven publicly available datasets,including ETT,show that the method achieves optimal performance at multiple prediction lengths,with an average MSE reduction of 4.12% compared to the current best baseline model.This method has demonstrated its effectiveness and potential for practical applications on time series public datasets.

Key words: Time series prediction, Multi resolution wavelet decomposition, State space model, Linear model, Power load forecasting

CLC Number: 

  • TP391
[1] SIMS C A.Macroeconomics and reality[J].Econometrica:journal of the Econometric Society,1980:1-48.
[2] BOX G E P,JENKINS G M.Time series analysis:forecastingand control[M].San Francisco:Holden-Day,1970.
[3] SALINAS D,FLUNKERT V,GASTHAUS J,et al.DeepAR:Probabilistic forecasting with autoregressive recurrent networks[J].International Journal of Forecasting,2020,36(3):1181-1191.
[4] LIU M,ZENG A,CHEN M,et al.Scinet:Time Series Modeling and Prediction Based on Sample Convolution and Interaction [C]//Advances in Neural Information Processing Systems.2022:5816-5828.
[5] HAN K,XIAO A,WU E,et al.Transformers in Transformers[C]//Advances in Neural Information Processing Systems.2021:15908-15919.
[6] ZHOU H,ZHANG S,PENG J,et al.Information provider:Effi-cient Converter Beyond Long Range Time Series Prediction [C]//Proceedings of the AAAI Conference on Artificial Intelligence.2021:11106-11115.
[7] WU H,XU J,WANG J,et al.Autoformer:Autocorrelated Decomposition Converter for Long term Sequence Prediction [C]//Advances in Neural Information Processing Systems.2021:22419-22430.
[8] ZHOU T,MA Z,WEN Q,et al.Fedformer:Frequency enhanceddecomposition transformer for long-term sequence prediction [C]//Proceedings of the International Conference on Machine Learning.2022:27268-27286.
[9] ZHANG Y,YAN J.Crossformer:Transformer for MultivariateTime Series Prediction Using Cross Dimensional Dependence [C]//Proceedings of the 11th International Conference on Learning Representations.2023.
[10] ZENG A L,WEI C X,ZENG M X,et al.Are Transformers Effective for Time Series Forecasting?[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2023:11121-11128.
[11] YI K,ZHANG Q,FAN W,et al.Frequency-domain MLPs are More Effective Learners in Time Series Forecasting[C]//Proceedings of the 37th Conference on Neural Information Processing Systems.2023.
[12] CHEN Z,LI Y,ZHOU T,et al.HyperAttention:Long-context Attention in Near-Linear Time[J].arXiv:2310.05869,2023.
[13] LIU S,YU H,LIAO C,et al.Pyraformer:Low complexity Pyramid Attention for Remote Time Series Modeling and Prediction [C]//Proceedings of the International Conference on Learning Representations.2022.
[14] NIE Y Q,NGUYEN N H,SINTHONG P,et al.A time series is worth64 words:Long-term forecasting with transformers[C]//Proceedings of the International Conference on Learning Representations.2023.
[15] LIU Y,HU T,ZHANG H.iTransformer:Inverted transformers are effective for time series forecasting[C]//Proceedings of the International Conference on Learning Representations.2024.
[16] GU A,GOEL K,RÉ C.Efficiently Modeling Long Sequenceswith Structured State Spaces[C]//International Conference on Learning Representations.2022:1-36.
[17] GU A,DAO T.Mamba:Linear-Time Sequence Modeling with Selective State Spaces[J].arXiv:2312.00752,2023.
[18] LIEBER O,LENZ B,BATA H,et al.Jamba:a hybrid transformer-mamba language model[C]//Proceedings of the International Conference on Learning Representations.2024.
[19] WANG Z,KONG F,FENG S,et al.Is mamba effective for time series forecasting?[J].Neurocomputing,2025,619:129178.
[20] ZHAN Y,MA Z,LIU Q,et al.MambaMixer:Efficient Multi-Scale Sequence Modeling with Selective State Spaces[C]//Proceedings of the 12th International Conference on Learning Representations.2024:1-22.
[21] YANG J,WANG L,ZHANG H,et al.TimeMachine:Forecasting Time Series with Regime Switching via State Space Models[J].Neural Networks,2024:45-60.
[22] KIM T,KIM J,TAE Y,et al.Reversible instance normalization for accurate time-series forecasting against distributionshift[C]//Proceedings of the 10thInternational Conference on Learning Representations.2022.
[23] COTTER F.Uses of complex wavelets in deep convolutionalneural networks[D].Cambridge:University of Cambridge,2019.
[24] LIANG A,JIANG X,SUN Y,et al.Bi-Mamba+:Bidirectional Mamba for Time Series Forecasting[J].arXiv:2404.15772,2024.
[25] MD ATIK A,CHENG Q.Timemachine: A timeseries is worth 4 mambas for long-term forecasting[J].arXiv:2403.09898,2024.
[26] WANG Z H,KONG F H,FENG S,et al.Is mamba effectivefortime series forecasting?[J].arXiv:2403.11144,2024.
[1] LI Xiuying, CHEN Xuesong, LI Haoze, LIAO Hongwei, HAN Jiameng, DUAN Xiaoyi. MambaCS:Mamba-based Image Compressed Sensing Algorithm [J]. Computer Science, 2026, 53(6): 232-241.
[2] LIU Yichen, LIN Yan, ZHOU Zeyu, GUO Shengnan, LIN Youfang, WAN Huaiyu. Efficient Semantic-aware Trajectory Representation Learning Method via State Space Model [J]. Computer Science, 2026, 53(4): 134-142.
[3] DING Zhengze, NIE Rencan, LI Jintao, SU Huaping, XU Hang. MTFuse:An Infrared and Visible Image Fusion Network Based on Mamba and Transformer [J]. Computer Science, 2025, 52(8): 188-194.
[4] HONG Yi, SHEN Shikai, SHE Yumei, YANG Bin, DAI Fei, WANG Jianxiao, ZHANG Liyi. Multivariate Time Series Prediction Based on Dynamic Graph Learning and Attention Mechanism [J]. Computer Science, 2025, 52(6A): 240700047-8.
[5] LI Ao, BAI Xueru, JIANG Jiali, QIAO Ye. Group Cross Adversarial Application in Stock Price Prediction [J]. Computer Science, 2025, 52(10): 22-32.
[6] LIN Binwei, YU Zhiyong, HUANG Fangwan, GUO Xianwei. Data Completion and Prediction of Street Parking Spaces Based on Transformer [J]. Computer Science, 2024, 51(4): 165-173.
[7] HENG Hongjun, ZHOU Wenhua. Anomaly Detection Method Based on Context Information Fusion and Noise Adaptation [J]. Computer Science, 2023, 50(7): 237-245.
[8] PAN Liqun, WU Zhonghua, HONG Biao. Prediction Method of Long Series Time Series Based on Improved Informer Model with Kernel Technique [J]. Computer Science, 2023, 50(11A): 221100186-6.
[9] LIU Bao-bao, YANG Jing-jing, TAO Lu, WANG He-ying. Study on Prediction of Educational Statistical Data Based on DE-LSTM Model [J]. Computer Science, 2022, 49(6A): 261-266.
[10] CAI Xin-yu, FENG Xiang, YU Hui-qun. Adaptive Weight Based Broad Learning Algorithm for Cascaded Enhanced Nodes [J]. Computer Science, 2022, 49(6): 134-141.
[11] WANG Xue-guang, ZHANG Ai-xin, DOU Bing-lin. Non-linear Load Capacity Model of Complex Networks [J]. Computer Science, 2021, 48(6): 282-287.
[12] WANG Xiao-di, LIU Xin, YU Xiao. Adaptive Frequency Domain Model for Multivariate Time Series Forecasting [J]. Computer Science, 2021, 48(11A): 204-210.
[13] DONG Xin-yue, FAN Rui-dong, HOU Chen-ping. Active Label Distribution Learning Based on Marginal Probability Distribution Matching [J]. Computer Science, 2020, 47(9): 190-197.
[14] YANG Hai-min, PAN Zhi-song, BAI Wei. Review of Time Series Prediction Methods [J]. Computer Science, 2019, 46(1): 21-28.
[15] TANG Ying, SUN Kang-gao, QIN Xu-jia, ZHOU Jian-mei. Local Model Weighted Ensemble for Top-N Movie Recommendation [J]. Computer Science, 2018, 45(11A): 439-444.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!