Computer Science ›› 2026, Vol. 53 ›› Issue (6A): 250700136-8.doi: 10.11896/jsjkx.250700136

• Computer Software & Architecture • Previous Articles     Next Articles

ALHC:Floating-point Time Series Adaptive Lossless Compression Tool Based on HybridPrediction Architecture

ZHANG Xulin, WANG Lei, NIU Mengjin, LIANG Junda   

  1. School of Cyberspace Security,Zhongyuan University of Technology,Zhengzhou 450007,China
  • Online:2026-06-16 Published:2026-06-12
  • About author:ZHANG Xulin,born in 1999,postgra-duate.His main research intersts include high performance computing,and so on.
    WANG Lei,born in 1977,professor,is a member of CCF(No.12516M).His main research interests include research and development of high performance computing and domestic independent and controlable basic software.
  • Supported by:
    Research and Development of Basic Mathematical Library Software Based on High-Performance Computing(22001742).

Abstract: Against the backdrop of explosive growth in floating-point time series data within domains such as industrial IoT and financial technology,significant challenges have emerged for data storage and transmission,rendering floating-point time series data compression of paramount importance.Typically,time-series data suffers from encoding redundancy caused by outlier contamination,disruption of temporal continuity due to missing values,and failure of prediction models triggered by unstructured time series.Moreover,single prediction models exhibit marked performance disparities when dealing with linear short-cycle scenarios versus sparse and complex ones.To address these issues,this paper proposes the ALHC algorithm.Aiming to meet the requirements of floating-point compression concerning data accuracy,continuity,distribution characteristics,and residual calculation,a targeted data cleaning process is designed.A hybrid architecture is constructed,dynamically switching between a TCN-LSTM neural network predictor and an adaptive linear predictor based on the LMS algorithm,enabling adaptation to different scenarios.The adaptive predictor is employed as a post-processing module for residual correction,enhancing prediction accuracy,reducing residuals,and improving entropy coding efficiency.Experimental evaluations of the proposed compression algorithm on 14 public time series datasets demonstrate that under lossless conditions,the average compression ratio reaches 0.24.

Key words: Floating-point time series, Lossless compression, Preprocessing link, TCN-LSTM, Hybrid prediction architecture, Adaptive linear predictor

CLC Number: 

  • TP391
[1] CHEN H,LIU L,MENG J,et al.AFC:An adaptive lossless floating-point compression algorithm in time series database[J].Information Sciences,2024,654:119847.
[2] KNORR F,THOMAN P,FAHRINGER T.DNZIP:A High-Throughput Parallel Lossless Compressor for Scientific Data[C]//2021 Data Compression Conference(DCC).2021:103-112.
[3] ZHU M H,LI Z,LI R Y,et al Survey on Floating-point Time Series Data Compression[J].Journal of Software,2025,36(6):2844-2874.
[4] HE K J.High Performance Lossless Compression of Floating-Point Data for Scientific Computing[J].Chinese Journal of Computers,2010,33(6):966-976.
[5] PELKONEN T,FRANKLIN S,TELLER J,et al.Gorilla:Afast,scalable,in-memory time series database[C]//Proceedings of the VLDB Endowment.2015:1816-1827.
[6] LIAKOS P,PAPAKONSTANTINOPOULOU K,KOTIDIS Y.Chimp:Efficient lossless floating point compression for time series databases[C]//Proceedings of the VLDB Endowment.2022:3058-3070.
[7] LI R Y,LI Z,WU Y,et al.Elf:Erasing-Based Lossless Floating-Point Compression[C]//Proceedings of the VLDB Endowment.2023:1763-1776.
[8] CHIAROT G,SILVESTRI C.Time series compression survey[J].ACM Computing Surveys,2023,55(10):1-32.
[9] CHANDAK S,TATWAWADI K,WEN C,et al.LFZip:Lossy Compression of Multivariate Floating-Point Time Series Data via Improved Prediction[C]//2020 Data Compression Conference(DCC).2020:342-351.
[10] BURTSCHER M,RATANAWORABHAN P.FPC:A high-speed compressor for double-precision floating-point data[J].IEEE Transactions on Computers,2008,58(1):18-31.
[11] YU X,PENG Y,LI F,et al.Two-Level Data Compression using Machine Learning in Time Series Database[C]//2020 IEEE 36th International Conference on Data Engineering(ICDE).2020:1333-1344.
[12] BRUNO A,NARDINI F M,PIBIRI G E,et al.TSXor:A Simple Time Series Compression Algorithm[C]//String Processing and Information Retrieval(SPIRE 2021).Cham:Springer,2021.
[13] FAN J,ZHANG K,HUANG Y,et al.Parallel spatio-temporal attention-based TCN for multivariate time series prediction[J].Neural Computing and Applications,2023,35:13109-13118.
[14] BAI J,ZHU W,LIU S,et al.A Temporal Convolutional Network-Bidirectional Long Short-Term Memory(TCN-BiLSTM) Prediction Model for Temporal Faults in Industrial Equipment[J].Applied Sciences,2025,15(4):1702.
[15] ZHOU H F,SUN K W.Research on Multi-timeslot Industrial Time Series Data Prediction Method Based on Improved LSTM[J].Process Automation Instrumentation,2025,46(4):86-91.
[16] YU Y,SI X S,HU C H,J,et al.A Review of Recurrent Neural Networks:LSTM Cells and Network Architectures[J].Neural Comput,2019,31(7):1235-1270.
[17] CHANDAK S,TATWAWADI K,WEN C,et al.LFZip:Lossy Compression of Multivariate Floating-Point Time Series Data via Improved Prediction[C]//2020 Data Compression Confe-rence(DCC).2020:342-351.
[18] HUFFMAN D A.A method for the construction of minimum-redundancy codes[C]//Proceedings of the IRE.1952:1098-1101.
[19] Daily temperature of major cities[EB/OL].www.kaggle.com/sudalairajkumar/daily-temperature-of-major-cities./
[20] Financial data set used in infore project[EB/OL].https://zenodo.org/record/3886895.
[21] National Ecological Observatory Network(NEON):Ir biological temperature[EB/OL].https://data.neonscience.org/data-products/DP1.00005.001/RELEASE-2022.
[22] Historical weather data download[EB/OL].https://www.me-teoblue.com/en/weather/archive/export/basel_switzerland.
[23] National Ecological Observatory Network(NEON):Relative humidity above wateron-buoy(dp1.20271.001)[EB/OL].https://data.neonscience.org/data-products/DP1.20271.001/RELEASE-2022.
[24] National Ecological Observatory Network(NEON):Barometric pressure(dp1.00004.001)[EB/OL].https://data.neonscience.org/data-products/DP1.00004.001/RELEASE-2022.
[25] National Ecological Observatory Network(NEON):Dust andparticulate size dis-tribution(dp1.00017.001)[EB/OL].https://data.neonscience.org/data-products/DP1.00017.001/RELEASE-2022.
[26] National Ecological Observatory Network(NEON):2d windspeed and di-rection(dp1.00001.001)[EB/OL].https://data.neonscience.org/data-products/DP1.00001.001/RELEASE-2022.
[27] Influxdb 2.0 sample data[EB/OL].https://github.com/influxdata/influxdb2-sample-data.
[28] The.xz file format[EB/OL].https://tukaani.org/xz/xz-file-format.txt.
[29] ALAKUIJALA J,FARRUGGIA A,FERRAGINA P,et al.Brotli:A general-purpose data compressor[C]//ACM Transactions on Information Systems(TOIS).2018:1-30.
[30] COLLET Y.Lz4:Extremely fast compression algorithm[EB/OL].https://github.com/lz4/lz4.
[31] COLLET Y.Zstdgithub repository from facebook[EB/OL].https://github.com/facebook/zstd.
[32] GUNDERSON S H.Snappy:A fast compressor/decompressor[EB/OL].http://google.github.io/snappy/.
[1] LI Qing and LI Dong-hui. Lossless Color Image Compression Method Based on Fuzzy Logic [J]. Computer Science, 2014, 41(Z11): 103-106.
[2] LI Heng-jian,WANG Lian-hai and ZHANG Jia-shu. Secure Palmprint Image Coding Algorithm for JPEG-LS [J]. Computer Science, 2013, 40(9): 141-146.
[3] ZHANG Wei,DAI Ming,YIN Chuan-li,FENG Yu-ping. New Compression Approach to Hyper-spectral Images Based on Second Order Difference Predictive [J]. Computer Science, 2010, 37(5): 240-242246.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!