Computer Science ›› 2023, Vol. 50 ›› Issue (12): 97-103.doi: 10.11896/jsjkx.221100112

• Database & Big Data & Data Science • Previous Articles     Next Articles

Transformer Feature Fusion Network for Time Series Classification

DUAN Mengmeng1, JIN Cheng2   

  1. 1 School of Software,Fudan University,Shanghai 200438,China
    2 School of Computer Science,Fudan University,Shanghai 200438,China
  • Received:2022-11-11 Revised:2023-03-30 Online:2023-12-15 Published:2023-12-07
  • About author:DUAN Mengmeng,born in 1998,postgraduate.Her main research interest is time series classification and prediction.
    JIN Cheng,born in 1978,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include computer vision and multimedia information retrieval.
  • Supported by:
    Shanghai Municipal Science and Technology Commission(22dz1204900).

Abstract: Model ensemble methods train multiple basic models and use a certain rule to aggregate the output of the basic models for time series classification.However,they mainly focus on two aspects.The first one is which model is chose as the basic mo-del.And the Second one is how to increase the difference and the diversity of the basic models.They all ignore the exploration of aggregation rules.Aiming at this problem,Transformer feature fusion network for time series classification(TFFN) is proposed.TFFN have two key components,dual Transformer encoder decoder(Dual TED) and Transformer encoder head(TEH).Dual TED leverage attention module to fuse the basic feature into more discriminative fusion features.Transformer encoder head,a sample-distribution-aware classifier,is adopted to classify time series more accurately.Experiments show that TFFN achieves state-of-the-art results on multiple mainstream time series classification datasets.

Key words: Time series classification, Model ensemble, Transformer, Feature fusion, Deep learning

CLC Number: 

  • TP391
[1]BAJWA M N,KHURRAM S,MUNIR M,et al.Confident classification using a hybrid between deterministic and probabilistic convolutional neural networks[J].IEEE Access,2020,8:115476-115485.
[2]LIU M H,ZENG A L,LAI Q X,et al.T-WaveNet:A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis[C]//International Conference on Learning Representations(ICLR).2021.
[3]XIAO Z,XU X,XING H,et al.RNTS:Robust Neural Temporal Search for Time Series Classification[C]//International Joint Conference on Neural Networks(IJCNN).2021:1-8.
[4]YAN W,LI G,WU Z,et al.Extracting diverse-shapelets for early classification on time series[J].World Wide Web,2020,23(6):3055-3081.
[5]LI H,JIA R,WAN X.Time series classification based on complex network[J].Expert Systems with Applications,2022,194:116502.
[6]YAGHOUBI V,CHENG L,VAN PAEPEGEM W,et al.An ensemble classifier for vibration-based quality monitoring[J].Mechanical Systems and Signal Processing,2022,165:108341.
[7]MELIN P,MONICA J C,SANCHEZ D,et al.Multiple ensemble neural network models with fuzzy response aggregation for predicting COVID-19 time series:the case of Mexico[J].Healthcare,2020,8(2):181.
[8]TSAI Y H H,BAI S,LIANG P P,et al.Multimodal transformer for unaligned multimodal language sequences[C]//Proceedings of the Conference.Association for Computational Linguistics.Meeting.NIH Public Access,2019:6558.
[9]YU Z,YU J,CUI Y,et al.Deep modular co-attention networks for visual question answering[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:6281-6290.
[10]SCHÄFER P,LESER U.Multivariate Time Series Classification with WEASEL+MUSE[EB/OL].(2017-11)[2022-11].https://arxiv.org/abs/1711.11343.
[11]FAWAZ H I,LUCAS B,FORESTIER G,et al.InceptionTime:Finding AlexNet for time series classification[J].Data Mining and Knowledge Discovery,2020,34(6):1936-1962.
[12]TANG W S,LONG G D,LIU L,et al.Omni-Scale CNNs:A Simple and Effective Kernel Size Configuration for Time Series Classification[C]//International Conference on Learning Representations(ICLR).2022.
[13]ZHANG X,GAO Y,LIN J,et al.TapNet:Multivariate Time Series Classification with Attentional Prototypical Network[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:6845-6852.
[14]DEMPSTER A,PETITJEAN F,WEBB G I.ROCKET:exceptionally fast and accurate time series classification using random convolutional kernels[J].Data Mining and Knowledge Discove-ry,2020,34(5):1454-1495.
[15]DEMPSTER A,SCHMIDT D F,WEBB G I.MiniRocket:A Very Fast(Almost) Deterministic Transform for Time Series Classification[C]//The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining(KDD' 21).ACM,2021:248-257.
[16]ZERVEAS G,JAYARAMAN S,PATEL D,et al.A Transfor-mer-based Framework for Multivariate Time Series Representation Learning[C]//The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining(KDD'21).ACM,2021:2114-2124.
[17]WORTSMAN M,ILHARCO G,GADRE S Y,et al.Modelsoups:averaging weights of multiple fine-tuned models improves accuracy without increasing inference time[C]//International Conference on Machine Learning.PMLR,2022:23965-23998.
[18]LINES J,TAYLOR S,BAGNALL A.Hive-cote:The hierarchical vote collective of transformation-based ensembles for time series classification[C]//2016 IEEE 16th International Confe-rence on Data Mining(ICDM).IEEE,2016:1041-1046.
[19]SHIFAZ A,PELLETIER C,PETITJEAN F,et al.TS-CHIEF:a scalable and accurate forest algorithm for time series classification[J].Data Mining and Knowledge Discovery,2020,34(3):742-775.
[20]MIDDLEHURST M,LARGE J,FLYNN M,et al.HIVE-COTE 2.0:a new meta ensemble for time series classification[J].Machine Learning,2021,110(11):3211-3243.
[21]LI X,JIANG H,NIU M,et al.An enhanced selective ensemble deep learning method for rolling bearing fault diagnosis with beetle antennae search algorithm[J].Mechanical Systems and Signal Processing,2020,142:106752.
[22]ZHOU Y,CHENG G,JIANG S,et al.Building an efficient intrusion detection system based on feature selection and ensemble classifier[J].Computer Networks,2020,174:107247.
[23]ISHAQ M,KWON S.Short-term energy forecasting framework using an ensemble deep learning approach[J].IEEE Access,2021,9:94262-94271.
[24]XIA T,SONG Y,ZHENG Y,et al.An ensemble frameworkbased on convolutional bi-directional LSTM with multiple time windows for remaining useful life estimation[J].Computers in Industry,2020,115:103182.
[25]DOSOVITSKIY A,BEYER L,KOLESNIKOV A,et al.Animage is worth 16x16 words:Transformers for image recognition at scale[C]//International Conference on Learning Representations(ICLR).2021.
[26]DAU H A,BAGNALL A,KAMGAR K,et al.The UCR Time Series Archive[J].IEEE/CAA Journal of Automatica Sinica,2019,6(6):6-18.
[27]BAGNALL A,DAU H A,LINES J,et al.The UEA multiva-riate time series classification archive,2018[A/OL].[2018-10].https://arxiv.org/abs/1811.00075.
[28]CHEN Y P,EAMONN K,HU B,et al.The ucr time series classification archive[DS/OL].[2015-07].http://www.cs.ucr.edu/~eamonn/time_seires_data/.
[29]LUCAS B,SHIFAZ A,CHARLOTTE P,et al.Proximity fo-rest:an effective and scalable distance-based classifier for time series[J].Data Mining and Knowledge Discovery,2019,33(3):607-635.
[30]BENAVOLI A,CORANI G,MANGILI F.Should we really use post-hoc tests based on mean-ranks?[J].The Journal of Machine Learning Research,2016,17(1):152-161.
[1] ZHAO Mingmin, YANG Qiuhui, HONG Mei, CAI Chuang. Smart Contract Fuzzing Based on Deep Learning and Information Feedback [J]. Computer Science, 2023, 50(9): 117-122.
[2] LI Haiming, ZHU Zhiheng, LIU Lei, GUO Chenkai. Multi-task Graph-embedding Deep Prediction Model for Mobile App Rating Recommendation [J]. Computer Science, 2023, 50(9): 160-167.
[3] HUANG Hanqiang, XING Yunbing, SHEN Jianfei, FAN Feiyi. Sign Language Animation Splicing Model Based on LpTransformer Network [J]. Computer Science, 2023, 50(9): 184-191.
[4] CHEN Guojun, YUE Xueyan, ZHU Yanning, FU Yunpeng. Study on Building Extraction Algorithm of Remote Sensing Image Based on Multi-scale Feature Fusion [J]. Computer Science, 2023, 50(9): 202-209.
[5] ZHU Ye, HAO Yingguang, WANG Hongyu. Deep Learning Based Salient Object Detection in Infrared Video [J]. Computer Science, 2023, 50(9): 227-234.
[6] WANG Yu, WANG Zuchao, PAN Rui. Survey of DGA Domain Name Detection Based on Character Feature [J]. Computer Science, 2023, 50(8): 251-259.
[7] ZHOU Fengfan, LING Hefei, ZHANG Jinyuan, XIA Ziwei, SHI Yuxuan, LI Ping. Facial Physical Adversarial Example Performance Prediction Algorithm Based on Multi-modal Feature Fusion [J]. Computer Science, 2023, 50(8): 280-285.
[8] ZHANG Yian, YANG Ying, REN Gang, WANG Gang. Study on Multimodal Online Reviews Helpfulness Prediction Based on Attention Mechanism [J]. Computer Science, 2023, 50(8): 37-44.
[9] SONG Xinyang, YAN Zhiyuan, SUN Muyi, DAI Linlin, LI Qi, SUN Zhenan. Review of Talking Face Generation [J]. Computer Science, 2023, 50(8): 68-78.
[10] WANG Xu, WU Yanxia, ZHANG Xue, HONG Ruize, LI Guangsheng. Survey of Rotating Object Detection Research in Computer Vision [J]. Computer Science, 2023, 50(8): 79-92.
[11] ZHOU Ziyi, XIONG Hailing. Image Captioning Optimization Strategy Based on Deep Learning [J]. Computer Science, 2023, 50(8): 99-110.
[12] TENG Sihang, WANG Lie, LI Ya. Non-autoregressive Transformer Chinese Speech Recognition Incorporating Pronunciation- Character Representation Conversion [J]. Computer Science, 2023, 50(8): 111-117.
[13] ZHANG Xiao, DONG Hongbin. Lightweight Multi-view Stereo Integrating Coarse Cost Volume and Bilateral Grid [J]. Computer Science, 2023, 50(8): 125-132.
[14] LI Kun, GUO Wei, ZHANG Fan, DU Jiayu, YANG Meiyue. Adversarial Malware Generation Method Based on Genetic Algorithm [J]. Computer Science, 2023, 50(7): 325-331.
[15] WANG Mingxia, XIONG Yun. Disease Diagnosis Prediction Algorithm Based on Contrastive Learning [J]. Computer Science, 2023, 50(7): 46-52.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!