Computer Science ›› 2020, Vol. 47 ›› Issue (11A): 459-463.doi: 10.11896/jsjkx.200500128

• Big Data & Data Science • Previous Articles     Next Articles

Mining Trend Similarity of Multivariate Hydrological Time Series Based on XGBoost Algorithm

DING Wu1,3, MA Yuan2, DU Shi-lei2, LI Hai-chen3, DING Gong-bo3, WANG Chao3   

  1. 1 School of Hydropower and Information Engineering,Hua Zhong University of Science and Technology,Wuhan 430074,China
    2 Taihu Basin Authority of Ministry of Water Resources(Information Center),Shanghai 200434,China
    3 China Institute of Water Resources and Hydropower Research,Beijing 100038,China
  • Online:2020-11-15 Published:2020-11-17
  • About author:DING Wu,born in 1996,postgraduate.His main research interests include hydrological big data analysis and optimized operation of hydropower.
    WANG Chao,born in 1989,Ph.D,senior engineer.His main interests research include basin water resources scheduling and intelligent water conservancy.
  • Supported by:
    This work was supported by the Young Elite Scientists Sponsorship Program by the CAST (2019QNRC001)and Fundamental Research Funds of China Institute of Water Resources and Hydropower Research (WR0145B012020).

Abstract: In view of the shortcomings of the traditional hydrological trend prediction using neural networks and other tools,the results are not interpretable and so on.This paper proposes a method of hydrological trend prediction based on machine learning algorithms,which aims to use the XGBOOST machine learning algorithm to establish a similarity mapping model for each hydrological feature between the reference period and the hydrological prediction period,thus,the most similar sequence to the hydrological trend in the foreseeing period is matched in the historical hydrological time series,so as to achieve the purpose of hydrological trend prediction.In order to prove the efficiency and feasibility of the proposed method,it was verified with the Taihu hydrological time series data as the object.The analysis results show that the multi-variable hydrological time series trend simila-rity analysis based on machine learning can meet therequirements of dispatchers for the prediction effect of future hydrological trends.

Key words: Hydrological trend prediction, Machine learning, Multivariate time series, Similarity measure, Time series data mining

CLC Number: 

  • TV121
[1] ZHANG J Y,PAN Q,ZHANG P,et al.Time series similarity measurement method based on slope representation [J].Pattern Recognition and Artificial Intelligence,2007,20(2):271-274.
[2] DONG X L,GU C K,WANG Z G.Research on morphology-based time series similarity measurement [J].Journal of Electronics and Information Technology,2007,29 (5):1228-1231.
[3] LI H L,YANG L B.Time series similarity measurement method based on incremental dynamic time warping [J].Computer Science,2013,40(4):227-230.
[4] BAGNALL A,HILLS J,LINES J.Finding Motif Sets in Time Series[J].Bmc Public Health,2014,12(1):1-11.
[5] LI Z X,LI K W,WU H S.Similarity measure for multivariate time series based on dynamic time warping[C]//The 2016 International Conference.2016.
[6] DUCHNE F,GARBAY C,RIALLE V.Similarity measure for heterogeneous multivariate time-series[C]//Proceeding of the 12th European Signal Processing Conference.2004:7-1.
[7] SHEN J Y,HUANG W P,ZHU D Y,et al.A Novel Similarity Measure Model for Multivariate Time Series Based on LMNN and DTW[J].Neural Processing Letters,2017,45(3):925-937.
[8] LI S J,ZHU Y L,ZHANG X H,et al.Similarity analysis ofmultiple hydrological time series based on BORDA counting method [J].Journal of Hydraulic Engineering,2009,40(3):378-384.
[9] KARAMITOPOULOS L,EVANGELIDIS G,DERVOS D.PCA-based Time Series Similarity Search[C]//Data Mining.2010:255-276.
[10] VAN DER MAATEN L.Accelerating t-SNE using tree-basedalgorithms[J].The Journal of Machine Learning Research,2014,15(1):3221-3245.
[11] TANG J,LIU J Z,ZHANG M,et al.Visualizing Large-scale and High-dimensional Data[C]//International Conference on World Wide Web.2016:287-297.
[12] CHEN T,GUESTRIN C.XGBoost:A Scalable Tree Boosting System[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.2016.
[13] MURATA N,YOSHIZAWA S,AMARI S.Learning curves,model selection and complexity of neural networks[C]//Proceedings of the 1992 Conference.San Mateo.CA.Morgan Kaufmann,1993: 607-614.
[1] LENG Dian-dian, DU Peng, CHEN Jian-ting, XIANG Yang. Automated Container Terminal Oriented Travel Time Estimation of AGV [J]. Computer Science, 2022, 49(9): 208-214.
[2] NING Han-yang, MA Miao, YANG Bo, LIU Shi-chang. Research Progress and Analysis on Intelligent Cryptology [J]. Computer Science, 2022, 49(9): 288-296.
[3] HE Qiang, YIN Zhen-yu, HUANG Min, WANG Xing-wei, WANG Yuan-tian, CUI Shuo, ZHAO Yong. Survey of Influence Analysis of Evolutionary Network Based on Big Data [J]. Computer Science, 2022, 49(8): 1-11.
[4] LI Yao, LI Tao, LI Qi-fan, LIANG Jia-rui, Ibegbu Nnamdi JULIAN, CHEN Jun-jie, GUO Hao. Construction and Multi-feature Fusion Classification Research Based on Multi-scale Sparse Brain Functional Hyper-network [J]. Computer Science, 2022, 49(8): 257-266.
[5] ZHANG Guang-hua, GAO Tian-jiao, CHEN Zhen-guo, YU Nai-wen. Study on Malware Classification Based on N-Gram Static Analysis Technology [J]. Computer Science, 2022, 49(8): 336-343.
[6] CHEN Ming-xin, ZHANG Jun-bo, LI Tian-rui. Survey on Attacks and Defenses in Federated Learning [J]. Computer Science, 2022, 49(7): 310-323.
[7] XIAO Zhi-hong, HAN Ye-tong, ZOU Yong-pan. Study on Activity Recognition Based on Multi-source Data and Logical Reasoning [J]. Computer Science, 2022, 49(6A): 397-406.
[8] YAO Ye, ZHU Yi-an, QIAN Liang, JIA Yao, ZHANG Li-xiang, LIU Rui-liang. Android Malware Detection Method Based on Heterogeneous Model Fusion [J]. Computer Science, 2022, 49(6A): 508-515.
[9] LI Ya-ru, ZHANG Yu-lai, WANG Jia-chen. Survey on Bayesian Optimization Methods for Hyper-parameter Tuning [J]. Computer Science, 2022, 49(6A): 86-92.
[10] ZHAO Lu, YUAN Li-ming, HAO Kun. Review of Multi-instance Learning Algorithms [J]. Computer Science, 2022, 49(6A): 93-99.
[11] WANG Fei, HUANG Tao, YANG Ye. Study on Machine Learning Algorithms for Life Prediction of IGBT Devices Based on Stacking Multi-model Fusion [J]. Computer Science, 2022, 49(6A): 784-789.
[12] XU Jie, ZHU Yu-kun, XING Chun-xiao. Application of Machine Learning in Financial Asset Pricing:A Review [J]. Computer Science, 2022, 49(6): 276-286.
[13] LI Ye, CHEN Song-can. Physics-informed Neural Networks:Recent Advances and Prospects [J]. Computer Science, 2022, 49(4): 254-262.
[14] YAO Xiao-ming, DING Shi-chang, ZHAO Tao, HUANG Hong, LUO Jar-der, FU Xiao-ming. Big Data-driven Based Socioeconomic Status Analysis:A Survey [J]. Computer Science, 2022, 49(4): 80-87.
[15] SHEN Shao-peng, MA Hong-jiang, ZHANG Zhi-heng, ZHOU Xiang-bing, ZHU Chun-man, WEN Zuo-cheng. Three-way Drift Detection for State Transition Pattern on Multivariate Time Series [J]. Computer Science, 2022, 49(4): 144-151.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!