Computer Science ›› 2021, Vol. 48 ›› Issue (2): 128-133.doi: 10.11896/jsjkx.191200152

• Database & Big Data & Data Science • Previous Articles     Next Articles

Power Load Data Completion Based on Sparse Representation

LI Pei-guan1, YU Zhi-yong1,2, HUANG Fang-wan1,2   

  1. 1 College of Mathematics and Computer Science,Fuzhou University,Fuzhou 350116,China
    2 Fujian Provincial Key Laboratory of Network Computing and Intelligent Information Processing,Fuzhou University,Fuzhou 350116,China
  • Received:2019-12-25 Revised:2020-04-16 Online:2021-02-15 Published:2021-02-04
  • About author:LI Pei-guan,born in 1996,B.S.His main research interests include machine learning and so on.
    HUANG Fang-wan,born in 1980,M.S.,senior lecturer,is a member of China Computer Federation.Her main research interests include computational intelligence,machine learning and big data analysis.
  • Supported by:
    The National Natural Science Foundation of China(61772136)and Research Project for Young and Middle-aged Teachers of Fujian Province (JT180045).

Abstract: Data loss often occurs in the process of power load data collection,which adversely affects the accuracy of algorithm prediction.The existing missing data completion algorithm is only suitable for the case with less missing data,but performs poorly for the case with more missing data.Faced with the challenge of severe data loss,a method for power load missing data completion based on sparse representation is proposed.First of all,we assume that the data is randomly missing,and stitch the assumed missing data in the training data and the complete training data to form a training matrix.Secondly,an over-complete dictionary is generated by discrete cosine transform (DCT),and is learned according to the training matrix,aims to obtain a suitable dictionary for the best sparse representations of the samples in the training matrix.Finally,in the test phase,the upper part of the learned dictionary is used to obtain sparse representations of the missing data in the test set,and then the sparse representations and the lower part of the learned dictionary are used to reconstruct the complete data without missing.Experimental results show that using this method to complete missing values of power load data can achieve higher accuracy than traditional interpolation me-thods,correlation-based KNN algorithm,spatiotemporal compressed sensing estimation algorithm and time-series compressed sen-sing prediction algorithm.Even if the data miss rate is as high as 95%,this method can still effectively complete the missing data.

Key words: Data completion, Missing data, Power load, Sparse representation

CLC Number: 

  • TP181
[1] FAN W Q,ZHANG W,LI Y G,et al.Ultra short-term loadforecasting for micro-grid based on improved human comfort index[J].Guangdong Electric Power,2017,30(4):137-142.
[2] LI H.Residual error GM(1,1) model improved by Markovmethod for long-term and medium-term load forecasting[J].Shaanxi Electric Power,2017,45(5):75-77.
[3] YANG H X,DENG Y J,LIU Z B,et al.Study on electric load forecasting with historical bad data[J].Power System Protection and Control,2017,45(15):62-68.
[4] CHEN Z H,ZHANG Y,WU Z G.Application of RBF neural network in medium and long-term load forecasting[J].Procee-dings of the CSU-EPSA,2006,18(1):15-19.
[5] LAKSHMINARAYAN K.Imputation of missing data in industrial databases[J].Applied Intelligence,1999,11:259-275.
[6] WU S F,CHANG C Y,LEE S J.Time series forecasting with missing values[C]//2015 1st International Conference on Industrial Networks and Intelligent Systems (INISCom).2015:151-156.
[7] DING Q,LI S.Research on Resampling Application of Intelli-gent Substation and Error Analysis of Linear Interpolation Method[J].Power System Protection and Control,2015,43(23):132-136.
[8] ZHU Q W,YE L,ZHAO Y N,et al.Research on Identification and Reconstruction Method of Wind Farm Output Power Abnormal Data[J].Power System Protection and Control,2015,43(3):38-45.
[9] RUAN Q Z,CHEN J B,ZHU G,et al.Instantaneous test data analysis of low voltage electrical equipment based on cubic spline interpolation[J].Low Voltage Electrical Appliance,2012(10):27-31.
[10] TAO T Y,WANG H.Simplified wind field simulation based on Hermite interpolation[J].Engineering Mechanics,2017,34(3):182-188.
[11] GERHARD T,SHAHLA R.Improved methods for the imputation of missing data by nearest neighbor methods[J].Computational Statistics and Data Analysis,2015,90:84-99.
[12] NEWSHAM G R,BIRT B J.Building-level occupancy data to improve arima-based electricity use forecasts[C]//Proceedings of the 2nd ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Building,ACM.New York,USA,2010:13-18.
[13] SHI W,ZHU Y,ZHANG J,et al.Improving power grid monitoring data quality:An efficient machine learning framework for missing data prediction[C]//2015 IEEE 17th International Conference on High Performance Computing and Communications.IEEE,2015:417-422.
[14] KONG L,XIA M,LIU X Y,et al.Data loss and reconstruction in sensor networks[C]//INFOCOM.2013:1654-1662.
[15] ZHU Y,LI Z,ZHU H,et al.A Compressive Sensing Approach to Urban Traffic Estimation with Probe Vehicles[J].IEEE Transactions on Mobile Computing,2013,12(11):2289-2302.
[16] SONG X X,GUO Y,LI N,et al.Missing Data Prediction Based on Compressive Sensing in Time Series[J].Computer Science,2019,46(6):35-40.
[17] ZHANG Y,ROUGHAN M,WILINGER W,et al.Spatio-tem-poral compressive sensing and internet traffic matrices[J].ACM SIGCOMM Computer Communication Review,2009,39(4):267.
[18] MEI J L,YOHANN D C,YANNIG G,et al.Nonnegative matrix factorization with side information for time series recovery and prediction[J].IEEE Transactions on Knowledge and Data Engineering,2018:1.
[19] WANG Z H,HORNG G J,HSU T H,et al.Heart sound signal recovery based on time series signal prediction using a recurrent neural network in the long short-term memory model[J].The Journal of Supercomputing,2019(1):1-18.
[20] STRAUMAN A S,BIANCHI F M,MIKALSEN K Ø.Classification of postoperative surgical site infections from blood measurements with missing data using recurrent neural networks[C]//International Conference on Biomedical & Health Informatics.Las Vegas,USA,2018:307-310.
[21] ZHANG Z,XU Y,YANG J,et al.A survey of sparse representation:algorithms and applications[J].Access IEEE,2015,3:490-530.
[22] AHARON M,ELAD M,BRUCKSTEINA.K-SVD:an algo-rithm for designing overcomplete dictionaries for sparse representation[J].IEEE Transactions on Signal Processing,2006,54(11):4311-4322.
[1] TIAN Xu, CHANG Kan, HUANG Sheng, QIN Tuan-fa. Single Image Super-resolution Algorithm Using Residual Dictionary and Collaborative Representation [J]. Computer Science, 2020, 47(9): 135-141.
[2] CHENG Zhong-Jian, ZHOU Shuang-e and LI Kang. Sparse Representation Target Tracking Algorithm Based on Multi-scale Adaptive Weight [J]. Computer Science, 2020, 47(6A): 181-186.
[3] ZHANG Wang-ce, FAN Jing, WANG Bo-ru and NI Min. (α,k)-anonymized Model for Missing Data [J]. Computer Science, 2020, 47(6A): 395-399.
[4] WU Qing-hong, GAO Xiao-dong. Face Recognition in Non-ideal Environment Based on Sparse Representation and Support Vector Machine [J]. Computer Science, 2020, 47(6): 121-125.
[5] LI Xiao-yu,GAO Qing-wei,LU Yi-xiang,SUN Dong. Image Fusion Method Based on Image Energy Adjustment [J]. Computer Science, 2020, 47(1): 153-158.
[6] LI Gui-hui,LI Jin-jiang,FAN Hui. Image Denoising Algorithm Based on Adaptive Matching Pursuit [J]. Computer Science, 2020, 47(1): 176-185.
[7] ZHANG Bing, XIE Cong-hua, LIU Zhe. Multi-focus Image Fusion Based on Latent Sparse Representation and Neighborhood Information [J]. Computer Science, 2019, 46(9): 254-258.
[8] SONG Xiao-xiang,GUO Yan,LI Ning,YU Dong-ping. Missing Data Prediction Algorithm Based on Sparse Bayesian Learning in Coevolving Time Series [J]. Computer Science, 2019, 46(7): 217-223.
[9] ZHANG Fu-wang, YUAN Hui-juan. Image Super-resolution Reconstruction Algorithm with Adaptive Sparse Representationand Non-local Self-similarity [J]. Computer Science, 2019, 46(6A): 188-191.
[10] SONG Xiao-xiang, GUO Yan, LI Ning, WANG Meng. Missing Data Prediction Based on Compressive Sensing in Time Series [J]. Computer Science, 2019, 46(6): 35-40.
[11] DU Xiu-li, ZUO Si-ming, QIU Shao-ming. Adaptive Dictionary Learning Algorithm Based on Image Gray Entropy [J]. Computer Science, 2019, 46(5): 266-271.
[12] RU Feng, XU Jin, CHANG Qi, KAN Dan-hui. High Order Statistics Structured Sparse Algorithm for Image Genetic Association Analysis [J]. Computer Science, 2019, 46(4): 66-72.
[13] FAN Zhe-ning, YANG Qiu-hui, ZHAI Yu-peng, WAN Ying, WANG Shuai. Improved ROUSTIDA Algorithm for Missing Data Imputation with Key Attribute in Repetitive Data [J]. Computer Science, 2019, 46(2): 30-34.
[14] MAO Xia, WANG Lan, LI Jian-jun. Human Action Recognition Framework with RGB-D Features Fusion [J]. Computer Science, 2018, 45(8): 22-27.
[15] GAN Ling, ZHAO Fu-chao, YANG Meng. Self-adaptive Group Sparse Representation Method for Image Inpainting [J]. Computer Science, 2018, 45(8): 272-276.
Full text



No Suggested Reading articles found!