Computer Science ›› 2023, Vol. 50 ›› Issue (7): 82-88.doi: 10.11896/jsjkx.220600209

• Database & Big Data & Data Science • Previous Articles     Next Articles

Method for Correlation Data Imputation Based on Compressed Sensing

REN Bing, GUO Yan, LI Ning, LIU Cuntao   

  1. College of Communication Engineering,Army Engineering University of PLA,Nanjing 210007,China
  • Received:2022-06-23 Revised:2022-07-20 Online:2023-07-15 Published:2023-07-05
  • About author:REN Bing,born in 1993,postgraduate.His main research interests include compressed sensing and big data.GUO Yan,born in 1971,Ph.D,professor.Her main research interests include unmanned intelligent system,compressed sensing and localization.
  • Supported by:
    National Natural Science Foundation of China(61871400) and Natural Science Foundation of Jiangsu Province,China(BK20211227).

Abstract: The phenomenon of missing data occurs frequently during the acquisition and transfer of data,and improper handling of missing data sets can adversely affect subsequent data mining efforts.In order to fill the missing data set more effectively,a method for data imputation based on compressed sensing is proposed for correlation data.First,the problem of missing data imputation is transformed into a sparse vector recovery problem under the compressed sensing framework.Second,a specialized sparse representation base is constructed for correlation data,so the data sparsity can be better realized.Finally,the fast iterative weighted thresholding algorithm(FIWTA) is proposed,which is refined based on the fast iterative shrinkage-thresholding algorithm (FISTA).The proposed algorithm adopts a new iterative weighted method and introduces a restart strategy,which greatly improves the convergence of the algorithm and the reconstruction accuracy of the data.Simulation results show that the proposed algorithm is able to fill the missing data efficiently,and both the reconstruction success rate and the reconstruction speed are improved compared with the traditional fast iterative shrinkage-thresholding algorithm.Meanwhile,even when the sparse transformation of the data is less effective,imputation of missing data sets can still be accomplished with better robustness.

Key words: Compressed sensing, Data imputation, Correlation data, Orthonormal eigenvector basis, Iterative weighted thresholding algorithm

CLC Number: 

  • TN911.7
[1]QIU X G,CHEN B,ZHANG P.Emergency Management Oriented Artificial Society Construction and Computational Experiments[M].Beijing:Science Press,2017:32-59.
[2]HE M.Introduction to big data-big data thinking and innovative applications[M].Beijing:Publishing House of Electronics Industry,2019:2-10.
[3]LIN W C,TSAI C F.Missing value imputation:a review andanalysis of the literature(2006-2017)[J].Artificial Intelligence Review,2020,53(2):1487-1509.
[4]HUANG G L.Missing data filling method based on linear interpolation and lightgbm[J].Journal of Physics:Conference Series,2021,1754(1):012187.
[5]SANJAR K,BEKHZOD O,KIM J,et al.Missing Data Imputation for Geolocation-based Price Prediction Using KNN-MCF Method[J].ISPRS International Journal of Geo-Information,2020,9(4):227.
[6]PRIETO-CUBIDES J,ARGOTY C.Dealing with Missing Data using a Selection Algorithm on Rough Sets[J].International Journal of Computational Intelligence Systems,2018,11(1):1307-1321.
[7]XIAO J Y,BULUT O.Evaluating the Performances of Missing Data Handling Methods in Ability Estimation From Sparse Data[J].Educational and Psychological Measurement,2020,80(5):932-954.
[8]KHALDY M A,KAMBHAMPATI C.Performance Analysis of Various Missing Value Imputation Methods on Heart Failure Dataset[C]//IntelliSys.Proceedings of SAI Intelligent Systems Conference.Berlin:Springer,2016:415-425.
[9]SAROJ A J,GUIN A,HUNTER M.Deep LSTM RecurrentNeural Networks for Arterial Traffic Volume Data Imputation[J].Journal of Big Data Analytics in Transportation,2021,3(2):95-108.
[10]CHEN X,SUN L.Bayesian Temporal Factorization for Multi-dimensional Time Series Prediction[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2021,44(9):4659-4673.
[11]LU W Q,ZHOU T,LI L H,et al.An improved tucker decomposition-based imputation method for recovering lane-level missing values in traffic data[J].IET Intelligent Transport Systems,2022,16(3):363-379.
[12]BECK A,TEBOULLE M.A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems[J].Siam J Imaging Sciences,2009,2(1):183-202.
[13]PAN S,YAN K,LAN H,et al.Adaptive step-size fast iterative shrinkage-thresholding algorithm and sparse-spike deconvolution[J].Computers & Geosciences,2020,134:104343.
[14]CALATRONI L,CHAMBOLLE A.Backtracking strategies for accelerated descent methods with smooth composite objectives[J].SIAM Journal on Optimization,2017,29(3):1-25.
[15]ZHU T.Accelerating monotone fast iterative shrinkage-thres-holding algorithm with sequential subspace optimization for sparse recovery[J].Signal Image and Video Processing,2020,14(1):1-10.
[16]KIM D,PARK D.Element-Wise Adaptive Thresholds forLearned Iterative Shrinkage Thresholding Algorithms[J].IEEE Access,2020,4:45874-45886.
[17]TONG C,TENG Y,YAO Y,et al.Eigenvalue-free iterativeshrinkage-thresholding algorithm for solving the linear inverse problems[J].Inverse Problems,2021,37(6):5867-5877.
[18]CANDES E J,TAO T.Decoding by linear programming[J].IEEE Transactions Information Theory,2005,51(12):4203-4215.
[19]WU X,XIONG Y,YANG P,et al.Sparsest Random Scheduling for Compressive Data Gathering in Wireless Sensor Networks[J].IEEE Transactions on Wireless Communications,2014,13(10):5867-5877.
[20]QUER G,MASIERO R,MUNARETTO D,et al.On the interplay between routing and signal representation for Compressive Sensing in wireless sensor networks[C]//Information Theory &Applications Workshop.2009:206-215.
[21]ELAD M.Optimized projections for compressed sensing[J].IEEE Transactions on Signal Processing,2007,55(12):5695-5702.
[22]ZHU W X,HUANG Z L,CHEN J L,et al.Iteratively weighted thresholding homotopy method for the sparse solution of underdetermined linear equations[J].Science China Mathematics,2021,64(3):639-664.
[23]LI J J,JIANG Y,QIU T,et al.The Estimation Algorithm ofOFDM Sparse Channel Based on Compressed Sensing[J].Journal of Chongqing University of Technology (Natural Science),2021,35(4):117-122.
[24]DONOGHUE B,CANDES E.Adaptive restart for acceleratedgradient schemes[J].Foundations of Computational Mathema-tics,2015,15(3):715-732.
[25]YANG L,LI H,LI P,et al.Sparse Representation for SARGround Moving Target Imaging Based on Greedy FISTA[J].Journal of Signal Processing,2020,35(11):1844-1852.
[1] PAN Tao, TONG Xiaojun, ZHANG Miao, WANG Zhu. Image Compression and Encryption Based on Compressive Sensing and Hyperchaotic System [J]. Computer Science, 2023, 50(6A): 220200121-6.
[2] WANG Zhenbiao, QIN Yali, WANG Rongfang, ZHENG Huan. Image Compressed Sensing Attention Neural Network Based on Residual Feature Aggregation [J]. Computer Science, 2023, 50(4): 117-124.
[3] PAN Ze-min, QIN Ya-li, ZHENG Huan, WANG Rong-fang, REN Hong-liang. Block-based Compressed Sensing of Image Reconstruction Based on Deep Neural Network [J]. Computer Science, 2022, 49(11A): 210900118-9.
[4] LIU Yu-hong,LIU Shu-ying,FU Fu-xiang. Optimization of Compressed Sensing Reconstruction Algorithms Based on Convolutional Neural Network [J]. Computer Science, 2020, 47(3): 143-148.
[5] WU Xue-lin, ZHU Rong, GUO Ying. Ghost Imaging Reconstruction Algorithm Based on Block Sparse Bayesian Model [J]. Computer Science, 2020, 47(11A): 188-191.
[6] HOU Ming-xing,QI Hui,HUANG Bin-ke. Data Abnormality Processing in Wireless Sensor Networks Based on Distributed Compressed Sensing [J]. Computer Science, 2020, 47(1): 276-280.
[7] LI Xiu-qin, WANG Tian-jing, BAI Guang-wei, SHEN Hang. Two-phase Multi-target Localization Algorithm Based on Compressed Sensing [J]. Computer Science, 2019, 46(5): 50-56.
[8] FAN Zhe-ning, YANG Qiu-hui, ZHAI Yu-peng, WAN Ying, WANG Shuai. Improved ROUSTIDA Algorithm for Missing Data Imputation with Key Attribute in Repetitive Data [J]. Computer Science, 2019, 46(2): 30-34.
[9] WANG Peng-fei, ZHANG Hang. Sub-sampling Signal Reconstruction Based on Principal Component Under Underdetermined Conditions [J]. Computer Science, 2019, 46(10): 103-108.
[10] DU Xiu-li, HU Xing, CHEN Bo, QIU Shao-ming. Multi-hypothesis Reconstruction Algorithm of DCVS Based on Weighted Non-local Similarity [J]. Computer Science, 2019, 46(1): 291-296.
[11] HENG Yang, CHEN Feng, XU Jian-feng, TANG Min. Application Status and Development Trends of Cardiac Magnetic Resonance Fast Imaging Based on Compressed Sensing Theory [J]. Computer Science, 2019, 46(1): 36-44.
[12] DU Xiu-li, ZHANG Wei, GU Bin-bin, CHEN Bo, QIU Shao-ming. GLCM-based Adaptive Block Compressed Sensing Method for Image [J]. Computer Science, 2018, 45(8): 277-282.
[13] CAI Ti-jian, FAN Xiao-ping, CHEN Zhi-jie and LIAO Zhi-fang. Sparse Representation Classification Model Based on Non-shared Multiple Measurement Vectors [J]. Computer Science, 2018, 45(3): 258-262.
[14] LIU Xin-yue, ZHAO Zhi-gang, LV Hui-xian, WANG Fu-chi and XIE Hao. Double Threshold Orthogonal Matching Pursuit Algorithm [J]. Computer Science, 2017, 44(Z6): 212-215.
[15] ZHAO Yang, WANG Wei, DONG Rong, WANG Jing-shi and TANG Min. Compressed Sensing Recovery Algorithm for Region of Interests of MRI/MRA Images Based on NLTV and NESTA [J]. Computer Science, 2017, 44(9): 308-314.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!