Computer Science ›› 2011, Vol. 38 ›› Issue (5): 149-153.

Previous Articles     Next Articles

Missing Data Imputation Based on Generalized Mahalanobis Distance

CHEN Huan,HUANG Der-cai   

  • Online:2018-11-16 Published:2018-11-16

Abstract: Missing data arc inevitable in data-collection, how to restore these data has become one of the hottest issues in data mining. Just like most algorithms,missing data imputation algorithms based on Mahalanobis Distance make full use of relationships between data. I}hough the results arc acceptable, the covariance matrixes arc not always reversible, which limit the algorithms greatly. This paper improved a traditional principal component analysis(PCA) method, proposed a new distance named Generalized Mahalanobis Distance according to SVl)and Moore-Penrose pseudoinverse. Combining with SOFM neural network and entropy, we designed GS missing data imputation algorithms. After academic analysis and simulation, it was proved that Generalized Mahalanobis Distance inherits the advantages of Mahalanobis Distance wonderfully in dealing with relatived data. Not only the new algorithm has good accuracy and stability, but also suits for any datascts.

Key words: PCA, Moore-penrose pseudoinverse, Generalized mahalanobis distance, SOFM neural network, Entropy

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!