Computer Science ›› 2013, Vol. 40 ›› Issue (2): 195-199.

Previous Articles     Next Articles

mDHT:A Search Algorithm to Extra-large Volume of Data Based on Open HDFS Platform and Multi-level Indexing

  

  • Online:2018-11-16 Published:2018-11-16

Abstract: Corresponding to the storing and fast searching needs of extra-large scale of energy monitoring and statistics data,we proposed a Multi indexed Distributed Hash Table (mDHh) algorithm based on the HDFS/Hadoop open plat- form and multi-level indexing design, and accomplished the MapReduce implementation of the algorithm. hhe simulation experiment at a scale up to 48 million data records indicates that, when the data volume reaches the scale of 12 millions to 48 millions, the proposed mDH T algorithm presents an outstanding performance in data adding operation, compared to that of traditional MS SQL Server implementation. Even compared to the singlaindex search application, the mDHT approach reduces the data searching time by 24. 5%一57. 8 0 o. The multi-level indexed DHT algorithm presented in this paper provides a key technique for developing a fast search engine to the extra large scale of data on the cloud storage architecture.

Key words: Extra large scale data processing, Cloud storage, Multi-index, Search algorithm, MapReduce

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!