Computer Science ›› 2011, Vol. 38 ›› Issue (Z10): 146-149.

Previous Articles     Next Articles

Web Data Mining Based on Cloud-computing

CHENG Miao   

  • Online:2018-11-16 Published:2018-11-16

Abstract: Internet is a huge and widely distributed information service center, the vast amounts of data generated on the Internet arc usually geographically distributed, heterogeneous, dynamic and become more complex, it can not meet the requirements if we use the existing centralized data mining methods. To solve these problems,proposed a cloud computing-based Web data mining method, the massive data and mining tasks will be decomposed on multiple computers parallely processed. We use open platform-Hadoop to establish a parallel association rules mining algorithm based on Apriori, and it tests and veriftes the efficiency of system. This paper proposed a design thinking that "migrate the calculation to the store",the calculation will be implemented on the local storage nodes, thus it can avoid the large amount of data transmission on the network, and will not take a lot of bandwidth.

Key words: Cloud-computing, Data mining, Map/Reduce, Association rules

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!