Computer Science ›› 2011, Vol. 38 ›› Issue (Z10): 146-149.
Previous Articles Next Articles
CHENG Miao
Online:
Published:
Abstract: Internet is a huge and widely distributed information service center, the vast amounts of data generated on the Internet arc usually geographically distributed, heterogeneous, dynamic and become more complex, it can not meet the requirements if we use the existing centralized data mining methods. To solve these problems,proposed a cloud computing-based Web data mining method, the massive data and mining tasks will be decomposed on multiple computers parallely processed. We use open platform-Hadoop to establish a parallel association rules mining algorithm based on Apriori, and it tests and veriftes the efficiency of system. This paper proposed a design thinking that "migrate the calculation to the store",the calculation will be implemented on the local storage nodes, thus it can avoid the large amount of data transmission on the network, and will not take a lot of bandwidth.
Key words: Cloud-computing, Data mining, Map/Reduce, Association rules
CHENG Miao. Web Data Mining Based on Cloud-computing[J].Computer Science, 2011, 38(Z10): 146-149.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.jsjkx.com/EN/
https://www.jsjkx.com/EN/Y2011/V38/IZ10/146
Cited