Computer Science ›› 2012, Vol. 39 ›› Issue (3): 174-182.
Previous Articles Next Articles
WANG Jie,DAI Qing-hao,LI Huan
Online:
Published:
Abstract: Frequent pattern mining can find frequent pattern in data, and iYs an important step in the association rules mining. Parallel frequent pattern(PFP) algorithms apply it into parallel environment, which is suitable for massive data.Based on the implementation of Apache Mahout, this paper proposed a design for optimizing the counting and sorting parts of PFP using distributed coordination system. This design takes advantage of distributed coordination system and reduces the consumption on HDFS and memory of data node. Another benefit is that the counting procedure and sorting procedure start parallclly. At last this paper analyzed the experimental result and the difficulties for implementation for further study.
Key words: Frequent pattern growth algorithm, Parallel data mining, Distributed coordination system, Performance tuning
WANG Jie,DAI Qing-hao,LI Huan. Tuning of Parallel Frequent Pattern Growth Algorithm Based on Distributed Coordination System[J].Computer Science, 2012, 39(3): 174-182.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: https://www.jsjkx.com/EN/
https://www.jsjkx.com/EN/Y2012/V39/I3/174
Cited