计算机科学 ›› 2012, Vol. 39 ›› Issue (10): 160-163.

• 数据库与数据挖掘 • 上一篇    下一篇

基于垂直压缩格式的高效FP-STREAM算法的研究

唐耀红,魏慧琴   

  1. (北京交通大学计算机与信息技术学院 北京100044)
  • 出版日期:2018-11-16 发布日期:2018-11-16

Efficient FP-STREAM Algorithm Based on Vertical Compression Data Format

  • Online:2018-11-16 Published:2018-11-16

摘要: 近年来由于信息的爆炸式增长,数据流频繁模式挖掘逐渐成为研究的热点。FP-Stream作为经典的数据流频繁模式的挖掘算法,实现了多时间粒度的挖掘,但是该算法并未对数据本身进行压缩,使其在一定时间内处理的数据量受到限制,存在有限内存和高速海量数据的矛盾。通过对数据流进行垂直和Dif-bits压缩变换来改进FP-Stream算法,大大降低了内存需求,提高了数据处理能力。经过实验证明,改进算法是有效的。

关键词: 数据流,频繁模式,FP-Stream,垂直格式,Dif-bits数据压缩

Abstract: Along with the sharp increment of the information, mining frectuent itemsets gradually becomes a hot point in recent years.日'-Stream is a classic algorithm for mining frequent itemsets at multiple time granularitics.But the weakness is the contradiction of the massive data and the limited memory in a certain time which will lead to the result that the algorithm can not be used in high speed data stream mining. This article proposed a improved FP-Stream algorithm based on vertical and Dif-bits compression data format

Key words: Data stream, Frectuent itemsets, FP-Stream, Vertical format, Dif-bits data-compression

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!