Computer Science ›› 2015, Vol. 42 ›› Issue (5): 82-87.doi: 10.11896/j.issn.1002-137X.2015.05.017

Previous Articles     Next Articles

Mining Frequent and High Utility Itemsets

LI Hui, LIU Gui-quan and QU Chun-yan   

  • Online:2018-11-14 Published:2018-11-14

Abstract: Mining interesting itemsets from transaction database has attracted a lot of research work for more than a decade.However,most of these studies either use frequency/support(e.g.,frequent itemset mining) or utility/profit (e.g.,high utility itemset mining) as the key interestingness measure.In other words,these two measures are consi-dered individually,which leads to some shortages that frequent itemsets may have low profit,or high profit itemsets may have very low support,so it is meaningless to recommend these itemsets to users.To this end,we considered these two measures from a unified perspective.Specifically,we proposed to identify the qualified itemsets which are both frequent and high utility.The key challenge to these problems is that the value of utility does not change monotonically when we add more items to a given itemset.Thus,we proposed an efficient algorithm named FHIMA (Frequent and High utility Itemset Mining Algorithms),where an effective upper bound based on frequency and utility is designed to further prune the search space.Moreover,FHIMA incorporates the idea of Prefixspan to avoid generating candidates,thus leading to high efficiency.Finally,the experiment results demonstrate the efficiency of FHIMA on real and synthetic datasets.

Key words: Top-k,Frequent,High utility,Qualified itemsets

[1] Agrawl R,Srikant R.Fast algorithms for mining associationrules[C]∥VLDB.1994:487-499
[2] Ahmed C F,Tanbeer S K,Jeong B-S,et al.Efficient tree structures for high utility pattern mining in incremental databases[C]∥TKDE.2009:1708-1721
[3] Ahmed C F,Tanbeer S K,Jeong B-S,et al.Efficient tree structures for high utility pattern mining in incremental databases[J].TKDE,2009,21(12):1708-1721
[4] Chan R,Yang Q,Shen Y.Ming high-utility itemsets[C]∥ICDM.2003:19-26
[5] Cheung Y L,Fu A W.Mining frequent itemsets without support threshold:with and without item constraints[C]∥TKDE.2004:1052-1069
[6] Chuang K,Huang J,Chen M.Mining top-k frequent patterns in the presence of the memory constraint[J].VLDB Journal,2008,17:1321-1488
[7] Fu A W,Kwong R W,Tang J.Mining n-most interesting itemsets[C]∥ISMIS.2000:59-67
[8] Gade K,Wang J,Karypis G.Efficient closed pattern mining inthe presence of tough block constraints[C]∥KDD.ACM,2004:138-147
[9] Han J,Pei J,Yin Y.Mining frequent patterns without candidate generation[C]∥SIGMOD.2000:1-12
[10] Han J,Wang J,Liu Y,et al.Mining top-k frequent closed pat-terns without minimum support[C]∥ICDM.2002:211-218
[11] Li H F,Huang H Y,Chen Y C,et al.Fast and memory efficient mining of high utility itemsets in data streams[C]∥ICDM.2008:881-886
[12] Liu M,Qu J.Mining high utility itemsets without candidate generation[C]∥CIKM.ACM,2012:55-64
[13] Liu Y,Liao W K,Choudhary A.A fast high utility itemsets mining algorithm[C]∥Workshop on WUDM.2005:90-99
[14] Lucchese C,Orlando S,Palmerini P,et al.KDCI:A multi-strategy algorithm for mining frequent itemsets[C]∥ICDM Workshop FIMI.2003:372-390
[15] Pei J,Han J,Mortazaviasl B,et al.Prefixspan:Mining sequential patterns efficiently by prefix-projected pattern growth[C]∥ICDE.2001:215-224
[16] Savasere A,Omiecinski E R,Navathe S B.An efficient algorithm for mining association rules in large databases[C]∥VLDB.1995:432-443
[17] Seno M,Karypis G.Lpminer:An algorithm for finding frequent itemsets using length-decreasing support constraint[C]∥ICDM.2011:505-512
[18] Tseng V S,Chu C J,Liang T.Efficient mining of temporal high-utility itemsets from data streams[C]∥KDDWorkshop on UBDM.2006:18
[19] Tseng V S,Wu C W,Shie B E,et al.Up-growth:an efficient algorithm for high utility itemset mining[C]∥KDD.2010:253-262
[20] Vo B,Nguyen H,Ho T B,et al.Parallel method for mining high utility itemsets from vertically partitioned distributed databases[C]∥KES 2009.2009:251-260
[21] Wu C,Shie B,Tseng V S,et al.Mining top-k high utility itemsets[C]∥KDD.2012:78-86
[22] Yao H,Hamilton H J,Buts C J.A foundational approach tomining itemset utilities from databases[C]∥SDM.2004:251-260
[23] Zaki M.Scalable algorithms for association mining[J].Know-ledge and Data Engineering,2000,12(2):372-390
[24] Frequent itemset mining dataset repository,2012.http://fimi.ua.ac.be

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!