Computer Science ›› 2019, Vol. 46 ›› Issue (10): 71-76.doi: 10.11896/jsjkx.190100223

• Big Data & Data Science • Previous Articles     Next Articles

Association Rule Mining Algorithm Based on Timestamp and Vertical Format

WANG Bin, MA Jun-jie, FANG Xin-xiu, WEI Tian-you   

  1. (School of Information and Control Engineering,Qingdao University of Technology,Qingdao,Shandong 266520,China)
  • Received:2019-01-26 Revised:2019-05-05 Online:2019-10-15 Published:2019-10-21

Abstract: The SLMCM algorithm (Specific Later-marketed Consequent Mining) is mainly used to solve the problem of later item,but it is inefficient and difficult to adapt to big data mining.For this problem,this paper proposed the improved algorithms E-SLMCM and DE-SLMCM .E-SLMCM algorithm is based on vertical structure,so it only traversal the database twice.Furthermore,the timestamp of each item can be directly calculated when the format is converted to vertical,and each transaction does not need to be sorted by the timestamp of the item.In addition,a new method for finding the itemset timestamp was proposed,which dose not need to traverse the database to find the timestamp of itemset.In order to adapt to dense database,DE-SLMCM algorithm was proposed based on E-SLMCM algorithm and diffset,which improves the execution efficiency on dense database.In the listed four simulation experiments based on common data sets,the time efficiency of E-SLMCM and DE-SLMCM running on sparse and dense data sets is 10-1000 times higher than that of SLMCM.

Key words: Association rules, Diffset, Later item, Timestamp

CLC Number: 

  • TP391
[1]AGRAWAL R,IMIELIHSKI T,SWAMI A.Mining association rules between sets of items in large databases[J].Acm sigmod record,1993,22(2):207-216.
[2]HAN J,PEI J,YIN Y .Mining Frequent Patterns Without Candidate Generation[C]//Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data.Dallas,Te-xas,USA:ACM,2000.
[3]ZAKI M J.Scalable algorithms for association mining[J].IEEE transactions on knowledge and data engineering,2000,12(3):372-390.
[4]RASHID M M,GONDAL I,KAMRUZZAMAN J.Mining associated patterns from wireless sensor networks[J].IEEE Tran-sactions on Computers,2015,64(7):1998-2011.
[5]WANG L,MENG J,XU P,et al.Mining temporal association rules with frequent itemsets tree[J].Applied Soft Computing,2018,62:817-829.
[6]CHEN C H,LAN G C,HONG T P,et al.Mining fuzzy temporal association rules by item lifespans[J].Applied Soft Computing,2016,41:265-274.
[7]ZOU C,DENG H,WAN J,et al.Mining and updating association rules based on fuzzy concept lattice[J].Future Generation Computer Systems,2018,82:698-706.
[8]MIHOLCA D L,CZIBULA G,CRIVEI L M.A new incremental relational association rules mining approach[J].Procedia Computer Science,2018,126:126-135.
[9]NGUYEN D,LUO W,PHUNG D,et al.LTARM:A novel temporal association rule mining method to understand toxicities in a routine cancer treatment[J].Knowledge-Based Systems,2018,161:313-328.
[10]ZHANG S C,ZHANG J L,CHEN F,et al.Negative Incremental Updating Algorithm for Maintaining Association rules[J].Computer Science,2005,32(9):153-155.(in Chinese)
张师超,张继连,陈峰,等.负增量式关联规则更新算法[J].计算机科学,2005,32(9):153-155.
[11]CAI J,XUE Y S,LIN L,et al.Updating Algorithm for Association Rules Based on Fully M ining Incremental Transactions[J].Computer Science,2007,34(2):220-222.(in Chinese)
蔡进,薛永生,林丽,等.基于充分挖掘增量事务的关联规则更新算法[J].计算机科学,2007,34(2):220-222.
[12]GAN W,LIN C W,FOURNIER-VIGER P,et al.Mining of frequent patterns with multiple minimum supports[J].Engineering Applications of Artificial Intelligence,2017,60(C):83-96.
[13]DARRAB S,ERGENC B.Vertical Pattern Mining Algorithm for Multiple Support Thresholds[J].Procedia ComputerScien-ce,2017,112:417-426.
[14]HU H W,CHANG H C,LIN W S,et al.An optimized frequent pattern mining algorithm with multiple minimum supports[C]//IEEE International Conference on Big Data.IEEE,2017.
[15]DAHBI A,BALOUKI Y,GADI T.Using Multiple Minimum Support to Auto-adjust the Threshold of Support in Apriori Algorithm[C]//International Conference on Soft Computing and Pattern Recognition.Cham:Springer,2017:111-119.
[16]WENG C H.Identifying association rules of specific later-marketed products[J].Applied Soft Computing,2016,38:518-529.
[17]GAO J,SHI B L.Research on Fast Association Rule Mining Algorithm[J].Computer Science,2005,32(3):200-201.(in Chinese)
c高俊,施伯乐.快速关联规则挖掘算法研究[J].计算机科学,2005,32(3):200-201.
[18]LIU D,LIU Y,YIN Z.Fast algorithm for discovering maximum frequent itemsets of association rules[J].Acta Scientiarium Natu-ralium Universitatis Jilinensis,2004,42(2):212-215.
[19]ZAKI M J,GOUDA K.Fast vertical mining using diffsets[C]//Proceedings of the Ninth ACM SIGKDD International Confe-rence on Knowledge Discovery and Data Mining.ACM,2003:326-335.
[20]FOURNIER-VIGER P,GOMARIZ A,SOLTANI A,et al. SPMF:OpenSource Data Mining Platform[EB/OL].http://www.philippe-fournier-viger.com/spmf.
[1] CAO Yang-chen, ZHU Guo-sheng, SUN Wen-he, WU Shan-chao. Study on Key Technologies of Unknown Network Attack Identification [J]. Computer Science, 2022, 49(6A): 581-587.
[2] XU Hui-hui, YAN Hua. Relative Risk Degree Based Risk Factor Analysis Algorithm for Congenital Heart Disease in Children [J]. Computer Science, 2021, 48(6): 210-214.
[3] SHEN Xia-jiong, YANG Ji-yong, ZHANG Lei. Attribute Exploration Algorithm Based on Unrelated Attribute Set [J]. Computer Science, 2021, 48(4): 54-62.
[4] LI Ai-ling, ZHANG Feng-li, GAO Qiang, WANG Rui-jin. Trajectory Next Footprint Prediction Model Based on Adaptive Timestamp and Multi-scale Feature Extraction [J]. Computer Science, 2021, 48(11A): 191-197.
[5] ZHU An-qing, LI Shuai, TANG Xiao-dong. Parallel FP_growth Association Rules Mining Method on Spark Platform [J]. Computer Science, 2020, 47(12): 139-143.
[6] HAN Cheng-cheng, LIN Qiang, MAN Zheng-xing, CAO Yong-chun, WANG Hai-jun, WANG Wei-lan. Mining Nuclear Medicine Diagnosis Text for Correlation Extraction Between Lesions and Their Representations [J]. Computer Science, 2020, 47(11A): 524-530.
[7] LU Xin-yun, WANG Xing-fen. Educational Administration Data Mining of Association Rules Based on Domain Association Redundancy [J]. Computer Science, 2019, 46(6A): 427-430.
[8] ZHANG Wei-guo. Decision Making of Course Selection Oriented by Knowledge Recommendation Service [J]. Computer Science, 2019, 46(6A): 507-510.
[9] LI Zhi-xing, REN Shi-ya, WANG Hua-ming, SHEN Ke. Knowledge Reasoning Method Based on Unstructured Text-enhanced Association Rules [J]. Computer Science, 2019, 46(11): 209-215.
[10] JIA Xi-bin,YE Ying-jie,CHEN Jun-cheng. Influence Factors Mining of Traffic Accidents Based on Association Rules [J]. Computer Science, 2018, 45(6A): 447-452.
[11] LIU Jie, WANG Gui-ling, ZUO Xiao-jiang. Incremental Data Extraction Model Based on Variable Time-window [J]. Computer Science, 2018, 45(11): 204-209.
[12] DING Dang, ZHANG Zhi-fei, MIAO Duo-qian and CHEN Yue-feng. Ordering Recommender Algorithm Based on Consumers’ Behavior [J]. Computer Science, 2017, 44(Z11): 46-50.
[13] LI Chao, ZHAO Shu-liang, ZHAO Jun-peng, GAO Lin and CHI Yun-xian. Scaling-up Algorithm of Multi-scale Association Rules [J]. Computer Science, 2017, 44(8): 285-289.
[14] ZHANG Yan-ping and LING Jie. Improved Algorithm for Privacy-preserving Association Rules Mining on Horizontally Distributed Databases [J]. Computer Science, 2017, 44(8): 157-161.
[15] ZHAO Yue, REN Yong-gong and LIU Yang. Improved Apriori Algorithm and Its Application Based on MapReduce [J]. Computer Science, 2017, 44(6): 250-254.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!