Computer Science ›› 2023, Vol. 50 ›› Issue (11A): 220700200-8.doi: 10.11896/jsjkx.220700200

• Big Data & Data Science • Previous Articles     Next Articles

Bio-inspired Frequent Itemset Mining Strategy Based on Genetic Algorithm

ZHAO Xuejian1,2, ZHAO Ke1   

  1. 1 Technology and Application Engineering Center of Postal Big Data,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
    2 Key Lab of Broadband Wireless Communication and Sensor Network Technology of Ministry of Education,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
  • Published:2023-11-09
  • About author:ZHAO Xuejian,born in 1982,Ph.D,associate professor,is a member of China Computer Federation.His main research interests include data mining and wireless sensor networks.
  • Supported by:
    National Natural Science Foundation of China(61972208) and China Postdoctoral Science Foundation(2018M640509).

Abstract: Precise frequent itemset mining algorithms usually have a low time efficiency,particularly in processing large-scale data sets.To solve this problem,a frequent itemset mining algorithm,genetic algorithm combining apriori property based frequent itemset mining(GAA-FIM),is proposed,which combines the genetic algorithm and the downward closure property of precise frequent itemset mining algorithms.The detailed operation rules of coding operation,crossover operation,mutation operation and selection operation are described in detail.In GAA-FIM algorithm,individuals with good genes are preferentially added to the latest generation of candidate population through the asexual crossover operation process and the scale of the new generation candidate can be expanded through mutation operation.Therefore,the time efficiency of the proposed algorithm can be improved greatly and the frequent itemsets with better quality can be obtained.The performance of GAA-FIM algorithm is validated based on both synthetic data sets and real data sets.Experimental results show that the proposed GAA-FIM algorithm has a better time efficiency than GAFIM algorithm and GA-Apriori algorithm.Moreover,the quality of mining frequent itemsets has been further improved.

Key words: Frequent itemset, Genetic algorithm, Bio-inspired, Downward closure property, Data mining

CLC Number: 

  • TP391
[1]CHEE C H,JAAFAR J,AZIZ I A,et al.Algorithms for fre-quent itemset mining:a literature review[J].Artificial Intelligence Review,2019,52(4):2603-2621.
[2]VALIULLIN T,HUANG Z,WEI C,et al.A new approximate method for mining frequent itemsets from big data[J].Compu-ter Science and Information Systems,2021,18(3):641-656.
[3]MATA J,ALVAREZ J L,RIQUELME J C.Mining numeric association rules with genetic algorithms[C]//Artificial Neural Nets and Genetic Algorithms.Springer,Vienna,2001:264-267.
[4]MATA J,ALVAREZ J L,RIQUELME J C.An evolutionary algorithm to discover numeric association rules[C]//Proceedings of the 2002 ACM Symposium on Applied Computing.2002:590-594.
[5]ALATA B,AKIN E.An efficient genetic algorithm for automated mining of both positive and negative quantitative association rules[J].Soft Cmputing,2006,10(3):230-237.
[6]YAN X,ZHANG C,ZHANG S.Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support[J].Expert Systems with Applications,2009,36(2):3066-3076.
[7]DJENOURI Y,NOUALI-TABOUDJEMAT N,BENDJOUDI A.Association rules mining using evolutionary algorithms[C]//The 9th International Conference on Bio-inspired Computing:Theories and Applications(BIC-TA 2014).LNCS,2014.
[8]DJENOURI Y,COMUZZI M.Combining Apriori heuristic and bio-inspired algorithms for solving the frequent itemsets mining problem[J].Information Sciences,2017,420:1-15.
[9]BAGUI S,STANLEY P.Mining frequent itemsets from strea-ming transaction data using genetic algorithms[J].Journal of Big Data,2020,7(1):54.
[10]FONG S,WONG R,VASILAKOS A V.Accelerated PSOswarm search feature selection for data stream mining big data[J].IEEE Transactions on Services Computing,2015,9(1):33-45.
[11]KUO R J,LIN S Y,SHIH C W.Mining association rulesthrough integration of clustering analysis and ant colony system for health insurance database in Taiwan[J].Expert Systems with Applications,2007,33(3):794-808.
[12]WU J M T,ZHAN J,LIN J C W.An ACO-based approach to mine high-utility itemsets[J].Knowledge-Based Systems,2017,116:102-113.
[13]KUO R J,CHAO C M,CHIU Y T.Application of particleswarm optimization to association rule mining[J].Applied Soft Computing,2011,11(1):326-336.
[14]LIN J C W,YANG L,FOURNIER-VIGER P,et al.Mininghigh-utility itemsets based on particle swarm optimization[J].Engineering Applications of Artificial Intelligence,2016,55:320-330.
[15]DJENOURI Y,DRIAS H,HABBAS Z.Bees swarm optimization using multiple strategies for association rule mining[J].International Journal of Bio-Inspired Computation,2014,6(4):239-249.
[16]HERAGUEMI K E,KAMEL N,DRIAS H.Multi-swarm batalgorithm for association rule mining using multiple cooperative strategies[J].Applied Intelligence,2016,45(4):1021-1033.
[17]CAO H,YANG S,WANG Q,et al.A Closed Itemset Property based Multi-objective Evolutionary Approach for Mining Frequent and High Utility Itemsets[C]//2019 IEEE Congress on Evolutionary Computation(CEC).Wellington,New Zealand,2019:3356-3363.
[18]DJENOURIY,DJENOURI D,BELHADI A,et al.A Novel Pa-rallel Framework for Metaheuristic-based Frequent Itemset Mining[C]//2019 IEEE Congress on Evolutionary Computation(CEC).Wellington,New Zealand,2019:1439-1445.
[19]TIMUR V,HUANG Z J,WEI C,et al.A new approximatemethod for mining frequent itemsets from big data[J].Compu-ter Science and Information Systems,2021,18(3):641-656.
[20]RAMESH D F,JEYASUTHA M.A Novel Fuzzy FrequentItemsets Mining Approach for the Detection of Breast Cancer[J].International Journal of Information Retrieval Research,2021,11(1):36-53.
[21]FATEMI S M,HOSSEINI S M,KAMANDI A,et al.CL-MAX:a clustering-based approximation algorithm for mining maximal frequent itemsets[J].International Journal of Machine Learning and Cybernetics,2021,12:365-383.
[22]YU X,ZHAO J,WANG H,et al.A model of mining approximate frequent itemsets using rough set theory[J].International Journal of Computational Science and Engineering(IJCSE),2019,19(1):71-82.
[23]WU T,LIN J,YUN J,et al.An efficient algorithm for fuzzy fre-quent itemset mining[J].Journal of Intelligent & Fuzzy Systems,2020,38(5):5787-5797.
[24]VALIULLIN T,HUANG Z,WEI C,et al.A new approximate method for mining frequent itemsets from big data[J].ComputerScience and Information Systems,2021,18(3):641-656.
[1] LIU Xuanyu, ZHANG Shuai, HUO Shumin, SHANG Ke. Microservice Moving Target Defense Strategy Based on Adaptive Genetic Algorithm [J]. Computer Science, 2023, 50(9): 82-89.
[2] LIU Ziwen, YU Lijuan, SU Yixing, ZHAO Yao, SHI Zhu. Test Case Generation Based on Web Application Front-end Behavior Model [J]. Computer Science, 2023, 50(7): 18-26.
[3] SHEN Zhehui, WANG Kailai, KONG Xiangjie. Exploring Station Spatio-Temporal Mobility Pattern:A Short and Long-term Traffic Prediction Framework [J]. Computer Science, 2023, 50(7): 98-106.
[4] LI Kun, GUO Wei, ZHANG Fan, DU Jiayu, YANG Meiyue. Adversarial Malware Generation Method Based on Genetic Algorithm [J]. Computer Science, 2023, 50(7): 325-331.
[5] REN Gaoke, MO Xiuliang. Network Security Situation Assessment for GA-LightGBM Based on PRF-RFECV Feature Optimization [J]. Computer Science, 2023, 50(6A): 220400151-6.
[6] ZHANG Jian, ZHANG Ye. College Students Employment Dynamic Prediction of Multi-feature Fusion Based on GRU-LSTM [J]. Computer Science, 2023, 50(6A): 220500056-6.
[7] ZHANG Zelun, YANG Zhibin, LI Xiaojie, ZHOU Yong, LI Wei. Machine Learning Based Environment Assumption Automatic Generation for Compositional Verification of SCADE Models [J]. Computer Science, 2023, 50(6): 297-306.
[8] ZHONG Jialin, WU Yahui, DENG Su, ZHOU Haohao, MA Wubin. Multi-objective Federated Learning Evolutionary Algorithm Based on Improved NSGA-III [J]. Computer Science, 2023, 50(4): 333-342.
[9] SHANG Yuye, YUAN Jiabin. Task Offloading Method Based on Cloud-Edge-End Cooperation in Deep Space Environment [J]. Computer Science, 2023, 50(2): 80-88.
[10] CHEN Haiyan, ZHU Junlin, WANG Ping. Study on Prediction Modeling and Compensation of Circular Target Center Positioning Error Based on GA-BP [J]. Computer Science, 2023, 50(11A): 221100170-5.
[11] ZHANG Yang, WANG Rui, WU Guanfeng, LIU Hongyi. Parallel Mining Algorithm of Frequent Itemset Based on N-list and DiffNodeset Structure [J]. Computer Science, 2023, 50(11): 55-61.
[12] ZHANG Luping, XU Fei. SNPT Systems Working in Global Asynchronous and Local Synchronous Mode [J]. Computer Science, 2023, 50(1): 270-275.
[13] ZHANG Lu-ping, XU Fei. Survey on Spiking Neural P Systems with Rules on Synapses [J]. Computer Science, 2022, 49(8): 217-224.
[14] LI Rong-fan, ZHONG Ting, WU Jin, ZHOU Fan, KUANG Ping. Spatio-Temporal Attention-based Kriging for Land Deformation Data Interpolation [J]. Computer Science, 2022, 49(8): 33-39.
[15] YANG Hao-xiong, GAO Jing, SHAO En-lu. Vehicle Routing Problem with Time Window of Takeaway Food ConsideringOne-order-multi-product Order Delivery [J]. Computer Science, 2022, 49(6A): 191-198.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!