计算机科学 ›› 2017, Vol. 44 ›› Issue (8): 285-289.doi: 10.11896/j.issn.1002-137X.2017.08.049

• 人工智能 • 上一篇    下一篇

多尺度关联规则尺度上推算法

李超,赵书良,赵骏鹏,高琳,池云仙   

  1. 河北师范大学数学与信息科学学院 石家庄050024 河北师范大学河北省计算数学与应用重点实验室 石家庄050024,河北师范大学数学与信息科学学院 石家庄050024 河北师范大学河北省计算数学与应用重点实验室 石家庄050024,河北师范大学数学与信息科学学院 石家庄050024 河北师范大学河北省计算数学与应用重点实验室 石家庄050024,河北师范大学数学与信息科学学院 石家庄050024 河北师范大学河北省计算数学与应用重点实验室 石家庄050024,河北师范大学数学与信息科学学院 石家庄050024 河北师范大学河北省计算数学与应用重点实验室 石家庄050024
  • 出版日期:2018-11-13 发布日期:2018-11-13
  • 基金资助:
    本文受国家自然科学基金项目(71271067),国家社科基金重大项目(13&ZD091),河北省高等学校科学技术研究项目(QN2014196),河北师范大学硕士基金(xj2015003)资助

Scaling-up Algorithm of Multi-scale Association Rules

LI Chao, ZHAO Shu-liang, ZHAO Jun-peng, GAO Lin and CHI Yun-xian   

  • Online:2018-11-13 Published:2018-11-13

摘要: 数据挖掘在多尺度研究方面取得了一些成果。然而,多尺度数据挖掘研究还不够深入和完善。目前针对空间和图像数据的研究较多,对于一般数据的多尺度数据挖掘的研究较少。随着大数据应用的不断发展,其研究显得尤为重要。针对上述问题,进行了普适的多尺度关联规则尺度转换方法的研究。首先,基于包含度的相似度理论提出频繁项集的处理方法;然后,以图像金字塔为理论基础,提出了多尺度关联规则尺度上推算法MSARSUA(Multi-Scale Association Rules Scaling Up Algorithm);最后,利用H省1)全员人口真实数据集、UCI公用数据集和IBM数据集对所提算法进行了实验验证与分析,结果表明MSARSUA具有较高的覆盖率、较高的F1-measure值和较低的平均支持度估计误差,在效率上比Apriori算法和FP-Growth算法有较大的提升,在性能上比SU-ARMA有更好的表现。

关键词: 多尺度,关联规则,尺度上推,多尺度关联规则挖掘

Abstract: Great achievements have been made on multi-scale research of data mining.However,multi-scale data mining research is far from being deep and perfect.Current research,which mainly focuses on space and image data,pays less attention to multi-scale data mining on the general data.With the continuous development of big data applications,research of multi-scale data mining becomes particularly important.Regarding the issue above,this paper carried out a study of scale-conversion methods on universal multi-scale association rules data mining.First of all,this paper gave an approach of frequent items based on the similarity theory of including degree.Then,the paper proposed an algorithm named MSARSUA (Multi-Scale Association Rules Scaling Up Algorithm) based on the theory of image pyramid.Finally,experimental results on data sets from H province,UCI and IBM show that algorithm MSARSUA has higher coverage,higher F1-measure and lower estimation error of average support.Algorithm MSARSUA outperforms both Apriori algorithm and FP-Growth algorithm on efficiency aspect.Meanwhile,the results indicate that algorithm MSARSUA possesses superior performance compared with algorithm SU-ARMA.

Key words: Multi-scale,Association rules,Scaling-up,Multi-scale association rules mining

[1] LI H J,LI H Y,LI A H.Analysis of multi-scale stability in community structure[J].Chinese Journal of Computer,2015,8(2):301-312.(in Chinese) 李慧嘉,李慧颖,李爱华.多尺度的社团结构稳定性分析[J].计算机学报,2015,8(2):301-312.
[2] CHEN J J,SU S B,XU H L.Decision tree optimization algorithm based on multiscale rough set model[J].Journal of Computer Applications,2011,1(12):3243-3246.(in Chinese) 陈家俊,苏守宝,徐华丽.基于多尺度粗糙集模型的决策树优化算法[J].计算机应用,2011,1(12):3243-3246.
[3] LIU M M,ZHAO S L,CHEN M,et al.Scaling-up mining algorithm of multi-scale association rules mining[J].Application Research of Computers,2015,2(10):2924-2929.(in Chinese) 柳萌萌,赵书良,陈敏,等.多尺度关联规则挖掘的尺度上推算法[J].计算机应用研究,2015,2(10):2924-2929.
[4] WANG Z,ZHANG L,FANG T.A Multiscale and Hierarchical Feature Extraction Method for Terrestrial Laser Scanning Point Cloud Classification[J].IEEE Transactions on Geoscience and Remote Sensing,2015,3(5):2409-2425.
[5] ALEX J,HENRI P,SYLVIE,et al.Interactive Multiscale Classification of High-Resolution Remote Sensing Images[J].IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing,2013,6(4):2020-2034.
[6] YING D,TAO G,LEI L.Self-Organizing Map Based Multi-scaleSpectral Clustering for Image Segmentation[C]∥2012 International Conference on Computer Science and Electronics Engineering.2012:329-333.
[7] ZHANG W X,XU Z B,LIANG Y,et al.Inclusion degree theory[J].Fuzzy Systems and Mathematics,1996,10(4):1-9.(in Chines) 张文修,徐宗本,梁怡,等.包含度理论[J].模糊系统与数学,1996,10(4):1-9.
[8] YU Z M,GAO F.Laplacian pyramid and contrast pyramid based image fusion and their performance comparison[J].Application Research of Computers,2004,21(10):128-130.(in Chinese) 玉振明,高飞.基于金字塔方法的图像融合原理及性能评价[J].计算机应用研究,2004,1(10):128-130.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!