数据挖掘中关联弱化问题的解决方法分析

计算机科学 ›› 2013, Vol. 40 ›› Issue (8): 220-222.

数据挖掘中关联弱化问题的解决方法分析

杨泽民,郭显娥,王文军

山西大同大学数学与计算机科学学院大同037009;山西大同大学数学与计算机科学学院大同037009;山西大同大学数学与计算机科学学院大同037009

出版日期:2018-11-16 发布日期:2018-11-16
基金资助:
本文受国家自然科学基金(11171112)资助

Research on Solution to Association Weakening Problem in Data Mining

YANG Ze-min,GUO Xian-e and WANG Wen-jun

Online:2018-11-16 Published:2018-11-16

摘要/Abstract

摘要： 当前的支持向量机和均值聚类等数据挖掘算法中,几乎都是依靠数据之间的关联性来完成数据匹配。一旦数据库中含有大量的冗余数据,将造成数据之间的相关性降低,关联性被破坏,导致传统的数据挖掘算法效率降低。为了避免上述缺陷,提出了一种弱化关联规则修补挖掘算法。利用弱聚类方法,在数据选择过程中,不将所有的元素都进行初始分类处理,只计算某一元素属于某一个类别的概率,确定多个弱聚类中心,计算不同数据之间的弱聚类关联性,从而实现关联规则较弱的冗余环境下准确的数据挖掘。实验结果表明,这种算法能够有效提高海量冗余环境下的数据挖掘效率,取得了令人满意的效果。

关键词: 海量冗余,数据挖掘,关联规则

Abstract: The support vector machine (SVM) and mean cluster data mining algorithm,almost all rely on the correlation between data,complete data matching．Once the database contains a large amount of redundancy data,the correlation between data will be reduced,and relevance is destroyed,resulting in traditional data mining algorithm efficiency lower．In order to avoid the above defects,this paper proposed a weakening association rules repair mining algorithm．In the data selection process,the method will not make initial classification processing for all elements only calculates proba-bility that one element belongs to a category,and determines multiple weak clustering center,calculates weak clustering relevance between different data,so as to realize the association rules weaker redundancy environment accurate data mining．The experimental results show that this algorithm can effectively improve the massive redundant environment data mining efficiency,has made the satisfactory effect.

Key words: Mass redundancy,Data mining,Association rules

杨泽民,郭显娥,王文军. 数据挖掘中关联弱化问题的解决方法分析[J]. 计算机科学, 2013, 40(8): 220-222. https://doi.org/

YANG Ze-min,GUO Xian-e and WANG Wen-jun. Research on Solution to Association Weakening Problem in Data Mining[J]. Computer Science, 2013, 40(8): 220-222. https://doi.org/

参考文献

[1] 崔建,李强,杨龙坡.基于垂直数据分布的大型稠密数据库快速关联规则挖掘算法[J].计算机科学,2011(4):216-219
[2] Tojanovic Z,Dahanayake A．Service-Oriented Software SystemEngineering:Challenges and Practices [J]．Idea Group Publi-shing,2011:1-47
[3] Tasi T,Zhang D,Chen Y,et al．A software reliability model for Web services[C]∥8th IASTED International Conference on Software Engineering and Applications．Cambridge,MA,USA,2011:144-149
[4] 穆肇南,张健.数据挖掘技术在经济预测中的应用[J].计算机仿真,2012(6):347-350
[5] 王晟,赵壁芳.基于模糊数据挖掘和遗传算法的网络入侵检测技术[J].计算机测量与控制,2012(3):660-663
[6] Xu Yue,Li Yue-feng．Mining non-redundant association rulesbased on concise bases[J]．International Journal of Pattern Re-cognition and Artificial Intelligence,2007,1(4):659-675
[7] Loglisci C,Malerba D．Mining multiple level non-redundant association rules through two-fold pruning of redundancies[C]∥Proceedings of MLDM．2009:251-265
[8] Cheng J,ke Y P,Ng W.Effective elimination of redundant association rules[J]．Data mining and knowledge discovery,2008,16(2):221-249

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed