计算机科学 ›› 2014, Vol. 41 ›› Issue (12): 143-147.doi: 10.11896/j.issn.1002-137X.2014.12.030

• 人工智能 • 上一篇    下一篇

同源数据的协同挖掘算法研究

王泳,吕科,潘卫国   

  1. 中国科学院大学 北京100049;中国科学院大学 北京100049;中国科学院大学 北京100049
  • 出版日期:2018-11-14 发布日期:2018-11-14
  • 基金资助:
    本文受国家自然科学基金 (61371155)资助

Research on Collaborative Mining Algorithm on Homologous Data

WANG Yong,LV Ke and PAN Wei-guo   

  • Online:2018-11-14 Published:2018-11-14

摘要: 围绕知识管理和提高数据挖掘模型的可解释性问题展开研究,提出了 采用协同挖掘的方法对同源数据进行模式评估和知识管理的CMA算法(Collaborative Mining Algorithm)。与集成学习产生同一类型知识规则的组合学习方式不同,协同挖掘在同源数据的基础上建立不同类型的学习模型,并且每类学习模型产生的知识规则的表现形式各不相同,通过比对学习形成了一致的知识规则。实验表明,协同挖掘可以有效发现数据中的隐含信息,提高知识管理的性能。

关键词: 同源数据,协同挖掘,模型评估,知识管理

Abstract: This article explored the issues of knowledge management and improvement of the interpretability of data mining models,and proposed the collaborative mining algorithm (CMA),which performs pattern evaluation and know-ledge management based on collaborative mining of homologous data.In contrast to the ensemble learning knowledge rules by combining learning models of the same type,collaborative mining sets up learning models of different types based on homologous data,and each model owns different forms of knowledge rules.Through the comparison study,coincident knowledge rules were formed.Experiments show that collaborative mining can efficiently find the latent information in data,and improve the performance of knowledge management.

Key words: Homologous data,Collaborative mining,Model evaluation,Knowledge management

[1] Fayyad U M,Shapiro G P,Smyth P.The KDD Process for Extracting Useful Knowledge from Volumes of Data[J].Communications of the ACM,1996,39(11):27-34
[2] Han Jia-wei,Kamber M,Pei Jian.Data Mining:Concepts andTechniques(3rd edition)[M].Singapore,Elsevier,2012
[3] 郭萌,王珏.数据挖掘与数据库知识发现:综述[J].模式识别与人工智能,1998,11(3):292-299
[4] 胡包钢,王泳,杨双红,等.如何增加人工神经元网络的透明度?[J].模式识别与人工智能,2007,20(1):72-84
[5] Tan Pang-ning,Steinbach M,Kumar V.Introduction to DataMining[M].Addison Wesley,2005
[6] Mitra S,Pal S K,Mitra P.Data Mining in Soft ComputingFramework:A Survey[J].IEEE Trans.on Neural Networks,2002,13(1):3-14
[7] Lee M R,Chen T T.Revealing research themes and trends in knowledge management:From 1995 to 2010[J].Knowledge-Based Systems,2012,28(4):47-58
[8] West M.Developing High Quality Data Models[M].Singapore,Elsevier,2011
[9] 郭晓波,赵书良,刘军丹,等.基于概念图的关联规则知识表示[J].计算机科学,2013,40(8):261-265
[10] 王泳,邢红杰.对基于知识发现的神经元网络集成方法的研究[J].计算机科学,2006,33(10):189-192
[11] Duda R O,Hart P E,Stork D.Pattern Classification(2nd edi-tion)[M].New York,John Willy,2001
[12] 王泳,胡包钢.应用统计方法综合评估核函数分类能力的研究[J].计算机学报,2008,31(6):942-952
[13] Liberona D,Ruiz M,Fuenzalida D.Customer Knowledge Management in the Age of Social Networks[J].Advances in Intelligent Systems and Computing,2013,172:353-364
[14] 张春霞,张讲社.选择性集成学习算法综述[J].计算机学报,2011,34(8):1399-1410
[15] Pelleg D,Moore A W.X-means:Extending K-means with Efficient Estimation of the Number of Clusters[C]∥Seventeenth International Conference on Machine Learning.2000:727-734

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!