计算机科学 ›› 2020, Vol. 47 ›› Issue (6A): 450-456.doi: 10.11896/JsJkx.190700143

• 数据库 & 大数据 & 数据科学 • 上一篇    下一篇

基于判断聚合的分布式数据挖掘分类算法研究

李莉   

  1. 西南政法大学行政法学院 重庆 401120
  • 发布日期:2020-07-07
  • 通讯作者: 李莉(395770202@qq.com)
  • 基金资助:
    国家社科基金项目(18BZX133);西南政法大学校级项目(2016XZQN-20)

Classification Algorithm of Distributed Data Mining Based on Judgment Aggregation

LI Li   

  1. School of Administrative law,Southwest University of Political Science and Law,Chongqing 401120,China
  • Published:2020-07-07
  • About author:LI Li, born in 1982, Ph. D, lecturer.Her research interests include modern logic and artificial intelligence.
  • Supported by:
    This work was supported by the National Social Science Foundation of China (18BZX133) and University Level ProJect of Southwest University of Political Science and Law (2016XZQN-20).

摘要: 随着互联网的发展和云计算技术的广泛应用,许多数据存储在不同的服务器上,分布式数据挖掘技术应运而生。智能agent在各自的站点上得到部分挖掘结果,分布式数据挖掘可以将这些部分的挖掘结果聚合成为全局的结果。文中主要处理的是分布式数据挖掘过程中的分类问题,针对一些特征的数据分别存储于不同的数据源上,提出了一种基于判断聚合模型的分类算法。该算法中每一个agent要对一个案例属于某一个目标类的可能性进行判断,然后利用判断聚合模型将这些agent的判断进行聚合,形成全局的分类结果。基于判断聚合模型的分类算法将逻辑和社会选择理论的技术应用于解决分布式数据挖掘的分类问题,这种新的算法不需要大规模地传输和转化数据,节省了传输成本,提高了分类效率,同时有效地保护了数据的安全性。

关键词: 多主体系统, 分布式数据挖掘, 逻辑, 判断聚合模型, 算法

Abstract: With the development of Internet and the wide application of cloud computing,many data sets are stored on different servers,and the distributed data mining comes into being.Each agent gets partial data mining results on its respective site,and distributed data mining could aggregate this part of mining results into a global decision.This paper is focused on the classification issue in the process of distributed data mining.Aiming at some specific data are stored in difference data source,this paper puts forward a classification algorithm based on the Judgment aggregation model.Each agent should give its Judgment whether a new case belongs to a certain target class,and then use the Judgment aggregation model to aggregate the Judgments of these agents to form a global classification.This algorithm combines logic and social choice theory technologies together and applies them to the classification problem in distributed data mining.It doesn’t need to transfer and transform the data on a large scale,thus saving the transmission cost and improving the efficiency of classification.At the same time,it effectively protects the data security.

Key words: Algorithm, Distributed data mining, Judgment aggregation model, Logic, Multi-agent system

中图分类号: 

  • TP18
[1] CAO L,WEISS G,YU P S.A brief introduction to agent mining .Autonomous Agent and Multi-Agent Systems,2012,25(3):419-424.
[2] MODI P J,SHEN W M.Collaborative multiagent learning for classification tasks//Proceedings of the Fifth International Conference on Autonomous Agent.ACM,2001:37-38.
[3] SANTANA L E A,CANUTO A M P,JUNIOR J C X,et al.A comparative analysis of data distribution methods in an agent-based neural system for classification tasks//Sixth International Conference on Hybrid Intelligent Systems(HIS’06).IEEE,2006:9-9.
[4] KARGUPTA H,HUANG W,SIVAKUNMAR K,et al.Distributed clustering using collective principal component analysis .Knowledge and Information Systems,2001,3(4):422-448.
[5] PARK B H,KARGUPTA H.Distributed data mining:Algorithms,systems,and applications //Data Mining Handbook.2002. MATATOV N,ROKACH L,MAIMON O.Privacy-preserving data mining:A feature set partitioning approach .Information Sciences,2010,180(14):2696-2720.
[6] GAERTNER W.A primer in social choice theory:Revised edition .Oxford University Press,2009.
[7] BRANDT F,CONITZER V,ENDRISS U,et al.Handbook of Computational Social Choice.Cambridge University Press,2016.
[8] ARROW K J.Social choice and individual values.Yale University Press,1963.
[9] KORNHAUSER L A,SAGER L G.Unpacking the court .The Yale Law Journal,1986,96(1):82-117.
[10] LIST C,POLAK B.Introduction to Judgment aggregation.Journal of economic theory,2010,145(2):441-466.
[11] LANG J,PIGOZZI G,SLAVKOVIK M,et al.Judgment aggregation rules based on minimization//Proceedings of the 13th Conference on Theoretical Aspects of Rationality and Know-ledge.ACM,2011:238-246.
[12] LANG J,SLAVKOVIK M.How hard is it to compute maJority-preserving Judgment aggregation rules//Proceedings of the Twenty-first European Conference on Artificial Intelligence.IOS Press,2014:501-506.
[13] LANG J,PIGOZZI G,SLAVKOVIK M,et al.A partial taxonomy of Judgment aggregation rules and their properties .Social Choice and Welfare,2017,48(2):327.
[14] LANG J,MENGIN J,XIA L.Voting on multi-issue domains with conditionally lexicographic preferences .Artificial Intelligence,2018,265:18-44.
[15] LI L,TANG X.A solution to the cold-start problem in recommender systems based on social choice theory //Intelligent and Evolutionary Systems.Springer,Cham,2016:267-279.
[16] LI L I,NIU B,TANG S X.Online marketing research based on social choice theory and cloud computing//2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC).IEEE,2015:391-396.
[1] 刘鑫, 王珺, 宋巧凤, 刘家豪.
一种基于AAE的协同多播主动缓存方案
Collaborative Multicast Proactive Caching Scheme Based on AAE
计算机科学, 2022, 49(9): 260-267. https://doi.org/10.11896/jsjkx.210800019
[2] 宁晗阳, 马苗, 杨波, 刘士昌.
密码学智能化研究进展与分析
Research Progress and Analysis on Intelligent Cryptology
计算机科学, 2022, 49(9): 288-296. https://doi.org/10.11896/jsjkx.220300053
[3] 姜洋洋, 宋丽华, 邢长友, 张国敏, 曾庆伟.
蜜罐博弈中信念驱动的攻防策略优化机制
Belief Driven Attack and Defense Policy Optimization Mechanism in Honeypot Game
计算机科学, 2022, 49(9): 333-339. https://doi.org/10.11896/jsjkx.220400011
[4] 柴慧敏, 张勇, 方敏.
基于特征相似度聚类的空中目标分群方法
Aerial Target Grouping Method Based on Feature Similarity Clustering
计算机科学, 2022, 49(9): 70-75. https://doi.org/10.11896/jsjkx.210800203
[5] 陈俊, 何庆, 李守玉.
基于自适应反馈调节因子的阿基米德优化算法
Archimedes Optimization Algorithm Based on Adaptive Feedback Adjustment Factor
计算机科学, 2022, 49(8): 237-246. https://doi.org/10.11896/jsjkx.210700150
[6] 刘卫明, 安冉, 毛伊敏.
基于聚类和WOA的并行支持向量机算法
Parallel Support Vector Machine Algorithm Based on Clustering and WOA
计算机科学, 2022, 49(7): 64-72. https://doi.org/10.11896/jsjkx.210500040
[7] 唐枫, 冯翔, 虞慧群.
基于自适应知识迁移与资源分配的多任务协同优化算法
Multi-task Cooperative Optimization Algorithm Based on Adaptive Knowledge Transfer andResource Allocation
计算机科学, 2022, 49(7): 254-262. https://doi.org/10.11896/jsjkx.210600184
[8] 张翀宇, 陈彦明, 李炜.
边缘计算中面向数据流的实时任务调度算法
Task Offloading Online Algorithm for Data Stream Edge Computing
计算机科学, 2022, 49(7): 263-270. https://doi.org/10.11896/jsjkx.210300195
[9] 李瑭, 秦小麟, 迟贺宇, 费珂.
面向多无人系统的安全协同模型
Secure Coordination Model for Multiple Unmanned Systems
计算机科学, 2022, 49(7): 332-339. https://doi.org/10.11896/jsjkx.210600107
[10] 单晓英, 任迎春.
基于改进麻雀搜索优化支持向量机的渔船捕捞方式识别
Fishing Type Identification of Marine Fishing Vessels Based on Support Vector Machine Optimized by Improved Sparrow Search Algorithm
计算机科学, 2022, 49(6A): 211-216. https://doi.org/10.11896/jsjkx.220300216
[11] 李丹丹, 吴宇翔, 朱聪聪, 李仲康.
基于多种改进策略的改进麻雀搜索算法
Improved Sparrow Search Algorithm Based on A Variety of Improved Strategies
计算机科学, 2022, 49(6A): 217-222. https://doi.org/10.11896/jsjkx.210700032
[12] 王文强, 贾星星, 李朋.
自适应的集成定序算法
Adaptive Ensemble Ordering Algorithm
计算机科学, 2022, 49(6A): 242-246. https://doi.org/10.11896/jsjkx.210200108
[13] 刘宝宝, 杨菁菁, 陶露, 王贺应.
基于DE-LSTM模型的教育统计数据预测研究
Study on Prediction of Educational Statistical Data Based on DE-LSTM Model
计算机科学, 2022, 49(6A): 261-266. https://doi.org/10.11896/jsjkx.220300120
[14] 肖治鸿, 韩晔彤, 邹永攀.
基于多源数据和逻辑推理的行为识别技术研究
Study on Activity Recognition Based on Multi-source Data and Logical Reasoning
计算机科学, 2022, 49(6A): 397-406. https://doi.org/10.11896/jsjkx.210300270
[15] 高文龙, 周天阳, 朱俊虎, 赵子恒.
基于双向蚁群算法的网络攻击路径发现方法
Network Attack Path Discovery Method Based on Bidirectional Ant Colony Algorithm
计算机科学, 2022, 49(6A): 516-522. https://doi.org/10.11896/jsjkx.210500072
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!