计算机科学 ›› 2017, Vol. 44 ›› Issue (Z11): 407-410.doi: 10.11896/j.issn.1002-137X.2017.11A.086

• 大数据与数据挖掘 • 上一篇    下一篇

面向大数据的多维粒矩阵关联分析及应用

吴珺,王春枝   

  1. 湖北工业大学计算机学院 武汉430068;武汉理工大学交通物联网技术湖北省重点实验室 武汉430070,湖北工业大学计算机学院 武汉430068
  • 出版日期:2018-12-01 发布日期:2018-12-01
  • 基金资助:
    本文受国家自然科学基金(61602161),湖北省自然基金(2014CFB590),交通物联网技术湖北省重点实验室(2015III015-A03)资助

Multiple Correlation Analysis and Application of Granular Matrix Based on Big Data

WU Jun and WANG Chun-zhi   

  • Online:2018-12-01 Published:2018-12-01

摘要: 当前日益增长的大数据备受青睐,大数据的核心是数据分析。然而聚焦大数据的动态、多维特性,传统数据分析方法难以获取可靠且准确的分析结果,数据分析方法面临着重要的发展机遇和严峻的挑战。对动态大数据的多维关联性分析问题进行研究和探讨,以动态大数据为研究对象,以粒计算(Granular Computing,GrC)理论为研究基础,提出粒矩阵思想,研究构建面向动态大数据的粒矩阵方法,分析粒矩阵的逻辑约简运算,确定了基于粒矩阵的动态大数据多维关联性分析模型。本文旨在为高效利用动态大数据进行多维关联性分析和揭示数据隐含的客观规律提供科学依据,对大数据的可持续发展也具有重要意义。

关键词: 大数据,粒计算,多维关联分析

Abstract: The ever increasing big data is acclaimed,and the key point of big data is data analysis.However focusing on the big data with dynamic and multiple-dimensional characteristic,it is difficult for traditional data analysis methods to obtain reliable and accurate analytical results.Therefore there is an important opportunity and a great challenge for the data analysis methods to be developed.It aims to make an important research and investigation of the multiple correlation analysis for dynamic big data.The research object is dynamic big data,and the research is based on the theory of granular computing (GrC).We got the theoretical thought of granular matrix completely new in this paper.It was expected to reveal the multiple correlation analysis for dynamic big data.On one hand the achievements of this research would provide a scientific basis for multiple correlation analysis and revelation of the objective law in big data area.On the other hand it is also an important implication for sustainable development of big data.

Key words: Big data,Granular computing,Multiple correlation analysis

[1] http://www.ibm.com.
[2] http://www.google.com.
[3] KUMAR A,NIU F,R C.Hazy:making it easier to build and maintain big-data analytics[J].Communications of the ACM,2013,56(3):40-49.
[4] GUO P,WANG K,LUO A L,et al.Computational Intelligence for Big Data Analysis:Current Status and Future Prospect[J].Journal of Software,2015,6(11):3010-3025.
[5] BENNETT P,GILES L,HALEVY A,et al.Channeling the de-luge:research challenges for big data and information systems[C]∥Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management.ACM,2013:2537-2538.
[6] LU C W,HSIEH C M,CHANG C H,et al.An Improvement to Data Service in Cloud Computing with Content Sensitive Tran-saction Analysis and Adaptation[C]∥2013 IEEE 37th Annual Computer Software and Applications Conference Workshops.2013:463-468.
[7] YANG Q.Big data,lifelong machine learning and transfer lear-ning[C]∥Proceedings of the Sixth ACM International Confe-rence on Web Search and Data Mining.ACM,2013:505-506.
[8] ORDONEZ C.Can we analyze big data inside a DBMS[C]∥Proceedings of the Sixteenth International Workshop on Data Warehousing and OLAP.ACM,2013:85-92.
[9] NGUYEN D,VO B.Mining Class-Association Rules with Constraints[M]∥Knowledge and Systems Engineering.Springer International Publishing,2014:307-318.
[10] 李志杰,李元香,王峰,等.面向大数据分析的在线学习算法综述[J].计算机研究与发展,2015,52(8):1707-1721.
[11] 张蕾,章毅.大数据分析的无限深度神经网络方法[J].计算机研究与发展,2016,53(1):68-79.
[12] 罗军舟,王兴伟,尹浩.大数据驱动网络科学研究专题前言[J].计算机研究与发展,2015,52(4):777-782.
[13] 陈世敏.大数据分析与高速数据更新[J].计算机研究与发展,2015,52(2):333-342.
[14] 王文剑,于剑,高阳.面向大数据的人工智能技术专题前言[J].计算机研究与发展,2015,52(8):1705-1706.
[15] 孟小峰,杜治娟.大数据融合研究:问题与挑战[J].计算机研究与发展,2016,53(2):231-246.
[16] AGRAWAL R,SRIKANT R.Fast Algorithms for Mining Association Rules in Large Databases[C]∥Proceedings of the 20th International Conference on Very Large Data Bases.Morgan Kaufmann,1994:487-499.
[17] HAN J W,KOPERSKI K.Discovery of spatial association rules in geographic information databases[C]∥Proceedings of the 4th International Symposium on Advances in Spatial Databases,Maine.USA,1995:47-66.
[18] HOUTSMA M,SWAMI A.Set-oriented mining for association rules in relational databases[C]∥Proceedings of the Eleventh International Conference on Data Engineering.1995:25-33.
[19] CHEN C,YAN X F,ZHU F D,et al.Graph OLAP:a multi-dimensional framework for graph data analysis[J].Knowledge and Information Systems,2009,21(1):41-63.
[20] 何清,李宁,罗文娟,等.大数据下的机器学习算法综述[J].模式识别与人工智能,2014,7(4):327-337.
[21] 沈斌,姚敏.一种新的动态关联规则及其挖掘算法[J].控制与决策,2009,24(9):1310-1315.
[22] 李玲娟,张敏.云计算环境下关联规则挖掘算法的研究[J].计算机技术与发展,2011,21(2):43-46.
[23] 杨勇,王伟.一种基于 MapReduce 的并行 FP-growth 算法[J].重庆邮电大学学报(自然科学版),2013,25(5):651-659.
[24] BOUKOUVALA F,DUBEY A,et al.Computational Approa-ches for Studying the Granular Dynamics of Continuous Blen-ding Processes,2-Population Balance and Data-Based Methods[J].Macromolecular Materials and Engineering,2013,297(1):9-19.
[25] LIN T Y.Granular computing[M]∥Rough Sets,Fuzzy Sets,Data Mining,and Granular Computing.Springer Berlin Heidelberg,2003:16-24.
[26] ZADEH L A.Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic[J].Fuzzy Sets System,1997,90(2):111-127.
[27] PAWLAK Z.Rough sets[J].International Journal of Computer and Science,1982,11:341-356.
[28] 张铃,张钹.模糊商空间理论(模糊粒度计算方法)[J].软件学报,2003,14(4):770-776.
[29] 徐计,王国胤,于洪.基于粒计算的大数据处理[J].计算机学报,2014,7(11):1-22.
[30] 王国胤,张清华.不同知识粒度下粗糙集的不确定性研究[J].计算机学报,2008,1(9):1588-1598.
[31] YAO Y.Perspectives of granular computing[C]∥2005 IEEE International Conference on Granular Computing.2005:85-90.
[32] YAO Y Y.Granular computing:basic issues and possible solutions[C]∥Proceedings of the Fifth Joint Conference on Information Sciences.2000:186-189.[32]张清华,王国胤,胡军.多粒度知识获取与不确定性度量[M].北京:科学出版社,2013.
[33] 苗夺谦,王国胤,刘清.粒计算:过去、现在与展望[M].北京:科学出版社,2007.
[34] 王国胤,李德毅,姚一豫,等.云模型与粒计算[M].北京:科学出版社,2012.
[35] 张钹,张铃.粒计算未来发展方向探讨[J].重庆邮电大学学报(自然科学版),2010(5):538-540.
[36] 钟珞,吴珺.粒度计算在数据仓库挖掘中的应用[J].华中师范大学学报(自然科学版),2009,43(3):392-395.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!