一种大数据放置方法

doi:10.11896/j.issn.1002-137X.2014.06.001

计算机科学 ›› 2014, Vol. 41 ›› Issue (6): 1-4.doi: 10.11896/j.issn.1002-137X.2014.06.001

• 综述 • 下一篇

一种大数据放置方法

张桂刚

清华大学信息技术研究院北京100084 首都经济贸易大学北京100070

出版日期:2018-11-14 发布日期:2018-11-14
基金资助:
本文受高等学校博士学科点专项科研基金课题(20100002110082)资助

A kind of Big Data Placement Method

ZHANG Gui-gang

Online:2018-11-14 Published:2018-11-14

摘要/Abstract

摘要： 数据密集型应用越来越多,如何将大数据在数据中心实现有效放置变得日益重要。研究了大数据的放置模型。影响大数据放置的因素主要有:能耗、异构节点的服务能力及具有关联计算的数据集。基于这3个因素设计了一种节能、充分考虑异构节点服务能力及提升MapReduce处理Join连接的效率的大数据放置模型。该模型将有效实现大数据的有效放置管理,同时也为未来软件定制数据中心奠定了基础。

关键词: 大数据,数据放置,节能,异构节点,关联计算中图法分类号TP391.41文献标识码A

Abstract: More and more data-intensive applications have come into being．It is becoming more and more important for the big data’ efficient placement in the data center．This paper proposed a kind of big data placement model．The major factors that influce the big data placement have the following three points:energy consumption,sevice capability of heterogeneous node and the data sets which have associated computing.Based on these three factors,our big data placement model considers the energy-saving,service capability of heterogeneous node and the complex Join query mapreduce computing so on．This model can implement the big data’s efficient placement management efficiently．At the same time,it will estabilish a foundation for software customized data center in the future.

Key words: Big data,Data placement,Energy-saving,Heterogeneous nodes,Associated computing

张桂刚. 一种大数据放置方法[J]. 计算机科学, 2014, 41(6): 1-4. https://doi.org/10.11896/j.issn.1002-137X.2014.06.001

ZHANG Gui-gang. A kind of Big Data Placement Method[J]. Computer Science, 2014, 41(6): 1-4. https://doi.org/10.11896/j.issn.1002-137X.2014.06.001

参考文献

[1] 覃雄派,王会举,李芙蓉,等.数据管理技术的新格局[J].软件学报,2013,4(2):175-197
[2] Kaushik R T,Bhandarkar M.GreenHDFS:towards an energy-conserving,storage-efficient,hybrid Hadoop compute cluster[C]∥HotPower’10 Proceedings of the 2010International Conference on Power Aware Computing and Systems.2010:1-5
[3] Arasanal R M,Rumani D U.Improving MapReduce Perform-ance through Complexity and Performance Based Data Placement in Heterogeneous Hadoop Clusters[C]∥Distributed Computing and Internet Technology,Lecture Notes in Computer Science．2013,7753:115-125
[4] Xie J,Yin S,Ruan X,et al.Improving MapReduce performance through data placement in heterogeneous Hadoop clusters[C]∥Proceedings of IPDPS Workshops．2010:1-9
[5] 王俊伟．大规模多媒体存储系统中数据放置与调度策略的研究[D].长沙:国防科学技术大学,2005
[6] 林伟伟．一种改进的Hadoop数据放置策略[J]．华南理工大学学报:自然科学版,2012(01):152-158
[7] Mohamed Y,Tian Yuan-yuan,zcan F,et al.CoHadoop:flexi-ble data placement and its exploitation in Hadoop[J].Procee-dings of the VLDB Endowment VLDB2011,2011,4(9):575-585
[8] Dittrich J,Quian’e-Ruiz J A,Jindal A,et al.Hadoop++:Ma-king a yellow elephant run like a cheetah (without it even noticing)[J]．PVLDB2010,2010,3(1/2):518-529
[9] Zhao Yan-rong,Wang Wei-ping,Meng Dan,et al.A data locality optimization algorithm for large-scale data processing in Hadoop[C]∥ISCC 2012．2010:655-661
[10] 赵彦荣,王伟平,孟丹,等.基于Hadoop 的高效连接查询处理算法CHMJ[J].软件学报,2012,3(8):2032-2041

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

一种大数据放置方法

A kind of Big Data Placement Method

PDF (PC)

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 0

Metrics

本文评价

推荐阅读 0