Computer Science ›› 2014, Vol. 41 ›› Issue (Z6): 42-46.

Previous Articles     Next Articles

Task Scheduling Algorithm for MapReduce Based on DAG

TANG Yi-tao,HUANG Jing and XIAO Qiu   

  • Online:2018-11-14 Published:2018-11-14

Abstract: Hadoop has been the basic platform of cloud computing research,and MapReduce is the computing mode for distributed processing of big data.For heterogeneous cluster,considering MapReduce’s defects in data distribution,data locality and process of the job execution,we proposed a DAG based MapReduce scheduling algorithm.The algorithm groups the nodes based on their computing ability,transforms MapReduce job execution to DAG model and improves upward ranking to achieve better accuracy and a more reasonable sequencing of task priority.By combining the computing ability of nodes,data locality and cluster utilization,choosing the proper data nodes for task distribution and execution,our algorithm shortens task completion time.The experimental result shows that the proposed algorithm can distribute data reasonably,improve data locality effectively,reduce communication overhead,shorten schedule length of set of job,thus improving utilization of cluster.

Key words: DAG,Scheduling algorithm,MapReduce,Hadoop,Heterogeneous environment,Big data

[1] Dean J,Ghemawat S.MapReduce:Simplified Data Processing on Large Clusters[J].Communications of the ACM,2008,51(1):107-113
[2] Apache Hadoop.Hadoop[EB/OL].http://hadoop.apache.org/,2009-03-06
[3] Vaquero L M,Rodero-Merino L,Caceres J,et al.A Break In the cloud:Towards a Ckoud Definition[J].ACM SIGCOMM Computer Communication Review,2009,39(1):50-55
[4] 陆嘉恒.Hadoop实战(第3版)[M].北京:机械工业出版社,2011
[5] Zaharia M,Borthakur D,Sarma J S,et al.Delay Scheduling:A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling[C]∥Proceedings of the 5th European Conference on Computer Systems.2010:265-278
[6] Xie J,Yin S,Ruan X J,et al.Improving MapReduce Performancethrough Data Placement in Heterogeneous Hadoop Clusters[C]∥IEEE International Symposium on Parallel & Distributed Processing,Workshops and PhdForum.2010:1-9
[7] Zhang X H,Zhong Z Y,Feng S Z,et al.Improving Data Localityof MapReduce by Scheduling in Homogeneous Computing Environments[C]∥IEEE 9th International Symposium on Parallel and Distributed Processing with Applications.2011:120-126
[8] Guo Lei-tao,Sun Hong-wei,et al.A data distribution aware taskscheduling strategy for mapreduce system[C]∥First International Conference on Cloud Computing.2009
[9] Verma A,Cherkasova L,Campbell R.Resource ProvisioningFramework for MapReduce Jobs withPerformance Goals[J].Lecture Notes in Computer Science,2011,9:165-186
[10] Polo J,Carrera D,et al.Performance-driven task co-scheduling for mapreduce environments[C]∥Proc of IEEE/IFIP Network Operations and Management Symposium.2010
[11] Kc K,Anyanwu K.Scheduling Hadoop Jobs to Meet Deadlines[C]∥IEEE Second International Conference on Cloud Computing Technology and Science.2010:388-392
[12] Polo J,Carrera D,Becerra Y,et al.Performance-Driven Task Co-Scheduling for MapReduce Environments[C]∥IEEE proceedings of Network Operations and Management Symposium.2010:373-380
[13] Tang Zhuo,Zhou Jun-qing,Li Ken-li,et al.MTSD:A taskscheduling algorithm for MapReduce base on deadline constraints[C]∥IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum.2012
[14] Zaliva V,Orlov V.Hamake:A Data Flow Approach to DataProcessing in Hadoop[C]∥CLOSER.2012:457-461
[15] Furst S.Challenges in the design of automotive software[C]∥Proceedings of the Conference on Design,Automation and Test in Europe.European Design and Automation Association,2010:256-258
[16] Arabnejad H,Barbosa J.Fairness resource sharing for dynamicworkflow scheduling on Heterogeneous Systems[C]∥Parallel and Distributed Processing with Applications (ISPA),2012IEEE 10th International Symposium on.IEEE,2012:633-639
[17] Klobedanz K,Koenig A,Mueller W.A reconfiguration approach for fault-tolerant flexray networks[C]∥Design,Automation & Test in Europe Conference & Exhibition (DATE),2011.IEEE,2011:1-6

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!