计算机科学 ›› 2009, Vol. 36 ›› Issue (9): 248-251.

• 图形图像及体系结构 • 上一篇    下一篇

容错并行算法的性能分析

杜云飞,唐玉华,杨学军   

  1. (国防科技大学计算机学院并行与分布处理国家重点实验室 长沙 410073)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受国家自然科学基金项目(60621003,60633050和60873014)和国家863项目(2008AA01Z110)资助。

Performance Evaluation for Fault-tolerant Parallel Algorithm

DU Yun-fei , TANG Yu-hua , YANG Xue-jun   

  • Online:2018-11-16 Published:2018-11-16

摘要: 容错并行算法是一种应用级容错方法,它通过并行复算的方法实现快速的故障恢复。容错并行算法是在并行算法设计的基础上增加了容错设计部分,因此其性能评估必须考虑故障对程序性能的影响。研究了评估故障情况下容错并行算法性能的各种度量,建立了性能模型预测容错并行算法的期望执行时间,以此为基础评估了程序段的运行时间、数据保存开销、故障率以及并行复算加速比等系统参数对容错并行算法性能的影响。

关键词: 容错并行算法,执行时间,加速比,效率

Abstract: The fault tolerant parallel algorithm (FTPA) is an application-level technique for tolerating hardware failures.FTPA achieves fast failure recovery making use of parallel recomputing. How to deal with system failures is a concern in the design of FTPA. Thus, evaluating the performance of FTPA under system failures is necessary. In this study,we presented the performance metrics to evaluate the performance of FTPA and a model to predict the application completion time under system failures. Then, the influence of program section executing time, checkpointing cost, failureate, and speedup of parallel recomputing on the performance of FTPA were evaluated.

Key words: Fault-tolerant parallel algorithm, Application completion time, Speedup, Efficiency

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!