Computer Science ›› 2011, Vol. 38 ›› Issue (5): 287-289.

Previous Articles     Next Articles

Cache-style Parallel Checkpointing for Large-scale Computing System

LIU Yong-yan,LIU Yong-peng,FENG Hua,CHI Wan-qing   

  • Online:2018-11-16 Published:2018-11-16

Abstract: Checkpointing is a typical technique for fault tolerance, whereas its scalability is limited by the overhead of file access. According to the multi level file system architecture, the cache-style parallel checkpointing was introduced,which translates global coordinated checkpointing into local file operation by out of-order pipelining of checkpoint flushing opportunity. The overhead of writcback is hidden effectively to increase the performance and the scalability of parallel checkpointing.

Key words: Cachcstylc checkpointing, Parallel computing, Multi-level file system, Multi-processor, Out-of-order pipeline

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!