loading...
Evaluating Distributed Checkpointing Protocol
Providence, Rhode Island May 19-May 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDCS.2003.120347523rd IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Adnan Agbaria, University of Illinois at Urbana-Champaign
Ari Freund, Technion - Israel Institute of Technology
Roy Friedman, Technion - Israel Institute of Technology
This paper presents an objective measure, called overhead ratio, for evaluating distributed checkpointing protocols. This measure extends previous evaluation schemes by incorporating several additional parameters that are inherent in distributed environments. In particular, we take into account the rollback propagation of the protocol, which impacts the length of the recovery process, and therefore the expected program run-time in executions that involve failures and recoveries. The paper also analyzes several known protocols and compares their overhead ratio.
Citation:
Adnan Agbaria, Ari Freund, Roy Friedman, "Evaluating Distributed Checkpointing Protocol," icdcs, pp.266, 23rd IEEE International Conference on Distributed Computing Systems (ICDCS'03), 2003
Usage of this product signifies your acceptance of the Terms of Use.