loading...
Locks and barriers in checkpointing and recovery
Chicago, IL, USA April 19-April 22
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/CCGrid.2004.1336601Fourth IEEE International Symposium o ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
R. Badrinath, CSE Dept., Indian Inst. of Technol., Kharagpur, India
C. Morin, Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Dependency tracking between communicating tasks is an important concept in backward error recovery for parallel applications. One can extend the traditional dependence tracking model for message passing systems to track dependencies between shared memory and task private states for shared memory applications. The objective of this paper is to analyze the issues generated by locks and barriers in parallel applications so that we can checkpoint tasks at any time (even when holding or waiting for locks and barriers). In particular we attempt to extend earlier dependency tracking mechanisms to locks and barriers. We address both coordinated and uncoordinated checkpointing schemes.
Citation:
R. Badrinath, C. Morin, "Locks and barriers in checkpointing and recovery," ccgrid, pp.459-466, Fourth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.