loading...
Towards Easy-to-Use Checkpointing of MPI Applications within CLUSTERIX
Dresden, Germany September 07-September 10
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PCEE.2004.72International Conference on Parallel ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Pawel Czarnul, Gdansk University of Technology, Poland
Arkadiusz Urbaniak, Gdansk University of Technology, Poland
Marcin Fraczak, Gdansk University of Technology, Poland
Maciej Dyczkowski, Wroclaw University of Technology
Bartlomiej Balcerek, Wroclaw University of Technology
While there exist many kernel and user level libraries/systems which support checkpointing working processes and resuming their operations, it is still very difficult to provide an easy-to-use tool to assist checkpointing parallel applications. In this work, we aim at the development of an easy-to-use user-guided library to support checkpointing parallel MPI applications to be executed within the CLUSTERIX environment i.e. a collection of distributed HPC clusters. We propose a programmer-assisted approach with process state packing and unpacking at the code level for SPMD HPC applications. Although the library is in its early stage of development we present checkpoint/restart times and application execution (interrupted by checkpointing) times for the proposed approach compared to the same application linked with the ckpt user level library.
Index Terms:
Process Checkpointing, Checkpointing Parallel Applications, Parallel Software Environments
Citation:
Pawel Czarnul, Arkadiusz Urbaniak, Marcin Fraczak, Maciej Dyczkowski, Bartlomiej Balcerek, "Towards Easy-to-Use Checkpointing of MPI Applications within CLUSTERIX," parelec, pp.390-393, International Conference on Parallel Computing in Electrical Engineering, (PARELEC'04), 2004
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions