A. Mendelson, Dept. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa, Israel
N. Suri, Dept. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa, Israel
No cache based techniques for roll-forward fault recovery exist at present. A split-cache approach is proposed that provides efficient support for checkpointing and roll-forward fault recovery in distributed systems. This approach obviates the use of discrete stable storage or explicit synchronization among the processors. Stability of the checkpoint intervals is used as a driver for real time operations.
Index Terms:
synchronisation; cache based fault recovery; distributed systems; roll-forward fault recovery; split-cache approach; checkpointing; discrete stable storage; explicit synchronization
Citation:
A. Mendelson, N. Suri, "Cache based fault recovery for distributed systems," iceccs, pp.119, Third IEEE International Conference on Engineering of Complex Computer Systems (ICECCS '97), 1997