loading...
Rejuvenation and Failure Detection in Partitionable Systems
Seoul, Korea December 17-December 19
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/PRDC.2001.992692Eighth Pacific Rim International Symp ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   

Certain gateways (e.g., some cable or DSL modems) are known to have low reliability and low availability. Most failures of these devices can however be "fixed" by rejuvenating the device after a failure has been detected. Such a detection based rejuvenation strategy permits increasing the availability of these gateways. In the considered scenario, rejuvenation is non-trivial since a failure of such a gateway will leave it partitioned away from the network. In particular, network operators that want to rejuvenate these gateways are in a different network partition, and can therefore not initiate a remote rejuvenation.

In this paper we propose a failure detection based rejuvenation service and a remote detection service. The rejuvenation service detects and faxes "soft" failures automatically (in one partition), and the detection service detects (in another partition) all rejuvenations exactly once, within a bounded amount of time, even when the gateway is rejuvenated consecutively. The detection service also allows the detection of "hard" failures, and filtering of notifications of soft failures.

Index Terms:
Failure detection, failure detection based rejuvenation, distributed systems, fault tolerant systems, home networking, remote system management.
Citation:
Christof Fetzer, Karin Högstedt, "Rejuvenation and Failure Detection in Partitionable Systems," prdc, pp.154, Eighth Pacific Rim International Symposium on Dependable Computing (PRDC'01), 2001
Usage of this product signifies your acceptance of the Terms of Use.