loading...
A Reinforcement-Learning Approach to Failure-Detection Scheduling
Portland, Oregon, USA October 11-October 12
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/QSIC.2007.7Seventh International Conference on Q ...
 This Article 
 
PDF
HTML
IEEE Xplore Subscribers
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Fancong Zeng, BEA Systems, Inc.
A failure-detection scheduler for an online production system must strike a tradeoff between performance and reli- ability. If failure-detection processes are run too frequently, valuable system resources are spent checking and recheck- ing for failures. However, if failure-detection processes are run too rarely, a failure can remain undetected for a long time. In both cases, system performability suffers. We present a model-based learning approach that estimates the failure rate and then performs an optimization to find the tradeoff that maximizes system performability. We show that our approach is not only theoretically sound but prac- tically effective, and we demonstrate its use in an imple- mented automated deadlock-detection system for Java.
Citation:
Fancong Zeng, "A Reinforcement-Learning Approach to Failure-Detection Scheduling," qsic, pp.161-170, Seventh International Conference on Quality Software (QSIC 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.