In a multiprocessor under normal loading conditions, idle processors over a natural spare capacity. Previous work attempted to utilize this redundancy to overcome the limitations of classic diagnosability and modular redundancy techniques while pro viding significant fault tolerance. A common approach is task duplexing. The usefulness of this approach for critical applications, unfortunately, is seriously undermined by its susceptibility to agreement onfaulty outcomes (malicious agreement). To assess dependability of duplexing under malicious agreement, we propose a stochastic model which dynamically profiles behavior in the presence of malicious faults. The model uses the so-called policy referred to as NMR on demand (NMROD). Each task in a multiprocessor is duplicated, with additional processors allocated for recovery as needed. NMROD relies on a fault model favoring response correctness over actual fault status, and integrates on-line repair to provide non-stop operation over an extended period.
Index Terms:
Malicious agreement, malicious faults, dependabilit y, processor duplex, TMR.
Citation:
M. Al-Hashimi, H.H. Pu, N. Park, F. Lombardi, "Dependability under Malicious Agreement in N-modular Redundancy-on-Demand Systems," nca, pp.0080, IEEE International Symposium on Network Computing and Applications (NCA'01), 2001