Study of the Effects of SEU-Induced Faults on a Pipeline Protected Microprocessor
|
This paper presents a detailed analysis of the behavior of a novel, fault-tolerant, 32-bit embedded CPU when compared to a default (non fault-tolerant) implementation of the same processor, during a fault injection campaign of single and double faults. The fault-tolerant processor tested is characterized by per-cycle voting of microarchitectural and the flop-based architectural states, redundancy at the pipeline level and a distributed voting scheme. Its fault-tolerant behavior is characterized for three different workloads from the automotive application domain. The study proposes statistical methods for both the single and dual fault injection campaigns and demonstrates the fault-tolerant capability of both processors in terms of fault latencies, probability of fault manifestation and the behavior of latent faults.
[1] 1585 T.C. May and M.H. Woods, “Alpha-Particle-Induced Soft Errors in Dynamic Memories,” IEEE Trans. Electron Devices, vol. 26, no. 1, 1979.
[2] J.F. Ziegler, H.W. Curtis, H.P. Muhlfeld, C.J. Montrose, B. Chin, M. Nicewicz, C.A. Russell, W.Y. Wang, L.B. Freeman, P. Hosier, L.E. LaFave, J.L. Walsh, J.M. Orro, G.J. Unger, J.M. Ross, T.J. O'Gorman, B. Messina, T.D. Sullivan, A.J. Sykes, H. Yourke, T.A. Enger, V. Tolat, T.S. Scott, A.H. Taber, R.J. Sussman, W.A. Klein, and C.W. Wahaus, “IBM Experiments in Soft Fails in Computer Electronics (1978-1994),” IBM J. Research and Development, vol. 40, no. 1, pp. 3-18, 1996.
[3] R.C. Baumann, “Soft Errors in Advanced Semiconductor Devices —Part I: The Three Radiation Sources,” IEEE Trans. Device and Material Reliability, vol. 1, no. 1, pp. 17-22, 2001.
[4] J.F. Ziegler, “Terrestrial Cosmic Ray Intensities,” IBM J. Research and Development, vol. 42, no. 1, pp. 117-139, 1998.
[5] “EMC for ICs,” http:/www.ic-emc.org/, 2007.
[6] G.E. Moore, “Cramming More Components onto Integrated Circuits,” Electronics, vol. 38, no. 8, 1965.
[7] A.H. Johnston, “Scaling and Technology Issues for Soft Error Rates,” Proc. Fourth Ann. Research Conf. Reliability, Apr. 2000.
[8] E. Dupont, M. Nicolaidis, and P. Rohr, “Embedded Robustness IPs for Transient-Error-Free ICs,” IEEE Design and Test of Computers, vol. 40, no. 3, pp. 56-70, May/June 2002.
[9] C. Constantinescu, “Trends and Challenges in VLSI Circuit Reliability,” IEEE Micro, vol. 23, no. 4, pp. 14-19, July-Aug. 2003.
[10] R. Baumann, “Soft Errors in Advanced Computer Systems,” IEEE Design and Test of Computers, vol. 22, no. 3, pp. 258-266, May/June 2005.
[11] S. Buchner, M. Baze, D. Brown, D. McMorrow, and J. Melinger, “Comparison of Error Rates in Combinational and Sequential Logic,” IEEE Trans. Nuclear Science, vol. 44, no. 6, pp. 2209-2216, 1997.
[12] F. Irom and F.F. Farmanesh, “Frequency Dependence of Single-Event Upset in Advanced Commercial PowerPC Microprocessors,” IEEE Trans. Nuclear Science, vol. 51, no. 6, pp. 3505-3509, 2004.
[13] P.-C. Li and T.K. Young, “Electromigration: The Time Bomb in Deep-Submicron ICs,” IEEE Spectrum, vol. 33, no. 9, pp. 75-78, Sept. 1996.
[14] “The LEON2 Processor User's Manual,” http:/www.gaisler.com, 2007.
[15] E. Touloupis, J.A. Flint, V.A. Chouliaras, and D.D. Ward, “A Fault-Tolerant Processor Core Architecture for Safety-Critical Automotive Applications,” SAE 2005 Trans. J. Passenger Cars: Electronic and Electrical Systems, pp. 1-6, Feb. 2006.
[16] J. Gaisler, “A Portable and Fault-Tolerant Microprocessor Based on the SPARC v8 Architecture,” Proc. Int'l Conf. Dependable Systems and Networks (DSN '02), pp. 409-415, 2002.
[17] F. Faure, R. Velazco, M. Violante, M. Rebaudengo, and M.S. Reorda, “Impact of Data Cache Memory on the Single-Event-Upset-Induced Error Rate of Microprocessors,” IEEE Trans. Nuclear Science, vol. 50, no. 6, pp. 2101-2106, 2003.
[18] M. Rebaudengo, M.S. Reorda, and M. Violante, “Accurate Analysis of Single Event Upsets in a Pipelined Microprocessor,” J. Electronic Testing: Theory and Applications, vol. 19, no. 5, pp. 577-584, 2003.
[19] D. Gil, R. Martínez, J.V. Busquets, J.C. Baraza, and P.J. Gil, “Fault Injection into VHDL Models: Experimental Validation of a Fault Tolerant Microcomputer System,” Proc. Third European Dependable Computing Conf. (EDCC '99), pp. 191-208, 1999.
[20] N.J. Wang, J. Quek, T.M. Rafacz, and S.J. Patel, “Characterizing the Effects of Transient Faults on a High-Performance Processor Pipeline,” Proc. 2004 Int'l Conf. Dependable Systems and Networks (DSN), 2004.
[21] M. Rimén, J. Ohlsson, and J. Karlsson, “Experimental Evaluation of Control Flow Errors,” Proc. 1995 Pacific Rim Int'l Symp. Fault Tolerant Systems (PRFTS), 1995.
[22] R. Velazco, S. Rezgui, and H. Ziade, “Assessing the Soft Error Rate of Digital Architectures Devoted to Operate in Radiation Environment: A Case Studied,” J. Electronic Testing: Theory and Applications, vol. 19, no. 1, pp. 83-90, 2003.
[23] R. Velazco, S. Rezgui, and R. Ecoffet, “Predicting Error Rate for Microprocessor-Based Digital Architectures through C.E.U. (Code Emulating Upsets) Injection,” IEEE Trans. Nuclear Science, vol. 47, no. 6, pp. 2405-2411, 2000.
[24] M.S. Reorda and M. Violante, “Efficient Analysis of Single Event Transients,” J. Systems Architecture, vol. 50, no. 5, pp. 239-246, 2004.
[25] Z. Segall, D. Vrsalovic, D. Siewiorek, D. Yaskin, J. Kownacki, J. Barton, D. Rancey, A. Robinson, and T. Lin, “FIAT: Fault Injection Based Automated Testing Environment,” Proc. 18th Int'l Symp. Fault-Tolerant Computing (FTCS-18), pp. 102-107, 1988.
[26] W. Kao, R.K. Iyer, and D. Tang, “FINE: A Fault Injection and Monitoring Environment for Tracing the UNIX System Behavior under Faults,” IEEE Trans. Software Eng., vol. 19, no. 11, pp. 1105-1118, Nov. 1993.
[27] S. Han, K.G. Shin, and H.A. Rosenberg, “DOCTOR: An Integrated Software Fault Injection Environment for Distributed Real-Time Systems,” Proc. Int'l Computer Performance and Dependability Symp. (IPDS '95), pp. 204-213, 1995.
[28] G.A. Kanawati, N.A. Kanawati, and J.A. Abraham, “FERRARI: A Flexible Software-Based Fault and Error Injection System,” IEEE Trans. Computers, vol. 44, no. 2, pp. 248-260, Feb. 1995.
[29] J. Carreira, H. Madeira, and J. Gabriel Silva, “Xception: A Technique for the Experimental Evaluation of Dependability in Modern Computers,” Software Eng., vol. 24, no. 2, pp. 125-136, 1998.
[30] J. Arlat, M. Aguera, L. Amat, Y. Crouzet, J.-C. Fabre, J.-C. Laprie, E. Martins, and D. Powell, “Fault Injection for Dependability Validation: A Methodology and Some Applications,” IEEE Trans. Software Eng., vol. 16, no. 2, pp. 166-182, 1990.
[31] H. Madeira, M.Z. Rela, F. Moreira, and J.G. Silva, “RIFLE: A General Purpose Pin-Level Fault Injector,” Proc. First European Dependable Computing Conf. (EDCC '94), pp. 199-216, 1994.
[32] J.K.U. Gunneflo and J. Torin, “Evaluation of Error Detection Schemes Using Fault Injection by Heavy-Ion Radiation,” Proc. 19th Int'l Symp. Fault Tolerant Computing, pp. 340-347, 1989.
[33] J. Karlsson, P. Liden, P. Dahlgren, R. Johansson, and U. Gunneflo, “Using Heavy-Ion Radiation to Validate Fault-Handling Mechanisms,” IEEE Micro, vol. 14, no. 1, pp. 8-11, 13-23, Feb. 1994.
[34] J. Karlsson, P. Folkesson, J. Arlat, Y. Crouzet, G. Leber, and J. Reisinger, “Application of Three Physical Fault Injection Techniques to the Experimental Assessment of the MARS Architecture,” Proc. Fifth IFIP Working Conf. Dependable Computing for Critical Applications (DCCA-5), pp. 267-287, 1995.
[35] J.R. Samson, W. Moreno, and F. Falquez, “A Technique for Automated Validation of Fault Tolerant Designs Using Laser Fault Injection (LFI),” Proc. 28th Int'l Symp. Fault Tolerant Computing, pp.162-167, 1998.
[36] J. Aidemark, J. Vinter, P. Folkesson, and J. Karlsson, “GOOFI: Generic Object-Oriented Fault Injection Tool,” Proc. Int'l Conf. Dependable Systems and Networks (DSN '01), 2001.
[37] N.A. Kanawati, G. Kanawati, and J. Abraham, “Dependability Evaluation Using Hybrid Fault/Error Injection,” Proc. IEEE Int'l Computer Performance and Dependability Symp. (IPDS '95), p. 0224, 1995.
[38] P.L. Civera, L. Macchiarulo, M. Rebaudengo, M. SonzaReorda, and M. Violante, “Exploiting FPGA for Accelerating Fault Injection Experiments,” Proc. IEEE On-Line Testing Workshop (IOLTW '01), pp. 9-13, 2001.
[39] G.S. Choi and R.K. Iyer, “FOCUS: An Experimental Environment for Fault Sensitivity Analysis,” IEEE Trans. Computers, vol. 41, no. 12, pp. 1515-1526, Dec. 1992.
[40] E. Jenn, J. Arlat, M. Rimén, J. Ohlsson, and J. Karlsson, “Fault Injection into VHDL Models: The MEFISTO Tool,” Proc. 24th Ann. Int'l Symp. Fault-Tolerant Computing, pp. 66-75, 1994.
[41] T.A. Delong, B.W. Johnson, and J.A. Profeta III, “A Fault Injection Technique for VHDL Behavioral-Level Models,” IEEE Design and Test of Computers, vol. 13, no. 4, pp. 24-33, Winter 1996.
[42] K.K. Goswami, R.K. Iyer, and L.T. Young, “DEPEND: A Simulation-Based Environment for System Level Dependability Analysis,” IEEE Trans. Computers, vol. 46, no. 1, pp. 60-74, Jan. 1997.
[43] B. Parrotta, M. Rebaudengo, M. SonzaReorda, and M. Violante, “New Techniques for Accelerating Fault Injection in VHDL Descriptions,” Proc. Int'l On-Line Test Workshop (IOLTW '00), pp.61-66, 2000.
[44] S. Mitra, N.R. Saxena, and E.J. McCluskey, “A Design Diversity Metric and Analysis of Redundant Systems,” IEEE Trans. Computers, vol. 51, no. 5, pp. 498-510, May 2002.
[45] E. Touloupis, J.A. Flint, V.A. Chouliaras, and D.D. Ward, “Modelling Multiple Faults in Fault-Tolerant Processor Architectures,” IEE Electronics Letters, vol. 41, no. 21, pp. 1162-1163, 2005.
[46] M.R. Guthaus, J.S. Ringenberg, D. Ernst, T.M. Austin, T. Mudge, and R.B. Brown, “MiBench: A Free, Commercially Representative Embedded Benchmark Suite,” Proc. Fourth IEEE Ann. Workshop Workload Characterization, Dec. 2001.
[47] The SPARC Architecture Manual Version 8, http://www.sparc.org/standardsV8.pdf, 2007.
Index Terms:
fault injection, fault modeling and simulation, SEU, soft error, microprocessor test, fault tolerance
Citation:
Emmanuel Touloupis, James A. Flint, Vassilios A. Chouliaras, David D. Ward, "Study of the Effects of SEU-Induced Faults on a Pipeline Protected Microprocessor," IEEE Transactions on Computers, vol. 56, no. 12, pp. 1585-1596, June 2007, doi:10.1109/TC.2007.70766