Publication Details
Coarse-Grained TMR Soft-Core Processor Fault Tolerance Methods and State Synchronization for Run-Time Fault Recovery
soft-core processor, triple modular redundancy, state synchronization, partial dynamic reconfiguration, SEU mitigation
Triple Modular Redundancy (TMR) applied with various granularity combined with periodical scrubbing of a configuration memory or with run-time Partial Dynamic Reconfiguration (PDR) for fault recovery are one of the most preferred Single Event Upset (SEU) mitigation techniques used by Fault Tolerant Systems (FTS) implemented into SRAM-based FPGAs. Usage of PDR and TMR allows recovering of FTSs from all transient SEU faults and offers run-time fault mitigation compared to scrubbing methods which only correct configuration upsets and are limited by scrubbing period latency. Reconfigurable TMR architecture may require global state maintenance after PDR is applied for fault removal. In such situation, operational state of reconfigured circuit copy needs to be synchronized with remaining circuit copies which were active during PDR. This paper evaluates existing synchronization methods for reconfigurable TMR architectures and soft-core processors; presents our recent research focused on a state synchronization methodology compared to the state of the art methods and further investigates strategy for a state synchronization of TMR-protected soft-core processor neo430.
@INPROCEEDINGS{FITPUB11879, author = "Karel Szurman and Zden\v{e}k Kot\'{a}sek", title = "Coarse-Grained TMR Soft-Core Processor Fault Tolerance Methods and State Synchronization for Run-Time Fault Recovery", pages = "32--35", booktitle = "20th IEEE Latin American Test Symposium (LATS 2019)", year = 2019, location = "Santiago, CL", publisher = "IEEE Computer Society", ISBN = "978-1-7281-1756-0", doi = "10.1109/LATW.2019.8704639", language = "english", url = "https://www.fit.vut.cz/research/publication/11879" }