Publication Details
Towards a State Synchronization Methodology for Recovery Process after Partial Reconfiguration of Fault Tolerant Systems
Mičulka Lukáš, Ing. (DCSY FIT BUT)
Kotásek Zdeněk, doc. Ing., CSc. (DCSY FIT BUT)
state synchronization, recovery, partial dynamic reconfiguration, fault tolerance, FPGA
The paper describes essential points of new methodology for design, implementation and evaluation of state synchronization methods for fault tolerant system implemented on SRAM FPGA after its recovery through partial dynamic reconfiguration. Essential problems which must be satisfied by each synchronization method are described. Then, dynamic and static parameters of synchronization methods which can have direct effect on target fault tolerant system are defined. Basic principles of presented methodology are verified on implementation of specific synchronization method for reconfigurable fault tolerant CAN bus control system.
Modern fault tolerant systems implemented into FPGAs integrate very often hardware redundancy together with fault tolerant approaches based on active fault recovery and the system reconfiguration. Space and safety-critical applications are examples of systems where the principles of fault tolerance and recovery techniques have increasing importance. Except of fault-masking behavior and FPGA partial reconfiguration, also the synchronization of reconfigured circuit copy with remaining circuits which are during the recovery process still operating, is an integral part of the recovery process in these systems. The synchronization process is closely related to the system architecture, specific requirements and functionality. Our aim is to propose specific methodology to design and implement the most suitable synchronization procedure for the recovery of target fault tolerant system. In this paper, basic principles of our synchronization methodology are described together with generic architecture for synchronization in fault tolerant systems, which was designed for reconfigurable fault tolerant CAN bus control system. This system and performed experiments are in the paper described as well.
@INPROCEEDINGS{FITPUB10793, author = "Karel Szurman and Luk\'{a}\v{s} Mi\v{c}ulka and Zden\v{e}k Kot\'{a}sek", title = "Towards a State Synchronization Methodology for Recovery Process after Partial Reconfiguration of Fault Tolerant Systems", pages = "231--236", booktitle = "9th IEEE International Conference on Computer Engineering and Systems", year = 2014, location = "K\'{a}hira, EG", publisher = "IEEE Computer Society", ISBN = "978-1-4799-6594-6", doi = "10.1109/ICCES.2014.7030963", language = "english", url = "https://www.fit.vut.cz/research/publication/10793" }