Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs
description
Transcript of Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs
Chandrasekhar 1 MAPLD 2005/204
Reduced Triple Modular Redundancy for Tolerating SEUs in SRAM based FPGAs
Vikram Chandrasekhar, Sk. Noor Mahammad, V. Muralidharan
Dr. V. Kamakoti
Department of Computer Science and Engineering
Indian Institute of Technology Madras, India
Dr. N. Vijaykrishnan
Department of Computer Science and Engineering
Pennsylvania State University, U.S.A.
MAPLD 2005
Chandrasekhar 2 MAPLD 2005/204
Outline of the talk
Single Event Upsets (SEUs) in FPGAs Motivation Signal probability propagation for LUTs Sensitivity of LUTs Reduced Triple Modular Redundancy (RTMR) SEU simulator Experimental Results Conclusions
MAPLD 2005
Chandrasekhar 3 MAPLD 2005/204
Single Event Upsets (SEUs)
• Circuit errors caused due to excess charge carriers induced primarily by external radiations
• Cause an upset event but the circuit itself is not damaged
MAPLD 2005
Chandrasekhar 4 MAPLD 2005/204
SEUs in LUTs
• Single Event Upset changes the function stored in LUT
• Behavior of the circuit configured on to the FPGA is modified
MAPLD 2005
Chandrasekhar 5 MAPLD 2005/204
SEUs in a Switch Matrix
Fault-free switch matrix
Formation of a new net Two nets are shorted
Deletion of a net
MAPLD 2005
Chandrasekhar 6 MAPLD 2005/204
Motivation
• TMR hardened design takes 200% extra area• Certain logic blocks can halt the SEU propagation
in the circuit• Focus is on the Boolean network of LUTs
obtained after technology mapping• LUTs are more likely to stop SEUs rather than
simple gates• Lesser redundancy in LUTs is required compared
to redundant gates
MAPLD 2005
Chandrasekhar 7 MAPLD 2005/204
Signal Probability
• Probability that a line carries a value ‘1’
• Largely governed by the functions stored in the LUTs
• Can be used to estimate the value carried by a line
• The inputs are assigned random signal probabilities
• Input values are propagated along Boolean network to the primary outputs
• A threshold is fixed to get the expected value of a line from its signal probability
MAPLD 2005
Chandrasekhar 8 MAPLD 2005/204
Signal Probability Propagation
• Calculation of signal probability of an LUT’s output is dependent on the stored function
• Signal probability of the LUT output is the probability of the input accessing a cell storing ‘1’
• Can be computed as a sum of probability products similar to the sum-of-products form of the function
• Let Mi {0,1}, for 1 i 4, and
V = F(M1, M2, M3, M4)
MAPLD 2005
Chandrasekhar 9 MAPLD 2005/204
Signal Probability Propagation
S (M1, M2, M3, M4) = R(M1, P1) x R(M2, P2) x R(M3, P3) x R(M4, P4)
0 If V = 0
If V = 1
Where
R (M1, M2, M3, M4) = B
1 - B If A = 0
If A = 1
The output signal probability is defined as
S (i, j, k, l ) i=0 j=0 k=0 l=0
1 1 1 1
MAPLD 2005
Chandrasekhar 10 MAPLD 2005/204
Sensitive and Insensitive LUTs
• Input to an LUT can at most change by one bit - guideline for voter insertion
• Sensitive LUT - a change in any one of the expected input values changes the output
• Insensitive LUT - same value in all cells accessed due to a one-bit change in expected input
• Insensitive LUTs stop the SEU effect from propagating any further
• Form chains of sensitive LUTs that end at a voter followed by an insensitive LUT
MAPLD 2005
Chandrasekhar 11 MAPLD 2005/204
Insensitive LUTs
MAPLD 2005
Chandrasekhar 12 MAPLD 2005/204
Sensitive LUTs
MAPLD 2005
Chandrasekhar 13 MAPLD 2005/204
Pseudo-insensitive LUTs
• A sensitive LUT whose fanouts are all insensitive LUTs
• An SEU effect passing through this LUT will not get past any of its fanouts
• Such an LUT need not be triplicated
• Cannot be treated as an insensitive LUT in identifying more pseudo-insensitive LUTs
MAPLD 2005
Chandrasekhar 14 MAPLD 2005/204
Reduced Triple Modular Redundancy (RTMR)
Assign random signal probabilities to primary inputs of the LUT network
Propagate the signal probabilities through the network till the primary outputs
Assign expected values to lines based on the threshold Mark LUTs as either sensitive or insensitive Find pseudo-insensitive LUTs and remark them Triplicate every sensitive LUT identified in the circuit
MAPLD 2005
Chandrasekhar 15 MAPLD 2005/204
RTMR
For every sensitive LUT L
Triplicate L If all fanouts of L are sensitive LUTs
Connect the fanout of each copy of L to the corresponding copies of the fanout LUTs
Else Connect the fanout of each copy of the LUT to the
corresponding copies of the fanout sensitive LUTs Insert a voter for the outputs of the three copies of the LUT Connect the output of the voter to all the fanout insensitive
and pseudo insensitive LUTs
MAPLD 2005
Chandrasekhar 16 MAPLD 2005/204
SEU Simulation
• For given technology mapping circuit– A place and routing is performed using VPR tool– Net adjacencies are generated from the routing to
simulate possible bridge faults– Using the technology mapping circuit, a Verilog
model of the Boolean network is generated – The delays of the nets between the LUTs are
extracted from the routing provided by the VPR tool
MAPLD 2005
Chandrasekhar 17 MAPLD 2005/204
SEU Simulator
SEU Simulator
RTMR Circuit
Original Circuit
Net adjacencies generation
Compare
Errors
simulatesimulat
e
Faults
MAPLD 2005
Chandrasekhar 18 MAPLD 2005/204
Fault generation
• A random fault list is generated which consists errors like– Bridge Faults– Nets disconnections– Changes in the CLB SRAM cells
MAPLD 2005
Chandrasekhar 19 MAPLD 2005/204
Simulation
• Each fault is simulated for a flexible duration of time to check the effectiveness of the RTMR circuit in tolerating the faults
• Bridge faults are most important because errors are propagated through multiple paths in the network
• Tougher to simulate since the RTMR circuit and the original circuit have different sets of possible SEU routing errors
MAPLD 2005
Chandrasekhar 20 MAPLD 2005/204
Experimental Results
• Implemented on MCNC benchmark circuits to determine the required redundancy
• Threshold for signal probability was varied between 0.5-0.8 with nearly similar results
• Berkeley Logic Interchange Format (BLIF) files of the circuits are used for RTMR
MAPLD 2005
Chandrasekhar 21 MAPLD 2005/204
Area overheads of RTMR vs. TMR
4200276648.791400Tseng
1159545448.783865spla
5733253516.331911seq
1605959475.555353pdc
4431201318.151477misex3
17559916628.35853frisc
3486198635.461162ex5p
12543547515.484181elliptic
5625299929.981875diffeq
6465371536.22155des
394801632612.0313160clma
3864164813.981288apex4
No. of LUTs in TMR circuit
No. of LUTs in RTMR circuit
Percentage of sensitive
LUTs
No. of LUTs in original
circuit
Benchmark Circuit
MAPLD 2005
Chandrasekhar 22 MAPLD 2005/204
Conclusion and future work
• On an average, only 38% additional redundancy is required
• Insignificant loss of SEU immunity is observed using the SEU simulator
• Further study of the tradeoff between SEU immunity and LUT redundancy is required
MAPLD 2005