Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of...
Transcript of Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of...
![Page 1: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/1.jpg)
Failure Mode and Effects Analysis of FPGA-Based Nuclear Power
Plant Safety Systems
Failure Mode and Effects Analysis of FPGA-Based Nuclear Power
Plant Safety Systems
8th International Workshop on the Applications of FPGAs in NPPs
Shanghai, ChinaOctober 13-16, 2015
Phillip McNelles, Zhao Chang Zeng, and Guna Renganathan
Phillip McNelles, Zhao Chang Zeng, and Guna Renganathan
![Page 2: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/2.jpg)
Presentation OutlinePresentation Outline
• Introduction• Potential use of FPGAs in Canadian NPPs
• FMEA• Purpose of performing FPGA FMEA (Research
Program)
• FMEA Background
2Canadian Nuclear Safety Commission
• FMEA Background
• FMEA Results
• Failure Mode Categorization • Failure Categories
• “When and Why” Matrix
• Failure Types and Parameters
• Design Suggestions
• Conclusions
![Page 3: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/3.jpg)
IntroductionIntroduction
• Nuclear Power Plants (NPPs) in Canada constructed 1971-1992• FPGAs not implemented in NPPs at that time
• Later implemented in non-safety systems
3Canadian Nuclear Safety Commission
• FPGAs have seen more use in NPP I&C • International implementations
• New builds
• Replacement of older systems
• Potential for future use in operating plants in Canada
![Page 4: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/4.jpg)
Purpose of FMEA (Research Project)Purpose of FMEA (Research Project)
• If FPGA-based systems are implemented in safety systems:• Must be functionally safe and reliable
• Potential faults and failures must be known
4Canadian Nuclear Safety Commission
• FMEA Research Program• Identify potential failure modes and causes
• Identify methods to avoid or mitigate those failures
• Ensure FPGA-based systems are safe to use
![Page 5: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/5.jpg)
Failure Mode and Effects AnalysisFailure Mode and Effects Analysis
• Failure Mode and Effects Analysis (FMEA)• Common Method in Reliability and Safety Analysis
• Start of Reliability Program (Study)
• Reviewed available data from international community (Extensive Literature Review)
5Canadian Nuclear Safety Commission
community (Extensive Literature Review)
• Extensive Literature Review• US NRC and ORNL, VTT, EPRI, OECD-NEA
• Standards from IEC, IEEE and CSA
• White papers from FPGA suppliers
• Scientific/technical literature
![Page 6: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/6.jpg)
Failure Mode and Effects AnalysisFailure Mode and Effects Analysis
• Failure Mode and Effects Analysis (FMEA)• Performed on FPGAs to identify Failure Mode data
• Potential Failure Modes
• Cause(s)
• Potential Effects on FPGA-based system
6Canadian Nuclear Safety Commission
• Potential Effects on FPGA-based system
• Effects of Latent Design Errors
• Eliminate or Mitigate/Control Failure Modes
• Produced a list of failure modes and information• Identify most common/most severe failures
![Page 7: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/7.jpg)
FMEA ResultsFMEA Results
• Identified potential issues • Failure modes, faults, logic errors, human factors…
• Failures divided into categories
7Canadian Nuclear Safety Commission
• 1st : Lifecycle: “Design (Fabrication)”, “Operation”
• 2nd : Cause: “Design Defect”, “Manufacturer Defect”, “Environmental”, “Stress/Aging”, “Maintenance (Human Factors)”
• Causes, potential effects, and methods to eliminate/mitigate those failures for each set
![Page 8: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/8.jpg)
FMEA ResultsFMEA Results
• Design Defect:• Logic (“Programming”), Hardware Faults
• Manufacturer Defects:• Failures due to issues with the physical chip/board
8Canadian Nuclear Safety Commission
• Environment:• Radiation induced failures (SEE)
• Stress/Aging:• Aging Effects
• Maintenance/Human Factors:• Personnel/Security
![Page 9: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/9.jpg)
FMEA ResultsFMEA Results
• Certain failure modes specific to FPGA• Clock/timing failures
• Significance of proper clock/timing behavior
• Optimization by synthesis software may alter intended behavior
9Canadian Nuclear Safety Commission
• Certain failure modes common to digital technology • Single Event Effects (SEE)
• Importance of SEE mitigation
• Programming Errors (HDL code)• FPGA design has some similarities with software-based design
• Non-standard language additions may introduce failures
• Aging Failures
![Page 10: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/10.jpg)
FMEA Results (Failure Sets)FMEA Results (Failure Sets)
• Failure “Causes” divided into “Failure Sets” based on “Failure Effects”
• Failure Effect:• “Consequence of a failure mode in terms of the
10Canadian Nuclear Safety Commission
• “Consequence of a failure mode in terms of the operation, function or status of the item”
• IEC 60812 standard (FMEA)
• Each set includes a description and mitigation
• Grouped for easier identification and mitigation
![Page 11: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/11.jpg)
FMEA Results (Failure Sets)FMEA Results (Failure Sets)
Failure Modes
Design
Design Defects
Manufacturer Defects
Operation
Environmental Stress/AgingMaintenance
(Human Factors)
(Lifecycle)
(Cause)
11Canadian Nuclear Safety Commission
Clock/TimingLogic Errors
(HDL)
State Machines
Sneak Circuit
Input and Data Type
Board Level
Common Cause Failure
Chip and Board
Radiation Induced
Bit ErrorAging
ProcessMaintenance
Induced
Failure Category Diagram
(Failure Sets)
![Page 12: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/12.jpg)
FMEA Results (Failure Sets)FMEA Results (Failure Sets)
• Sample of Failure Sets
• Design• Clock/Timing
• State Machines
12Canadian Nuclear Safety Commission
• State Machines
• Common Cause Failure
• Logic Errors (HDL)
• Operation• Aging Process Failures
• Radiation-Induced Failures
![Page 13: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/13.jpg)
FMEA Results (When and Why Matrix)FMEA Results (When and Why Matrix)
• Additional way to categorize failure modes• Presented in “When and Why” Figure
• When:
13Canadian Nuclear Safety Commission
• When:• Stage in system lifecycle that the failure occurs
• Why:• Failure Category (Failure Modes)
![Page 14: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/14.jpg)
FMEA Results (When and Why Matrix)FMEA Results (When and Why Matrix)
• Lifecycle Categories (“When”):• Design (Fabrication)
• Operation
14Canadian Nuclear Safety Commission
• Cause Categories (“Why”):• Design Defects
• Manufacturer Defects
• Environmental
• Stress/Aging
• Maintenance (Human Factors)
![Page 15: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/15.jpg)
FMEA Results (When and Why Graph)FMEA Results (When and Why Graph)
15Canadian Nuclear Safety Commission
FPGA FMEA “When and Why” Results
![Page 16: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/16.jpg)
FMEA Results (When and Why Results)FMEA Results (When and Why Results)
• Two important results from the graph
• “Design Stage” had most results (73)• Includes Logic, Timing and general Hardware faults
16Canadian Nuclear Safety Commission
• Includes Logic, Timing and general Hardware faults
• “Design Defect” most populous category
• Most failures eliminated before implementation
• Stress/Aging Failure Mitigation• Aging process failures cannot be avoided
• Revealed using self-tests and periodic testing
![Page 17: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/17.jpg)
Failure Types and ParametersFailure Types and Parameters
• Failure Causes divided into “Types” and “Parameters”
Failure Type
Definition Failure Parameter
Definition
17Canadian Nuclear Safety Commission
A Hardware Failure
B Logic Failure
C Radiation Failures
V Electric
T Temperature
M Material
S Chip size
R Radiation Failures
L Logic Failure
H Hardware (General)
FPGA Failure Types FPGA Failure Parameters
![Page 18: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/18.jpg)
Failure Types and ParametersFailure Types and Parameters
18Canadian Nuclear Safety Commission
FPGA Failure Types
![Page 19: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/19.jpg)
Failure Types and ParametersFailure Types and Parameters
19Canadian Nuclear Safety Commission
FPGA Failure Parameters
![Page 20: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/20.jpg)
Failure Type and Parameter ResultsFailure Type and Parameter Results
• “Failure Type” Results:• Hardware Faults most numerous (52)
• More Hardware and Logic faults than Radiation
• Significant overlap (Timing, Common Cause)
20Canadian Nuclear Safety Commission
• “Failure Parameter” Results:• Shows in detail the factors affecting FPGA reliability
• Hardware (Aging Process) failures show strong environmental dependency
• FPGA material (technology) affects both Hardware and Radiation failures
![Page 21: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/21.jpg)
Design and Review SuggestionsDesign and Review Suggestions
• Research provided suggestions for design and review of FPGA systems
• Design Suggestions• Use of Antifuse FPGAs for radiation tolerance
21Canadian Nuclear Safety Commission
• Use of Antifuse FPGAs for radiation tolerance
• Use of synchronous designs
• Use of self-tests to monitor FPGA chip health
• Use of coding standards/guides to prevent logic errors
![Page 22: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/22.jpg)
Design and Review SuggestionsDesign and Review Suggestions
• Research provided suggestions for design and review of FPGA systems
• Review Suggestions• Review system for tolerance of radiation effects
22Canadian Nuclear Safety Commission
• Review system for tolerance of radiation effects– Design should eliminate effects of SEE (where possible)
– Design should mitigate any effects of residual SEE
• Review system for mitigation of aging effects– Design should incorporate methods to detect aging failures (Self-tests)
– Design should include mitigations for effects of residual aging failures
![Page 23: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/23.jpg)
ConclusionsConclusions
• Detailed FMEA was performed to identify:• Failure modes, causes, and effects
• Expanded to include avoidance and mitigation
• FMEA Categorization
23Canadian Nuclear Safety Commission
• FMEA Categorization• Categories facilitate detection and avoidance/ mitigation of
failure modes• Failure modes divided by Lifecycle (“Design” and “Operation”)
• Lifecycle failure modes divided by “Causes”
• “Failure sets” group failure modes by similar cause/effects
• Failure “Types” and “Parameters” provide additional information on root cause of failure modes
![Page 24: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/24.jpg)
ConclusionsConclusions
• Additional Conclusions from FMEA Study• Many failure modes not specific to FPGAs
• Common to digital technology
• FPGA design shares aspects of software-based design
24Canadian Nuclear Safety Commission
• FPGA design shares aspects of software-based design
• Clock and timing behavior critical to correct operation
• Non-standard languages can introduce failure modes
• Synthesizer code optimization features are to be avoided
![Page 25: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/25.jpg)
ConclusionsConclusions
• Primary Results• Methods to avoid or mitigate all identified failure modes
• Majority of failures during the design stage (eliminated)
• Several aging failures that must be mitigated (self tests and periodic tests)
25Canadian Nuclear Safety Commission
and periodic tests)
• Hardware (aging) failures have environmental factors
• Large number of potential logic and timing errors
![Page 26: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/26.jpg)
Future WorkFuture Work
• Future work on FPGA-based systems:• Failure mode information utilized for FPGA-based
system modelling and analysis
• Comparison of reliability analysis methods
• Defenses against SEE failures
26Canadian Nuclear Safety Commission
• Defenses against SEE failures – (Error Correcting Codes, Modular Redundancy, etc.)
![Page 27: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/27.jpg)
The EndThe End
• Thank you for your time• Questions?
27Canadian Nuclear Safety Commission
![Page 28: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/28.jpg)
ReferencesReferences
[1] McNelles, P., Zeng, Z.C., Renganathan, G., 2015, “Modelling of Field Programmable GateArray Based Nuclear Power Plant Safety Systems Part I: Failure Mode and Effects Analysis”,Proc. Of the 7th International Conference on Modelling and Simulation in Nuclear Science andEngineering, Ottawa, Canada.
[2] Bobrek, M., & Bouldin, D., Review Guidelines for FPGAs in NPP Safety Systems, Oak Ridge, Tennessee, 2010.
[3] United States Nuclear Regulatory Commission (U.S. NRC), NUREG-7006, Review Guidelines for Field Programmable Gate Arrays in Nuclear Power Plant Safety Systems,
28Canadian Nuclear Safety Commission
Guidelines for Field Programmable Gate Arrays in Nuclear Power Plant Safety Systems, Washington D.C., 2010.
[4] Electric Power Research Institute (EPRI), TR-1019181, Guidelines on the Use of Field Programmable Gate Arrays (FPGAs) in Nuclear Power Plant I&C System. Palo Alto, California, 2009.
[5] EPRI, TR-1022983, Recommended Approaches and Design Criteria for Application of Field Programmable Gate Arrays in Nuclear Power Plant Instrumentation and Control Systems,Palo Alto, California, 2011.
[6] Valtion Teknillinen Tutkimuskeskus (VTT), The current state of FPGA technology in the nuclear domain. Vuorimiehentie, Finland, 2011.
![Page 29: Failure Mode and Effects Analysis of FPGA-Based Nuclear ... · Failure Mode and Effects Analysis of FPGA-Based Nuclear Power Plant Safety Systems 8th International Workshop on the](https://reader033.fdocuments.us/reader033/viewer/2022042612/5f58339bde6293218441b255/html5/thumbnails/29.jpg)
Canadian Nuclear Safety Commission
Canadian Nuclear Safety Commission
nuclearsafety.gc.canuclearsafety.gc.canuclearsafety.gc.canuclearsafety.gc.ca
© CNSC Copyright 2013
facebook.com/CanadianNuclearSafetyCommission
youtube.ca/cnscccsn