Lecture Set 10 s03 P2homepages.cae.wisc.edu/~ece551/spring03/lectures/Lecture_Set_10… · ECE 551...

04/23/2003 1

ECE 551: Digital System Design & Synthesis Lecture Set 10

10.1: Functional & Timing Verification (Separate File)10.2: Faults & Testing10.2 Appendix: PODEM Example

Slides adapted from ECE 553 Slides by Prof.

Kewal Saluja

04/23/2003 2

ECE 551 - Digital System Design & SynthesisLecture 10.2 – Faults and Testing

OverviewIntroductionFault ModelsTest Pattern GenerationDesign for Testability (DFT) – Serial ScanBuilt-In Self-Test (BIST)Boundary Scan (JTAG/IEEE 1149.1)Quiescent Drain Current (IDDQ) Testing

04/23/2003 3

IntroductionThe manufacturing process for ICs is so complex that only a portion of all chips produced are good – the percentage of such good chips is referred to as the yieldIn order to avoid shipping defective products, manufacturing test at the die and packaged chip level is required.Complex chips => complex tests

The objective: summary the elements of contemporary IC test

04/23/2003 4

Overview

IntroductionFault Models Test Pattern GenerationDesign for Testability (DFT) – Serial ScanBuilt-In Self-Test (BIST)Boundary Scan (JTAG/IEEE 1149.1)Quiescent Drain Current (IDDQ) Testing

04/23/2003 5

Common Fault Models

Single stuck-at faults Transistor open and short faultsMemory faultsPLA faults (stuck-at, cross-point, bridging)Functional faults (processors)Delay faults (transition, path)Analog faults

04/23/2003 6

Stuck-at Faults

•Single stuck-at fault model•What does it achieve in practice?

•Fault equivalence•Checkpoint faults

04/23/2003 7

Single Stuck-at FaultThree properties define a single stuck-at fault

• Only one line is faulty• The faulty line is permanently set to 0 or 1• The fault can be at an input or output of a gate

Example: XOR circuit has 12 fault sites ( ) and 24 single stuck-at faults

a

b

c

d

e

f

10

g h i 1

s-a-0j

k

z

0(1)1(0)

1

Test vector for h s-a-0 fault

Good circuit valueFaulty circuit value

04/23/2003 8

Single Stuck-at Faults (contd.)

How effective is this model?Empirical evidence supports the use of this modelHas been found to be effective to detect other types of faultsRelates to yield modelingSimple to use

04/23/2003 9

Fault EquivalenceNumber of fault sites in a Boolean gate circuit = #PI + #gates + # (fanout branches).Fault equivalence: Two faults f1 and f2 are equivalent if all tests that detect f1 also detect f2.If faults f1 and f2 are equivalent then the corresponding faulty functions are identical.Fault collapsing: All single faults of a logic circuit can be divided into disjoint equivalence subsets, where all faults in a subset are mutually equivalent. A collapsed fault set contains one fault from each equivalence subset.

04/23/2003 10

Equivalence Example

sa0 sa1sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

sa0 sa1

Faults in blueremoved byequivalencecollapsing

20Collapse ratio = ----- = 0.625

32

04/23/2003 11

CheckpointsPrimary inputs and fanout branches of a combinational circuit are called checkpoints.Checkpoint theorem: A test set that detects all single (multiple) stuck-at faults on all checkpoints of a combinational circuit, also detects all single (multiple) stuck-at faults in that circuit.

Total fault sites = 16

Checkpoints ( ) = 10

04/23/2003 12

Summary

Gate level models are most prevalent in logic testingFault models are analyzable approximations of defects and are essential for a test methodology.For digital logic single stuck-at fault model offers advantage of effective tools and much experience.Many other faults (bridging, stuck-open and multiple stuck-at) are largely covered by stuck-at fault tests.Stuck-short and delay faults and technology-dependent faults require special tests.Memory and analog circuits need other specialized fault models and tests.

04/23/2003 13

Overview


04/23/2003 14

Definition of Automatic Test-Pattern Generator

Operations on digital hardware:Inject fault into sequential circuit modeled in computerUse various ways to activate and propagate fault effect through hardware to circuit outputOutput flips from expected to faulty signal

Scan design – add test hardware to all flip-flops to make them a giant shift register in test mode

Can shift state in, scan state outWidely used – makes sequential circuit into combinational circuit for testing!

Costs: 5 to 20% chip area, circuit delay, extra pin, longertest sequence

04/23/2003 15

Notation

SymbolDD01X

Meaning

1/00/10/01/1X/X

FailingMachine

0101X

GoodMachine

1001X

Roth’s DAlgebra

04/23/2003 16

Conditions for Finding a Test

Fault excitation – the signal value at the fault site must be different from the value of the stuck-at fault (thus fault site must contain a D or a D)The fault effect must be propagated to a primary output (A D or a D must appear at the output)Some simple observations

There must be at least a D or a D on some circuit nets)D’s must form a chain to some output

04/23/2003 17

Exhaustive Algorithm

For n-input circuit, generate all 2n input patternsInfeasible, unless circuit is partitioned into cones of logic, with 15 inputs

Perform exhaustive ATPG for each coneMisses faults that require specific activation patterns for multiple cones to be tested

≤

04/23/2003 18

Random-Pattern Generation

Flow chart for methodUse to get tests for 60-80% of faults, then switch to D-algorithm or other ATPG for rest

04/23/2003 19

Path Sensitization Method -Example 1 Fault

Sensitization2 Fault

Propagation3 Line Justification

04/23/2003 20

Path Sensitization Method -Example

Try path f – h – k – L. This path is blocked at j, since there is no way to justify the 1 on i

10

D

D1

1

1DD

D

04/23/2003 21

Path Sensitization MethodTry simultaneous paths f – h – k – L and

g – i – j – k – L. These paths blocked at kbecause D-frontier (chain of D or D) disappears

1

DD D

DD

1

1

1

04/23/2003 22

Path Sensitization Method Circuit Example

Final try: path g – i – j – k – L – test found!

0

D D D

1 DD

1

0

1

04/23/2003 23

OverviewMajor ATPG algorithms

• Definitions• D-Algorithm (Roth) – 1964-66

– D-cubes– Bridging faults– Logic gate function change faults

• PODEM (Goel) -- 1981– X-Path-Check– Backtracing

• Summary

04/23/2003 24

Overview


04/23/2003 25

Definition

Design for testability (DFT) refers to those design techniques that make test generation and test application cost-effective.DFT methods for digital circuits:

Ad-hoc methodsStructured methods:

• Scan• Partial Scan• Built-in self-test (BIST)• Boundary scan

04/23/2003 26

Scan Design

Objectives• Simple read/write access to all or subset of storage

elements in a design.• Direct control of storage elements to an arbitrary

value (0 or 1).• Direct observation of the state of storage elements

and hence the internal state of the circuit.

Key is – Enhanced controllability and observability.

04/23/2003 27

Scan DesignCircuit is designed using pre-specified design rules.Test structure (hardware) is added to the verified design:

• Add one (or more) test control (TC) primary input.• Replace flip-flops by scan flip-flops and connect to form one or

more shift registers in the test mode.• Make input/output of each scan shift register

controllable/observable from PI/PO.

Use combinational ATPG to obtain tests for all testable faults in the combinational logic.Add shift register tests and convert ATPG tests into scan sequences for use in manufacturing test.

04/23/2003 28

Scan Flip-Flop (master-slave)

DTC

SD

CK

Q

QMUX

D flip-flop

Master latch Slave latch

CK

TC Normal mode, D selected Scan mode, SD selected

Master open Slave opent

t

Logicoverhead

04/23/2003 29

Adding Scan Structure

SFF

SFF

SFF

Combinational

logic

PI PO

SCANOUT

SCANINTC or TCK Not shown: CK or

MCK/SCK feed allSFFs (scan Flip-flops).

04/23/2003 30

Comb. Test Vectors

I2I1 O1 O2

S2S1 N2N1

Combinational

logic

PI

Presentstate

PO

Nextstate

SCANINTC SCANOUT

04/23/2003 31

Comb. Test Vectors

I2I1

O1 O2

PI

PO

SCANIN

SCANOUT

S1 S2

N1 N2

0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0TC

Don’t careor random

bits

Sequence length = (nsff + 1) ncomb + nsff clock periods

ncomb = number of combinational vectorsnsff = number of scan flip-flops

04/23/2003 32

Scan OverheadIO pins: One pin necessary.Area overhead:

Gate overhead = [4 nsff/(ng+10nff)] x 100%, where ng = comb. gates; nff = flip-flops; • Example – ng = 100k gates, nff = 2k flip-flops, overhead =

6.7%.

More accurate estimate must consider scan wiring and layout area.

Performance overhead:Multiplexer delay added in combinational path; approx. two gate-delays.Flip-flop output loading due to one additional fanout; approx. 5-6%.

04/23/2003 33

Overview


04/23/2003 34

BIST Process

Test controller – Hardware that activates self-test simultaneously on all PCBsEach board controller activates parallel chip BIST Diagnosis effective only if very high fault coverage

04/23/2003 35

BIST Architecture

Note: BIST cannot test wires and transistors:

From PI pins to Input MUXFrom POs to output pins

04/23/2003 36

Pattern GenerationStore in ROM – too expensiveExhaustivePseudo-exhaustivePseudo-random (LFSR) – Preferred methodBinary counters – use more hardware than LFSRModified countersTest pattern augmentation

LFSR combined with a few patterns in ROMHardware diffracter – generates pattern cluster in neighborhood of pattern stored in ROM

04/23/2003 37

Exhaustive Pattern Generation (A Counter)

Shows that every state and transition worksFor n-input circuits, requires all 2n vectorsImpractical for large n ( > 20 )

04/23/2003 38

Random Pattern Testing

Bottom:Random-PatternResistantcircuit

04/23/2003 39

Pseudo-Random Pattern Generation

Standard Linear Feedback Shift Register (LFSR)Normally known as External XOR type LFSRProduces patterns algorithmically – repeatableHas most of desirable random # properties

Need not cover all 2n input combinationsLong sequences needed for good fault coverage

04/23/2003 40

Test Pattern AugmentationSecondary ROM – to get LFSR to 100% SAF coverage

Add a small ROM with missing test patternsAdd extra circuit mode to Input MUX – shift to ROM patterns after LFSR doneImportant to compact extra test patterns

Use diffracter:Generates cluster of patterns in neighborhood of stored ROM pattern

Transform LFSR patterns into new vector setPut LFSR and transformation hardware in full-scan chain

04/23/2003 41

Response Compaction

Severe amounts of data in CUT response to LFSR patterns – example:

Generate 5 million random patternsCUT has 200 outputsLeads to: 5 million x 200 = 1 billion bits response

Not economical to store and check all of these responses on chipResponses must be compacted

04/23/2003 42

DefinitionsAliasing – Due to information loss, signatures of good and some bad machines matchCompaction – Drastically reduce # bits in original circuit response – lose informationCompression – Reduce # bits in original circuit response – no information loss – fully invertible(can get back original response)Signature analysis – Compact good machine response into good machine signature. Actual signature generated during testing, and compared with good machine signature

04/23/2003 43

LFSR for Response CompactionUse cyclic redundancy check code (CRCC) generator (LFSR) for response compacterTreat data bits from circuit POs to be compacted as a decreasing order coefficient polynomialCRCC divides the PO polynomial by its characteristic polynomial

Leaves remainder of division in LFSRMust initialize LFSR to seed value (usually 0) before testing

After testing – compare signature in LFSR to known good machine signatureCritical: Must compute good machine signature

04/23/2003 44

Example Modular LFSR Response Compacter

LFSR seed value is “00000”

04/23/2003 45

SummaryLFSR pattern generator and MISR response compacter – preferred BIST methodsBIST has overheads: test controller, extra circuit delay, Input MUX, pattern generator, response compacter, DFT to initialize circuit & test the test hardwareBIST benefits:

At-speed testing for delay & stuck-at faultsDrastic ATE cost reductionField test capabilityFaster diagnosis during system testLess effort to design testing processShorter test application times

04/23/2003 46

Overview


04/23/2003 47

Overview: Boundary Scan

MotivationSystem view of boundary scan hardwareElementary scan cellTest Access Port (TAP) controllerBoundary scan instructionsSummary

04/23/2003 48

Motivation for StandardBed-of-nails printed circuit board tester gone

We put components on both sides of PCB & replaced DIPs with flat packs to reduce inductance

Nails would hit componentsReduced spacing between PCB wires

Nails would short the wiresPCB Tester must be replaced with built-in test delivery system -- JTAG does that Need standard System Test Port and BusIntegrate components from different vendors

Test bus identical for various componentsOne chip has test hardware for other chips

04/23/2003 49

System Test Logic

04/23/2003 50

System View of Interconnect

04/23/2003 51

Boundary Scan Chain View

04/23/2003 52

Elementary Boundary Scan Cell

04/23/2003 53

Tap Controller SignalsTest Access Port (TAP) includes these signals:

Test Clock Input (TCK) -- Clock for test logicCan run at different rate from system clock

Test Mode Select (TMS) -- Switches system from functional to test modeTest Data Input (TDI) -- Accepts serial test data and instructions -- used to shift in vectors or one of many test instructionsTest Data Output (TDO) -- Serially shifts out test results captured in boundary scan chain (or device ID or other internal registers)Test Reset (TRST) -- Optional asynchronous TAP controller reset

04/23/2003 54

Tap Controller State Diagram

04/23/2003 55

EXTEST InstructionPurpose: Test off-chip circuits and board-level interconnections

04/23/2003 56

SummaryBoundary Scan Standard has become absolutely essential --

No longer possible to test printed circuit boards with bed-of-nails testerNot possible to test multi-chip modules at all without itSupports BIST, external testing with Automatic Test Equipment, and boundary scan chain reconfiguration as BIST pattern generator and response compacterNow getting widespread usage

04/23/2003 57

Overview


04/23/2003 58

Basic Principle of IDDQTesting

Measure IDDQ current through Vss bus

04/23/2003 59

Stuck-at Faults Detected by IDDQ Tests

Bridging faults with stuck-at fault behaviorLevi – Bridging of a logic node to VDD or VSS – few of theseTransistor gate oxide short of 1 KΩ to 5 KΩ

Floating MOSFET gate defects – do not fully turn off transistor

04/23/2003 60

NAND Open Circuit Defect – Floating gate

The fault manifests as stuck-at, weak ON for N-FET, or delay faultsome

manifestations can be tested by IDDQ tests

04/23/2003 61

Floating Gate Defects

Small break in logic gate inputs (100 –200 Angstroms) lets wires couple by electron tunneling

Delay fault and IDDQ fault

Large open results in stuck-at fault – not detectable by IDDQ test

04/23/2003 62

Bridging Faults S1 – S5

Caused by absolute short (< 50 Ω) or higher R

Segura et al. evaluated testing of bridges with 3 CMOS inverter chain

IDDQRb tests fault when

Rb > 50 KΩ or

0 ≤ Rb ≤ 100 KΩ

Largest deviation when Vin = 5 V bridged nodes at opposite logic values

04/23/2003 63

Delay Faults

Most random CMOS defects cause a timing delay fault, not catastrophic failureMany delay faults detected by IDDQ test – late switching of logic gates keeps IDDQ elevatedDelay faults not detected by IDDQ test

Resistive via fault in interconnectIncreased transistor threshold voltage fault

04/23/2003 64

Leakage Faults

Gate oxide shorts cause leaks between gate

& source or gate & drain

Weak Faults• nFET passes logic 1 as 5 V – Vtn

• pFET passes logic 0 as 0 V + |Vtp|

• Weak fault – one device in C-switch does not turn on

Causes logic value degradation in C-switch

04/23/2003 65

Transistor Stuck-Closed FaultsDue to gate oxide short (GOS)k = distance of short from drainRs = short resistanceIDDQ2 current results show 3 or 4 orders of magnitude elevation

04/23/2003 66

Gate Oxide Short

04/23/2003 67

IDDQ SummaryIDDQ tests improve reliability, find defects causing:

Delay, bridging, weak faultsChips damaged by electro-static discharge

No natural breakpoint for current thresholdGet continuous distribution – bimodal would be better

Conclusion: now need stuck-fault, IDDQ, and delay fault testing combinedStill uncertain whether IDDQ tests will remain useful as chip feature sizes shrink further

Lecture Set 10 s03 P2homepages.cae.wisc.edu/~ece551/spring03/lectures/Lecture_Set_10… · ECE 551...

Documents

Transcript of Lecture Set 10 s03 P2homepages.cae.wisc.edu/~ece551/spring03/lectures/Lecture_Set_10… · ECE 551...