Académique Documents
Professionnel Documents
Culture Documents
EE141 1 EE141 2
Outline 1. Introduction
1. Introduction Test in the past High Fault Coverage
Short Test Time
2. Energy and power modeling
Small Test Data Volume
3. Test power issues Low Test Development Efforts
4. Low-power scan testing Low area overhead
EE141 3 EE141 4
1. Introduction 1. Introduction
Power dissipation in test mode is much higher Industry generally resorts to ad-hoc solutions:
than during functional mode Over sizing power rails
The circuit is highly stressed Over sizing packages and use of cooling systems
No correlation between consecutive test vectors Test with reduced operation frequency
Test vectors ignore functional constraints Partitioning and appropriate test planning
EE141 5 EE141 6
1
EE141
EE141 7 EE141 8
EE141 9 EE141 10
EE141 11 EE141 12
2
EE141
EE141 13 EE141 14
4. Low-
Low-Power Scan Testing - Basics 4. Low-
Low-Power Scan Testing - Basics
At-speed scan testing with a LOC test scheme At-speed scan testing with a LOS test scheme
Last shift Capture Last shift
V 1 applied & Launch Response V 1 applied & Launch Response
V 2 applied capture V 2 applied capture
shift shift shift shift
CLK CLK
SE Time SE Time
LOC scheme LOS scheme
SE Time Time
LOS scheme
Launch is caused by the difference between the values loaded Launch is caused by the difference between the values loaded
by the last shift pulse (V1) and the first capture pulse (V2) Time by the next-to-last (V1) and the last (V2) shift pulses
load/unload test load/unload
SE easy to implement,
cycles but lower
cycle fault coverage than LOS
cycles Higher fault coverage than LOC, but SE not easy to implement
EE141 15 EE141 16
4. Low-
Low-Power Scan Testing - Basics 4. Low-
Low-Power Scan Testing
ATPG and X-filling techniques (1/3)
The problem of excessive power during scan testing can
be split into two sub-problems: excessive power during the The fraction of dont care bits (Xs) in a given
shift operation (called shift power) and excessive power ATPG test cube is nearly always a very large
during the capture operation (called capture power) fraction of the total number of bits despite the
application of state-of-the-art dynamic and static test
At-speed scan testing especially vulnerable to excessive pattern compaction techniques
IR drop caused by the high switching activity generated in
the CUT between launch and capture yield loss In classical ATPG, Xs are randomly filled and then
the resulting fully specified pattern is simulated to
confirm detection of all targeted faults and to
measure the amount of fortuitous detection
EE141 17 EE141 18
3
EE141
4. Low-
Low-Power Scan Testing 4. Low-
Low-Power Scan Testing
Power-aware ATPG algorithms (2/3) Power-aware X-filling heuristics (3/3)
From a set of deterministic test cubes, the main goal of these
Clever assignment of dont care bits in techniques is to assign dont care bits of each test pattern so that the
combinational ATPG in order to minimize the occurrence of transitions in the scan chain is minimized:
number of transitions between two consecutive test
Adjacent filling or MT-filling
vectors 0-filling
1-filling
Minimizing the difference between the before- 0XXX1XX0XX0XX 0000111000000 with MT-filling
capture and after-capture output values of a scan 0000100000000 with 0-filling
flip-flop 0111111011011 with 1-filling
Applicable at the end of the design process, no area overhead
Reduce test power consumption by reasonable increase of test length
A few solutions exist for reducing power during test cycle (LOC)
EE141 19 EE141 20
4. Low-
Low-Power Scan Testing 4. Low-
Low-Power Scan Testing
Low power test vector compaction Low power (gated) scan cells
Static compaction minimizes the number of test cubes generated by an
Combinational Part
ATPG tool by merging test cubes that are compatible in all bit positions
Example 1: 11XX0 and 1X0X0 are compatible ( 110X0)
Example 2: 11XX0 and 011X1 are not compatible
D Q
Conventional approaches target the minimum number of final test 0 SO
SI 1 output
cubes
CLK
[Sankaralingam 2000] used a greedy heuristic for merging test cubes SE
in a way that minimizes the number of transitions (use of weighted
transition metric) Gate scan cells block transitions during scan shifting
Significant reductions in average and peak power consumption Very effective in test power reduction
Significant area overhead and performance degradation
EE141 21 EE141 22
4. Low-
Low-Power Scan Testing 4. Low-
Low-Power Scan Testing
Scan cell ordering (1/2) Scan cell ordering (2/2)
FF1 FF2 FF3 FF4 FF2 FF4 FF1 FF3
0101 0 0 0 0 1100 0 0 0 0
1 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0
1 0 1 0 1 0 0 0
0 1 0 1 1 1 0 0
10 transitions generated 2 transitions generated
during loading of V after scan cell reordering
Need to change the order of bits in each vector during test application Partition the circuit in clusters (by using geographical criteria)
Scan cell reordering may lead to significant power reduction (up to 66%) Then reorder the scan cells within each cluster so as to reduce WSA
No overhead, FC and test time unchanged, low impact on design flow Clusters are then stitched together using the nearest neighbor criteria
May lead to routing congestion problems Good tradeoff between test power reduction and scan chain length
EE141 23 EE141 24
4
EE141
4. Low-
Low-Power Scan Testing 4. Low-
Low-Power Scan Testing
Scan chain segmentation Scan architecture modification
Combinational Logic
SE Capture
Inserting logic elements (XOR gates) between
Scan Chain A scan cells in order to minimize the number of
CLKA
Scan Chain B
transitions occurring inside the scan chain
Scan In Scan Chain C
ScanOut CLKB
CLKA CLKB CLKC
EE141 25 EE141 26
4. Low-
Low-Power Scan Testing 4. Low-
Low-Power Scan Testing
Token scan architecture (1/2) Token scan architecture (2/2)
ScanIn CLK Scan Si Di
1 2 j N ScanIn 1 0
ScanOut D
1 2 j N 0 TCK
1 2 j N D1
1
Q
ScanOut CLR
CLK Ti D Q T0
D2
CLK
Multiphase
S
Generator
S0 D0
EE141 27 EE141 28
4. Low-
Low-Power Scan Testing 5. Low-
Low-Power BIST - Basics
Scan clock splitting Test Pattern Generator
(TPG)
Vdd Logic
Circuit Under Test
Circuit Under Test BIST
(CUT)
CLK Controller
Time
T 2T 3T 4T 5T
ComOut Vdd
CLK/2 CLK/2
Output Response Analyzer
CLK/2 (ORA)
ScanIn Scan Cells A Scan Cells B 1
ScanOut Time
T 3T 5T
0 Vdd
SE
CLK/2
2T 4T
Time A test pattern generator (TPG) automatically generates test patterns for
application to the inputs of the circuit under test (CUT)
In-circuit TPGs constructed from LFSRs are most commonly used
The two clocks are synchronous with the system clock and have the same
period during shift operation except that they are shifted in time LFSRs are also used for output response analyzer (ORA)
During capture operation, the two clocks operate as the system clock BIST is implemented as Test-per-scan or as test-per-clock
Lowers the transition density in the CUT, the scan chains and the clock tree Even if it is slower, test-per-scan is the industry preferred solution today
EE141 29 EE141 30
5
EE141
5. Low-
Low-Power BIST 5. Low-
Low-Power BIST
Low power test pattern generators (1/3) Low power test pattern generators (2/3)
SI ScanOut
Circuit Under Test k
TFF
Scan Chain
m
LFSR
CLK r
CLK
Normal-speed
CUT
SCLK Slow
Sel_CLK LFSR/MISR LFSR/MISR
EE141 31 EE141 32
5. Low-
Low-Power BIST 5. Low-
Low-Power BIST
Low power test pattern generators (3/3) Vector filtering BIST
Test Sequence
By carefully choosing the seed of the LFSR V0
CLK LFSR
Vi
Vj LFSR activation
By inserting translating logic between the LFSR FF
Vk LFSR inhibition
and the CUT to obtain weighted random test vectors
Vl LFSR activation Circuit Under Test
EE141 33 EE141 34
5. Low-
Low-Power BIST 5. Low-
Low-Power BIST
Circuit partitioning Power-aware test scheduling (1/3)
A B C D A B C D A B C Power
DMUX DMUX DMUX DMUX
M
U
Power limit
X
C1 C2 C1 M
C2 C1 M
C2
U U
X X
Partition the original circuit (using a graph partitioning algorithm that The goal is to determine the blocks (memory, logic, analog, etc.) of a SoC to
minimizes the cut size) into structural sub-circuits so each sub-circuit can be be tested in parallel at each stage of the BIST session in order to keep power
successively tested through different BIST sessions dissipation under a specified limit while optimizing test time
FC and test time are unchanged and area overhead is quite low Some of the test resources (pattern generators and response analyzers)
Drawbacks are a slight penalty on performance and an impact on routing must be shared among the various blocks
EE141 35 EE141 36
6
EE141
5. Low-
Low-Power BIST 5. Low-
Low-Power BIST
Power-aware test scheduling (2/3) Power-aware test scheduling (3/3)
EE141 37 EE141 38
6. Low-
Low-Power Test Data Compression 6. Low-
Low-Power Test Data Compression
High test data volume leads to a high testing time Coding-based schemes
and may exceed the limited memory depth of ATE
Test data compression involves encoding a test Use of 0-filling on ATPG test cubes and then
set so as to reduce its size encode runs of 0s with Golomb codes (run-length
ATE limitations, i.e., tester storage memory and
codes) for reducing the number of transitions (75%)
bandwidth gap between the ATE and the CUT, may Golomb coding is very inefficient for runs of 1s
hence be overcome A synchronization signal between the ATE and the
Using compressed test data involves having an CUT is required as the size of the compressed data
on-chip decoder which decompresses the data (codeword) is of variable length
Low-power test data compression techniques Alternating run-length coding improves the
are needed to concurrently reduce scan power encoding efficiency of Golomb coding (can encode
dissipation and test data volume during test both runs of 0s and runs of 1s )
EE141 39 EE141 40
6. Low-
Low-Power Test Data Compression 6. Low-
Low-Power Test Data Compression
Linear-decompression-based schemes (1/2) Linear-decompression-based schemes (2/2)
Linear Decompressors consist of XORs and flip-flops Low power LD using LFSR reseeding can be used
In LFSR reseeding LFSR reseeding not used to directly encode specified bits
Deterministic test cubes generated by expanding seeds Each test cube divided into blocks
Typically 1-5% of bits in test vector specified LFSR reseeding used only to produce blocks containing
Most bits need not be considered when seed computed transitions
Size of seed much smaller than size of vector For blocks not containing transitions
Significantly reduces test data volume and bandwidth Logic value fed into scan chain simply held constant
Problem: X's in test cubes filled randomly Reduces number of transitions in scan chain
Results in excessive switching during scan shifting Efficient solution to trade-off between test data compression and
test power reduction
EE141 41 EE141 42
7
EE141
6. Low-
Low-Power Test Data Compression 6. Low-
Low-Power Test Data Compression
Broadcast-scan-based schemes (1/2) Broadcast-scan-based schemes (2/2)
Clock Tree
Sense amplifiers & MISR
Segment 1
Segmented
Mode
Test Column line driver
Tester Channel or Column address decoder
Control
Input Decompressor
EE141 43 EE141 44
7. Low-
Low-Power RAM Testing 7. Low-
Low-Power RAM Testing
Motivated by the need to concurrently test several A second strategy is to exploit the predictability of the
banks of memories in a system to reduce test time addressing sequence to reduce the pre-charge activity during test
A first strategy is to reorder memory tests to reduce Pre-charge circuits contribute to up to 70% to power dissipation
the switching activity on each address line while retaining In functional mode, the cells are selected in random sequence, and
the fault coverage and the memory overall test time all pre-charge circuits need to be always active, while during the test
mode the access sequence is known, and hence only the columns
that are to be selected need to be pre-charged
Original Test Low-power Test Single bit
change This low-power test mode can be implemented by using a modified
Zero-One (W0); (R0); (W1); (R1); s (W0, R0, W1, R1); (SBC) pre-charge control circuitry, and by exploiting the first degree of
(W(1odd/0even)); (R(1odd/0even)); s (W(1odd/0even), R(1odd/0even),
counting freedom of March tests, which allows choosing a specific addressing
Checker Board
(W(0odd/1even)); (R(0odd/1even)); W(0odd/1even), R(0odd/1even)); sequence
Addressing sequence is fixed to word line after word line and the
Power dissipation reduced by a factor of two to sixteen pre-charge activity is restricted to only two columns for each clock
A special design of the BIST circuitry is needed cycle: the selected column and the following one
EE141 45 EE141 46
7. Low-
Low-Power RAM Testing Summary and Conclusions
BLj-1 BLBj-1 BLj BLBj BLj+1 BLBj+1 Test throughput and manufacturing yield may be
affected by excessive test power
Therefore, lowering test power has been and is still a
Additional
pre-charge focus of intense research and development
control
Prec
Prec Prec
logic
Prec
Following points have been surveyed:
EE141 47 EE141 48
8
EE141
EE141 49