Académique Documents
Professionnel Documents
Culture Documents
A Design Perspective
Jan M. Rabaey Anantha Chandrakasan Borivoje Nikolic
Semiconductor Memories
December 20, 2002
Digital Integrated Circuits2nd
Memories
Chapter Overview
Memory Classification Memory Architectures The Memory Core Periphery Reliability Case Studies
Digital Integrated Circuits2nd
Memories
Random Access
Mask-Programmed
Programmable (PROM)
SRAM DRAM
Memories
Memories
words SN 2 2 N
SN 2
1
A K2 Word N 2 2 Word N 2 1
Decoder Word N 2
Word N 2 1 K 5 log2N
Input-Output ( M bits) Intuitive architecture for N x M memory Too many select signals: N words == N select signals
K = log2N
Memories
Memories
Advantages: 1. Shorter wires within blocks 2. Block address activates only 1 block => power savings Digital Integrated Circuits2nd
Memories
128 K Array Block Subglobal row decoder SubglobalGlobal row decoder row decoder Block 30 Block 31 Block 1
Transfer gate Column decoder
CS, WE buffer
I/O buffer
x1/x4 controller
Y -address buffer
[Hirose90]
Memories
Contents-Addressable Memory
I/O Buffers
Commands
Comparand
Mask
Address Decoder
Validity Bits 9 2 Priority Enc Validity Bits Address Decoder 9 2 Address Decoder Priority Enc
Priority Encoder
Commands Commands
29 Validity Bits
Memories
Memories
BL WL WL
BL
BL WL
0
GND Diode ROM MOS ROM 1 MOS ROM 2
Memories
MOS OR ROM
BL [0] BL [1] BL [2] BL [3]
WL [0]
V DD WL [1]
WL [2]
V DD WL [3]
Memories
WL [0]
GND WL [1]
BL [0]
BL [1]
BL [2]
BL [3]
Memories
Memories
Memories
WL [1]
WL [2]
WL [3]
Memories
Memories
Memories
Memories
WL cword
r word
Memories
WL
K cells
Memories
V DD Precharge devices
WL [2]
GND WL [3]
BL [0]
BL [1]
BL [2]
BL [3]
PMOS precharge device can be made as large as necessary, but clock driver becomes harder to design.
Digital Integrated Circuits2nd
Memories
Source
tox tox n+ Substrate p
Drain
G S
n+_
Device cross-section
Schematic symbol
Memories
10 V S
5V
20 V
2 5V S
0V
2 2.5 V S
5V
Avalanche injection
Memories
A Programmable-Threshold Transistor
Memories
FLOTOX EEPROM
Floating gate Source 2030 nm Gate Drain -10 V 10 V n1 Substrate p 10 nm n1 V GD I
FLOTOX transistor
Memories
EEPROM Cell
BL WL
Absolute threshold control is hard Unprogrammed transistor might be depletion 2 transistor cell
VDD
Memories
Flash EEPROM
Control gate
Floating gate erasure n 1 source
programming p- substrate
n 1 drain
Memories
Flash
Digital Integrated Circuits2nd
EPROM
Courtesy Intel
Memories
Memories
Memories
Memories
Unit Cell
Gate ONO FG
Gate Oxide
Courtesy Toshiba
Memories
Active area
STI
Courtesy Toshiba
Memories
Memories
DYNAMIC (DRAM)
Periodic refresh required Small (1-3 transistors/cell) Slower Single Ended
Digital Integrated Circuits2nd
Memories
M2
M5 Q M1 BL
M4
Q M6
M3 BL
Memories
V DD
BL
Q= 0 M5 V DD M4 Q= 1 V DD M6 V DD BL
M1
Cbit
Cbit
Memories
1
Voltage Rise (V) 0.8 0.6 0.4 0.2 Voltage rise [V] 0 0 0.5 1 1.2 1.5 2 Cell Ratio (CR) 2.5 3
Memories
BL = 1
BL = 0
Memories
Memories
6T-SRAM Layout
VDD
M2 M4
Q
M1 M3
GND
M5 M6
WL
BL
BL
Memories
M1
M2
Static power dissipation -- Want R L large Bit lines precharged to V DD to address t p problem
Digital Integrated Circuits2nd
Memories
SRAM Characteristics
Memories
WWL
RWL X BL 1 BL 2 V DD V DD 2 V T DV V DD 2 V T
M1
CS
M2
No constraints on device ratios Reads are non-destructive Value stored at node X when writing a 1 = V WWL-VTn Digital Integrated Circuits2nd
Memories
3T-DRAM Layout
BL2 BL1 GND
RWL
M3 M2
WWL
M1
Memories
Write: C S is charged or discharged by asserting WL and BL. Read: Charge redistribution takes places between bit line and storage capacitance
CS D V = VBL V PRE = V BIT V PRE -----------C S + CBL
Memories
DRAM requires a sense amplifier for each bit line, due to charge redistribution read-out. DRAM memory cells are single ended in contrast to SRAM cells. The read-out of the 1T DRAM cell is destructive; read and refresh operations are necessary for correct operation. Unlike 3T cell, 1T cell requires presence of an extra capacitance that must be explicitly included in the design. When writing a 1 into a DRAM cell, a threshold voltage is lost. This charge loss can be circumvented by bootstrapping the word lines to a higher value than VDD
Digital Integrated Circuits2nd
Memories
V (1) V PRE
D V (1)
V (0)
Memories
n+
Poly
Field Oxide
Cross-section
Layout
Memories
Memories
Cell plate
Cell Plate Si
Refilling Poly
Transfer gate
Trench Cell
Digital Integrated Circuits2nd
Stacked-capacitor Cell
Memories
M8
M9
M5
CAM
CAM
Word Match
S
M3
int
S
M2
M1
Memories
CAM
SRAM
ARRAY
ARRAY
Hit Logic
Address Decoder
Input Drivers Sense Amps / Input Drivers
Address
Tag
Hit
R/W
Data
Memories
Periphery
Decoders
Sense Amplifiers Input/Output Buffers Control / Timing Circuitry
Memories
Row Decoders
Collection of 2M complex logic gates Organized in regular and dense fashion
(N)AND Decoder
NOR Decoder
Memories
Hierarchical Decoders
Multi-stage implementation improves performance
WL 1
WL 0
A 0A 1 A 0A 1 A 0A 1 A 0A 1
A 2A 3 A 2A 3 A 2A 3 A 2A 3
Memories
Dynamic Decoders
Precharge devices GND GND
VDD WL 3
VDD
VDD
WL 2
V DD WL0
WL 1
WL 0 VDD f A0 A0 A1 A1
A0
A0
A1
A1
Memories
A0
S0 S1
S2
A1 S3
Advantages: speed (tpd does not add to overall memory access time) Only one extra transistor in signal path Disadvantage: Large transistor count
Memories
A1
A1
D
Number of devices drastically reduced Delay increases quadratically with # of sections; prohibitive for large decoders Solutions: buffers progressive sizing combination of tree and pass transistor approaches
Memories
V DD WL
1
V DD
V DD WL
2
V DD
V DD
f f
f f R
f f
f f R
f f
f f
Memories
Sense Amplifiers
DV C tp = ---------------Iav large
small
small transition
input
s.a. output
Memories
bit
M1
M2
bit
SE
M5
Memories
PC
V DD
BL EQ WL i
BL y M3 x
V DD M4 M2 M5 2 x x
V DD 2 y 2 x SE
M1 SE
SE Output (a) SRAM sensing scheme (b) two stage differential amplifier
Memories
SE
Initialized in its meta-stable point with EQ Once adequate voltage gap created, sense amp enabled with SE Positive feedback quickly forces output to a stable operating point.
Digital Integrated Circuits2nd
Memories
Charge-Redistribution Amplifier
V ref
VL M2 M3 VS C small M1 C large
Transient Response
Concept
Memories
SE
M4 Out
Load
V casc
M3
Cout
Cascode device
Ccol
WLC
M2 BL
Column decoder
WL
Digital Integrated Circuits2nd
M1
CBL
EPROM array
Memories
Single-to-Differential Conversion
WL BL Cell x Diff. S.A.
2 x
1 2
V ref
Output
Memories
R0
R1
BLR
SE
CS
CS
CS
CS
CS
CS
Dummy cell
Dummy cell
Memories
V
0 1 t (ns)
BL
V
3
BL
1 t (ns)
reading 0
3 EQ 2 WL
reading 1
SE 1
1 t (ns)
control signals
Memories
Voltage Regulator
VDD
Mdrive VDL
VREF Vbias
Equivalent Model
VREF
Mdrive
VDL
Digital Integrated Circuits2nd
Memories
Charge Pump
Memories
DRAM Timing
Memories
RDRAM Architecture
Bus Clocks Data bus k k3 l memory array
network mux/demux
Column demux Row packet dec. packet dec.
demux
Memories
A0
ATD
ATD
A1
A N2 1
Memories
Memories
C D (1F)
smax V , DD V ,S 10 C ,S Q ,D C
100
C S (1F)
Q S (1C)
V DD (V)
Q S 5 C S V DD / 2 V smax 5 Q S / ( C S 1 C D )
4K
64K
1M 16M 256M 4G
/ chip)
64G
From [Itoh01]
Memories
electrode
Ccross
Memories
EQ
WL 1 BL C BL C
WL 0
C WBL
WL D
WL 0
WL 1
BL C BL
Memories
Folded-Bitline Architecture
WL 1
BL
WL 1
WL 0 C
WBL
WL 0
WL D
WL D x
CBL C CBL
CWBL
Sense EQ Amplifier x
BL
Memories
Transposed-Bitline Architecture
Ccross BL 9 BL BL BL 99 (a) Straightforward bit-line routing Ccross BL 9 BL BL BL 99
(b) Transposed bit-line architecture
Digital Integrated Circuits2nd
SA
SA
Memories
V DD SiO 2
Memories
Yield
Memories
Redundancy
Redundant rows Redundant columns Memory Array Row Address : Fuse Bank
Row Decoder
Column Decoder Column Address
Memories
Error-Correcting Codes
Example: Hamming Codes
e.g. B3 Wrong
with
1 1 =3
Memories
Memories
From [Itoh00]
Memories
Ileakage
700n 500n
Factor 7
0.18 m m CMOS
(A) 300n
100n
0.00 .600 1.20
1.80
VDD
Memories
sleep
V DD,int
SRAM cell
SRAM cell
SRAM cell
sleep
V SS,int
From [Itoh00]
Memories
Case Studies
Programmable
Logic Array
SRAM
Flash
Memory
Memories
Main difference
ROM: fully populated PLA: one element per minterm Note: Importance of PLAs has drastically reduced 1. slow 2. better software techniques (mutli-level logic synthesis)
But
Memories
GND
GND
V DD
X0
X0
X1
X1
X2
X2
f0
f1
AND-plane
OR-plane
Memories
Dynamic PLA
f AND GND V DD f
OR
f f AND V DD
OR
X0
X0
X1
X1
X2
X2
f0
f 1 GND
AND-plane
OR-plane
Memories
AND
tpre teval f
OR
AND
OR
Memories
PLA Layout
VDD And-Plane Or-Plane f GND
x0 x0 x1 x1 x2 x2 Pull-up devices
Digital Integrated Circuits2nd
f0 f1 Pull-up devices
Memories
Memories
Bit-line Circuitry
Bit-line load Block select ATD
Memories
ATD
Block select ATD
BEQ
SA
BS
DATA
BS
Data-cut
Memories
From [Nakamura02]
Memories
108 106
104
102 100 0V 1V 2V 3V 4V
Read
Final Distribution
From [Nakamura02]
Memories
2 125mm
10.7mm
11.7mm
Digital Integrated Circuits2nd
From [Nakamura02]
Memories
0.13m p-sub CMOS triple-well 1poly, 1polycide, 1W, 2Al Cell size 0.077m2 Chip size 125.2mm2 Organization 2112 x 8b x 64 page x 1k block Power supply 2.7V-3.6V Cycle time 50ns Read time 25s Program time 200s / page Erase time 2ms / block Technology
From [Nakamura02]
Memories
Memories
From [Itoh01]
Memories
From [Itoh01]
Memories
Memories