Académique Documents
Professionnel Documents
Culture Documents
The R
of a MOSFET is inversely
proportional to the gate overdrive, V
GS
-V
T
R
ON
During Transition
S. A. Ibrahim
14
MOS Capacitance
S. A. Ibrahim
15
Parasitic (undesirable) elements are everywhere
Devices are small enough such that inductive effects are negligible.
Capacitances dominate; and there are MANY
1. C
channel
since channel connects S/D, it is split
2. C
overlap
gate overlaps S/D, and there is fringe cap from Gate
3. C
junction
S/D form reverse-biased PN junctions with the body
4. C
j_sidewall
S/D different doping to the bottom and side
5. C
body
from channel to body, quite small b/c of small L
Gate
Source Drain
C
1
C
2
C
2
C
3
C
4
C
5
C
3
C
4
Gate Capacitance
S. A. Ibrahim
16
Channel cap characteristics
Split between S/D
Varies with region of operation
Off MOSFET
C
ch
= C
GB
On MOSFET
C
ch
= W*L
ch
*c
ox
/t
ox
Triode region
C
1A
= C
1B
= C
ch
/2
Saturated region
C
1A
= 2/3C
ch
, C
1B
= 0
C
ov
= C
ox
*W*L
ov
*k, where k is a
constant accounting for fringing
Removing technology dependence
and assuming minimum channel
length
C
gate
= C
G
* W
Gate
Source Drain
C
1A
C
1B
Self-Loading Capacitance
S. A. Ibrahim
17
A transistor pulls current from its S/D,
so the S/D junction is often called the
Self-Loading Capacitance.
C
S/D
= C
j_area
+ C
j_sidewall
C
j_area
= W*L
diff
*C
J
(V
S/D
)
C
j_sidewall
= (W+2L
diff
)*C
JSW
(V
S/D
)
Note that C
J
and C
JSW
are
layout dependent
voltage dependent
L
diff
is typically 5 (unshared and
contacted)
L
diff
W
M
O RB O J
V V C C ) / 1 /( + =
Where C
O
, V
O
, and M are
junction dependent constants
Can also be simplified as
C
junction
= C
D,S
* W
CMOS Inverter: Transient Response
t
pHL
= f(R
on
.C
L
)
= 0.69 R
on
C
L
V
out
R
p
V
DD
(a) Low-to-high (b) High-to-low
V
out
R
n
V
DD
C
L
C
L
S. A. Ibrahim
18
V
DD
V
out
V
in
= V
DD
R
on
C
L
t
pHL
= f(R
on
.C
L
)
= 0.69 R
on
C
L
t
V
out
V
DD
R
on
C
L
1
0.5
ln(0.5)
0.36
0 0.5 1 1.5 2 2.5
x 10
-10
-0.5
0
0.5
1
1.5
2
2.5
3
t (sec)
V
o
u
t
(
V
)
CMOS Inverter: Propagation Delay
t
p
= 0.69 C
L
(R
n
+R
p
)/2
t
pHL
t
pLH
S. A. Ibrahim
19
3
4
(neglecting
Channel length mod.)
Choosing W
p
/W
n
Ratio
For symmetric inverter V
M
= V
DD
/2 and t
PHL
= t
PLH
,
=W
p
/W
n
=
n
/
p
=R
eqp
/R
eqn
=r.
However, this does not guarantee minimum delay.
t
PLH
decreases with whereas t
PHL
increases with .
S. A. Ibrahim
20
Power Consumption
Static (leakage) (low for low transistor count and perfected technology)
Dynamic (to charge and discharge C
L
at a certain frequency: appreciable)
Short-Circuit (when both PMOS and NMOS are ON: reduced in fast transitions)
circuit short dynamic static total
P P P P
+ + =
may be the most important
S. A. Ibrahim
21
Static Power
leakage DD static
I V P =
S. A. Ibrahim
22
Leakage power increases exponentially with temperature.
Leakage power affects stand-by time heavily.
Techniques to Reduce Static Power
Using transistors with higher
threshold voltages (HVT). Difficult
at low supplies.
Using technologies that contain
devices with sharper turn-off
characteristic for example SOI.
Using body biasing to increase
threshold voltage when off
(dynamic biasing used especially
in memories).
Increasing length and/or stacking
devices.
Power gating of different blocks
when not used (supply off or
reduced).
S. A. Ibrahim
23
>0
Dynamic Power
* An equal energy is needed to discharge C
L
(in the next half cycle)
* Therefore in one switching cycle we need an energy of C
L
V
DD
2
activity DD L
DD L
dynamic
f V C
T
V C
P
2
2
= =
During
output
transition
S. A. Ibrahim
24
Techniques to Reduce Dynamic Power
Reducing the supply reduces
power substantially (quadratic) as
long as we can keep f
clk
(V
DD
>2V
T
).
Reducing the activity factor by
revisiting the Boolean functions on
top level. Reduce Glitches too.
Reducing capacitance by reducing
sizes of transistors and
interconnects whenever possible.
Dynamic voltage scaling (DVS).
Clock gating.
Parallel hardware may be used to
reduce global interconnect and
allow a reduction in supply voltage
without degrading system
throughput.
S. A. Ibrahim
25
[Burd, JSSC00]
Short-Circuit Power
This average short-circuit current drawn from V
DD
results in P
short-circuit
Fast rise and fall times of input waveform reduce short-circuit power
During
input
transition
S. A. Ibrahim
26
activity DD SC DD peak SC SC
f V C fV I t P
2
= =
Techniques to Reduce Short Circuit Power
Short-circuit current is reduced when we lower the
supply voltage. In the extreme case, when VDD < V
Tn
+
|V
Tp
|, short-circuit dissipation is completely eliminated.
With threshold voltages scaling at a slower rate than
the supply voltage, short-circuit power dissipation is
becoming of a lesser importance in deep-submicron
technologies.
Short-circuit power is consumed by each transition
(increases with input transition time). Reduction
requires that gate output transition should not be faster
than the input transition (faster gates can consume
more short-circuit power). Increasing the output load
capacitance reduces short-circuit power.
S. A. Ibrahim
27
Power-Delay & Energy-Delay Product
Used as quality measure of logic gates.
PDP is energy, so not so useful.
Assuming that NMOS and PMOS
transistors have comparable threshold
and saturation voltages,
where
This equation is only accurate as long as
the devices remain in velocity saturation,
which is probably not the case for the
lower supply voltages.
S. A. Ibrahim
28
Outline
The CMOS Inverter Noise Margin and Threshold
Voltage
The CMOS Inverter Propagation Delay and Switching
Speed
The CMOS Inverter Power Consumption
Sizing of the CMOS Inverter
S. A. Ibrahim
29
Inverter with Load
Load (C
L
)
Delay
Assumptions: no load -> zero delay
C
L
t
p
= k R
eq
C
L
R
eq
k is a constant, equal to 0.69 (note ln (2) =0.69)
R
eq
S. A. Ibrahim
30
Layout-Aware Inverter with Load
C
int
C
L
Delay = kR
eq
(C
int
+ C
L
) = kR
eq
C
int
+ kR
eq
C
L
= kR
eq
C
int
(1+ C
L
/C
int
)
= Delay (Internal) + Delay (Load)
2W
W
Intrinsic or internal or self-loading cap
External loading cap
S. A. Ibrahim
31
Load
Delay
kR
EQ
C
int
Load
Delay = t
p
C
int
C
L
Delay = t
p
= kR
eq
C
int
(1+ C
L
/C
int
) = t
po
(1+ C
L
/C
gin
)
C
gn
C
gp
= 2C
cgn
2W
W
t
po
C
L
C
g
C
int
~ C
g
= C
g
S. A. Ibrahim
32
C
g
vs. C
int
C
int
= C
g
( for simplicity one assumes ~ 1, it is indeed close
to 1 for most submicron processes)
f = C
L
/C
g
= effective fanout (also called electrical effort)
t
po
= 0.69R
eq
C
int
(zero-load delay or intrinsic delay)
Delay = t
p
= kR
eq
C
int
(1+ C
L
/C
int
) = t
po
(1+ C
L
/C
g
)
= t
po
(1+ f /)
Relative (Normalized) Delay d = t
p
/t
po
= (1+ f /)
S. A. Ibrahim
33
Delay Formula
S. A. Ibrahim
34
Delay = t
p
= kR
eq
C
int
(1+ C
L
/C
int
) = t
po
(1+ C
L
/SC
g
)
t
po
= k(R
eq
/S) (SC
int
) = f(S) = kR
eq
C
int
Scaling W by a factor S: R
eq
R
eq
/S and C
int
SC
int
SC
int
R
eq
/S
R
eq
/S
Size of a Single Inverter for a Given C
L
(1)
S. A. Ibrahim
35
Size of a Single Inverter for a Given C
L
(2)
large area and
wasted effort
SC
int
>>C
L
Sizing an Inverter Chain
C
L
In Out
1 2 N
t
p
= t
p1
+ t
p2
+ + t
pN
|
|
.
|
\
|
+ =
+
j g
j g
po pj
C
C
t t
,
1 ,
1
L N g
N
i
j g
j g
p
N
j
j p p
C C
C
C
t t t =
|
|
.
|
\
|
+ = =
+
=
+
=
1 ,
1
,
1 ,
0
1
,
, 1
Load cap
as if a gate N+1 exists
S. A. Ibrahim
36
The Tapered Buffer
S. A. Ibrahim
37
C
int
fC
int
f
2
C
int
f
N-1
C
int
Optimum Delay and Number of Stages
1
/
g L
N
C C F f = =
When each stage is sized by f and has same eff. fanout f:
N
F f =
( ) / 1
0
N
p p
F Nt t + =
Path (N-stages) delay
Effective fanout of each stage:
f
S. A. Ibrahim
38
Optimum Number of Stages
For a given load, C
L
and given input capacitance C
in
Find optimal sizing f
( )
|
|
.
|
\
|
+ = + =
f f
f
F t
F Nt t
p
N
p p
ln ln
ln
1 /
0
/ 1
0
0
ln
1 ln
ln
2
0
=
=
c
c
f
f f
F t
f
t
p p
f
F
N C f C F C
g
N
g L
ln
ln
with
1 1
= = =
( ) f f + = 1 exp For = 0, f = e, N = ln F Special case: no self loading:
Exponential Buffer
S. A. Ibrahim
39
Optimum vs. in Tapered Buffers
S. A. Ibrahim
40
Exponential buff
(no self-loading)
tapered buff
(self-loading at = 1)
C
int
= C
g
3.6 (~ 4)
Buffer Design
1
1
1
1
8
64
64
64
64
4
2.8
8
16
22.6
N f t
p
1 64 65
2 8 18
3 4 15
4 2.8 15.3
( exponential)
(Typical instead of 3.6)
S. A. Ibrahim
41
FO4 Delay
Want a way to characterize the delay of a circuit
(roughly) independent of technology
Most common metric:
Delay of an inverter driving four copies of itself (t
FO4
)
S. A. Ibrahim
42
UTB=ultra-thin body
FinFET Key Benefits
L
g
Scaling: Supports smaller L
eff
at same I
off
Faster
V
g
-V
t
Scaling: Supports smaller V
t
at at same I
off
lower supply
Lower power
Smaller I
off
lower sub-threshold leakage
A versatile model for double-gate, triple gate, even cylindrical gate
FET is available now from BSIM group at Berkeley. Passed
Industry FinFET standard balloting in Jan. 2012. (BSIM-CMG)
Fabs: Intel, Global, IBM, Samsung, TSMC
S. A. Ibrahim
43
FinFET vs. Planar MOSFET
Increase effective width for a given footprint: increase
H
fin
and/or reduce fin pitch
S. A. Ibrahim
44
FinFET Inverter Layout
S. A. Ibrahim
45
Reading
Rabaey: Sections 1.3 and 3.3 and chapter 5.
Weste: Chapters 2 and 5 and Sections 2.1 and 2.2.
K. Roy, S. Mukhopadhyay, and H. Mahmoodi-Meimand,
Leakage current mechanisms and leakage reduction
techniques in deep-submicrometer cmos circuits,
Proceedings of the IEEE, vol 91, no. 2, pp. 305327, Feb
2003.
S. A. Ibrahim
46