Académique Documents
Professionnel Documents
Culture Documents
Architecture
Niraj K. Jha
Dept. of Electrical Engineering
Princeton University
Si Fin
Independent-gate FinFETs
Oxide insulation
Back Gate
†J. Kao, S. Narendra and A. Chandrakasan, “Subthreshold leakage modeling and reduction
techniques,” in Proc. ICCAD, 2002.
Logic Styles: NAND Gates
IG-mode
pull up
pull up bias
voltage
Minimum-delay
synthesis in SG-mode
IG SG
netlist
Design
Synopsys libraries
Compiler
Power-optimized
IG/LP LP
mixed-mode netlists
Linear
programming SG+ SG+LP
†D. Chinnery and K. Keutzer, based cell IG/LP SG+IG
“Linear programming for sizing, Vdd selection
and Vt assignment,” in Proc.
ISLPED, 2005.
Power Consumption of Optimized Circuits
Available modes
N-type FinFETs
–
–
Vdd H Vdd L
Symmetric
Vdd H 1.08V threshold control
for P and N
Vdd L 1.0V
Vss H -0.08V
TCMS buffer
Vss L
0.0V Vss H Vss L
Exploratory Buffer Design
VHdd VLdd VHdd VLdd
i’
i
S1 S2 lopt S1 S2
• Benchmarks are nets extracted from real layouts and scaled to 32nm
http://dropzone.tamu.edu/~zhouli/GSRC/fast_buffer_insertion.html
Fin-count Savings
nor01100
X8 X2
inv101
nand01001
inv101
c b X2 c b X1
X16 X2
d inv101 d nor00111
X8 X4 nor10011 nand00110
X16 X8
X4 X4 X2 X1
inv101 nor01100
a a
X16 X8
X8 X2
X16 X8
Level: 1 2 3 4 inv101
Level : 1 2 3 4
% reduction in power
90
80
70
% reduction in power
60
50
40
30
20
10
0
110% 130% 150% 170% 190%
ATCs
Power-minimized vs Delay-
minimized Netlists at 130% ATC
TCMS TCMS (Single- Dual-Vdd
Vth
% reduction in
dynamic power
53.3 49.8 51.4
% reduction in
leakage power
95.8 95.7 95.8
% reduction in
total power
67.6 65.3 66.3
% reduction in
Fin-count
65.2 59.5 61.6
Talk Outline
• Background
• Low Power FinFET Circuits
– Unusual Logic Styles
– Unusual Dual-Vdd /Dual-Vth Circuits
• Architectural Impact
• Other Ongoing Work
• Conclusions
Orion-FinFET
• Extends ORION for FinFET-based power
simulation for interconnection networks
• FinFET power libraries for various
temperatures and technologies nodes
• Power breakdown of interconnection
networks for different FinFET modes
• Power comparison for different FinFET
modes under different traffic patterns
Router Microarchitecture & Pipeline Stages
VC allocation
arbiters
Req
Input buffers
From source
RC ……
.
From north
RC ……
.
Route
From south
RC
VC ……
Sw itch
calculation allocation. allocation
Power Simulation Flow
F in
mo d e
Run UFDG
S P IC Esim u la tio n
L o g ic-le v e l
ro u te r c irc u its F in F E Tlo
c h a ra c
F in c o u n t s p e c ific a tio n
fo r ol g ic g a te s C a p a c ita n c ele &a k a g e
ex tra c tio n
R o u te r tra ffic R o u te r F in F E
p ro file p ow e r m odel lib r
Power Breakdown for SG/LP Modes
• 4X4 mesh network: 5 ports/router, 48-flit buffer/port
• Flit width = 128 bits
• Clock frequency = 1GHz
0.03 0.7
0.025 0.6
0.5
0.02
0.4
Watt
Watt
0.015
0.3
0.01
0.2
0.005 0.1
0 0
SG LP 1.2/-0.2 LP 1.4/-0.4 SG LP 1.2/-0.2 LP1.4/-0.4
B uffer C rossba r A rbiter C lock Leakage Router Link Clk dist Driver leak
2
1.5
1
0.5
0
Bulk CMOS LP mode (1.2/-0.2)
0.012
SG
LP 1.2/-0.2
Leakage power (Watt)
0.01
0.008
0.006
0.004
0.002
0
25 35 45 55 65 75 85 95 105
Temperature
Talk Outline
• Background
• Low Power FinFET Circuits
– Unusual Logic Styles
– Unusual Dual-Vdd /Dual-Vth Circuits
• Architectural Impact
• Other ongoing work
• Conclusions
FinFET SRAM and Embedded DRAM
Design
• FinE: Two-tier FinFET simulation framework for
FinFET circuit design space exploration:
– Sentaurus TCAD+UFDG SPICE model
– Quasi Monte-Carlo simulation for process variation analysis
– Thermal analysis using ThermalScope
– Yield estimation
• Variation-tolerant ultra low-leakage FinFET SRAMs at
lower technology nodes
• Gated-diode FinFET embedded DRAMs
Extension of CACTI for FinFETs
• Selection of any of the FinFET SRAM and
embedded DRAM cells
• Use of any of the FinFET operating modes
• Scaling of FinFET designs from 32nm to 22nm,
16nm and 10nm technology nodes
• Accurately modeling the behavior of a wide
range of cache configurations
FPGA vs. ASICs
• Distributed non-volatile nano RAMs: main
storage for reconfiguration bits
CMOS fabrication Nano RAM
compatible on-chip storage • Fine-grain reconfiguration (even cycle-by-
cycle) and logic folding
More than an order of magnitude increase in logic
density and area-delay product
Competitive performance and moderate power
Run-time Temporal consumption
reconfiguration NATURE logic folding Non-volatility: useful in low power & secure
processing
• NanoMap to map application to NATURE
Significant area-delay trade-off flexibility
Design Logic
flexibility density
Conclusions
• FinFETs a necessary semiconductor evolution step
because of bulk CMOS scaling problems beyond 32nm
• Use of the FinFET back gate leads to very interesting
design opportunities
• Rich diversity of design styles, made possible by
independent control of FinFET gates, can be used
effectively to reduce total active power consumption
• TCMS able to reduce both delay and subthreshold
leakage current in a logic circuit simultaneously
• Time has arrived to start exploring the architectural
trade-offs made possible by switch to FinFETs