Académique Documents
Professionnel Documents
Culture Documents
Algorithm
Technology
Methodology
Andrew B. Kahng
1
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Introduction
zz This tutorial will cover "the latest word" in physical chip
implementation methodology and physical design (PD) algorithm
technology.
zz The target audience consists of
zz system and circuit designers who would benefit from
understanding tool capabilities in this arena,
zz CAD engineers (both R&D and support),
zz design project managers,
zz academic researchers.
zz Familiarity with basic PD methodology is assumed.
Andrew B. Kahng
2
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
1
Trade-Off: Depth vs. Breadth
zz Broad spectrum of possible material
zz Only ~6-7 hours for presentation
zz Not all possible topics covered in slides, not all slides covered
in talks
zz ask questions if you’d like to hear about something
something in
in particular,
particular,
esp. related to methodology or particular P&R techniques
z All tutorial materials will be available in softcopy at
zz http://vlsicad.cs.ucla.edu/ICCAD99TUTORIAL
zz http://www.ece.nwu.edu/nucad/ICCAD99TUTORIAL
Andrew B. Kahng
3
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
4
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
2
Modern Physical Design:
Algorithm
Technology
Methodology
(Part I)
Andrew B. Kahng UCLA
UCLA
Andrew B. Kahng
5
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
6
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
3
Overall Roadmap Technology
Characteristics
YEAR OF FIRS T P RO DUCT S HIP M ENT 1997 1999 2002 2005 2008 2011 2014
TE C H N O LO G Y N O D E
250 180 130 100 70 50 35
D E N S E L IN E S (D R A M H A LF -P ITC H ) (n m )
IS O L A TE D LIN E S (M P U G A TE S ) (nm ) 200 140 100 70 50 35 25
Log ic (Low -V o lum e— A S IC )‡
U s ab le trans is tors /c m 2 (auto lay o ut) 8M 14M 24M 40M 64M 100 M 160 M
N onrec urring en gin eering c os t
50 25 15 10 5 2.5 1.3
/us able trans is tor (m ic roc ents )
N um ber o f C hip I/O s – M ax im um
C hip-to-pac k age (pa ds )
151 5 186 7 255 3 349 2 477 6 653 2 893 5
(hig h-p erform a nc e)
C hip-to-pac k age (pa ds )
758 934 127 7 174 7 238 6 326 8 447 0
(c os t-pe rform an c e)
N um ber o f P ac k ag e P ins /B alls – M ax im u m
M ic rop roc e s s or/c o ntrolle r
568 700 957 130 9 179 1 244 9 335 0
(c os t-pe rform an c e)
A S IC
113 6 140 0 191 5 261 9 358 1 489 8 670 0
(hig h-p erform a nc e)
P ac k age c os t (c ents /pin)
0.78-2.7 1 0.70-2.5 2 0.60-2.1 6 0.51-1.8 5 0.44-1.5 9 0.38-1.3 6 0.33-1.1 7
(c os t-pe rform an c e)
P ow er S u pply V oltage (V )
M in im um logic V dd (V ) 1.8–2.5 1.5–1.8 1.2–1.5 0.9–1.2 0.6–0.9 0.5–0.6 0.37-0.4 2
M ax im um P ow er
H igh-perform anc e w ith heat s ink (W ) 70 90 130 160 170 175 183
B attery (W )— (H an d-h eld ) 1.2 1.4 2 2.4 2.8 3.2 3.7
Andrew B. Kahng
7
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
8
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
4
Technology Scaling Trends
z Interconnect
zz Impact of scaling on parasitic capacitance
zz Impact of scaling on inductance coupling
zz Impact of new materials on parasitic capacitance & resistance
zz Trends in number of layers, routing pitch
z Device
zz Vdd
dd, Vtt, sizing
zz Circuit trends (multithreshold CMOS, multiple supply voltages,
dynamic CMOS)
zz Impact of scaling on power and reliability
Andrew B. Kahng
9
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Reachability in τcrit
crit
= 80 ps
25 x 25 mm chip
Andrew B. Kahng
10
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
5
Technology Scaling Trends
z Scaling of x0.7 every three years
zz .25u .18u .13u .10u .07u .05u
zz 1997 1999 2002 2005 2008 2011
zz 5LM 6LM 7LM 7LM 8LM 9LM
z Interconnect delay dominates system performance
zz consumes 70% of clock cycle
z Cross coupling capacitance is dominating
zz cross capacitance → 100%, ground capacitance → 0%
zz 90% in .18u
zz huge signal integrity implications (e.g., guardbands in static
analysis approaches)
z Multiple clock cycles required to cross chip
zz whether 3 or 15 not as important as fact of “multiple” > 1
Andrew B. Kahng
11
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
12
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
6
Technology Extrapolation Sensitivity
Andrew B. Kahng
13
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
http://vlsicad.cs.ucla.edu/GSRC/GTX/
Andrew B. Kahng
14
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
7
Andrew B. Kahng
15
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
18
Risk Factors: 14
Interconnect Delay
Signal Integrity 12
Electromigration
Process Variations 10
2
0.13
0.18
0.25
0.35
0.5
Technology (µ)
Andrew B. Kahng
16
Courtesy Hormoz/Muddu, ASIC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
8
Scaling of Noise with Process
z Cross coupling noise increases with
zz process shrink
zz frequency of operation
z Propagated noise increases with decrease in noise
margins
zz decrease in supply voltage
zz more extreme P/N ratios for high speed operation
z IR drop noise increases with
zz complexity of chip size
zz frequency of chip
zz shrinking of metal layers
layers
Andrew B. Kahng
17
Courtesy Hormoz/Muddu, ASIC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
18
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
9
Technical Issues in UDSM Design
z New issues and problems arising in UDSM technology
zz catastrophic yield: critical area, antennas
zz parametric yield: density control (filling) for CMP
zz parametric yield: subwavelength lithography implications
zz optical
optical proximity
proximity correction
correction (OPC)
(OPC)
zz phase-shifting
phase-shifting mask
mask design
design (PSM)
(PSM)
zz signal integrity
zz crosstalk
crosstalk and
and delay
delay uncertainty
uncertainty
zz DC
DC electromigration
electromigration
zz AC
AC self-heat
self-heat
zz hot
hot electrons
electrons
z Current context: cell-based place-and-route methodology
zz placement and routing formulations, basic technologies
zz methodology contexts
Andrew B. Kahng
19
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
20
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
10
Why Now?
z These effects have always existed, but become worse
at UDSM sizes because of:
zz finer geometries
zz greater
greater wire
wire and
and via
via resistance
resistance
zz higher
higher electric
electric fields
fields ifif supply
supply voltage
voltage not
not scaled
scaled
zz more metal layers
zz higher
higher ratio
ratio of
of cross
cross coupling
coupling to to grounded
grounded capacitance
capacitance
zz lower supply voltages
zz more
more current
current for
for given
given power
power
zz lower device thresholds
zz smaller
smaller noise
noise margins
margins
z Focus on interconnect
zz susceptible to patterning difficulties
zz CMP,
CMP, optical
optical exposure,
exposure, resist
resist development/etch,
development/etch, CVD,
CVD, ...
...
zz susceptible to defects
zz critical
critical area,
area, critical
critical volume
volume
Andrew B. Kahng
21
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Issues in IC Design
SYSTEM SUBSYSTEM CIRCUIT PHYSICAL MANUFACTURE
DESIGN DESIGN DESIGN DESIGN INTERFACE
New Figure 4 (Draft Rev. B, 3-12-99) Red denotes most challenging activity
11
ASSP: 44% WallTime, 39% Total Effort
After First Tape-out
Time and Effort Allocation by First Tape-out
100%
First Tape-out
Percent of Total Project Effort
80%
(Man-Weeks)
60%
40%
61%
39%
20%
56% 44%
0%
0% 20% 40% 60% 80% 100%
7X
6X
5X
4X
3X
2X
1X
0
06/94 06/95 06/96 06/97 06/98
Project Start Date
* Data Source: Collett International Inc.’s Design Productivity Management SystemTM (DPMS) database.
** Methodology: The design productivity trendline is the ordinary-least-squares (OLS) regression line. 27% is the compound
annual growth rate between 06/94 & 06/98.
*** ASSP (Application Specific Standard Product): Standard “off--the-shelf” IC product that has been designed to implement
Andrew B. Kahng
a specific application function.
24
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
12
Silicon Complexity and Design
Complexity
z Silicon complexity: physical effects cannot be ignored
zz fast but weak gates; resistive and cross-coupled interconnects
zz subwavelength lithography from 350nm generation onward
zz delay, power, signal integrity, manufacturability, reliability all
become first-class objectives along with area
z Design complexity: more functionality and
customization, in less time
zz reuse-based design methodologies for SOC
z Interactions increase complexity
zz need robust, top-down, convergent design methodology
Andrew B. Kahng
25
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
26
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
13
Implications of Complexity
z UDSM: Silicon complexity + Design complexity
zz convergent design: must abstract what’s beneath
zz prevention
prevention with
with respect
respect to
to analysis/verification
analysis/verification checks
checks
zz many
many issues
issues to
to worry
worry about
about (all
(all are
are “first-class
“first-class citizens”
citizens”
zz apply
apply methodology
methodology (P/G/clock
(P/G/clock design,
design, circuit
circuit tricks,
tricks, …)
…) whenever
whenever possible
possible
zz must concede loss of clean abstractions: need unifications
zz synthesis
synthesis and
and analysis
analysis in
in tight
tight loop
loop
zz logic
logic and layout : chip implementation planning
and layout : chip implementation planning methodologies
methodologies
zz layout
layout and
and manufacturing
manufacturing :: CMP/OPC/PSM,
CMP/OPC/PSM, yield,
yield, reliability,
reliability, SI,
SI, statistical
statistical
design,
design, ……
zz must hit function/cost/TAT points that maximize $/wafer
zz reuse-based
reuse-based methodology
methodology
zz need
need for differentiating IP
for differentiating IP →→ custom-ization
custom-ization
Andrew B. Kahng
27
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
28
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
14
Example: Defect-related Yield Loss
z High susceptibility to spot defect-related yield loss,
particularly in metallization stages of process
z Most common failure mechanisms: shorts or opens due
to extra or missing material between metal tracks
z Design tools fail to realize that values in design manuals
are minimum values, not target values
z Spot defect yield loss modeling
zz extremely well-studied field
zz first-order yield prediction: Poisson yield model
zz critical-area model much more successful
zz fatal defect types (two types of short circuits, one type of open)
Andrew B. Kahng
29
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
fatal defect types (two types of short circuits, one type of open)
Andrew B. Kahng
30
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
15
Critical Area for Short Circuits
Andrew B. Kahng
31
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Critical Area
for Shorts
Andrew B. Kahng
32
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
16
Approaches to Spot Defect Yield
Loss
z Modify wire placements to minimize critical area
z Router issue
zz router understands critical-area analyses, optimizations
zz spread, push/shove (gridless, compaction technology)
zz layer reassignment, via shifting (standard capabilities)
zz related: via doubling when available, etc.
z Post-processing approaches in PV are awkward
zz breaks performance verification in layout (if layout has been
changed by physical verification)
zz no easy loop back to physical design: convergence problems
Andrew B. Kahng
33
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Example: Antennas
z Charging in semiconductor processing
zz many process steps use plasmas, charged particles
zz charge collects on conducting poly, metal surfaces
zz capacitive coupling: large electrical fields over gate oxides
zz stresses cause damage, or complete breakdown
zz induced Vtt shifts affect device matching (e.g., in analog)
Andrew B. Kahng
34
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
17
Antennas
z Charging in semiconductor processing
z Standard solution: limit antenna ratio
zz antenna ratio = (Apoly
poly + AM1
M1 + ... ) / Agate-ox
gate-ox
zz e.g., antenna ratio < 300
Mx ≡ metal (x) area electrically connected to node without using
zz AMx
metal (x+1), and not connected to an active area
Andrew B. Kahng
35
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Antennas
z Charging in semiconductor processing
z Standard solution: limit antenna ratio
z General solution == bridging (break antenna by moving
route to higher layer)
z Antennas also solved by protection diodes
zz not free (leakage power, area penalties)
z Basically, annoying-but-solved problem
zz not clear whether today’s approaches scale into the future
zz (today, mostly post-processing approaches)
Andrew B. Kahng
36
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
18
Wire Spacing and Layout Methodology
Andrew B. Kahng
37
Courtesy M. Berkens, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
38
Courtesy M. Berkens, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
19
Wire Spacing and Shielding
Andrew B. Kahng
39
Courtesy M. Berkens, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
40
Courtesy M. Berkens, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
20
Wire spacing example
Andrew B. Kahng
41
Courtesy M. Berkens, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
42
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
21
Performance Optimization Methodology
z Tradeoffs: Speed / Power / Area
z Must compromise and choose between often competing
criteria
z For given criteria (constraints) on some variables, make
best choice for free variables (min cost) => Need to be
on boundary of feasible region
Andrew B. Kahng
43
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Optimization
Methods
Reorganize Logic
Re r
tim ffe
e Bu
Size
e
ac
Sp
Andrew B. Kahng
44
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
22
Optimization at Layout Level
z Size Transistors
z Space/size wires
z Add/delete buffers
z Modify circuit locally
Andrew B. Kahng
45
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Transistor Sizing
Area Delay Curve
Infeasible
Region X
Min cost
No assignment of sizes can produce a result here
Andrew B. Kahng
46
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
23
Transistor sizing
What will it buy me?
z Scenario: Lots of capacitance in wires
zz will it buy me speed: Yes
zz will is save me power: “Yes” (qualified)
Architecture cannot
Architecture is
Area satisfy application
an overkill for
(increase parallelism)
Delay cannot this application
be improved Architecture and
at any cost application are
well matched
Delay can be improved
at almost no cost
Delay
Andrew B. Kahng
47
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Transistor Sizing
Convexity + Dual Goals
5ns 10ns
Optimal
point for Circuits of constant cost W1 +W2 = Cte
10ns
W2
(size of Xtr2)
Note: Actually circuit delay is Posynomial ~ Convex
Andrew B. Kahng
48
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
24
Transistor Sizing
Methods
z Exact Solutions
zz gradient Search
zz convex Programming
z Approximate methods (very good solutions)
zz iterative improvement on critical path (e.g. TILOS)
Andrew B. Kahng
49
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Convex Programming
Outside Delay Case
z Add more and more bounds
zz guess new solution (deep) inside bounds
W2
Andrew B. Kahng
50
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
25
Convex Programming
Inside Delay Case
z New guess delay is adequate but try and improve cost
Bound 1
W1 Add a bound to force
New guess search into region
of lower cost. New bound
Bound 3
Required
is constant cost curve
Delay
X passing through new
N
nd 2
bo
bound.
W2
Andrew B. Kahng
51
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Transistor Sizing
Approximate Solutions
ta
nt
W2
Andrew B. Kahng
52
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
26
Transistor Sizing
TILOS method
3
Critical Path
4
Effective speedup
per unit area
Andrew B. Kahng
53
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Critical path
zz Increase Islow
slow
until capacitive power increase for driving Islow
slow
is more
than decrease in S.C. power
zz sweep
sweep circuit
circuit from
from outputs
outputs to
to inputs
inputs
Andrew B. Kahng
54
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
27
TILOS Optimization Trajectory
.
uit
Ci e
Infeasible
S. d u c
X fix timin
rc
Region g
Re
X
Andrew B. Kahng
55
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Buffer Insertion
Area delay tradeoffs
W
With
it h
Feasible Region
Area
bu
ou
t bu
feasible regions
r
ffer
Area of Min
Optimization Size buffer
Add buffer at
Trajectory
this point
Delay
Andrew B. Kahng
56
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
28
Local Re-synthesis
z Pass Xtr re-synthesis, logic reorganization
z Gate collapsing
Tp
P1 P3
P2
N3
N1 N2 Tn
Andrew B. Kahng
57
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Gate Collapsing
Example
Andrew B. Kahng
58
Courtesy Bamji, DAC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
29
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
59
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
60
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
30
Custom Methodology in ASIC(?) / COT
Andrew B. Kahng
61
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
z Architecture
zz heavy pipelining
zz fewer logic levels between latches
z Dynamic logic
zz used on all critical paths
z Hand-crafted circuit topologies, sizing and layout
zz good attention to design reduces guardbands
Andrew B. Kahng
62
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
31
Custom Methodology in ASIC(?) / COT
z ASIC market forces (IP differentiation) will define needs
for xtor-level analyses and syntheses
z Flexible-hierarchical top-down methodology
zz basic strategy: iteratively re-optimize chunks of the design as
defined by the layout, i.e., cut out a piece of physical hierarchy,
reoptimize it (“peephole optimization”)
zz for
for timing/power/area
timing/power/area (e.g.,
(e.g., for
for mismatched
mismatched input
input arrival
arrival times,
times,
slews)
slews)
zz for
for auto-layout
auto-layout (e.g.,
(e.g., pin
pin access
access and
and cell
cell porosity
porosity for
for router)
router)
zz for
for manufacturability (density control, critical area, phase-
manufacturability (density control, critical area, phase-
assignability)
assignability)
zz DOF’s:
DOF’s: diffusion
diffusion sharing,
sharing, sizing,
sizing, new
new mapping
mapping // circuit
circuit topology
topology
sol’s
sol’s
zz chunk
chunk size:
size: as
as large
large as
as possible
possible (tradeoff
(tradeoff between
between near-optimality,
near-optimality,
CPU
CPU time)
time)
zz antecedents: IBM C5M, Motorola CELLERITY, DEC CLEO
zz “infinite library”recovers performance, density that a 300-cell
library and classic cell-based flow leave on the table
Andrew B. Kahng
63
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
32
Custom Methodology in ASIC(?) / COT
Andrew B. Kahng
65
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
66
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
33
Sylvester-Keutzer: Classic Picture
Andrew B. Kahng
67
Sylvester-Keutzer, Computer Nov. 99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Sylvester-Keutzer: Combining
Logical and Physical
Andrew B. Kahng
68
Sylvester-Keutzer, Computer Nov. 99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
34
Required Advance in Design System Architecture
Yesterday 1000nm Today 180nm Tomorrow 50nm
Functional
Software Performance
Perf. SPEC
Logic Design Hw/Sw Testability
Design Optimization Model Verification
Functional
Cockpit
Verification
RTL SW Auto-Pilot
RTL SW Optimize Analyze
Opt Hw/Sw Comm. Perf.
Synthesis SW Timing
EQ check
Synthesis Logic Hw/Sw Power
+ Timing Analysis Circuit Data Noise
Equivalence checking
File + Placement Opt Model
Place Test
Wire Mfg.
Timing Analysis Performance other
Repository
other
File Testability
Verification
Functional
File MASKS
Verification
Place/Wire
+ Timing Analysis
Place/Wire + Logic Opt
File
File
Multiple design files are converged into one efficient Data Model
Timing Analysis Disk accesses are eliminated in critical methodology loops
MASKS Verification of Function, Performance, Testability and other design
Performance criteria all move to earlier, higher levels of abstraction followed by
File Verification equivalence checking and
assertion driven design optimizations
Testability Industry Standard interfaces for data access and control
MASKS Verification Incremental modular tools for optimization and analysis
New Table 8
Planning / Implementation
Methodologies
z Centered on logic design
zz wire-planning methodology with block/cell global placement
zz global routing directives passed forward to chip finishing
zz constant-delay methodology may be used to guide sizing
z Centered on physical design
zz placement-driven or placement-knowledgeable logic synthesis
z Buffer between logic and layout synthesis
zz placement, timing, sizing optimization tools
z Centered on SOC, chip-level planning
zz interface synthesis between blocks
zz communications protocol, protocol implementation decisions
guide logic and physical implementation
Andrew B. Kahng
70
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
35
Planning / Implementation
Methodologies
z Centered on logic design
zz wire-planning methodology with block/cell global placement
zz global routing directives passed forward to chip finishing
zz constant-delay methodology may be used to guide sizing
z Centered on physical design
zz placement-driven or placement-knowledgeable logic synthesis
z Buffer between logic and layout synthesis
zz placement, timing, sizing optimization tools
z Centered on SOC, chip-level planning
zz interface synthesis between blocks
zz communications protocol, protocol implementation decisions
guide logic and physical implementation
Andrew B. Kahng
71
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
RTL Design
Extraction
36
Performance Optimization Methodology
z Design Optimization
zz global restructuring optimization -- logic optimization on layout
using actual RC, noise peak values etc.
zz localized optimization -- with no structural changes and least
layout impact
zz repeater/buffer insertion for global wires
z Physical optimization
zz high fanout net synthesis (eg. for clock nets); buffer trees to meet
delay/skew and fanout requirements
zz automatically determine network topology (# levels, #buffers, and
type of buffers)
zz wire sizing, spacing, shielding etc.
z Fixing timing violations automatically
zz fix setup/hold time violations
zz fix maximum slew and fanout violations
Andrew B. Kahng
73
Courtesy Hormoz/Muddu, ASIC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
GL
Total
RCw
Delay=Gi+GL+RCw
Gi
Gi = Intrinsic Gate Delay
60%
GL = Gate Delay from Load
RCw= Delay from Interconn
Loading
20% 25%
20%
10% Critical Path Delay
5%
0% 0% 0% Electrical Optimization
Gi GL RCw Logic Optimization
50K gate Block at 0.18 microns
Andrew B. Kahng
74
Courtesy Hormoz/Muddu, ASIC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
37
KEY ISSUE: PREDICTABILITY
Andrew B. Kahng
75
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
76
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
38
The Problem With Hierarchies
Andrew B. Kahng
77
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Interconnect Complexities
zz Interconnect effects play a major role in the increasing costs for
large hard-block or rectilinear-outline based design styles
zz Probabilistic wireload models fail
zz Without new capabilities for soft IP design and assembly,
interconnect problems will significantly impact performance and cost
for emerging IC technologies
Local wires
Occurrence Rate
blocks
(Normalized)
global
Global wires wires
~0.5 wirelength
die _ size
Andrew B. Kahng
78
Courtesy Pileggi, MARCO GSRC ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
39
Technology Scaling
zz Block sizes cannot grow as rapidly as chip sizes since block design
becomes increasingly more difficult --- each block is a chip design
over multiple configurations
zz If the blocks are inflexible, the global wiring problems begin to
dominate all aspects of performance quality and system cost
Occurrence Rate
(Normalized)
Soft Blocks
zz With soft,
soft, flexible
flexible blocks, the system assembly can more thoroughly
exploit the available technology
zz Interconnect problem is controlled via: soft boundaries for area re-
shaping; re-synthesis and re-mapping for timing; smart wires; and
top-down specified block synthesis
zz Cf. “Amoeba” placement, coloring analysis of “good” placements
with respect to original logic hierarchy, etc.
Occurrence Rate
(Normalized)
Superior timing,
wirelength power and cost
~0.5
die _ size
Andrew B. Kahng
80
Courtesy Pileggi, MARCO GSRC ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
40
Soft-Block Assembly
zz Hard rectilinear blocks make prediction of global wires extremely
difficult
zz Top-down constraint-driven assembly of soft fabrics: ability to
significantly restructure circuit level blocks during the assembly
process helps reach performance goals
zz For
For example,
example, timing-critical
timing-critical interconnect
interconnect paths
paths can
can be
be completely
completely
restructured
restructured during
during assembly
assembly without
without changing
changing any
any of
of the
the system
system level
level
specification
specification
zz Key issue: how to determine the soft blocks in the first place
zz non-classical
non-classical partitioning
partitioning objectives:
objectives: area
area sensitivity,
sensitivity, functional
functional and
and
clocking
clocking structure,
structure, critical
critical timing-path
timing-path awareness,
awareness, matching
matching capabilities
capabilities
of
of block
block placer
placer
zz block
block placement:
placement: largely
largely unsolved
unsolved issue
issue
zz unclear
unclear whether
whether packing-centric
packing-centric or
or connectivity-centric
connectivity-centric approaches
approaches are
are best
best
Andrew B. Kahng
81
Courtesy Pileggi, MARCO GSRC ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Outline
z Technology trends
z Post-layout optimization methodologies
zz manufacturability and reliability
zz performance
z Custom or custom-on-the-fly methodologies
z Flavors of classic planning-based methodologies
z Implications for P&R
Andrew B. Kahng
82
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
41
Cell-Based P&R: Classic Context
z Architecture design
zz golden
golden microarchitecture
microarchitecture design,
design, behavioral
behavioral model,
model, RT-level
RT-level structural
structural
HDL
HDL passed
passed to
to chip
chip planning
planning
zz cycle
cycle time
time and
and cycle-accurate
cycle-accurate timing
timing boundaries
boundaries established
established
zz hierarchy
hierarchy correspondences
correspondences (structural-functional,
(structural-functional, logical
logical (schematic)
(schematic)
and
and physical)
physical) well-established
well-established
z Chip planning
zz hierarchical
hierarchical floorplan,
floorplan, mixed
mixed hard-soft
hard-soft block
block placement
placement
zz block
block context-sensitivity:
context-sensitivity: no-fly,
no-fly, layer
layer usage,
usage, other
other routing
routing constraints
constraints
zz route
route planning
planning of
of all
all global
global nets
nets (control/data
(control/data signals,
signals, clock,
clock, P/G)
P/G)
zz induces
induces pin
pin assignments/orderings,
assignments/orderings, hard
hard (partial)
(partial) pre-routes,
pre-routes, etc.
etc.
z Individual block design -- various P&R methodologies
z Chip assembly -- possibly implicit in above steps
z What follows: qualitative review of key goals, purposes
Andrew B. Kahng
83
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Placement Directions
z Global placement
zz engines (analytic, top-down partitioning based, (iterative annealing
based) remain the same; all support “anytime” convergent solution
zz becomes more hierarchical
zz block
block placement,
placement, latch
latch placement
placement before
before “cell
“cell placement”
placement”
zz support placement of partially/probabilistically specified design
z Detailed placement
zz LEQ/EEQ substitution
zz shifting, spacing and alignment for routability
zz ECOs for timing, signal integrity, reliability
zz closely tied to performance analysis backplane (STA/PV)
zz support incremental “construct by correction” use model
Andrew B. Kahng
84
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
42
Function of a UDSM Router
z Ultimately responsible for meeting specs/assumptions
zz slew, noise, delay, critical-area, antenna ratio, PSM-amenable …
z Checks performability throughout top-down physical impl.
zz actively understands, invokes analysis engines and macromodels
z Many functions
zz circuit-level IP generation: clock, power, test, package substrate
routing
zz pin assignment and track ordering engines
zz monolithic topology optimization engines
zz owns key DOFs: small re-mapping, incremental placement,
device-level layout resynthesis
zz is hierarchical, scalable, incremental, controllable, well-
characterized (well-modeled), detunable (e.g., coarse/quick
routing), ...
Andrew B. Kahng
85
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Andrew B. Kahng
86
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
43
Routing Directions
z Cost functions and constraints
zz rich
rich vocabulary,
vocabulary, powerful
powerful mechanisms
mechanisms to
to capture,
capture, translate,
translate,
enforce
enforce
z Degrees of freedom
zz wire
wire widths/spacings,
widths/spacings, shielding/interleaving,
shielding/interleaving, driver/repeater
driver/repeater sizing
sizing
zz router empowered to perform small logic resyntheses
router empowered to perform small logic resyntheses
z “Methodology”
zz carefully
carefully delineated
delineated scopes
scopes of
of router
router application
application
zz instance
instance complexities remain tractable due
complexities remain tractable due to
to hierarchy
hierarchy and
and
restrictions
restrictions (e.g.,
(e.g., layer
layer assignment
assignment rules)
rules) that
that are
are part
part of
of the
the
methodology
methodology
z Change in search mechanisms
zz iterative
iterative ripup/reroute
ripup/reroute replaced
replaced by
by “atomic
“atomic topology
topology synthesis
synthesis
utilities”:
utilities”: construct
construct entire topologies to
entire topologies to satisfy
satisfy constraints
constraints in
in
arbitrary
arbitrary contexts
contexts
z Closer alignment with full-/automated-custom view
zz “peephole
“peephole optimizations”
optimizations” of
of layout
layout are
are the
the natural
natural extensions
extensions of
of
Motorola
Motorola CELLERITY,
CELLERITY, IBM
IBM CM5,
CM5, etc.
etc. methodologies
methodologies Andrew B. Kahng
87
ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
Noise Sources
z Analog design concerns are due physical noise sources
zz because of discreteness of electronic charge and stochastic
nature of electronic transport processes
zz example: thermal noise, flicker noise, shot noise
z Digital circuits due to large, abrupt voltage swings,
create deterministic noise which is several orders of
magnitude higher than stochastic physical noise
zz still digital circuits are prevalent because hey are inherently
immune to noise
z Technology scaling and performance demands made
noisiness of digital circuits a big problem
Andrew B. Kahng
88
Courtesy Hormoz/Muddu, ASIC99 ICCAD Tutorial: November 11, 1999 C Majid Sarrafzadeh
44