Académique Documents
Professionnel Documents
Culture Documents
Keynote
K t Add
Address:
Multicore, Meganonsense, and the
Future, if We Get It Right
Brought to you by:
1
3/17/2011
Yale Patt
The University of Texas at Austin
2
3/17/2011
Copyright 2011,
2011 Yale N.
N Patt
3
3/17/2011
• What is Multicore?
• Who
Wh is
i You?
Y ?
• What is Work?
4
3/17/2011
Problem
Algorithm
Program
Microarchitecture
Circuits
Electrons
5
3/17/2011
Problem
Algorithm
Program
Microarchitecture
Circuits
Electrons
6
3/17/2011
Problem
Algorithm
Program
Microarchitecture
Circuits
Electrons
7
3/17/2011
Problem
Algorithm
Program
Microarchitecture
Circuits
Electrons
8
3/17/2011
• Multi-core
– How we got to where we are
– Some of the not unexpected Mega-nonsense
• How do we satisfy
y both
9
3/17/2011
• What is Multicore?
• Who
Wh is
i You?
Y ?
• What is Work?
10
3/17/2011
• Today
– more than one billion transistors
– Frequencies
F i ini excess off 5 GHz
GH
• Tomorrow ?
11
3/17/2011
Cache
nsistors
Microprocessor
Number of Tran
Tim e
12
3/17/2011
Cache
nsistors
Microprocessor
Number of Tran
Tim e
13
3/17/2011
Cache
nsistors
Microprocessor
Number of Tran
Tim e
14
3/17/2011
Cache
nsistors
Microprocessor
Number of Tran
Tim e
15
3/17/2011
Cache
nsistors
Microprocessor
Number of Tran
Tim e
16
3/17/2011
• Tomorrow: ???
17
3/17/2011
• Yes,
Yes Multi-core
Multi core is a reality
– We will have 1000 core chips
18
3/17/2011
19
3/17/2011
20
3/17/2011
21
3/17/2011
22
3/17/2011
core core Niagara Niagara Niagara Niagara Niagara Niagara Niagara Niagara
-like -like -like -like -like -like -like -like
core core core core core core core core
23
3/17/2011
24
3/17/2011
Large Small
Core Core
• Out-of-order • In-order
• Wid fetch
Wide f t h e.g. 4-wide
4 id • Narrow Fetch e.g. 2-wide
• Deeper pipeline • Shallow pipeline
• Aggressive branch • Simple branch predictor
predictor (e.g. hybrid)
(e.g. Gshare)
• Many functional units
• Few functional units
• Trace cache
• Memory dependence
speculation
25
3/17/2011
9
Niagara
Speedup vs. 1 Large Core
8
7 Tile-Large
6 ACMP
5
4
3
2
1
0
0 0.2 0.4 0.6 0.8 1
Degree of Parallelism
26
3/17/2011
27
3/17/2011
28
3/17/2011
ILP is dead
29
3/17/2011
30
3/17/2011
Moore’s Law
• A law of physics
• A law of process technology
• A law of microarchitecture
• A law of psychology
31
3/17/2011
32
3/17/2011
• Sorting
33
3/17/2011
34
3/17/2011
35
3/17/2011
36
3/17/2011
37
3/17/2011
38
3/17/2011
• What is Multicore?
• Who
Wh is
i You?
Y ?
• What is Work?
39
3/17/2011
40
3/17/2011
• What is Multicore?
• Who
Wh is
i You?
Y ?
• What is Work?
41
3/17/2011
42
3/17/2011
43
3/17/2011
Problem
Algorithm
Program
Microarchitecture
Circuits
Electrons
44
3/17/2011
• The Refrigerator
• X + Superscalar
• Th
The algorithm,
l ith the
th language,
l the
th compiler,
il
& the microarchitecture all working together
45
3/17/2011
• Compiler,
Compiler Microarchitecture
– Multiple levels of cache
– Block-structured ISA
– Part by compiler, part by uarch
– Fast track, slow track
• Microarchitecture, Circuits
– Verification Hooks
– Internal fault tolerance
46
3/17/2011
The problem:
47
3/17/2011
48
3/17/2011
• For example:
– Factorial
– Streaming
49
3/17/2011
50
3/17/2011
51
3/17/2011
• IF we are willing
g to break the layers
y
– Because we don’t have a choice anymore
52
3/17/2011
Most importantly,
• That means
– Chip designers understanding what algorithms need
– Algorithms folks understanding what hardware provides
– Knowledgeable Software folks bridging the two interfaces
53
3/17/2011
54
3/17/2011
Thank you!
55
3/17/2011
Keynote Address:
Multicore, Meganonsense,
and the Future, if We Get It Right
56