Académique Documents
Professionnel Documents
Culture Documents
Tianhe2
K-Computer
Titan
Sequoia
Mira
Copyright
Facebook
Specifications
Performance* 27-51 PFLOP/s
• 2014 completion target Memory*
21-27 PB RAM
1900-6800 PB disk
• Cost: ~760 M$ Power 84 MW avg* (120 MW max)
Copyright
Facebook
Specifications
Performance* 27-51 PFLOP/s
• 2014 completion target Memory*
21-27 PB RAM
1900-6800 PB disk
• Cost: ~760 M$ Power 84 MW avg* (120 MW max)
Alternative
Computation Efficiency (LINPACK FLOPS/W)
Technology
Goal
1E+11
Exascale goal
(DoE), 20 MW
Goals
1E+10 Projected
Exascale Strawman
! (DARPA), Green500 #1
68 500 MW
Top500 #1
Titan
Sequoia
1E+9
K, Japan As of: 2012-11
Tianhe-1A, China
Jaguar
1E+8
2005 2010 2015 2020
IAC IAC
VDC VDC = Φ0⋅ƒclock
Ic = Ibias
Rbias Lbias Ibias
Ic Ibias
___
Ic = Ic
0.7
RSFQ ERSFQ RQL AQFP
Rapid Single Flux Quantum Energy-efficient Rapid Single Flux Reciprocal Quantum Logic (NGC) Adiabatic Quantum Flux Parametron
Quantum (Hypres)
Logic
0.3
Memory
0.3
• Toughest problem: scaling uncooled
components, primarily disk storage memory
Other
0.1 Interconnects
0.3
10 PFLOP/s
Conclusions:
system
Interconnects
• Priorities: Memory Logic System Interconnects
Logic
Tianhe-2
Coral
K-Computer Titan
10
Tianhe2
Sequoia
K-Computer
Power (MW)
Mira
Titan
Sequoia
1 Mira Top Computers
DOE Exascale Goal
Coral Goal (2017)
Superconducting, Projected
0.1
1 10 100 1000
Performance (PFLOP/s)
INTELLIGENCE ADVANCED RESEARCH PROJECTS ACTIVITY (IARPA)
2016-04, Holmes, Superconducting Computing and the IARPA C3 Program
15
Exascale Power Comparison
Conventional computer at 100 MW
Memory Superconducting
Logic
computer at 2 MW
Leakage
Interconnect
Biggest win is
superconducting
interconnect!
Cryostat
* TOP500, 2015-11
– Failure analysis
– Superconductivity expertise
• NIST Boulder
– Test & evaluation of performer circuits
– Government program support
Microelectronics Lab,
MIT Lincoln Laboratory
• Nb/Al-AlOx/Nb JJ technology
• 200 mm Si wafers, full planarization
INTELLIGENCE ADVANCED RESEARCH PROJECTS ACTIVITY (IARPA) 26
2016-04, Holmes, Superconducting Computing and the IARPA C3 Program
SFQ7ee process node (2016)
• 415 GHz maximum operating frequency of a toggle flip-flop (TFF) circuit was
demonstrated, a record for Josephson junctions with critical current density
Jc = 135 MA/m2.
• ERSFQ is an energy-efficient superconducting digital circuit family, proving
that both high speed and high efficiency are possible in the same circuit.
INTELLIGENCE ADVANCED RESEARCH PROJECTS ACTIVITY (IARPA) 30
2016-04, Holmes, Superconducting Computing and the IARPA C3 Program
RQL 8-bit Kogge-Stone Adder
• Carry look ahead adder (CLA) core
– Excluding: input shift registers, output amps
RQL “1”
– Fabricated in Hypres 45 MA/m2 process
Q. Herr et al., J. Appl. Phys. (2011)
JJs 815
time →
Ave IC 162 µA
Clock 6.2 GHz
Power 560 nW @ 6.2 GHz
with 1.2 mW clock
Energy ~5x10-5 pJ
Latency 150 ps @ 10 GHz
Area ~ 4 mm2
superconducting
H
Magnetic Tunnel Junction
Top
Bottom Electrode
Electrode Isense
H
I
Write Line 1
ON for
sensing,
OFF for
programming
I Ref
www.everspin.com
superconducting
3-terminal nanowire switch S-F transistor Holmes, Kadin, Johnson, Computer, Dec. 2015
D
Ibias
J2
Set Ls Out
Decision
Block
A J0 B J1 C J3 E
1E+9
1E+8
Switching devices per chip
1E+7
1E+6 Si-CPU
Si-GPU
1E+5
Si-FPGA
1E+4 SFQ
SFQ-design
1E+3
1E+2
1E+1
1E+0
1970 1980 1990 2000 2010 2020
1E+9
Exponential growth in
1E+8
MONEY drives
Switching devices per chip
1E+2
1E+1
1E+0
1970 1980 1990 2000 2010 2020
Nagy et al. (2013) Statistical Basis for Predicting Technological Progress. PLoS ONE 8(2): e52669
1E+8
1E+7
Switching device area density [mm-2]
1E+6
1E+5
Si-CPU
C3! Si-GPU
1E+4
Si-FPGA
SFQ
1E+3
SFQ-design
1E+2
1E+1
1E+0
1970 1980 1990 2000 2010 2020
2015-07
?