Académique Documents
Professionnel Documents
Culture Documents
Processors
Some Examples
– AMD
– SUN Microsystems
1
Focus
2
Intel Core Duo
• Homogeneous cores
• Bus based on chip interconnect
• Shared Memory
• Traditional I/O
3
The Core™ micro-architecture
4
Intel Core Duo
• High memory latency due to the lack of on-die
memory controller (further aggravated by system-
chipset's use of DDR-II RAM)
• Main-memory transactions have to pass through
the Northbridge of the chipset
– Higher latency compared to the AMD's Turion platform.
– Weakness shared by the entire line of Pentium processors
– L2-cache is quite effective at hiding main-memory latency
• Execution units
– Three 64-bit integer exec units
– one CIU (complex) + two SIU (simple)
– Two FPUs
– Poor Floating Point Unit (FPU) throughput
• Limited to little "performance per watt" in single
threaded applications compared to its predecessor.
ECE 4100/6100 (9)
5
Intel® CoreTM2 Duo Processor
Process Technology 65 nm
Line Width 65 nm
6
Wide Dynamic Execution
7
Wide Dynamic Execution
8
Intelligent Power Capability
• SpeedStep technology
– Dyamic clock speed reduction
– Intel mobile processors include this already
– Enhanced SpeedStep used in Core 2 Duo
9
Advanced Smart Cache
10
Smart Memory Access
Example from
http://arstechnica.com/articles/pae
Execution without memory disambiguation dia/cpu/core.ars/8
11
Advanced Digital Media Boost
12
Intel Conroe Vs Presler
Conroe Presler
8C 12MB
Enterprise processors
8C 12MB shared
Mobile processors
Desktop processors
shared (45nm)
(45nm) QC 8/16MB
DC 3MB /6MB shared
shared (45nm) DC 3 MB/6
MB shared QC 4MB
(45nm)
DC 4MB DC 2/4MB
shared DC 16MB
DC 2/4MB
shared DC 2MB DC 4MB
SC 1MB DC 2MB
DC 2/4MB
SC 512KB/
1/ 2MB
• Drivers are
– Market segments
– More cache
– More cores
• 80 core processor prototype has been designed!
ECE 4100/6100 (26)
13
Intel Chipset Example
• http://www.intel.com/products/processor/coreduo/
• http://en.wikipedia.org/wiki/Intel_Core
• http://www.hothardware.com/viewarticle.aspx?articleid=845&cid=1
• http://www.bit-tech.net/hardware/2006/03/10/intel_core_microarchitecture/
• http://www.bit-
tech.net/hardware/2006/05/19/intel_core_duo_t2600_on_the_desktop
• http://www.bit-tech.net/hardware/2006/07/14/intel_core_2_duo_processors/
• http://www.hardcoreware.net/reviews/review-347-1.htm
• http://www.trustedreviews.com/cpu-memory/review/2006/08/28/Intel-Core-2-
Duo-Merom-Notebooks/p1
• http://www.trustedreviews.com/cpu-memory/review/2006/07/14/Intel-Core-2-
Duo-Conroe-E6400-E6600-E6700-X6800/p1
• http://techreport.com/reviews/2006q2/core-duo/index.x?pg=1
• http://arstechnica.com/articles/paedia/cpu/core.ars/1
• http://www.anandtech.com/mobile/showdoc.aspx?i=2663&p=4
• http://www.extremetech.com/article2/0,1697,1988794,00.asp
• http://www.coreduoinfo.com/blog/about-intel-core-duo/
• http://67.91.114.164/intel_c2d_info.htm
• http://www.pcper.com/article.php?aid=272&type=expert
14
AMD MultiCore Processors
Source: AMD
15
AMD Multicore (Dualcore)
Opteron
• Two AMD Opteron CPU
cores on a single die
– Each has 1MB L2 cache
Core 0 • 90nm, ~205 million
1-MB L2 transistors
– Approximately same die size
as 130nm single-core AMD
Northbridge Opteron processor
• 95 watt power envelope
– fits into 90nm power
infrastructure
1-MB L2 • Introduced with “K8”
Core 1 Revision E core in April
2005
Source: Chip
Architect ECE 4100/6100 (32)
16
AMD Opteron Processor Core Architecture
Branch
Fetch
L1 Prediction
Icache
64KB Scan/Align/Decode
Fastpath Microcode Engine
µops
• AMD64 technology
– Runs 32-bit applications and is 64-bit capable
– Compatible with the x86 software infrastructure
– Enables a single architecture across 32- and 64-bit environments
• Direct Connect Architecture
– NUMA system
– Each processor shares its memory with other processors in the
system
– Integrated Memory Controller on-die
– DDR2 DRAM memory controller offers memory BW up to 10.7 GB/s
per processor
– HyperTransport
– Point-to-point interconnect can be used to build a mesh of multiple-
processor Opteron systems
– Scalable bandwidth interconnect between processors, I/O
subsystems, and other chipsets
– 24.0 GB/s peak bandwidth per processor
17
Dual Core AMD Opteron
18
AMD Opteron (SOI)
• 1MB L2 Cache
• Detailed discussion of the 64-bit core architecture
at:
– http://chip-
architect.com/news/2003_09_21_Detailed_Architecture_of
_AMDs_64bit_Core.html
19
Multiprocessor Systems using AMD
Opteron
8 GB/S
CPU CPU
CPU CPUCPU
CPU CPUCPU
CPU CPU CPU
CPU
SRQ SRQ
Crossbar Crossbar
Mem.Ctrlr HT Mem.Ctrlr HT
8 GB/S 8 GB/S
PCI-E
PCI-E
Memory
Memory Bridge
Bridge
PCI-E PCI-E
PCI-E
I/O
I/O Hub PCI-E
Controller PCI-E PCI-E
I/OHub
I/O Hub
Hub Controller
Bridge Bridge
Bridge PCI-E PCI-E
Bridge
Hub
Hub PCI-E
PCI-E Bridge Bridge
Bridge Bridge
Bridge
Bridge
8 GB/S
USB
USB
I/O
I/OHub
Hub
PCI
PCI
Source: XBitlabs
ECE 4100/6100 (40)
20
Cache coherency
AMD Athlon 64 X2
21
References and Links
• http://techreport.com/reviews/2005q2/opteron-x75/index.x?pg=1
• http://www.tomshardware.com/2005/06/03/dual_core_stress_test/index.html
• http://www.a1-
electronics.net/AMD_Section/CPUs/2005/AMD_Athlon64x2_Apr.shtml
• http://en.wikipedia.org/wiki/Opteron
• http://en.wikipedia.org/wiki/Athlon_64_X2
• http://www.amd.com/us-
en/Processors/ProductInformation/0,,30_118_8796_14309,00.html
• http://chip-
architect.com/news/2003_09_21_Detailed_Architecture_of_AMDs_64bit_Core
.html
• http://firingsquad.com/hardware/amd_dual-core_opteron_875/page2.asp
• http://www.xbitlabs.com/articles/cpu/display/opteron-ws_4.html
• http://www.extremetech.com/article2/0,1697,1675784,00.asp
• http://www.elecdesign.com/Articles/Index.cfm?AD=1&ArticleID=11991
• http://www.the3dshop.com/userimages/amd_systems/opteron_dualcore.htm
• http://www.nextcomputing.com/advantages/thruadv.shtml
• http://arstechnica.com/news.ars/post/20060817-7535.html
• http://www.bit-tech.net/hardware/2005/05/09/amd_a64x2_4800/1.html
22
SUN – UltraSPARC T1
SUN – UltraSPARC T1
23
SUN – UltraSPARC T1 Pipeline
24
SUN UltraSPARC T2
25
UltraSparc T2 Memory System
26
UltraSparc T2 Core Pipeline
• http://realworldtech.com/page.cfm?ArticleID=RWT0
90406012516&p=4
• http://www.opensparc.net/cgi-
bin/goto.php?w=/pubs/preszo/06/HotChips06_09_p
pt_master.pdf
• http://www.freescale.com/files/netcomm/doc/fact_sh
eet/MPC8572FS.pdf
27
The Embedded Multicores
28
Freescale MPC8572 PowerQUICC III
Processor
• http://www.freescale.com/files/netcomm/doc/fact_sh
eet/MPC8572FS.pdf
29
Summary
30