Académique Documents
Professionnel Documents
Culture Documents
Serge Poublan
Product Marketing Manager
ARM
1
Higher level of integration
Bluetooth MP3
Camera Flash 9
Skype
2
Processors are evolving, e.g. MP
World-class market-proven technology ARMv7 Cortex
x1-4
Cortex-A8
200+ silicon partners ARMv6 ARM11 MPCore
x1-4
15Bu shipped
ARM1156T2(F)-S
ARM1136J(F)-S
ARMv5 Cortex-R4F
ARM1026EJ-S
Cortex-R4
ARM968E-S
ARM926EJ-S
ARM966E-S
ARM946E-S
ARM7EJ-S SC200
3
ARM Mali GPU - Scalable Performance to over 1G Pixel/s
Mali™-400 MP
Console 3D
Gaming
Next
Generation
Navigation HD 3D
Visual complexity
Gaming
Mali™-200
Mobile 2D/3D
Gaming Presentations
Flash Flash 10
Lite
HD Video
Post
Mali™-55 TV HD UI Processing
3D
Navigation
Web Video
Browsing Java Post
Gaming Processing
Screen resolution
4
Higher Mobile Device Resolution
Requirements of next generation Mobile
platform
- Increasing bandwidth requirements simply to
refresh the display
- Ignoring Fill rate, Input Vertex Data
and Texture bandwidth
1080p60
1080p30 1920x1080
1920x1080
WXGA
1280x800
WSVGA
1024x600
Display Refresh Bandwidth MB/s
5
Example SoC Mobile Platform
Media
CPU
L2 Media
CPU Engine
Cache Engine Graphic
Graphic Video DMA LCD
Video DMA LCD
Engine Engine Engine Ctrl
Engine Engine Engine Ctrl
L2CC
L2CC
Bandwidth
64 or 128 requirement
Latency requirement
AMBA
Interconnect
NAND
LPDDR2 UART0 SPI
Flash UART1 WDT Timer0 Timer1 RTC GPIO
6
Example SoC Mobile Platform
Media
CPU
L2 Media
CPU Engine
Cache Engine Graphic
Graphic Video DMA LCD
Video DMA LCD
Engine Engine Engine Ctrl
Engine Engine Engine Ctrl
L2CC
L2CC
64 or 128
AMBA
Interconnect
NAND
LPDDR2
Flash “Digital Highway”
7
ARM Design Flow for Digital Highway
Design Your Intelligent Digital Highway
AMBA Designer
Verification & performance exploration in simulation
AVIP
Improve your software
CoreSight
8
AMBA Ecosystem :
The on-chip infrastructure is critical to system performance
Increased focus on processor memory performance
Different types of processors have different requirements
ARM has grown the AMBA architecture eco-system to help
accelerate SoC design:
70+ Connected Community partners
have AMBA compatible products
10+ AMBA specification downloads a day
10
Design to Maximise Throughput
Effective on-chip Quality of Service depends on the co-
operation of the interconnect and memory controller
11
ARM Level2 Cache Controllers
Media
CPU Media
CPU Engine
Engine Graphic
Graphic Video DMA LCD
Video DMA LCD
L2 Engine Engine Engine Ctrl
L2 Engine Engine Engine Ctrl
Cache
Cache
64 or 128
AMBA
Interconnect
NAND
LPDDR2
Flash “Digital Highway”
12
L2CC Increases Processor Performance
Benchmark : MPEG4 decode
512K L2 +104%
256K L2
System : ARM PrimeXsys Platform for
+102%
ARM1136J-S
128K L2
+74%
CPU : 400MHz ARM1136J-S 16K I & D caches
No L2
Memory : 100MHz 32 bit SDRAM
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 2.2 L2 cache : L210 128K unified L2 cache
MPEG4 Decode on ARM1136EJ-S
Relative performance
128
Results may vary for system
First Time configuration and web content
0 Subsequent
Speed Up Compared to 0K L2
13
L2CC Increases System Performance
Reduced System Power Consumption
External memory access ~10x more energy than on-chip
External memory accesses reduced with L2 cache
Enables use of lower-power and lower-cost memory
sub-system
E.g. 16-bit instead of 32-bit external interface
Or LPDDR instead of DDR2
14
ARM AMBA Interconnect
Media
CortexA8
A8 Media
Cortex Engine
Engine Graphic
Graphic Video DMA LCD
Video DMA LCD
Engine Engine Engine Ctrl
Engine Engine Engine Ctrl
L2CC
L2CC
64 or
128
NIC-301
NAND
LPDDR2
Flash “Digital Highway”
15
AMBA Interconnect (NIC-301)
Low latency communication for ARM CPUs
Supporting:
AXI, AHB & APB
Data widths from 32- to 128-bit
Supporting both synchronous & GALS implementations
Quality of service
16
Optimise your Interconnect Topology
Real-time masters Real-time masters
Cortex A9
Freq F
Fx2.5
Cortex A9
Fx2.5
RAM SMC DMC
17
Topology Optimisation with ARM Interconnect
Cortex Neon
Cortex Neon
Graphic Video DMA LCD
Graphic Video DMA LCD
Engine Engine Engine Ctrl
Engine Engine Engine Ctrl
L2CC
L2CC
64 or
128
NIC-301
400MHz
NIC-301
Low Latency
200MHz
Interconnect
NAND
LPDDR2
Flash
18
ARM Memory Controllers
Cortex Neon
Cortex Neon
Graphic Video DMA LCD
Graphic Video DMA LCD
Engine Engine Engine Ctrl
Engine Engine Engine Ctrl
L2CC
L2CC
64 or
128
Low Latency
Interconnect
Interrupt
SMC-35x Interrupt
DMC-34x
SDRA
DMC-34x SMC-35x Controller
Controller
M Ctrl
NAND
LPDDR2
Flash
19
ARM Memory Controllers
Synthesizable, Configurable soft cores
Wide range of memory types, silicon processes & target
applications
AXI Dynamic Memory Controllers for SDR, DDR, LPDDR,
DDR2 and LPDDR2 (DMC-34x)
Over 20 licensees to date
AXI Static Memory Controllers for NOR Flash, NAND
Flash and SRAM (SMC-35x)
Over 40 licensees to date
AHB Memory Controllers for Dynamic and Static Memories
(PL24x)
Over 60 licensees to date
20
ARM Design Flow for Digital Highway
Design Your Intelligent Digital Highway
AMBA Designer
Verification & performance exploration in simulation
AVIP
Improve your software
CoreSight
21
What is AMBA Designer?
Topolology
Configure
Cross-configure
22
What is AMBA Designer?
Interface checking on:
Configure
•…
Cross-configure
23
ARM Design Flow for Digital Highway
Design Your Intelligent Digital Highway
AMBA Designer
Verification & performance exploration in simulation
AVIP
Improve your software
CoreSight
24
AVIP Features for RTL Simulation
Functional IEEE 1800 SystemVerilog Testbench
Verification
For Verification Directed
Prof.
Vectors
Data AXI Master
AXI Slave
Interface
AXI Master
Interface AXI Slave
Engineers, AVIP is a
set of System Verilog User VIP User
UUT
Prof.
Data
(Block or Sub-system) AXI Monitor
modules that enable
faster and higher AXI Slave
Interface
AXI Master
Interface
User IP
AXI Master
quality verification of
AXI based IP.
Performance
Exploration
For SoC architects, HW
and Verification
Engineers. AXI based
SoC performance can
be explored and verified.
25
AVIP Features for RTL Simulation
Protocol IEEE 1800 SystemVerilog Testbench
Checkers
AXI Slave AXI Master
assertion libraries
provided for AXI
User VIP User
UUT
(Block or Sub-system) AXI Monitor
AXI Protocol
Coverage
Channel level,
transaction level and
sequence level pre-
defined coverage points
for AXI protocol
coverage.
26
AMBA Designer + AVIP: RTL Design Flow
To optimise interconnect and memory Configuration
Configuration
architecture ARM recommends the Integration
Integration
following flow:
Simulation
Configuration Simulation
27
Fabric Design Tools: What is AVIP?
IEEE 1800 SystemVerilog Testbench
28
Fabric Design Tools: What is AVIP?
It enables System Exploration at RTL level
TTT = Time to tweak = 20s
TTS = Time to simulate = 5 mins
29
System Exploration Methods
SoC, static
Spreadsheet
Analysis
Acceleration/Emulation
VIP, Logic Tiles, SW
Real-time Behavior
Silicon/Applications
30
Iteration time vs Realism
LOW Mathematical LOW
Spreadsheet formula, not
mins/hrs
Static analysis dynamic
Statistical or
Realistic behaviour
mins/hrs AVIP recorded traffic
Internal bus simulation
profiles
Cycle time
Adding S/W,
SoC + s/w
days/wks external I/F with
Emulation/proto
realistic scenarios
Observe
Silicon + Appl actual
mths/yrs
CoreSight™ behaviour
HIGH HIGH
AVIP: the iteration time of a spreadsheet with the accuracy approaching RTL simulation
31
ARM Design Flow for Digital Highway
Design Your Intelligent Digital Highway
AMBA Designer
Verification & performance exploration in simulation
AVIP
Improve your software
CoreSight
32
Improve the Performance of Your SoC
Analyzing real silicon performance enables you to
confidently improve the next design
If you want to find out how a car really performs, drive it
CoreSight Design Kit & Performance Profiling
Provide accurate, real-time ‘telemetry’ from your system
Essential tools for delivering system performance improvements
Your SoC may be optimized, but is the software?
ARM Profiler analyzes system performance, enabling optimization via
Profile Driven Compilation
33
CoreSight Debug & Trace
The Debug & Trace Architecture for the Digital World
Open Standard available on www.arm.com
Optimise software productivity
on your multi-core SoC
SW Debug
SW Performance
Optimisation
SoC Performance
optimisation
Visibility and trace of the
whole SoC
ARM trace and performance sources (ETM, PTM, Interconnect)
Leverage CoreSight architecture for YOUR IP
34
ARM Digital Highway
ARM Digital Highway technology
delivers to YOU
35