Académique Documents
Professionnel Documents
Culture Documents
Patrick Greene
Solution Architect – HP
HPC on Wall Street
9/19/12
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Experience matters
HP ProLiant
#1 in x86 server market share
16+ years straight – 65 consecutive quarters in
both factory revenue and units
HP’s leadership in the datacenter that has been built over years of innovation,
experience and market leadership.
Source: IDC Worldwide Quarterly Server Tracker, August 2012. Includes Compaq ProLiant from Q196 through Q202
2 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
FSI-HPC Solutions for Capital Markets
TM
• Quality infrastructure
3
for IT cost reduction
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Low Latency Systems Require Optimization at
every layer in the Solution Stack
Use Cases Low Latency FSI Solution Stack
Exchange Matching Engines Use Cases / Lines of Business
Market Data Distribution Application Environment
Mgmt
Fab.
Messaging Middleware
Precision Timing
High Frequency Algorithmic Trading
Server I/O Fabric
Pre/Post Trade Analytics High Speed Storage
Real Time Enterprise Risk Integrated Acceleration
Management Firmware and Operating System
X86-64 Server Architecture
Definitions:
Solution - includes messaging middleware; in-house apps; design services
System - integrated server/networking/storage infrastructure
Components - specific servers/OS/switches/file system in the “system”
4 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Optimized Form Factors to • DL rack-mount servers for expandability
meet a variety of needs • All top bin E5-2600 processors offered with 3DPC
5 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP Gen8 Servers (Sandy Bridge E5-2600)
UDIMMs offer a 1 clock latency advantage when only 1 DIMM per Channel (DPC)
Unregistered DIMMs UDIMM failure rates are higher, so use these judiciously
DIMM Description 1DPC (DDR3-) 2DPC (DDR3-) 3DPC (DDR3-)
Do this with the new HPRCU, Conrep scripting tool or RBSU Advanced Menu
Conrep now available for Solaris too
See User Guide for ROM-Based Setup Utility (RBSU) for explanation of BIOS settings
Pub #347563-405 June, 2012 at: http://h20000.www2.hp.com/bc/docs/support/SupportManual/c00191707/c00191707.pdf
we observe 20000
latency (cycles)
6
latency (μsecs)
5
power refresh,
25000
7
latency (cycles)
20000 6
latency (μsecs)
we observe 15000
5
10 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
VMA v6 - TCP – Improved Capability In ConnectX-3
Feature CX-2 CX-3 Description
Connection Steering MAC+IP per process in No additional MAC+IP. ConnectX-3 implements Flow Steering
addition to Server MAC+IP Use Server’s MAC+IP
Multithread support QP per process QP per thread/socket ConnectX-3 Flow Steering enables finer
Multi-threaded applications performance tuning and optimizations
will share same CQ
6
5
4
½ RT
3
Latency
(msec) 2
1
0
1 2 4 8 16 32 64 128 256 512 1024
Message Size (Bytes)
Back-to-back configuration (no Switch), ½ Round Trip; Netperf v2.5.0; MTU size = 1470 Bytes
12 RHEL Development
© Copyright 2012 Hewlett-Packard 6.1; ConnectX-3 FWL.P.
Company, 2.10.2220; Driver:
The information OFED-VMA
contained 1.5.3-0008;
herein is subject VMA
to change 6.1.6
without notice.
Command Line: netperf -n 16 -H <peer ip> -c -C -P 0 -t TCP_RR -l 10 -T 2,2 -- -r <message size>
Application Accelerator Options
FSI customers use accelerators for faster feed handlers, order execution engines, and compute-intensive risk &
pricing calculations
Rapid changes underway: FPGA vendors adding 10GbE; 10GbE vendors adding FPGAs; switches adding FPGAs…
13 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Application Programming for Low Latency
Determine how many cores your trading strategy requires
Can it run on 8 cores? If so, match up CPU+NIC per strategy
• Quality infrastructure
15
for IT cost reduction
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Demonstrating the value of SL6500 servers
Built on ProActive Insight Architecture
SL230s SL250s
HPC optimized for HPC optimized for efficiency
maximum performance, and density, with balanced
efficiency and density GPU performance
• Purpose-built for HPC performance at scale • Purpose-built for HPC performance at scale
• Up to 1 integrated I/O Accelerator • Up to 3 integrated GPUs
• Maximum speed FDR IB FlexibleLOM • Maximum speed FDR IB FlexibleLOM
• Multi-node 1/2U density and efficiency • Multi-node 1U density and efficiency
• Enhanced, simple front serviceability • Enhanced, simple front serviceability
• Rack level power management • Rack level power management
• Industry Leading Mgmt with Insight Control* • Industry Leading Mgmt with Insight Control*
16 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
“GPUDirect RDMA” for Peer-to-Peer I/O
GPU Direct RDMA (previously known as GPU Direct 3.0)
Enables peer to peer communication directly between HCA and GPU
Dramatically reduces overall latency for GPU to GPU communications
by bypassing the host CPU’s memory
System GDDR5 GDDR5 System
Memory Memory Memory Memory
GPU
Mellanox Mellanox
HCA HCA
Mellanox VPI Availability: GPUDirect RDMA requires
CUDA 5.0 and MLNX_OFED driver changes
17 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. (beta 9/12 with expected GA by 12/12).
HP/Nvidia Gen 8 GPU Starter Kit V2.0 in Americas
– Configuration:
• 1 DL380 control node w/ E5-2670 8 core 2.6GHz 115WCPUs, 64 GB RAM and 2x 600 GB HDD
• 1 SL6500 enclosures
• 4 SL250s 2u server trays w/ E5-2670 8 core 2.6GHz 115W CPUs, 64 GB RAM, 600 GB HDD, 2 Nvidia M2090 GPU
modules
• Mellanox IB 4x QDR 36 port managed switch
• HPN ProCurve 2910 24 port 10/100/1000 Ethernet switch
• RHEL
• CMU
• Linux Value Pack
• Rack and infrastructure
• Hardware/Software Integration
20 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
2
HP ‘Redstone’ Server Development Platform
Perfect for development and testing with unparalleled density, flexibility, and simplicity
2
Breakthrough Savings and Simplicity
Energy, cost and space savings move the industry to new
infrastructure
Traditional x86 HP ‘Redstone Server’
$3.3M $1.2M
89% less energy
94% less space
63% less cost
97% less complexity
400 servers 1,600 servers
10 racks 1/2 rack
20 switches 2 switches
1,600 cables 41 cables
91 kilowatts 9.9 kilowatts
Select hyperscale web, and data analytics applications show tremendous promise
22 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Based on weighted average performance projections for workloads such as web serving, memcached, and Data Analytics.
© 2011 HP Confidential NDA Required Cost estimates include infrastructure, space, and power and cooling costs over three years.
FSI-HPC Solutions for Capital Markets
TM
• Quality infrastructure
for IT cost reduction
23 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
What is Hadoop?
Your data is going unstructured
The digital universe will expand by almost half in 2012 - 90% of that data is unstructured
Risk Modeling Fraud Detection Sentiment Analysis Customer Retention Web Mining
24 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
How does Hadoop fit into existing BI ecosystems
Click Stream Analysis using Hadoop, Vertica and Autonomy
Navigation paths Data Assimilation Multi-dimensional analysis User segmentation
Time per page Data Consolidation, Aggregation Predictive analysis Software testing
Products Browsed Transformation into structured data Geographical analysis Market research
Hadoop Distributions
Unstructured HP Insight CMU (Cloudera, MapR, Ad hoc SQL Compliant Analytics Business
Click Stream Data Hortonworks) Users
Vertica
Consulting Services
25 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP offers the shortest route to Hadoop success
Open strategy that combines Hadoop with advanced analytics and management
Seamless
• Deploy in days, not months analytics
26 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP HyperStorage Server
Address the explosion of data permeating the data center
ProLiant SL 4500
Shared SL 4500 HyperStorage chassis
• Pooled power — 4 HP common slot power supplies
180TB Storage • Shared cooling — 10 shared fans, N+1, rear-serviceable
• Shared management — Reduced cabling with single iLo port
Single node
Most dense storage available in market today
2 x 75TB Storage • Up to 60 LFF drives in a single chassis giving a total of 180 TB of
available storage
2
HP ProLiant SL4500 Solution Efficiency
Three Node vs. Traditional Similar Deployment
vs.
vs.
28 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
SL45xx Overview and Features
Designed for Density
First HP ProLiant server, built purely with storage intensive applications in mind
Various configurations allow customer selection for optimization for their unique data center needs
29 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
FSI-HPC Solutions for Capital Markets
TM
• Quality infrastructure
for IT cost reduction
• ProActive Insight Architecture
• Performance Optimized Datacenters
30 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
HP ProLiant Gen8:
The World’s Most Self-Sufficient Servers
3X 6X 70% 66%
Admin productivity Performance increase More compute Faster time to
improvement for the most per watt problem resolution
demanding workloads
3
HP ProActive Insight Architecture
Designed to Simplify, Integrate and Automate your Infrastructure
HP FlexNet Adapters
Insight Online Sea of Sensors 3D
Integrated Lifecycle Automation / Dynamic Workload Acceleration / Automated Energy Optimization / ProActive Service and Support
32 © Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Gen8 Smart Array Innovations
Increased performance, data availability and storage capacity
Faster access to data
• Up to 2X performance improvement*
• 2X Write Cache (up to 2 GB)
Low.latency@hp.com
© Copyright 2012 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.