Bienvenue sur Scribd !

Ignorer le carrousel

Caffe-MPI A Parallel Framework On The GPU Clusters

Transféré par

csgaddis

0% ont trouvé ce document utile (0 vote)

17 vues16 pages

Caffe GPU Clusters

Titre original

Caffe-MPI a Parallel Framework on the GPU Clusters

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Caffe GPU Clusters

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

17 vues16 pages

Caffe-MPI A Parallel Framework On The GPU Clusters

Transféré par

csgaddis

Caffe GPU Clusters

Droits d'auteur :

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 16

Rechercher à l'intérieur du document

Caffe-MPI: A parallel Framework on the

GPU Clusters

Shaohua Wu
Senior Software Engineer
wushh@inspur.com
Caffe-MPI

• What is Caffe-MPI?
– Developed by Inspur
• Open-source：https://github.com/Caffe-MPI/Caffe-MPI.github.io

– Programmed by MVAPICH
– Based on the Berkeley Vision and Learning
Center (BVLC) single node version
– A GPU Cluster version
– Support 16+ GPUs to Train
Analysis of Caffe
ForwardBackward computing Data parallel
80%

Weight computing Some parts can be

paralleled
16%
Some parts
Net update can be
paralleled
4%

• Caffe needs long training time for big data sets on

a single node.
Caffe-MPI Architecture
• HPC Technology
– Hardware arch：IB+GPU cluster+Lustre
– Software arch：MPI+Pthread+CUDA
• Data parallel on GPU Cluster
GPU Cluster Configuration

GPU master
Multi GPUs
node

GPU Salve
Multi GPUs
Node

Storage Lustre

network 56Gb/s IB

Linux/Cuda7.5/Mvapi
Software
ch2
MPI Framework Design
• MPI Master-Slave model
– Master Process: Multi Pthread Threads+CUDA Threads
– Slave Process: CUDA Threads

Reference：Q Ho, J Cipar, H Cui, J. K Kim, S Lee, P. B. Gibbons, G. A. Gibson, G. R. Ganger, E. P.

Xing. More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server.
Design of Master Process
• Master Process (0 process)
– Three functions
• Parallel read data and send data
• Weight Computing and The parameter update
• The parameter communication
Design of Slave Process
• Slave process
– CPU
• To receive training data from the
master process
• To send weight data(GPU-to-GPU)
• To receive new net data(GPU-to-GPU)
– GPU
• ForwardBackward computing

• Slave Node
– The number of Slave process = the
number of GPU
Features of the Computing & Communication

• GPU parallel computing

• Computing & Communication asynchronous parallel
• Communication Optimization
– GPU RDMA：Weight Data and Net data between GPUs
Total Time=max(TRead Data+Send Data ,TForwardBackWord Computing+ Weight Computing and Net Update+ Net Send)
The Performance of Caffe-MPI
• Speed-up Ratio：16GPU/1GPU=10.45X
• Scalability efficiency：65%
Tuning 1： Change BatchSize
• Speed-up Ratio：16GPU/1GPU=10.74X
• Scalability efficiency：67%
Tuning 2：Caffe-MPI+cuDNN
• 21% Performance improvement by cuDNN
• Speed-up: 16GPU vs. 1GPU = 12.66x
• Scalability: 79%

G o o g l e N e t ( It e r a t i o n s = 4 0 0 0 , b a t c h s i z e = 6 4 )
1 ,4 0 0
1 ,3 8 0

1 ,0 5 0
Tr a i n i n g T i m e ( s )

700

350

132 109
0 1 1 6 ( C a f f e -M P I) 1 6 ( C a f f e -M P I + c u D N N )

Th e N u m b er o f GPU
Tuning 3: Parallelizing Read and Send Data
• Parallelizing read training data from Lustre Storage and send data to different
GPUs
– GPU Cluster is divided into sub groups
– Each group has a master node
– Each master node read and send data in parallel with Multi Processes and Multi
Threads
• Support large-scale GPU computing platform for large training data set
The Performance of Caffe-MPI
• Speed-up Ratio: 16GPU/1GPU=13X
• Scalability efficiency: 81%
Caffe-MPI Plan
• Plan：
– Support cuDNN 4.0
– MPI tuning
• Symmetric model
Conclusions
• Caffe-MPI
– 13x performance improvements: 16 GPU vs. 1GPU
• Support 16+ GPU for large data sets
– Improved master-slave model
• Open source

Vous aimerez peut-être aussi

Shoe Dog: A Memoir by the Creator of Nike
D'Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Évaluation : 4.5 sur 5 étoiles
4.5/5 (537)
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
D'Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Évaluation : 4 sur 5 étoiles
4/5 (5795)
Jetson Xavier Accelerated GStreamer User Guide
Document46 pages
Jetson Xavier Accelerated GStreamer User Guide
csgaddis
Pas encore d'évaluation
Python Interface For Beaglebone
Document37 pages
Python Interface For Beaglebone
csgaddis
Pas encore d'évaluation
Dragon - Ik Documentation
Document13 pages
Dragon - Ik Documentation
csgaddis
Pas encore d'évaluation
Manual Navigation System v001
Document14 pages
Manual Navigation System v001
csgaddis
Pas encore d'évaluation
HBMT 2005-3 Bear Stearns Derivative Agreement
Document98 pages
HBMT 2005-3 Bear Stearns Derivative Agreement
csgaddis
Pas encore d'évaluation
Homebanc 2005-3 Form 8-K With Monthly Remittance Report
Document11 pages
Homebanc 2005-3 Form 8-K With Monthly Remittance Report
csgaddis
Pas encore d'évaluation
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
D'Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Évaluation : 4 sur 5 étoiles
4/5 (895)
The Yellow House: A Memoir (2019 National Book Award Winner)
D'Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Évaluation : 4 sur 5 étoiles
4/5 (98)
Grit: The Power of Passion and Perseverance
D'Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Évaluation : 4 sur 5 étoiles
4/5 (588)
The Little Book of Hygge: Danish Secrets to Happy Living
D'Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Évaluation : 3.5 sur 5 étoiles
3.5/5 (400)
The Emperor of All Maladies: A Biography of Cancer
D'Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Évaluation : 4.5 sur 5 étoiles
4.5/5 (271)
Never Split the Difference: Negotiating As If Your Life Depended On It
D'Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Évaluation : 4.5 sur 5 étoiles
4.5/5 (838)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
D'Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2259)
On Fire: The (Burning) Case for a Green New Deal
D'Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Évaluation : 4 sur 5 étoiles
4/5 (74)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
D'Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Évaluation : 4.5 sur 5 étoiles
4.5/5 (474)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
D'Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Évaluation : 3.5 sur 5 étoiles
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
D'Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Évaluation : 4.5 sur 5 étoiles
4.5/5 (234)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
D'Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Évaluation : 4.5 sur 5 étoiles
4.5/5 (266)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
D'Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Évaluation : 4.5 sur 5 étoiles
4.5/5 (345)
Yes Please
D'Everand
Yes Please
Amy Poehler
Évaluation : 4 sur 5 étoiles
4/5 (1891)
The Unwinding: An Inner History of the New America
D'Everand
The Unwinding: An Inner History of the New America
George Packer
Évaluation : 4 sur 5 étoiles
4/5 (45)
Rise of ISIS: A Threat We Can't Ignore
D'Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Évaluation : 3.5 sur 5 étoiles
3.5/5 (137)
Principles: Life and Work
D'Everand
Principles: Life and Work
Ray Dalio
Évaluation : 4 sur 5 étoiles
4/5 (599)
Fear: Trump in the White House
D'Everand
Fear: Trump in the White House
Bob Woodward
Évaluation : 3.5 sur 5 étoiles
3.5/5 (738)
Angela's Ashes: A Memoir
D'Everand
Angela's Ashes: A Memoir
Frank McCourt
Évaluation : 4.5 sur 5 étoiles
4.5/5 (440)
Bad Feminist: Essays
D'Everand
Bad Feminist: Essays
Roxane Gay
Évaluation : 4 sur 5 étoiles
4/5 (1016)
Steve Jobs
D'Everand
Steve Jobs
Walter Isaacson
Évaluation : 4.5 sur 5 étoiles
4.5/5 (806)
The Glass Castle: A Memoir
D'Everand
The Glass Castle: A Memoir
Jeannette Walls
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1713)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
D'Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Évaluation : 4 sur 5 étoiles
4/5 (1090)
John Adams
D'Everand
John Adams
David McCullough
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2409)
The Outsider: A Novel
D'Everand
The Outsider: A Novel
Stephen King
Évaluation : 4 sur 5 étoiles
4/5 (1839)
The Light Between Oceans: A Novel
D'Everand
The Light Between Oceans: A Novel
M.L. Stedman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (789)
Manhattan Beach: A Novel
D'Everand
Manhattan Beach: A Novel
Jennifer Egan
Évaluation : 3.5 sur 5 étoiles
3.5/5 (792)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
D'Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Évaluation : 4.5 sur 5 étoiles
4.5/5 (121)
The Woman in Cabin 10
D'Everand
The Woman in Cabin 10
Ruth Ware
Évaluation : 3.5 sur 5 étoiles
3.5/5 (2322)
Brooklyn: A Novel
D'Everand
Brooklyn: A Novel
Colm Tóibín
Évaluation : 3.5 sur 5 étoiles
3.5/5 (1937)
A Man Called Ove: A Novel
D'Everand
A Man Called Ove: A Novel
Fredrik Backman
Évaluation : 4.5 sur 5 étoiles
4.5/5 (4610)
The Perks of Being a Wallflower
D'Everand
The Perks of Being a Wallflower
Stephen Chbosky
Évaluation : 4.5 sur 5 étoiles
4.5/5 (2104)
Wolf Hall: A Novel
D'Everand
Wolf Hall: A Novel
Hilary Mantel
Évaluation : 4 sur 5 étoiles
4/5 (3811)
Little Women
D'Everand
Little Women
Louisa May Alcott
Évaluation : 4 sur 5 étoiles
4/5 (104)
Her Body and Other Parties: Stories
D'Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Évaluation : 4 sur 5 étoiles
4/5 (821)
The Art of Racing in the Rain: A Novel
D'Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Évaluation : 4 sur 5 étoiles
4/5 (4200)
Sing, Unburied, Sing: A Novel
D'Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Évaluation : 4 sur 5 étoiles
4/5 (1103)
The Constant Gardener: A Novel
D'Everand
The Constant Gardener: A Novel
John le Carré
Évaluation : 3.5 sur 5 étoiles
3.5/5 (104)
A Tree Grows in Brooklyn
D'Everand
A Tree Grows in Brooklyn
Betty Smith
Évaluation : 4.5 sur 5 étoiles
4.5/5 (1929)
Owner's Manual: Two-Stage Reciprocating Air Compressors
Document24 pages
Owner's Manual: Two-Stage Reciprocating Air Compressors
mauricio
Pas encore d'évaluation
Manual SKF TK Sa 40 PDF
Document432 pages
Manual SKF TK Sa 40 PDF
Douglas Carrasco
Pas encore d'évaluation
SCADA
Document2 pages
SCADA
biswa217
Pas encore d'évaluation
Inspur Server User Manual NF5280M4
Document111 pages
Inspur Server User Manual NF5280M4
Gabo Tovar
0% (1)
Transcend: Ts-Avd3 User S Manual Intel Socket 370 Celeron/ Pentium III FC-PGA Series Cyrix III Joshua Series
Document51 pages
Transcend: Ts-Avd3 User S Manual Intel Socket 370 Celeron/ Pentium III FC-PGA Series Cyrix III Joshua Series
Videlaor
Pas encore d'évaluation
Thinclient 19
Document9 pages
Thinclient 19
api-3833258
100% (2)
AVS OP 0808 JD0.Datasheet
Document3 pages
AVS OP 0808 JD0.Datasheet
adrianschneider
Pas encore d'évaluation
2017 Samsung Smart Signage DCH Series - Leaflet - Web
Document2 pages
2017 Samsung Smart Signage DCH Series - Leaflet - Web
horacioterragno
Pas encore d'évaluation
TWS-DS-3 433.92MHz Miniaturization Wireless Transmitter Module Data Sheet
Document5 pages
TWS-DS-3 433.92MHz Miniaturization Wireless Transmitter Module Data Sheet
NivaldodeArruda
Pas encore d'évaluation
Floating Point Arithmetic Unit Using Verilog
Document6 pages
Floating Point Arithmetic Unit Using Verilog
namdt1
0% (1)
Microrupteur Page624 626
Document3 pages
Microrupteur Page624 626
ttiinneell8932
Pas encore d'évaluation
XBKP
Document88 pages
XBKP
Mark Letterman
Pas encore d'évaluation
Power Xpert Protection Manager: For Eaton PXR 20/25 Trip Units
Document25 pages
Power Xpert Protection Manager: For Eaton PXR 20/25 Trip Units
Muhammad Syaifulhaq
Pas encore d'évaluation
X067893
Document110 pages
X067893
CJ10a
Pas encore d'évaluation
Instructor Materials Chapter 3: Computer Assembly: IT Essentials v6.0
Document28 pages
Instructor Materials Chapter 3: Computer Assembly: IT Essentials v6.0
opsss
Pas encore d'évaluation
2012 - Q2 All 100
Document84 pages
2012 - Q2 All 100
zaskribdo
Pas encore d'évaluation
ch17 PDF
Document25 pages
ch17 PDF
Tom
Pas encore d'évaluation
Interfacing With Intel 8251a
Document7 pages
Interfacing With Intel 8251a
Santhosh Kumar
Pas encore d'évaluation
Chapter 1 - Part 1: Introduction To Windows Operating Systems
Document13 pages
Chapter 1 - Part 1: Introduction To Windows Operating Systems
LeBron Pua
Pas encore d'évaluation
Siemens 4wire Analog
Document24 pages
Siemens 4wire Analog
Ivan Bevanda
Pas encore d'évaluation
QUALCOMM-RCareers Jobs Openings Registration Portal Register
Document2 pages
QUALCOMM-RCareers Jobs Openings Registration Portal Register
Gaurav Kumar Singh
Pas encore d'évaluation
LM-390A Operation Manual in English
Document75 pages
LM-390A Operation Manual in English
bart0526
0% (1)
Data Sheet 6GT2002-1JD00: Transmission Rate
Document3 pages
Data Sheet 6GT2002-1JD00: Transmission Rate
renjith
Pas encore d'évaluation
Mercury Outboard Motor Installation Guide
Document20 pages
Mercury Outboard Motor Installation Guide
Mick Bissett
100% (3)
RN4870 71 Bluetooth Low Energy Module User Guide DS50002466C
Document75 pages
RN4870 71 Bluetooth Low Energy Module User Guide DS50002466C
Dev Sane
Pas encore d'évaluation
Study of The Data Exchange Between PL and PS of Zynq-7000 Devices
Document21 pages
Study of The Data Exchange Between PL and PS of Zynq-7000 Devices
nagcrazz
Pas encore d'évaluation
Patran 2008r1 Doc Install
Document148 pages
Patran 2008r1 Doc Install
Amir Ali Janbakhsh
Pas encore d'évaluation
P Intlk 2.0 Syslib
Document40 pages
P Intlk 2.0 Syslib
carbono980
Pas encore d'évaluation
AOC V22t
Document2 pages
AOC V22t
rashikakaran
Pas encore d'évaluation
OpenScape Business V1, CallBridge Collection V2.3, Installation Guide, Issue 1
Document31 pages
OpenScape Business V1, CallBridge Collection V2.3, Installation Guide, Issue 1
CarlosPewer
0% (2)