Vous êtes sur la page 1sur 10

Hercules:

A System for Massive Parallel


End-to-End Finite Element
Ground Motion Simulations
David R. O’Hallaron
Associate Professor of CS and ECE
Carnegie Mellon Quake Group
www.cs.cmu.edu/~quake

Joint work with (alphabetical order): Jacobo Bielak (CMU CEE),


Leonardo Ramirez-Guzman (CMU CEE), Julio Lopez (CMU ECE),
Kwan-Liu Ma (UC-Davis Visualization), Ricardo Taborda-Rios, Tiankai Tu
(CMU CS), Hongfeng Yu (UC-Davis Visualization)

Funded by SCEC, NSF


Hercules
- An end-to-end approach for parallel simulation
- Why end-to-end?
- Eliminate intermediate files.
- Eliminate “heroic efforts” to run simulations
- Fast turnaround on simulation/analysis cycle.
- Latest result of long term (since 1993)
collaboration among Carnegie Mellon computer
scientists and domain scientists at

Tiankai Tu, Hongfeng Yu, Leonardo Ramirez-Guzman, Jacobo Bielak, Omar


Ghattas, Kwan-Liu Ma, David R. O’Hallaron, From Mesh Generation to
Scientific Visualization: An End-to-End Approach to Parallel Supercomputing,
Proceedings of SC06, Tampa, FL, November, 2006.

David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006


Finite Element Powerful
workstation
Ground Motion Modeling
~1-10 TB
4D vel/disp
Powerful Parallel
workstation supercomputer wavefield

Mesh
Solving Analysis/Viz
generation
10’s MB
Velocity Unstructured
model mesh ~10-100s GB

10’s MB
David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006
Key Hercules Idea #1 4D vel/disp
wavefield

Mesh
Solving Analysis/Viz
generation

Velocity Unstructured
Output model
model mesh

Key Idea #1: Use octrees for the input


datasets.

David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006


Why Octrees?
Octrees can be used to represent velocity fields and
unstructured FEM meshes

Octrees have an efficient etree database


representation (Tu et al)
– Draws on decades of spatial database research
– Fast approximate query times allow for fast mesh generation

Octree meshes a nice tradeoff between structured


meshes and unstructured tetrahedral meshes
+ Wavelength adaptive
+ Fast parallel mesh generation (e.g. 8 -> 10 pts)
+ No need to store mesh topology
- Unable to resolve complex geometries
David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006
Key Hercules Idea #2 4D velocity
wavefield

Mesh
Solving Analysis/Viz
generation
etree library etree library etree library

Velocity Unstructured Output model


model mesh

Key Idea #2: Each octree is an etree database


– Portable across different machine types
– Fast sampling of velocity model
– Code and documentation available at
http://www.cs.cmu.edu/~euclid

David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006


Key Hercules Idea #3 4D velocity
Parallel Supercomputer System wavefield

Meshgen/
Solving Analysis/Viz
partition
etree lib herc lib herc lib herc lib

Partitioned in-situ distributed mesh data structure

Velocity Output model


model

Key Idea #3: Run entire finite element simulation (mesh


generation, solving, and visualizing), in place and in
parallel (end-to-end simulation).

David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006


Performance (Tu et al, SC06)
Scenario: LA Basin (100km x 100km x 37.5 km), SCEC CVM Version
2.0; minimum shear wave velocity 100m/s, Lemieux at PSC
Processors 1 16 52 184 748 2000
Frequency (Hz) 0.23 0.5 0.75 1 1.5 2
Elements 6.61E+5 9.92E+6 3.13E+7 1.14E+8 4.62E+8 1.22E+9 1
Nodes 8.11E+5 1.13E+7 3.57E+7 1.34E+8 5.34E+8 1.37E+9
Anchored 6.48E+5 9.87E+6 3.12E+7 1.14E+8 4.61E+8 1.22E+9
Hanging 1.63E+5 1.44E+6 4.57+6 2.03E+7 7.32E+7 1.48+8
Max leaf level 11 13 13 14 14 15
Min leaf level 6 7 8 8 9 9
Elements/PE 6.61E+5 6.20E+5 6.02E+5 6.20E+5 6.18E+5 6.12E+5
Time steps 2000 4000 10000 8000 2500 2500
E2E time (s) 12911 19804 38165 48668 13033 16709
Replication(s) 22 71 85 94 187 251
Meshing(s) 20 75 128 150 303 333 1
Solver(s) 8381 16060 31781 42892 11960 16097
Vis(s) 4488 3596 6169 5528 558 * 3

E2E t/ts/e/PE(μs) 9.77 7.98 7.93 7.86 8.44 10.92


Sol t/ts/e/PE (μs) 6.34 6.48 6.60 6.92 7.74 10.52
2
Mflops/sec/PE 569 638 653 655 * *
David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006
Future CS Directions
1. Standard Velocity Model
Representations
Proposal: Adopt etree database as the standard
queryable representation for CVMs and other fields
– Queryable dataset required for any unstructured FE code
– Models currently based on etree databases:
» USGS Bay Area Velocity Model
» Southern California Velocity Model
– Other candidates: Shaw model, Parkfield model

Proposal: Develop tool for automatic generation of


sampled etree fields
– Quantize and aggregate given some global error constraint
– Standalone C program
– Arbitrary field function dynamically linked from shared library (dll)

David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006


Future CS Directions
2. Wavefield Database Systems
At present, we only look at a small fraction of the
volume data produced by our models

Proposal: Develop compressed and queryable


database representations of synthetic 4D wavefields

WaveDB: Wavefield Database System (Julio Lopez,


CMU)
– Lossless compression through
» Frequency domain filtering
» Delta encoding of waveforms
» “micro solver” queries based on Greens functions
– Fast interpolated access methods using mesh as index
» “Give me the surface values on a regular grid at 100 meter
spacing for all time steps”
David O’Hallaron, SCEC/CME All-Hands Meeting, July 2006

Vous aimerez peut-être aussi