SlideShare una empresa de Scribd logo
1 de 45
Descargar para leer sin conexión
Steve Oberlin, CTO Accelerated Computing
HPC + AI:
MACHINE LEARNING MODELS IN SCIENTIFIC COMPUTING
2
GRAND CHALLENGES REQUIRE MASSIVE COMPUTING
REINVENTING THE LI-ION BATTERY
3M Node Hours | 7 Days on Titan
UNDERSTANDING HIV’S STRUCTURE
10M node Hours |16 Days on BlueWaters
CLOUD RESOLVING CLIMATE
SIMULATIONS
100M Node Hours | 840 Days on Piz Daint
3
BAD TIMING
THE SLOW DEATH OF MOORE’S LAW
5
TOP500 EFFECTS
All
#1
#500
1
TFLOPS
100
GFLOPS
10
TFLOPS
100
TFLOPS
1
PFLOPS
10
PFLOPS
100
PFLOPS
6
SOMETHING NEW:
AI + HPC = REVOLUTION
7
INGREDIENTS: BIG DATA
Cloud Apps
(We are the sensors for our cloud service providers)
8
INGREDIENTS: BIG DATA
9
INGREDIENTS: AI ALGORITHMS
10
NOW, JUST ADD HPC AND STIR…
4
60
110
0
20
40
60
80
100
120
2010 2011 2012 2013 2014
GPU Entries
Classification Error Rates
28%
26%
16%
12%
7%
0%
5%
10%
15%
20%
25%
30%
2010 2011 2012 2013 2014
Team Date Top-5 Test Error
GoogLeNet 2014 6.66%
Baidu Deep Image 01/12/2015 5.98%
Baidu Deep Image 02/05/2015 5.33%
Microsoft 02/05/2015 4.94%
Google 03/02/2015 4.82%
Baidu Deep Image 03/17/2015 4.83%
Classification Task:
1.2M images • 1000 object categoriesEnter Deep Learning
Trained Human Performance: 5.1%
11
ALGORITHMS + BIG DATA + GPUS =
THE BIG BANG OF MODERN AI
Auto
Encoders
GANLSTM
IDSIA
CNN on GPU
Stanford &
NVIDIA
Large-scale
DNN on GPU
U Toronto
AlexNet
on GPU
CaptioningNVIDIA BB8 Style TransferBRETTImageNet
Google Photo
Arterys
FDA Approved AlphaGo
Super
Resolution Deep Voice
Baidu
DuLight
NMT
Superhuman
ASR
Reinforcement
Learning
Transfer
Learning
recognition/classification -> recursion/time series -> generative
12
BEYOND RECOGNITION
DNNs Go Generative
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. "High-
Resolution Image Synthesis and Semantic Manipulation with Conditional GANs", in CVPR, 2018.
13
BEYOND RECOGNITION
DNNs Go Generative
14
BEYOND RECOGNITION
DNNs Go Generative
“WaveNet: A Generative Model for Raw Audio”, https://arxiv.org/pdf/1609.03499.pdf, {avdnoord, sedielem,
heigazen, simonyan, vinyals, gravesa, nalk, andrewsenior, korayk}@google.com, Google DeepMind, London, UK
15
WHAT DOES THIS HAVE TO DO WITH
SCIENCE?
(HPC + AI = ?)
16
AI on a super-Moore’s Law progression
0
10
20
30
40
50
60
K40
(2014)
K80
(2015)
P100
(2016)
V100
(2017)
AMBER Performance (ns/ day)
AMBER 14
CUDA 4
AMBER 14
CUDA 6
AMBER 16
CUDA 8
AMBER 16
CUDA 9
0
2400
4800
7200
9600
12000
8X K80
(2014)
8X MAXWELL
(2015)
DGX-1
(2016)
DGX-1V
(2017)
GoogleNet Performance (i/s)
cuDNN 2
CUDA 6
cuDNN 4
CUDA 7
cuDNN 6
CUDA 8
NCCL 1.6
cuDNN 7
CUDA 9
NCCL 2
Amber dataset: Cellulose NVE; GoogLeNet dataset: Imagenet
4x in 3 years 12x in 3 years
(65x in 5 years)
AI: A NEW COMPUTING PARADIGM
17
2018: 10X AI GAIN IN ONE YEAR
DGX-1, SEP’17 DGX-2, Q3‘18
PyTorch Stack: Time to Train FAIRSEQ
software improvements across the stack including NCCL, cuDNN, etc.
0
5
10
15
DGX-1V DGX-2
15 days
1.5 days
18NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
Mix freely with conventional
software and algorithms
SOFTWARE, BY EXAMPLE
Deep Learning builds functions from examples of desired behavior
Functions are the building
blocks of software. DL can
approximate any function.
Some functions are too complex to code by hand.
Generate complex functions by example.
Hurricane
Not a hurricane
HURRICANE
DETECTOR
Neural
network
!" = $(obs)
Optimizer
19
THE POWER OF LEARNING FROM DATA
Predicting Chaos
“Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A
Reservoir Computing Approach”, Jaideep Pathak, Brian Hunt, Michelle Girvan,
Zhixin Lu, and Edward Ott
Phys. Rev. Lett. 120, 024102 – 12 January 2018
20NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE.
BIG DATA IN SCIENCE
Big Science ingests/outputs Big Data
Large Hadron Collider Square Kilometer Array
Johns Hopkins
Turbulence Database
21
RECOGNITION/CLASSIFICATION
Heterogeneous event selection at CMS experiment
DNN to reconstruct higher rate of events at lower power
Heterogeneous Event Selection at the CMS experiment
http://drive.google.com/file/d/0B596cb8D9K9kZjJzdzBRdGY0NFk/preview
22
RECOGNITION/CLASSIFICATION -> CONTROL
DL for plasma fusion stability
DL enabling better accuracy –- ~95% TP vs. old ~80% -- promising control of live ITER Tokomac
Accelerated Deep Learning Discovery in Fusion Energy Science
http://on-demand-gtc.gputechconf.com/gtc-quicklink/7zGB7j
23
DL for adaptive optics
DL enabling clearer views from the world’s largest ground-based telescopes
RECOGNITION/CLASSIFICATION -> CONTROL
Helping the Discovery of New Galaxies on the World's Largest Telescopes Using a Large GPU Cluster
http://on-demand-gtc.gputechconf.com/gtc-quicklink/ewiELDW
24
RECOGNITION/CLASSIFICATION -> FILTER
De-noising gravitational waves
DL enabling 5000x faster filtering for real-time multi-messenger astronomy
25
2015: USING NUMERIC SIMULATIONS TO TRAIN AI
“Data-driven Fluid Simulations using Regression Forests” http://people.inf.ethz.ch/ladickyl/fluid_sigasia15.pdf
26
2015: USING NUMERIC SIMULATIONS TO TRAIN AI
“Data-driven Fluid Simulations using Regression Forests” http://people.inf.ethz.ch/ladickyl/fluid_sigasia15.pdf
27
TRAINING A DEEP LEARNING HPC MODEL
ERRORS
REGRESSION TESTING
(FP16/INT8)
INFERENCE
(FP16/INT8)
TRAINING
(FP32/FP16)
SIMULATION
(FP64/FP32)
DATA
REGRESSION SET
NEW DATA
TRAINING SET
28
IS A ML MODEL USEFUL FOR SCIENCE?
29
Background
Developing a new drug costs $2.5B and takes 10-15 years. Quantum chemistry
(QC) simulations are important to accurately screen millions of potential drugs to
a few most promising drug candidates.
Challenge
QC simulation is computationally expensive so researchers use approximations,
compromising on accuracy. To screen 10M drug candidates, it takes 5 years to
compute on CPUs.
Solution
Researchers at the University of Florida and the University of North Carolina
leveraged GPU deep learning to develop ANAKIN-ME, to reproduce molecular
energy surfaces with super speed (microseconds versus several minutes),
extremely high (DFT) accuracy, and at 5-6 orders of magnitude lower cost.
Impact
Faster, more accurate screening at far lower cost
DEEP LEARNING FOR
QUANTUM CHEMISTRY
30
NEURAL NETWORK MODEL APPROACH
Training set: ~20M DFT data points.
Molecules with 1 to 8 atoms from GDB database
31
CORRELATION VALIDATION
32
CORRELATION VALIDATION
33
SATELLITE TO MODEL TRANSLATION
Automatically generate inverse map from radiances to model variables
SATELLITE RADIANCES WEATHER MODEL VARIABLES
No analytic formula available for such a conversion
Data assimilation: forward operator + adjoint-sensitivity analysis
Deep learning can potentially obtain inverse operator numerically
34
MODEL TRANSLATION
BY CONDITIONAL GAN
Adversarial model outputs a
physically plausible state
Both forward and inverse maps
For data assimilation and forecast
verification
Physically plausible state
from incomplete data
OBSERVATION GOES-15 band 3
MODEL VAR GFS Precipitable water
Training 2014-2016
Test 2013
INPUT: GOES-15 GENERATED TARGET: GFS
INPUT: GFS GENERATED TARGET: GOES-15
35
DEEP LEARNING FOR MODEL CREATION: MIIDAPS-AI
Multi-Instrument Inversion and Data Assimilation Preprocessing System
Sid Boukabara NOAA/NESDIS Eric Maddy, Adam Neiss Riverside Technology Inc
MIIDAPS-AI TPW
Inverse operator for multiple IR and microwave satellites.
Iteratively uses CRTM radiative transfer model
5 seconds vs 2 hrs to process one day
1400x speedup.
36
SLOW MOTION
SATELLITE LOOP
David Hall NVIDIA
INPUT GOES-15 band 3, GFS winds
OUTPUT Interpolated GOES-15
INPUT FREQ 1 every 3 hours
OUTPUT FREQ 1 every 18 minutes
Applications:
• Visualization
• Data Augmentation
• Replace dropped frames
• Reduce storage requirements
11 input images
110 output frames
37
Resolving physics at sub-grid dimensions
DL enabling faster, more accurate climate modeling and predictions
DEEP LEARNING FOR CLIMATE MODELING
38
Automating Extreme Weather Detection in Climate Model Output
2018 Gordon Bell Award winner – 1.13 EFLOPS (training at mixed precision)
DEEP LEARNING FOR CLIMATE ANALYTICS
“Exascale Deep Learning For Climate Analytics”, Jaideep Pathak, Brian Hunt, Michelle Thorsten Kurth, Sean Treichler, Joshua Romero, Mayur
Mudigonda, Nathan Luehr, Everett Phillips, Ankur Mahesh, Michael Matheson, Jack Deslippe, Massimiliano Fatica, Prabhat, Mike Houston
SC 2018
39
Physics Informed Neural Networks
39
Mass conservation:
Momentum conservation:
Transport:
RESPECTING PHYSICS
Deep Learning for CFD
40
DISCOVERING HIDDEN PHYSICS
Learned vs. Ground Truth
Training set: CFD Simulation of an External
Flow over a Cylinder with OpenFOAM
41
BLOOD FLOW IN AN ARTERY
42
THE PROMISE OF HPC+AI
Better stability/accuracy -> Higher-quality simulations
Resists Moore’s Law fade -> Continuing future simulation progress
Lower-precision data requirements -> Larger/finer grids, better science
Trained models run at peak speed -> Perfectly accelerate existing applications
Inference perf function of DNN arch -> Better science at low incremental cost
Reduced code optimization requirement -> Scientists can code simply, naturally
Reduced maintenance -> More science/dollar
Orders-of-magnitude speed-up/lower energy -> Bigger/longer/cheaper simulations
43
3 ORDERS OF MAGNITUDE
.6 MPH 6 MPH 60 MPH 600 MPH
1000x
44
WHAT IF…?
(Warning: CTO math)
REINVENTING THE LI-ION BATTERY
3M Node Hours | 7 Days on Titan
UNDERSTANDING HIV’S STRUCTURE
10M node Hours |16 Days on BlueWaters
CLOUD RESOLVING CLIMATE
SIMULATIONS
100M Node Hours | 840 Days on Piz Daint
10x: 17 hours
100x: 100 minutes
1000x: 86 seconds
10x: 84 days
100x: 8.4 days
1000x: 20 hours
10x: 1.6 days
100x: 4 hours
1000x: 23 minutes
45
THIS IS ONLY THE BEGINNING…
46
THANK YOU

Más contenido relacionado

La actualidad más candente

Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning TutorialAmr Rashed
 
初探人工智慧
初探人工智慧初探人工智慧
初探人工智慧Paul Yang
 
Basics of Deep learning
Basics of Deep learningBasics of Deep learning
Basics of Deep learningRamesh Kumar
 
Big data and AI presentation slides
Big data and AI presentation slidesBig data and AI presentation slides
Big data and AI presentation slidesCloudxLab
 
What if Things Start to Think - Artificial Intelligence in IoT
What if Things Start to Think - Artificial Intelligence in IoTWhat if Things Start to Think - Artificial Intelligence in IoT
What if Things Start to Think - Artificial Intelligence in IoTMuralidhar Somisetty
 
3D OPTICAL DATA STORAGE
3D OPTICAL DATA STORAGE3D OPTICAL DATA STORAGE
3D OPTICAL DATA STORAGERishikese MR
 
What is Deep Learning?
What is Deep Learning?What is Deep Learning?
What is Deep Learning?NVIDIA
 
Artificial intelligency & robotics
Artificial intelligency & roboticsArtificial intelligency & robotics
Artificial intelligency & roboticsSneh Raval
 
Internet of things
Internet of thingsInternet of things
Internet of thingsNaiyer Khan
 
artificial Intelligence
artificial Intelligence artificial Intelligence
artificial Intelligence Ramya SK
 
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...SlideTeam
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial IntelligenceLuca Bianchi
 
ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEMidhuti
 

La actualidad más candente (20)

Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
 
初探人工智慧
初探人工智慧初探人工智慧
初探人工智慧
 
Basics of Deep learning
Basics of Deep learningBasics of Deep learning
Basics of Deep learning
 
認識 AI
認識 AI認識 AI
認識 AI
 
Big data and AI presentation slides
Big data and AI presentation slidesBig data and AI presentation slides
Big data and AI presentation slides
 
What if Things Start to Think - Artificial Intelligence in IoT
What if Things Start to Think - Artificial Intelligence in IoTWhat if Things Start to Think - Artificial Intelligence in IoT
What if Things Start to Think - Artificial Intelligence in IoT
 
3D OPTICAL DATA STORAGE
3D OPTICAL DATA STORAGE3D OPTICAL DATA STORAGE
3D OPTICAL DATA STORAGE
 
What is Deep Learning?
What is Deep Learning?What is Deep Learning?
What is Deep Learning?
 
AI
AIAI
AI
 
Artificial intelligency & robotics
Artificial intelligency & roboticsArtificial intelligency & robotics
Artificial intelligency & robotics
 
chameleon chip
chameleon chipchameleon chip
chameleon chip
 
Artifical intelligence (a.i)
Artifical intelligence (a.i)Artifical intelligence (a.i)
Artifical intelligence (a.i)
 
Internet of things
Internet of thingsInternet of things
Internet of things
 
1.Introduction to deep learning
1.Introduction to deep learning1.Introduction to deep learning
1.Introduction to deep learning
 
ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCE
 
artificial Intelligence
artificial Intelligence artificial Intelligence
artificial Intelligence
 
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
Artificial Intelligence Machine Learning Deep Learning Ppt Powerpoint Present...
 
Introduction to Artificial Intelligence
Introduction to Artificial IntelligenceIntroduction to Artificial Intelligence
Introduction to Artificial Intelligence
 
ARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCEARTIFICIAL INTELLIGENCE
ARTIFICIAL INTELLIGENCE
 
Artificial intelligence
Artificial intelligenceArtificial intelligence
Artificial intelligence
 

Similar a HPC + Ai: Machine Learning Models in Scientific Computing

Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
TeraGrid Communication and Computation
TeraGrid Communication and ComputationTeraGrid Communication and Computation
TeraGrid Communication and ComputationTal Lavian Ph.D.
 
OpenACC Monthly Highlights: May 2020
OpenACC Monthly Highlights: May 2020OpenACC Monthly Highlights: May 2020
OpenACC Monthly Highlights: May 2020OpenACC
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Larry Smarr
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceinside-BigData.com
 
OpenPOWER Academia and Research team's webinar - Presentations from Oak Ridg...
OpenPOWER Academia and Research team's webinar  - Presentations from Oak Ridg...OpenPOWER Academia and Research team's webinar  - Presentations from Oak Ridg...
OpenPOWER Academia and Research team's webinar - Presentations from Oak Ridg...Ganesan Narayanasamy
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)byteLAKE
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC
 
Early Application experiences on Summit
Early Application experiences on Summit Early Application experiences on Summit
Early Application experiences on Summit Ganesan Narayanasamy
 
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM ProcessingHow to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processinginside-BigData.com
 
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...Christian Plessl
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC
 
The Influence of the Java Collection Framework on Overall Energy Consumption
The Influence of the Java Collection Framework on Overall Energy ConsumptionThe Influence of the Java Collection Framework on Overall Energy Consumption
The Influence of the Java Collection Framework on Overall Energy ConsumptionGreenLabAtDI
 
Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Paolo Missier
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptxArthur240715
 
Many Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersMany Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersIan Foster
 
05 Preparing for Extreme Geterogeneity in HPC
05 Preparing for Extreme Geterogeneity in HPC05 Preparing for Extreme Geterogeneity in HPC
05 Preparing for Extreme Geterogeneity in HPCRCCSRENKEI
 

Similar a HPC + Ai: Machine Learning Models in Scientific Computing (20)

ECP Application Development
ECP Application DevelopmentECP Application Development
ECP Application Development
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
TeraGrid Communication and Computation
TeraGrid Communication and ComputationTeraGrid Communication and Computation
TeraGrid Communication and Computation
 
OpenACC Monthly Highlights: May 2020
OpenACC Monthly Highlights: May 2020OpenACC Monthly Highlights: May 2020
OpenACC Monthly Highlights: May 2020
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
How HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental scienceHow HPC and large-scale data analytics are transforming experimental science
How HPC and large-scale data analytics are transforming experimental science
 
OpenPOWER Academia and Research team's webinar - Presentations from Oak Ridg...
OpenPOWER Academia and Research team's webinar  - Presentations from Oak Ridg...OpenPOWER Academia and Research team's webinar  - Presentations from Oak Ridg...
OpenPOWER Academia and Research team's webinar - Presentations from Oak Ridg...
 
Deep Learning for Fast Simulation
Deep Learning for Fast SimulationDeep Learning for Fast Simulation
Deep Learning for Fast Simulation
 
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
AI optimizing HPC simulations (presentation from  6th EULAG Workshop)AI optimizing HPC simulations (presentation from  6th EULAG Workshop)
AI optimizing HPC simulations (presentation from 6th EULAG Workshop)
 
OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020OpenACC Monthly Highlights: October2020
OpenACC Monthly Highlights: October2020
 
Early Application experiences on Summit
Early Application experiences on Summit Early Application experiences on Summit
Early Application experiences on Summit
 
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM ProcessingHow to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
 
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
FPGA-accelerated High-Performance Computing – Close to Breakthrough or Pipedr...
 
OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019OpenACC Monthly Highlights: May 2019
OpenACC Monthly Highlights: May 2019
 
cug2011-praveen
cug2011-praveencug2011-praveen
cug2011-praveen
 
The Influence of the Java Collection Framework on Overall Energy Consumption
The Influence of the Java Collection Framework on Overall Energy ConsumptionThe Influence of the Java Collection Framework on Overall Energy Consumption
The Influence of the Java Collection Framework on Overall Energy Consumption
 
Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...Selective and incremental re-computation in reaction to changes: an exercise ...
Selective and incremental re-computation in reaction to changes: an exercise ...
 
230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx230208 MLOps Getting from Good to Great.pptx
230208 MLOps Getting from Good to Great.pptx
 
Many Task Applications for Grids and Supercomputers
Many Task Applications for Grids and SupercomputersMany Task Applications for Grids and Supercomputers
Many Task Applications for Grids and Supercomputers
 
05 Preparing for Extreme Geterogeneity in HPC
05 Preparing for Extreme Geterogeneity in HPC05 Preparing for Extreme Geterogeneity in HPC
05 Preparing for Extreme Geterogeneity in HPC
 

Más de inside-BigData.com

Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networksinside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networksinside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoringinside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Updateinside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODinside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Accelerationinside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficientlyinside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Erainside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computinginside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Clusterinside-BigData.com
 

Más de inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Último

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

HPC + Ai: Machine Learning Models in Scientific Computing

  • 1. Steve Oberlin, CTO Accelerated Computing HPC + AI: MACHINE LEARNING MODELS IN SCIENTIFIC COMPUTING
  • 2. 2 GRAND CHALLENGES REQUIRE MASSIVE COMPUTING REINVENTING THE LI-ION BATTERY 3M Node Hours | 7 Days on Titan UNDERSTANDING HIV’S STRUCTURE 10M node Hours |16 Days on BlueWaters CLOUD RESOLVING CLIMATE SIMULATIONS 100M Node Hours | 840 Days on Piz Daint
  • 3. 3 BAD TIMING THE SLOW DEATH OF MOORE’S LAW
  • 5. 6 SOMETHING NEW: AI + HPC = REVOLUTION
  • 6. 7 INGREDIENTS: BIG DATA Cloud Apps (We are the sensors for our cloud service providers)
  • 9. 10 NOW, JUST ADD HPC AND STIR… 4 60 110 0 20 40 60 80 100 120 2010 2011 2012 2013 2014 GPU Entries Classification Error Rates 28% 26% 16% 12% 7% 0% 5% 10% 15% 20% 25% 30% 2010 2011 2012 2013 2014 Team Date Top-5 Test Error GoogLeNet 2014 6.66% Baidu Deep Image 01/12/2015 5.98% Baidu Deep Image 02/05/2015 5.33% Microsoft 02/05/2015 4.94% Google 03/02/2015 4.82% Baidu Deep Image 03/17/2015 4.83% Classification Task: 1.2M images • 1000 object categoriesEnter Deep Learning Trained Human Performance: 5.1%
  • 10. 11 ALGORITHMS + BIG DATA + GPUS = THE BIG BANG OF MODERN AI Auto Encoders GANLSTM IDSIA CNN on GPU Stanford & NVIDIA Large-scale DNN on GPU U Toronto AlexNet on GPU CaptioningNVIDIA BB8 Style TransferBRETTImageNet Google Photo Arterys FDA Approved AlphaGo Super Resolution Deep Voice Baidu DuLight NMT Superhuman ASR Reinforcement Learning Transfer Learning recognition/classification -> recursion/time series -> generative
  • 11. 12 BEYOND RECOGNITION DNNs Go Generative Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. "High- Resolution Image Synthesis and Semantic Manipulation with Conditional GANs", in CVPR, 2018.
  • 13. 14 BEYOND RECOGNITION DNNs Go Generative “WaveNet: A Generative Model for Raw Audio”, https://arxiv.org/pdf/1609.03499.pdf, {avdnoord, sedielem, heigazen, simonyan, vinyals, gravesa, nalk, andrewsenior, korayk}@google.com, Google DeepMind, London, UK
  • 14. 15 WHAT DOES THIS HAVE TO DO WITH SCIENCE? (HPC + AI = ?)
  • 15. 16 AI on a super-Moore’s Law progression 0 10 20 30 40 50 60 K40 (2014) K80 (2015) P100 (2016) V100 (2017) AMBER Performance (ns/ day) AMBER 14 CUDA 4 AMBER 14 CUDA 6 AMBER 16 CUDA 8 AMBER 16 CUDA 9 0 2400 4800 7200 9600 12000 8X K80 (2014) 8X MAXWELL (2015) DGX-1 (2016) DGX-1V (2017) GoogleNet Performance (i/s) cuDNN 2 CUDA 6 cuDNN 4 CUDA 7 cuDNN 6 CUDA 8 NCCL 1.6 cuDNN 7 CUDA 9 NCCL 2 Amber dataset: Cellulose NVE; GoogLeNet dataset: Imagenet 4x in 3 years 12x in 3 years (65x in 5 years) AI: A NEW COMPUTING PARADIGM
  • 16. 17 2018: 10X AI GAIN IN ONE YEAR DGX-1, SEP’17 DGX-2, Q3‘18 PyTorch Stack: Time to Train FAIRSEQ software improvements across the stack including NCCL, cuDNN, etc. 0 5 10 15 DGX-1V DGX-2 15 days 1.5 days
  • 17. 18NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. Mix freely with conventional software and algorithms SOFTWARE, BY EXAMPLE Deep Learning builds functions from examples of desired behavior Functions are the building blocks of software. DL can approximate any function. Some functions are too complex to code by hand. Generate complex functions by example. Hurricane Not a hurricane HURRICANE DETECTOR Neural network !" = $(obs) Optimizer
  • 18. 19 THE POWER OF LEARNING FROM DATA Predicting Chaos “Model-Free Prediction of Large Spatiotemporally Chaotic Systems from Data: A Reservoir Computing Approach”, Jaideep Pathak, Brian Hunt, Michelle Girvan, Zhixin Lu, and Edward Ott Phys. Rev. Lett. 120, 024102 – 12 January 2018
  • 19. 20NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. BIG DATA IN SCIENCE Big Science ingests/outputs Big Data Large Hadron Collider Square Kilometer Array Johns Hopkins Turbulence Database
  • 20. 21 RECOGNITION/CLASSIFICATION Heterogeneous event selection at CMS experiment DNN to reconstruct higher rate of events at lower power Heterogeneous Event Selection at the CMS experiment http://drive.google.com/file/d/0B596cb8D9K9kZjJzdzBRdGY0NFk/preview
  • 21. 22 RECOGNITION/CLASSIFICATION -> CONTROL DL for plasma fusion stability DL enabling better accuracy –- ~95% TP vs. old ~80% -- promising control of live ITER Tokomac Accelerated Deep Learning Discovery in Fusion Energy Science http://on-demand-gtc.gputechconf.com/gtc-quicklink/7zGB7j
  • 22. 23 DL for adaptive optics DL enabling clearer views from the world’s largest ground-based telescopes RECOGNITION/CLASSIFICATION -> CONTROL Helping the Discovery of New Galaxies on the World's Largest Telescopes Using a Large GPU Cluster http://on-demand-gtc.gputechconf.com/gtc-quicklink/ewiELDW
  • 23. 24 RECOGNITION/CLASSIFICATION -> FILTER De-noising gravitational waves DL enabling 5000x faster filtering for real-time multi-messenger astronomy
  • 24. 25 2015: USING NUMERIC SIMULATIONS TO TRAIN AI “Data-driven Fluid Simulations using Regression Forests” http://people.inf.ethz.ch/ladickyl/fluid_sigasia15.pdf
  • 25. 26 2015: USING NUMERIC SIMULATIONS TO TRAIN AI “Data-driven Fluid Simulations using Regression Forests” http://people.inf.ethz.ch/ladickyl/fluid_sigasia15.pdf
  • 26. 27 TRAINING A DEEP LEARNING HPC MODEL ERRORS REGRESSION TESTING (FP16/INT8) INFERENCE (FP16/INT8) TRAINING (FP32/FP16) SIMULATION (FP64/FP32) DATA REGRESSION SET NEW DATA TRAINING SET
  • 27. 28 IS A ML MODEL USEFUL FOR SCIENCE?
  • 28. 29 Background Developing a new drug costs $2.5B and takes 10-15 years. Quantum chemistry (QC) simulations are important to accurately screen millions of potential drugs to a few most promising drug candidates. Challenge QC simulation is computationally expensive so researchers use approximations, compromising on accuracy. To screen 10M drug candidates, it takes 5 years to compute on CPUs. Solution Researchers at the University of Florida and the University of North Carolina leveraged GPU deep learning to develop ANAKIN-ME, to reproduce molecular energy surfaces with super speed (microseconds versus several minutes), extremely high (DFT) accuracy, and at 5-6 orders of magnitude lower cost. Impact Faster, more accurate screening at far lower cost DEEP LEARNING FOR QUANTUM CHEMISTRY
  • 29. 30 NEURAL NETWORK MODEL APPROACH Training set: ~20M DFT data points. Molecules with 1 to 8 atoms from GDB database
  • 32. 33 SATELLITE TO MODEL TRANSLATION Automatically generate inverse map from radiances to model variables SATELLITE RADIANCES WEATHER MODEL VARIABLES No analytic formula available for such a conversion Data assimilation: forward operator + adjoint-sensitivity analysis Deep learning can potentially obtain inverse operator numerically
  • 33. 34 MODEL TRANSLATION BY CONDITIONAL GAN Adversarial model outputs a physically plausible state Both forward and inverse maps For data assimilation and forecast verification Physically plausible state from incomplete data OBSERVATION GOES-15 band 3 MODEL VAR GFS Precipitable water Training 2014-2016 Test 2013 INPUT: GOES-15 GENERATED TARGET: GFS INPUT: GFS GENERATED TARGET: GOES-15
  • 34. 35 DEEP LEARNING FOR MODEL CREATION: MIIDAPS-AI Multi-Instrument Inversion and Data Assimilation Preprocessing System Sid Boukabara NOAA/NESDIS Eric Maddy, Adam Neiss Riverside Technology Inc MIIDAPS-AI TPW Inverse operator for multiple IR and microwave satellites. Iteratively uses CRTM radiative transfer model 5 seconds vs 2 hrs to process one day 1400x speedup.
  • 35. 36 SLOW MOTION SATELLITE LOOP David Hall NVIDIA INPUT GOES-15 band 3, GFS winds OUTPUT Interpolated GOES-15 INPUT FREQ 1 every 3 hours OUTPUT FREQ 1 every 18 minutes Applications: • Visualization • Data Augmentation • Replace dropped frames • Reduce storage requirements 11 input images 110 output frames
  • 36. 37 Resolving physics at sub-grid dimensions DL enabling faster, more accurate climate modeling and predictions DEEP LEARNING FOR CLIMATE MODELING
  • 37. 38 Automating Extreme Weather Detection in Climate Model Output 2018 Gordon Bell Award winner – 1.13 EFLOPS (training at mixed precision) DEEP LEARNING FOR CLIMATE ANALYTICS “Exascale Deep Learning For Climate Analytics”, Jaideep Pathak, Brian Hunt, Michelle Thorsten Kurth, Sean Treichler, Joshua Romero, Mayur Mudigonda, Nathan Luehr, Everett Phillips, Ankur Mahesh, Michael Matheson, Jack Deslippe, Massimiliano Fatica, Prabhat, Mike Houston SC 2018
  • 38. 39 Physics Informed Neural Networks 39 Mass conservation: Momentum conservation: Transport: RESPECTING PHYSICS Deep Learning for CFD
  • 39. 40 DISCOVERING HIDDEN PHYSICS Learned vs. Ground Truth Training set: CFD Simulation of an External Flow over a Cylinder with OpenFOAM
  • 40. 41 BLOOD FLOW IN AN ARTERY
  • 41. 42 THE PROMISE OF HPC+AI Better stability/accuracy -> Higher-quality simulations Resists Moore’s Law fade -> Continuing future simulation progress Lower-precision data requirements -> Larger/finer grids, better science Trained models run at peak speed -> Perfectly accelerate existing applications Inference perf function of DNN arch -> Better science at low incremental cost Reduced code optimization requirement -> Scientists can code simply, naturally Reduced maintenance -> More science/dollar Orders-of-magnitude speed-up/lower energy -> Bigger/longer/cheaper simulations
  • 42. 43 3 ORDERS OF MAGNITUDE .6 MPH 6 MPH 60 MPH 600 MPH 1000x
  • 43. 44 WHAT IF…? (Warning: CTO math) REINVENTING THE LI-ION BATTERY 3M Node Hours | 7 Days on Titan UNDERSTANDING HIV’S STRUCTURE 10M node Hours |16 Days on BlueWaters CLOUD RESOLVING CLIMATE SIMULATIONS 100M Node Hours | 840 Days on Piz Daint 10x: 17 hours 100x: 100 minutes 1000x: 86 seconds 10x: 84 days 100x: 8.4 days 1000x: 20 hours 10x: 1.6 days 100x: 4 hours 1000x: 23 minutes
  • 44. 45 THIS IS ONLY THE BEGINNING…