SlideShare una empresa de Scribd logo
1 de 13
HOW DO I UNDERSTAND DEEP LEARNING
PERFORMANCE?
THERE’S A LOT OF CONFUSION AROUND DEEP
LEARNING PERFORMANCE.
HOW DO YOU MEASURE IT?
WHAT SHOULD YOU MEASURE?
JUST REMEMBER “PLASTER”
A FRAMEWORK FOR DEEP LEARNING
PERFORMANCE
WHAT DOES PLASTER STAND FOR?
Programmability
Latency
Accuracy
Size of Model
Throughput
Energy Efficiency
Rate of Learning
PROGRAMMABILITY
Machine learning is experiencing explosive growth
not only in the size and complexity of the models
but also the burgeoning diversity of neural network
architectures. It is difficult even for experts to
understand the model choices and then choose the
appropriate model to solve their AI business
problems.
After a deep learning model is coded and trained,
it is then optimized for a specific runtime inference
environment. NVIDIA addresses training and
inference challenges with two key tools– CUDA and
TensorRT, NVIDIA’s programmable inference
accelerator. In addition, NVIDIA’s deep learning
platform accelerates ALL deep learning
frameworks, both for training and inference.
LATENCY
Humans and machines need a response to make
decisions and take action. Latency is the time
between requesting something and receiving a
response.
With most human-facing software systems, not just
AI, the time is often measured in milliseconds. Voice
recognition is a commonly understood application,
thanks to Siri, Alexa, and similar voice interfaces.
There is wide demand for digital assistants in both
consumer and customer service applications. But when
humans try to interface with digital assistants, a lag
of even a few seconds starts to feel unnatural.
ACCURACY
While accuracy is important in every industry,
healthcare needs especially high accuracy. Medical
imaging has advanced significantly in the last couple
of decades, increasing usage and requiring more
analysis to identify medical issues. Medical imaging
advancements and usage also mean that large
volumes of data must be transmitted from medical
machines to medical specialists to analyze.
Options to address the data volume issue have been
either to transmit the full information with long
delays or to sample the data and reconstruct it using
techniques that can lead to inaccurate reconstruction
and diagnostics.
An advantage of deep learning is that it can be
trained at high precision and implemented at lower
precision.
SIZE OF MODEL
The size of a deep learning model and the capacity of
the physical network between processors have impacts
on performance, especially in the latency and
throughput aspects of PLASTER. Deep learning network
models are exploding in numbers.
Their size and complexity are also increasing, enabling
far more detailed analysis and driving the need for
more powerful systems for training. In a deep learning
model, the drivers of compute power and physical
network expansion are:
• Number of layers
• Number of nodes (neurons) per layer
• Complexity of computation per layer
• Number of connections between a node at one layer
and the nodes of neighboring layers
THROUGHPUT
Developers are increasingly optimizing inference within a
specified latency threshold. While the latency limit
ensures good customer experience, maximizing
throughput within that limit is critical to maximizing
data center efficiency as well as revenue.
There has been a tendency to use throughput as the only
performance metric, as more computations-per-second
generally leads to better performance across other areas.
However, if a system cannot deliver adequate throughput
within a specified latency requirement, power budget, or
server node count, then the system will not ultimately
serve an application’s inference needs well. Without the
appropriate balance of throughput and latency, the
result can be poor customer service, missing service level
agreements (SLAs), and potentially a failed service.
ENERGY EFFICIENCY
As DL accelerator performance improves, DL accelerator
power consumption escalates. Providing ROI for deep
learning solutions involves more than looking at just the
inference performance of a system. Power consumption
can quickly increase costs of delivering a service, driving a
need to focus on energy efficiency in both devices and
systems.
Speech processing is a good example of a solution that
needs heavy processing to provide an intelligent response
in a natural voice. Data center inference providing real-
time processing for speech can easily involve large racks
of machines that can impact a company’s total cost of
ownership (TCO). Therefore, the industry measures
operational success in inferences-per-watt (higher is
better). Hyperscale data centers seek to maximize energy
efficiency for as many inferences as they can deliver with
a fixed power budget.
RATE OF LEARNING
As one of the two words in “AI” is intelligence, users
will want the neural networks to learn and adapt in a
reasonable timeframe. For complex DL systems to gain
traction in business, software tool developers must
support the DevOps movement.
As organizations continue to experiment with deep
learning and neural networks, they are learning how to
more effectively build and implement DL systems. DL
models must be retrained periodically as inferencing
services gather new data and as services grow and
change. Therefore, IT organizations and software
developers must increase the rate at which they can
retrain models as new data arrives.
HOW CAN PLASTER IMPACT YOUR BUSINESS?
DOWNLOAD WHITEPAPER

Más contenido relacionado

La actualidad más candente

HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018NVIDIA
 
GTC Europe 2017 Keynote
GTC Europe 2017 KeynoteGTC Europe 2017 Keynote
GTC Europe 2017 KeynoteNVIDIA
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA
 
HPC Top 5 Stories: July, 21, 2017
 HPC Top 5 Stories: July, 21, 2017 HPC Top 5 Stories: July, 21, 2017
HPC Top 5 Stories: July, 21, 2017NVIDIA
 
HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016NVIDIA
 
GTC 2015 Highlights
GTC 2015 HighlightsGTC 2015 Highlights
GTC 2015 HighlightsNVIDIA
 
The Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningThe Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningNVIDIA
 
Harness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessHarness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessNVIDIA
 
AI For Enterprise
AI For EnterpriseAI For Enterprise
AI For EnterpriseNVIDIA
 
Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018NVIDIA
 
NVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA
 
HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016NVIDIA
 
HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017NVIDIA
 
ISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapOpenPOWERorg
 
HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016NVIDIA
 
ISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapOpenPOWERorg
 
ISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapOpenPOWERorg
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021Sandeep Mishra
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsNVIDIA
 

La actualidad más candente (20)

HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018HPC Top 5 Stories: April 26, 2018
HPC Top 5 Stories: April 26, 2018
 
GTC Europe 2017 Keynote
GTC Europe 2017 KeynoteGTC Europe 2017 Keynote
GTC Europe 2017 Keynote
 
NVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We AreNVIDIA Corporation Brochure: Who We Are
NVIDIA Corporation Brochure: Who We Are
 
HPC Top 5 Stories: July, 21, 2017
 HPC Top 5 Stories: July, 21, 2017 HPC Top 5 Stories: July, 21, 2017
HPC Top 5 Stories: July, 21, 2017
 
HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016HPC Top 5 Stories: Nov. 21, 2016
HPC Top 5 Stories: Nov. 21, 2016
 
GTC 2015 Highlights
GTC 2015 HighlightsGTC 2015 Highlights
GTC 2015 Highlights
 
The Convergence of HPC and Deep Learning
The Convergence of HPC and Deep LearningThe Convergence of HPC and Deep Learning
The Convergence of HPC and Deep Learning
 
Harness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for BusinessHarness the Power of AI and Deep Learning for Business
Harness the Power of AI and Deep Learning for Business
 
AI For Enterprise
AI For EnterpriseAI For Enterprise
AI For Enterprise
 
Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018Fueling the Next Wave of AI Discovery - CVPR 2018
Fueling the Next Wave of AI Discovery - CVPR 2018
 
NVIDIA 2017 Overview
NVIDIA 2017 OverviewNVIDIA 2017 Overview
NVIDIA 2017 Overview
 
HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016HPC Top 5 Stories: Nov. 11, 2016
HPC Top 5 Stories: Nov. 11, 2016
 
HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017HPC Top 5 Stories: September 29, 2017
HPC Top 5 Stories: September 29, 2017
 
ISC 2016 Day 3 Recap
ISC 2016 Day 3 RecapISC 2016 Day 3 Recap
ISC 2016 Day 3 Recap
 
HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016HPC Top 5 Stories: Dec. 7, 2016
HPC Top 5 Stories: Dec. 7, 2016
 
ISC 2016 Day 2 Recap
ISC 2016 Day 2 RecapISC 2016 Day 2 Recap
ISC 2016 Day 2 Recap
 
ISC 2016 Day 1 Recap
ISC 2016 Day 1 RecapISC 2016 Day 1 Recap
ISC 2016 Day 1 Recap
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021
 
GTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era DawnsGTC 2018: A New AI Era Dawns
GTC 2018: A New AI Era Dawns
 

Similar a How Do I Understand Deep Learning Performance?

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...Willy Marroquin (WillyDevNET)
 
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Happiest Minds Technologies
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING mlaij
 
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and MindsporePerformance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindsporeijdms
 
Migration services (DB2 to Teradata)
Migration services (DB2  to Teradata)Migration services (DB2  to Teradata)
Migration services (DB2 to Teradata)ModakAnalytics
 
Network performance - skilled craft to hard science
Network performance - skilled craft to hard scienceNetwork performance - skilled craft to hard science
Network performance - skilled craft to hard scienceMartin Geddes
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...inside-BigData.com
 
Data center-terminology photostory-
Data center-terminology photostory-Data center-terminology photostory-
Data center-terminology photostory-VenkatRamana242
 
Electi Deep Learning Optimization
Electi  Deep Learning OptimizationElecti  Deep Learning Optimization
Electi Deep Learning OptimizationNikolas Markou
 
Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...butest
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discoveryadamkraut
 
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSSTUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSijdpsjournal
 

Similar a How Do I Understand Deep Learning Performance? (20)

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
 
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
Whitepaper: Simplifying Data Center Network Management Leveraging SDN - Happi...
 
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
A SURVEY OF NEURAL NETWORK HARDWARE ACCELERATORS IN MACHINE LEARNING
 
1605.08695.pdf
1605.08695.pdf1605.08695.pdf
1605.08695.pdf
 
Performance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and MindsporePerformance Comparison between Pytorch and Mindspore
Performance Comparison between Pytorch and Mindspore
 
Migration services (DB2 to Teradata)
Migration services (DB2  to Teradata)Migration services (DB2  to Teradata)
Migration services (DB2 to Teradata)
 
Priorities Shift In IC Design
Priorities Shift In IC DesignPriorities Shift In IC Design
Priorities Shift In IC Design
 
Network performance - skilled craft to hard science
Network performance - skilled craft to hard scienceNetwork performance - skilled craft to hard science
Network performance - skilled craft to hard science
 
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
 
Data center-terminology photostory-
Data center-terminology photostory-Data center-terminology photostory-
Data center-terminology photostory-
 
Data center terminology photostory
Data center terminology photostoryData center terminology photostory
Data center terminology photostory
 
Electi Deep Learning Optimization
Electi  Deep Learning OptimizationElecti  Deep Learning Optimization
Electi Deep Learning Optimization
 
FINAL VER - 2015_09
FINAL VER - 2015_09FINAL VER - 2015_09
FINAL VER - 2015_09
 
Technovision
TechnovisionTechnovision
Technovision
 
FC Brochure & Insert
FC Brochure & InsertFC Brochure & Insert
FC Brochure & Insert
 
Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...Machine Learning-Based Prefetch Optimization for Data Center ...
Machine Learning-Based Prefetch Optimization for Data Center ...
 
Building Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
 
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORSSTUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
STUDY OF VARIOUS FACTORS AFFECTING PERFORMANCE OF MULTI-CORE PROCESSORS
 
accelerate-intelligent-solutions-with-machine-learning-platform-brief
accelerate-intelligent-solutions-with-machine-learning-platform-briefaccelerate-intelligent-solutions-with-machine-learning-platform-brief
accelerate-intelligent-solutions-with-machine-learning-platform-brief
 
Value Stories - 7th Issue
Value Stories - 7th Issue Value Stories - 7th Issue
Value Stories - 7th Issue
 

Más de NVIDIA

NVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA
 
NVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA
 
NVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA
 
The Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesThe Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesNVIDIA
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical ApplicationsNVIDIA
 
Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019NVIDIA
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchNVIDIA
 
NVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA
 
NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA
 
Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019NVIDIA
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019NVIDIA
 
This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019NVIDIA
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyNVIDIA
 
CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019NVIDIA
 
DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019NVIDIA
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyNVIDIA
 
OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019NVIDIA
 
CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019NVIDIA
 
Empowering Radiology with AI
Empowering Radiology with AIEmpowering Radiology with AI
Empowering Radiology with AINVIDIA
 

Más de NVIDIA (20)

NVIDIA Story 2023.pdf
NVIDIA Story 2023.pdfNVIDIA Story 2023.pdf
NVIDIA Story 2023.pdf
 
NVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring HighlightsNVIDIA GTC2022 Spring Highlights
NVIDIA GTC2022 Spring Highlights
 
NVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company OverviewNVIDIA Brochure 2021 Company Overview
NVIDIA Brochure 2021 Company Overview
 
NVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October SummaryNVIDIA GTC 2020 October Summary
NVIDIA GTC 2020 October Summary
 
The Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life SciencesThe Best of AI and HPC in Healthcare and Life Sciences
The Best of AI and HPC in Healthcare and Life Sciences
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical Applications
 
Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019Top 5 Deep Learning and AI Stories - August 30, 2019
Top 5 Deep Learning and AI Stories - August 30, 2019
 
Seven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence ResearchSeven Ways to Boost Artificial Intelligence Research
Seven Ways to Boost Artificial Intelligence Research
 
NVIDIA Developer Program Overview
NVIDIA Developer Program OverviewNVIDIA Developer Program Overview
NVIDIA Developer Program Overview
 
NVIDIA at Computex 2019
NVIDIA at Computex 2019 NVIDIA at Computex 2019
NVIDIA at Computex 2019
 
Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019Top 5 DGX Sessions From GTC 2019
Top 5 DGX Sessions From GTC 2019
 
Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019Top 5 Data Science Sessions from GTC 2019
Top 5 Data Science Sessions from GTC 2019
 
This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019This Week in Data Science - Top 5 News - April 26, 2019
This Week in Data Science - Top 5 News - April 26, 2019
 
GTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon ValleyGTC 2019 Keynote in Silicon Valley
GTC 2019 Keynote in Silicon Valley
 
CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019CUDA DLI Training Courses at GTC 2019
CUDA DLI Training Courses at GTC 2019
 
DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019DGX Sessions You Won't Want to Miss at GTC 2019
DGX Sessions You Won't Want to Miss at GTC 2019
 
Transforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon ValleyTransforming Healthcare at GTC Silicon Valley
Transforming Healthcare at GTC Silicon Valley
 
OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019OpenACC Monthly Highlights February 2019
OpenACC Monthly Highlights February 2019
 
CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019CUDA Sessions You Won't Want to Miss at GTC 2019
CUDA Sessions You Won't Want to Miss at GTC 2019
 
Empowering Radiology with AI
Empowering Radiology with AIEmpowering Radiology with AI
Empowering Radiology with AI
 

Último

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Último (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

How Do I Understand Deep Learning Performance?

  • 1. HOW DO I UNDERSTAND DEEP LEARNING PERFORMANCE?
  • 2. THERE’S A LOT OF CONFUSION AROUND DEEP LEARNING PERFORMANCE.
  • 3. HOW DO YOU MEASURE IT? WHAT SHOULD YOU MEASURE?
  • 4. JUST REMEMBER “PLASTER” A FRAMEWORK FOR DEEP LEARNING PERFORMANCE
  • 5. WHAT DOES PLASTER STAND FOR? Programmability Latency Accuracy Size of Model Throughput Energy Efficiency Rate of Learning
  • 6. PROGRAMMABILITY Machine learning is experiencing explosive growth not only in the size and complexity of the models but also the burgeoning diversity of neural network architectures. It is difficult even for experts to understand the model choices and then choose the appropriate model to solve their AI business problems. After a deep learning model is coded and trained, it is then optimized for a specific runtime inference environment. NVIDIA addresses training and inference challenges with two key tools– CUDA and TensorRT, NVIDIA’s programmable inference accelerator. In addition, NVIDIA’s deep learning platform accelerates ALL deep learning frameworks, both for training and inference.
  • 7. LATENCY Humans and machines need a response to make decisions and take action. Latency is the time between requesting something and receiving a response. With most human-facing software systems, not just AI, the time is often measured in milliseconds. Voice recognition is a commonly understood application, thanks to Siri, Alexa, and similar voice interfaces. There is wide demand for digital assistants in both consumer and customer service applications. But when humans try to interface with digital assistants, a lag of even a few seconds starts to feel unnatural.
  • 8. ACCURACY While accuracy is important in every industry, healthcare needs especially high accuracy. Medical imaging has advanced significantly in the last couple of decades, increasing usage and requiring more analysis to identify medical issues. Medical imaging advancements and usage also mean that large volumes of data must be transmitted from medical machines to medical specialists to analyze. Options to address the data volume issue have been either to transmit the full information with long delays or to sample the data and reconstruct it using techniques that can lead to inaccurate reconstruction and diagnostics. An advantage of deep learning is that it can be trained at high precision and implemented at lower precision.
  • 9. SIZE OF MODEL The size of a deep learning model and the capacity of the physical network between processors have impacts on performance, especially in the latency and throughput aspects of PLASTER. Deep learning network models are exploding in numbers. Their size and complexity are also increasing, enabling far more detailed analysis and driving the need for more powerful systems for training. In a deep learning model, the drivers of compute power and physical network expansion are: • Number of layers • Number of nodes (neurons) per layer • Complexity of computation per layer • Number of connections between a node at one layer and the nodes of neighboring layers
  • 10. THROUGHPUT Developers are increasingly optimizing inference within a specified latency threshold. While the latency limit ensures good customer experience, maximizing throughput within that limit is critical to maximizing data center efficiency as well as revenue. There has been a tendency to use throughput as the only performance metric, as more computations-per-second generally leads to better performance across other areas. However, if a system cannot deliver adequate throughput within a specified latency requirement, power budget, or server node count, then the system will not ultimately serve an application’s inference needs well. Without the appropriate balance of throughput and latency, the result can be poor customer service, missing service level agreements (SLAs), and potentially a failed service.
  • 11. ENERGY EFFICIENCY As DL accelerator performance improves, DL accelerator power consumption escalates. Providing ROI for deep learning solutions involves more than looking at just the inference performance of a system. Power consumption can quickly increase costs of delivering a service, driving a need to focus on energy efficiency in both devices and systems. Speech processing is a good example of a solution that needs heavy processing to provide an intelligent response in a natural voice. Data center inference providing real- time processing for speech can easily involve large racks of machines that can impact a company’s total cost of ownership (TCO). Therefore, the industry measures operational success in inferences-per-watt (higher is better). Hyperscale data centers seek to maximize energy efficiency for as many inferences as they can deliver with a fixed power budget.
  • 12. RATE OF LEARNING As one of the two words in “AI” is intelligence, users will want the neural networks to learn and adapt in a reasonable timeframe. For complex DL systems to gain traction in business, software tool developers must support the DevOps movement. As organizations continue to experiment with deep learning and neural networks, they are learning how to more effectively build and implement DL systems. DL models must be retrained periodically as inferencing services gather new data and as services grow and change. Therefore, IT organizations and software developers must increase the rate at which they can retrain models as new data arrives.
  • 13. HOW CAN PLASTER IMPACT YOUR BUSINESS? DOWNLOAD WHITEPAPER