SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
October 28, 2021 @QCOMResearch
Qualcomm Technologies, Inc.
Enabling on-device
learning at scale
2
Today’s agenda
• What is on-device learning and why
is it crucial for scaling intelligence?
• Our latest on-device learning research
and results
• Conclusions and future directions
• Questions?
3
3
Video monitoring
Extended reality Smart cities
Smart factories
Autonomous vehicles
Video conferencing
Smart homes
Smartphone
The need for intelligent, personalized
experiences powered by AI is ever-growing
How do we maintain privacy and deal
with all the data from edge devices?
4
On-device
intelligence is
paramount
Process data closest to the
source, complement the cloud
Privacy
Reliability
Low latency
Efficient use of
network bandwidth
5
Public network
Private networks
Transformation of the
Connected Intelligent
Edge has begun at scale
Edge cloud
Cloud On-device
Past
Cloud-centric AI
AI training and
inference in the
central cloud
Future
Fully-distributed AI
With lifelong on-
device learning
Today
Partially-distributed AI
Power-efficient
on-device AI inference
Processing data closer to devices at the edge derives
new system values (e.g., lower latency, enhanced privacy)
6
Object
classification
Voice
activation
Text
recognition
Gesture/
hand tracking
Computational
photography
Contextual
awareness
Face
detection
On-device
security
Voice
recognition
Fingerprint
Local network
analytics
Low-latency
interactive content
Boundless XR
On-demand
computing
Industrial
automation
and control
Enterprise data
+
Edge cloud
AI
On-device
AI
Connected
Intelligent
Edge
brings new and
enhanced services
7
Data and
labels
Training
Offline training
A model is trained
in the cloud with
data reflecting the
target application
On-device learning
Modifying model
after deployment
based on the
test environment
Deploy
What is
on-device
learning? Test
data
Adapt
model
Inference
8
Data and
labels
Training
With offline training,
the test data can differ
from training data
(domain shift, distribution
shift, anomalies) and may
even change continuously
Test
data
On-device learning can
help to improve and
maintain accuracy when
original pre-trained model
cannot generalize well
Adapt
model
Inference
Deploy
On-device
learning
offers several
benefits
• Continuous learning
• Personalization
• Data privacy
• Scale
9
Overcoming challenges to achieve on-device ML benefits
Important considerations for on-device learning to achieve benefits for different use cases
Challenges
Benefits
• Better examples
than training dataset
• Ability to run with smaller
models that adapt to the
target data
• Preservation of privacy
during model development
• Local data can be limited,
e.g., noisy labels and
class imbalance
• Overfitting or catastrophic
forgetting
• Limited compute, storage,
and/or power
• Adversarial attacks
to training
• Federated learning
communication
overhead
Test
data
Adapt
model
9
On-device
learning
10
Our AI
research
areas address
the key
deployment
challenges of
on-device
learning
10
How to adapt
the model to
a few labeled
samples
Few-shot
learning
How to use
unlabeled
data to do
unsupervised
learning
Continuous
learning with
unlabeled
data
How to implement
federate learning
at scale
and address
deployment
challenges
Federated
learning
for global
adaptation
Low-complexity
on-device
learning
How to implement
on-device learning to
improve efficiency
11
11
Improve the target user’s model using the
initial collected data, such as enrollment
Learning from limited labeled data is crucial
Few-shot
learning
Offline learning
11
Deploy global
model
On-device learning
Target user labeled data (e.g., from enrollment)
Limited samples with
user-assisted labeling
May be from cleaner
environment
Cloud data
and training
Enroll
12
12
Qualcomm AI Research internal results .
Few-shot learning for increased personalization
Improving keyword spotting (KWS) performance of outlier users through on-device learning
Detection rate for outlier users is over 30% worse, on average
Identify when a keyword
is spoken using always-on ML
Keyword
spotting
• In practice, it is hard to collect
all types of accented utterance
• The KWS model may not be
sensitive to users’ accents and
have poor performance for outliers
Keyword spotting
challenge
• Locally adapt the model
to user enrollments
• Personalize the model
at enrollment time
Keyword spotting
solution
13
Local
adaptation
• Collect enrollment data from target user
• Adapt the global model on local data
Personalized
Model
Local training
Global
model
Keyword
Non-keyword
Dataset from local
enrollment
Device
How to locally adapt keyword spotting for personalization
Server
Train a global
KWS model
• Global train/validation dataset
Global training dataset
Global
model
Validation
Train
Global validation dataset
14
Qualcomm AI Research internal results. For the few-shot results,
performance is the average of all the locally adapted models.
Personalization
improvements
across the board
but particularly
for outliers
Few-shot
learning
for KWS
improves
performance
Average detection rate improvement
Few-shot vs baseline model
0% 5% 10% 15% 20% 25% 30%
Inlier
Outlier
Percentage improvement (%)
30%
Improvement in
detection rate*
Up to
15
15
Improve the target user’s model based on data
from continuous operation, often unlabeled data
Leveraging user data throughout deployment
Continuous
learning
Offline learning
Deploy global
model
On-device learning
Target user data through continuous operation
Plentiful samples of
unlabeled user data
Can be from various
environment: clean, noisy
Cloud data
and training
Enroll
Day1, 2, …
16
Solving the challenges for continuous learning
Employ pseudo labeling and regularization to reduce impact from forgetting
Challenge
Training data are
collected on the
device without labels
Unlabeled
collected data
Solution
Assign pseudo labels
to training data through
the verification process
Overfitting to
small data
Challenge
Number of collected
data is small
Solution
Exploit regularization loss
that maintains some metrics
from pre-trained model
17
17
Global adaptation
Local adaptation
Federated learning brings on-device learning to new level
Adaptation on the device, once or continuously, locally and/or globally for continuous model enhancement
Federated learning for global adaptation
17
Offline learning
Data
On-device learning
Locally adapt once to a few samples
(e.g., few shot learning) or continuously
(e.g., unsupervised learning)
Adapt model
based on
local data
Offline training prior to deployment
Federated learning
Aggregate model updates across
multiple users to globally improve
model from more diverse data
18
18
Each device
collects
personal data
and performs
on-device
training
Federated learning over 5G is the way to scale intelligence
Cloud
A state-of-art global
AI model is distributed
to devices
D1
D2
DN
2 Scale
Personalization
Privacy
Processing is spread
over many devices
Model customized based
on your personal data
Raw data stays
on the device
1
19
19
Federated learning over 5G is the way to scale intelligence
1.
Adds perturbation
to the parameters
of the AI model
Cloud
The cloud updates the
AI model and then sends
it to the devices again
4
D1
D2
DN
3
Each device:
2.
Compresses the
parameters
3.
Encrypts them
4.
Sends the update
to the cloud
Scale
Privacy
Network bandwidth
is conserved
Only noisy and encrypted
weights sent to the cloud
20
20
“Federated Learning of User verification Models Without Sharing Embeddings” (ICML 2021)
Federated learning can be a powerful tool for user verification
User verification
The authentication problem needs big data
to get a powerful verification model
E.g., typical speaker verification system needs
data from more than 600k different speakers
Challenge
How can we learn this model
while keeping all data private?
We do not want to compromise the sensitive biometric data
of training participants
Personal data available for authentication Deep learning approach for authentication
Face
Fingerprint
Voice
Medical
Data
Client 𝐾
resource
𝑥
𝑔(𝑥)
𝑧
Data User score
𝑔 𝑊
User A
User B
User C
Embedding vector W
Output 𝑔 𝑥
21
“Federated Learning of User verification
Models Without Sharing Embeddings” (ICML 2021)
User embeddings
need to be shared
for training
Traditional
design of neural
networks for
user verification
do not preserve
privacy
𝑥
𝑔(𝑥)
𝑧
Data User score
𝑔 𝑊
User A
User B
User C
Embedding vector W
Output 𝑔 𝑥
For user verification, neural network 𝒈 𝒙 should be trained to:
Minimize the loss to the target user
A smaller loss means a higher user score
Maximize the loss to the other users
In traditional (one-hot) approaches, users share
embeddings to calculate this loss (not private)
22
“Federated Learning of User verification
Models Without Sharing Embeddings” (ICML 2021)
Generate user
embeddings using
error-correcting
codes (ECC)
We enable
federated
learning for
user verification
without users
sharing their
embeddings
𝑥
𝑔(𝑥)
𝑧
Data User score
𝑔 𝑊
User A
User B
User C
Our method (FedUV) accomplishes this by using embeddings
that are codewords of error correcting codes (ECC) and
optimizes network 𝒈 𝒙 using only positive loss function
Each user minimizes their own loss
ECC ensures user embeddings
are maximally spaced to reduce
score to other users
Embedding vector W
Output 𝑔 𝑥
23
“Federated Learning of User verification
Models Without Sharing Embeddings” (ICML 2021)
FedUV
achieves
state-of-the-art
verification
performance
without users
sharing their
embeddings
FedUV (127)
FedUV (255)
FedUV (511)
FedAws
Softmax
1.0
0.8
0.6
0.4
0.2
0.00 0.02 0.04 0.06 0.08 0.10
False positive rate
True
positive
rate
FedUV is better than existing
method, which does not share
user embeddings (FedAWS)
FedUV is comparable to the
best method, which shares
user embeddings (softmax)
24
24
RPC: remote procedure call.
Snapdragon is a product of Qualcomm Technologies, Inc. and/or its subsidiaries.
FL framework for research and
application development on mobile
FL demo of
speaker
verification
• Enrollment from
1000 clients
• Leverage PyTorch
model & training pipeline
from research framework
Samsung Galaxy S21
device powered by
Snapdragon® 888 Platform
Python
control
ML experts Coordinator server Mobile devices
Android app
Pipe
Worker
host
gRPC
Torch
host
LibTorch
C++
Android app
Pipe
Worker
host
gRPC
Torch
host
LibTorch
C++
Android app
Pipe
Worker
host
gRPC
Torch
host
LibTorch
C++
Android app
Pipe
Worker
host
gRPC
Torch
host
LibTorch
C++
Controller
gRPC
Worker
manager
PyTorch / TensorFlow code
FL trainer
gRPC
TCP/IP
network
TCP/IP
network
Worker
25
Low-complexity
on-device learning
26
Learning with backprop is computationally demanding
Updating the model weights using backprop can be expensive, especially on power-constrained devices
Data and
labels
Training
Backward pass with gradient computation = “backpropagation”
Layer 1
In Out
Grad
Layer 2
In Out
Grad
Layer L
In
Target
Out
Grad
𝐿𝑖
𝑔𝑖
(𝐿)
𝑔𝑖
(2)
𝑔𝑖
(3)
Forward pass = inference
Layer 1
In Out
Layer 2
In Out
Layer L
In Out
𝑥𝑖 𝑦𝑖
𝑎𝑖
(1)
𝑎𝑖
(2)
𝑎𝑖
(𝐿)
AI model
27
Overcoming challenges to efficiently adapt a neural net on a device
27
Backprop training
requirements
Large memory
DRAM
Training runtime
High precision
Support for quantized
inference
Adapt AI model
on the device
1. Reduce complexity
of backprop with
quantized training
2. Efficient models for
backpropagation
3. Adapt model
using inference
28
“In-Hindsight Quantization Range Estimation for
Quantized Training” (CVPR ECV Workshop 2021)
Challenge
Maintain accuracy and
reduce compute and memory
using quantized gradients
and activations
Solution
Quantization with
In-Hindsight Range
Estimation
NN quantization is very
effective for NN inference:
low energy with high
accuracy
Can we use quantization
in backpropagation to make
NN training more efficient?
Reduce
backprop
complexity
with quantized
training
Backward pass with gradient computation = “backpropagation”
Layer 1
In Out
Grad
Layer 2
In Out
Grad
Layer L
In
Target
Out
Grad
𝐿𝑖
𝑔𝑖
(𝐿)
𝑔𝑖
(2)
𝑔𝑖
(3)
29
“In-Hindsight Quantization Range Estimation for
Quantized Training” (CVPR ECV Workshop 2021)
Estimating range with
dynamic quantization
• Uses statistics from
the current feature
map to quantize it
• Requires writing the 32-bit
feature map to memory
before quantization
• Is expensive to implement
due to high memory transfers
Existing
quantized
training
techniques
are too
complex
MAC Array
Accumulator
Quantization
෩
GY
෩
𝑊
Memory
Statistics
෩
GX
GX
0
GX
M
GX
1
Dynamic
quantization
Static
Memory
Memory
𝑞min , 𝑞max
Gradient method Activation method ResNet18 Memory transfer
FP32 FP32 69.75 High
Current min-max Current min-max 69.21 +/- 0.06 High
Running min-max Running min-max 69.35 +/- 0.16 High
30
“In-Hindsight Quantization Range Estimation for
Quantized Training” (CVPR ECV Workshop 2021)
Use pre-computed
quantization parameters
to quantize current tensor
Extract statistics from current
tensor for quantization parameters
on next iteration
Much lower complexity
and data movement
In-Hindsight
Range Estimation
reduces quantize
training complexity
while maintaining
accuracy
Gradient method Activation method ResNet18 Memory transfer
FP32 FP32 69.75 High
Current min-max Current min-max 69.21 +/- 0.06 High
Running min-max Running min-max 69.35 +/- 0.16 High
In-hindsight min-max In-hindsight min-max 69.37 +/- 0.11 Low
Memory movement cost comparison between static and dynamic quantization
MAC Array
Accumulator
Quantization
෩
GY
෩
𝑊
Memory
𝑞min
𝑡+1
, 𝑞max
𝑡+1
෩
GX
GX
0
GX
M
GX
1
In-hindsight
Memory
Memory
𝑞min
𝑡
, 𝑞max
𝑡
79%
Reduction
in memory
transfer*
31
Typically, the
backpropagation
calculation
requires large
memory to
store activations
𝑊
𝜎
𝑥
𝑧
𝑦
𝜕ℓ
𝜕𝑧
=
𝜕ℓ
𝜕𝑦
⋅ 𝜎′
𝑧
For backpropagation,
activations need to be
stored at every layer
to calculate the gradient
of the loss function
Typical layer of a neural network
Linear weights
Non-linear
activation function
ℓ is the loss function
to be minimized
Input to layer
Output to next layer
32
Qualcomm AI Research internal results.
Using invertible
layers reduces
memory
requirements for
backpropagation
Typical layer of a neural network
𝑊
𝜎
𝑥
𝑧
𝑦
𝜕ℓ
𝜕𝑧
=
𝜕ℓ
𝜕𝑦
⋅ 𝜎′ 𝑧
For backpropagation,
activations need to be
stored at every layer
to calculate the gradient
of the loss function
Activations of each layer can be
reconstructed exactly from next layer
𝑧 can be computed
for each layer in
backpropagation,
so no need to
store activations
Invertible layer
𝑧 = 𝑦2𝑊
𝑊
𝑥1 𝑥2
𝑦1 𝑦2
𝜎
𝑧 𝑥1 = 𝑦2
𝑥2 = 𝑦1 − 𝐹 𝑦2
#params (M) / #MACs (B) Top 1 / Top 5 Activation mem. per image
MobileNet-V2 3.4 / 0.3 72.0 / 91.0 43 MB
Invertible network 3.24 / 0.3 72.5 / 90.7 3.7 MB
11x
Reduction in
activation
memory
33
Broad range
of research directions
for on-device learning
Test
data
Adapt
model
Privacy, security,
and robustness
Privacy guarantees, adversarial
attack, anomaly detection
Tackling statistical
heterogeneity
in data
Smartly combining
model updates from
a broad distribution
Personalization
and labeling
Meta learning, active learning,
learning with noisy labels
Optimizing
communication in FL
Compressing information
sent on the
uplink and downlink
On-device
training efficiency
Light-weight models, low
complexity training, quantization
Advanced
topologies for FL
Peer-to-peer, multi-cloud,
and hierarchical privacy
34
34
On-device learning is crucial
for providing intelligent,
personalized experiences
without sacrificing privacy
We are conducting leading
research and development
in on-device learning
We are solving system
and feasibility challenges
to move from research
to commercialization
34
35
www.qualcomm.com/ai
@QCOMResearch
www.qualcomm.com/news/onq
https://www.youtube.com/qualcomm?
http://www.slideshare.net/qualcommwirelessevolution
Connect with Us
Questions?
Follow us on:
For more information, visit us at:
www.qualcomm.com & www.qualcomm.com/blog
Thank you
Nothing in these materials is an offer to sell any of the
components or devices referenced herein.
©2018-2021 Qualcomm Technologies, Inc. and/or its
affiliated companies. All Rights Reserved.
Qualcomm and Snapdragon are trademarks or registered
trademark of Qualcomm Incorporated. Other products and
brand names may be trademarks or registered trademarks
of their respective owners.
References in this presentation to “Qualcomm” may mean Qualcomm
Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries
or business units within the Qualcomm corporate structure, as
applicable. Qualcomm Incorporated includes our licensing business,
QTL, and the vast majority of our patent portfolio. Qualcomm
Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates,
along with its subsidiaries, substantially all of our engineering,
research and development functions, and substantially all of our
products and services businesses, including our QCT semiconductor
business.

Más contenido relacionado

La actualidad más candente

Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Artificial Intelligence Introduction & Business usecases
Artificial Intelligence Introduction & Business usecasesArtificial Intelligence Introduction & Business usecases
Artificial Intelligence Introduction & Business usecasesVikas Jain
 
Cassandra's Odyssey @ Netflix
Cassandra's Odyssey @ NetflixCassandra's Odyssey @ Netflix
Cassandra's Odyssey @ NetflixRoopa Tangirala
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AICMassociates
 
AI Chip Trends and Forecast
AI Chip Trends and ForecastAI Chip Trends and Forecast
AI Chip Trends and ForecastCastLabKAIST
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep LearningOswald Campesato
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networksSi Haem
 
Automotive Technology Vision 2019
Automotive Technology Vision 2019Automotive Technology Vision 2019
Automotive Technology Vision 2019accenture
 
Eni: fourth quarter and full year 2023 results
Eni: fourth quarter and full year 2023 resultsEni: fourth quarter and full year 2023 results
Eni: fourth quarter and full year 2023 resultsEni
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learningMichał Kuźba
 
Notes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgNotes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgdataHacker. rs
 
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...Edge AI and Vision Alliance
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsOVHcloud
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...Edge AI and Vision Alliance
 
SMART HEALTH AND Internet of Things (IoT) - RESEARCH Opportunities
SMART HEALTH AND Internet of Things (IoT) 	-  RESEARCH  OpportunitiesSMART HEALTH AND Internet of Things (IoT) 	-  RESEARCH  Opportunities
SMART HEALTH AND Internet of Things (IoT) - RESEARCH Opportunities Tauseef Naquishbandi
 

La actualidad más candente (20)

Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Artificial Intelligence Introduction & Business usecases
Artificial Intelligence Introduction & Business usecasesArtificial Intelligence Introduction & Business usecases
Artificial Intelligence Introduction & Business usecases
 
Cassandra's Odyssey @ Netflix
Cassandra's Odyssey @ NetflixCassandra's Odyssey @ Netflix
Cassandra's Odyssey @ Netflix
 
Big Data and Advanced Analytics
Big Data and Advanced AnalyticsBig Data and Advanced Analytics
Big Data and Advanced Analytics
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
 
AI Chip Trends and Forecast
AI Chip Trends and ForecastAI Chip Trends and Forecast
AI Chip Trends and Forecast
 
FP7 iCore project presentation
FP7 iCore project presentation FP7 iCore project presentation
FP7 iCore project presentation
 
Marv Wexler - Transform Your with AI.pdf
Marv Wexler - Transform Your with AI.pdfMarv Wexler - Transform Your with AI.pdf
Marv Wexler - Transform Your with AI.pdf
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Automotive Technology Vision 2019
Automotive Technology Vision 2019Automotive Technology Vision 2019
Automotive Technology Vision 2019
 
Eni: fourth quarter and full year 2023 results
Eni: fourth quarter and full year 2023 resultsEni: fourth quarter and full year 2023 results
Eni: fourth quarter and full year 2023 results
 
Privacy preserving machine learning
Privacy preserving machine learningPrivacy preserving machine learning
Privacy preserving machine learning
 
Is AI generation the next platform shift?
Is AI generation the next platform shift?Is AI generation the next platform shift?
Is AI generation the next platform shift?
 
Notes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew NgNotes from Coursera Deep Learning courses by Andrew Ng
Notes from Coursera Deep Learning courses by Andrew Ng
 
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
“Accelerating Newer ML Models Using the Qualcomm AI Stack,” a Presentation fr...
 
Fine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP modelsFine tune and deploy Hugging Face NLP models
Fine tune and deploy Hugging Face NLP models
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
 
SMART HEALTH AND Internet of Things (IoT) - RESEARCH Opportunities
SMART HEALTH AND Internet of Things (IoT) 	-  RESEARCH  OpportunitiesSMART HEALTH AND Internet of Things (IoT) 	-  RESEARCH  Opportunities
SMART HEALTH AND Internet of Things (IoT) - RESEARCH Opportunities
 

Similar a Enabling on-device learning at scale

AI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptAI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptQualcomm Research
 
Towards Effective Device-Aware Federated Learning
Towards Effective Device-Aware Federated LearningTowards Effective Device-Aware Federated Learning
Towards Effective Device-Aware Federated LearningAntonio Ferrara
 
Federated learning of deep networks using model averaging
Federated learning of deep networks using model averagingFederated learning of deep networks using model averaging
Federated learning of deep networks using model averagingNguyễn Nhương
 
Multi Layer Federated Learning.pptx
Multi Layer Federated Learning.pptxMulti Layer Federated Learning.pptx
Multi Layer Federated Learning.pptxTimePass43152
 
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...leewayhertz.com-Federated learning Unlocking the potential of secure distribu...
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...KristiLBurns
 
A Web-based Attendance System Using Face Recognition
A Web-based Attendance System Using Face RecognitionA Web-based Attendance System Using Face Recognition
A Web-based Attendance System Using Face RecognitionIRJET Journal
 
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdfMachine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdfCarlos Paredes
 
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Codemotion Tel Aviv
 
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...Edge AI and Vision Alliance
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERGanesan Narayanasamy
 
A Fresh, New Look for CMD+CTRL Cyber Range
A Fresh, New Look for CMD+CTRL Cyber RangeA Fresh, New Look for CMD+CTRL Cyber Range
A Fresh, New Look for CMD+CTRL Cyber RangeSecurity Innovation
 
Managing Data Science Projects
Managing Data Science ProjectsManaging Data Science Projects
Managing Data Science ProjectsDanielle Dean
 
Federated learning in brief
Federated learning in briefFederated learning in brief
Federated learning in briefShashi Perera
 
Iaetsd future polling system using cloud computing in support with smart clie...
Iaetsd future polling system using cloud computing in support with smart clie...Iaetsd future polling system using cloud computing in support with smart clie...
Iaetsd future polling system using cloud computing in support with smart clie...Iaetsd Iaetsd
 
Building AI with Security and Privacy in mind
Building AI with Security and Privacy in mindBuilding AI with Security and Privacy in mind
Building AI with Security and Privacy in mindgeetachauhan
 
Building AI with Security Privacy in Mind
Building AI with Security Privacy in MindBuilding AI with Security Privacy in Mind
Building AI with Security Privacy in Mindgeetachauhan
 
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...Smash Tech
 

Similar a Enabling on-device learning at scale (20)

AI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-conceptAI firsts: Leading from research to proof-of-concept
AI firsts: Leading from research to proof-of-concept
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Towards Effective Device-Aware Federated Learning
Towards Effective Device-Aware Federated LearningTowards Effective Device-Aware Federated Learning
Towards Effective Device-Aware Federated Learning
 
Federated learning of deep networks using model averaging
Federated learning of deep networks using model averagingFederated learning of deep networks using model averaging
Federated learning of deep networks using model averaging
 
Multi Layer Federated Learning.pptx
Multi Layer Federated Learning.pptxMulti Layer Federated Learning.pptx
Multi Layer Federated Learning.pptx
 
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...leewayhertz.com-Federated learning Unlocking the potential of secure distribu...
leewayhertz.com-Federated learning Unlocking the potential of secure distribu...
 
A Web-based Attendance System Using Face Recognition
A Web-based Attendance System Using Face RecognitionA Web-based Attendance System Using Face Recognition
A Web-based Attendance System Using Face Recognition
 
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdfMachine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
 
Project based learning methodologies for Embedded Systems and Intelligent Sys...
Project based learning methodologies for Embedded Systems and Intelligent Sys...Project based learning methodologies for Embedded Systems and Intelligent Sys...
Project based learning methodologies for Embedded Systems and Intelligent Sys...
 
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...Faster deep learning solutions from training to inference - Amitai Armon & Ni...
Faster deep learning solutions from training to inference - Amitai Armon & Ni...
 
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...
“Powering the Connected Intelligent Edge and the Future of On-Device AI,” a P...
 
Machine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWERMachine Learning AND Deep Learning for OpenPOWER
Machine Learning AND Deep Learning for OpenPOWER
 
A Fresh, New Look for CMD+CTRL Cyber Range
A Fresh, New Look for CMD+CTRL Cyber RangeA Fresh, New Look for CMD+CTRL Cyber Range
A Fresh, New Look for CMD+CTRL Cyber Range
 
Managing Data Science Projects
Managing Data Science ProjectsManaging Data Science Projects
Managing Data Science Projects
 
Federated learning in brief
Federated learning in briefFederated learning in brief
Federated learning in brief
 
Iaetsd future polling system using cloud computing in support with smart clie...
Iaetsd future polling system using cloud computing in support with smart clie...Iaetsd future polling system using cloud computing in support with smart clie...
Iaetsd future polling system using cloud computing in support with smart clie...
 
Machine learning
Machine learningMachine learning
Machine learning
 
Building AI with Security and Privacy in mind
Building AI with Security and Privacy in mindBuilding AI with Security and Privacy in mind
Building AI with Security and Privacy in mind
 
Building AI with Security Privacy in Mind
Building AI with Security Privacy in MindBuilding AI with Security Privacy in Mind
Building AI with Security Privacy in Mind
 
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...
Donald De Witte | Cisco Sistems - Conferencia Internet of Things Smash Tech B...
 

Más de Qualcomm Research

Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdfQualcomm Research
 
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Research
 
Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Qualcomm Research
 
Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfQualcomm Research
 
Enabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfEnabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfQualcomm Research
 
Bringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingBringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingQualcomm Research
 
How will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfHow will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfQualcomm Research
 
Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Qualcomm Research
 
Realizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GRealizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GQualcomm Research
 
3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolutionQualcomm Research
 
Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Qualcomm Research
 
5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edgeQualcomm Research
 
The essential role of AI in the 5G future
The essential role of AI in the 5G futureThe essential role of AI in the 5G future
The essential role of AI in the 5G futureQualcomm Research
 
How AI research is enabling next-gen codecs
How AI research is enabling next-gen codecsHow AI research is enabling next-gen codecs
How AI research is enabling next-gen codecsQualcomm Research
 
Role of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingRole of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingQualcomm Research
 
How to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANHow to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANQualcomm Research
 
What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? Qualcomm Research
 
Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AIQualcomm Research
 
Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Qualcomm Research
 

Más de Qualcomm Research (20)

Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AIQualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
Qualcomm Webinar: Solving Unsolvable Combinatorial Problems with AI
 
Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022Why and what you need to know about 6G in 2022
Why and what you need to know about 6G in 2022
 
Understanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdfUnderstanding the world in 3D with AI.pdf
Understanding the world in 3D with AI.pdf
 
Enabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdfEnabling the metaverse with 5G- web.pdf
Enabling the metaverse with 5G- web.pdf
 
Bringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensingBringing AI research to wireless communication and sensing
Bringing AI research to wireless communication and sensing
 
How will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdfHow will sidelink bring a new level of 5G versatility.pdf
How will sidelink bring a new level of 5G versatility.pdf
 
Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)Scaling 5G to new frontiers with NR-Light (RedCap)
Scaling 5G to new frontiers with NR-Light (RedCap)
 
Realizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5GRealizing mission-critical industrial automation with 5G
Realizing mission-critical industrial automation with 5G
 
3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution3GPP Release 17: Completing the first phase of 5G evolution
3GPP Release 17: Completing the first phase of 5G evolution
 
Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18Setting off the 5G Advanced evolution with 3GPP Release 18
Setting off the 5G Advanced evolution with 3GPP Release 18
 
5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge5G positioning for the connected intelligent edge
5G positioning for the connected intelligent edge
 
The essential role of AI in the 5G future
The essential role of AI in the 5G futureThe essential role of AI in the 5G future
The essential role of AI in the 5G future
 
How AI research is enabling next-gen codecs
How AI research is enabling next-gen codecsHow AI research is enabling next-gen codecs
How AI research is enabling next-gen codecs
 
Role of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous drivingRole of localization and environment perception in autonomous driving
Role of localization and environment perception in autonomous driving
 
Pioneering 5G broadcast
Pioneering 5G broadcastPioneering 5G broadcast
Pioneering 5G broadcast
 
How to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RANHow to build high performance 5G networks with vRAN and O-RAN
How to build high performance 5G networks with vRAN and O-RAN
 
What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave? What's in the future of 5G millimeter wave?
What's in the future of 5G millimeter wave?
 
Efficient video perception through AI
Efficient video perception through AIEfficient video perception through AI
Efficient video perception through AI
 
Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...Enabling the rise of the smartphone: Chronicling the developmental history at...
Enabling the rise of the smartphone: Chronicling the developmental history at...
 

Último

CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 

Último (20)

CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Enabling on-device learning at scale

  • 1. October 28, 2021 @QCOMResearch Qualcomm Technologies, Inc. Enabling on-device learning at scale
  • 2. 2 Today’s agenda • What is on-device learning and why is it crucial for scaling intelligence? • Our latest on-device learning research and results • Conclusions and future directions • Questions?
  • 3. 3 3 Video monitoring Extended reality Smart cities Smart factories Autonomous vehicles Video conferencing Smart homes Smartphone The need for intelligent, personalized experiences powered by AI is ever-growing How do we maintain privacy and deal with all the data from edge devices?
  • 4. 4 On-device intelligence is paramount Process data closest to the source, complement the cloud Privacy Reliability Low latency Efficient use of network bandwidth
  • 5. 5 Public network Private networks Transformation of the Connected Intelligent Edge has begun at scale Edge cloud Cloud On-device Past Cloud-centric AI AI training and inference in the central cloud Future Fully-distributed AI With lifelong on- device learning Today Partially-distributed AI Power-efficient on-device AI inference Processing data closer to devices at the edge derives new system values (e.g., lower latency, enhanced privacy)
  • 6. 6 Object classification Voice activation Text recognition Gesture/ hand tracking Computational photography Contextual awareness Face detection On-device security Voice recognition Fingerprint Local network analytics Low-latency interactive content Boundless XR On-demand computing Industrial automation and control Enterprise data + Edge cloud AI On-device AI Connected Intelligent Edge brings new and enhanced services
  • 7. 7 Data and labels Training Offline training A model is trained in the cloud with data reflecting the target application On-device learning Modifying model after deployment based on the test environment Deploy What is on-device learning? Test data Adapt model Inference
  • 8. 8 Data and labels Training With offline training, the test data can differ from training data (domain shift, distribution shift, anomalies) and may even change continuously Test data On-device learning can help to improve and maintain accuracy when original pre-trained model cannot generalize well Adapt model Inference Deploy On-device learning offers several benefits • Continuous learning • Personalization • Data privacy • Scale
  • 9. 9 Overcoming challenges to achieve on-device ML benefits Important considerations for on-device learning to achieve benefits for different use cases Challenges Benefits • Better examples than training dataset • Ability to run with smaller models that adapt to the target data • Preservation of privacy during model development • Local data can be limited, e.g., noisy labels and class imbalance • Overfitting or catastrophic forgetting • Limited compute, storage, and/or power • Adversarial attacks to training • Federated learning communication overhead Test data Adapt model 9 On-device learning
  • 10. 10 Our AI research areas address the key deployment challenges of on-device learning 10 How to adapt the model to a few labeled samples Few-shot learning How to use unlabeled data to do unsupervised learning Continuous learning with unlabeled data How to implement federate learning at scale and address deployment challenges Federated learning for global adaptation Low-complexity on-device learning How to implement on-device learning to improve efficiency
  • 11. 11 11 Improve the target user’s model using the initial collected data, such as enrollment Learning from limited labeled data is crucial Few-shot learning Offline learning 11 Deploy global model On-device learning Target user labeled data (e.g., from enrollment) Limited samples with user-assisted labeling May be from cleaner environment Cloud data and training Enroll
  • 12. 12 12 Qualcomm AI Research internal results . Few-shot learning for increased personalization Improving keyword spotting (KWS) performance of outlier users through on-device learning Detection rate for outlier users is over 30% worse, on average Identify when a keyword is spoken using always-on ML Keyword spotting • In practice, it is hard to collect all types of accented utterance • The KWS model may not be sensitive to users’ accents and have poor performance for outliers Keyword spotting challenge • Locally adapt the model to user enrollments • Personalize the model at enrollment time Keyword spotting solution
  • 13. 13 Local adaptation • Collect enrollment data from target user • Adapt the global model on local data Personalized Model Local training Global model Keyword Non-keyword Dataset from local enrollment Device How to locally adapt keyword spotting for personalization Server Train a global KWS model • Global train/validation dataset Global training dataset Global model Validation Train Global validation dataset
  • 14. 14 Qualcomm AI Research internal results. For the few-shot results, performance is the average of all the locally adapted models. Personalization improvements across the board but particularly for outliers Few-shot learning for KWS improves performance Average detection rate improvement Few-shot vs baseline model 0% 5% 10% 15% 20% 25% 30% Inlier Outlier Percentage improvement (%) 30% Improvement in detection rate* Up to
  • 15. 15 15 Improve the target user’s model based on data from continuous operation, often unlabeled data Leveraging user data throughout deployment Continuous learning Offline learning Deploy global model On-device learning Target user data through continuous operation Plentiful samples of unlabeled user data Can be from various environment: clean, noisy Cloud data and training Enroll Day1, 2, …
  • 16. 16 Solving the challenges for continuous learning Employ pseudo labeling and regularization to reduce impact from forgetting Challenge Training data are collected on the device without labels Unlabeled collected data Solution Assign pseudo labels to training data through the verification process Overfitting to small data Challenge Number of collected data is small Solution Exploit regularization loss that maintains some metrics from pre-trained model
  • 17. 17 17 Global adaptation Local adaptation Federated learning brings on-device learning to new level Adaptation on the device, once or continuously, locally and/or globally for continuous model enhancement Federated learning for global adaptation 17 Offline learning Data On-device learning Locally adapt once to a few samples (e.g., few shot learning) or continuously (e.g., unsupervised learning) Adapt model based on local data Offline training prior to deployment Federated learning Aggregate model updates across multiple users to globally improve model from more diverse data
  • 18. 18 18 Each device collects personal data and performs on-device training Federated learning over 5G is the way to scale intelligence Cloud A state-of-art global AI model is distributed to devices D1 D2 DN 2 Scale Personalization Privacy Processing is spread over many devices Model customized based on your personal data Raw data stays on the device 1
  • 19. 19 19 Federated learning over 5G is the way to scale intelligence 1. Adds perturbation to the parameters of the AI model Cloud The cloud updates the AI model and then sends it to the devices again 4 D1 D2 DN 3 Each device: 2. Compresses the parameters 3. Encrypts them 4. Sends the update to the cloud Scale Privacy Network bandwidth is conserved Only noisy and encrypted weights sent to the cloud
  • 20. 20 20 “Federated Learning of User verification Models Without Sharing Embeddings” (ICML 2021) Federated learning can be a powerful tool for user verification User verification The authentication problem needs big data to get a powerful verification model E.g., typical speaker verification system needs data from more than 600k different speakers Challenge How can we learn this model while keeping all data private? We do not want to compromise the sensitive biometric data of training participants Personal data available for authentication Deep learning approach for authentication Face Fingerprint Voice Medical Data Client 𝐾 resource 𝑥 𝑔(𝑥) 𝑧 Data User score 𝑔 𝑊 User A User B User C Embedding vector W Output 𝑔 𝑥
  • 21. 21 “Federated Learning of User verification Models Without Sharing Embeddings” (ICML 2021) User embeddings need to be shared for training Traditional design of neural networks for user verification do not preserve privacy 𝑥 𝑔(𝑥) 𝑧 Data User score 𝑔 𝑊 User A User B User C Embedding vector W Output 𝑔 𝑥 For user verification, neural network 𝒈 𝒙 should be trained to: Minimize the loss to the target user A smaller loss means a higher user score Maximize the loss to the other users In traditional (one-hot) approaches, users share embeddings to calculate this loss (not private)
  • 22. 22 “Federated Learning of User verification Models Without Sharing Embeddings” (ICML 2021) Generate user embeddings using error-correcting codes (ECC) We enable federated learning for user verification without users sharing their embeddings 𝑥 𝑔(𝑥) 𝑧 Data User score 𝑔 𝑊 User A User B User C Our method (FedUV) accomplishes this by using embeddings that are codewords of error correcting codes (ECC) and optimizes network 𝒈 𝒙 using only positive loss function Each user minimizes their own loss ECC ensures user embeddings are maximally spaced to reduce score to other users Embedding vector W Output 𝑔 𝑥
  • 23. 23 “Federated Learning of User verification Models Without Sharing Embeddings” (ICML 2021) FedUV achieves state-of-the-art verification performance without users sharing their embeddings FedUV (127) FedUV (255) FedUV (511) FedAws Softmax 1.0 0.8 0.6 0.4 0.2 0.00 0.02 0.04 0.06 0.08 0.10 False positive rate True positive rate FedUV is better than existing method, which does not share user embeddings (FedAWS) FedUV is comparable to the best method, which shares user embeddings (softmax)
  • 24. 24 24 RPC: remote procedure call. Snapdragon is a product of Qualcomm Technologies, Inc. and/or its subsidiaries. FL framework for research and application development on mobile FL demo of speaker verification • Enrollment from 1000 clients • Leverage PyTorch model & training pipeline from research framework Samsung Galaxy S21 device powered by Snapdragon® 888 Platform Python control ML experts Coordinator server Mobile devices Android app Pipe Worker host gRPC Torch host LibTorch C++ Android app Pipe Worker host gRPC Torch host LibTorch C++ Android app Pipe Worker host gRPC Torch host LibTorch C++ Android app Pipe Worker host gRPC Torch host LibTorch C++ Controller gRPC Worker manager PyTorch / TensorFlow code FL trainer gRPC TCP/IP network TCP/IP network Worker
  • 26. 26 Learning with backprop is computationally demanding Updating the model weights using backprop can be expensive, especially on power-constrained devices Data and labels Training Backward pass with gradient computation = “backpropagation” Layer 1 In Out Grad Layer 2 In Out Grad Layer L In Target Out Grad 𝐿𝑖 𝑔𝑖 (𝐿) 𝑔𝑖 (2) 𝑔𝑖 (3) Forward pass = inference Layer 1 In Out Layer 2 In Out Layer L In Out 𝑥𝑖 𝑦𝑖 𝑎𝑖 (1) 𝑎𝑖 (2) 𝑎𝑖 (𝐿) AI model
  • 27. 27 Overcoming challenges to efficiently adapt a neural net on a device 27 Backprop training requirements Large memory DRAM Training runtime High precision Support for quantized inference Adapt AI model on the device 1. Reduce complexity of backprop with quantized training 2. Efficient models for backpropagation 3. Adapt model using inference
  • 28. 28 “In-Hindsight Quantization Range Estimation for Quantized Training” (CVPR ECV Workshop 2021) Challenge Maintain accuracy and reduce compute and memory using quantized gradients and activations Solution Quantization with In-Hindsight Range Estimation NN quantization is very effective for NN inference: low energy with high accuracy Can we use quantization in backpropagation to make NN training more efficient? Reduce backprop complexity with quantized training Backward pass with gradient computation = “backpropagation” Layer 1 In Out Grad Layer 2 In Out Grad Layer L In Target Out Grad 𝐿𝑖 𝑔𝑖 (𝐿) 𝑔𝑖 (2) 𝑔𝑖 (3)
  • 29. 29 “In-Hindsight Quantization Range Estimation for Quantized Training” (CVPR ECV Workshop 2021) Estimating range with dynamic quantization • Uses statistics from the current feature map to quantize it • Requires writing the 32-bit feature map to memory before quantization • Is expensive to implement due to high memory transfers Existing quantized training techniques are too complex MAC Array Accumulator Quantization ෩ GY ෩ 𝑊 Memory Statistics ෩ GX GX 0 GX M GX 1 Dynamic quantization Static Memory Memory 𝑞min , 𝑞max Gradient method Activation method ResNet18 Memory transfer FP32 FP32 69.75 High Current min-max Current min-max 69.21 +/- 0.06 High Running min-max Running min-max 69.35 +/- 0.16 High
  • 30. 30 “In-Hindsight Quantization Range Estimation for Quantized Training” (CVPR ECV Workshop 2021) Use pre-computed quantization parameters to quantize current tensor Extract statistics from current tensor for quantization parameters on next iteration Much lower complexity and data movement In-Hindsight Range Estimation reduces quantize training complexity while maintaining accuracy Gradient method Activation method ResNet18 Memory transfer FP32 FP32 69.75 High Current min-max Current min-max 69.21 +/- 0.06 High Running min-max Running min-max 69.35 +/- 0.16 High In-hindsight min-max In-hindsight min-max 69.37 +/- 0.11 Low Memory movement cost comparison between static and dynamic quantization MAC Array Accumulator Quantization ෩ GY ෩ 𝑊 Memory 𝑞min 𝑡+1 , 𝑞max 𝑡+1 ෩ GX GX 0 GX M GX 1 In-hindsight Memory Memory 𝑞min 𝑡 , 𝑞max 𝑡 79% Reduction in memory transfer*
  • 31. 31 Typically, the backpropagation calculation requires large memory to store activations 𝑊 𝜎 𝑥 𝑧 𝑦 𝜕ℓ 𝜕𝑧 = 𝜕ℓ 𝜕𝑦 ⋅ 𝜎′ 𝑧 For backpropagation, activations need to be stored at every layer to calculate the gradient of the loss function Typical layer of a neural network Linear weights Non-linear activation function ℓ is the loss function to be minimized Input to layer Output to next layer
  • 32. 32 Qualcomm AI Research internal results. Using invertible layers reduces memory requirements for backpropagation Typical layer of a neural network 𝑊 𝜎 𝑥 𝑧 𝑦 𝜕ℓ 𝜕𝑧 = 𝜕ℓ 𝜕𝑦 ⋅ 𝜎′ 𝑧 For backpropagation, activations need to be stored at every layer to calculate the gradient of the loss function Activations of each layer can be reconstructed exactly from next layer 𝑧 can be computed for each layer in backpropagation, so no need to store activations Invertible layer 𝑧 = 𝑦2𝑊 𝑊 𝑥1 𝑥2 𝑦1 𝑦2 𝜎 𝑧 𝑥1 = 𝑦2 𝑥2 = 𝑦1 − 𝐹 𝑦2 #params (M) / #MACs (B) Top 1 / Top 5 Activation mem. per image MobileNet-V2 3.4 / 0.3 72.0 / 91.0 43 MB Invertible network 3.24 / 0.3 72.5 / 90.7 3.7 MB 11x Reduction in activation memory
  • 33. 33 Broad range of research directions for on-device learning Test data Adapt model Privacy, security, and robustness Privacy guarantees, adversarial attack, anomaly detection Tackling statistical heterogeneity in data Smartly combining model updates from a broad distribution Personalization and labeling Meta learning, active learning, learning with noisy labels Optimizing communication in FL Compressing information sent on the uplink and downlink On-device training efficiency Light-weight models, low complexity training, quantization Advanced topologies for FL Peer-to-peer, multi-cloud, and hierarchical privacy
  • 34. 34 34 On-device learning is crucial for providing intelligent, personalized experiences without sacrificing privacy We are conducting leading research and development in on-device learning We are solving system and feasibility challenges to move from research to commercialization 34
  • 36. Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog Thank you Nothing in these materials is an offer to sell any of the components or devices referenced herein. ©2018-2021 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm and Snapdragon are trademarks or registered trademark of Qualcomm Incorporated. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to “Qualcomm” may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.