DSDT Meetup April 2021

Data Science | Design | Technology
https://www.meetup.com/DSDTMTL
April
28
2021

2
April
28
Please, don't forget to
mute yourself
(2021)

JL Maréchaux
DSDT Co-Organizer
(Google Montreal)
Simon Dagenais
Lead Data Scientist
Snitch AI

Agenda
3:45 - 4:00 Arrival & Networking
4:00 - 4:15 News & Intro
4:15 - 5:15 How to QA your ML models
5:15 - 5:30: Virtual Snack & Networking
4
DSDT Meetup - April 28, 2021

5
A special thanks to our contributors…
Lorem ipsum congue
tempus
Lorem ipsum
tempus
Lorem ipsum congue
tempus
Lorem ipsum
tempus
Lorem ipsum
congue tempus
Lorem ipsum congue
tempus
Thanks
Merci
The
(virtual)
venue
sponsor
& snacks
The brains
...

DSDT Mtl meetup
Pdipiscing elit
322,722 views
DSDT Meetup
Pdipiscing elit
322,722 views
DSDT Meetup
Pdipiscing elit
322,722 views
DSDT
Pdipiscin
322,722
Virtual Meetups
Until we can do in-person events
again in Montreal…
Past (and future) presentations
available on Slideshare.
http://www.slideshare.net/DSDT_MTL

Survey: http://bit.ly/DSDTsurvey2021
Which topics should be considered for 2021 meetups (select all that apply)
7

Monthly cadence, on Wednesdays.
Incredible sessions already planned for May, June and July.
Contact us with your expectations & ideas.
ML
Validation
Reinforcement
Learning
Explainable
AI
RNN & Time
Series
Lorem ipsum
Commodo
April 28
May 26 July 21
What is coming in 2021
June 16
Your ideas,
your meetup.
http://bit.ly/DSDTsurvey2021

9
Yes No Maybe
Going?
Suscipit commodo arcu
May
26
"Autonomous navigation of stratospheric balloons
using reinforcement learning"
Google Brain
May 26
4:00 pm - 5:30 pm
Based on paper published in Nature on
December 2020
No Maybe

10
“
It's time start a new
collaboration and give
back to the community.
Our donations will help
ﬁght against poverty
and social exclusion.
Let's build a stronger
Greater Montreal
together.
Data Science.
Design.
Technology.
More information soon….

How to QA your ML
models
Data Science | Design | Technology 11
Simon Dagenais

The genesis of an AI system
12

The failure of an AI system
13

The end of an AI system
14
How could we have prevented that:
● The model’s performance would not degrade once in
production
● Trust and willingness to pursue efforts would come from
management

Why are there no systematic QA
approaches in ML?
15
Afterall, ML models are:
● Subject to unexpected inputs
● Built in relationship with other software components
● Expected to be consistent, reliable and usable

How should we perform QA on ML
models?
16
● We should uncover and understand those core and
central functions
● We should gain insights of response to altered inputs
● We should also constantly validate the input to our
model

An efﬁcient framework for validation
17
● Deriving feature explainability
● Robustness to random and targeted altered data
● Detecting data drift
● Other tests

Feature explainability related tests (1)
18
Risk
Errors due to a complex data pipeline.
Data coming from multiple sources
and API
Test
Many features are unimportant in
creating the prediction
Action
Pruning model and dataset

19
Risk
Model learned erroneous and non-replicable patterns
Test
Weakly correlated features or features with
non-causal relationship with your model have strong
a contribution with the output
Action
● Adversarial training
● Data augmentation

20
Risk
Concept drift
Test
Change in feature importance
through time
Action
● Model re-training
● Learning changes
● Pre-processing

Robustness to random and targeted noise
21
Risk
The model’s output varies widely to slight
variations in input.
Test
Evaluate the model’s performance with
random or targeted transformation of
input
Action
Data augmentation, adversarial training

Data drift
22
Risk
Evaluate whether the distribution of incoming data is similar to the training data’s
Test
Evaluate whether distribution of feature is similar in training data and production
data
Action
Re-train on non-drifting features, use data that is most similar to in-production
input for training (most recent)

Other tests
23
● Data Leakage
● Model Simpliﬁcation
● Overﬁtting

The alternate fate of an AI system
24
The Data science team builds a robust model
On top of that, stakeholders understand:
● On which basis the model emits its prediction
● The associated risk of using the model
● That proper due diligence was conducted by the team

25
Automated scientiﬁc validation for your
ML models in a few clicks, without the
need to become an expert.

Questions ?
P.S. : We’re hiring DS!
Data Science | Design | Technology 26
Simon Dagenais

Merci / Thank You
@DsdtMtl
(Check for next DSDT meetup at https://www.meetup.com/DSDTmtl)
http://bit.ly/dsdtmtl-in

DSDT Meetup April 2021

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (12)

Similar a DSDT Meetup April 2021

Similar a DSDT Meetup April 2021 (20)

Más de DSDT_MTL

Más de DSDT_MTL (14)

Último

Último (20)

DSDT Meetup April 2021