SlideShare a Scribd company logo
1 of 80
www.prmia.org© PRMIA 2020
Model Risk Management for Machine Learning Models
Sri Krishnamurthy, CFA, CAP
Founder & CEO
www.QuantUniversity.com
www.prmia.org© PRMIA 2020
Thought Leadership Webinar
www.prmia.org© PRMIA 2020
Presenter
Sri Krishnamurthy, CFA, CAP
Founder & CEO, QuantUniversity
• Advisory and Consultancy for Financial Analytics
• Prior experience at MathWorks, Citigroup, and Endeca and
25+ years in financial services and energy
• Columnist for the Wilmott Magazine
• Teaches Analytics in the Babson College MBA program and at
Northeastern University, Boston
• Reviewer: Journal of Asset Management
www.prmia.org© PRMIA 2020
About www.QuantUniversity.com
• Boston-based Data Science, Quant
Finance and Machine Learning
training and consulting advisory
• Trained more than 5,000 students
in Quantitative methods, Data
Science and Big Data Technologies
using MATLAB, Python and R
• Building a platform for AI
and Machine Learning Enablement
in the Enterprise
www.prmia.org© PRMIA 2020
Agenda
Considerations for MRM
for Machine Learning
models
Case Study
Machine Learning
www.prmia.org© PRMIA 2020
Machine Learning in FinancePart 1
www.prmia.org© PRMIA 2020
The world as we know has changed!
www.prmia.org© PRMIA 2020
Machine Learning and AI Have Revolutionized Finance
www.prmia.org© PRMIA 2020
Machine Learning & AI in Finance: A Paradigm Shift
Stochastic
Models
Factor Models Optimization
Risk Factors P/Q Quants
Derivative
pricing
Trading
Strategies
Simulations
Distribution
fitting
Real-time
analytics
Predictive
analytics
Machine
Learning
RPA NLP
Deep
Learning
Computer
Vision
Graph
Analytics
Chatbots
Sentiment
Analysis
Alternative
Data
Quant Data Scientist/ML
Engineer
www.prmia.org© PRMIA 2020
Machine Learning
1. https://en.wikipedia.org/wiki/Machine_learning
Figure Source: http://www.fsb.org/wp-content/uploads/P011117.pdf
AI
• Artificial intelligence is
intelligence demonstrated by
machines, in contrast to the
natural intelligence displayed by
humans and animals1.
Definitions: Machine Learning and AI
• Machine learning is the scientific
study of algorithms and statistical
models that computer systems use
to effectively perform a specific
task without using explicit
instructions, relying on patterns
and inference instead1.
1. https://en.wikipedia.org/wiki/Machine_learning
2. Figure Source: http://www.fsb.org/wp-content/uploads/P011117.pdf
www.prmia.org© PRMIA 2020
Polling Question 1
• Question: Have you deployed machine learning models in your
organization?
a) Considering it
b) Will be rolled out soon
c) In Production
d) Not yet
www.prmia.org© PRMIA 2020
Considerations for MRM for Machine
Learning models
Part 2
www.prmia.org© PRMIA 2020
The Basics
www.prmia.org© PRMIA 2020
Model Risk Defined
www.prmia.org© PRMIA 2020
The Machine Learning and AI Workflow
Data Scraping/
Ingestion
Data
Exploration
Data Cleansing
and Processing
Feature
Engineering
Model
Evaluation
& Tuning
Model
Selection
Model
Deployment/
Inference
Supervised
Unsupervised
Modeling
Data Engineer, Dev Ops Engineer
• Auto ML
• Model Validation
• Interpretability
Robotic Process Automation (RPA) (Microservices, Pipelines )
• SW: Web/ Rest API
• HW: GPU, Cloud
• Monitoring
• Regression
• KNN
• Decision Trees
• Naive Bayes
• Neural Networks
• Ensembles
• Clustering
• PCA
• Autoencoder
• RMS
• MAPS
• MAE
• Confusion Matrix
• Precision/Recall
• ROC
• Hyper-parameter
tuning
• Parameter Grids
Risk Management/ Compliance(All stages)
Software / Web Engineer Data Scientist/Quants
Analysts&
DecisionMakers
www.prmia.org© PRMIA 2020
Elements of Model Risk Management
www.prmia.org© PRMIA 2020
Model Governance Structure
www.prmia.org© PRMIA 2020
• Components that needs to be tracked
What constitutes an ML model?
• Interdependencies
• Lineage/Provenance
of individual
components
• Model params
• Hyper parameters
• Pipeline specifications
• Model specific
• Tests
• Data versions
Data Model
EnvironmentProcess
• Programming environment
• Execution environment
• Hardware specs
• Cloud
• GPU
www.prmia.org© PRMIA 2020
Elements of a Machine Learning System
Source: Sculley et al., 2015 "Hidden Technical Debt in Machine Learning Systems"
www.prmia.org© PRMIA 2020 19
AI Governance Is Gaining Focus
https://legalinstruments.oecd.org/en/instruments/OECD-LEGAL-0449
www.prmia.org© PRMIA 2020 20
Theory to Practice: How to cross the chasm ?
• Theory
• Regulations
• Local Laws
• Practical ML systems
• Company Expertise
• Company culture and Best
practices
www.prmia.org© PRMIA 2020 21
1. ML Life cycle management
2. Tracking
3. Metadata management
4. Scaling
5. Reproducibility
6. Interpretability
7. Testing
8. Measurement
Themes We Will Discuss Today
www.prmia.org© PRMIA 2020
Polling Question 2
• Which is the most challenging aspect in your organization ?
a) ML Life cycle management
b) Tracking & Metadata management
c) Scaling
d) Reproducibility & Interpretability
e) Testing & Measurement
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
24
Model Lifecycle Management
www.prmia.org© PRMIA 2020
Source: T. van derWeide, O. Smirnov, M. Zielinski, D. Papadopoulos, and T. van Kasteren. Versioned machine learning pipelines for batch experimentation. In ML Systems, Workshop NIPS 2016, 2016.
Provenance and Lineage of Pipelines
www.prmia.org© PRMIA 2020 26
Versioning
www.prmia.org© PRMIA 2020
Schemas proposed
Sebastian Schelter, Joos-Hendrik Boese, Johannes Kirschnick, Thoralf Klein, and Stephan Seufert. Automatically Tracking Metadata and Provenance of Machine Learning Experiments. NIPS Workshop on
Machine Learning Systems, 2017.
www.prmia.org© PRMIA 2020
Schemas proposed
G. C. Publio, D. Esteves, and H. Zafar, “ML-Schema : Exposing the Semantics of Machine Learning with Schemas and Ontologies,” in Reproducibility in ML Workshop, ICML’18, 2018.
www.prmia.org© PRMIA 2020
MLFlow
www.prmia.org© PRMIA 2020
DVC
Source: https://dvc.org/
www.prmia.org© PRMIA 2020
31
Sample Project Structure
REF: Harvard Computefest 2020 demo example
www.prmia.org© PRMIA 2020
GoCD
Source: https://www.gocd.org/
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
I. Altintas, O. Barney, and E. Jaeger-Frank. Provenance collection support in the Kepler scientific workflow system. In Provenance and annotation of data, pages 118–132.
Current Approaches
www.prmia.org© PRMIA 2020
Miao, Hui & Chavan, Amit & Deshpande, Amol. (2016). ProvDB: A System for Lifecycle Management of Collaborative Analysis Workflows.
Current Approaches
www.prmia.org© PRMIA 2020
Related Work
Xueping Liang, Sachin Shetty, Deepak Tosh, Charles Kamhoua, Kevin Kwiat, and Laurent Njilla. 2017. ProvChain: A Blockchain-based Data Provenance Architecture in Cloud Environment with Enhanced
Privacy and Availability. In Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid '17). IEEE Press, Piscataway, NJ, USA, 468-477. DOI:
https://doi.org/10.1109/CCGRID.2017.8
Focus on Cloud data
provenance using Blockchain
www.prmia.org© PRMIA 2020
Related Work
Ramachandran, Aravind & Kantarcioglu, Dr. (2017). Using Blockchain and smart contracts for secure data provenance management.
DataProv: Built on top of
Ethereum, the platform
utilizes smart contracts and
open provenance model
(OPM) to record immutable
data trails.
www.prmia.org© PRMIA 2020
Related Work
Sarpatwar, Kanthi & Vaculín, Roman & Min, Hong & Su, Gong & Heath, Terry & Ganapavarapu, Giridhar & Dillenberger, Donna. (2019). Towards Enabling Trusted Artificial Intelligence via Blockchain.
10.1007/978-3-030-17277-0_8.
Trusted AI and provenance of
AI models
www.prmia.org© PRMIA 2020
Model Inference Standards
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
Meta Data Management
www.prmia.org© PRMIA 2020
Meta Data Management
1. Add people to Amundsen’s data graph, by integrating with
integration with HR systems like Workday. Show commonly
used and bookmarked data assets.
2. Add dashboards and reports (e.g. Tableau, Looker, Apache
Superset) to Amundsen.
3. Add support for lineage across disparate data assets like
dashboards and tables.
4. Add events/schemas (e.g. schema registry) to Amundsen.
5. Add streams (e.g. Apache Kafka, AWS Kinesis) to Amundsen.
https://eng.lyft.com/amundsen-lyfts-data-discovery-metadata-engine-62d27254fbb9
www.prmia.org© PRMIA 2020
43
• Machine learning applications fail is due to the lack of rich, diverse and
clean datasets needed to build models.
• Historical datasets may be hard to acquire or may be skewed towards the
majority class.
• All plausible scenarios of the future haven’t happened yet!
• Synthetic data used to enrich and augment existing datasets to provide
comprehensive samples while training machine learning problems.
Role of Data Augmentation
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
GPUs for Scaling
REF : NVIDIA DLI Multi-GPU course slide deck
www.prmia.org© PRMIA 2020
GPUs for Scaling
REF : NVIDIA DLI Multi-GPU course slide deck
www.prmia.org© PRMIA 2020
“TSNE Optimizations
There are four optimizations used to improve the performance of TSNE on GPUs:
1. calculating higher dimensional probabilities with less GPU memory,
2. approximating higher dimensional probabilities,
3. reducing arithmetic operations, and
4. broadcasting along rows.”
Ref: https://medium.com/rapids-ai/tsne-with-gpus-hours-to-seconds-9d9c17c941db
Using GPUs requires GPU compatible code changes
www.prmia.org© PRMIA 2020
Polling Question 3
• What kinds of ML tools do you use in your organization?
a) None
b) On-prem - Enterprise
c) Cloud - Enterprise
d) On-prem – Open Source
e) Cloud – Open Source
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
The Reproducibility Challenge
https://www.nature.com/news/1-500-scientists-lift-the-lid-on-reproducibility-1.19970
www.prmia.org© PRMIA 2020
• Repeatability (Same team, same experimental setup)
— The measurement can be obtained with stated precision by the same team using the same
measurement procedure, the same measuring system, under the same operating conditions, in
the same location on multiple trials. For computational experiments, this means that a
researcher can reliably repeat her own computation.
• Replicability (Different team, same experimental setup)
— The measurement can be obtained with stated precision by a different team using the
same measurement procedure, the same measuring system, under the same operating
conditions, in the same or a different location on multiple trials. For computational
experiments, this means that an independent group can obtain the same result using the
author’s own artifacts.
• Reproducibility (Different team, different experimental setup)
— The measurement can be obtained with stated precision by a different team, a different
measuring system, in a different location on multiple trials. For computational
experiments, this means that an independent group can obtain the same result using
artifacts which they develop completely independently.
Repeatable or Reproducible or Replicable
https://www.acm.org/publications/policies/artifact-review-badging
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
“Interpretability is the degree to which a human can
consistently predict the model's result”1
What is the objective?2
• Simply be to get more useful information from the mode
• Uncover causal structure in observational data
• Transparency? Convergence?
• Model complexity?
• Culture?
The Interpretability Challenge
1. https://christophm.github.io/interpretable-ml-book/interpretability.html
2. https://arxiv.org/abs/1606.03490
www.prmia.org© PRMIA 2020
• Partial dependence plots (PDP)
• Shapley Values
• Lime (Local Interpretable Model-Agnostic Explanations)
• SHAP (SHapley Additive exPlanations)
Reference: https://christophm.github.io/interpretable-ml-book/
Shapley Values
www.prmia.org© PRMIA 2020
• Partial dependence plots (PDP) show the dependence between the target
response and a set of ‘target’ features, marginalizing over the values of all
other features (the ‘complement’ features).
• Intuitively, we can interpret the partial dependence as the expected target
response as a function of the ‘target’ features.
https://scikit-learn.org/stable/modules/partial_dependence.html
The Interpretability Challenge
www.prmia.org© PRMIA 2020
Which model to choose?
Client Objective:
• Build the best forecasting model that has a
MAPE of 5% or less
Result:
· Regression – 7% MAPE
· Neural Networks – 4% MAPE
· Random Forest – 5% MAPE
Client choice:
· Regression despite being the worst of the
top-3 models
· “I won’t deploy anything that I don’t
understand”
Source: http://engineering.electrical-equipment.org/electrical-distribution/electric-load-forecasting-advantages-challenges.html
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
Testing for Machine Learning Models
Figureref: http://www.actuaries.org/CTTEES_SOLV/Documents/StressTestingPaper.pdf
www.prmia.org© PRMIA 2020
59
Comprehensive Testing Is Important
www.prmia.org© PRMIA 2020 60
Can Machine Learning algorithms be gamed?
https://www.youtube.com/watch?time_continue=36&v=MIbFvK2S9g8
https://arxiv.org/abs/1904.08653
84
www.prmia.org© PRMIA 2020
Up Next
www.prmia.org© PRMIA 2020
Model Risk Assessment Framework
www.prmia.org© PRMIA 2020
Quantifying Model Risk Is Important
www.prmia.org© PRMIA 2020
RISKGRADING
RiskScores
Impact
5 5 10 15 20 25
4 4 8 12 16 20
3 3 6 9 12 15
2 2 4 6 8 10
1 1 2 3 4 5
1 2 3 4 5
Likelihood of occurrence
Red High Risk
Yellow Moderate Risk
Green LowRisk
High Impact- High likelihood of occurrence: Needs adequate model risk
controlmeasures to mitigate risk
High Impact – Lowlikelihood of occurrence:Address through model risk
control measures
and contingency plans
Low Impact – High likelihood of occurrence : Lower priority model risk
control measures
LowImpact – Lowlikelihood of occurrence:Least prioritymodel risk control
measures
www.prmia.org© PRMIA 2020
Summary
1. ML Life cycle management
2. Tracking
3. Metadata management
4. Scaling
5. Reproducibility
6. Interpretability
7. Testing
8. Measurement
www.prmia.org© PRMIA 2020
Up Next Case study:
Using Synthetic Data for Model Validation
www.prmia.org© PRMIA 2020
Polling Question 4
• Have you considered using Synthetic/Simulated data for testing
and validating models?
a) No
b) Considering it
c) Yes
d) Tried it and decided not to use it
www.prmia.org© PRMIA 2020
Synthetic Data
• Synthetic data is "any production data applicable to a given situation that
are not obtained by direct measurement.”1
• In finance, Synthetic data has been used in stress and scenario analysis for
many years now.
• Example: Montecarlo simulations have been used to generate future
scenarios.
• In Machine Learning, Synthetic Data plays an important role to prevent
overfitting, handle imbalance class problems, and to accommodate
plausible scenarios.
1 https://en.wikipedia.org/wiki/Synthetic_data
www.prmia.org© PRMIA 2020
Challenges with Real Datasets
All scenarios haven’t played out
• Stress scenarios
• What-if scenarios
Figureref:http://www.actuaries.org/CTTEES_SOLV/Documents/StressTestingPaper.pdf
www.prmia.org© PRMIA 2020
Access
• Hard to find
• Rare class problems
• Privacy concerns making it
difficult to share
Challenges with Real Datasets
Picture source: www.pixabay.com
www.prmia.org© PRMIA 2020
Imbalanced
• Need more samples of rare class
• Need proxies for data points that
were not observed or recorded
Challenges with Real Datasets
Picture source: www.pixabay.com
www.prmia.org© PRMIA 2020
Synthetic Data in Finance
Ref: Machine Learning for Asset Managers, Marcos M. López de Prado,,CAMBRIDGE UNIVERSITY PRESS 2020
www.prmia.org© PRMIA 2020
73
www.prmia.org© PRMIA 2020
MRM Use Cases
• Data Anonymization
— Anonymize training and test data sets for internal and external model
validation
• Data Augmentation
— Augment sparse datasets with realistic datasets
• Handling Imbalanced data classes
— Handle Algorithmic bias and to test efficacy of model for rare-class
problems
• Stress and Scenario testing
— Simulate test scenarios for extreme but plausible scenarios to test
model behavior
www.prmia.org© PRMIA 2020
VIX Characteristics
REF: https://www.investopedia.com/terms/v/vix.asp
www.prmia.org© PRMIA 2020
Demo: Synthetic VIX Generation
www.prmia.org© PRMIA 2020
Up Next Demo
If you would like access to the demo and the QuSandbox,
please contact us at info@qusandbox.com.
www.prmia.org© PRMIA 2020
Use Code MRMPRMIA for $100 off!
Register here
www.prmia.org© PRMIA 2020
QuantUniversity’s Model Risk related papers
Email me at sri@quantuniversity.com for a copy
www.prmia.org© PRMIA 2020
Q&A Sri Krishnamurthy, CFA, CAP
Founder and CEO
Information, data and drawings embodied in this presentation are strictly a property of QuantUniversity LLC. and shall not
be distributed or used in any other publication without the prior written consent of QuantUniversity LLC.

More Related Content

What's hot

Model governance in the age of data science & AI
Model governance in the age of data science & AIModel governance in the age of data science & AI
Model governance in the age of data science & AIQuantUniversity
 
Adopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseAdopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseQuantUniversity
 
achine Learning and Model Risk
achine Learning and Model Riskachine Learning and Model Risk
achine Learning and Model RiskQuantUniversity
 
CFA-NY Workshop - Final slides
CFA-NY Workshop - Final slidesCFA-NY Workshop - Final slides
CFA-NY Workshop - Final slidesQuantUniversity
 
Ml master class cfa poland
Ml master class   cfa polandMl master class   cfa poland
Ml master class cfa polandQuantUniversity
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementQuantUniversity
 
QuantUniversity Machine Learning in Finance Course
QuantUniversity Machine Learning in Finance CourseQuantUniversity Machine Learning in Finance Course
QuantUniversity Machine Learning in Finance CourseQuantUniversity
 
Data science in 10 steps
Data science in 10 stepsData science in 10 steps
Data science in 10 stepsQuantUniversity
 
Ml and AI for financial professionals
Ml and AI for financial professionalsMl and AI for financial professionals
Ml and AI for financial professionalsQuantUniversity
 
Time series analysis : Refresher and Innovations
Time series analysis : Refresher and InnovationsTime series analysis : Refresher and Innovations
Time series analysis : Refresher and InnovationsQuantUniversity
 
Machine learning for factor investing
Machine learning for factor investingMachine learning for factor investing
Machine learning for factor investingQuantUniversity
 
Rapid prototyping quant research ml models using the qu sandbox
Rapid prototyping quant research ml models using the qu sandboxRapid prototyping quant research ml models using the qu sandbox
Rapid prototyping quant research ml models using the qu sandboxQuantUniversity
 
No, you don't need to learn python
No, you don't need to learn pythonNo, you don't need to learn python
No, you don't need to learn pythonQuantUniversity
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learningQuantUniversity
 
Ml master class northeastern university
Ml master class   northeastern universityMl master class   northeastern university
Ml master class northeastern universityQuantUniversity
 
Modular Machine Learning for Model Validation
Modular Machine Learning for Model ValidationModular Machine Learning for Model Validation
Modular Machine Learning for Model ValidationQuantUniversity
 

What's hot (20)

Model governance in the age of data science & AI
Model governance in the age of data science & AIModel governance in the age of data science & AI
Model governance in the age of data science & AI
 
QCon conference 2019
QCon conference 2019QCon conference 2019
QCon conference 2019
 
Adopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseAdopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterprise
 
achine Learning and Model Risk
achine Learning and Model Riskachine Learning and Model Risk
achine Learning and Model Risk
 
CFA-NY Workshop - Final slides
CFA-NY Workshop - Final slidesCFA-NY Workshop - Final slides
CFA-NY Workshop - Final slides
 
Python for Data science
Python for Data sciencePython for Data science
Python for Data science
 
Ml master class cfa poland
Ml master class   cfa polandMl master class   cfa poland
Ml master class cfa poland
 
Machine Learning and AI in Risk Management
Machine Learning and AI in Risk ManagementMachine Learning and AI in Risk Management
Machine Learning and AI in Risk Management
 
QuantUniversity Machine Learning in Finance Course
QuantUniversity Machine Learning in Finance CourseQuantUniversity Machine Learning in Finance Course
QuantUniversity Machine Learning in Finance Course
 
Data science in 10 steps
Data science in 10 stepsData science in 10 steps
Data science in 10 steps
 
Ml and AI for financial professionals
Ml and AI for financial professionalsMl and AI for financial professionals
Ml and AI for financial professionals
 
ML master class
ML master classML master class
ML master class
 
Time series analysis : Refresher and Innovations
Time series analysis : Refresher and InnovationsTime series analysis : Refresher and Innovations
Time series analysis : Refresher and Innovations
 
Machine learning for factor investing
Machine learning for factor investingMachine learning for factor investing
Machine learning for factor investing
 
Ml master class
Ml master classMl master class
Ml master class
 
Rapid prototyping quant research ml models using the qu sandbox
Rapid prototyping quant research ml models using the qu sandboxRapid prototyping quant research ml models using the qu sandbox
Rapid prototyping quant research ml models using the qu sandbox
 
No, you don't need to learn python
No, you don't need to learn pythonNo, you don't need to learn python
No, you don't need to learn python
 
Synthetic data generation for machine learning
Synthetic data generation for machine learningSynthetic data generation for machine learning
Synthetic data generation for machine learning
 
Ml master class northeastern university
Ml master class   northeastern universityMl master class   northeastern university
Ml master class northeastern university
 
Modular Machine Learning for Model Validation
Modular Machine Learning for Model ValidationModular Machine Learning for Model Validation
Modular Machine Learning for Model Validation
 

Similar to Model Risk Management for Machine Learning Models

Model Experiments Tracking and Registration using MLflow on Databricks
Model Experiments Tracking and Registration using MLflow on DatabricksModel Experiments Tracking and Registration using MLflow on Databricks
Model Experiments Tracking and Registration using MLflow on DatabricksDatabricks
 
Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101QuantUniversity
 
Synthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesSynthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesQuantUniversity
 
2022-09-14-MATLABDay_SREC.pptx
2022-09-14-MATLABDay_SREC.pptx2022-09-14-MATLABDay_SREC.pptx
2022-09-14-MATLABDay_SREC.pptxAnjanMayra1
 
QuantUniversity Fintech Bootcamp Day- 3
QuantUniversity Fintech Bootcamp Day- 3QuantUniversity Fintech Bootcamp Day- 3
QuantUniversity Fintech Bootcamp Day- 3QuantUniversity
 
DataOps - Production ML
DataOps - Production MLDataOps - Production ML
DataOps - Production MLAl Zindiq
 
ML and AI in Finance: Master Class
ML and AI in Finance: Master ClassML and AI in Finance: Master Class
ML and AI in Finance: Master ClassQuantUniversity
 
Ml conference slides boston june 2019
Ml conference slides boston june 2019Ml conference slides boston june 2019
Ml conference slides boston june 2019QuantUniversity
 
Synthetic data in finance
Synthetic data in financeSynthetic data in finance
Synthetic data in financeQuantUniversity
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...Precisely
 
2020 09-16-ai-engineering challanges
2020 09-16-ai-engineering challanges2020 09-16-ai-engineering challanges
2020 09-16-ai-engineering challangesIvica Crnkovic
 
Aastha Grover Resume (2)
Aastha Grover Resume (2)Aastha Grover Resume (2)
Aastha Grover Resume (2)Aastha Grover
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
 
Qu for India - QuantUniversity FundRaiser
Qu for India  - QuantUniversity FundRaiserQu for India  - QuantUniversity FundRaiser
Qu for India - QuantUniversity FundRaiserQuantUniversity
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuantUniversity
 

Similar to Model Risk Management for Machine Learning Models (20)

Model Experiments Tracking and Registration using MLflow on Databricks
Model Experiments Tracking and Registration using MLflow on DatabricksModel Experiments Tracking and Registration using MLflow on Databricks
Model Experiments Tracking and Registration using MLflow on Databricks
 
Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101Automatic machine learning (AutoML) 101
Automatic machine learning (AutoML) 101
 
Synthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML TechniquesSynthetic VIX Data Generation Using ML Techniques
Synthetic VIX Data Generation Using ML Techniques
 
2022-09-14-MATLABDay_SREC.pptx
2022-09-14-MATLABDay_SREC.pptx2022-09-14-MATLABDay_SREC.pptx
2022-09-14-MATLABDay_SREC.pptx
 
QuantUniversity Fintech Bootcamp Day- 3
QuantUniversity Fintech Bootcamp Day- 3QuantUniversity Fintech Bootcamp Day- 3
QuantUniversity Fintech Bootcamp Day- 3
 
An Analytics Platform for Connected Vehicles
An Analytics Platform for Connected VehiclesAn Analytics Platform for Connected Vehicles
An Analytics Platform for Connected Vehicles
 
DataOps - Production ML
DataOps - Production MLDataOps - Production ML
DataOps - Production ML
 
ML and AI in Finance: Master Class
ML and AI in Finance: Master ClassML and AI in Finance: Master Class
ML and AI in Finance: Master Class
 
Ml conference slides boston june 2019
Ml conference slides boston june 2019Ml conference slides boston june 2019
Ml conference slides boston june 2019
 
Introducing MLOps.pdf
Introducing MLOps.pdfIntroducing MLOps.pdf
Introducing MLOps.pdf
 
Resume jay
Resume jayResume jay
Resume jay
 
Synthetic data in finance
Synthetic data in financeSynthetic data in finance
Synthetic data in finance
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage fro...
 
2020 09-16-ai-engineering challanges
2020 09-16-ai-engineering challanges2020 09-16-ai-engineering challanges
2020 09-16-ai-engineering challanges
 
Aastha Grover Resume (2)
Aastha Grover Resume (2)Aastha Grover Resume (2)
Aastha Grover Resume (2)
 
Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...Building a Real-Time Security Application Using Log Data and Machine Learning...
Building a Real-Time Security Application Using Log Data and Machine Learning...
 
Qu for India - QuantUniversity FundRaiser
Qu for India  - QuantUniversity FundRaiserQu for India  - QuantUniversity FundRaiser
Qu for India - QuantUniversity FundRaiser
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 

More from QuantUniversity

EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !QuantUniversity
 
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdfManaging-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdfQuantUniversity
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSQuantUniversity
 
Ml master class for CFA Dallas
Ml master class for CFA DallasMl master class for CFA Dallas
Ml master class for CFA DallasQuantUniversity
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0QuantUniversity
 
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...QuantUniversity
 
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...QuantUniversity
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewQuantUniversity
 
AI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementAI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementQuantUniversity
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0QuantUniversity
 
Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021QuantUniversity
 
Bayesian Portfolio Allocation
Bayesian Portfolio AllocationBayesian Portfolio Allocation
Bayesian Portfolio AllocationQuantUniversity
 
Constructing Private Asset Benchmarks
Constructing Private Asset BenchmarksConstructing Private Asset Benchmarks
Constructing Private Asset BenchmarksQuantUniversity
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning InterpretabilityQuantUniversity
 
Responsible AI in Action
Responsible AI in ActionResponsible AI in Action
Responsible AI in ActionQuantUniversity
 
Qu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQuantUniversity
 
Qu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQuantUniversity
 

More from QuantUniversity (20)

EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !EU Artificial Intelligence Act 2024 passed !
EU Artificial Intelligence Act 2024 passed !
 
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdfManaging-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf
Managing-the-Risks-of-LLMs-in-FS-Industry-Roundtable-TruEra-QuantU.pdf
 
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALSPYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
PYTHON AND DATA SCIENCE FOR INVESTMENT PROFESSIONALS
 
Ml master class for CFA Dallas
Ml master class for CFA DallasMl master class for CFA Dallas
Ml master class for CFA Dallas
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0
 
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
Towards Fairer Datasets: Filtering and Balancing the Distribution of the Peop...
 
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
Machine Learning: Considerations for Fairly and Transparently Expanding Acces...
 
Seeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper reviewSeeing what a gan cannot generate: paper review
Seeing what a gan cannot generate: paper review
 
AI Explainability and Model Risk Management
AI Explainability and Model Risk ManagementAI Explainability and Model Risk Management
AI Explainability and Model Risk Management
 
Algorithmic auditing 1.0
Algorithmic auditing 1.0Algorithmic auditing 1.0
Algorithmic auditing 1.0
 
Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021Machine Learning in Finance: 10 Things You Need to Know in 2021
Machine Learning in Finance: 10 Things You Need to Know in 2021
 
Bayesian Portfolio Allocation
Bayesian Portfolio AllocationBayesian Portfolio Allocation
Bayesian Portfolio Allocation
 
The API Jungle
The API JungleThe API Jungle
The API Jungle
 
Explainable AI Workshop
Explainable AI WorkshopExplainable AI Workshop
Explainable AI Workshop
 
Constructing Private Asset Benchmarks
Constructing Private Asset BenchmarksConstructing Private Asset Benchmarks
Constructing Private Asset Benchmarks
 
Machine Learning Interpretability
Machine Learning InterpretabilityMachine Learning Interpretability
Machine Learning Interpretability
 
Responsible AI in Action
Responsible AI in ActionResponsible AI in Action
Responsible AI in Action
 
Qu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in FinanceQu speaker series 14: Synthetic Data Generation in Finance
Qu speaker series 14: Synthetic Data Generation in Finance
 
Qwafafew meeting 5
Qwafafew meeting 5Qwafafew meeting 5
Qwafafew meeting 5
 
Qu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial MarketsQu speaker series:Ethical Use of AI in Financial Markets
Qu speaker series:Ethical Use of AI in Financial Markets
 

Recently uploaded

What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 

Recently uploaded (20)

What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 

Model Risk Management for Machine Learning Models

  • 1. www.prmia.org© PRMIA 2020 Model Risk Management for Machine Learning Models Sri Krishnamurthy, CFA, CAP Founder & CEO www.QuantUniversity.com www.prmia.org© PRMIA 2020 Thought Leadership Webinar
  • 2. www.prmia.org© PRMIA 2020 Presenter Sri Krishnamurthy, CFA, CAP Founder & CEO, QuantUniversity • Advisory and Consultancy for Financial Analytics • Prior experience at MathWorks, Citigroup, and Endeca and 25+ years in financial services and energy • Columnist for the Wilmott Magazine • Teaches Analytics in the Babson College MBA program and at Northeastern University, Boston • Reviewer: Journal of Asset Management
  • 3. www.prmia.org© PRMIA 2020 About www.QuantUniversity.com • Boston-based Data Science, Quant Finance and Machine Learning training and consulting advisory • Trained more than 5,000 students in Quantitative methods, Data Science and Big Data Technologies using MATLAB, Python and R • Building a platform for AI and Machine Learning Enablement in the Enterprise
  • 4. www.prmia.org© PRMIA 2020 Agenda Considerations for MRM for Machine Learning models Case Study Machine Learning
  • 5. www.prmia.org© PRMIA 2020 Machine Learning in FinancePart 1
  • 6. www.prmia.org© PRMIA 2020 The world as we know has changed!
  • 7. www.prmia.org© PRMIA 2020 Machine Learning and AI Have Revolutionized Finance
  • 8. www.prmia.org© PRMIA 2020 Machine Learning & AI in Finance: A Paradigm Shift Stochastic Models Factor Models Optimization Risk Factors P/Q Quants Derivative pricing Trading Strategies Simulations Distribution fitting Real-time analytics Predictive analytics Machine Learning RPA NLP Deep Learning Computer Vision Graph Analytics Chatbots Sentiment Analysis Alternative Data Quant Data Scientist/ML Engineer
  • 9. www.prmia.org© PRMIA 2020 Machine Learning 1. https://en.wikipedia.org/wiki/Machine_learning Figure Source: http://www.fsb.org/wp-content/uploads/P011117.pdf AI • Artificial intelligence is intelligence demonstrated by machines, in contrast to the natural intelligence displayed by humans and animals1. Definitions: Machine Learning and AI • Machine learning is the scientific study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions, relying on patterns and inference instead1. 1. https://en.wikipedia.org/wiki/Machine_learning 2. Figure Source: http://www.fsb.org/wp-content/uploads/P011117.pdf
  • 10. www.prmia.org© PRMIA 2020 Polling Question 1 • Question: Have you deployed machine learning models in your organization? a) Considering it b) Will be rolled out soon c) In Production d) Not yet
  • 11. www.prmia.org© PRMIA 2020 Considerations for MRM for Machine Learning models Part 2
  • 14. www.prmia.org© PRMIA 2020 The Machine Learning and AI Workflow Data Scraping/ Ingestion Data Exploration Data Cleansing and Processing Feature Engineering Model Evaluation & Tuning Model Selection Model Deployment/ Inference Supervised Unsupervised Modeling Data Engineer, Dev Ops Engineer • Auto ML • Model Validation • Interpretability Robotic Process Automation (RPA) (Microservices, Pipelines ) • SW: Web/ Rest API • HW: GPU, Cloud • Monitoring • Regression • KNN • Decision Trees • Naive Bayes • Neural Networks • Ensembles • Clustering • PCA • Autoencoder • RMS • MAPS • MAE • Confusion Matrix • Precision/Recall • ROC • Hyper-parameter tuning • Parameter Grids Risk Management/ Compliance(All stages) Software / Web Engineer Data Scientist/Quants Analysts& DecisionMakers
  • 15. www.prmia.org© PRMIA 2020 Elements of Model Risk Management
  • 16. www.prmia.org© PRMIA 2020 Model Governance Structure
  • 17. www.prmia.org© PRMIA 2020 • Components that needs to be tracked What constitutes an ML model? • Interdependencies • Lineage/Provenance of individual components • Model params • Hyper parameters • Pipeline specifications • Model specific • Tests • Data versions Data Model EnvironmentProcess • Programming environment • Execution environment • Hardware specs • Cloud • GPU
  • 18. www.prmia.org© PRMIA 2020 Elements of a Machine Learning System Source: Sculley et al., 2015 "Hidden Technical Debt in Machine Learning Systems"
  • 19. www.prmia.org© PRMIA 2020 19 AI Governance Is Gaining Focus https://legalinstruments.oecd.org/en/instruments/OECD-LEGAL-0449
  • 20. www.prmia.org© PRMIA 2020 20 Theory to Practice: How to cross the chasm ? • Theory • Regulations • Local Laws • Practical ML systems • Company Expertise • Company culture and Best practices
  • 21. www.prmia.org© PRMIA 2020 21 1. ML Life cycle management 2. Tracking 3. Metadata management 4. Scaling 5. Reproducibility 6. Interpretability 7. Testing 8. Measurement Themes We Will Discuss Today
  • 22. www.prmia.org© PRMIA 2020 Polling Question 2 • Which is the most challenging aspect in your organization ? a) ML Life cycle management b) Tracking & Metadata management c) Scaling d) Reproducibility & Interpretability e) Testing & Measurement
  • 24. www.prmia.org© PRMIA 2020 24 Model Lifecycle Management
  • 25. www.prmia.org© PRMIA 2020 Source: T. van derWeide, O. Smirnov, M. Zielinski, D. Papadopoulos, and T. van Kasteren. Versioned machine learning pipelines for batch experimentation. In ML Systems, Workshop NIPS 2016, 2016. Provenance and Lineage of Pipelines
  • 26. www.prmia.org© PRMIA 2020 26 Versioning
  • 27. www.prmia.org© PRMIA 2020 Schemas proposed Sebastian Schelter, Joos-Hendrik Boese, Johannes Kirschnick, Thoralf Klein, and Stephan Seufert. Automatically Tracking Metadata and Provenance of Machine Learning Experiments. NIPS Workshop on Machine Learning Systems, 2017.
  • 28. www.prmia.org© PRMIA 2020 Schemas proposed G. C. Publio, D. Esteves, and H. Zafar, “ML-Schema : Exposing the Semantics of Machine Learning with Schemas and Ontologies,” in Reproducibility in ML Workshop, ICML’18, 2018.
  • 31. www.prmia.org© PRMIA 2020 31 Sample Project Structure REF: Harvard Computefest 2020 demo example
  • 34. www.prmia.org© PRMIA 2020 I. Altintas, O. Barney, and E. Jaeger-Frank. Provenance collection support in the Kepler scientific workflow system. In Provenance and annotation of data, pages 118–132. Current Approaches
  • 35. www.prmia.org© PRMIA 2020 Miao, Hui & Chavan, Amit & Deshpande, Amol. (2016). ProvDB: A System for Lifecycle Management of Collaborative Analysis Workflows. Current Approaches
  • 36. www.prmia.org© PRMIA 2020 Related Work Xueping Liang, Sachin Shetty, Deepak Tosh, Charles Kamhoua, Kevin Kwiat, and Laurent Njilla. 2017. ProvChain: A Blockchain-based Data Provenance Architecture in Cloud Environment with Enhanced Privacy and Availability. In Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid '17). IEEE Press, Piscataway, NJ, USA, 468-477. DOI: https://doi.org/10.1109/CCGRID.2017.8 Focus on Cloud data provenance using Blockchain
  • 37. www.prmia.org© PRMIA 2020 Related Work Ramachandran, Aravind & Kantarcioglu, Dr. (2017). Using Blockchain and smart contracts for secure data provenance management. DataProv: Built on top of Ethereum, the platform utilizes smart contracts and open provenance model (OPM) to record immutable data trails.
  • 38. www.prmia.org© PRMIA 2020 Related Work Sarpatwar, Kanthi & Vaculín, Roman & Min, Hong & Su, Gong & Heath, Terry & Ganapavarapu, Giridhar & Dillenberger, Donna. (2019). Towards Enabling Trusted Artificial Intelligence via Blockchain. 10.1007/978-3-030-17277-0_8. Trusted AI and provenance of AI models
  • 39. www.prmia.org© PRMIA 2020 Model Inference Standards
  • 42. www.prmia.org© PRMIA 2020 Meta Data Management 1. Add people to Amundsen’s data graph, by integrating with integration with HR systems like Workday. Show commonly used and bookmarked data assets. 2. Add dashboards and reports (e.g. Tableau, Looker, Apache Superset) to Amundsen. 3. Add support for lineage across disparate data assets like dashboards and tables. 4. Add events/schemas (e.g. schema registry) to Amundsen. 5. Add streams (e.g. Apache Kafka, AWS Kinesis) to Amundsen. https://eng.lyft.com/amundsen-lyfts-data-discovery-metadata-engine-62d27254fbb9
  • 43. www.prmia.org© PRMIA 2020 43 • Machine learning applications fail is due to the lack of rich, diverse and clean datasets needed to build models. • Historical datasets may be hard to acquire or may be skewed towards the majority class. • All plausible scenarios of the future haven’t happened yet! • Synthetic data used to enrich and augment existing datasets to provide comprehensive samples while training machine learning problems. Role of Data Augmentation
  • 45. www.prmia.org© PRMIA 2020 GPUs for Scaling REF : NVIDIA DLI Multi-GPU course slide deck
  • 46. www.prmia.org© PRMIA 2020 GPUs for Scaling REF : NVIDIA DLI Multi-GPU course slide deck
  • 47. www.prmia.org© PRMIA 2020 “TSNE Optimizations There are four optimizations used to improve the performance of TSNE on GPUs: 1. calculating higher dimensional probabilities with less GPU memory, 2. approximating higher dimensional probabilities, 3. reducing arithmetic operations, and 4. broadcasting along rows.” Ref: https://medium.com/rapids-ai/tsne-with-gpus-hours-to-seconds-9d9c17c941db Using GPUs requires GPU compatible code changes
  • 48. www.prmia.org© PRMIA 2020 Polling Question 3 • What kinds of ML tools do you use in your organization? a) None b) On-prem - Enterprise c) Cloud - Enterprise d) On-prem – Open Source e) Cloud – Open Source
  • 50. www.prmia.org© PRMIA 2020 The Reproducibility Challenge https://www.nature.com/news/1-500-scientists-lift-the-lid-on-reproducibility-1.19970
  • 51. www.prmia.org© PRMIA 2020 • Repeatability (Same team, same experimental setup) — The measurement can be obtained with stated precision by the same team using the same measurement procedure, the same measuring system, under the same operating conditions, in the same location on multiple trials. For computational experiments, this means that a researcher can reliably repeat her own computation. • Replicability (Different team, same experimental setup) — The measurement can be obtained with stated precision by a different team using the same measurement procedure, the same measuring system, under the same operating conditions, in the same or a different location on multiple trials. For computational experiments, this means that an independent group can obtain the same result using the author’s own artifacts. • Reproducibility (Different team, different experimental setup) — The measurement can be obtained with stated precision by a different team, a different measuring system, in a different location on multiple trials. For computational experiments, this means that an independent group can obtain the same result using artifacts which they develop completely independently. Repeatable or Reproducible or Replicable https://www.acm.org/publications/policies/artifact-review-badging
  • 53. www.prmia.org© PRMIA 2020 “Interpretability is the degree to which a human can consistently predict the model's result”1 What is the objective?2 • Simply be to get more useful information from the mode • Uncover causal structure in observational data • Transparency? Convergence? • Model complexity? • Culture? The Interpretability Challenge 1. https://christophm.github.io/interpretable-ml-book/interpretability.html 2. https://arxiv.org/abs/1606.03490
  • 54. www.prmia.org© PRMIA 2020 • Partial dependence plots (PDP) • Shapley Values • Lime (Local Interpretable Model-Agnostic Explanations) • SHAP (SHapley Additive exPlanations) Reference: https://christophm.github.io/interpretable-ml-book/ Shapley Values
  • 55. www.prmia.org© PRMIA 2020 • Partial dependence plots (PDP) show the dependence between the target response and a set of ‘target’ features, marginalizing over the values of all other features (the ‘complement’ features). • Intuitively, we can interpret the partial dependence as the expected target response as a function of the ‘target’ features. https://scikit-learn.org/stable/modules/partial_dependence.html The Interpretability Challenge
  • 56. www.prmia.org© PRMIA 2020 Which model to choose? Client Objective: • Build the best forecasting model that has a MAPE of 5% or less Result: · Regression – 7% MAPE · Neural Networks – 4% MAPE · Random Forest – 5% MAPE Client choice: · Regression despite being the worst of the top-3 models · “I won’t deploy anything that I don’t understand” Source: http://engineering.electrical-equipment.org/electrical-distribution/electric-load-forecasting-advantages-challenges.html
  • 58. www.prmia.org© PRMIA 2020 Testing for Machine Learning Models Figureref: http://www.actuaries.org/CTTEES_SOLV/Documents/StressTestingPaper.pdf
  • 60. www.prmia.org© PRMIA 2020 60 Can Machine Learning algorithms be gamed? https://www.youtube.com/watch?time_continue=36&v=MIbFvK2S9g8 https://arxiv.org/abs/1904.08653 84
  • 62. www.prmia.org© PRMIA 2020 Model Risk Assessment Framework
  • 63. www.prmia.org© PRMIA 2020 Quantifying Model Risk Is Important
  • 64. www.prmia.org© PRMIA 2020 RISKGRADING RiskScores Impact 5 5 10 15 20 25 4 4 8 12 16 20 3 3 6 9 12 15 2 2 4 6 8 10 1 1 2 3 4 5 1 2 3 4 5 Likelihood of occurrence Red High Risk Yellow Moderate Risk Green LowRisk High Impact- High likelihood of occurrence: Needs adequate model risk controlmeasures to mitigate risk High Impact – Lowlikelihood of occurrence:Address through model risk control measures and contingency plans Low Impact – High likelihood of occurrence : Lower priority model risk control measures LowImpact – Lowlikelihood of occurrence:Least prioritymodel risk control measures
  • 65. www.prmia.org© PRMIA 2020 Summary 1. ML Life cycle management 2. Tracking 3. Metadata management 4. Scaling 5. Reproducibility 6. Interpretability 7. Testing 8. Measurement
  • 66. www.prmia.org© PRMIA 2020 Up Next Case study: Using Synthetic Data for Model Validation
  • 67. www.prmia.org© PRMIA 2020 Polling Question 4 • Have you considered using Synthetic/Simulated data for testing and validating models? a) No b) Considering it c) Yes d) Tried it and decided not to use it
  • 68. www.prmia.org© PRMIA 2020 Synthetic Data • Synthetic data is "any production data applicable to a given situation that are not obtained by direct measurement.”1 • In finance, Synthetic data has been used in stress and scenario analysis for many years now. • Example: Montecarlo simulations have been used to generate future scenarios. • In Machine Learning, Synthetic Data plays an important role to prevent overfitting, handle imbalance class problems, and to accommodate plausible scenarios. 1 https://en.wikipedia.org/wiki/Synthetic_data
  • 69. www.prmia.org© PRMIA 2020 Challenges with Real Datasets All scenarios haven’t played out • Stress scenarios • What-if scenarios Figureref:http://www.actuaries.org/CTTEES_SOLV/Documents/StressTestingPaper.pdf
  • 70. www.prmia.org© PRMIA 2020 Access • Hard to find • Rare class problems • Privacy concerns making it difficult to share Challenges with Real Datasets Picture source: www.pixabay.com
  • 71. www.prmia.org© PRMIA 2020 Imbalanced • Need more samples of rare class • Need proxies for data points that were not observed or recorded Challenges with Real Datasets Picture source: www.pixabay.com
  • 72. www.prmia.org© PRMIA 2020 Synthetic Data in Finance Ref: Machine Learning for Asset Managers, Marcos M. López de Prado,,CAMBRIDGE UNIVERSITY PRESS 2020
  • 74. www.prmia.org© PRMIA 2020 MRM Use Cases • Data Anonymization — Anonymize training and test data sets for internal and external model validation • Data Augmentation — Augment sparse datasets with realistic datasets • Handling Imbalanced data classes — Handle Algorithmic bias and to test efficacy of model for rare-class problems • Stress and Scenario testing — Simulate test scenarios for extreme but plausible scenarios to test model behavior
  • 75. www.prmia.org© PRMIA 2020 VIX Characteristics REF: https://www.investopedia.com/terms/v/vix.asp
  • 76. www.prmia.org© PRMIA 2020 Demo: Synthetic VIX Generation
  • 77. www.prmia.org© PRMIA 2020 Up Next Demo If you would like access to the demo and the QuSandbox, please contact us at info@qusandbox.com.
  • 78. www.prmia.org© PRMIA 2020 Use Code MRMPRMIA for $100 off! Register here
  • 79. www.prmia.org© PRMIA 2020 QuantUniversity’s Model Risk related papers Email me at sri@quantuniversity.com for a copy
  • 80. www.prmia.org© PRMIA 2020 Q&A Sri Krishnamurthy, CFA, CAP Founder and CEO Information, data and drawings embodied in this presentation are strictly a property of QuantUniversity LLC. and shall not be distributed or used in any other publication without the prior written consent of QuantUniversity LLC.