SlideShare una empresa de Scribd logo
1 de 26
Descargar para leer sin conexión
WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
Patrick Hall, H2O.ai
Beyond Reason Codes
A Blueprint for Human-Centered, Low-Risk ML
#UnifiedAnalytics #SparkAISummit
Contents
• Blueprint
• EDA
• Benchmark
• Training
• Post-Hoc Analysis
• Review
• Deployment
• Appeal
• Iterate
• Questions
3#UnifiedAnalytics #SparkAISummit
4#UnifiedAnalytics #SparkAISummit
Blueprint
5#UnifiedAnalytics #SparkAISummit
EDA and Data Visualization
• OSS: H2O-3 Aggregator
• References: Visualizing Big Data
Outliers through Distributed
Aggregation; The Grammar of
Graphics
Establish Benchmarks
6#UnifiedAnalytics #SparkAISummit
Establishing a benchmark from which to gauge improvements in accuracy, fairness, interpretability, privacy, or
security is crucial for good (“data”) science and for compliance.
7#UnifiedAnalytics #SparkAISummit
Manual, Private, Sparse or Straightforward Feature Engineering
• OSS: Pandas Profiler, Feature Tools
• References: Deep Feature
Synthesis: Towards Automating Data
Science Endeavors; Label, Segment,
Featurize: A Cross Domain
Framework for Prediction Engineering
Preprocessing for Fairness, Privacy or Security
8#UnifiedAnalytics #SparkAISummit
• OSS: IBM AI360
• References: Data Preprocessing
Techniques for Classification Without
Discrimination; Certifying and
Removing Disparate Impact;
Optimized Pre-processing for
Discrimination Prevention; Privacy-
Preserving Data Mining
Constrained, Fair, Interpretable, Private or Simple Models
9#UnifiedAnalytics #SparkAISummit
References: Explainable Neural
Networks Based on Additive Index
Models (XNN); Learning Fair
Representations (LFR); Locally
Interpretable Models and Effects Based
on Supervised Partitioning (LIME-SUP);
Private Aggregation of Teacher
Ensembles (PATE); Scalable Bayesian
Rule Lists (SBRL)
10#UnifiedAnalytics #SparkAISummit
Traditional Model Assessment and Diagnostics
• Residual analysis, Q-Q plots, AUC
and lift curves confirm model is
accurate and meets assumption
criteria.
• Implemented as model diagnostics in
Driverless AI.
11#UnifiedAnalytics #SparkAISummit
Post-hoc Explanations
• OSS: lime, shap
• References: Why Should I Trust
You?: Explaining the Predictions of
Any Classifier; A Unified Approach to
Interpreting Model Predictions;
Please Stop Explaining Black Box
Models for High Stakes Decisions
(criticism)
12#UnifiedAnalytics #SparkAISummit
Interlude: The Time–Tested Shapley Value
1. In the beginning: A Value for N-Person Games, 1953
2. Nobel-worthy contributions: The Shapley Value: Essays in Honor of Lloyd S.
Shapley, 1988
3. Shapley regression: Analysis of Regression in Game Theory Approach, 2001
4. First reference in ML? Fair Attribution of Functional Contribution in Artificial
and Biological Networks, 2004
5. Into the ML research mainstream, i.e. JMLR: An Efficient Explanation of
Individual Classifications Using Game Theory, 2010
6. Into the real-world data mining workflow ... finally: Consistent
Individualized Feature Attribution for Tree Ensembles, 2017
7. Unification: A Unified Approach to Interpreting Model Predictions, 2017
Model Debugging for Accuracy, Privacy or Security
13#UnifiedAnalytics #SparkAISummit
OSS: cleverhans, pdpbox, what-if tool
References: A Marauder’s Map of
Security and Privacy in Machine
Learning: An overview of current and
future research directions for making
machine learning secure and private;
Modeltracker: Redesigning Performance
Analysis Tools for Machine Learning;
The Security of Machine Learning
14#UnifiedAnalytics #SparkAISummit
Post-hoc Disparate Impact Assessment and Remediation
OSS: aequitas, IBM AI360 , themis
References: Equality of Opportunity in
Supervised Learning; Certifying and
Removing Disparate Impact
Human Review and Documentation
15#UnifiedAnalytics #SparkAISummit
References: Model Cards for Model
Reporting
16#UnifiedAnalytics #SparkAISummit
Deployment, Management and Monitoring
OSS: mlflow, modeldb, awesome-
machine-learning-ops metalist
Reference: Model DB: A System for
Machine Learning Model Management
Human Appeal
17#UnifiedAnalytics #SparkAISummit
Very important, may require custom implementation for each deployment environment?
Iterate: Use Gained Knowledge to Improve Accuracy, Fairness, Interpretability,
Privacy or Security
18#UnifiedAnalytics #SparkAISummit
Improvements, KPIs should not be restricted to accuracy alone.
Open Conceptual Questions
• How much automation is appropriate, 100%?
• How to automate learning by iteration,
reinforcement learning?
• How to implement human appeals, is it
productizable? Mobile Apps?
19#UnifiedAnalytics #SparkAISummit
Resources
In-Depth Open Source Interpretability Technique Examples:
https://github.com/jphall663/interpretable_machine_learning_with_python
"Awesome" Machine Learning Interpretability Resource List:
https://github.com/jphall663/awesome-machine-learning-interpretability
20#UnifiedAnalytics #SparkAISummit
References
Agrawal, Rakesh and Ramakrishnan Srikant (2000). “Privacy-Preserving Data Mining.” In: ACM Sigmod Record. Vol.
29. 2. URL: http://alme1.almaden.ibm.com/cs/projects/iis/hdb/Publications/papers/sigmod00_privacy.pdf. ACM, pp.
439–450.
Amershi, Saleema et al. (2015). “Modeltracker: Redesigning Performance Analysis Tools for Machine Learning.” In:
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. URL:
https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/amershi.CHI2015.ModelTracker.pdf. ACM, pp.
337–346.
Barreno, Marco et al. (2010). “The Security of Machine Learning.” In: Machine Learning 81.2. URL:
https://people.eecs.berkeley.edu/~adj/publications/paper-files/SecML-MLJ2010.pdf, pp. 121–148.
Calmon, Flavio et al. (2017). “Optimized Pre-processing for Discrimination Prevention.” In: Advances in Neural
Information Processing Systems. URL: http://papers.nips.cc/paper/6988-optimized-pre-processing-for-discrimination-
prevention.pdf, pp. 3992–4001.
Feldman, Michael et al. (2015). “Certifying and Removing Disparate Impact.” In: Proceedings of the 21th ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining. URL: https://arxiv.org/pdf/1412.3756.pdf. ACM, pp.
259–268.
21#UnifiedAnalytics #SparkAISummit
References
Hardt, Moritz, Eric Price, Nati Srebro, et al. (2016). “Equality of Opportunity in Supervised Learning.” In: Advances in
neural information processing systems. URL:
http://papers.nips.cc/paper/6374-equality-of-opportunity-in-supervised-learning.pdf, pp. 3315–3323.
Hu, Linwei et al. (2018). “Locally Interpretable Models and Effects Based on Supervised Partitioning (LIME-SUP).” In:
arXiv preprint arXiv:1806.00663. URL: https://arxiv.org/ftp/arxiv/papers/1806/1806.00663.pdf.
Kamiran, Faisal and Toon Calders (2012). “Data Preprocessing Techniques for Classification Without Discrimination.”
In: Knowledge and Information Systems 33.1. URL: https://link.springer.com/content/pdf/10.1007/s10115-011-0463-
8.pdf, pp. 1–33.
Kanter, James Max, Owen Gillespie, and Kalyan Veeramachaneni (2016). “Label, Segment, Featurize: A Cross Domain
Framework for Prediction Engineering.” In: Data Science and Advanced Analytics (DSAA), 2016 IEEE International
Conference on. URL: http://www.jmaxkanter.com/static/papers/DSAA_LSF_2016.pdf. IEEE, pp. 430–439.
Kanter, James Max and Kalyan Veeramachaneni (2015). “Deep Feature Synthesis: Towards Automating Data Science
Endeavors.” In: Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on.
URL: https://groups.csail.mit.edu/EVO-DesignOpt/groupWebSite/uploads/Site/DSAA_DSM_2015.pdf. IEEE, pp. 1–10.
22#UnifiedAnalytics #SparkAISummit
References
Keinan, Alon et al. (2004). “Fair Attribution of Functional Contribution in Artificial and Biological Networks.” In: Neural
Computation 16.9. URL: https://www.researchgate.net/profile/Isaac_Meilijson/
publication/2474580_Fair_Attribution_of_Functional_Contribution_in_Artificial_and_
Biological_Networks/links/09e415146df8289373000000/Fair-Attribution-of-Functional-Contribution-in-Artificial-and-
Biolo, pp. 1887–1915.
Kononenko, Igor et al. (2010). “An Efficient Explanation of Individual Classifications Using Game Theory.” In: Journal of
Machine Learning Research 11.Jan. URL: http://www.jmlr.org/papers/volume11/strumbelj10a/strumbelj10a.pdf, pp. 1–
18.
Lipovetsky, Stan and Michael Conklin (2001). “Analysis of Regression in Game Theory Approach.” In: Applied
Stochastic Models in Business and Industry 17.4, pp. 319–330.
Lundberg, Scott M., Gabriel G. Erion, and Su-In Lee (2017). “Consistent Individualized Feature Attribution for Tree
Ensembles.” In: Proceedings of the 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017).
Ed. by Been Kim et al. URL: https://openreview.net/pdf?id=ByTKSo-m-. ICML WHI 2017, pp. 15–21.
Lundberg, Scott M and Su-In Lee (2017). “A Unified Approach to Interpreting Model Predictions.” In: Advances in Neural
Information Processing Systems 30. Ed. by I. Guyon et al. URL: http://papers.nips.cc/paper/7062-a-unified-approach-
to-interpreting-model-predictions.pdf. Curran Associates, Inc., pp. 4765–4774.
23#UnifiedAnalytics #SparkAISummit
References
Mitchell, Margaret et al. (2019). “Model Cards for Model Reporting.” In: Proceedings of the Conference on
Fairness, Accountability, and Transparency. URL: https://arxiv.org/pdf/1810.03993.pdf. ACM, pp. 220–229.
Papernot, Nicolas (2018). “A Marauder’s Map of Security and Privacy in Machine Learning: An overview of
current and future research directions for making machine learning secure and private.” In: Proceedings of the
11th ACM Workshop on Artificial Intelligence and Security. URL: https://arxiv.org/pdf/1811.01134.pdf. ACM.
Papernot, Nicolas et al. (2018). “Scalable Private Learning with PATE.” In: arXiv preprint arXiv:1802.08908.
URL: https://arxiv.org/pdf/1802.08908.pdf.
Ribeiro, Marco Tulio, Sameer Singh, and Carlos Guestrin (2016). “Why Should I Trust You?: Explaining the
Predictions of Any Classifier.” In: Proceedings of the 22nd ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining. URL: http://www.kdd.org/kdd2016/papers/files/rfp0573-ribeiroA.pdf.
ACM, pp. 1135–1144.
Rudin, Cynthia (2018). “Please Stop Explaining Black Box Models for High Stakes Decisions.” In: arXiv
preprint arXiv:1811.10154. URL: https://arxiv.org/pdf/1811.10154.pdf.
Shapley, Lloyd S (1953). “A Value for N-Person Games.” In: Contributions to the Theory of Games 2.28. URL:
http://www.library.fa.ru/files/Roth2.pdf#page=39, pp. 307–317.
24#UnifiedAnalytics #SparkAISummit
References
Shapley, Lloyd S, Alvin E Roth, et al. (1988). The Shapley Value: Essays in Honor of Lloyd S. Shapley. URL:
http://www.library.fa.ru/files/Roth2.pdf. Cambridge University Press.
Vartak, Manasi et al. (2016). “Model DB: A System for Machine Learning Model Management.” In:
Proceedings of the Workshop on Human-In-the-Loop Data Analytics. URL: https://www-
cs.stanford.edu/~matei/papers/2016/hilda_modeldb.pdf. ACM, p. 14.
Vaughan, Joel et al. (2018). “Explainable Neural Networks Based on Additive Index Models.” In: arXiv preprint
arXiv:1806.01933. URL: https://arxiv.org/pdf/1806.01933.pdf.
Wilkinson, Leland (2006). The Grammar of Graphics.
Wilkinson, Leland (2018). “Visualizing Big Data Outliers through Distributed Aggregation.” In: IEEE
Transactions on Visualization & Computer Graphics. URL:
https://www.cs.uic.edu/~wilkinson/Publications/outliers.pdf.
Yang, Hongyu, Cynthia Rudin, and Margo Seltzer (2017). “Scalable Bayesian Rule Lists.” In: Proceedings of
the 34th International Conference on Machine Learning (ICML). URL: https://arxiv.org/pdf/1602.08610.pdf.
Zemel, Rich et al. (2013). “Learning Fair Representations.” In: International Conference on Machine Learning.
URL: http://proceedings.mlr.press/v28/zemel13.pdf, pp. 325–333.
25#UnifiedAnalytics #SparkAISummit
DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

Más contenido relacionado

La actualidad más candente

Using Mask R CNN to Isolate PV Panels from Background Object in Images
Using Mask R CNN to Isolate PV Panels from Background Object in ImagesUsing Mask R CNN to Isolate PV Panels from Background Object in Images
Using Mask R CNN to Isolate PV Panels from Background Object in Images
ijtsrd
 

La actualidad más candente (20)

Deep Learning for Structure-from-Motion (SfM)
Deep Learning for Structure-from-Motion (SfM)Deep Learning for Structure-from-Motion (SfM)
Deep Learning for Structure-from-Motion (SfM)
 
Using Mask R CNN to Isolate PV Panels from Background Object in Images
Using Mask R CNN to Isolate PV Panels from Background Object in ImagesUsing Mask R CNN to Isolate PV Panels from Background Object in Images
Using Mask R CNN to Isolate PV Panels from Background Object in Images
 
A new image steganography algorithm based
A new image steganography algorithm basedA new image steganography algorithm based
A new image steganography algorithm based
 
Machine Learning @ NECST
Machine Learning @ NECSTMachine Learning @ NECST
Machine Learning @ NECST
 
Visual cryptography for hybrid approach
Visual cryptography for hybrid approachVisual cryptography for hybrid approach
Visual cryptography for hybrid approach
 
Geometric Deep Learning
Geometric Deep Learning Geometric Deep Learning
Geometric Deep Learning
 
Computational decision making
Computational decision makingComputational decision making
Computational decision making
 
PointNet
PointNetPointNet
PointNet
 
IRJET- Comparative Study of Different Techniques for Text as Well as Object D...
IRJET- Comparative Study of Different Techniques for Text as Well as Object D...IRJET- Comparative Study of Different Techniques for Text as Well as Object D...
IRJET- Comparative Study of Different Techniques for Text as Well as Object D...
 
Emerging 3D Scanning Technologies for PropTech
Emerging 3D Scanning Technologies for PropTechEmerging 3D Scanning Technologies for PropTech
Emerging 3D Scanning Technologies for PropTech
 
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
 
Mechanical
MechanicalMechanical
Mechanical
 
Image recognition
Image recognitionImage recognition
Image recognition
 
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
Medical Imaging at UPC - Elisa Sayrol - UPC Barcelona 2018
 
IRJET - Object Detection using Deep Learning with OpenCV and Python
IRJET - Object Detection using Deep Learning with OpenCV and PythonIRJET - Object Detection using Deep Learning with OpenCV and Python
IRJET - Object Detection using Deep Learning with OpenCV and Python
 
Applying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labelsApplying Deep Learning with Weak and Noisy labels
Applying Deep Learning with Weak and Noisy labels
 
“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...
“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...
“Explainability in Computer Vision: A Machine Learning Engineer’s Overview,” ...
 
Efficient Deep Learning in Communications
Efficient Deep Learning in CommunicationsEfficient Deep Learning in Communications
Efficient Deep Learning in Communications
 
Bhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogueBhadale group of companies ai neural networks and algorithms catalogue
Bhadale group of companies ai neural networks and algorithms catalogue
 
Secure IoT Systems Monitor Framework using Probabilistic Image Encryption
Secure IoT Systems Monitor Framework using Probabilistic Image EncryptionSecure IoT Systems Monitor Framework using Probabilistic Image Encryption
Secure IoT Systems Monitor Framework using Probabilistic Image Encryption
 

Similar a Interpretable AI: Not Just For Regulators

Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Marcel Kurovski
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
inovex GmbH
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
icwe2015
 

Similar a Interpretable AI: Not Just For Regulators (20)

202212APSEC.pptx.pdf
202212APSEC.pptx.pdf202212APSEC.pptx.pdf
202212APSEC.pptx.pdf
 
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
 
New Research Articles - 2018 November Issue-International Journal of Artifici...
New Research Articles - 2018 November Issue-International Journal of Artifici...New Research Articles - 2018 November Issue-International Journal of Artifici...
New Research Articles - 2018 November Issue-International Journal of Artifici...
 
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
March 2024 - Top 10 Read Articles in Artificial Intelligence and Applications...
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
 
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Synergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software EngineeringSynergy of Human and Artificial Intelligence in Software Engineering
Synergy of Human and Artificial Intelligence in Software Engineering
 
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
(Web User Interfaces track) "Getting the Query Right: User Interface Design o...
 
Quant university MRM and machine learning
Quant university MRM and machine learningQuant university MRM and machine learning
Quant university MRM and machine learning
 
AI meets Big Data
AI meets Big DataAI meets Big Data
AI meets Big Data
 
Improve ML Predictions using Connected Feature Extraction
Improve ML Predictions using Connected Feature ExtractionImprove ML Predictions using Connected Feature Extraction
Improve ML Predictions using Connected Feature Extraction
 
TUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analyticsTUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analytics
 
DEF CON 27 - BRENT STONE - reverse enginerring 17 cars
DEF CON 27 - BRENT STONE - reverse enginerring 17 carsDEF CON 27 - BRENT STONE - reverse enginerring 17 cars
DEF CON 27 - BRENT STONE - reverse enginerring 17 cars
 
POTENTIAL IMPACT OF GENERATIVE ARTIFICIAL INTELLIGENCE(AI) ON THE FINANCIAL I...
POTENTIAL IMPACT OF GENERATIVE ARTIFICIAL INTELLIGENCE(AI) ON THE FINANCIAL I...POTENTIAL IMPACT OF GENERATIVE ARTIFICIAL INTELLIGENCE(AI) ON THE FINANCIAL I...
POTENTIAL IMPACT OF GENERATIVE ARTIFICIAL INTELLIGENCE(AI) ON THE FINANCIAL I...
 
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSISUNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
 
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSISUNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
UNCOVERING FAKE NEWS BY MEANS OF SOCIAL NETWORK ANALYSIS
 
SBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and AnalysisSBQS 2013 Keynote: Cooperative Testing and Analysis
SBQS 2013 Keynote: Cooperative Testing and Analysis
 
Regression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms ExcelRegression with Microsoft Azure & Ms Excel
Regression with Microsoft Azure & Ms Excel
 

Más de Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

Más de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Último

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
q6pzkpark
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
vexqp
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 

Último (20)

Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
一比一原版(曼大毕业证书)曼尼托巴大学毕业证成绩单留信学历认证一手价格
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 

Interpretable AI: Not Just For Regulators

  • 1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics
  • 2. Patrick Hall, H2O.ai Beyond Reason Codes A Blueprint for Human-Centered, Low-Risk ML #UnifiedAnalytics #SparkAISummit
  • 3. Contents • Blueprint • EDA • Benchmark • Training • Post-Hoc Analysis • Review • Deployment • Appeal • Iterate • Questions 3#UnifiedAnalytics #SparkAISummit
  • 5. 5#UnifiedAnalytics #SparkAISummit EDA and Data Visualization • OSS: H2O-3 Aggregator • References: Visualizing Big Data Outliers through Distributed Aggregation; The Grammar of Graphics
  • 6. Establish Benchmarks 6#UnifiedAnalytics #SparkAISummit Establishing a benchmark from which to gauge improvements in accuracy, fairness, interpretability, privacy, or security is crucial for good (“data”) science and for compliance.
  • 7. 7#UnifiedAnalytics #SparkAISummit Manual, Private, Sparse or Straightforward Feature Engineering • OSS: Pandas Profiler, Feature Tools • References: Deep Feature Synthesis: Towards Automating Data Science Endeavors; Label, Segment, Featurize: A Cross Domain Framework for Prediction Engineering
  • 8. Preprocessing for Fairness, Privacy or Security 8#UnifiedAnalytics #SparkAISummit • OSS: IBM AI360 • References: Data Preprocessing Techniques for Classification Without Discrimination; Certifying and Removing Disparate Impact; Optimized Pre-processing for Discrimination Prevention; Privacy- Preserving Data Mining
  • 9. Constrained, Fair, Interpretable, Private or Simple Models 9#UnifiedAnalytics #SparkAISummit References: Explainable Neural Networks Based on Additive Index Models (XNN); Learning Fair Representations (LFR); Locally Interpretable Models and Effects Based on Supervised Partitioning (LIME-SUP); Private Aggregation of Teacher Ensembles (PATE); Scalable Bayesian Rule Lists (SBRL)
  • 10. 10#UnifiedAnalytics #SparkAISummit Traditional Model Assessment and Diagnostics • Residual analysis, Q-Q plots, AUC and lift curves confirm model is accurate and meets assumption criteria. • Implemented as model diagnostics in Driverless AI.
  • 11. 11#UnifiedAnalytics #SparkAISummit Post-hoc Explanations • OSS: lime, shap • References: Why Should I Trust You?: Explaining the Predictions of Any Classifier; A Unified Approach to Interpreting Model Predictions; Please Stop Explaining Black Box Models for High Stakes Decisions (criticism)
  • 12. 12#UnifiedAnalytics #SparkAISummit Interlude: The Time–Tested Shapley Value 1. In the beginning: A Value for N-Person Games, 1953 2. Nobel-worthy contributions: The Shapley Value: Essays in Honor of Lloyd S. Shapley, 1988 3. Shapley regression: Analysis of Regression in Game Theory Approach, 2001 4. First reference in ML? Fair Attribution of Functional Contribution in Artificial and Biological Networks, 2004 5. Into the ML research mainstream, i.e. JMLR: An Efficient Explanation of Individual Classifications Using Game Theory, 2010 6. Into the real-world data mining workflow ... finally: Consistent Individualized Feature Attribution for Tree Ensembles, 2017 7. Unification: A Unified Approach to Interpreting Model Predictions, 2017
  • 13. Model Debugging for Accuracy, Privacy or Security 13#UnifiedAnalytics #SparkAISummit OSS: cleverhans, pdpbox, what-if tool References: A Marauder’s Map of Security and Privacy in Machine Learning: An overview of current and future research directions for making machine learning secure and private; Modeltracker: Redesigning Performance Analysis Tools for Machine Learning; The Security of Machine Learning
  • 14. 14#UnifiedAnalytics #SparkAISummit Post-hoc Disparate Impact Assessment and Remediation OSS: aequitas, IBM AI360 , themis References: Equality of Opportunity in Supervised Learning; Certifying and Removing Disparate Impact
  • 15. Human Review and Documentation 15#UnifiedAnalytics #SparkAISummit References: Model Cards for Model Reporting
  • 16. 16#UnifiedAnalytics #SparkAISummit Deployment, Management and Monitoring OSS: mlflow, modeldb, awesome- machine-learning-ops metalist Reference: Model DB: A System for Machine Learning Model Management
  • 17. Human Appeal 17#UnifiedAnalytics #SparkAISummit Very important, may require custom implementation for each deployment environment?
  • 18. Iterate: Use Gained Knowledge to Improve Accuracy, Fairness, Interpretability, Privacy or Security 18#UnifiedAnalytics #SparkAISummit Improvements, KPIs should not be restricted to accuracy alone.
  • 19. Open Conceptual Questions • How much automation is appropriate, 100%? • How to automate learning by iteration, reinforcement learning? • How to implement human appeals, is it productizable? Mobile Apps? 19#UnifiedAnalytics #SparkAISummit
  • 20. Resources In-Depth Open Source Interpretability Technique Examples: https://github.com/jphall663/interpretable_machine_learning_with_python "Awesome" Machine Learning Interpretability Resource List: https://github.com/jphall663/awesome-machine-learning-interpretability 20#UnifiedAnalytics #SparkAISummit
  • 21. References Agrawal, Rakesh and Ramakrishnan Srikant (2000). “Privacy-Preserving Data Mining.” In: ACM Sigmod Record. Vol. 29. 2. URL: http://alme1.almaden.ibm.com/cs/projects/iis/hdb/Publications/papers/sigmod00_privacy.pdf. ACM, pp. 439–450. Amershi, Saleema et al. (2015). “Modeltracker: Redesigning Performance Analysis Tools for Machine Learning.” In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. URL: https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/amershi.CHI2015.ModelTracker.pdf. ACM, pp. 337–346. Barreno, Marco et al. (2010). “The Security of Machine Learning.” In: Machine Learning 81.2. URL: https://people.eecs.berkeley.edu/~adj/publications/paper-files/SecML-MLJ2010.pdf, pp. 121–148. Calmon, Flavio et al. (2017). “Optimized Pre-processing for Discrimination Prevention.” In: Advances in Neural Information Processing Systems. URL: http://papers.nips.cc/paper/6988-optimized-pre-processing-for-discrimination- prevention.pdf, pp. 3992–4001. Feldman, Michael et al. (2015). “Certifying and Removing Disparate Impact.” In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. URL: https://arxiv.org/pdf/1412.3756.pdf. ACM, pp. 259–268. 21#UnifiedAnalytics #SparkAISummit
  • 22. References Hardt, Moritz, Eric Price, Nati Srebro, et al. (2016). “Equality of Opportunity in Supervised Learning.” In: Advances in neural information processing systems. URL: http://papers.nips.cc/paper/6374-equality-of-opportunity-in-supervised-learning.pdf, pp. 3315–3323. Hu, Linwei et al. (2018). “Locally Interpretable Models and Effects Based on Supervised Partitioning (LIME-SUP).” In: arXiv preprint arXiv:1806.00663. URL: https://arxiv.org/ftp/arxiv/papers/1806/1806.00663.pdf. Kamiran, Faisal and Toon Calders (2012). “Data Preprocessing Techniques for Classification Without Discrimination.” In: Knowledge and Information Systems 33.1. URL: https://link.springer.com/content/pdf/10.1007/s10115-011-0463- 8.pdf, pp. 1–33. Kanter, James Max, Owen Gillespie, and Kalyan Veeramachaneni (2016). “Label, Segment, Featurize: A Cross Domain Framework for Prediction Engineering.” In: Data Science and Advanced Analytics (DSAA), 2016 IEEE International Conference on. URL: http://www.jmaxkanter.com/static/papers/DSAA_LSF_2016.pdf. IEEE, pp. 430–439. Kanter, James Max and Kalyan Veeramachaneni (2015). “Deep Feature Synthesis: Towards Automating Data Science Endeavors.” In: Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on. URL: https://groups.csail.mit.edu/EVO-DesignOpt/groupWebSite/uploads/Site/DSAA_DSM_2015.pdf. IEEE, pp. 1–10. 22#UnifiedAnalytics #SparkAISummit
  • 23. References Keinan, Alon et al. (2004). “Fair Attribution of Functional Contribution in Artificial and Biological Networks.” In: Neural Computation 16.9. URL: https://www.researchgate.net/profile/Isaac_Meilijson/ publication/2474580_Fair_Attribution_of_Functional_Contribution_in_Artificial_and_ Biological_Networks/links/09e415146df8289373000000/Fair-Attribution-of-Functional-Contribution-in-Artificial-and- Biolo, pp. 1887–1915. Kononenko, Igor et al. (2010). “An Efficient Explanation of Individual Classifications Using Game Theory.” In: Journal of Machine Learning Research 11.Jan. URL: http://www.jmlr.org/papers/volume11/strumbelj10a/strumbelj10a.pdf, pp. 1– 18. Lipovetsky, Stan and Michael Conklin (2001). “Analysis of Regression in Game Theory Approach.” In: Applied Stochastic Models in Business and Industry 17.4, pp. 319–330. Lundberg, Scott M., Gabriel G. Erion, and Su-In Lee (2017). “Consistent Individualized Feature Attribution for Tree Ensembles.” In: Proceedings of the 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017). Ed. by Been Kim et al. URL: https://openreview.net/pdf?id=ByTKSo-m-. ICML WHI 2017, pp. 15–21. Lundberg, Scott M and Su-In Lee (2017). “A Unified Approach to Interpreting Model Predictions.” In: Advances in Neural Information Processing Systems 30. Ed. by I. Guyon et al. URL: http://papers.nips.cc/paper/7062-a-unified-approach- to-interpreting-model-predictions.pdf. Curran Associates, Inc., pp. 4765–4774. 23#UnifiedAnalytics #SparkAISummit
  • 24. References Mitchell, Margaret et al. (2019). “Model Cards for Model Reporting.” In: Proceedings of the Conference on Fairness, Accountability, and Transparency. URL: https://arxiv.org/pdf/1810.03993.pdf. ACM, pp. 220–229. Papernot, Nicolas (2018). “A Marauder’s Map of Security and Privacy in Machine Learning: An overview of current and future research directions for making machine learning secure and private.” In: Proceedings of the 11th ACM Workshop on Artificial Intelligence and Security. URL: https://arxiv.org/pdf/1811.01134.pdf. ACM. Papernot, Nicolas et al. (2018). “Scalable Private Learning with PATE.” In: arXiv preprint arXiv:1802.08908. URL: https://arxiv.org/pdf/1802.08908.pdf. Ribeiro, Marco Tulio, Sameer Singh, and Carlos Guestrin (2016). “Why Should I Trust You?: Explaining the Predictions of Any Classifier.” In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. URL: http://www.kdd.org/kdd2016/papers/files/rfp0573-ribeiroA.pdf. ACM, pp. 1135–1144. Rudin, Cynthia (2018). “Please Stop Explaining Black Box Models for High Stakes Decisions.” In: arXiv preprint arXiv:1811.10154. URL: https://arxiv.org/pdf/1811.10154.pdf. Shapley, Lloyd S (1953). “A Value for N-Person Games.” In: Contributions to the Theory of Games 2.28. URL: http://www.library.fa.ru/files/Roth2.pdf#page=39, pp. 307–317. 24#UnifiedAnalytics #SparkAISummit
  • 25. References Shapley, Lloyd S, Alvin E Roth, et al. (1988). The Shapley Value: Essays in Honor of Lloyd S. Shapley. URL: http://www.library.fa.ru/files/Roth2.pdf. Cambridge University Press. Vartak, Manasi et al. (2016). “Model DB: A System for Machine Learning Model Management.” In: Proceedings of the Workshop on Human-In-the-Loop Data Analytics. URL: https://www- cs.stanford.edu/~matei/papers/2016/hilda_modeldb.pdf. ACM, p. 14. Vaughan, Joel et al. (2018). “Explainable Neural Networks Based on Additive Index Models.” In: arXiv preprint arXiv:1806.01933. URL: https://arxiv.org/pdf/1806.01933.pdf. Wilkinson, Leland (2006). The Grammar of Graphics. Wilkinson, Leland (2018). “Visualizing Big Data Outliers through Distributed Aggregation.” In: IEEE Transactions on Visualization & Computer Graphics. URL: https://www.cs.uic.edu/~wilkinson/Publications/outliers.pdf. Yang, Hongyu, Cynthia Rudin, and Margo Seltzer (2017). “Scalable Bayesian Rule Lists.” In: Proceedings of the 34th International Conference on Machine Learning (ICML). URL: https://arxiv.org/pdf/1602.08610.pdf. Zemel, Rich et al. (2013). “Learning Fair Representations.” In: International Conference on Machine Learning. URL: http://proceedings.mlr.press/v28/zemel13.pdf, pp. 325–333. 25#UnifiedAnalytics #SparkAISummit
  • 26. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT