SlideShare una empresa de Scribd logo
1 de 20
Development of	Explainable NLP	
Models:	"You	show	me	the	man	and	
I	will show	you the	rule"
SVITLANA GALESHCHUK
About	me
vData	Scientist:	6	years
vIn	NLP:	3	years
vFulbright Scholar in	2015-2016,	USA
vVisiting Associate Prof.	at	University of	Grenoble,	France	2017
vData	Scientist,	Lecturer and	Researcher at	PSL/University of	Paris	
Dauphine,	France,	since 2017
vData	Scientist at	Starclay Consulting,	France,	since 2019
vEmail:	svitlana.galeshchuk@gmail.com
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
NLP — Natural	Language	“Processing” =	
NLU — Natural	Language	“Understanding” (Sentiment	Analysis,	Topic	
Classification,	Entity	Detection)	 +
NLG — Natural	Language	“Generation”	(textual	summaries,	etc)	
I.	What is NLP	?
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
Word	Embedding
NLP	:	Natural	Language	Processing
• 2001 : Neural language models: word embedding
> converting the words into vectors
Bengio, Y., Ducharme, R. & Vincent, P. A. Neural probabilistic language model.
Proc. Advances. Neural Information Processing Systems 13. 932–938 (2001)
• 2013 : Model Word2vec : Linguistic Contextualisation of words
> Predict the word based on the context
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. Distributed representations of words and phrases
and their compositionality. Advances in neural information processing systems. 3111-3119 (2013)
• 2018 : Le modèle révolutionnaire BERT de Google
> Bidirectional Encoder Representations from Transformers
[1]	Vaswani,	A.,	Shazeer,	N.,	Parmar,	N.,	Uszkoreit,	J.,	Jones,	L.,	Gomez,	A.	N.,	...	&	Polosukhin,	I.	Attention	is all	you need.	
Advances in	neural	information	processing systems.	5998-6008.	(2017)
[2]	Devlin,	Jacob,	et	al.	Bert:	Pre-training	of	deep bidirectional transformers for	language understanding.	arXiv preprint
arXiv:1810.04805 (2018)
BERT
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
ØText is a	set	of	words;
ØWords are	discrete values,	hence the	curse of	dimensionality;
ØEmbedding (converting words into vectors)	is the	way to	use	text in	
ML;
ØAutoregressive nature	of	natural language makes ML	practitioners to	
often use	LSTM	in	NLP	tasks;
ØBERT	being a	major	breakthrough since 2017	is difficult to	put	into
production;	it is good	for	texts less than 512	tokens.
To	retain:	
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
II.	Explainable AI
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
Explainable AI	Methods
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
Ribeiro,	M.	T.,	Singh,	S.,	&	Guestrin,	C.	"	Why should i	trust	you?"	Explaining the	predictions of	any classifier.	
In Proceedings of	the	22nd	ACM	SIGKDD	international	conference on	knowledge discovery and	data	mining.
1135-1144.	(2016)
LIME	Intuition
LIME
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
+	LIME	delivers only
local	explanations
LIME	Disadvantages
Medical records
(14.000	patients)
Comments of	the	clinical history
of	patient,	lifestyle and	the	
symptomes
(features)
Motive of	
hospitalisation
(features)
Principal	diagnosis
(target)
Used Data
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
Use	Case:	Hospital Data
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
Ingelsson, E., Lundholm, C., Johansson, A. L., & Altman, D. Hysterectomy and risk of cardiovascular disease: a
population-based cohort study. European heart journal, 32(6), 745-750. (2011)
Laughlin-Tommaso, S. K., Khan, Z., Weaver, A. L., Smith, C. Y., Rocca, W. A., & Stewart, E. A. Cardiovascular and
metabolic morbidity after hysterectomy with ovarian conservation: a cohort study. Menopause (New York,
NY), 25(5), 483. (2018)
« Women who have had a hysterectomy, especially before the age of 35, have a higher
risk of having a stroke. About 70,000 hysterectomies are performed each year in France »
Stroke (863 patients) :
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
Coumbaras, M., A. Duval, P. Le Hir, N. Jomaah, L. Arrivé, and J. M. Tubiana. "Fibrolipome du filum terminal." J Radiol 84.
721-7222 (2003)
«When the lipoma is located in the thoracic region, it can be responsible for chronic back pain
and sometimes headaches»
Low back pain (1040 observations) :
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
« the	Shapley	value:	It	is the	average of	the	marginal	contributions	across all	
permutations »
« What Shapley	does is quantifying the	contribution	that each player brings to	the	game.	
What SHAP	does is quantifying the	contribution	that each feature brings to	the	prediction
made	by	the	model »
SHAP:	both local	and	global	explainability
Lundberg,	Scott	M.,	and	Su-In	Lee.	"A	unified approach to	interpreting model	predictions." Advances
in	neural	information	processing systems.	2017.
Shap Local	Results
SHAP
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
LIME	and	SHAP	into generating innocuous explanations which do	not	reflect the	
underlying biases
Takes a	long	time	to	compute.	For	large	datasets,	it is computationally expensive
to	use	the	entire dataset and	we have	to	rely on	approximations	(e.g.,	
subsample the	data).	This	has	implications	for	the	accuracy of	the	explanation.
Original	SHAP	implementation has	issues	with visualization when more	than 20	
words are	in	the	text:
Slack,	Dylan,	et	al.	"Fooling lime	and	shap:	Adversarial attacks on	post	hoc	explanation methods." Proceedings of	the	
AAAI/ACM	Conference on	AI,	Ethics,	and	Society.	2020.
SHAP	Disadvantages
Sundararajan,	M.,	Taly,	A.,	&	Yan,	Q.	(2017).	Axiomatic attribution	for	deep networks. arXiv preprint
arXiv:1703.01365.	
Mudrakarta,	Pramod Kaushik,	et	al.	"Did the	model	understand the	question?." arXiv preprint
arXiv:1805.05492 (2018).:
« As	the	input	varies	along the	straight	line	path between the	baseline and	the	
input	at	hand,	the	prediction moves	along a	trajectory from uncertainty to	
certainty (the	final	prediction probability).	At	each point	on	this trajectory,	one	
can use	the	gradient	with respect	to	the	input	features to	attribute the	change	in	
the	prediction probability back	to	the	input	features.	IG	aggregates these
gradients	along the	trajectory using a	path integral »
Øapt for	all	differentiable models;
Øeasy to	implement;
Øcomputationally scalable to	massive	deep networks;
Ømuch faster than a	naive Shapley-value-based method
INTEGRATED	GRADIENTS
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
§ Not	working with non-differentiable model	types	(random forest,	etc);
§ Some counterintuitive explanations
IG	Disadvantages
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
DeepLIFT proceeds in	a	backward fashion.	Each unit	is assigned an	
attribution	that represents the	relative	effect of	the	unit	activated at	the	
original	network	input	x	compared to	the	activation	at	some reference
input.	Reference	values	for	all	hidden units are	determined running	a	
forward pass through the	network,	using the	baseline as	input,	and	
recording the	activation	of	each unit
Pros:	very fast
Cons:	picking	the	baseline inputs
Gabriel	Tseng
Nov	5,	2020																																																																																																																		 UA	Online	Data	Science	Marathon
DeepLIFT
Integrated	Gradients: Mukund Sundararajan,	Ankur Taly,	Qiqi
Yan, Axiomatic	Attribution	for	Deep	Networks,	2017
DeepLIFT: Avanti Shrikumar,	Peyton Greenside,	Anshul
Kundaje, Learning	Important	Features	Through	Propagating	Activation	
Differences,	2017
SHAP	values: Scott	M.	Lundberg,	Su-In	Lee, A	Unified	Approach	to	
Interpreting	Model	Predictions,	2017
LIME:	Ribeiro,	M.	T.,	Singh,	S.,	&	Guestrin,	C. Why should i	trust	you?								
Explaining the	predictions of	any classifier	(2016)
Literature
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon
•Explanation of	the	black-box	models’	outputs	is an	important step towards
making the	bridge	between the	model	and	its end-user;
•Explainable AI	methods may deliver global	or/and	local	interpretaions;
•Most	of	the	current approaches are	based on	the	cooperative game theory;
•Validation of	interpretations is usually provided by	field experts.	Kullback-
Leibler divergence	are	sometimes used to	assess the	interpretations.
•Python	implementation:
Shap:	https://github.com/slundberg/shap
LIME:	https://eli5.readthedocs.io/en/latest/overview.html
IG,	DeepLIFT:	https://captum.ai/
To	retain:
Nov	5,	2020																																																																																																						UA	Online	Data	Science	Marathon

Más contenido relacionado

Similar a Svitlana Galeshchuk Development of Explainable NLP Models: "You show me the man and I will show you the rule"

Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG researchDorothy Bishop
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as CommoditiesMathieu d'Aquin
 
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docxgertrudebellgrove
 
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...A. Hasib Uddin
 
If only access were our only infrastructure problem!
If only access were our only infrastructure problem!If only access were our only infrastructure problem!
If only access were our only infrastructure problem!Björn Brembs
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataPablo Bernabeu
 
Normative Modeling & Patients Stratifications: Dealing with Dimensions & Cat...
Normative Modeling & Patients Stratifications:  Dealing with Dimensions & Cat...Normative Modeling & Patients Stratifications:  Dealing with Dimensions & Cat...
Normative Modeling & Patients Stratifications: Dealing with Dimensions & Cat...Guillaume Dumas
 
Cognitive Computing at University Osnabrück
Cognitive Computing at University OsnabrückCognitive Computing at University Osnabrück
Cognitive Computing at University OsnabrückSteven Miller
 
The future of scholarly publishing
The future of scholarly publishingThe future of scholarly publishing
The future of scholarly publishingBjörn Brembs
 
Lariviere - Unraveling gender disparities in science
Lariviere - Unraveling gender disparities in scienceLariviere - Unraveling gender disparities in science
Lariviere - Unraveling gender disparities in scienceinnovationoecd
 
Keynote at VR in Science and Industry
Keynote at VR in Science and Industry Keynote at VR in Science and Industry
Keynote at VR in Science and Industry Christian Sandor
 
CV_JNorris_Oct2016
CV_JNorris_Oct2016CV_JNorris_Oct2016
CV_JNorris_Oct2016Jade Norris
 
BIOSKETCH
BIOSKETCHBIOSKETCH
BIOSKETCHbutest
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformaticsJan Aerts
 

Similar a Svitlana Galeshchuk Development of Explainable NLP Models: "You show me the man and I will show you the rule" (20)

Haladjian CV
Haladjian CVHaladjian CV
Haladjian CV
 
Talk on reproducibility in EEG research
Talk on reproducibility in EEG researchTalk on reproducibility in EEG research
Talk on reproducibility in EEG research
 
Data and Knowledge as Commodities
Data and Knowledge as CommoditiesData and Knowledge as Commodities
Data and Knowledge as Commodities
 
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx
(I’ll GO OVER STEP BY STEP IN CLASS TOMORROW)Part OneP.docx
 
2014 mmg-talk
2014 mmg-talk2014 mmg-talk
2014 mmg-talk
 
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...
Depression Analysis of Bangla Social Media Data using Gated Recurrent Neural ...
 
thesis_final.pdf
thesis_final.pdfthesis_final.pdf
thesis_final.pdf
 
If only access were our only infrastructure problem!
If only access were our only infrastructure problem!If only access were our only infrastructure problem!
If only access were our only infrastructure problem!
 
Towards reproducibility and maximally-open data
Towards reproducibility and maximally-open dataTowards reproducibility and maximally-open data
Towards reproducibility and maximally-open data
 
Normative Modeling & Patients Stratifications: Dealing with Dimensions & Cat...
Normative Modeling & Patients Stratifications:  Dealing with Dimensions & Cat...Normative Modeling & Patients Stratifications:  Dealing with Dimensions & Cat...
Normative Modeling & Patients Stratifications: Dealing with Dimensions & Cat...
 
Cognitive Computing at University Osnabrück
Cognitive Computing at University OsnabrückCognitive Computing at University Osnabrück
Cognitive Computing at University Osnabrück
 
The future of scholarly publishing
The future of scholarly publishingThe future of scholarly publishing
The future of scholarly publishing
 
Lariviere - Unraveling gender disparities in science
Lariviere - Unraveling gender disparities in scienceLariviere - Unraveling gender disparities in science
Lariviere - Unraveling gender disparities in science
 
One Perceptron to Rule Them All: Language and Vision
One Perceptron to Rule Them All: Language and VisionOne Perceptron to Rule Them All: Language and Vision
One Perceptron to Rule Them All: Language and Vision
 
Keynote at VR in Science and Industry
Keynote at VR in Science and Industry Keynote at VR in Science and Industry
Keynote at VR in Science and Industry
 
CV_JNorris_Oct2016
CV_JNorris_Oct2016CV_JNorris_Oct2016
CV_JNorris_Oct2016
 
Measuring Research Impact
Measuring Research ImpactMeasuring Research Impact
Measuring Research Impact
 
Quality of Life Technologies: From Cure to Care
Quality of Life Technologies: From Cure to CareQuality of Life Technologies: From Cure to Care
Quality of Life Technologies: From Cure to Care
 
BIOSKETCH
BIOSKETCHBIOSKETCH
BIOSKETCH
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformatics
 

Más de Lviv Startup Club

Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...
Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...
Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...Lviv Startup Club
 
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...Dmytro Khudenko: Challenges of implementing task managers in the corporate an...
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...Lviv Startup Club
 
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...Lviv Startup Club
 
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...Lviv Startup Club
 
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)Mykhailo Hryhorash: What can be good in a "bad" project? (UA)
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)Lviv Startup Club
 
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)Lviv Startup Club
 
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...Lviv Startup Club
 
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...Lviv Startup Club
 
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...Lviv Startup Club
 
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...Lviv Startup Club
 
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)Lviv Startup Club
 
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...Lviv Startup Club
 
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)Lviv Startup Club
 
Nataliya Kryvonis: Essential soft skills to lead your team (UA)
Nataliya Kryvonis: Essential soft skills to lead your team (UA)Nataliya Kryvonis: Essential soft skills to lead your team (UA)
Nataliya Kryvonis: Essential soft skills to lead your team (UA)Lviv Startup Club
 
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...Lviv Startup Club
 
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...Lviv Startup Club
 
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)Oksana Smilka: Цінності, цілі та (де) мотивація (UA)
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)Lviv Startup Club
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)Lviv Startup Club
 
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...Lviv Startup Club
 

Más de Lviv Startup Club (20)

Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...
Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...
Artem Bykovets: 4 Вершники апокаліпсису робочих стосунків (+антидоти до них) ...
 
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...Dmytro Khudenko: Challenges of implementing task managers in the corporate an...
Dmytro Khudenko: Challenges of implementing task managers in the corporate an...
 
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...
Sergii Melnichenko: Лідерство в Agile командах: ТОП-5 основних психологічних ...
 
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...
Mariia Rashkevych: Підвищення ефективності розроблення та реалізації освітніх...
 
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)Mykhailo Hryhorash: What can be good in a "bad" project? (UA)
Mykhailo Hryhorash: What can be good in a "bad" project? (UA)
 
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)
Oleksii Kyselov: Що заважає ПМу зростати? Розбір практичних кейсів (UA)
 
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...
Yaroslav Osolikhin: «Неідеальний» проєктний менеджер: People Management під ч...
 
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...
Mariya Yeremenko: Вплив Генеративного ШІ на сучасний світ та на особисту ефек...
 
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...
Petro Nikolaiev & Dmytro Kisov: ТОП-5 методів дослідження клієнтів для успіху...
 
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...
Maksym Stelmakh : Державні електронні послуги та сервіси: чому бізнесу варто ...
 
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)
Alexander Marchenko: Проблеми росту продуктової екосистеми (UA)
 
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...
Oleksandr Grytsenko: Save your Job або прокачай скіли до Engineering Manageme...
 
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)
Yuliia Pieskova: Фідбек: не лише "як", але й "коли" і "навіщо" (UA)
 
Nataliya Kryvonis: Essential soft skills to lead your team (UA)
Nataliya Kryvonis: Essential soft skills to lead your team (UA)Nataliya Kryvonis: Essential soft skills to lead your team (UA)
Nataliya Kryvonis: Essential soft skills to lead your team (UA)
 
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...
Volodymyr Salyha: Stakeholder Alchemy: Transforming Analysis into Meaningful ...
 
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...
Anna Chalyuk: 7 інструментів та принципів, які допоможуть зробити вашу команд...
 
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)Oksana Smilka: Цінності, цілі та (де) мотивація (UA)
Oksana Smilka: Цінності, цілі та (де) мотивація (UA)
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)
Andrii Skoromnyi: Чому не працює методика "5 Чому?" – і яка є альтернатива? (UA)
 
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...
Maryna Sokyrko & Oleksandr Chugui: Building Product Passion: Developing AI ch...
 

Último

URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 

Último (20)

URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

Svitlana Galeshchuk Development of Explainable NLP Models: "You show me the man and I will show you the rule"

  • 2. About me vData Scientist: 6 years vIn NLP: 3 years vFulbright Scholar in 2015-2016, USA vVisiting Associate Prof. at University of Grenoble, France 2017 vData Scientist, Lecturer and Researcher at PSL/University of Paris Dauphine, France, since 2017 vData Scientist at Starclay Consulting, France, since 2019 vEmail: svitlana.galeshchuk@gmail.com Nov 5, 2020 UA Online Data Science Marathon
  • 3. Nov 5, 2020 UA Online Data Science Marathon NLP — Natural Language “Processing” = NLU — Natural Language “Understanding” (Sentiment Analysis, Topic Classification, Entity Detection) + NLG — Natural Language “Generation” (textual summaries, etc) I. What is NLP ?
  • 4. Nov 5, 2020 UA Online Data Science Marathon Word Embedding NLP : Natural Language Processing • 2001 : Neural language models: word embedding > converting the words into vectors Bengio, Y., Ducharme, R. & Vincent, P. A. Neural probabilistic language model. Proc. Advances. Neural Information Processing Systems 13. 932–938 (2001) • 2013 : Model Word2vec : Linguistic Contextualisation of words > Predict the word based on the context Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. Distributed representations of words and phrases and their compositionality. Advances in neural information processing systems. 3111-3119 (2013)
  • 5. • 2018 : Le modèle révolutionnaire BERT de Google > Bidirectional Encoder Representations from Transformers [1] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. Attention is all you need. Advances in neural information processing systems. 5998-6008. (2017) [2] Devlin, Jacob, et al. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018) BERT Nov 5, 2020 UA Online Data Science Marathon
  • 6. ØText is a set of words; ØWords are discrete values, hence the curse of dimensionality; ØEmbedding (converting words into vectors) is the way to use text in ML; ØAutoregressive nature of natural language makes ML practitioners to often use LSTM in NLP tasks; ØBERT being a major breakthrough since 2017 is difficult to put into production; it is good for texts less than 512 tokens. To retain: Nov 5, 2020 UA Online Data Science Marathon
  • 9. Nov 5, 2020 UA Online Data Science Marathon Ribeiro, M. T., Singh, S., & Guestrin, C. " Why should i trust you?" Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 1135-1144. (2016) LIME Intuition LIME
  • 11. Medical records (14.000 patients) Comments of the clinical history of patient, lifestyle and the symptomes (features) Motive of hospitalisation (features) Principal diagnosis (target) Used Data Nov 5, 2020 UA Online Data Science Marathon Use Case: Hospital Data
  • 12. Nov 5, 2020 UA Online Data Science Marathon Ingelsson, E., Lundholm, C., Johansson, A. L., & Altman, D. Hysterectomy and risk of cardiovascular disease: a population-based cohort study. European heart journal, 32(6), 745-750. (2011) Laughlin-Tommaso, S. K., Khan, Z., Weaver, A. L., Smith, C. Y., Rocca, W. A., & Stewart, E. A. Cardiovascular and metabolic morbidity after hysterectomy with ovarian conservation: a cohort study. Menopause (New York, NY), 25(5), 483. (2018) « Women who have had a hysterectomy, especially before the age of 35, have a higher risk of having a stroke. About 70,000 hysterectomies are performed each year in France » Stroke (863 patients) :
  • 13. Nov 5, 2020 UA Online Data Science Marathon Coumbaras, M., A. Duval, P. Le Hir, N. Jomaah, L. Arrivé, and J. M. Tubiana. "Fibrolipome du filum terminal." J Radiol 84. 721-7222 (2003) «When the lipoma is located in the thoracic region, it can be responsible for chronic back pain and sometimes headaches» Low back pain (1040 observations) :
  • 14. Nov 5, 2020 UA Online Data Science Marathon « the Shapley value: It is the average of the marginal contributions across all permutations » « What Shapley does is quantifying the contribution that each player brings to the game. What SHAP does is quantifying the contribution that each feature brings to the prediction made by the model » SHAP: both local and global explainability Lundberg, Scott M., and Su-In Lee. "A unified approach to interpreting model predictions." Advances in neural information processing systems. 2017. Shap Local Results SHAP
  • 15. Nov 5, 2020 UA Online Data Science Marathon LIME and SHAP into generating innocuous explanations which do not reflect the underlying biases Takes a long time to compute. For large datasets, it is computationally expensive to use the entire dataset and we have to rely on approximations (e.g., subsample the data). This has implications for the accuracy of the explanation. Original SHAP implementation has issues with visualization when more than 20 words are in the text: Slack, Dylan, et al. "Fooling lime and shap: Adversarial attacks on post hoc explanation methods." Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. 2020. SHAP Disadvantages
  • 16. Sundararajan, M., Taly, A., & Yan, Q. (2017). Axiomatic attribution for deep networks. arXiv preprint arXiv:1703.01365. Mudrakarta, Pramod Kaushik, et al. "Did the model understand the question?." arXiv preprint arXiv:1805.05492 (2018).: « As the input varies along the straight line path between the baseline and the input at hand, the prediction moves along a trajectory from uncertainty to certainty (the final prediction probability). At each point on this trajectory, one can use the gradient with respect to the input features to attribute the change in the prediction probability back to the input features. IG aggregates these gradients along the trajectory using a path integral » Øapt for all differentiable models; Øeasy to implement; Øcomputationally scalable to massive deep networks; Ømuch faster than a naive Shapley-value-based method INTEGRATED GRADIENTS Nov 5, 2020 UA Online Data Science Marathon
  • 17. § Not working with non-differentiable model types (random forest, etc); § Some counterintuitive explanations IG Disadvantages Nov 5, 2020 UA Online Data Science Marathon
  • 18. DeepLIFT proceeds in a backward fashion. Each unit is assigned an attribution that represents the relative effect of the unit activated at the original network input x compared to the activation at some reference input. Reference values for all hidden units are determined running a forward pass through the network, using the baseline as input, and recording the activation of each unit Pros: very fast Cons: picking the baseline inputs Gabriel Tseng Nov 5, 2020 UA Online Data Science Marathon DeepLIFT
  • 19. Integrated Gradients: Mukund Sundararajan, Ankur Taly, Qiqi Yan, Axiomatic Attribution for Deep Networks, 2017 DeepLIFT: Avanti Shrikumar, Peyton Greenside, Anshul Kundaje, Learning Important Features Through Propagating Activation Differences, 2017 SHAP values: Scott M. Lundberg, Su-In Lee, A Unified Approach to Interpreting Model Predictions, 2017 LIME: Ribeiro, M. T., Singh, S., & Guestrin, C. Why should i trust you? Explaining the predictions of any classifier (2016) Literature Nov 5, 2020 UA Online Data Science Marathon
  • 20. •Explanation of the black-box models’ outputs is an important step towards making the bridge between the model and its end-user; •Explainable AI methods may deliver global or/and local interpretaions; •Most of the current approaches are based on the cooperative game theory; •Validation of interpretations is usually provided by field experts. Kullback- Leibler divergence are sometimes used to assess the interpretations. •Python implementation: Shap: https://github.com/slundberg/shap LIME: https://eli5.readthedocs.io/en/latest/overview.html IG, DeepLIFT: https://captum.ai/ To retain: Nov 5, 2020 UA Online Data Science Marathon