Rasa Developer Summit - Nouha Dziri, PhD Student/Google AI - Evaluating Coherence in Dialogue Systems Using ENtailment

•

1 like•454 views

Evaluating open-domain dialogue systems is difficult due to the diversity of possible correct answers. Automatic metrics such as BLEU correlate weakly with human annotations, resulting in a significant bias across different models and datasets. Some researchers resort to human judgment experimentation for assessing response quality, which is expensive, time consuming, and not scalable. Moreover, judges tend to evaluate a small number of dialogues, meaning that minor differences in evaluation configuration may lead to dissimilar results. In this talk, I will present interpretable metrics for evaluating topic coherence by making use of distributed sentence representations. Furthermore, I will introduce calculable approximations of human judgment based on conversational coherence by adopting state-of-the-art entailment techniques. I will show that the introduced metrics can be used as a surrogate for human judgment, making it easy to evaluate dialogue systems on large-scale datasets and allowing an unbiased estimate for the quality of the responses. WHAT YOU'LL LEARN The task of evaluating dialogue systems is far from being solved, researchers are still on the quest for a strong and reliable metric that highly conforms with human judgment. Consistency is key in evaluating dialog systems Entailment techniques lay the foundations of future works to evaluate better the consistency in dialogues Deep learning and reinforcement enable new research Nouha Dziri is a Ph.D. student at the University of Alberta working within the Alberta Machine Intelligence Institute. Her research interests revolves around generative deep learning models and conversational dialogue systems. In particular, her work focuses on modelling an intelligent agent which can have open-ended conversations indistinguishable from human ones. Before her Ph.D, she completed a MSc degree in Computer Science at the University of Alberta where she worked on dialogue modeling and quality evaluation. She has interned at Google AI in New York city where she investigated dialogue quality modeling and persuasiveness.

Technology

Evaluating Coherence in Dialogue Systems Using
Entailment
Nouha Dziri
PhD student at University of Alberta / Google AI
Rasa Developer Summit - 2019

hdbfdj
Dialogue
history
Generated
response

Rasa Developer Summit - Nouha Dziri, PhD Student/Google AI - Evaluating Coherence in Dialogue Systems Using ENtailment

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Slack Application Development 101 Slidespraypatel2

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Real Time Object Detection Using Open CVKhem

Histor y of HAM Radio presentation slidevu2urc

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Artificial Intelligence: Facts and MythsJoaquim Jorge

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

A Year of the Servo Reboot: Where Are We Now?Igalia

Scaling API-first – The story of a global engineering organizationRadu Cotescu

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Slack Application Development 101 Slides

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Real Time Object Detection Using Open CV

Histor y of HAM Radio presentation slide

Tata AIG General Insurance Company - Insurer Innovation Award 2024

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

🐬 The future of MySQL is Postgres 🐘

Advantages of Hiring UIUX Design Service Providers for Your Business

Artificial Intelligence: Facts and Myths

Handwritten Text Recognition for manuscripts and early printed texts

Boost PC performance: How more available memory can improve productivity

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Driving Behavioral Change for Information Management through Data-Driven Gree...

A Year of the Servo Reboot: Where Are We Now?

Scaling API-first – The story of a global engineering organization

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Rasa Developer Summit - Nouha Dziri, PhD Student/Google AI - Evaluating Coherence in Dialogue Systems Using ENtailment

1. Evaluating Coherence in Dialogue Systems Using Entailment Nouha Dziri PhD student at University of Alberta / Google AI Rasa Developer Summit - 2019

8. 1.

9. 1. 2.

10. 1. 2. 3.

11. 1. 2. 3. 4.

12. 1. 2. 3. 4. 🤨

13. 1. 2. 3. 4.

14.