Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time.
Ting-Hao K. Huang, Joseph Chee Chang, Jeffrey P. Bigham.
In Proceedings of Conference on Human Factors in Computing Systems 2018 (CHI 2018), 2018, Montréal, Canada.
Advantages of Hiring UIUX Design Service Providers for Your Business
Evorus: A Crowd-Powered Conversational Assistant Built to Automate Itself Over Time
1. 1
Live Note/QA: http://tinyurl.com/Evorus
Evorus:
A Crowd-Powered
Conversational Assistant
Built to Automate Itself Over Time
Ting-Hao (Kenneth) Huang*
Joseph Chee Chang
Jeffrey P. Bigham
[ Question / Feedback: http://tinyurl.com/Evorus ]
*Kenneth will join in Fall 2018
7. 7
Live Note/QA: http://tinyurl.com/Evorus
What just
happened?
• Open Conversation
• Multi-turn interaction
• Multiple domains
• Personalized
• Coherent dialog
• Mix of task-oriented
and social conversation
9. 9
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
10. 10
Live Note/QA: http://tinyurl.com/Evorus
Existing Approaches to
Open Conversation
• Combining multiple task-oriented dialog systems
• DialPort (Zhao, et al., 2016)
• End-to-end framework for dialogue systems
• Serban, et al. 2016; Li, et al. 2017
• Adapting a model to many other domains
• Walker, et al., 2007; Sun, et al., 2016
• Chit-chat systems (social bot)
• Hold social conversations (Banchs, et al., 2012)
• Still a very hard problem…
MIT Technology Review
Feb 27, 2018
16. 16
Live Note/QA: http://tinyurl.com/Evorus
•Crowd workers collectively hold a
conversation by:
1. Propose Responses
2. Vote Responses
3. Take Notes
Reward points for
each action
"Is there anything else I can help you with?": Challenges in Deploying an On-Demand
Crowd-Powered Conversational Agent. Huang, et al. HCOMP 2016.
Chorus: A Crowd-Powered
Conversation Assistant
30. 30
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Find me some
good restaurants !
Where can I get
Chinese food?
Example
Triggering
Message
31. 31
Live Note/QA: http://tinyurl.com/Evorus
Ranking Chatbots: Performance & Topic
Topic Similarity
User Message
Domain of
the Chatbot
Hey what should
I eat in Montreal?
Example
Triggering
Message
Find me some
good restaurants !
Where can I get
Chinese food?
Topic
Similarity
36. 36
Live Note/QA: http://tinyurl.com/Evorus
Find the Best Confidence Threshold
• High Threshold
• Only vote when pretty sure
• High precision, but little benefit
• Low Threshold
• Nearly always vote
• Grant agreement bonus by mistake
• Damage conversation quality
39. 39
Live Note/QA: http://tinyurl.com/Evorus
Automating Open Conversations
• Setup
• A 5-month-long deployment, 80 Users
• 4 chatbots + 1 voting bot
• Result
• Automated responses were chosen 12.44% of the time.
• Human upvotes were reduced by 13.81%.
• The cost of each message is reduced by 32.76%.
• Conversation quality and user
satisfaction level remains.
• Conversation Quality: Satisfaction,
Clarity, Responsiveness, Comfort
(Liu, et al., 2010)
40. 40
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
41. 41
Live Note/QA: http://tinyurl.com/Evorus
Conclusion & Future Work
• Automating some part of an open conversation is
possible.
• Future Work
• 1000+ Chatbots!
• Deploy Chorus as an Open Research Platform
• Chorus on Smart Devices
@windx0303
KennethHuang.cc
Ting-Hao (Kenneth) Huang
QA: http://tinyurl.com/Evorus