Check my presentation about challenges we're facing with voice and how to successfully getting started. Also viewable online: https://pasteapp.com/p/ANdFfwH2si9
2. About me
Creative Technologist at CX Company
Finding magic where left brain meets
right brain
Innovator and early-adopter
Connecting technology, business and
people
3. Some cool facts about CX Company
• Founded in 2004, meanwhile 15 years of experience
• AI-driven, enterprise-ready conversational platform
DigitalCX
• Content-first developer-friendly platform
• Support for over 20 languages
• Over 4 million digital interactions per day
• Leading provider for chatbots and digital assistants in NL
• ISO 27001 certified. Your data is safe with us
• Pan European operations with offices in NL, D and UK
5. We also offer easy integrations for backend systems
and devices. Available through NPM
6. Let's embark on a high-level journey
through the adoption of smart
speakers and voice assistants
What is the current state? What can we do better? What new challenges do we face?
And what can we do to make the experience better?
9. Smart speakers
are a hot market
Of the 30% planning to purchase, 73% of
them want to do so in the next six months.
There are more speakers available and
cheaper in price.
10. 75% of households will have at least
one smart speaker by 2020
Source: Gartner
11. Also by 2020, customers will manage
85% of their relationship with the
enterprise without interacting with a
human
Source: Gartner
14. What are we talking
about?
Jargon, technical phrases, definitions etc.
15. Smart speakers try to
understand users' intents by
analyzing their utterances.
Smart speakers are called smart because
they use Artificial Intelligence (AI) to do
so.
16. Do you feel like this at
the moment?
Let's explain
17. Intent vs. utterance
Intent
What does a user want to achieve?
Utterance
How does a user express that?
The same intent can have multiple utterances: "Book a cab", "Get me a taxi"
19. Machine Learning vs.
Natural Language Processing
Machine Learning
A computer tries to make decisions by discovering patterns in large amounts of data. It
will typically use neural networks or deep learning models to achieve that.
Requires lots of data and expensive computing power
Sounds really cool. Ma-chi-ne-Lear-ning, awesome!
Can potentially predict what a user wants to do
20. Can you
imagine?
"Hey Maarten, your favorite beer is on
discount now at your local supermarket. I
also noticed you're running out of stock.
Since your friends are coming over
tomorrow to watch the football you might
want to have some at home. Shall I go
ahead and place a new order?"
21. Machine Learning vs.
Natural Language Processing
Natural Language Processing/Understanding
A computer tries to make decisions based on patterns and topology. It will typically use
grammar analysis with lexicons, dictionaries, lemmatization, word weights etc. to achieve
that.
Requires a bit more work to maintain
Can be setup without lots of data. Early results possible
You don't need to train your phrases
Can be fine-grained, for example Mercedes SLK vs. car
Can distinct grammar rules, for example: book a ship vs. ship a book
23. AI technology is improving
rapidly, but we still have a way
to go to Super Intelligence
Some even say we're in an AI-winter.
Because the industry is currently
overpromising but underperforming
24. How is CX Company using AI?
We use business rules for the things we
know and AI for the things we do not know
and want to improve
25. How is CX Company using AI?
• Make life easier for content engineers
(GAP-analysis, pre-trained modules,
AutoAnswers, AutoDialogs)
→ Supervised Machine Learning
• Better understand what a clients wants
(context, spellcheck, synonyms etc.)
→ NLP
33. A growing majority
now views our online
privacy as a crisis
Source: Axios
The privacy threat is
a crisis, and we need
to force companies
to change.
vs.
Online services are
essential, and we all
have to accept
some risk.
39. Also, integrating smart
speakers into existing
ecosystems like smart homes is
not frictionless. In fact, it can
be quite hard for non-techies.
40. "Voice search is
one of the most
often talked about,
but least
understood topics
confronting
businesses today."
The average VSR score
Uberall calculated was
44.12%, meaning the
majority of businesses
have not been optimized
to an acceptable
standard for consumer
voice search queries.
We'll see why this is
important later on
42. Some random facts
about screenagers
• 95 percent of teens
have a phone
• Teens spend 11
hours on their
devices each day
• Kids spend twice as
long playing on
screens than they do
outside
• 1 in 3 people would
rather give up sex
than their phone
Sources: Pew Research
Center, KFF, Market
Watch, Elephant Journal
43. So why should they give up
their smartphone for
something that's currently
working less interactive, less
efficient and less personal?
47. The way we interact with new Voice User
Interfaces forces us to rethink a consumer's
decision-making process.
48. "Conversational AI simplifies
computer usage like never
before as it flips the user
dynamic – instead of humans
learning computer code,
computers are learning our
language."
49. #2
A Voice User Interface is more of a blank canvas
than a Graphical User Interface
50. Similar to a rails put up on a bowling track
to make the ball reach its target, we have
to provide handles for a user to complete
a journey through voice.
51. We have to take more efforts into
steering the conversation. (1/2)
By creating exceptionally good dialogs.
Conversational copywriting is already playing a huge roll in this. This
will eventually be - or already is - a job on its own!
52. For example by catering for non-scripted scenarios
"Alright I found two accommodations, one is a hotel the other a camping. Which one
would you like?"
"The first one"
"The last one"
"Neither, I want an AirB&B"
"Definitely the hotel!"
53. Or by managing expectations
"We're at question 3 of 10"
vs.
"Thanks for your patience. I only need 3 more answers and then we're done"
54. We have to take more efforts into
steering the conversation. (2/2)
Or by using the hardware's capabilities. Do you have a screen
available? Use it! Quick replies? Show them! Anything that can help
moving the user forward into the dialog.
You can get those parameters from a request
55. Using the hardware's capability
"Take a look at the Nike shoes below I've found for you in size 8. Just tap one to get
more information"
57. Invocation vs. deep-linking
• Explicit invocation
"Hey Google, talk to Domino's pizza"
• Deep linking
Extension on the explicit invocation where the user already has an intent
"Hey Google, ask Domino's pizza about their opening hours"
"Hey Google, ask Domino's pizza about their opening hours this Friday"
• Implicit invocation
Google uses an "action directory" to lookup which actions can fulfill this intent.
"Hey Google, order a pizza"
58. So..
There are two
important things
to keep in mind
1. First mover's
advantage because
of the invocation
2. There's something
like "SEO for Voice"
59. It brings back memories to the
early days of optimizing our
websites for search engines
(SEO).
No-one really knew how it worked
64. We're moving on from the mobile-first
into the voice-first era. Your brand should also have
a voice-search strategy
65. So avoid copying your
chatbot's conversation 1-on-1
to voice..
I know it's very tempting, however it's far
more difficult to adjust screen content to
voice than the other way around.
66. But how do you get started
then?
Focus on the high-frequency and low-
breadth journey.
In other words: a journey that is triggered
often without too many steps to complete.
You do have analytics for this, right?
67. Or run a workshop.
At CX Company we can
potentially run several
workshops like kickoff-
or prototyping
workshops.
Let me know if you want
to know more about
this!
70. Leverage the power of
Design Thinking to get
your insights fast and
test them in an early
stage already.
71. Setup a Minimal Viable
Product to be able to
enable more
experimentation across
teams
72. Setup lo-fi prototyping, for example:
• Wizard of Oz testing
Simulate the machine responding
• Alexa's one-breath test
If you need to take a breath in between responses, it's probably too long..
• Read the script out loud!
Release your inner Shakespeare. Really. Write down your script. Set aside some time
and start "playing" your script with two or more people.
75. "We respond to voice technologies as we respond
to actual people and behave as we would in any
social situation"
76. The principle behind this is called the Cooperative Principle and can
be understood in terms of four rules: Maxim of Quality, Maxim of
Quantity, Maxim of Relevance and Maxim of Manner
77. • Maxim of Quality
Share the right quality of information, i.e. info you believe is true
• Maxim of Quantity
Share the right amount of information
• Maxim of Relevance
Share only relevant information
• Maxim of Manner
Share information as briefly and orderly as possible, trying to avoid obscurity and
ambiguity
78. For example, maxim of quantity:
"Hey Google, do you know the time?"
"Of course I do"
vs.
"Hey Google, do you know the time?"
"In The Hague it's currently twenty to twelve"
80. "People attribute human characteristics to spoken
dialogue systems for reasons related to human
evolution."
81.
82. Keep this in mind when creating a Persona around your voice identity.
A Persona will make your assistant also appear
more consistent, more likable and more reliable
and should at least:
Define personality traits like friendly, helpful, witty, charming etc.
Follow a style guide (if existent)
Be considered as a real employee of your customer service center