SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
Interfaces de voz avanzadas
con VoiceXML
© 2015 Interactive Powers | www.ivrpowers.com
Digital Revolution is coming…
© 2015 Interactive Powers | www.ivrpowers.com
Self driving cars … Self voice services
DTMF keypad
Speaker
Automatic Speech Recognition
ASR
Micro
Text-to-Speech
TTS or Wav
Autopilot for
Interactive Voice Response
CAR
IVR
SIP / VoIP or TDM
© 2015 Interactive Powers | www.ivrpowers.com
What is IVR?
In telephony, Interactive Voice Response, or IVR, is a phone
technology that allows a computer to detect voice and touch
tones using a normal phone call. The IVR system can respond with
pre-recorded or dynamically generated audio to further direct
callers on how to proceed. IVR systems can be used to control
almost any function where the interface can be broken down into a
series of simple menu choices. Once constructed IVR systems
scale well to handle large call volumes.
© 2015 Interactive Powers | www.ivrpowers.com
IVR: Simple definition
ApplicationsPhone
IVR
Voice API
Persons Machines
© 2015 Interactive Powers | www.ivrpowers.com
IVR: Human-Machine Dialogue
Listen
Speak
Process
ASR / SIV
TTS / WAV
VoiceXML
IVR
Listen
Speak
Process
Machine Human
© 2015 Interactive Powers | www.ivrpowers.com
What is VoiceXML?
VoiceXML is a language for creating voice-user interfaces, particularly
for the telephone. It uses speech recognition (ASR) and touchtone
(DTMF keypad) for input, and pre-recorded audio and text-to-speech
synthesis (TTS) for output. It is based on the Worldwide Web
Consortium’s (W3C’s) Extensible Markup Language (XML), and
leverages the web paradigm for application development and
deployment. By having a common language, application developers,
platform vendors, and tool providers all can benefit from code
portability and reuse.
© 2015 Interactive Powers | www.ivrpowers.com
VoiceXML: History
201520101998 1999 2000 2001 2002 2006
VoiceXML 2.1
VoiceXML 3.0 draft
VoiceXML 1.0
VoiceXML 2.0
Natural Language Under.
NLU
Motorola
VoxML
IBM
SpeechML
Lucent
Teleportal
W3C
VoiceXML 0.9
AT&T
Labs
(Meta-languages)
© 2015 Interactive Powers | www.ivrpowers.com
W3C VoiceXML Open Standard
• W3C VoiceXML 2.0
Recommendation March 2004
• W3C VoiceXML 2.1 (Recommendation)
Recommendation June 2007
• W3C VoiceXML 3.0 (Draft)
Early Stage of development January 2006
© 2015 Interactive Powers | www.ivrpowers.com
Voice Browser or Web Browser
<vxml>
Internet
<html>
Web
Navegador Web
(Web Browser)
Navegador de Voz
(Voice Browser)
http://
Servidor Web
(web Server)
© 2015 Interactive Powers | www.ivrpowers.com
HTML versus VXML
HTML VXML
Mouse + Display Phone + Keypad
HTML layout VXML layout
images, video files audio, grammar files
Text Text (TTS)
Scripts Scripts
HTTP / HTTPS HTTP / HTTPS
RTP - SOAP - WSDL RTP - SOAP - WSDL - SIP
© 2015 Interactive Powers | www.ivrpowers.com
PBX versus IVR
Features PBX IVR
Connect Phones / Extensions Phones / Applications
Call Routing Person-to-Person Person-to-Machine
Configuration Static (Dialplan) Dynamic (VoiceXML)
Interaction DTMF DTMF | TTS | ASR | NLU | SIV
© 2015 Interactive Powers | www.ivrpowers.com
3 Niveles de dialogo: IVR … NLU
Key Tones (DTMF)
Direct Dialog (ASR/TTS)
Natural Language Understanding (NLU)
Dialogo determinista:
“Ventas, Comercial, Soporte…”
Teclas del teléfono:
“0…9 # *”
Dialogo indeterminista:
“Quiero contactar con un comercial”
1
2
3
© 2015 Interactive Powers | www.ivrpowers.com
Diagrama IVR… extendido
NLU
IVR
TTS ASR
MRCP
HTTP
MRCP
API
HTTP VoiceXML + GRXML + BNF
Voice Browser
Business
Applications
VOIP | TDM
HTTP | API
VXML
Speech Servers
HTTPS
Phone
SIP / TDM
PBX
Natural Language Understanding Voice Biometrics
Telephony Private Branch Exchange
IVR+
SIV
© 2015 Interactive Powers | www.ivrpowers.com
Hello World!
<?xml version="1.0"?>
<vxml version = "2.0" xmlns="http://www.w3.org/2001/vxml" xml:lang="en-US">
<form>
<block>Hello world!</block>
</form>
</vxml>
<?xml version="1.0"?>
<vxml version="2.0" xmlns="http://www.w3.org/2001/vxml">
<form>
<block>
<prompt>
<audio src="helloworld.wav"/>
</prompt>
</block>
</form>
</vxml>
© 2015 Interactive Powers | www.ivrpowers.com
Menu DTMF
<?xml version="1.0"?>
<vxml version = "2.1">
<menu>
<prompt>
Hello. Choose among the following option:
<enumerate> <value expr=“_dtmf”/> for <value expr=“_prompt”/>
</prompt>
<choice dtmf=“1” next=page1.vxml> Hotel </choice>
<choice dtmf=“2” next=page2.vxml> Wheather </choice>
<choice dtmf=“3” next=page3.vxml> News </choice>
</menu>
© 2015 Interactive Powers | www.ivrpowers.com
Reconocimiento de voz (ASR)
<?xml version=“1.0" encoding=“ISO-8859-1”?>
<vxml version = “2.0" lang=“en”>
<form> <field name=“city”>
<prompt>Where to you want to travel to?</prompt>
<option>New York</option>
<option>Paris</option>
<option>Berlin</option>
<option>Madrid</option>
<option>London</option>
</field>
<field name=“travellers” type=“number”>
<prompt>How many are traveling to <value expr=“city”/>? </prompt>
</field>
<block>
<submit next=“http://localhost/handler” namelist=“city travelers”/>
</block>
</form>
</vxml>
© 2015 Interactive Powers | www.ivrpowers.com
Ventajas de VoiceXML
• VoiceXML es un estándar abierto para los sistemas IVR
• Lenguaje basado en el paradigma XML / HTTP
• Inclusión de gramáticas de diálogo compiladas o dinámicas GRXML, ABNF,…
• Integración y gestión de los motores del habla TTS / ASR
• Compatible con todos lenguajes de programación web PHP / JSP / ASP /...
• Acceso universal a bases de datos y sistemas externos (también para NLU)
• Permite una gestión de los eventos en tiempo real
© 2015 Interactive Powers | www.ivrpowers.com
Live Demo
Sales Contact:
Ivan Sixto
ivan.sixto@ivrpowers.com
+34 699435965

Más contenido relacionado

La actualidad más candente

Final Twilio Under the Radar 2009 Cloud
Final Twilio Under the Radar 2009 CloudFinal Twilio Under the Radar 2009 Cloud
Final Twilio Under the Radar 2009 Cloud
Dealmaker Media
 
Integrate WebRTC Voice / Video Call App Using Contus Fly
Integrate WebRTC Voice / Video Call App Using Contus FlyIntegrate WebRTC Voice / Video Call App Using Contus Fly
Integrate WebRTC Voice / Video Call App Using Contus Fly
CONTUS TECH
 
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
telestax
 

La actualidad más candente (20)

Twilio Signal 2016 Real-time Communications Overview
Twilio Signal 2016 Real-time Communications OverviewTwilio Signal 2016 Real-time Communications Overview
Twilio Signal 2016 Real-time Communications Overview
 
Twilio Signal 2016 Keynote
Twilio Signal 2016 Keynote Twilio Signal 2016 Keynote
Twilio Signal 2016 Keynote
 
Twilio Signal 2016 Build Your Own Cloud Enterprise VoIP
Twilio Signal 2016 Build Your Own Cloud Enterprise VoIPTwilio Signal 2016 Build Your Own Cloud Enterprise VoIP
Twilio Signal 2016 Build Your Own Cloud Enterprise VoIP
 
Building Blocks for Next Generation Contact Centers
Building Blocks for Next Generation Contact CentersBuilding Blocks for Next Generation Contact Centers
Building Blocks for Next Generation Contact Centers
 
Smarter SIP Trunks: 6 Ways You Can Save Time and Get More Features
Smarter SIP Trunks: 6 Ways You Can Save Time and Get More FeaturesSmarter SIP Trunks: 6 Ways You Can Save Time and Get More Features
Smarter SIP Trunks: 6 Ways You Can Save Time and Get More Features
 
TringMe Overview
TringMe OverviewTringMe Overview
TringMe Overview
 
TWILIO SMS Introduction
TWILIO SMS IntroductionTWILIO SMS Introduction
TWILIO SMS Introduction
 
Final Twilio Under the Radar 2009 Cloud
Final Twilio Under the Radar 2009 CloudFinal Twilio Under the Radar 2009 Cloud
Final Twilio Under the Radar 2009 Cloud
 
Twiliobot at Google Wave Meetup 2009-09-14
Twiliobot at Google Wave Meetup 2009-09-14Twiliobot at Google Wave Meetup 2009-09-14
Twiliobot at Google Wave Meetup 2009-09-14
 
Twilio Contact Center Overview
Twilio Contact Center OverviewTwilio Contact Center Overview
Twilio Contact Center Overview
 
Integrate WebRTC Voice / Video Call App Using Contus Fly
Integrate WebRTC Voice / Video Call App Using Contus FlyIntegrate WebRTC Voice / Video Call App Using Contus Fly
Integrate WebRTC Voice / Video Call App Using Contus Fly
 
Open Source Software Business Model
Open Source Software Business Model Open Source Software Business Model
Open Source Software Business Model
 
The Momentum of Collaboration and the Customer Experience
The Momentum of Collaboration and the Customer ExperienceThe Momentum of Collaboration and the Customer Experience
The Momentum of Collaboration and the Customer Experience
 
Yealink business ip video phone vp530
Yealink business ip video phone vp530Yealink business ip video phone vp530
Yealink business ip video phone vp530
 
Avaya Video Everywhere
Avaya Video EverywhereAvaya Video Everywhere
Avaya Video Everywhere
 
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
Mobicents Summit 2012 - François Joseph Levée - Orange Labs' Mobicents Sip Se...
 
Presentation1
Presentation1Presentation1
Presentation1
 
Icareus Ltd company presentation 20170301
Icareus Ltd company presentation 20170301Icareus Ltd company presentation 20170301
Icareus Ltd company presentation 20170301
 
Twilio Product Overview
Twilio Product OverviewTwilio Product Overview
Twilio Product Overview
 
Icareus Solutions TV Everywhere Presentation
Icareus Solutions TV Everywhere PresentationIcareus Solutions TV Everywhere Presentation
Icareus Solutions TV Everywhere Presentation
 

Similar a Interfaces de Voz avanzadas con VoiceXML - Iván Sixto | VoIP2DAY 2015

Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008
eComm2008
 
IBM Mobile portal experience
IBM Mobile portal experienceIBM Mobile portal experience
IBM Mobile portal experience
Vincent Perrin
 
Hello Soft Corporate Overview (2)
Hello Soft Corporate Overview (2)Hello Soft Corporate Overview (2)
Hello Soft Corporate Overview (2)
satyapraveent
 
Transcend/ShoreTel Overview
Transcend/ShoreTel OverviewTranscend/ShoreTel Overview
Transcend/ShoreTel Overview
anders56
 
Tringme Overview
Tringme OverviewTringme Overview
Tringme Overview
gueste32f19
 

Similar a Interfaces de Voz avanzadas con VoiceXML - Iván Sixto | VoIP2DAY 2015 (20)

Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008Ken Rehor's presentation at eComm 2008
Ken Rehor's presentation at eComm 2008
 
CCXML For Advanced Communications Applications
CCXML For Advanced Communications ApplicationsCCXML For Advanced Communications Applications
CCXML For Advanced Communications Applications
 
Genesys voice portal whitepaper
Genesys voice portal whitepaperGenesys voice portal whitepaper
Genesys voice portal whitepaper
 
Current trends in open source and automotive
Current trends in open source and automotiveCurrent trends in open source and automotive
Current trends in open source and automotive
 
Greach 2011 - Building Telephony Apps with Groovy and Grails
Greach 2011 - Building Telephony Apps with Groovy and GrailsGreach 2011 - Building Telephony Apps with Groovy and Grails
Greach 2011 - Building Telephony Apps with Groovy and Grails
 
Phonologies @ Cluecon
Phonologies @ ClueconPhonologies @ Cluecon
Phonologies @ Cluecon
 
IBM Mobile portal experience
IBM Mobile portal experienceIBM Mobile portal experience
IBM Mobile portal experience
 
Otra forma de hacer aplicaciones de telefonía
Otra forma de hacer aplicaciones de telefoníaOtra forma de hacer aplicaciones de telefonía
Otra forma de hacer aplicaciones de telefonía
 
10.1.1.510.6198
10.1.1.510.619810.1.1.510.6198
10.1.1.510.6198
 
Hello Soft Corporate Overview (2)
Hello Soft Corporate Overview (2)Hello Soft Corporate Overview (2)
Hello Soft Corporate Overview (2)
 
ibm språkbanken websphere
ibm språkbanken websphereibm språkbanken websphere
ibm språkbanken websphere
 
Wildix X Convention 2018 - Slides
Wildix X Convention 2018 - SlidesWildix X Convention 2018 - Slides
Wildix X Convention 2018 - Slides
 
Far South Networks Vision
Far South Networks VisionFar South Networks Vision
Far South Networks Vision
 
Transcend/ShoreTel Overview
Transcend/ShoreTel OverviewTranscend/ShoreTel Overview
Transcend/ShoreTel Overview
 
Voicexml
VoicexmlVoicexml
Voicexml
 
Tringme Overview
Tringme OverviewTringme Overview
Tringme Overview
 
Voip mobile dialer
Voip mobile dialerVoip mobile dialer
Voip mobile dialer
 
Open Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan QuayleOpen Source Telecom Software Landscape by Alan Quayle
Open Source Telecom Software Landscape by Alan Quayle
 
Webrtc workshop from Alan Quayle
Webrtc workshop from Alan QuayleWebrtc workshop from Alan Quayle
Webrtc workshop from Alan Quayle
 
Rick Fant Novedades Firefox OS en Firefox Update 2.0
Rick Fant Novedades Firefox OS en Firefox Update 2.0Rick Fant Novedades Firefox OS en Firefox Update 2.0
Rick Fant Novedades Firefox OS en Firefox Update 2.0
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Interfaces de Voz avanzadas con VoiceXML - Iván Sixto | VoIP2DAY 2015

  • 1. Interfaces de voz avanzadas con VoiceXML
  • 2. © 2015 Interactive Powers | www.ivrpowers.com Digital Revolution is coming…
  • 3. © 2015 Interactive Powers | www.ivrpowers.com Self driving cars … Self voice services DTMF keypad Speaker Automatic Speech Recognition ASR Micro Text-to-Speech TTS or Wav Autopilot for Interactive Voice Response CAR IVR SIP / VoIP or TDM
  • 4. © 2015 Interactive Powers | www.ivrpowers.com What is IVR? In telephony, Interactive Voice Response, or IVR, is a phone technology that allows a computer to detect voice and touch tones using a normal phone call. The IVR system can respond with pre-recorded or dynamically generated audio to further direct callers on how to proceed. IVR systems can be used to control almost any function where the interface can be broken down into a series of simple menu choices. Once constructed IVR systems scale well to handle large call volumes.
  • 5. © 2015 Interactive Powers | www.ivrpowers.com IVR: Simple definition ApplicationsPhone IVR Voice API Persons Machines
  • 6. © 2015 Interactive Powers | www.ivrpowers.com IVR: Human-Machine Dialogue Listen Speak Process ASR / SIV TTS / WAV VoiceXML IVR Listen Speak Process Machine Human
  • 7. © 2015 Interactive Powers | www.ivrpowers.com What is VoiceXML? VoiceXML is a language for creating voice-user interfaces, particularly for the telephone. It uses speech recognition (ASR) and touchtone (DTMF keypad) for input, and pre-recorded audio and text-to-speech synthesis (TTS) for output. It is based on the Worldwide Web Consortium’s (W3C’s) Extensible Markup Language (XML), and leverages the web paradigm for application development and deployment. By having a common language, application developers, platform vendors, and tool providers all can benefit from code portability and reuse.
  • 8. © 2015 Interactive Powers | www.ivrpowers.com VoiceXML: History 201520101998 1999 2000 2001 2002 2006 VoiceXML 2.1 VoiceXML 3.0 draft VoiceXML 1.0 VoiceXML 2.0 Natural Language Under. NLU Motorola VoxML IBM SpeechML Lucent Teleportal W3C VoiceXML 0.9 AT&T Labs (Meta-languages)
  • 9. © 2015 Interactive Powers | www.ivrpowers.com W3C VoiceXML Open Standard • W3C VoiceXML 2.0 Recommendation March 2004 • W3C VoiceXML 2.1 (Recommendation) Recommendation June 2007 • W3C VoiceXML 3.0 (Draft) Early Stage of development January 2006
  • 10. © 2015 Interactive Powers | www.ivrpowers.com Voice Browser or Web Browser <vxml> Internet <html> Web Navegador Web (Web Browser) Navegador de Voz (Voice Browser) http:// Servidor Web (web Server)
  • 11. © 2015 Interactive Powers | www.ivrpowers.com HTML versus VXML HTML VXML Mouse + Display Phone + Keypad HTML layout VXML layout images, video files audio, grammar files Text Text (TTS) Scripts Scripts HTTP / HTTPS HTTP / HTTPS RTP - SOAP - WSDL RTP - SOAP - WSDL - SIP
  • 12. © 2015 Interactive Powers | www.ivrpowers.com PBX versus IVR Features PBX IVR Connect Phones / Extensions Phones / Applications Call Routing Person-to-Person Person-to-Machine Configuration Static (Dialplan) Dynamic (VoiceXML) Interaction DTMF DTMF | TTS | ASR | NLU | SIV
  • 13. © 2015 Interactive Powers | www.ivrpowers.com 3 Niveles de dialogo: IVR … NLU Key Tones (DTMF) Direct Dialog (ASR/TTS) Natural Language Understanding (NLU) Dialogo determinista: “Ventas, Comercial, Soporte…” Teclas del teléfono: “0…9 # *” Dialogo indeterminista: “Quiero contactar con un comercial” 1 2 3
  • 14. © 2015 Interactive Powers | www.ivrpowers.com Diagrama IVR… extendido NLU IVR TTS ASR MRCP HTTP MRCP API HTTP VoiceXML + GRXML + BNF Voice Browser Business Applications VOIP | TDM HTTP | API VXML Speech Servers HTTPS Phone SIP / TDM PBX Natural Language Understanding Voice Biometrics Telephony Private Branch Exchange IVR+ SIV
  • 15. © 2015 Interactive Powers | www.ivrpowers.com Hello World! <?xml version="1.0"?> <vxml version = "2.0" xmlns="http://www.w3.org/2001/vxml" xml:lang="en-US"> <form> <block>Hello world!</block> </form> </vxml> <?xml version="1.0"?> <vxml version="2.0" xmlns="http://www.w3.org/2001/vxml"> <form> <block> <prompt> <audio src="helloworld.wav"/> </prompt> </block> </form> </vxml>
  • 16. © 2015 Interactive Powers | www.ivrpowers.com Menu DTMF <?xml version="1.0"?> <vxml version = "2.1"> <menu> <prompt> Hello. Choose among the following option: <enumerate> <value expr=“_dtmf”/> for <value expr=“_prompt”/> </prompt> <choice dtmf=“1” next=page1.vxml> Hotel </choice> <choice dtmf=“2” next=page2.vxml> Wheather </choice> <choice dtmf=“3” next=page3.vxml> News </choice> </menu>
  • 17. © 2015 Interactive Powers | www.ivrpowers.com Reconocimiento de voz (ASR) <?xml version=“1.0" encoding=“ISO-8859-1”?> <vxml version = “2.0" lang=“en”> <form> <field name=“city”> <prompt>Where to you want to travel to?</prompt> <option>New York</option> <option>Paris</option> <option>Berlin</option> <option>Madrid</option> <option>London</option> </field> <field name=“travellers” type=“number”> <prompt>How many are traveling to <value expr=“city”/>? </prompt> </field> <block> <submit next=“http://localhost/handler” namelist=“city travelers”/> </block> </form> </vxml>
  • 18. © 2015 Interactive Powers | www.ivrpowers.com Ventajas de VoiceXML • VoiceXML es un estándar abierto para los sistemas IVR • Lenguaje basado en el paradigma XML / HTTP • Inclusión de gramáticas de diálogo compiladas o dinámicas GRXML, ABNF,… • Integración y gestión de los motores del habla TTS / ASR • Compatible con todos lenguajes de programación web PHP / JSP / ASP /... • Acceso universal a bases de datos y sistemas externos (también para NLU) • Permite una gestión de los eventos en tiempo real
  • 19. © 2015 Interactive Powers | www.ivrpowers.com Live Demo