SlideShare una empresa de Scribd logo
1 de 17
Descargar para leer sin conexión
Voice Control combined with Speech-To-Text
           and NLU resulting in Smart UI
                                                     Reimund Schmald, Nuance

                                                        Stefan Seide, T-Systems
1   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
This is what we are working on!




2   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.                      MOBILE SOLUTIONS
                                                                                  Scene from Star Trek IV: The Voyage Home (1986)
Agenda

    • Multi-Modal Input UE: Status and Trends in Mobile


    • Voice enabled NLU: Requirements + Demo


    • Hybrid Architecture, Programming Example




3   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
Starting with Keyboard




            Type


            Write


           Speak


           Swype




4   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
Multi-Modality in Apps
    Example: amazon and iTranslate




5   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
Security and Personalization

                                                                                  From completing a financial
                                                                                  transaction to accessing sensitive
                                                                                  content Voice Biometrics offers
                                                                                  security so you can proceed with
                                                                                  confidence.

                                                                                  Through speaker identification
                                                                                  Voice Biometrics delivers a
                                                                                  personalized experience where
                                                         “My voice is my          various users profiles are
                                                           password”              available, e.g. shared devices such
                                                                                  as the TV or tablets. Simply speak,
                                                                                  and your personal settings are
                                                                                  loaded.




6   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.             MOBILE SOLUTIONS
Personalization – Across Devices

               Text Dictionary (Local)




                                   Speech Dictionary (Cloud)




7   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
Just the Mic Button

            Requirements: High Quality SpeechToText + NLU




8   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
High Quality SpeechToText on NDEV




                        The Industry’s FIRST developer program to offer
                        Speech To Text and Text to Speech integration for
                                                                   any mobile app

                                        8000+ developers registered to date

                                                            iOS, Android, WP 7

                                                        www.ndevmobile.com
9   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.     MOBILE SOLUTIONS
NDEV Mobile: Client SDK Technical Aspects
              SDK Components
                    –     Recognizer object
                    –     Audio engine                                                          Client Application
                    –     End of Speech Detection                                      1                                       8
                    –     Encoding (compresses request to conserve bandwidth)
                    –     Network Transport
                                                                                                Dragon SDK
                                                                                       Recogniser        Audio        End Of

              Server Components                                                    2     Object          Engine       Speech
                                                                                                                                   7
                    –     Authentication                                                    Network
                                                                                                                  Encoding
                                                                                           Transport
                    –     Recognizer
                    –     Vocalizer TTS
                                                                                       3
           1. Client application invokes SDK
           2. SDK captures request and encodes it
                                                                                                                               6
                    •    Might use End of Speech, if enabled                                         Authentication
           1. SDK Network Transport sends utterance to NVC Servers
           2. NVC Server authenticates Client app
                                                                                   4
                                                                                            MREC                   Vocalizer
           3. Recognizer/TTS processes request                                     5
           4. NVC Server redirects response to Client                                  Search       Dictation
           5. SDK processes response and sends to Client app
           6. Client app plays/shows response                                            NVC Hosted Server

10   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.             MOBILE SOLUTIONS
NDev mobile Service Levels



                                              FREE




11   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS
Feature Comparison: Gold, Silver, Emerald
                                                                   Silver              Gold              Emerald
            Features
            ASR Dictation & Search Models
                                                                       ü                 ü                  ü
            for 18 Languages
            Network TTS for 30+ Languages                              ü                 ü                  ü
            Bluetooth Support (8 KHz)                                  ü                 ü                  ü
            SSL                                                                          ü                  ü
            Customized Features                                                                             ü
            Flexibility & Customization
            UI                                                         ü                 ü                  ü
            Platforms
             Android, iOS, W P7                                        ü                 ü                  ü
             HTTP                                                                        ü                  ü
            Consulting Services                                                                          Available
            Availability & Support
            Centralized Speech Resource &
                                                                       ü                 ü                  ü
            Support Forums
            W eb Ticketing                                                               ü                  ü
            SLA                                                                          ü                  ü
            Dedicated Support Contact                                                                    Available
            Cost
            Development                                      Free for 90 days   Payment Options           Custom
            Production                                         Free w/ cap    $0.009 trx or $0.24 flat    Custom



12   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.                          MOBILE SOLUTIONS
NUANCE PROPRIETARY NON-DISCLOSURE INFORMATION


     Different Levels of NLU
                      Structured NLU                                                       Unstructured
                                                                                               NLU
            Embedded & Connected                                                   Server-side natural language
            speech systems working                                                 understanding platform that
            together to determine what                                             supports open-ended queries
            specific phone-related task the                                        and intent classification.
            user is looking to complete.                                           “Is it raining in Berlin?”
            “Send text to John Call me                                             “What movies are playing near
            shortly”                                                               me?”
            “Search for New York Yankees”                                          “Make a reservation to Capital
            “Update Facebook I am today in                                         Grille in Burlington for 8 pm on
            Berlin”                                                                Friday for 2 people”




13   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.                     MOBILE SOLUTIONS
NUANCE PROPRIETARY NON-DISCLOSURE INFORMATION

     Deploying a Comprehensive Speech Solution
                   Both NLU systems can be combined to offer a
                        comprehensive speech experience

                     Structured NLU                                                     Unstructured NLU
                   NVC Hybrid allows users to
                    complete core phone
                      functions (dialing,
                                                                                   +   DragonGO! allows for intelligent
                                                                                          Web and Content access

                      messaging, etc…)




                                                                All Web and media
                                                              related queries can be
                                                              passed to unstructured
                                                                NLU system
                                                                 (e.g. DragonGo!)



14   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.                      MOBILE SOLUTIONS
Dragon Go! Directed Search
                                                                                   •   A specific site is referenced
                                                                                       in the query.

                                                                                   •   Today we support 180+
                                                                                       content providers
                                                                                       including…

                                                                                   •   CNN

                                                                                   •   eBay

                                                                                   •   Engadget

                                                                                   •   Facebook

                                                                                   •   New York Times

                                                                                   •   TechCrunch

                                                                                   •   USA Today

                                                                                   •   Regional Newspapers




15   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.          MOBILE SOLUTIONS
Dragon Go! Intent Search

                                                                                   • CALL a
                                                                                     business

                                                                                   • GET directions
                                                                                   • MAKE
                                                                                     reservations

                                                                                   • PLAY music
                                                                                   • BUY tickets,
                                                                                     products,
                                                                                     music

                                                                                   • More…


16   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.    MOBILE SOLUTIONS
Dragon Go! Category Search

                                                                                     • Music
                                                                                     • Movies
                                                                                     • Businesses
                                                                                     • Restaurants
                                                                                     • Sports
                                                                                     • News
                                                                                     • Shopping
                                                                                     • Weather
                                                                                     • More…
17   CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved.   MOBILE SOLUTIONS

Más contenido relacionado

La actualidad más candente

Cisco UCM Mobility Services
Cisco UCM Mobility ServicesCisco UCM Mobility Services
Cisco UCM Mobility Services
Cisco Russia
 
Scenarios for-context-aware-sip-07-a t kishore.pdf
Scenarios for-context-aware-sip-07-a t kishore.pdfScenarios for-context-aware-sip-07-a t kishore.pdf
Scenarios for-context-aware-sip-07-a t kishore.pdf
AT Kishore
 
Explore Microsoft Lync & Exchange 2013 Webinar
Explore Microsoft Lync & Exchange 2013  WebinarExplore Microsoft Lync & Exchange 2013  Webinar
Explore Microsoft Lync & Exchange 2013 Webinar
Sentri
 
Taller Redes Emergentes
Taller Redes EmergentesTaller Redes Emergentes
Taller Redes Emergentes
Mundo Contact
 
3. FOMS_ IMS services_Shane_Dempsey
3. FOMS_ IMS services_Shane_Dempsey3. FOMS_ IMS services_Shane_Dempsey
3. FOMS_ IMS services_Shane_Dempsey
FOMS011
 
Videoconferencing in heterogeneous environments
Videoconferencing in heterogeneous environmentsVideoconferencing in heterogeneous environments
Videoconferencing in heterogeneous environments
Videoguy
 

La actualidad más candente (17)

Tele dna mobile applications v 1.4
Tele dna mobile applications v 1.4Tele dna mobile applications v 1.4
Tele dna mobile applications v 1.4
 
Cisco UCM Mobility Services
Cisco UCM Mobility ServicesCisco UCM Mobility Services
Cisco UCM Mobility Services
 
WebSphere Connectivity & Integration: What's New in the Messaging Family?
WebSphere Connectivity & Integration: What's New in the Messaging Family?WebSphere Connectivity & Integration: What's New in the Messaging Family?
WebSphere Connectivity & Integration: What's New in the Messaging Family?
 
Junos Space SDK - Imagination, Ideas, Innovation
Junos Space SDK - Imagination, Ideas, InnovationJunos Space SDK - Imagination, Ideas, Innovation
Junos Space SDK - Imagination, Ideas, Innovation
 
Embrace Change
Embrace ChangeEmbrace Change
Embrace Change
 
Scenarios for-context-aware-sip-07-a t kishore.pdf
Scenarios for-context-aware-sip-07-a t kishore.pdfScenarios for-context-aware-sip-07-a t kishore.pdf
Scenarios for-context-aware-sip-07-a t kishore.pdf
 
07 a t kishore.pdf
07 a t kishore.pdf07 a t kishore.pdf
07 a t kishore.pdf
 
Lync to the Future: Skype, Mobile, Meetings & Video
Lync to the Future: Skype, Mobile, Meetings & VideoLync to the Future: Skype, Mobile, Meetings & Video
Lync to the Future: Skype, Mobile, Meetings & Video
 
Explore Microsoft Lync & Exchange 2013 Webinar
Explore Microsoft Lync & Exchange 2013  WebinarExplore Microsoft Lync & Exchange 2013  Webinar
Explore Microsoft Lync & Exchange 2013 Webinar
 
Taller Redes Emergentes
Taller Redes EmergentesTaller Redes Emergentes
Taller Redes Emergentes
 
3. FOMS_ IMS services_Shane_Dempsey
3. FOMS_ IMS services_Shane_Dempsey3. FOMS_ IMS services_Shane_Dempsey
3. FOMS_ IMS services_Shane_Dempsey
 
Next Generation UC Clients and Endpoints
Next Generation UC Clients and EndpointsNext Generation UC Clients and Endpoints
Next Generation UC Clients and Endpoints
 
Videoconferencing in heterogeneous environments
Videoconferencing in heterogeneous environmentsVideoconferencing in heterogeneous environments
Videoconferencing in heterogeneous environments
 
The Open Splice.Org Community
The Open Splice.Org CommunityThe Open Splice.Org Community
The Open Splice.Org Community
 
IEEE_multimedia_2000
IEEE_multimedia_2000IEEE_multimedia_2000
IEEE_multimedia_2000
 
Technology Development and Innovation at Cisco
Technology Development and Innovation at CiscoTechnology Development and Innovation at Cisco
Technology Development and Innovation at Cisco
 
The NGN Test Centre Infrastructure & Services - Shane Dempsey (NGN Test Centre)
The NGN Test Centre Infrastructure & Services - Shane Dempsey (NGN Test Centre)The NGN Test Centre Infrastructure & Services - Shane Dempsey (NGN Test Centre)
The NGN Test Centre Infrastructure & Services - Shane Dempsey (NGN Test Centre)
 

Destacado

Destacado (20)

Digital delight: customer service in digital age
Digital delight: customer service in digital ageDigital delight: customer service in digital age
Digital delight: customer service in digital age
 
Trends Reshaping the Future of Customer Service
Trends Reshaping the Future of Customer Service Trends Reshaping the Future of Customer Service
Trends Reshaping the Future of Customer Service
 
Remarkable Customer Journeys
Remarkable Customer JourneysRemarkable Customer Journeys
Remarkable Customer Journeys
 
Tendencias 2016 17 Virtual Agents
Tendencias 2016 17   Virtual AgentsTendencias 2016 17   Virtual Agents
Tendencias 2016 17 Virtual Agents
 
Nuance at sabio Transforming Customer Contact 2015
Nuance at sabio Transforming Customer Contact 2015Nuance at sabio Transforming Customer Contact 2015
Nuance at sabio Transforming Customer Contact 2015
 
Multichannel Customer Journeys
Multichannel Customer JourneysMultichannel Customer Journeys
Multichannel Customer Journeys
 
Botego - Sanal müşteri temsilcileri
Botego - Sanal müşteri temsilcileriBotego - Sanal müşteri temsilcileri
Botego - Sanal müşteri temsilcileri
 
The Autonomous Customer: Trends shaping the future of customer service
The Autonomous Customer: Trends shaping the future of customer serviceThe Autonomous Customer: Trends shaping the future of customer service
The Autonomous Customer: Trends shaping the future of customer service
 
Chris ezekiel Building customer relationships with intelligent virtual agents...
Chris ezekiel Building customer relationships with intelligent virtual agents...Chris ezekiel Building customer relationships with intelligent virtual agents...
Chris ezekiel Building customer relationships with intelligent virtual agents...
 
Aspect voxeo enhancing your self service environment for today's mobile customer
Aspect voxeo enhancing your self service environment for today's mobile customerAspect voxeo enhancing your self service environment for today's mobile customer
Aspect voxeo enhancing your self service environment for today's mobile customer
 
Present & Future: Customer Service
Present & Future: Customer ServicePresent & Future: Customer Service
Present & Future: Customer Service
 
Back to the future of customer service / Part 1: Peer to Peer customer support
Back to the future of customer service / Part 1: Peer to Peer customer supportBack to the future of customer service / Part 1: Peer to Peer customer support
Back to the future of customer service / Part 1: Peer to Peer customer support
 
Digital transformation at EMC Forum 2014
Digital transformation at EMC Forum 2014Digital transformation at EMC Forum 2014
Digital transformation at EMC Forum 2014
 
The future of customer service (is here)
The future of customer service (is here)The future of customer service (is here)
The future of customer service (is here)
 
On the future of customer service
On the future of customer serviceOn the future of customer service
On the future of customer service
 
Reinventing finance and accounting through automation
Reinventing finance and accounting through automationReinventing finance and accounting through automation
Reinventing finance and accounting through automation
 
trendwatching.com's THE FUTURE OF CUSTOMER SERVICE
trendwatching.com's THE FUTURE OF CUSTOMER SERVICEtrendwatching.com's THE FUTURE OF CUSTOMER SERVICE
trendwatching.com's THE FUTURE OF CUSTOMER SERVICE
 
The Blended Customer Experience
The Blended Customer ExperienceThe Blended Customer Experience
The Blended Customer Experience
 
Routing Your Way to Service Nirvana with Omni-Channel
Routing Your Way to Service Nirvana with Omni-ChannelRouting Your Way to Service Nirvana with Omni-Channel
Routing Your Way to Service Nirvana with Omni-Channel
 
Self Services Trends
Self Services TrendsSelf Services Trends
Self Services Trends
 

Similar a Nuance

El video en un mundo de colaboración
El video en un mundo de colaboraciónEl video en un mundo de colaboración
El video en un mundo de colaboración
Mundo Contact
 
Nathan Winters What’s New And Cool In Ocs 2007 R2
Nathan Winters   What’s New And Cool In Ocs 2007 R2Nathan Winters   What’s New And Cool In Ocs 2007 R2
Nathan Winters What’s New And Cool In Ocs 2007 R2
Nathan Winters
 
Signify Software Tokens
Signify Software TokensSignify Software Tokens
Signify Software Tokens
pjpallen
 
Signify Software Tokens
Signify Software TokensSignify Software Tokens
Signify Software Tokens
kate_holden
 
Hd Connect Spec Sheet V1 6
Hd Connect Spec Sheet V1 6Hd Connect Spec Sheet V1 6
Hd Connect Spec Sheet V1 6
Tom Luketich
 

Similar a Nuance (20)

An Overview of All Ericsson Labs APIs
An Overview of All Ericsson Labs APIsAn Overview of All Ericsson Labs APIs
An Overview of All Ericsson Labs APIs
 
Mwc wip jam jabber sdk final
Mwc wip jam jabber sdk finalMwc wip jam jabber sdk final
Mwc wip jam jabber sdk final
 
01 introduction
01 introduction01 introduction
01 introduction
 
El video en un mundo de colaboración
El video en un mundo de colaboraciónEl video en un mundo de colaboración
El video en un mundo de colaboración
 
Radisys - Engage Digital - TADSummit Nov 2022
Radisys - Engage Digital - TADSummit Nov 2022Radisys - Engage Digital - TADSummit Nov 2022
Radisys - Engage Digital - TADSummit Nov 2022
 
Streaming Multimedia content distribution system using mobile application by...
Streaming  Multimedia content distribution system using mobile application by...Streaming  Multimedia content distribution system using mobile application by...
Streaming Multimedia content distribution system using mobile application by...
 
Mobile Web Security Bootstrap on Ericsson Labs
Mobile Web Security Bootstrap on Ericsson LabsMobile Web Security Bootstrap on Ericsson Labs
Mobile Web Security Bootstrap on Ericsson Labs
 
Eyeball Messenger SDK V10.0 Developer Reference Guide
Eyeball Messenger SDK V10.0 Developer Reference GuideEyeball Messenger SDK V10.0 Developer Reference Guide
Eyeball Messenger SDK V10.0 Developer Reference Guide
 
Nathan Winters What’s New And Cool In Ocs 2007 R2
Nathan Winters   What’s New And Cool In Ocs 2007 R2Nathan Winters   What’s New And Cool In Ocs 2007 R2
Nathan Winters What’s New And Cool In Ocs 2007 R2
 
VoLTE & RCS Revolutionizing Enterprise UC
VoLTE & RCS Revolutionizing Enterprise UCVoLTE & RCS Revolutionizing Enterprise UC
VoLTE & RCS Revolutionizing Enterprise UC
 
Cidway Banking 02 2011
Cidway Banking 02 2011Cidway Banking 02 2011
Cidway Banking 02 2011
 
Signify Software Tokens
Signify Software TokensSignify Software Tokens
Signify Software Tokens
 
Signify Software Tokens
Signify Software TokensSignify Software Tokens
Signify Software Tokens
 
Hd Connect Spec Sheet V1 6
Hd Connect Spec Sheet V1 6Hd Connect Spec Sheet V1 6
Hd Connect Spec Sheet V1 6
 
Next Generation Video Services Fundamentals
Next Generation Video Services FundamentalsNext Generation Video Services Fundamentals
Next Generation Video Services Fundamentals
 
COLLABORATION
COLLABORATIONCOLLABORATION
COLLABORATION
 
Monetizing the Enterprise: Borderless Networks
Monetizing the Enterprise: Borderless NetworksMonetizing the Enterprise: Borderless Networks
Monetizing the Enterprise: Borderless Networks
 
Distributed Shared Memory on Ericsson Labs
Distributed Shared Memory on Ericsson LabsDistributed Shared Memory on Ericsson Labs
Distributed Shared Memory on Ericsson Labs
 
Market Study on Mobile Authentication
Market Study on Mobile AuthenticationMarket Study on Mobile Authentication
Market Study on Mobile Authentication
 
Macadamian And Junos SDK
Macadamian And Junos SDKMacadamian And Junos SDK
Macadamian And Junos SDK
 

Más de Droidcon Berlin

Droidcon de 2014 google cast
Droidcon de 2014   google castDroidcon de 2014   google cast
Droidcon de 2014 google cast
Droidcon Berlin
 
Android programming -_pushing_the_limits
Android programming -_pushing_the_limitsAndroid programming -_pushing_the_limits
Android programming -_pushing_the_limits
Droidcon Berlin
 
Android industrial mobility
Android industrial mobility Android industrial mobility
Android industrial mobility
Droidcon Berlin
 
From sensor data_to_android_and_back
From sensor data_to_android_and_backFrom sensor data_to_android_and_back
From sensor data_to_android_and_back
Droidcon Berlin
 
new_age_graphics_android_x86
new_age_graphics_android_x86new_age_graphics_android_x86
new_age_graphics_android_x86
Droidcon Berlin
 
Testing and Building Android
Testing and Building AndroidTesting and Building Android
Testing and Building Android
Droidcon Berlin
 
Matchinguu droidcon presentation
Matchinguu droidcon presentationMatchinguu droidcon presentation
Matchinguu droidcon presentation
Droidcon Berlin
 
Cgm life sdk_droidcon_2014_v3
Cgm life sdk_droidcon_2014_v3Cgm life sdk_droidcon_2014_v3
Cgm life sdk_droidcon_2014_v3
Droidcon Berlin
 
The artofcalabash peterkrauss
The artofcalabash peterkraussThe artofcalabash peterkrauss
The artofcalabash peterkrauss
Droidcon Berlin
 
Raesch, gries droidcon 2014
Raesch, gries   droidcon 2014Raesch, gries   droidcon 2014
Raesch, gries droidcon 2014
Droidcon Berlin
 
Android open gl2_droidcon_2014
Android open gl2_droidcon_2014Android open gl2_droidcon_2014
Android open gl2_droidcon_2014
Droidcon Berlin
 
20140508 quantified self droidcon
20140508 quantified self droidcon20140508 quantified self droidcon
20140508 quantified self droidcon
Droidcon Berlin
 
Tuning android for low ram devices
Tuning android for low ram devicesTuning android for low ram devices
Tuning android for low ram devices
Droidcon Berlin
 
Froyo to kit kat two years developing & maintaining deliradio
Froyo to kit kat   two years developing & maintaining deliradioFroyo to kit kat   two years developing & maintaining deliradio
Froyo to kit kat two years developing & maintaining deliradio
Droidcon Berlin
 
Droidcon2013 security genes_trendmicro
Droidcon2013 security genes_trendmicroDroidcon2013 security genes_trendmicro
Droidcon2013 security genes_trendmicro
Droidcon Berlin
 

Más de Droidcon Berlin (20)

Droidcon de 2014 google cast
Droidcon de 2014   google castDroidcon de 2014   google cast
Droidcon de 2014 google cast
 
Android programming -_pushing_the_limits
Android programming -_pushing_the_limitsAndroid programming -_pushing_the_limits
Android programming -_pushing_the_limits
 
crashing in style
crashing in stylecrashing in style
crashing in style
 
Raspberry Pi
Raspberry PiRaspberry Pi
Raspberry Pi
 
Android industrial mobility
Android industrial mobility Android industrial mobility
Android industrial mobility
 
Details matter in ux
Details matter in uxDetails matter in ux
Details matter in ux
 
From sensor data_to_android_and_back
From sensor data_to_android_and_backFrom sensor data_to_android_and_back
From sensor data_to_android_and_back
 
droidparts
droidpartsdroidparts
droidparts
 
new_age_graphics_android_x86
new_age_graphics_android_x86new_age_graphics_android_x86
new_age_graphics_android_x86
 
5 tips of monetization
5 tips of monetization5 tips of monetization
5 tips of monetization
 
Testing and Building Android
Testing and Building AndroidTesting and Building Android
Testing and Building Android
 
Matchinguu droidcon presentation
Matchinguu droidcon presentationMatchinguu droidcon presentation
Matchinguu droidcon presentation
 
Cgm life sdk_droidcon_2014_v3
Cgm life sdk_droidcon_2014_v3Cgm life sdk_droidcon_2014_v3
Cgm life sdk_droidcon_2014_v3
 
The artofcalabash peterkrauss
The artofcalabash peterkraussThe artofcalabash peterkrauss
The artofcalabash peterkrauss
 
Raesch, gries droidcon 2014
Raesch, gries   droidcon 2014Raesch, gries   droidcon 2014
Raesch, gries droidcon 2014
 
Android open gl2_droidcon_2014
Android open gl2_droidcon_2014Android open gl2_droidcon_2014
Android open gl2_droidcon_2014
 
20140508 quantified self droidcon
20140508 quantified self droidcon20140508 quantified self droidcon
20140508 quantified self droidcon
 
Tuning android for low ram devices
Tuning android for low ram devicesTuning android for low ram devices
Tuning android for low ram devices
 
Froyo to kit kat two years developing & maintaining deliradio
Froyo to kit kat   two years developing & maintaining deliradioFroyo to kit kat   two years developing & maintaining deliradio
Froyo to kit kat two years developing & maintaining deliradio
 
Droidcon2013 security genes_trendmicro
Droidcon2013 security genes_trendmicroDroidcon2013 security genes_trendmicro
Droidcon2013 security genes_trendmicro
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Último (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 

Nuance

  • 1. Voice Control combined with Speech-To-Text and NLU resulting in Smart UI Reimund Schmald, Nuance Stefan Seide, T-Systems 1 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 2. This is what we are working on! 2 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS Scene from Star Trek IV: The Voyage Home (1986)
  • 3. Agenda • Multi-Modal Input UE: Status and Trends in Mobile • Voice enabled NLU: Requirements + Demo • Hybrid Architecture, Programming Example 3 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 4. Starting with Keyboard Type Write Speak Swype 4 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 5. Multi-Modality in Apps Example: amazon and iTranslate 5 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 6. Security and Personalization From completing a financial transaction to accessing sensitive content Voice Biometrics offers security so you can proceed with confidence. Through speaker identification Voice Biometrics delivers a personalized experience where “My voice is my various users profiles are password” available, e.g. shared devices such as the TV or tablets. Simply speak, and your personal settings are loaded. 6 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 7. Personalization – Across Devices Text Dictionary (Local) Speech Dictionary (Cloud) 7 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 8. Just the Mic Button Requirements: High Quality SpeechToText + NLU 8 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 9. High Quality SpeechToText on NDEV The Industry’s FIRST developer program to offer Speech To Text and Text to Speech integration for any mobile app 8000+ developers registered to date iOS, Android, WP 7 www.ndevmobile.com 9 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 10. NDEV Mobile: Client SDK Technical Aspects SDK Components – Recognizer object – Audio engine Client Application – End of Speech Detection 1 8 – Encoding (compresses request to conserve bandwidth) – Network Transport Dragon SDK Recogniser Audio End Of Server Components 2 Object Engine Speech 7 – Authentication Network Encoding Transport – Recognizer – Vocalizer TTS 3 1. Client application invokes SDK 2. SDK captures request and encodes it 6 • Might use End of Speech, if enabled Authentication 1. SDK Network Transport sends utterance to NVC Servers 2. NVC Server authenticates Client app 4 MREC Vocalizer 3. Recognizer/TTS processes request 5 4. NVC Server redirects response to Client Search Dictation 5. SDK processes response and sends to Client app 6. Client app plays/shows response NVC Hosted Server 10 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 11. NDev mobile Service Levels FREE 11 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 12. Feature Comparison: Gold, Silver, Emerald Silver Gold Emerald Features ASR Dictation & Search Models ü ü ü for 18 Languages Network TTS for 30+ Languages ü ü ü Bluetooth Support (8 KHz) ü ü ü SSL ü ü Customized Features ü Flexibility & Customization UI ü ü ü Platforms Android, iOS, W P7 ü ü ü HTTP ü ü Consulting Services Available Availability & Support Centralized Speech Resource & ü ü ü Support Forums W eb Ticketing ü ü SLA ü ü Dedicated Support Contact Available Cost Development Free for 90 days Payment Options Custom Production Free w/ cap $0.009 trx or $0.24 flat Custom 12 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 13. NUANCE PROPRIETARY NON-DISCLOSURE INFORMATION Different Levels of NLU Structured NLU Unstructured NLU Embedded & Connected Server-side natural language speech systems working understanding platform that together to determine what supports open-ended queries specific phone-related task the and intent classification. user is looking to complete. “Is it raining in Berlin?” “Send text to John Call me “What movies are playing near shortly” me?” “Search for New York Yankees” “Make a reservation to Capital “Update Facebook I am today in Grille in Burlington for 8 pm on Berlin” Friday for 2 people” 13 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 14. NUANCE PROPRIETARY NON-DISCLOSURE INFORMATION Deploying a Comprehensive Speech Solution Both NLU systems can be combined to offer a comprehensive speech experience Structured NLU Unstructured NLU NVC Hybrid allows users to complete core phone functions (dialing, + DragonGO! allows for intelligent Web and Content access messaging, etc…) All Web and media related queries can be passed to unstructured NLU system (e.g. DragonGo!) 14 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 15. Dragon Go! Directed Search • A specific site is referenced in the query. • Today we support 180+ content providers including… • CNN • eBay • Engadget • Facebook • New York Times • TechCrunch • USA Today • Regional Newspapers 15 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 16. Dragon Go! Intent Search • CALL a business • GET directions • MAKE reservations • PLAY music • BUY tickets, products, music • More… 16 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS
  • 17. Dragon Go! Category Search • Music • Movies • Businesses • Restaurants • Sports • News • Shopping • Weather • More… 17 CONFIDENTIAL | © 2002-2011 Nuance Communications, Inc. All rights reserved. MOBILE SOLUTIONS