SlideShare una empresa de Scribd logo
1 de 30
Descargar para leer sin conexión
SpeeG
        A	
  Mul&modal	
  Speech-­‐	
  and	
  
     Gesture-­‐based	
  Text	
  Input	
  Solu&on
Lode	
  Hoste,	
  Bruno	
  Dumas	
  and	
  Beat	
  Signer
Text-input for set-top boxes




Vrije Universiteit Brussel   SpeeG - Lode Hoste   2
Vrije Universiteit Brussel   SpeeG - Lode Hoste   3
Vrije Universiteit Brussel   SpeeG - Lode Hoste   4
Text-input for set-top boxes




Vrije Universiteit Brussel   SpeeG - Lode Hoste   5
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               6
Virtual keyboard




Vrije Universiteit Brussel        SpeeG - Lode Hoste   7
Kinect 1D keyboard




Vrije Universiteit Brussel     SpeeG - Lode Hoste   8
Kinect 1D keyboard

Vrije Universiteit Brussel    SpeeG - Lode Hoste   9
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               10
1D Keyboard for Kinect   Chatpad Controller    Virtual Keyboard for Xbox




                SwiftKey               8Pen                  EdgeWriter




                   Dasher         Speech Dasher                SpeeG


Vrije Universiteit Brussel       SpeeG - Lode Hoste                               11
Dasher


 Continuous input
 Joystick / Gaze / ...
 Open vocabulary
 Allows imprecise navigation




Vrije Universiteit Brussel   SpeeG - Lode Hoste   12
Dasher




Vrije Universiteit Brussel   SpeeG - Lode Hoste   13
Goals:                         Used technologies:
           Controller-free                           Kinect
           Text input                                CMU Sphinx
           Without training                          Dasher




Vrije Universiteit Brussel    SpeeG - Lode Hoste                        14
SpeeG




Vrije Universiteit Brussel   SpeeG - Lode Hoste   15
Vrije Universiteit Brussel   SpeeG - Lode Hoste   16
SpeeG Architecture

                                             5




                             User                                 GUI (JDasher)
                                         3
                             1
                                                                         4
                                         2




                     Speech Recogniser                           Hand Tracking
                      (CMU Sphinx 4)                       (Microsoft Kinect and NITE)
Vrije Universiteit Brussel                   SpeeG - Lode Hoste                          17
Evaluation




                Virtual Keyboard                               Kinect Keyboard


                                         5




                             User                             GUI (JDasher)
                       Speech-only                                   SpeeG
                                     3
                             1
Vrije Universiteit Brussel               SpeeG - Lode Hoste                      18
Evaluation


       7 (male) users: 23-31y
                                                         “this was easy for us”
                                                         “he will allow a rare lie”
                                                         “did you eat yet”
                                                             1-3: DARPA’s TIMIT



                                                         “my watch fell in the water”
                                                         “the world is a stage”
                                                         “peek out the window”
                                                             4-6: MacKenzie and Soukoreff




   Performed a quantitative (Words per minute and nr of errors)
   and qualitative (feedback and preference) evaluation
Vrije Universiteit Brussel          SpeeG - Lode Hoste                                      19
Virtual keyboard
                             6.3 WPM

                10

                9

                8

                7
                                                                        User 1
                6
                                                                        User 2
          WPM




                5                                                       User 3
                                                                        User 4
                4                                                       User 5
                                                                        User 6
                3
                                                                        User 7
                2

                1

                0
                         S1        S2   S3              S4    S5   S6
                                             Sentence



Vrije Universiteit Brussel               SpeeG - Lode Hoste                      20
Kinect Keyboard
                             1.83 WPM

               3.50


               3.00


               2.50
                                                                          User 1
               2.00                                                       User 2
         WPM




                                                                          User 3

               1.50                                                       User 4
                                                                          User 5
                                                                          User 6
               1.00
                                                                         *User 7
               0.50


               0.00
                             S1     S2   S3              S4    S5   S6
                                              Sentence




Vrije Universiteit Brussel                SpeeG - Lode Hoste                       21
Speech-only
                             11 WPM
         40

         35                                                               User

                                                                          1
         30

         25                                                                User 1
                                                                           User 2
   WPM




         20                                                                User 3
                                                                           User 4
         15                                                                User 5
                                                                      Speech Recognis
                                                                           User 6
                                                                       (CMU Sphinx 4
         10                                                                User 7


         5

         0
                    S1           S2   S3              S4    S5   S6
                                           Sentence

Vrije Universiteit Brussel             SpeeG - Lode Hoste                           22
SpeeG
                             5.8 WPM
         10

         9

         8

         7
                                                                       User 2
         6
                                                                       User 1
   WPM




         5                                                             User 3
                                                                       User 4
         4                                                             User 5
                                                                       User 6
         3
                                                                       User 7
         2

         1

         0
                    S1            S2   S3              S4    S5   S6
                                            Sentence

Vrije Universiteit Brussel              SpeeG - Lode Hoste                      23
SpeeG
                         2.6 7.8 WPM
         10

         9

         8

         7
                                                                       User 2
         6
                                                                       User 1
   WPM




         5                                                             User 3
                                                                       User 4
         4                                                             User 5
                                                                       User 6
         3
                                                                       User 7
         2

         1

         0
                    S1          S2     S3              S4    S5   S6
                                            Sentence

Vrije Universiteit Brussel              SpeeG - Lode Hoste                      24
Mean WPM per sentence
             and input device                                    Virtual Keyboard for Xbox          1D Keyboard for Xbox



                                                                                               5

         25
                                                                      Speech-only
                                                                          User                                 SpeeG
                                                                                                         GUI (JDasher)
                                                                                           3
                                                                           1
                                                                                                                 4
                                                                                           2
         20

                                                                       Speech Recogniser                 Hand Tracking
                                                                        (CMU Sphinx 4)             (Microsoft Kinect and NITE)



         15
                                                                                                    Controller
   WPM




                                                                                                    Speech only

         10                                                                                         Kinect only
                                                                                                    SpeeG


         5



         0
                   S1        S2   S3              S4        S5                 S6
                                       Sentence

Vrije Universiteit Brussel             SpeeG - Lode Hoste                                                                        25
Errors per sentence
                                     and input device                              Virtual Keyboard for Xbox           1D Keyboard for Xbox



                                                                                                                 5


                              10
                                                                                        Speech-only
                                                                                            User                                  SpeeG
                                                                                                                            GUI (JDasher)
                              9                                                              1
                                                                                                             3


                                                                                                                                    4
                                                                                                             2

                              8

                              7                                                          Speech Recogniser
                                                                                          (CMU Sphinx 4)
                                                                                                                            Hand Tracking
                                                                                                                      (Microsoft Kinect and NITE)
      Mean number of errors




                              6
                                                                                                                     Controller
                              5                                                                                      Speech only
                              4                                                                                      Kinect only
                                                                                                                     SpeeG
                              3

                              2

                              1

                              0
                                      S1     S2     S3                S4      S5                 S6
                                                           Sentence


Vrije Universiteit Brussel                               SpeeG - Lode Hoste                                                                         26
Vrije Universiteit Brussel   SpeeG - Lode Hoste   27
Future work
                             Other visualisations
                             Smaller gestures
                             Dedicated commands (gesture / voice)




Vrije Universiteit Brussel                 SpeeG - Lode Hoste       28
Vrije Universiteit Brussel   SpeeG - Lode Hoste   29
SpeeG
           A	
  Mul&modal	
  Speech-­‐	
  and	
  
     Gesture-­‐	
  based	
  Text	
  Input	
  Solu&on
   Lode	
  Hoste,	
  Bruno	
  Dumas,	
  Beat	
  Signer


                             Kinect                                                       Speech

   - Controller-free text input                                     - Non-native speakers
   - Real-time correction                                           - Untrained voice recogniser
   - Dasher, zoomable interface                                     - 6-12 WPM
     - probabilities                                                - Perceived fastest
     - alphabetic order                                             - Game-like character
     - character-level                                              - Novice and experts



Vrije Universiteit Brussel        Special thanks to Jorn De Baerdenmaeker and Keith Vertaenen
                                                       SpeeG - Lode Hoste                          30

Más contenido relacionado

Más de Beat Signer

Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Beat Signer
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Beat Signer
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Beat Signer
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...Beat Signer
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Beat Signer
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Beat Signer
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Beat Signer
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Beat Signer
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Beat Signer
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Beat Signer
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Beat Signer
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Beat Signer
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationBeat Signer
 
Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Beat Signer
 
Cross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationCross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationBeat Signer
 
An Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsAn Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsBeat Signer
 
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Beat Signer
 
Designing Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionDesigning Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionBeat Signer
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Beat Signer
 
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Beat Signer
 

Más de Beat Signer (20)

Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data Physicalisation
 
Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)Cross-Media Information Spaces and Architectures (CISA)
Cross-Media Information Spaces and Architectures (CISA)
 
Cross-Media Document Linking and Navigation
Cross-Media Document Linking and NavigationCross-Media Document Linking and Navigation
Cross-Media Document Linking and Navigation
 
An Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking MechanismsAn Analysis of Cross-Document Linking Mechanisms
An Analysis of Cross-Document Linking Mechanisms
 
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
Crossing Spaces: Towards Cross-Media Personal Information Management User Int...
 
Designing Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the QuestionDesigning Prosthetic Memory: Audio or Transcript, That is the Question
Designing Prosthetic Memory: Audio or Transcript, That is the Question
 
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
Introduction - Lecture 1 - Advanced Topics in Information Systems (4016792ENR)
 
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
Bespoke Map Customization Behavior and Its Implications for the Design of Mul...
 

Último

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 

Último (20)

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
dkNET Webinar "Texera: A Scalable Cloud Computing Platform for Sharing Data a...
 
IDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicineIDENTIFICATION OF THE LIVING- forensic medicine
IDENTIFICATION OF THE LIVING- forensic medicine
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 

SpeeG - A Multimodal Speech- and Gesture-based Text Input Solution

  • 1. SpeeG A  Mul&modal  Speech-­‐  and   Gesture-­‐based  Text  Input  Solu&on Lode  Hoste,  Bruno  Dumas  and  Beat  Signer
  • 2. Text-input for set-top boxes Vrije Universiteit Brussel SpeeG - Lode Hoste 2
  • 3. Vrije Universiteit Brussel SpeeG - Lode Hoste 3
  • 4. Vrije Universiteit Brussel SpeeG - Lode Hoste 4
  • 5. Text-input for set-top boxes Vrije Universiteit Brussel SpeeG - Lode Hoste 5
  • 6. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 6
  • 7. Virtual keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 7
  • 8. Kinect 1D keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 8
  • 9. Kinect 1D keyboard Vrije Universiteit Brussel SpeeG - Lode Hoste 9
  • 10. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 10
  • 11. 1D Keyboard for Kinect Chatpad Controller Virtual Keyboard for Xbox SwiftKey 8Pen EdgeWriter Dasher Speech Dasher SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 11
  • 12. Dasher Continuous input Joystick / Gaze / ... Open vocabulary Allows imprecise navigation Vrije Universiteit Brussel SpeeG - Lode Hoste 12
  • 13. Dasher Vrije Universiteit Brussel SpeeG - Lode Hoste 13
  • 14. Goals: Used technologies: Controller-free Kinect Text input CMU Sphinx Without training Dasher Vrije Universiteit Brussel SpeeG - Lode Hoste 14
  • 15. SpeeG Vrije Universiteit Brussel SpeeG - Lode Hoste 15
  • 16. Vrije Universiteit Brussel SpeeG - Lode Hoste 16
  • 17. SpeeG Architecture 5 User GUI (JDasher) 3 1 4 2 Speech Recogniser Hand Tracking (CMU Sphinx 4) (Microsoft Kinect and NITE) Vrije Universiteit Brussel SpeeG - Lode Hoste 17
  • 18. Evaluation Virtual Keyboard Kinect Keyboard 5 User GUI (JDasher) Speech-only SpeeG 3 1 Vrije Universiteit Brussel SpeeG - Lode Hoste 18
  • 19. Evaluation 7 (male) users: 23-31y “this was easy for us” “he will allow a rare lie” “did you eat yet” 1-3: DARPA’s TIMIT “my watch fell in the water” “the world is a stage” “peek out the window” 4-6: MacKenzie and Soukoreff Performed a quantitative (Words per minute and nr of errors) and qualitative (feedback and preference) evaluation Vrije Universiteit Brussel SpeeG - Lode Hoste 19
  • 20. Virtual keyboard 6.3 WPM 10 9 8 7 User 1 6 User 2 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 20
  • 21. Kinect Keyboard 1.83 WPM 3.50 3.00 2.50 User 1 2.00 User 2 WPM User 3 1.50 User 4 User 5 User 6 1.00 *User 7 0.50 0.00 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 21
  • 22. Speech-only 11 WPM 40 35 User 1 30 25 User 1 User 2 WPM 20 User 3 User 4 15 User 5 Speech Recognis User 6 (CMU Sphinx 4 10 User 7 5 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 22
  • 23. SpeeG 5.8 WPM 10 9 8 7 User 2 6 User 1 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 23
  • 24. SpeeG 2.6 7.8 WPM 10 9 8 7 User 2 6 User 1 WPM 5 User 3 User 4 4 User 5 User 6 3 User 7 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 24
  • 25. Mean WPM per sentence and input device Virtual Keyboard for Xbox 1D Keyboard for Xbox 5 25 Speech-only User SpeeG GUI (JDasher) 3 1 4 2 20 Speech Recogniser Hand Tracking (CMU Sphinx 4) (Microsoft Kinect and NITE) 15 Controller WPM Speech only 10 Kinect only SpeeG 5 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 25
  • 26. Errors per sentence and input device Virtual Keyboard for Xbox 1D Keyboard for Xbox 5 10 Speech-only User SpeeG GUI (JDasher) 9 1 3 4 2 8 7 Speech Recogniser (CMU Sphinx 4) Hand Tracking (Microsoft Kinect and NITE) Mean number of errors 6 Controller 5 Speech only 4 Kinect only SpeeG 3 2 1 0 S1 S2 S3 S4 S5 S6 Sentence Vrije Universiteit Brussel SpeeG - Lode Hoste 26
  • 27. Vrije Universiteit Brussel SpeeG - Lode Hoste 27
  • 28. Future work Other visualisations Smaller gestures Dedicated commands (gesture / voice) Vrije Universiteit Brussel SpeeG - Lode Hoste 28
  • 29. Vrije Universiteit Brussel SpeeG - Lode Hoste 29
  • 30. SpeeG A  Mul&modal  Speech-­‐  and   Gesture-­‐  based  Text  Input  Solu&on Lode  Hoste,  Bruno  Dumas,  Beat  Signer Kinect Speech - Controller-free text input - Non-native speakers - Real-time correction - Untrained voice recogniser - Dasher, zoomable interface - 6-12 WPM - probabilities - Perceived fastest - alphabetic order - Game-like character - character-level - Novice and experts Vrije Universiteit Brussel Special thanks to Jorn De Baerdenmaeker and Keith Vertaenen SpeeG - Lode Hoste 30