SlideShare una empresa de Scribd logo
1 de 24
Speech Enhanced Gesture Based Navigation System for Google Maps
An exploration in Multimodal HCI
Under the Guidance of: Asst. Professor Manoj Majhi
Vikas Luthra | Himanshu Bansal | Maulishree Pandey
Goal of Our Journey
Abstract
• Conventional method of using different features of Google Maps on touch-based devices entails
use of touch-based gestures defined for the devices.
• For certain touch-based devices like public kiosks, touch-screens, etc, it is possible to define in-
air or 3D gestures.
• Coupled with basic speech commands, a new group of interactions can be prepared for
accessing Google Maps.
• However, it becomes important to measure the usability of this new group of gestures against the
conventional touch-based gestures before substation is considered.
Final Destination: Aim
• Define the gestures and speech commons for the features of Google maps, and evaluate them
against the existing interactions
Final Destination: Aim
• Define the gestures and speech commons for the features of Google maps, and evaluate them
against the existing interactions
• Compare and evaluate usability of 3D gestures as well as speech against touch-based gestures
for using Google Maps for a large touchscreen
The Route to follow for our Journey: Methodology
Literature Research (Aug 1st week – Sept 1st week)
Background of the technologies
Multimodal HCI theory
Similar Works
The Route to follow for our Journey: Methodology
Literature Research (Aug 1st week – Sept 1st week)
Background of the technologies
Multimodal HCI theory
Similar Works
System Definition and Design (Sept 2nd week –Oct 1st week)
To decide case-study features of Google maps
Use-case scenarios
Feature wise gesture definition
Addition of voice commands where gesture control is not applicable
The Route to follow for our Journey: Methodology
Prototype Development (Oct 2nd week-Nov 4th week)
Skelton Based Gesture Tracking System Development
Speech Recognition System Development
Debugging and Refinement
The Route to follow for our Journey: Methodology
Prototype Development (Oct 2nd week-Nov 4th week)
Skelton Based Gesture Tracking System Development
Speech Recognition System Development
Debugging and Refinement
Comparative Study (Next Semester)
Experiments on comparison between 2 solutions having different gestures and voice
commands
Statistical analysis
The Route to follow for our Journey: Methodology
Prototype Development (Oct 2nd week-Nov 4th week)
Skelton Based Gesture Tracking System Development
Speech Recognition System Development
Debugging and Refinement
Comparative Study (Next Semester)
Experiments on comparison between 2 solutions having different gestures and voice
commands
Statistical analysis
Conclusion (Next Semester)
Inferences and Guidelines
Mode of Transportation : Microsoft Kinect
Mode of Transportation : Microsoft Kinect
Mode of Transportation : Microsoft Kinect
Microsoft Kinect
• Kinect sensor can build a 'depth map' of the area in front of it.
• This depth map is used to recognize the distance of various objects in front of the kinect.
• One of the popular uses is recognizing and tracking people standing in front of the sensor.
• Kinect has four microphones to pick up audio
Mode of Transportation : Microsoft Kinect
Kinect for Windows SDK
• This SDK has been provided by Microsoft for free use and experimentation, without the
permission of commercial distribution. SDK contains APIs that allow tracking of people
in front of the Kinect and provide coordinates of different bodily joints.
• There are APIs that recognize basic and common hand gestures like grip, release, etc.
• Speech APIs are provided to capture sound and program them for use.
Mode of Transportation : Microsoft Kinect
Kinect for Windows SDK
• This SDK has been provided by Microsoft for free use and experimentation, without the
permission of commercial distribution. SDK contains APIs that allow tracking of people
in front of the Kinect and provide coordinates of different bodily joints.
• There are APIs that recognize basic and common hand gestures like grip, release, etc.
• Speech APIs are provided to capture sound and program them for use.
“We would be using Kinect for Windows SDK and Kinect for XBox 360 to design gestures
and recognition of certain speech commands. Development would occur in Microsoft
Visual Studio 2010, using C# programming language.”
Mode of Transportation : Speech Recognition
What is needed
1. Acoustic Model
probabilistic models which makes try to build connection between voice utterances and its
transcriptions present in training data
Mode of Transportation : Speech Recognition
What is needed
1. Acoustic Model
probabilistic models which makes try to build connection between voice utterances and its
transcriptions present in training data
2. Language Model
#monogram, #bigram, #trigram
not much in our case
Mode of Transportation : Speech Recognition
What is needed
1. Acoustic Model
probabilistic models which makes try to build connection between voice utterances and its
transcriptions present in training data
2. Language Model
#monogram, #bigram, #trigram
not much in our case
3. Mapping Dictionary
grapheme to phoneme
Mode of Transportation : Speech Recognition
Current Challenges
1. Large variability in accents
2. Variability in gender
3. Surrounding noise
4. So many names of cities and places
Mode of Transportation : Speech Recognition
Development Tools
1. Microsoft speech SDK 5.1
Preferable to work Microsoft Kinect
Mode of Transportation : Speech Recognition
Development Tools
1. Microsoft speech SDK 5.1
Preferable to work Microsoft Kinect
2. CMU sphinx 0.8
Open Source Toolkit For Speech Recognition
Mode of Transportation : Speech Recognition
Development Tools
1. Microsoft speech SDK 5.1
Preferable to work Microsoft Kinect
2. CMU sphinx 0.8
Open Source Toolkit For Speech Recognition
3. Dragon SDKs - Nuance
Discussions & Conclusion
1. Speech input is about 4 times faster than typing
2. Touch interaction on vertical screen can cause Gorilla Arm effect
3. Free hand gesture has been used previously also for navigation systems
4. Assumption of improved ease of use by integration these two modalities
5. Need to have training corpus for Indian accent users for ASR system
6. Need to define variables
Thank You for Listening
Picture abhi baaki hai mere dost (our journey still continues)……

Más contenido relacionado

Destacado

Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808
OpenCity Community
 
Becker dossier, part 2
Becker dossier, part 2Becker dossier, part 2
Becker dossier, part 2
yahyakhan8
 
CSS Layout Tutorial
CSS Layout TutorialCSS Layout Tutorial
CSS Layout Tutorial
hstryk
 
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
Trevor E S Smith
 
Film opening lessons sep 2013
Film opening lessons sep 2013Film opening lessons sep 2013
Film opening lessons sep 2013
NShuttle
 

Destacado (20)

Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808
 
Becker dossier, part 2
Becker dossier, part 2Becker dossier, part 2
Becker dossier, part 2
 
CSS Layout Tutorial
CSS Layout TutorialCSS Layout Tutorial
CSS Layout Tutorial
 
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
The Online Academy Budget $ t-r-e-t-c-h Opportunity-v171213
 
Energy UAB_master
Energy UAB_masterEnergy UAB_master
Energy UAB_master
 
Java peresentation new soft
Java peresentation new softJava peresentation new soft
Java peresentation new soft
 
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline HaythornthwaiteCIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
 
118773548 communication
118773548 communication118773548 communication
118773548 communication
 
lolcats
lolcatslolcats
lolcats
 
SafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak ClusterSafePeak - How to manually configure SafePeak Cluster
SafePeak - How to manually configure SafePeak Cluster
 
Bewonersbedrijf na tekening
Bewonersbedrijf na tekeningBewonersbedrijf na tekening
Bewonersbedrijf na tekening
 
Veterans health care benefits
Veterans health care benefitsVeterans health care benefits
Veterans health care benefits
 
Con8833 access at scale for hundreds of millions of users final
Con8833 access at scale for hundreds of millions of users   finalCon8833 access at scale for hundreds of millions of users   final
Con8833 access at scale for hundreds of millions of users final
 
BIRTE-13-Kawashima
BIRTE-13-KawashimaBIRTE-13-Kawashima
BIRTE-13-Kawashima
 
Veiliger door gezond verstand - Presentatie Safe@schools 27 mei 2014
Veiliger door gezond verstand - Presentatie Safe@schools 27 mei 2014Veiliger door gezond verstand - Presentatie Safe@schools 27 mei 2014
Veiliger door gezond verstand - Presentatie Safe@schools 27 mei 2014
 
Film opening lessons sep 2013
Film opening lessons sep 2013Film opening lessons sep 2013
Film opening lessons sep 2013
 
Pietilä: Move! Fyysisen toimintakyvyn seurantajärjestelmä
Pietilä: Move! Fyysisen toimintakyvyn seurantajärjestelmäPietilä: Move! Fyysisen toimintakyvyn seurantajärjestelmä
Pietilä: Move! Fyysisen toimintakyvyn seurantajärjestelmä
 
6 Development Tools we Love for Mac
6 Development Tools we Love for Mac6 Development Tools we Love for Mac
6 Development Tools we Love for Mac
 
Tugas 3 Rangkuman Protocol DNS, FTP, HTTP, dan SMTP
Tugas 3 Rangkuman Protocol DNS, FTP, HTTP, dan SMTPTugas 3 Rangkuman Protocol DNS, FTP, HTTP, dan SMTP
Tugas 3 Rangkuman Protocol DNS, FTP, HTTP, dan SMTP
 
Paperless - smartare pappershantering
Paperless - smartare pappershanteringPaperless - smartare pappershantering
Paperless - smartare pappershantering
 

Similar a Speech enhanced gesture based navigation for Google Maps

Detection ofs Signlanguageminorppt1.pptx
Detection ofs Signlanguageminorppt1.pptxDetection ofs Signlanguageminorppt1.pptx
Detection ofs Signlanguageminorppt1.pptx
vigocib930
 
Mahesh Tamboli(Android developer)
Mahesh Tamboli(Android developer)Mahesh Tamboli(Android developer)
Mahesh Tamboli(Android developer)
Maheshkumar Tamboli
 
Conversion of sign language to speech using kinect
Conversion of sign language to speech using kinectConversion of sign language to speech using kinect
Conversion of sign language to speech using kinect
rajaganapathy28091100
 

Similar a Speech enhanced gesture based navigation for Google Maps (20)

HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
HCI BASED APPLICATION FOR PLAYING COMPUTER GAMES | J4RV4I1014
 
Real Time Sign Language Detection
Real Time Sign Language DetectionReal Time Sign Language Detection
Real Time Sign Language Detection
 
.NET Fest 2017. Олександр Краковецький. Інструменти та технології Microsoft в...
.NET Fest 2017. Олександр Краковецький. Інструменти та технології Microsoft в....NET Fest 2017. Олександр Краковецький. Інструменти та технології Microsoft в...
.NET Fest 2017. Олександр Краковецький. Інструменти та технології Microsoft в...
 
Ai big dataconference_krakovetskyi_microsoft ai a new era of smart solutions
Ai big dataconference_krakovetskyi_microsoft ai a new era of smart solutionsAi big dataconference_krakovetskyi_microsoft ai a new era of smart solutions
Ai big dataconference_krakovetskyi_microsoft ai a new era of smart solutions
 
Smart modeling of smart software
Smart modeling of smart softwareSmart modeling of smart software
Smart modeling of smart software
 
Ary Mouse for Image Processing
Ary Mouse for Image ProcessingAry Mouse for Image Processing
Ary Mouse for Image Processing
 
Ary Mouse for Image Processing
Ary Mouse for Image ProcessingAry Mouse for Image Processing
Ary Mouse for Image Processing
 
IRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech RecognitionIRJET- Voice to Code Editor using Speech Recognition
IRJET- Voice to Code Editor using Speech Recognition
 
Scaling mobile dev teams
Scaling mobile dev teams Scaling mobile dev teams
Scaling mobile dev teams
 
Sundar_v5.9_Proj_Summary
Sundar_v5.9_Proj_SummarySundar_v5.9_Proj_Summary
Sundar_v5.9_Proj_Summary
 
Detection ofs Signlanguageminorppt1.pptx
Detection ofs Signlanguageminorppt1.pptxDetection ofs Signlanguageminorppt1.pptx
Detection ofs Signlanguageminorppt1.pptx
 
Forey: An Android Application for the Visually Impaired
Forey: An Android Application for the Visually ImpairedForey: An Android Application for the Visually Impaired
Forey: An Android Application for the Visually Impaired
 
Dr. Elizabeth Churchill, Google. Creating consumer grade developer experiences
Dr. Elizabeth Churchill, Google. Creating consumer grade developer experiencesDr. Elizabeth Churchill, Google. Creating consumer grade developer experiences
Dr. Elizabeth Churchill, Google. Creating consumer grade developer experiences
 
Mahesh Tamboli(Android developer)
Mahesh Tamboli(Android developer)Mahesh Tamboli(Android developer)
Mahesh Tamboli(Android developer)
 
MultiModal Image Search on Mobile Device
MultiModal Image Search on Mobile DeviceMultiModal Image Search on Mobile Device
MultiModal Image Search on Mobile Device
 
Sign Language Recognition using Mediapipe
Sign Language Recognition using MediapipeSign Language Recognition using Mediapipe
Sign Language Recognition using Mediapipe
 
GDSC Machine Learning Session Presentation
GDSC Machine Learning Session PresentationGDSC Machine Learning Session Presentation
GDSC Machine Learning Session Presentation
 
GDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptxGDSC BPIT ML Campaign.pptx
GDSC BPIT ML Campaign.pptx
 
Conversion of sign language to speech using kinect
Conversion of sign language to speech using kinectConversion of sign language to speech using kinect
Conversion of sign language to speech using kinect
 
IRJET - Mutecom using Tensorflow-Keras Model
IRJET - Mutecom using Tensorflow-Keras ModelIRJET - Mutecom using Tensorflow-Keras Model
IRJET - Mutecom using Tensorflow-Keras Model
 

Más de Himanshu Bansal (16)

Studies in application of Augmented Reality in E-Learning Courses
Studies in application of Augmented Reality in E-Learning CoursesStudies in application of Augmented Reality in E-Learning Courses
Studies in application of Augmented Reality in E-Learning Courses
 
Human senses: Making sense of a new language
Human senses: Making sense of a new languageHuman senses: Making sense of a new language
Human senses: Making sense of a new language
 
Textual and visual analysis of print advertisements
Textual and visual analysis of print advertisementsTextual and visual analysis of print advertisements
Textual and visual analysis of print advertisements
 
Media as mirror vs. prosthesis
Media as mirror vs. prosthesisMedia as mirror vs. prosthesis
Media as mirror vs. prosthesis
 
Intern presentation
Intern presentationIntern presentation
Intern presentation
 
Shopping Mall Entrance Design
Shopping Mall Entrance DesignShopping Mall Entrance Design
Shopping Mall Entrance Design
 
Piet Mondrian
Piet MondrianPiet Mondrian
Piet Mondrian
 
Sensitive Windows Explorer
Sensitive Windows ExplorerSensitive Windows Explorer
Sensitive Windows Explorer
 
Design of shopping mall entrance
Design of shopping mall entranceDesign of shopping mall entrance
Design of shopping mall entrance
 
IIT Delhi Branding
IIT Delhi BrandingIIT Delhi Branding
IIT Delhi Branding
 
Traplate
TraplateTraplate
Traplate
 
Matrix Magazine' 12- Anantha
Matrix Magazine' 12- AnanthaMatrix Magazine' 12- Anantha
Matrix Magazine' 12- Anantha
 
Presentation1
Presentation1Presentation1
Presentation1
 
chair_10020516
chair_10020516chair_10020516
chair_10020516
 
brick_10020516
brick_10020516brick_10020516
brick_10020516
 
matrix magazine pages
matrix magazine pagesmatrix magazine pages
matrix magazine pages
 

Último

Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
nirzagarg
 
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
instagramfab782445
 
DESIGN THINKING in architecture- Introduction
DESIGN THINKING in architecture- IntroductionDESIGN THINKING in architecture- Introduction
DESIGN THINKING in architecture- Introduction
sivagami49
 
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Design Inspiration for College by Slidesgo.pptx
Design Inspiration for College by Slidesgo.pptxDesign Inspiration for College by Slidesgo.pptx
Design Inspiration for College by Slidesgo.pptx
TusharBahuguna2
 
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard ...
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard  ...Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard  ...
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard ...
nirzagarg
 
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
amitlee9823
 
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
amitlee9823
 
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman MuscatAbortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
Abortion pills in Kuwait Cytotec pills in Kuwait
 

Último (20)

Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
Nisha Yadav Escorts Service Ernakulam ❣️ 7014168258 ❣️ High Cost Unlimited Ha...
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Hy...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Hy...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Hy...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Hy...
 
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
Abortion pill for sale in Muscat (+918761049707)) Get Cytotec Cash on deliver...
 
VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...
VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...
VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...
 
DESIGN THINKING in architecture- Introduction
DESIGN THINKING in architecture- IntroductionDESIGN THINKING in architecture- Introduction
DESIGN THINKING in architecture- Introduction
 
Case Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, PuneCase Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, Pune
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
 
❤Personal Whatsapp Number 8617697112 Samba Call Girls 💦✅.
❤Personal Whatsapp Number 8617697112 Samba Call Girls 💦✅.❤Personal Whatsapp Number 8617697112 Samba Call Girls 💦✅.
❤Personal Whatsapp Number 8617697112 Samba Call Girls 💦✅.
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Th...
 
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
HiFi Call Girl Service Delhi Phone ☞ 9899900591 ☜ Escorts Service at along wi...
HiFi Call Girl Service Delhi Phone ☞ 9899900591 ☜ Escorts Service at along wi...HiFi Call Girl Service Delhi Phone ☞ 9899900591 ☜ Escorts Service at along wi...
HiFi Call Girl Service Delhi Phone ☞ 9899900591 ☜ Escorts Service at along wi...
 
Design Inspiration for College by Slidesgo.pptx
Design Inspiration for College by Slidesgo.pptxDesign Inspiration for College by Slidesgo.pptx
Design Inspiration for College by Slidesgo.pptx
 
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Vasundhra (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Vasundhra (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...call girls in Vasundhra (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Vasundhra (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
 
Tapestry Clothing Brands: Collapsing the Funnel
Tapestry Clothing Brands: Collapsing the FunnelTapestry Clothing Brands: Collapsing the Funnel
Tapestry Clothing Brands: Collapsing the Funnel
 
Q4-W4-SCIENCE-5 power point presentation
Q4-W4-SCIENCE-5 power point presentationQ4-W4-SCIENCE-5 power point presentation
Q4-W4-SCIENCE-5 power point presentation
 
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard ...
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard  ...Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard  ...
Anamika Escorts Service Darbhanga ❣️ 7014168258 ❣️ High Cost Unlimited Hard ...
 
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)
 
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
RT Nagar Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bang...
 
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman MuscatAbortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
Abortion Pills in Oman (+918133066128) Cytotec clinic buy Oman Muscat
 

Speech enhanced gesture based navigation for Google Maps

  • 1.
  • 2. Speech Enhanced Gesture Based Navigation System for Google Maps An exploration in Multimodal HCI Under the Guidance of: Asst. Professor Manoj Majhi Vikas Luthra | Himanshu Bansal | Maulishree Pandey
  • 3. Goal of Our Journey Abstract • Conventional method of using different features of Google Maps on touch-based devices entails use of touch-based gestures defined for the devices. • For certain touch-based devices like public kiosks, touch-screens, etc, it is possible to define in- air or 3D gestures. • Coupled with basic speech commands, a new group of interactions can be prepared for accessing Google Maps. • However, it becomes important to measure the usability of this new group of gestures against the conventional touch-based gestures before substation is considered.
  • 4. Final Destination: Aim • Define the gestures and speech commons for the features of Google maps, and evaluate them against the existing interactions
  • 5. Final Destination: Aim • Define the gestures and speech commons for the features of Google maps, and evaluate them against the existing interactions • Compare and evaluate usability of 3D gestures as well as speech against touch-based gestures for using Google Maps for a large touchscreen
  • 6. The Route to follow for our Journey: Methodology Literature Research (Aug 1st week – Sept 1st week) Background of the technologies Multimodal HCI theory Similar Works
  • 7. The Route to follow for our Journey: Methodology Literature Research (Aug 1st week – Sept 1st week) Background of the technologies Multimodal HCI theory Similar Works System Definition and Design (Sept 2nd week –Oct 1st week) To decide case-study features of Google maps Use-case scenarios Feature wise gesture definition Addition of voice commands where gesture control is not applicable
  • 8. The Route to follow for our Journey: Methodology Prototype Development (Oct 2nd week-Nov 4th week) Skelton Based Gesture Tracking System Development Speech Recognition System Development Debugging and Refinement
  • 9. The Route to follow for our Journey: Methodology Prototype Development (Oct 2nd week-Nov 4th week) Skelton Based Gesture Tracking System Development Speech Recognition System Development Debugging and Refinement Comparative Study (Next Semester) Experiments on comparison between 2 solutions having different gestures and voice commands Statistical analysis
  • 10. The Route to follow for our Journey: Methodology Prototype Development (Oct 2nd week-Nov 4th week) Skelton Based Gesture Tracking System Development Speech Recognition System Development Debugging and Refinement Comparative Study (Next Semester) Experiments on comparison between 2 solutions having different gestures and voice commands Statistical analysis Conclusion (Next Semester) Inferences and Guidelines
  • 11. Mode of Transportation : Microsoft Kinect
  • 12. Mode of Transportation : Microsoft Kinect
  • 13. Mode of Transportation : Microsoft Kinect Microsoft Kinect • Kinect sensor can build a 'depth map' of the area in front of it. • This depth map is used to recognize the distance of various objects in front of the kinect. • One of the popular uses is recognizing and tracking people standing in front of the sensor. • Kinect has four microphones to pick up audio
  • 14. Mode of Transportation : Microsoft Kinect Kinect for Windows SDK • This SDK has been provided by Microsoft for free use and experimentation, without the permission of commercial distribution. SDK contains APIs that allow tracking of people in front of the Kinect and provide coordinates of different bodily joints. • There are APIs that recognize basic and common hand gestures like grip, release, etc. • Speech APIs are provided to capture sound and program them for use.
  • 15. Mode of Transportation : Microsoft Kinect Kinect for Windows SDK • This SDK has been provided by Microsoft for free use and experimentation, without the permission of commercial distribution. SDK contains APIs that allow tracking of people in front of the Kinect and provide coordinates of different bodily joints. • There are APIs that recognize basic and common hand gestures like grip, release, etc. • Speech APIs are provided to capture sound and program them for use. “We would be using Kinect for Windows SDK and Kinect for XBox 360 to design gestures and recognition of certain speech commands. Development would occur in Microsoft Visual Studio 2010, using C# programming language.”
  • 16. Mode of Transportation : Speech Recognition What is needed 1. Acoustic Model probabilistic models which makes try to build connection between voice utterances and its transcriptions present in training data
  • 17. Mode of Transportation : Speech Recognition What is needed 1. Acoustic Model probabilistic models which makes try to build connection between voice utterances and its transcriptions present in training data 2. Language Model #monogram, #bigram, #trigram not much in our case
  • 18. Mode of Transportation : Speech Recognition What is needed 1. Acoustic Model probabilistic models which makes try to build connection between voice utterances and its transcriptions present in training data 2. Language Model #monogram, #bigram, #trigram not much in our case 3. Mapping Dictionary grapheme to phoneme
  • 19. Mode of Transportation : Speech Recognition Current Challenges 1. Large variability in accents 2. Variability in gender 3. Surrounding noise 4. So many names of cities and places
  • 20. Mode of Transportation : Speech Recognition Development Tools 1. Microsoft speech SDK 5.1 Preferable to work Microsoft Kinect
  • 21. Mode of Transportation : Speech Recognition Development Tools 1. Microsoft speech SDK 5.1 Preferable to work Microsoft Kinect 2. CMU sphinx 0.8 Open Source Toolkit For Speech Recognition
  • 22. Mode of Transportation : Speech Recognition Development Tools 1. Microsoft speech SDK 5.1 Preferable to work Microsoft Kinect 2. CMU sphinx 0.8 Open Source Toolkit For Speech Recognition 3. Dragon SDKs - Nuance
  • 23. Discussions & Conclusion 1. Speech input is about 4 times faster than typing 2. Touch interaction on vertical screen can cause Gorilla Arm effect 3. Free hand gesture has been used previously also for navigation systems 4. Assumption of improved ease of use by integration these two modalities 5. Need to have training corpus for Indian accent users for ASR system 6. Need to define variables
  • 24. Thank You for Listening Picture abhi baaki hai mere dost (our journey still continues)……