Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

•Descargar como PPTX, PDF•

4 recomendaciones•364 vistas

Gesture-controlled applications are typically tied to specific gestures, and also tied to specific recognition methods and specific gesture-detection devices. We propose a concern-separation architecture, which mediates the following concerns: gesture acquisition; gesture recognition; and gestural control. It enables application developers to respond to gesture-independent commands, recognized using plug-in gesture-recognition modules that process gesture data via both device-dependent and device-independent data formats and callbacks. Its feasibility is demonstrated with a sample implementation.

Software

Separating Gesture Detection and Application Control
Concerns with a Multimodal Architecture
Luís Fernandes et al.
INESC TEC & UTAD - PORTUGAL

Problem
“Most gestures are neither natural nor easy to learn or
remember. (…) Even the simple headshake is puzzling when
cultures intermix”
D.A. Norman (2010)

Reproduce this gesture
to continue
Prototype

Final thoughts
Results are very preliminary
Open issues and topics for reflection:
Access to low-level data
Diverse set of data formats

Bringing User Experience empirical data to gesture-control
and somatic interaction in virtual reality videogames:
an Exploratory Study with a multimodal interaction prototype
google it in December!
Subsequent paper – accepted for publication this November

Thank you! Any questions?
Luís Fernandes – INESC TEC & UTAD
lfernandes@utad.pt

Más contenido relacionado

La actualidad más candente

Tangible media pptLalbaadshah Boriswan

Resta tmg conference presentation Paul Resta

Tangible User Interface ShowcaseSimone Mora

ICS2208 Lecture 5Vanessa Camilleri

Deep LearningManjunatha Sai

Ubiquitous Computing and AmI Smart EnvironmentsJosephHowerton

New Media New Technology 2011 - Back to the futurePeter Van Der Putten

ICS 2208 lecture1Vanessa Camilleri

Inform- interacting with a dynamic shape displayHari Teja Joshi

IoTMeetupGuildford#1: IOT participation plot - Gill Wildman (Plot London)MicheleNati

Humans vs. the Internet of Things: conciliare tecnologie ed esperienza utenteFulvio Corno

8.5 martinAhmed Saleh

Next generation User interfacesHarshad Kt

La actualidad más candente (13)

Tangible media ppt

Resta tmg conference presentation

Tangible User Interface Showcase

ICS2208 Lecture 5

Deep Learning

Ubiquitous Computing and AmI Smart Environments

New Media New Technology 2011 - Back to the future

ICS 2208 lecture1

Inform- interacting with a dynamic shape display

IoTMeetupGuildford#1: IOT participation plot - Gill Wildman (Plot London)

Humans vs. the Internet of Things: conciliare tecnologie ed esperienza utente

8.5 martin

Next generation User interfaces

Similar a Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

Introduction to mLearning for MobiMOOCInge de Waard

Musstanser Avanzament 4 (Final No Animation)Musstanser Tinauli

Cognitive Computing for Tacit Knowledge1Lucia Gradinariu

Petrinect: Modeling gestural interactions with executable Petri netsTom Mens

Pre assessment Hasnain Zafar

Designing for the periphery of our attention - a study on Ambient Information...Mauro Pinheiro

Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...Ed Chi

An HCI Principles based Framework to Support Deaf CommunityIJEACS

DHWANI- THE VOICE OF DEAF AND MUTEIRJET Journal

Deep learning 1.0 and Beyond, Part 2Deakin University

HCI - Chapter 4Alan Dix

HCI 3e - Ch 4: ParadigmsAlan Dix

gesture-recognitionVenkat RAGHAVENDRA REDDY

Machine Learning, Artificial General Intelligence, and Robots with Human MindsUniversity of Huddersfield

Interusability: designing a coherent system UXClaire Rowland

1.lecture_07_paradigms.pptUsamajawad3

Prototyping Apps for Informal Learning: A Semantic Video Annotation ScenarioNicolaescu Petru

The artificiality of natural user interfaces alessio maliziaMarco Ajovalasit

My RobotAndreas Heil

Similar a Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture (20)

Introduction to mLearning for MobiMOOC

Musstanser Avanzament 4 (Final No Animation)

Cognitive Computing for Tacit Knowledge1

Petrinect: Modeling gestural interactions with executable Petri nets

Pre assessment

Designing for the periphery of our attention - a study on Ambient Information...

Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...

An HCI Principles based Framework to Support Deaf Community

DHWANI- THE VOICE OF DEAF AND MUTE

Deep learning 1.0 and Beyond, Part 2

HCI - Chapter 4

HCI 3e - Ch 4: Paradigms

gesture-recognition

Machine Learning, Artificial General Intelligence, and Robots with Human Minds

Interusability: designing a coherent system UX

1.lecture_07_paradigms.ppt

Prototyping Apps for Informal Learning: A Semantic Video Annotation Scenario

The artificiality of natural user interfaces alessio malizia

My Robot

Último

Software Coding for software engineeringssuserb3a23b

英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0

Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed

How to submit a standout Adobe Champion ApplicationBradBedford3

CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies

Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ

MYjobs Presentation Django-based projectAnoyGreter

Cyber security and its impact on E commercemanigoyal112

Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran

Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel

Machine Learning Software Engineering Patterns and Their EngineeringHironori Washizaki

SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl

React Server Component in Next.js by Hanief UtamaHanief Utama

Powering Real-Time Decisions with Continuous Data StreamsSafe Software

Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig

Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López

办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea

Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky

Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

1. Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture Luís Fernandes et al. INESC TEC & UTAD - PORTUGAL

2. Problem “Most gestures are neither natural nor easy to learn or remember. (…) Even the simple headshake is puzzling when cultures intermix” D.A. Norman (2010)

3. Devices Leap Motion Kinect 2

4. InMERSE Multimodal Architecture

5. InMERSE Multimodal Architecture - Data

6. Implementation

7. Reproduce this gesture to continue Prototype

8. Final thoughts Results are very preliminary Open issues and topics for reflection: Access to low-level data Diverse set of data formats

9. Bringing User Experience empirical data to gesture-control and somatic interaction in virtual reality videogames: an Exploratory Study with a multimodal interaction prototype google it in December! Subsequent paper – accepted for publication this November

10. Thank you! Any questions? Luís Fernandes – INESC TEC & UTAD lfernandes@utad.pt

Notas del editor

Good morning everyone, My name is Luís Fernandes and I’m here to present a Multimodal Architecture to separate gesture detection and application control concerns. The proposal herein was developed in the context of a corporate-funded innovation project, called InMERSE.
In so-called ‘natural’ user interfaces, gestural interaction with the user environment is a forefront element. They purport to be natural by leveraging users’ pre-existing skills. However, that’s quite not the case, since the meaning associated with the gestures vary across cultures, social groups and sometimes even from person to person. So current gestural commands methods for applications will likely become obsolete In order to prevent this from happening, our contribution is an architecture proposal which separates concerns: gesture acquisition, gesture recognition and gestural control.
Today, there are more and more low-cost computational devices and sensors available. Gestural interaction devices are one of the cases. But they’re all quite different, when it comes to the form of detection and data structure. Some use image processing, others, such as Myo, use electromyography. To produce a concrete implementation of the architecture presented, we employed two distinct gesture-acquisition devices: leap motion and the recent Kinect 2.Leap Motion samples the space above at regular intervals to detect the position of forearms, hands and fingers. Whereas Kinect 2 in spite of acting in a similar way, it targets the full body. Meaning, their data structure is different from each other. Each one of devices API or SDK ends up being different.
The core structure of this architecture is then the separation of 3 concerns: gesture acquisition, gesture recognition and application commands. In this picture we can see that different gesture acquisition devices (leap motion and Kinect 2) are interfaces by device specific Adapter modules; Multiple Decoder Modules are plugged-in to provide gesture recognition services tuned to different requirements And the core Framework modules provides the intermediation and abstraction services and enables Application modules to react to abstract Commands rather than the specific gestures that elicited them.
This architecture provides applications access to three different kinds of data: Commands – which are gesture independent information Gestures – simply a transparent access to Decoder’s output Basic Data – or transparent access to the framework data structures containing the gestural data.
We have developed a prototype implementation of the InMERSE Architecture to ascertain its feasibility. We store the basic data coming from each device in a Frame Buffer. Decoders will then try to detect gestures based on that data. And finally, based on a set of configurations, gestures are mapped to commands, which applications can listen to. We also included an extra scenario. We considered the need for an application to perform continuous hand-tracking operations, such as dragging virtual items or pointing at virtual elements. Thus, the framework might operate in two different modes: Acquisition or Detection mode.
To test the operation of the implemented prototype, we developed a digital signage application, which can be used seamlessly with Leap motion or Kinect 2 gesture acquisition devices. The user can pan, zoom in and out using hand gestures.
“The usage/efficiency of this prototype architecture needs to be tested in depth, as the results are very preliminary” There are still open issues and topics for reflection, requiring further work. Access to low-level data seems to defeat the purpose of separation of concerns – and it does. However, it means an application can use the command-based approach whenever possible, and only be tied to lower level data when that is unavoidable. While at the moment we are simply considering skeleton data with joint positions, and using traditional polymorphism to enable diverse data formats, in the future We should enable a more diverse set of data formats, using techniques such as ontology representation.

Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (13)

Similar a Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

Similar a Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture (20)

Último

Último (20)

Separating Gesture Detection and Application Control Concerns with a Multimodal Architecture

Notas del editor