SlideShare a Scribd company logo
1 of 22
Download to read offline
Distributed Multimodal Information Processing Group                                   Technische Universität München




                      MobiMed:
       Comparing Object Identification Techniques
                  on Smartphones

               Andreas Möller1, Stefan Diewald1, Luis Roalter1, Matthias Kranz2

                                   1Technische
                                      Universität München, Germany
             2Luleå University of Technology, Department of Computer Science,

                     Electrical and Space Engineering, Luleå, Sweden

                                                             October 15, 2012
                                                      NordiCHI, Copenhagen, Denmark
Distributed Multimodal Information Processing Group            Technische Universität München



Outline


                   Background and Motivation



                           Scenario and Prototype



                           User Study



                   Discussion and Conclusion



Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    2
Distributed Multimodal Information Processing Group            Technische Universität München



Background and Motivation

•     Idea of bridging the gap between the physical and the virtual world
      for easier interaction and additional functionality
       –  Connect physical objects with virtual representations by tags
          (Want et al., 1999)
       –  Physical mobile interaction (Rukzio, 2006)

•     Investigation and comparison of different interaction techniques done earlier,
      BUT:
       –  meanwhile outdated technologies (e.g. IR)
       –  older comparisons based on (nowadays) limited hardware
          (VGA cameras, small screens, slow mobile CPUs)
       –  new technologies have emerged (e.g. vision-based approaches)
       –  user knowledge and experience has changed

 Suggesting a new comparison of (state-of-the-art) interaction techniques

Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    3
Distributed Multimodal Information Processing Group            Technische Universität München



Outline


                   Background and Motivation



                           Scenario and Prototype



                           User Study



                   Discussion and Conclusion



Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    4
Distributed Multimodal Information Processing Group            Technische Universität München



Scenario for Physical Mobile Interaction

•     MobiMed:
      identifying medication packages
      with the smartphone

•     Target groups: active people
      pursuing a healthy lifestyle, elderly
      people

•     Physical mobile interaction to get
      information on drugs
       –  package insert
       –  side effects
       –  active ingredients
       –  cross-correlations


Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    5
Distributed Multimodal Information Processing Group                        Technische Universität München



Investigated Interaction Types




           Touching                                                                Scanning
           (radio tags, e.g. NFC or RFID)                      (visual tags, e.g. bar codes)




           Pointing                                                              Text Input
           (tag-less vision-based identification)                       (e.g. name, ID, …)

Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                6
Distributed Multimodal Information Processing Group            Technische Universität München



Excursus: Pointing (Vision-based Recognition)

•     Image processing is used to detect
      visual features of an image
•     A query in feature space returns
      similar images from a reference
      database
•     Good choice of feature type allows
      very reliable results (e.g. MSER)
       –  High distinctiveness (e.g. by
          using text-related features)
       –  Scale invariance (works at
          different distances)
       –  Rotation invariance (works at
          different angles)
•     Enabled by rise in mobile CPU
      performance (multi-core...)

Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    7
Distributed Multimodal Information Processing Group            Technische Universität München



Prototype

•     Implementation as Android application
•     47,000 drugs in query database
•     100,000 reference images




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    8
Distributed Multimodal Information Processing Group            Technische Universität München



Outline


                   Background and Motivation



                           Scenario and Prototype



                           User Study



                   Discussion and Conclusion



Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    9
Distributed Multimodal Information Processing Group            Technische Universität München



Research Questions

•     RQ1: What advantages and disadvantages of identification techniques,
             as presented in MobiMed, can be determined?
       –  ...in terms of effectiveness? large-scale, online
       –  ...in terms of efficiency? lab

•     RQ2: Which method is preferred by users?
       –  ...a priori? large-scale, online
       –  ...after practical use? lab

•     RQ3: What potential do people see for MobiMed as a whole?
       –  ...a priori? large-scale, online
       –  ...after practical use? lab




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    10
Distributed Multimodal Information Processing Group                     Technische Universität München



Methodology

•     Online study
       –  Human Intelligence Task at Amazon mTurk
       –  149 participants
                 •  74 females, 75 males
                 •  17-79 years (average: 31, standard deviation: 11)
        –  Questionnaire survey

•     Lab study
       –  16 participants
                 •  6 females, 10 males
                 •  22-69 years (average: 31, standard deviation: 12)
        –  Experimental task + Questionnaire survey
            •  Identification of 10 packages
               with each of four methods
            •  Within-subjects design, permuted order
Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                             11
Distributed Multimodal Information Processing Group                                        Technische Universität München



Results: RQ1 (Individual Method Comparison)

Method
                                               Advantages
                      Disadvantages
Scanning
                                             Quick, precise, high             Visual code + camera
                                                      familiarity
                     required, need to find and
                                                                                       focus on code
Touching
                                             Hassle-free, fool-proof,         NFC augmentation and
                                                      quick
                           NFC-capable phone
                                                                                       required, privacy skepticism
Pointing
                                             Intuitive to use, „most          Computational demand,
                                                      human form“ of interaction,      ambiguous results possible
                                                      works from any angle, works
                                                      also with catalog/website
                                                      images, no product tagging
                                                      required
Text
                                                 Highest familiarity, accurate,   High amount of typing,
                                                      search term flexibility
          misspelling, slow, difficult



Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                                12
Distributed Multimodal Information Processing Group            Technische Universität München



Results: RQ1 (Efficiency)




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    13
Distributed Multimodal Information Processing Group                                      Technische Universität München



Results: RQ2 (User Preferences)




                                                               -3 = strongly disagree, +3 =strongly agree



Observations/interpretations:
•  Touching was only #3 in online survey, but rated best in lab study
•  Possible explanation: low familiarity (as soon as people used it, they liked it)


Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                              14
Distributed Multimodal Information Processing Group                              Technische Universität München



Results: RQ3 (Utility of Tool in Scenario)

•     Information sources on drugs:                            •    Suggestions for additional
       –  Doctor/pharmacist (75%)                                   features
       –  Package insert (69%)                                       –  Price comparison
       –  Books/internet (56%)                                       –  Active ingredient analysis
                                                                     –  Self-diagnose
•     Would you be interested in                                     –  Personalized medication
      MobiMed as alternative source for                                 management
      drug information? 88%

•     Would you use a system such as
      MobiMed? 82%

•     Average amount of money subjects
      would spend: $8.40 (aged >25:
      $14.01)

Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                      15
Distributed Multimodal Information Processing Group            Technische Universität München



Results: RQ3 (Usability of Prototype)




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    16
Distributed Multimodal Information Processing Group            Technische Universität München



Outline


                   Background and Motivation



                           Scenario and Prototype



                           User Study



                   Discussion and Conclusion



Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    17
Distributed Multimodal Information Processing Group            Technische Universität München



Discussion and Conclusion

•     Physical Mobile Interaction is popular and efficient
       –  Was preferred over conventional (text) search
       –  Was faster than text search
•     Touching and Scanning evaluated best
       –  Fastest and most popular physical mobile interaction methods
       –  Touching faster and more popular than scanning in lab study
       –  Scanning more popular in online survey (familiarity)
•     Vision-based Search (pointing) as future alternative?
       –  Natural; works for any object (no augmentation needed)
       –  Reliability/speed improvement needed, but almost as fast as scanning
•     Best method depends on intended scenario
•     General demand for medical apps




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    18
Distributed Multimodal Information Processing Group                   Technische Universität München




                                          Thank you for your attention!
                                                 Questions?




                                                               ?
                                                               ?
                                andreas.moeller@tum.de
                       www.vmi.ei.tum.de/team/andreas-moeller.html

Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                           19
Distributed Multimodal Information Processing Group                           Technische Universität München



References

•     Slide 3:
        –  Rukzio, E. Physical mobile interactions: Mobile devices as pervasive mediators for interactions
           with the real world. PhD thesis, 2006
        –  Want, R., Fishkin, K., Gujar, A., and Harrison, B. Bridging physical and virtual worlds with
           electronic tags. In Proceedings of the SIGCHI conference on Human factors in computing
           systems: the CHI is the limit, ACM (1999), 370–377.
•     Slide 10: https://www.mturk.com/mturk/welcome

•     All other images: Microsoft ClipArt 2012




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                   20
Distributed Multimodal Information Processing Group            Technische Universität München



Paper Reference

•     Please find the associated paper at:
      http://dx.doi.org/10.1145/2399016.2399022

•     Please cite this work as follows:
•     Andreas Möller, Stefan Diewald, Luis Roalter, and Matthias Kranz. 2012.
      MobiMed: comparing object identification techniques on smartphones. In
      Proceedings of the 7th Nordic Conference on Human-Computer Interaction:
      Making Sense Through Design (NordiCHI '12). ACM, New York, NY, USA,
      31-40. DOI=10.1145/2399016.2399022 http://doi.acm.org/
      10.1145/2399016.2399022




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                    21
Distributed Multimodal Information Processing Group                                 Technische Universität München



If you use BibTex, please use the following entry
to cite this work:



 @inproceedings{Moller:2012:MCO:2399016.2399022,
  author = {M"{o}ller, Andreas and Diewald, Stefan and Roalter, Luis and Kranz, Matthias},
  title = {MobiMed: comparing object identification techniques on smartphones},
  booktitle = {Proceedings of the 7th Nordic Conference on Human-Computer Interaction: Making Sense Through Design},
  series = {NordiCHI '12},
  year = {2012},
  isbn = {978-1-4503-1482-4},
  location = {Copenhagen, Denmark},
  pages = {31--40},
  numpages = {10},
  url = {http://doi.acm.org/10.1145/2399016.2399022},
  doi = {10.1145/2399016.2399022},
  acmid = {2399022},
  publisher = {ACM},
  address = {New York, NY, USA},
  keywords = {object identification, physical mobile interaction, pointing, scanning, touching},
 }




Oct 15, 2012     A. Möller, S. Diewald, L. Roalter, M. Kranz                                                         22

More Related Content

Similar to MobiMed: Comparing Object Identification Techniques on Smartphones

Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasFulvio Corno
 
Mobility&Udi 2011
Mobility&Udi 2011Mobility&Udi 2011
Mobility&Udi 2011TingRay Chang
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.docbutest
 
The Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataThe Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataCS, NcState
 
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...The Research Council of Norway, IKTPLUSS
 
UMOBILE: Universal, mobile-centric and opportunistic communications architecture
UMOBILE: Universal, mobile-centric and opportunistic communications architectureUMOBILE: Universal, mobile-centric and opportunistic communications architecture
UMOBILE: Universal, mobile-centric and opportunistic communications architecturePaulo Milheiro Mendes
 
AudrisMockus_MSR22.pdf
AudrisMockus_MSR22.pdfAudrisMockus_MSR22.pdf
AudrisMockus_MSR22.pdfTapajitDey1
 
Methods and tools for human centered ICT: from human values to real-life inno...
Methods and tools for human centered ICT: from human values to real-life inno...Methods and tools for human centered ICT: from human values to real-life inno...
Methods and tools for human centered ICT: from human values to real-life inno...Human Centered ICT
 
Multimodal Tutor - Adaptive feedback from multimodal experience capturing
Multimodal Tutor - Adaptive feedback from multimodal experience capturingMultimodal Tutor - Adaptive feedback from multimodal experience capturing
Multimodal Tutor - Adaptive feedback from multimodal experience capturingDaniele Di Mitri
 
The Internet of Things: What's next?
The Internet of Things: What's next? The Internet of Things: What's next?
The Internet of Things: What's next? PayamBarnaghi
 

Similar to MobiMed: Comparing Object Identification Techniques on Smartphones (20)

Decision-Point Panorama-Based Indoor Navigation
Decision-Point Panorama-Based Indoor NavigationDecision-Point Panorama-Based Indoor Navigation
Decision-Point Panorama-Based Indoor Navigation
 
The Smartphone as Mobile Authorization Proxy
The Smartphone as Mobile Authorization ProxyThe Smartphone as Mobile Authorization Proxy
The Smartphone as Mobile Authorization Proxy
 
A Mobile Indoor Navigation System Interface Adapted to Vision-Based Localization
A Mobile Indoor Navigation System Interface Adapted to Vision-Based LocalizationA Mobile Indoor Navigation System Interface Adapted to Vision-Based Localization
A Mobile Indoor Navigation System Interface Adapted to Vision-Based Localization
 
Tool Support for Prototyping Interfaces
Tool Support for Prototyping InterfacesTool Support for Prototyping Interfaces
Tool Support for Prototyping Interfaces
 
Ambient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application AreasAmbient Intelligence: Definitions and Application Areas
Ambient Intelligence: Definitions and Application Areas
 
Mobility&Udi 2011
Mobility&Udi 2011Mobility&Udi 2011
Mobility&Udi 2011
 
Gamification-supported Exploration of Natural User Interfaces
Gamification-supported Exploration of Natural User InterfacesGamification-supported Exploration of Natural User Interfaces
Gamification-supported Exploration of Natural User Interfaces
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
 
MobiliNet: A Social Network for Optimized Mobility
MobiliNet: A Social Network for Optimized MobilityMobiliNet: A Social Network for Optimized Mobility
MobiliNet: A Social Network for Optimized Mobility
 
Update Behavior in App Markets and Security Implications: A Case Study in Goo...
Update Behavior in App Markets and Security Implications: A Case Study in Goo...Update Behavior in App Markets and Security Implications: A Case Study in Goo...
Update Behavior in App Markets and Security Implications: A Case Study in Goo...
 
The Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software DataThe Art and Science of Analyzing Software Data
The Art and Science of Analyzing Software Data
 
Nessos
NessosNessos
Nessos
 
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
Research Opportunities - Interactive Visual Representations, Otto J. Anshus, ...
 
Smart homes
Smart homesSmart homes
Smart homes
 
UMOBILE: Universal, mobile-centric and opportunistic communications architecture
UMOBILE: Universal, mobile-centric and opportunistic communications architectureUMOBILE: Universal, mobile-centric and opportunistic communications architecture
UMOBILE: Universal, mobile-centric and opportunistic communications architecture
 
AudrisMockus_MSR22.pdf
AudrisMockus_MSR22.pdfAudrisMockus_MSR22.pdf
AudrisMockus_MSR22.pdf
 
Methods and tools for human centered ICT: from human values to real-life inno...
Methods and tools for human centered ICT: from human values to real-life inno...Methods and tools for human centered ICT: from human values to real-life inno...
Methods and tools for human centered ICT: from human values to real-life inno...
 
Multimodal Tutor - Adaptive feedback from multimodal experience capturing
Multimodal Tutor - Adaptive feedback from multimodal experience capturingMultimodal Tutor - Adaptive feedback from multimodal experience capturing
Multimodal Tutor - Adaptive feedback from multimodal experience capturing
 
The Internet of Things: What's next?
The Internet of Things: What's next? The Internet of Things: What's next?
The Internet of Things: What's next?
 

More from Distributed Multimodal Information Processing Group

More from Distributed Multimodal Information Processing Group (9)

Experimental Evaluation of User Interfaces for Visual Indoor Navigation
Experimental Evaluation of User Interfaces for Visual Indoor NavigationExperimental Evaluation of User Interfaces for Visual Indoor Navigation
Experimental Evaluation of User Interfaces for Visual Indoor Navigation
 
Visual Authentication - A Secure Single Step Authentication for User Authoriz...
Visual Authentication - A Secure Single Step Authentication for User Authoriz...Visual Authentication - A Secure Single Step Authentication for User Authoriz...
Visual Authentication - A Secure Single Step Authentication for User Authoriz...
 
Mit mobilem Lernen zur erweiterten Methodenkompetenz
Mit mobilem Lernen zur erweiterten MethodenkompetenzMit mobilem Lernen zur erweiterten Methodenkompetenz
Mit mobilem Lernen zur erweiterten Methodenkompetenz
 
Investigating Self-Reporting Behavior in Long-Term Studies
Investigating Self-Reporting Behavior in Long-Term StudiesInvestigating Self-Reporting Behavior in Long-Term Studies
Investigating Self-Reporting Behavior in Long-Term Studies
 
GymSkill - A Personal Trainer for Physical Exercises
GymSkill - A Personal Trainer for Physical ExercisesGymSkill - A Personal Trainer for Physical Exercises
GymSkill - A Personal Trainer for Physical Exercises
 
The Healthcare and Motivation Seat - A Survey with the GewoS Chair
The Healthcare and Motivation Seat - A Survey with the GewoS ChairThe Healthcare and Motivation Seat - A Survey with the GewoS Chair
The Healthcare and Motivation Seat - A Survey with the GewoS Chair
 
DriveAssist – A V2X-Based Driver Assistance System for Android
DriveAssist – A V2X-Based Driver Assistance System for Android DriveAssist – A V2X-Based Driver Assistance System for Android
DriveAssist – A V2X-Based Driver Assistance System for Android
 
Distributed Networks within ROS: Challenges and Possibilities
Distributed Networks within ROS: Challenges and PossibilitiesDistributed Networks within ROS: Challenges and Possibilities
Distributed Networks within ROS: Challenges and Possibilities
 
MobiDics: Cooperative Mobile e-Learning for Teachers
MobiDics: Cooperative Mobile e-Learning for TeachersMobiDics: Cooperative Mobile e-Learning for Teachers
MobiDics: Cooperative Mobile e-Learning for Teachers
 

Recently uploaded

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

MobiMed: Comparing Object Identification Techniques on Smartphones

  • 1. Distributed Multimodal Information Processing Group Technische Universität München MobiMed: Comparing Object Identification Techniques on Smartphones Andreas Möller1, Stefan Diewald1, Luis Roalter1, Matthias Kranz2 1Technische Universität München, Germany 2Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Luleå, Sweden October 15, 2012 NordiCHI, Copenhagen, Denmark
  • 2. Distributed Multimodal Information Processing Group Technische Universität München Outline Background and Motivation Scenario and Prototype User Study Discussion and Conclusion Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 2
  • 3. Distributed Multimodal Information Processing Group Technische Universität München Background and Motivation •  Idea of bridging the gap between the physical and the virtual world for easier interaction and additional functionality –  Connect physical objects with virtual representations by tags (Want et al., 1999) –  Physical mobile interaction (Rukzio, 2006) •  Investigation and comparison of different interaction techniques done earlier, BUT: –  meanwhile outdated technologies (e.g. IR) –  older comparisons based on (nowadays) limited hardware (VGA cameras, small screens, slow mobile CPUs) –  new technologies have emerged (e.g. vision-based approaches) –  user knowledge and experience has changed  Suggesting a new comparison of (state-of-the-art) interaction techniques Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 3
  • 4. Distributed Multimodal Information Processing Group Technische Universität München Outline Background and Motivation Scenario and Prototype User Study Discussion and Conclusion Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 4
  • 5. Distributed Multimodal Information Processing Group Technische Universität München Scenario for Physical Mobile Interaction •  MobiMed: identifying medication packages with the smartphone •  Target groups: active people pursuing a healthy lifestyle, elderly people •  Physical mobile interaction to get information on drugs –  package insert –  side effects –  active ingredients –  cross-correlations Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 5
  • 6. Distributed Multimodal Information Processing Group Technische Universität München Investigated Interaction Types Touching Scanning (radio tags, e.g. NFC or RFID) (visual tags, e.g. bar codes) Pointing Text Input (tag-less vision-based identification) (e.g. name, ID, …) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 6
  • 7. Distributed Multimodal Information Processing Group Technische Universität München Excursus: Pointing (Vision-based Recognition) •  Image processing is used to detect visual features of an image •  A query in feature space returns similar images from a reference database •  Good choice of feature type allows very reliable results (e.g. MSER) –  High distinctiveness (e.g. by using text-related features) –  Scale invariance (works at different distances) –  Rotation invariance (works at different angles) •  Enabled by rise in mobile CPU performance (multi-core...) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 7
  • 8. Distributed Multimodal Information Processing Group Technische Universität München Prototype •  Implementation as Android application •  47,000 drugs in query database •  100,000 reference images Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 8
  • 9. Distributed Multimodal Information Processing Group Technische Universität München Outline Background and Motivation Scenario and Prototype User Study Discussion and Conclusion Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 9
  • 10. Distributed Multimodal Information Processing Group Technische Universität München Research Questions •  RQ1: What advantages and disadvantages of identification techniques, as presented in MobiMed, can be determined? –  ...in terms of effectiveness? large-scale, online –  ...in terms of efficiency? lab •  RQ2: Which method is preferred by users? –  ...a priori? large-scale, online –  ...after practical use? lab •  RQ3: What potential do people see for MobiMed as a whole? –  ...a priori? large-scale, online –  ...after practical use? lab Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 10
  • 11. Distributed Multimodal Information Processing Group Technische Universität München Methodology •  Online study –  Human Intelligence Task at Amazon mTurk –  149 participants •  74 females, 75 males •  17-79 years (average: 31, standard deviation: 11) –  Questionnaire survey •  Lab study –  16 participants •  6 females, 10 males •  22-69 years (average: 31, standard deviation: 12) –  Experimental task + Questionnaire survey •  Identification of 10 packages with each of four methods •  Within-subjects design, permuted order Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 11
  • 12. Distributed Multimodal Information Processing Group Technische Universität München Results: RQ1 (Individual Method Comparison) Method Advantages Disadvantages Scanning Quick, precise, high Visual code + camera familiarity required, need to find and focus on code Touching Hassle-free, fool-proof, NFC augmentation and quick NFC-capable phone required, privacy skepticism Pointing Intuitive to use, „most Computational demand, human form“ of interaction, ambiguous results possible works from any angle, works also with catalog/website images, no product tagging required Text Highest familiarity, accurate, High amount of typing, search term flexibility misspelling, slow, difficult Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 12
  • 13. Distributed Multimodal Information Processing Group Technische Universität München Results: RQ1 (Efficiency) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 13
  • 14. Distributed Multimodal Information Processing Group Technische Universität München Results: RQ2 (User Preferences) -3 = strongly disagree, +3 =strongly agree Observations/interpretations: •  Touching was only #3 in online survey, but rated best in lab study •  Possible explanation: low familiarity (as soon as people used it, they liked it) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 14
  • 15. Distributed Multimodal Information Processing Group Technische Universität München Results: RQ3 (Utility of Tool in Scenario) •  Information sources on drugs: •  Suggestions for additional –  Doctor/pharmacist (75%) features –  Package insert (69%) –  Price comparison –  Books/internet (56%) –  Active ingredient analysis –  Self-diagnose •  Would you be interested in –  Personalized medication MobiMed as alternative source for management drug information? 88% •  Would you use a system such as MobiMed? 82% •  Average amount of money subjects would spend: $8.40 (aged >25: $14.01) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 15
  • 16. Distributed Multimodal Information Processing Group Technische Universität München Results: RQ3 (Usability of Prototype) Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 16
  • 17. Distributed Multimodal Information Processing Group Technische Universität München Outline Background and Motivation Scenario and Prototype User Study Discussion and Conclusion Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 17
  • 18. Distributed Multimodal Information Processing Group Technische Universität München Discussion and Conclusion •  Physical Mobile Interaction is popular and efficient –  Was preferred over conventional (text) search –  Was faster than text search •  Touching and Scanning evaluated best –  Fastest and most popular physical mobile interaction methods –  Touching faster and more popular than scanning in lab study –  Scanning more popular in online survey (familiarity) •  Vision-based Search (pointing) as future alternative? –  Natural; works for any object (no augmentation needed) –  Reliability/speed improvement needed, but almost as fast as scanning •  Best method depends on intended scenario •  General demand for medical apps Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 18
  • 19. Distributed Multimodal Information Processing Group Technische Universität München Thank you for your attention! Questions? ? ? andreas.moeller@tum.de www.vmi.ei.tum.de/team/andreas-moeller.html Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 19
  • 20. Distributed Multimodal Information Processing Group Technische Universität München References •  Slide 3: –  Rukzio, E. Physical mobile interactions: Mobile devices as pervasive mediators for interactions with the real world. PhD thesis, 2006 –  Want, R., Fishkin, K., Gujar, A., and Harrison, B. Bridging physical and virtual worlds with electronic tags. In Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit, ACM (1999), 370–377. •  Slide 10: https://www.mturk.com/mturk/welcome •  All other images: Microsoft ClipArt 2012 Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 20
  • 21. Distributed Multimodal Information Processing Group Technische Universität München Paper Reference •  Please find the associated paper at: http://dx.doi.org/10.1145/2399016.2399022 •  Please cite this work as follows: •  Andreas Möller, Stefan Diewald, Luis Roalter, and Matthias Kranz. 2012. MobiMed: comparing object identification techniques on smartphones. In Proceedings of the 7th Nordic Conference on Human-Computer Interaction: Making Sense Through Design (NordiCHI '12). ACM, New York, NY, USA, 31-40. DOI=10.1145/2399016.2399022 http://doi.acm.org/ 10.1145/2399016.2399022 Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 21
  • 22. Distributed Multimodal Information Processing Group Technische Universität München If you use BibTex, please use the following entry to cite this work: @inproceedings{Moller:2012:MCO:2399016.2399022, author = {M"{o}ller, Andreas and Diewald, Stefan and Roalter, Luis and Kranz, Matthias}, title = {MobiMed: comparing object identification techniques on smartphones}, booktitle = {Proceedings of the 7th Nordic Conference on Human-Computer Interaction: Making Sense Through Design}, series = {NordiCHI '12}, year = {2012}, isbn = {978-1-4503-1482-4}, location = {Copenhagen, Denmark}, pages = {31--40}, numpages = {10}, url = {http://doi.acm.org/10.1145/2399016.2399022}, doi = {10.1145/2399016.2399022}, acmid = {2399022}, publisher = {ACM}, address = {New York, NY, USA}, keywords = {object identification, physical mobile interaction, pointing, scanning, touching}, } Oct 15, 2012 A. Möller, S. Diewald, L. Roalter, M. Kranz 22