SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
Evaluation
& User Study
Byungkyu Kang
FourEyes Lab,
Dept. of Computer Science
UC Santa Barbara
User Study
•

What is a study?
-

•

Empirically testing a hypothesis

Why run a study?
-

•

Determine ‘truth’
Evaluate if a statement is true

User Study on Different Platforms
-

Online / Offline
Purpose of User Study

•

Evaluate New Interface

•

Find the Ground Truth

•

Verify a Hypothesis

•

Discover errors and areas of improvement
What to Measure?

•

Usability Testing (User Study in HCI)
-

Efficiency : time and steps in a given task

-

Accuracy : mistakes (fatal or recoverable?)

-

Recall : How much does the person remember?

-

Emotional response : feeling about the task
(confident, stressed? recommendable?)
Crowdsourcing User Study

•

Online User Study using Crowdsource Platform
-

General Usability Test!

-

Ground Truth Annotation!
‣
‣

Micro-tasks on the Internet

‣

•

Amazon Mechanical Turk, CrowdFlower

Large sample, fast and low cost

Kittur et al., Crowdsourcing user studies with Mechanical
Turk. (CHI '08)
Crowdsourcing User Study
A Type
Interface A

Interface B

B Type
Screening Task

Task A
Questionnaire
Task B
Examples of User Study
•

User Interface Design
-

System, Application, Web Search

•

User Experience Evaluation

•

Virtual or Augmented Reality

•

Ground Truth Annotation

•

Visualization
-

Scientific Visualization

-

Information Visualization
What does User Study do?

When
Why

System

User Study

Qualitative!
Performance

Measure!
Quantitative!
Performance

How

Evaluate!
Overall!
Performance
User Study in InfoVis

“User studies offer a scientifically sound
method to measure a visualization’s
performance”
“to evaluate the strengths and
weaknesses of different visualization
techniques”
Kosara, Robert, et al. "Thoughts on user studies: Why, how, and when." 

IEEE Computer Graphics and Applications 23.4 (2003): 20-25.
IEEE VIS 2013: Panel
Evaluation: How Much Evaluation is Enough?
Panelists:
Min Chen, David Ebert, Brian Fisher, Tamara Munzner

Methodologies

Visual Analytics
&
Cognitive Science

Every paper needs an empirical evaluation?
D-Cog Study
Brian Fischer
•

To define an analytics that underpins
analysis

•

Pair analytics : Student "drives",
expert "navigates"
-

•

Student visual analyst & trained domain
expert collaborate on analytic task

Research Snapshot
-

How much of this can we do at VIS?

-

How can we facilitate others to do the
rest?

-

How can we interact with them?

-

What organization coordinates the
whole process?
Evaluation, When and How
Tamara Munzner

•

how to pick the right evaluation method.
-

A Nested Model for Visualization Design and
Validation. Munzner. TVCG 15(6):921-928, 2009
[InfoVis 09]

-

Remained Question: do you need a study if you’re
proposing a new idea?
Evaluation: broadly interpreted
[A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6):
921-928, 2009 (Proc. InfoVis 09).]

problem domain: !
observe target users using existing tools!
data/task abstraction:!
encoding/interaction technique: 

justify design wrt alternatives!
algorithm: 

measure system time

analyze computational complexity!
analyze results qualitatively

measure human time with lab experiment (“user study”)"
observe target users post-deployment (“field study”)"
measure adoption
http://www.cs.ubc.ca/labs/imager/tr/2009/NestedModel/
Evaluation: broadly interpreted

Threats and validation in the nested model.
Others
•

David Ebert

It all depends to context (How much eval?)
-

Answer important questions
‣

Better than previous contributions?

‣

Is the system effective and useful?

•

wrong scientific approach

•

statistically significant performance with toy study
do not work!

•

Publishing without user studies are fine and
sometimes better!
Discussions
•

Hypothesis “Blinded” vs “Opened”

•

Bias-free Design?

•

Avoid W.E.I.R.D.(Western, Educated, Industrialized,
Rich and Democratic) Society!

•

How Many Subjects Required?

•

Is Questionnaire Clear or Ambiguous?

•

Hawthorne effect (Observer Effect)?

Más contenido relacionado

La actualidad más candente

Planning for Library Automation
Planning for Library AutomationPlanning for Library Automation
Planning for Library Automation
Cendrella Habre
 
Introduction to HCI
Introduction to HCI Introduction to HCI
Introduction to HCI
Deskala
 
User interfaces presentation
User interfaces presentationUser interfaces presentation
User interfaces presentation
somipam1
 

La actualidad más candente (20)

Planning for Library Automation
Planning for Library AutomationPlanning for Library Automation
Planning for Library Automation
 
Introducing Human Computer Interaction
Introducing Human Computer InteractionIntroducing Human Computer Interaction
Introducing Human Computer Interaction
 
Evaluation in hci
Evaluation in hciEvaluation in hci
Evaluation in hci
 
Introduction to HCI
Introduction to HCI Introduction to HCI
Introduction to HCI
 
Psychology Human Computer Interaction
Psychology Human Computer InteractionPsychology Human Computer Interaction
Psychology Human Computer Interaction
 
Human-Computer Interaction: An Overview
Human-Computer Interaction: An OverviewHuman-Computer Interaction: An Overview
Human-Computer Interaction: An Overview
 
Human Computer Interaction HCI
Human Computer Interaction HCI Human Computer Interaction HCI
Human Computer Interaction HCI
 
HCI 3e - Ch 5: Interaction design basics
HCI 3e - Ch 5:  Interaction design basicsHCI 3e - Ch 5:  Interaction design basics
HCI 3e - Ch 5: Interaction design basics
 
The interaction HCI
The interaction HCIThe interaction HCI
The interaction HCI
 
10 user centered design
10 user centered design10 user centered design
10 user centered design
 
Use and user study
Use and user study Use and user study
Use and user study
 
Touch Research 2: HCI Details [Handouts]
Touch Research 2: HCI Details [Handouts]Touch Research 2: HCI Details [Handouts]
Touch Research 2: HCI Details [Handouts]
 
User interfaces presentation
User interfaces presentationUser interfaces presentation
User interfaces presentation
 
Lecture 1: Human-Computer Interaction Introduction (2014)
Lecture 1: Human-Computer Interaction Introduction (2014)Lecture 1: Human-Computer Interaction Introduction (2014)
Lecture 1: Human-Computer Interaction Introduction (2014)
 
Library management system basic points
Library management system basic pointsLibrary management system basic points
Library management system basic points
 
Dspace software
Dspace softwareDspace software
Dspace software
 
Interactive design basics
Interactive design basicsInteractive design basics
Interactive design basics
 
Evaluating Electronic Resources
Evaluating Electronic ResourcesEvaluating Electronic Resources
Evaluating Electronic Resources
 
Human computer interaction
Human  computer interactionHuman  computer interaction
Human computer interaction
 
The computer HCI
The computer HCIThe computer HCI
The computer HCI
 

Similar a Evaluation and User Study in HCI

ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)
Shweta Gupte
 

Similar a Evaluation and User Study in HCI (20)

UX Design Process | Sample Proposal
UX Design Process | Sample Proposal UX Design Process | Sample Proposal
UX Design Process | Sample Proposal
 
ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)ECE695DVisualAnalyticsprojectproposal (2)
ECE695DVisualAnalyticsprojectproposal (2)
 
Aect 2018 workshop
Aect 2018 workshopAect 2018 workshop
Aect 2018 workshop
 
Aect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressedAect2018 workshop-v6ij-compressed
Aect2018 workshop-v6ij-compressed
 
Levi McCusker UXD
Levi McCusker UXDLevi McCusker UXD
Levi McCusker UXD
 
Advanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise ARAdvanced Methods for User Evaluation in Enterprise AR
Advanced Methods for User Evaluation in Enterprise AR
 
Information Experience Lab, IE Lab at SISLT
Information Experience Lab, IE Lab at SISLTInformation Experience Lab, IE Lab at SISLT
Information Experience Lab, IE Lab at SISLT
 
Successfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX TestingSuccessfully Managing Customer Experience Combining VoC and UX Testing
Successfully Managing Customer Experience Combining VoC and UX Testing
 
Conducting User Research
Conducting User ResearchConducting User Research
Conducting User Research
 
Rosenhan "User Research"
Rosenhan "User Research"Rosenhan "User Research"
Rosenhan "User Research"
 
UX (User Experience) Process, May 2017
UX (User Experience) Process, May 2017UX (User Experience) Process, May 2017
UX (User Experience) Process, May 2017
 
Generating Mobile Application Onboarding Insights Through Minimalist Instruction
Generating Mobile Application Onboarding Insights Through Minimalist InstructionGenerating Mobile Application Onboarding Insights Through Minimalist Instruction
Generating Mobile Application Onboarding Insights Through Minimalist Instruction
 
UXprobe workshop at Dare Festival 2016
UXprobe workshop at Dare Festival 2016UXprobe workshop at Dare Festival 2016
UXprobe workshop at Dare Festival 2016
 
Cognitive Science Perspective on User eXperience!
Cognitive Science Perspective on User eXperience!Cognitive Science Perspective on User eXperience!
Cognitive Science Perspective on User eXperience!
 
Understanding The Value Of User Research, Usability Testing, and Information ...
Understanding The Value Of User Research, Usability Testing, and Information ...Understanding The Value Of User Research, Usability Testing, and Information ...
Understanding The Value Of User Research, Usability Testing, and Information ...
 
Life as a UX consultant
Life as a UX consultant Life as a UX consultant
Life as a UX consultant
 
Kedar Chavan - UX Process.pdf
Kedar Chavan - UX Process.pdfKedar Chavan - UX Process.pdf
Kedar Chavan - UX Process.pdf
 
Validating hypotheses with user research
Validating hypotheses with user researchValidating hypotheses with user research
Validating hypotheses with user research
 
Brightfind world usability day 2016 full deck final
Brightfind world usability day 2016   full deck finalBrightfind world usability day 2016   full deck final
Brightfind world usability day 2016 full deck final
 
Jan Moons at WUD16
Jan Moons at WUD16Jan Moons at WUD16
Jan Moons at WUD16
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Evaluation and User Study in HCI

  • 1. Evaluation & User Study Byungkyu Kang FourEyes Lab, Dept. of Computer Science UC Santa Barbara
  • 2. User Study • What is a study? - • Empirically testing a hypothesis Why run a study? - • Determine ‘truth’ Evaluate if a statement is true User Study on Different Platforms - Online / Offline
  • 3. Purpose of User Study • Evaluate New Interface • Find the Ground Truth • Verify a Hypothesis • Discover errors and areas of improvement
  • 4. What to Measure? • Usability Testing (User Study in HCI) - Efficiency : time and steps in a given task - Accuracy : mistakes (fatal or recoverable?) - Recall : How much does the person remember? - Emotional response : feeling about the task (confident, stressed? recommendable?)
  • 5. Crowdsourcing User Study • Online User Study using Crowdsource Platform - General Usability Test! - Ground Truth Annotation! ‣ ‣ Micro-tasks on the Internet ‣ • Amazon Mechanical Turk, CrowdFlower Large sample, fast and low cost Kittur et al., Crowdsourcing user studies with Mechanical Turk. (CHI '08)
  • 6. Crowdsourcing User Study A Type Interface A Interface B B Type Screening Task Task A Questionnaire Task B
  • 7. Examples of User Study • User Interface Design - System, Application, Web Search • User Experience Evaluation • Virtual or Augmented Reality • Ground Truth Annotation • Visualization - Scientific Visualization - Information Visualization
  • 8. What does User Study do? When Why System User Study Qualitative! Performance Measure! Quantitative! Performance How Evaluate! Overall! Performance
  • 9. User Study in InfoVis “User studies offer a scientifically sound method to measure a visualization’s performance” “to evaluate the strengths and weaknesses of different visualization techniques” Kosara, Robert, et al. "Thoughts on user studies: Why, how, and when." 
 IEEE Computer Graphics and Applications 23.4 (2003): 20-25.
  • 10. IEEE VIS 2013: Panel Evaluation: How Much Evaluation is Enough? Panelists: Min Chen, David Ebert, Brian Fisher, Tamara Munzner Methodologies Visual Analytics & Cognitive Science Every paper needs an empirical evaluation?
  • 11. D-Cog Study Brian Fischer • To define an analytics that underpins analysis • Pair analytics : Student "drives", expert "navigates" - • Student visual analyst & trained domain expert collaborate on analytic task Research Snapshot - How much of this can we do at VIS? - How can we facilitate others to do the rest? - How can we interact with them? - What organization coordinates the whole process?
  • 12. Evaluation, When and How Tamara Munzner • how to pick the right evaluation method. - A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6):921-928, 2009 [InfoVis 09] - Remained Question: do you need a study if you’re proposing a new idea?
  • 13. Evaluation: broadly interpreted [A Nested Model for Visualization Design and Validation. Munzner. TVCG 15(6): 921-928, 2009 (Proc. InfoVis 09).] problem domain: ! observe target users using existing tools! data/task abstraction:! encoding/interaction technique: 
 justify design wrt alternatives! algorithm: 
 measure system time
 analyze computational complexity! analyze results qualitatively
 measure human time with lab experiment (“user study”)" observe target users post-deployment (“field study”)" measure adoption http://www.cs.ubc.ca/labs/imager/tr/2009/NestedModel/
  • 14. Evaluation: broadly interpreted Threats and validation in the nested model.
  • 15. Others • David Ebert It all depends to context (How much eval?) - Answer important questions ‣ Better than previous contributions? ‣ Is the system effective and useful? • wrong scientific approach • statistically significant performance with toy study do not work! • Publishing without user studies are fine and sometimes better!
  • 16. Discussions • Hypothesis “Blinded” vs “Opened” • Bias-free Design? • Avoid W.E.I.R.D.(Western, Educated, Industrialized, Rich and Democratic) Society! • How Many Subjects Required? • Is Questionnaire Clear or Ambiguous? • Hawthorne effect (Observer Effect)?