SlideShare a Scribd company logo
1 of 20
Download to read offline
An Empirical Study on End-users Productivity Using
             Model-based Spreadsheets

Laura Beckwith           J´come Cunha
                          a                  Jo˜o P. Fernandes
                                               a                       Jo˜o Saraiva
                                                                         a

                 HCIResearcher, Denmark, beckwith@hciResearcher.com
           Universidade do Minho, Portugal, {jacome,jpaulo,jas}@di.uminho.pt
                    Universidade do Porto, Portugal, jpaulo@fe.up.pt


                                  EuSpRIG 2011

                                   July 14 – 15




 J´come Cunha (UMinho)
  a                                                                     EuSpRIG 2011   1 / 19
Agenda

1   Introduction

2   Model-based Spreadsheets
     Inferring Models
     Refactored Spreadsheet
     Visual Spreadsheet

3   Study Design

4   Results
      Effectiveness
      Efficiency

5   Conclusions


    J´come Cunha (UMinho)
     a                         EuSpRIG 2011   2 / 19
Introduction



    Much research has been done to prevent spreadsheet end-user errors
    Model-Driven Engineering (MDE) is a promising approach
    We have created a set of different model-based spreadsheets




  J´come Cunha (UMinho)
   a                                                     EuSpRIG 2011   3 / 19
Introduction



    Much research has been done to prevent spreadsheet end-user errors
    Model-Driven Engineering (MDE) is a promising approach
    We have created a set of different model-based spreadsheets




                          Need to evaluate their impact




  J´come Cunha (UMinho)
   a                                                      EuSpRIG 2011   3 / 19
Original Spreadsheet - original
A project management system




Tabular spreadsheet (database like spreadsheet)
   J´come Cunha (UMinho)
    a                                             EuSpRIG 2011   4 / 19
Functional Dependencies


     A     B means that the values in column A uniquely determine values
     in column B
     If two values two rows in column A are equal, then the corresponding
     values in B are also equal


For example, suppose A = {A1 , A2 } and B = {B1 , B2 }:

               A1     A2   B1      B2        A1    A2    B1    B2
               a1     a2   b1      b2        a1    a2    b1    b2
               a1     a3   b1      b1        a1    a3    b1    b1
               a1     a2   b1      b2        a1    a2    b2    b2
            (a) Table inducing A    B.   (b) Table not inducing A      B.



   J´come Cunha (UMinho)
    a                                                               EuSpRIG 2011   5 / 19
Relational Model


    From the spreadsheet data (also from formulas) we can infer
    functional dependencies
    From these dependencies, we can compute a relational database
    schema

                                  Reasoning
                                  about FDs


        Original SS
                                   A⇀B
                          Infer               Infer RDB    RDB
                                   CD⇀E
                          FDs                  Schema     Schema
                                   ...




  J´come Cunha (UMinho)
   a                                                       EuSpRIG 2011   6 / 19
Refactored Spreadsheet - refactored
    We use the relational model to generate a new refactored spreadsheet
    (respecting the relational model)
    Each entity separated in a different table
    Combo boxes used to list allowed values (from other tables)




  J´come Cunha (UMinho)
   a                                                      EuSpRIG 2011   7 / 19
Visual Spreadsheet - visual
    From functional dependencies we can generate edit assistance
    A    B: A columns become green combo boxes and B red
    Similar to original
    Combo boxes help user introducing data




  J´come Cunha (UMinho)
   a                                                      EuSpRIG 2011   8 / 19
The Entire Picture

                        Reasoning                                               Visual SS
                        about FDs


Original SS                                     Generate
                                                Visual SS
                            A⇀B
               Infer
                            CD⇀E
               FDs
                            ...     Infer RDB
                                     Schema
                                                  RDB
                                                 Schema                       Refactored SS
                                                              Generate
                                                            Refactored SS




Goal: compare efficiency and effectiveness of original, refactored and visual



    J´come Cunha (UMinho)
     a                                                                  EuSpRIG 2011   9 / 19
Research Questions




RQ1 Do end users introduce fewer errors when they use one of the
model-based spreadsheets versus the original unmodified one?

RQ2 Are end users more efficient using the model-based spreadsheets?

RQ3 Do particular models lead to fewer errors in particular tasks?




   J´come Cunha (UMinho)
    a                                                      EuSpRIG 2011   10 / 19
Study Design


    Background questionnaire, followed by a list of tasks and a post
    questionnaire
    No initial tutorial was given to the participants
    38 participants: 25 females and 11 males, all students
    Tasks for inserting and editing data and computing statistics
    3 different spreadsheet applications (projects, dishes and
    properties)
    3 different models (original, refactored, visual)
    Each participant received 3 spreadsheets, each of a different
    application/model




  J´come Cunha (UMinho)
   a                                                         EuSpRIG 2011   11 / 19
Global Effectiveness

                                       original   refactored   visual
                          dishes        86%         76%        78%
                          projects      73%         68%        78%
                          properties    75%         64%        62%




    Globally, models do not seem very competitive
    Further analyses are necessary


  J´come Cunha (UMinho)
   a                                                                    EuSpRIG 2011   12 / 19
Effectiveness: Inserting Data

                                       original   refactored   visual
                          dishes        91%         90%        81%
                          projects      76%         60%        75%
                          properties    86%         67%        68%




    Models always lose
    But very similar to original


  J´come Cunha (UMinho)
   a                                                                    EuSpRIG 2011   13 / 19
Effectiveness: Editing Data

                                       original   refactored   visual
                          dishes         91%        82%        82%
                          projects       54%        62%        50%
                          properties     65%        98%        48%




    refactored in general was better
    Because there is less data to edit


  J´come Cunha (UMinho)
   a                                                                    EuSpRIG 2011   14 / 19
Effectiveness: Computing Statistics

                                       original   refactored   visual
                          dishes        52%         37%        57%
                          projects      19%         76%        13%
                          properties    44%         57%        51%




    Again, refactored was better
    And again, less data do analyze


  J´come Cunha (UMinho)
   a                                                                    EuSpRIG 2011   15 / 19
Efficiency
                                       original   refactored   visual
                          dishes         35          32         28
                          projects       39          40         41
                          properties     37          36         40




    Models are competitive
    In further analyzes we discovered that users took more time understanding model,
    specially refactored

  J´come Cunha (UMinho)
   a                                                                    EuSpRIG 2011   16 / 19
Answers to Research Questions

RQ1 Do end users introduce fewer errors when they use one of the
model-based spreadsheets versus the original unmodified one?
     We believe there is potential for model-based spreadsheets
     But a deep knowledge seems to be necessary to be effective

RQ2 Are end users more efficient using model-based spreadsheets?
     Model-based spreadsheet can help in some cases
     We believe better knowledge about models could improve this result

RQ3 Do particular models lead to fewer errors in particular tasks?
     For inserting data, this was not the case
     The models helped editing data and in computing statistics


   J´come Cunha (UMinho)
    a                                                      EuSpRIG 2011   17 / 19
Conclusions




    We did not give a tutorial to participants
    We believe better results could be achieve with initial training
    Nevertheless, our model-based spreadsheet are competitive
    Better for editing and statistics
    We plan to run a new study similar to this, but with an initial tutorial




  J´come Cunha (UMinho)
   a                                                        EuSpRIG 2011   18 / 19
Thank you for your attention!




J´come Cunha (UMinho)
 a                         EuSpRIG 2011   19 / 19

More Related Content

Similar to Talk at EUSPRIG '11

Similar to Talk at EUSPRIG '11 (10)

Model-driven Spreadsheets
Model-driven SpreadsheetsModel-driven Spreadsheets
Model-driven Spreadsheets
 
Introduction to computer graphics
Introduction to computer graphicsIntroduction to computer graphics
Introduction to computer graphics
 
Model-Driven Spreadsheet Development
Model-Driven Spreadsheet DevelopmentModel-Driven Spreadsheet Development
Model-Driven Spreadsheet Development
 
GEOFRAME: a system for doing hydrology by computer
GEOFRAME: a system for doing hydrology by computerGEOFRAME: a system for doing hydrology by computer
GEOFRAME: a system for doing hydrology by computer
 
Sticky Notes - a tool for supporting collaborative activities in a 3D virtual...
Sticky Notes - a tool for supporting collaborative activities in a 3D virtual...Sticky Notes - a tool for supporting collaborative activities in a 3D virtual...
Sticky Notes - a tool for supporting collaborative activities in a 3D virtual...
 
COMPUTER AIDED DESIGN
COMPUTER AIDED DESIGNCOMPUTER AIDED DESIGN
COMPUTER AIDED DESIGN
 
CADM-Unit 1/INTRODUCTION
CADM-Unit 1/INTRODUCTIONCADM-Unit 1/INTRODUCTION
CADM-Unit 1/INTRODUCTION
 
Stream processors texture generation model for 3d virtual worlds learning too...
Stream processors texture generation model for 3d virtual worlds learning too...Stream processors texture generation model for 3d virtual worlds learning too...
Stream processors texture generation model for 3d virtual worlds learning too...
 
Crude-Oil Scheduling Technology: moving from simulation to optimization
Crude-Oil Scheduling Technology: moving from simulation to optimizationCrude-Oil Scheduling Technology: moving from simulation to optimization
Crude-Oil Scheduling Technology: moving from simulation to optimization
 
Introduction of Computer aided Design
Introduction of Computer aided DesignIntroduction of Computer aided Design
Introduction of Computer aided Design
 

More from Jácome Cunha

Type-Safe Evolution of 
Web Services
Type-Safe Evolution of 
Web ServicesType-Safe Evolution of 
Web Services
Type-Safe Evolution of 
Web Services
Jácome Cunha
 

More from Jácome Cunha (14)

Spreadsheet Engineering
Spreadsheet EngineeringSpreadsheet Engineering
Spreadsheet Engineering
 
Energy Efficiency Across 
Programming Languages
Energy Efficiency Across 
Programming LanguagesEnergy Efficiency Across 
Programming Languages
Energy Efficiency Across 
Programming Languages
 
LMCC - 30 Anos
LMCC - 30 AnosLMCC - 30 Anos
LMCC - 30 Anos
 
Explaining Spreadsheets with Spreadsheets
Explaining Spreadsheets with SpreadsheetsExplaining Spreadsheets with Spreadsheets
Explaining Spreadsheets with Spreadsheets
 
Automatically Inferring ClassSheet Models from Spreadsheets
Automatically Inferring ClassSheet Models from SpreadsheetsAutomatically Inferring ClassSheet Models from Spreadsheets
Automatically Inferring ClassSheet Models from Spreadsheets
 
On Understanding Data Scientists
On Understanding  Data ScientistsOn Understanding  Data Scientists
On Understanding Data Scientists
 
Systematic Spreadsheet Construction Processes @ VL/HCC 2017
Systematic Spreadsheet Construction Processes @ VL/HCC 2017Systematic Spreadsheet Construction Processes @ VL/HCC 2017
Systematic Spreadsheet Construction Processes @ VL/HCC 2017
 
jStanley: Placing a Green Thumb on Java Collections
jStanley: Placing a Green Thumb on  Java CollectionsjStanley: Placing a Green Thumb on  Java Collections
jStanley: Placing a Green Thumb on Java Collections
 
Type-Safe Evolution of 
Web Services
Type-Safe Evolution of 
Web ServicesType-Safe Evolution of 
Web Services
Type-Safe Evolution of 
Web Services
 
MDSheet – Model-Driven Spreadsheets
MDSheet – Model-Driven SpreadsheetsMDSheet – Model-Driven Spreadsheets
MDSheet – Model-Driven Spreadsheets
 
Summer School DSL 2013 - SpreadSheet Engineering
Summer School DSL 2013 - SpreadSheet EngineeringSummer School DSL 2013 - SpreadSheet Engineering
Summer School DSL 2013 - SpreadSheet Engineering
 
Talk
TalkTalk
Talk
 
Talk at VL/HCC '11
Talk at VL/HCC '11Talk at VL/HCC '11
Talk at VL/HCC '11
 
Talk at VL/HCC '11
Talk at VL/HCC '11Talk at VL/HCC '11
Talk at VL/HCC '11
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Talk at EUSPRIG '11

  • 1. An Empirical Study on End-users Productivity Using Model-based Spreadsheets Laura Beckwith J´come Cunha a Jo˜o P. Fernandes a Jo˜o Saraiva a HCIResearcher, Denmark, beckwith@hciResearcher.com Universidade do Minho, Portugal, {jacome,jpaulo,jas}@di.uminho.pt Universidade do Porto, Portugal, jpaulo@fe.up.pt EuSpRIG 2011 July 14 – 15 J´come Cunha (UMinho) a EuSpRIG 2011 1 / 19
  • 2. Agenda 1 Introduction 2 Model-based Spreadsheets Inferring Models Refactored Spreadsheet Visual Spreadsheet 3 Study Design 4 Results Effectiveness Efficiency 5 Conclusions J´come Cunha (UMinho) a EuSpRIG 2011 2 / 19
  • 3. Introduction Much research has been done to prevent spreadsheet end-user errors Model-Driven Engineering (MDE) is a promising approach We have created a set of different model-based spreadsheets J´come Cunha (UMinho) a EuSpRIG 2011 3 / 19
  • 4. Introduction Much research has been done to prevent spreadsheet end-user errors Model-Driven Engineering (MDE) is a promising approach We have created a set of different model-based spreadsheets Need to evaluate their impact J´come Cunha (UMinho) a EuSpRIG 2011 3 / 19
  • 5. Original Spreadsheet - original A project management system Tabular spreadsheet (database like spreadsheet) J´come Cunha (UMinho) a EuSpRIG 2011 4 / 19
  • 6. Functional Dependencies A B means that the values in column A uniquely determine values in column B If two values two rows in column A are equal, then the corresponding values in B are also equal For example, suppose A = {A1 , A2 } and B = {B1 , B2 }: A1 A2 B1 B2 A1 A2 B1 B2 a1 a2 b1 b2 a1 a2 b1 b2 a1 a3 b1 b1 a1 a3 b1 b1 a1 a2 b1 b2 a1 a2 b2 b2 (a) Table inducing A B. (b) Table not inducing A B. J´come Cunha (UMinho) a EuSpRIG 2011 5 / 19
  • 7. Relational Model From the spreadsheet data (also from formulas) we can infer functional dependencies From these dependencies, we can compute a relational database schema Reasoning about FDs Original SS A⇀B Infer Infer RDB RDB CD⇀E FDs Schema Schema ... J´come Cunha (UMinho) a EuSpRIG 2011 6 / 19
  • 8. Refactored Spreadsheet - refactored We use the relational model to generate a new refactored spreadsheet (respecting the relational model) Each entity separated in a different table Combo boxes used to list allowed values (from other tables) J´come Cunha (UMinho) a EuSpRIG 2011 7 / 19
  • 9. Visual Spreadsheet - visual From functional dependencies we can generate edit assistance A B: A columns become green combo boxes and B red Similar to original Combo boxes help user introducing data J´come Cunha (UMinho) a EuSpRIG 2011 8 / 19
  • 10. The Entire Picture Reasoning Visual SS about FDs Original SS Generate Visual SS A⇀B Infer CD⇀E FDs ... Infer RDB Schema RDB Schema Refactored SS Generate Refactored SS Goal: compare efficiency and effectiveness of original, refactored and visual J´come Cunha (UMinho) a EuSpRIG 2011 9 / 19
  • 11. Research Questions RQ1 Do end users introduce fewer errors when they use one of the model-based spreadsheets versus the original unmodified one? RQ2 Are end users more efficient using the model-based spreadsheets? RQ3 Do particular models lead to fewer errors in particular tasks? J´come Cunha (UMinho) a EuSpRIG 2011 10 / 19
  • 12. Study Design Background questionnaire, followed by a list of tasks and a post questionnaire No initial tutorial was given to the participants 38 participants: 25 females and 11 males, all students Tasks for inserting and editing data and computing statistics 3 different spreadsheet applications (projects, dishes and properties) 3 different models (original, refactored, visual) Each participant received 3 spreadsheets, each of a different application/model J´come Cunha (UMinho) a EuSpRIG 2011 11 / 19
  • 13. Global Effectiveness original refactored visual dishes 86% 76% 78% projects 73% 68% 78% properties 75% 64% 62% Globally, models do not seem very competitive Further analyses are necessary J´come Cunha (UMinho) a EuSpRIG 2011 12 / 19
  • 14. Effectiveness: Inserting Data original refactored visual dishes 91% 90% 81% projects 76% 60% 75% properties 86% 67% 68% Models always lose But very similar to original J´come Cunha (UMinho) a EuSpRIG 2011 13 / 19
  • 15. Effectiveness: Editing Data original refactored visual dishes 91% 82% 82% projects 54% 62% 50% properties 65% 98% 48% refactored in general was better Because there is less data to edit J´come Cunha (UMinho) a EuSpRIG 2011 14 / 19
  • 16. Effectiveness: Computing Statistics original refactored visual dishes 52% 37% 57% projects 19% 76% 13% properties 44% 57% 51% Again, refactored was better And again, less data do analyze J´come Cunha (UMinho) a EuSpRIG 2011 15 / 19
  • 17. Efficiency original refactored visual dishes 35 32 28 projects 39 40 41 properties 37 36 40 Models are competitive In further analyzes we discovered that users took more time understanding model, specially refactored J´come Cunha (UMinho) a EuSpRIG 2011 16 / 19
  • 18. Answers to Research Questions RQ1 Do end users introduce fewer errors when they use one of the model-based spreadsheets versus the original unmodified one? We believe there is potential for model-based spreadsheets But a deep knowledge seems to be necessary to be effective RQ2 Are end users more efficient using model-based spreadsheets? Model-based spreadsheet can help in some cases We believe better knowledge about models could improve this result RQ3 Do particular models lead to fewer errors in particular tasks? For inserting data, this was not the case The models helped editing data and in computing statistics J´come Cunha (UMinho) a EuSpRIG 2011 17 / 19
  • 19. Conclusions We did not give a tutorial to participants We believe better results could be achieve with initial training Nevertheless, our model-based spreadsheet are competitive Better for editing and statistics We plan to run a new study similar to this, but with an initial tutorial J´come Cunha (UMinho) a EuSpRIG 2011 18 / 19
  • 20. Thank you for your attention! J´come Cunha (UMinho) a EuSpRIG 2011 19 / 19