SlideShare a Scribd company logo
1 of 29
Reproducible
Research Workshop
SICSA PhD Conference
June 27, 2016
Ian Gent
University of St Andrews
http://ian.gent
http://www.slideshare.net/turingfan/sicsaphd2016
Image: © Columbia Pictures
Joint Work with Graham Mcdonald
• http://www.gla.ac.uk/schools/computing/researchstudents/grahammcdonald/
• Graham has put far more work into this session than me!
Two things from the title slide I want to highlight
Yes this really is my
URL
SICSA PhD Conference
June 27, 2016
Ian Gent
University of St Andrews
http://ian.gent
http://www.slideshare.net/turingfan/sicsa-phd2016
Image: © Columbia Pictures
And it’s all about doing
the same thing over
and over again
SICSA PhD Conference
June 27, 2016
Ian Gent
University of St Andrews
http://ian.gent
http://www.slideshare.net/turingfan/sicsa-phd2016
Image: © Columbia Pictures
Outline of Talk
• Part I: Reproducibility in (Computer) Science
• Part II: What we are doing for the rest of the session
Reproducibility in Science
• Officially, reproducibility is key to science
• If you reproduce my experiment it’s win-win-win
• You win because you have done a good thing for Science
• I win because my experiment is validated
• Science wins because it knows that my conclusions are valid
• Computer Science is at a huge advantage
• Many of our experiments are software runs
• With no human intervention or automatable intervention
• We can then rerun them very easily anytime we wish
• Even if our experiments need major human intervention
• We can automate the parts that don’t
• Plus data analysis etc
7
Reproducibility in Science
• Officially, reproducibility is key to science
• If you reproduce my experiment it’s win-win-win
• You win because you have done a good thing for Science
• I win because my experiment is validated
• Science wins because it knows that my conclusions are valid
• Computer Science is at a huge advantage
• Unofficially?
8
#overlyhonestmethods
https://twitter.com/ianholmes/status/288689712636493824
#overlyhonestmethods (still going strong!)
https://twitter.com/DRG_physics/status/745927096850087936
xkcd/PhD/Dilbert Compliance
https://www.phdcomics.com/comics/archive.php?comicid=1569
Galileo’s Telescopes
• Imagine if we could look through Galileo’s telescopes
• And we hadn’t bothered to keep them
• Or threw away the only postdoc …
• This has happened in computer science
• Many many times
Galileo’s Telescopes
• SHRDLU is a famous early AI program
• We have the source code
• But we can’t run it!
Image AI Lab MIT
Interlude: Can you do experiments in CS?
Real Experiments that helped me get my first AI
Journal paper
Yes it was 22 years ago!
What has this got to do with reproducibility?
How I almost didn’t get an AI Journal paper
• I ran experiments overnight
• Since “all” instances were easy they should just take a few hours
• When I came in the next morning the output was stuck
• No file writes for several hours
• I assumed that some file output had got lost somehow
• Killed the job and deleted all the relevant files
• And the files had random seeds in them
• If I had just not deleted them I could have reproduced the experiments
• If this effect was a one in a million event I might never have seen it again
• I am extremely lucky that it occurred once or twice in a thousand events
This is a fun story but with a bigger message
• Reproducibility is not about the worthy principle of science
• Reproducibility is about being able to reproduce your own work
• Or else you can lose journal papers!
• And make the next paper much harder to write
• And it’s about building on the great work of others
• The easier it is to reproduce somebody else’s work
• The easier it is to improve on their work and contribute more to science
• “It's not really for the benefit of other people. Experience shows the
principal beneficiary of reproducible research is you the author yourself.”
Jon Claerbout
Part II: The rest of this session
• 11.55 Group Discussions
• 12.30 Report back session
• 12.55 Interactive Polling
• 13.05 End (but something will slip so it will be 13.10)
Groups and papers
• Graham has pre-allocated you to groups
• Mailed you about your allocation on Friday
• Graham allocated a paper to each group
• Papers are (very broadly) related to your research field
• You should now be sitting at the table allocated to your group
• there are printed copies of the papers at your table.
• If you are unsure of your group raise your hand NOW
• Somebody from the committee will help you
The Papers
• Algorithms
• Learning Expressive Linkage Rules using GeneticProgramming
• Robert Isele, Christian Bizer
• Computer/Complex Systems
• Experimental demonstration of associative memory with memristive neuralnetworks
• YV Pershin, M Di Ventra
• Human Computer Interaction [2 groups]
• I did that! Measuring users' experience of agency in their own actions
• D Coyle, J Moore, PO Kristensson, P Fletcher, A Blackwell
• Machine Learning [2 groups]
• A Model for Learning the Semantics of Pictures
• V. Lavrenko, R. Manmatha, J. Jeon
• Networks
• Modeling and performance analysis of Bit Torrent like peer to peer networks
• D Qiu, R Srikant
Some Issues to Think about
• We have flagged up 5 issues for you to think about
• With a number of suggested questions in each one
• Some of these issues may not be relevant to your paper
• So don’t feel you have to discuss each one to the same depth
• Please use some of your time to discuss Issue 5: General Issues
• And you may have more issues
• Which would be great
• And you may have more questions within each issue
• Which would also be great
• Some of these issues may not be relevant to your paper -
1. Accuracy of Reproduction
• How accurately would you expect to be able to reproduce the results
of the paper?
• If you were to try reproduce this paper, what parts of the experiment
would you expect there to be variations or differences between the
original experiment and your reproduction?
2. Failure of Reproduction
• If your reproduction had more variations than you expected, what
variants in experimental design or results would result in you
concluding that the reproduction failed?
• What would make you decide that the experiments are flawed
enough to invalidate the conclusions?
3. Difficulty and Costs
• What aspects of the reported experiment would you expect to be
most challenging to reproduce?
• What specialist or specific resources do you need access to for
reproducing the experiment?
• Are these resources publicly available?
• What costs are attached to doing the reproduction?
• Would this be a good use of this money/time?
4. Legal and Ethical
• Are there legal or ethical issues around reproducing this experiment?
• If so, what are they?
• If not, why aren’t there any?
5. General Issues
• What are the main difficulties and costs, or legal and ethical issues
relating to reproducibility in your field of computing science?
• What practical steps are generally taken in your field of computing
science to achieve reproducibility?
• Why should we be striving for reproducibility in computing science?
• How much should paper reviewers focus on reproducibility?
• What about PhD examiners?
I’ve talked too long, Go!
11.55 Group Discussions
12.30 Report back session
12.55 Interactive Polling
13.05 End (but something will slip so it will be 13.10)

More Related Content

What's hot

Youth Trajectories: Pathways Design Charrette Presentation
Youth Trajectories: Pathways Design Charrette PresentationYouth Trajectories: Pathways Design Charrette Presentation
Youth Trajectories: Pathways Design Charrette Presentation
hivelearningnyc
 
Reu13 orientation
Reu13 orientationReu13 orientation
Reu13 orientation
gestrine
 
GeneralAssemb.ly Summer Program: Tech from the Ground Up
GeneralAssemb.ly Summer Program: Tech from the Ground UpGeneralAssemb.ly Summer Program: Tech from the Ground Up
GeneralAssemb.ly Summer Program: Tech from the Ground Up
Daniel Doubrovkine
 
Anatomical Atlas Keynote
Anatomical Atlas KeynoteAnatomical Atlas Keynote
Anatomical Atlas Keynote
Ráfagan Abreu
 
Crowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI researchCrowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI research
Ed Chi
 

What's hot (17)

Two Solitudes
Two SolitudesTwo Solitudes
Two Solitudes
 
Accomplishment Report Sysanal Itethic 1t20082009
Accomplishment Report Sysanal Itethic 1t20082009Accomplishment Report Sysanal Itethic 1t20082009
Accomplishment Report Sysanal Itethic 1t20082009
 
Youth Trajectories: Pathways Design Charrette Presentation
Youth Trajectories: Pathways Design Charrette PresentationYouth Trajectories: Pathways Design Charrette Presentation
Youth Trajectories: Pathways Design Charrette Presentation
 
We Know Less Than You Think (But We Do Know Something)
We Know Less Than You Think (But We Do Know Something)We Know Less Than You Think (But We Do Know Something)
We Know Less Than You Think (But We Do Know Something)
 
It's Not Your Fault - Blameless Post-mortems
It's Not Your Fault - Blameless Post-mortemsIt's Not Your Fault - Blameless Post-mortems
It's Not Your Fault - Blameless Post-mortems
 
Sylvia MacDonald - How the Three Amigos made us more effective - EuroSTAR 2017
Sylvia MacDonald - How the Three Amigos made us more effective - EuroSTAR  2017Sylvia MacDonald - How the Three Amigos made us more effective - EuroSTAR  2017
Sylvia MacDonald - How the Three Amigos made us more effective - EuroSTAR 2017
 
Reu13 orientation
Reu13 orientationReu13 orientation
Reu13 orientation
 
GeneralAssemb.ly Summer Program: Tech from the Ground Up
GeneralAssemb.ly Summer Program: Tech from the Ground UpGeneralAssemb.ly Summer Program: Tech from the Ground Up
GeneralAssemb.ly Summer Program: Tech from the Ground Up
 
How to avoid research debt
How to avoid research debtHow to avoid research debt
How to avoid research debt
 
What We've Learned From Building Basie
What We've Learned From Building BasieWhat We've Learned From Building Basie
What We've Learned From Building Basie
 
Open Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen ScienceOpen Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen Science
 
Twelve tips on how to prepare an ERC grant proposal
Twelve tips on how to prepare an ERC grant proposalTwelve tips on how to prepare an ERC grant proposal
Twelve tips on how to prepare an ERC grant proposal
 
Webquest
Webquest Webquest
Webquest
 
The Well Connected Facility
The Well Connected FacilityThe Well Connected Facility
The Well Connected Facility
 
Anatomical Atlas Keynote
Anatomical Atlas KeynoteAnatomical Atlas Keynote
Anatomical Atlas Keynote
 
How to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael BoltonHow to get what you really want from Testing' with Michael Bolton
How to get what you really want from Testing' with Michael Bolton
 
Crowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI researchCrowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI research
 

Viewers also liked

Women and Computer Science
Women and Computer ScienceWomen and Computer Science
Women and Computer Science
Brittany_Lane
 
Illegal Immigration
Illegal ImmigrationIllegal Immigration
Illegal Immigration
MrG
 

Viewers also liked (6)

Women in Science 2015: The Computer Scientist and the Cleaner
Women in Science 2015: The Computer Scientist and the CleanerWomen in Science 2015: The Computer Scientist and the Cleaner
Women in Science 2015: The Computer Scientist and the Cleaner
 
Women and Computer Science
Women and Computer ScienceWomen and Computer Science
Women and Computer Science
 
Women in computer_science
Women in computer_scienceWomen in computer_science
Women in computer_science
 
Illegal Immigration
Illegal ImmigrationIllegal Immigration
Illegal Immigration
 
Illegal Immigration
Illegal ImmigrationIllegal Immigration
Illegal Immigration
 
Top 10 Women in the History of Tech
Top 10 Women in the History of TechTop 10 Women in the History of Tech
Top 10 Women in the History of Tech
 

Similar to sicsa-phd2016

Techniques d’etudes et de recherche
Techniques d’etudes et de rechercheTechniques d’etudes et de recherche
Techniques d’etudes et de recherche
Mohamed Diallo
 
Research Challenges – Am I Doing “Real” Research?
Research Challenges – Am I Doing “Real” Research?Research Challenges – Am I Doing “Real” Research?
Research Challenges – Am I Doing “Real” Research?
Dr. Mazlan Abbas
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproducibly
Florian Markowetz
 

Similar to sicsa-phd2016 (20)

Techniques d’etudes et de recherche
Techniques d’etudes et de rechercheTechniques d’etudes et de recherche
Techniques d’etudes et de recherche
 
Research Challenges – Am I Doing “Real” Research?
Research Challenges – Am I Doing “Real” Research?Research Challenges – Am I Doing “Real” Research?
Research Challenges – Am I Doing “Real” Research?
 
The science and art of methodology
The science and art of methodologyThe science and art of methodology
The science and art of methodology
 
10 tips for a better UX survey
10 tips for a better UX survey10 tips for a better UX survey
10 tips for a better UX survey
 
SY 7034 Week8
SY 7034 Week8SY 7034 Week8
SY 7034 Week8
 
Your PhD and You
Your PhD and YouYour PhD and You
Your PhD and You
 
Digital scholarship - all day workshop
Digital scholarship - all day workshopDigital scholarship - all day workshop
Digital scholarship - all day workshop
 
Waves keynote2c
Waves keynote2cWaves keynote2c
Waves keynote2c
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproducibly
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Information symposium
Information symposiumInformation symposium
Information symposium
 
6_2019_10_31!10_52_47_PM.PPT
6_2019_10_31!10_52_47_PM.PPT6_2019_10_31!10_52_47_PM.PPT
6_2019_10_31!10_52_47_PM.PPT
 
Pikas casci talk 11262013 final
Pikas casci talk 11262013 finalPikas casci talk 11262013 final
Pikas casci talk 11262013 final
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Mining
 
Independent investigation
Independent investigationIndependent investigation
Independent investigation
 
How Not to Kill Thinking (PCTM)
How Not to Kill Thinking (PCTM)How Not to Kill Thinking (PCTM)
How Not to Kill Thinking (PCTM)
 
Learn Learning + Prototype Testing
Learn Learning + Prototype TestingLearn Learning + Prototype Testing
Learn Learning + Prototype Testing
 
Action research: moving research forward + how it fits with dissertation or...
Action research:  moving research  forward + how it fits with dissertation or...Action research:  moving research  forward + how it fits with dissertation or...
Action research: moving research forward + how it fits with dissertation or...
 
Social Graphs for Better Drug Development
Social Graphs for Better Drug DevelopmentSocial Graphs for Better Drug Development
Social Graphs for Better Drug Development
 
Escaping Datageddon
Escaping DatageddonEscaping Datageddon
Escaping Datageddon
 

More from turingfan

Computer Scientist And the Cleaner, V6
Computer Scientist And the Cleaner, V6Computer Scientist And the Cleaner, V6
Computer Scientist And the Cleaner, V6
turingfan
 
The Computer Scientist and the Cleaner v5
The Computer Scientist and the Cleaner v5The Computer Scientist and the Cleaner v5
The Computer Scientist and the Cleaner v5
turingfan
 

More from turingfan (11)

Standrews3
Standrews3Standrews3
Standrews3
 
Why Computer Science is a Great Choice
Why Computer Science is a Great ChoiceWhy Computer Science is a Great Choice
Why Computer Science is a Great Choice
 
Sicsa phd2015
Sicsa phd2015Sicsa phd2015
Sicsa phd2015
 
Computer Scientist And the Cleaner, V6
Computer Scientist And the Cleaner, V6Computer Scientist And the Cleaner, V6
Computer Scientist And the Cleaner, V6
 
The Computer Scientist and the Cleaner v5
The Computer Scientist and the Cleaner v5The Computer Scientist and the Cleaner v5
The Computer Scientist and the Cleaner v5
 
The Computer Scientist and the Cleaner v4
The Computer Scientist and the Cleaner v4The Computer Scientist and the Cleaner v4
The Computer Scientist and the Cleaner v4
 
The Computer Scientist and the Cleaner v3
The Computer Scientist and the Cleaner v3The Computer Scientist and the Cleaner v3
The Computer Scientist and the Cleaner v3
 
V2 of The Computer Scientist and the Cleaner
V2 of The Computer Scientist and the CleanerV2 of The Computer Scientist and the Cleaner
V2 of The Computer Scientist and the Cleaner
 
V1 of The Computer Scientist and The Cleaner
V1 of The Computer Scientist and The CleanerV1 of The Computer Scientist and The Cleaner
V1 of The Computer Scientist and The Cleaner
 
The Computer Scientist and The Cleaner
The Computer Scientist and The CleanerThe Computer Scientist and The Cleaner
The Computer Scientist and The Cleaner
 
Optimal Implementation of Watched Literals and More General Techniques
Optimal Implementation of  Watched Literals and More General TechniquesOptimal Implementation of  Watched Literals and More General Techniques
Optimal Implementation of Watched Literals and More General Techniques
 

Recently uploaded

Recently uploaded (20)

Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdfUnit 3 Emotional Intelligence and Spiritual Intelligence.pdf
Unit 3 Emotional Intelligence and Spiritual Intelligence.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 

sicsa-phd2016

  • 1. Reproducible Research Workshop SICSA PhD Conference June 27, 2016 Ian Gent University of St Andrews http://ian.gent http://www.slideshare.net/turingfan/sicsaphd2016 Image: © Columbia Pictures
  • 2. Joint Work with Graham Mcdonald • http://www.gla.ac.uk/schools/computing/researchstudents/grahammcdonald/ • Graham has put far more work into this session than me!
  • 3. Two things from the title slide I want to highlight
  • 4. Yes this really is my URL SICSA PhD Conference June 27, 2016 Ian Gent University of St Andrews http://ian.gent http://www.slideshare.net/turingfan/sicsa-phd2016 Image: © Columbia Pictures
  • 5. And it’s all about doing the same thing over and over again SICSA PhD Conference June 27, 2016 Ian Gent University of St Andrews http://ian.gent http://www.slideshare.net/turingfan/sicsa-phd2016 Image: © Columbia Pictures
  • 6. Outline of Talk • Part I: Reproducibility in (Computer) Science • Part II: What we are doing for the rest of the session
  • 7. Reproducibility in Science • Officially, reproducibility is key to science • If you reproduce my experiment it’s win-win-win • You win because you have done a good thing for Science • I win because my experiment is validated • Science wins because it knows that my conclusions are valid • Computer Science is at a huge advantage • Many of our experiments are software runs • With no human intervention or automatable intervention • We can then rerun them very easily anytime we wish • Even if our experiments need major human intervention • We can automate the parts that don’t • Plus data analysis etc 7
  • 8. Reproducibility in Science • Officially, reproducibility is key to science • If you reproduce my experiment it’s win-win-win • You win because you have done a good thing for Science • I win because my experiment is validated • Science wins because it knows that my conclusions are valid • Computer Science is at a huge advantage • Unofficially? 8
  • 10. #overlyhonestmethods (still going strong!) https://twitter.com/DRG_physics/status/745927096850087936
  • 12. Galileo’s Telescopes • Imagine if we could look through Galileo’s telescopes • And we hadn’t bothered to keep them • Or threw away the only postdoc … • This has happened in computer science • Many many times
  • 13. Galileo’s Telescopes • SHRDLU is a famous early AI program • We have the source code • But we can’t run it! Image AI Lab MIT
  • 14. Interlude: Can you do experiments in CS?
  • 15. Real Experiments that helped me get my first AI Journal paper
  • 16. Yes it was 22 years ago!
  • 17. What has this got to do with reproducibility?
  • 18. How I almost didn’t get an AI Journal paper • I ran experiments overnight • Since “all” instances were easy they should just take a few hours • When I came in the next morning the output was stuck • No file writes for several hours • I assumed that some file output had got lost somehow • Killed the job and deleted all the relevant files • And the files had random seeds in them • If I had just not deleted them I could have reproduced the experiments • If this effect was a one in a million event I might never have seen it again • I am extremely lucky that it occurred once or twice in a thousand events
  • 19. This is a fun story but with a bigger message • Reproducibility is not about the worthy principle of science • Reproducibility is about being able to reproduce your own work • Or else you can lose journal papers! • And make the next paper much harder to write • And it’s about building on the great work of others • The easier it is to reproduce somebody else’s work • The easier it is to improve on their work and contribute more to science • “It's not really for the benefit of other people. Experience shows the principal beneficiary of reproducible research is you the author yourself.” Jon Claerbout
  • 20. Part II: The rest of this session • 11.55 Group Discussions • 12.30 Report back session • 12.55 Interactive Polling • 13.05 End (but something will slip so it will be 13.10)
  • 21. Groups and papers • Graham has pre-allocated you to groups • Mailed you about your allocation on Friday • Graham allocated a paper to each group • Papers are (very broadly) related to your research field • You should now be sitting at the table allocated to your group • there are printed copies of the papers at your table. • If you are unsure of your group raise your hand NOW • Somebody from the committee will help you
  • 22. The Papers • Algorithms • Learning Expressive Linkage Rules using GeneticProgramming • Robert Isele, Christian Bizer • Computer/Complex Systems • Experimental demonstration of associative memory with memristive neuralnetworks • YV Pershin, M Di Ventra • Human Computer Interaction [2 groups] • I did that! Measuring users' experience of agency in their own actions • D Coyle, J Moore, PO Kristensson, P Fletcher, A Blackwell • Machine Learning [2 groups] • A Model for Learning the Semantics of Pictures • V. Lavrenko, R. Manmatha, J. Jeon • Networks • Modeling and performance analysis of Bit Torrent like peer to peer networks • D Qiu, R Srikant
  • 23. Some Issues to Think about • We have flagged up 5 issues for you to think about • With a number of suggested questions in each one • Some of these issues may not be relevant to your paper • So don’t feel you have to discuss each one to the same depth • Please use some of your time to discuss Issue 5: General Issues • And you may have more issues • Which would be great • And you may have more questions within each issue • Which would also be great • Some of these issues may not be relevant to your paper -
  • 24. 1. Accuracy of Reproduction • How accurately would you expect to be able to reproduce the results of the paper? • If you were to try reproduce this paper, what parts of the experiment would you expect there to be variations or differences between the original experiment and your reproduction?
  • 25. 2. Failure of Reproduction • If your reproduction had more variations than you expected, what variants in experimental design or results would result in you concluding that the reproduction failed? • What would make you decide that the experiments are flawed enough to invalidate the conclusions?
  • 26. 3. Difficulty and Costs • What aspects of the reported experiment would you expect to be most challenging to reproduce? • What specialist or specific resources do you need access to for reproducing the experiment? • Are these resources publicly available? • What costs are attached to doing the reproduction? • Would this be a good use of this money/time?
  • 27. 4. Legal and Ethical • Are there legal or ethical issues around reproducing this experiment? • If so, what are they? • If not, why aren’t there any?
  • 28. 5. General Issues • What are the main difficulties and costs, or legal and ethical issues relating to reproducibility in your field of computing science? • What practical steps are generally taken in your field of computing science to achieve reproducibility? • Why should we be striving for reproducibility in computing science? • How much should paper reviewers focus on reproducibility? • What about PhD examiners?
  • 29. I’ve talked too long, Go! 11.55 Group Discussions 12.30 Report back session 12.55 Interactive Polling 13.05 End (but something will slip so it will be 13.10)