Findings from GitHub. Methods, Datasets and Limitations

Javier Canovas
Javier CanovasAssociate Professor en IN3 UOC
Valerio Cosentino, Javier L. Cánovas Izquierdo, Jordi Cabot
Flickr/BenNuttall
Motivation
Motivation
Motivation
Empirical
Methods
Employed
Dataset
Used
Limitations
Reported
Methodology
Motivation
Empirical
Methods
Employed
Dataset
Used
Limitations
Reported
Methodology
Discussion
Methodology
Methodology
Title || Abstract || Keywords || IndexTerms
INCLUDES
“GitHub” OR “Git hub” OR “github”
Results
Flickr/mararle
Empirical Methods Employed
Datasets Used
Datasets Used
Datasets Used
Limitations Reported
Limitations Reported
Discussion
Flickr/KristinaAlexanderson
Discussion
Data Collection
Dataset Size
Replicability
Sampling
Longitudinal
Studies
Variety of
methodologies
Discussion
Data Collection
Dataset Size
Replicability
Sampling
Longitudinal
Studies
Variety of
methodologies
Freshness vs. Curation
Small-medium size
> 2/3 not providing dataset access
Most use non-probaility sampling
Scarcely used
Replication? Comparisons?
Flickr/JimRafferty
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International license.
Thanks!
http://tinyurl.com/GitHub-SystRev-Papers
Some works might
have been ignored
Subjetivity issues
Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International license.
Discussion
Data
Collection
Dataset Size
Replicability
Sampling
Longitudinal
Studies
Variety of
methodologies
Thanks!
http://tinyurl.com/GitHub-SystRev-Papers
1 de 19

Recomendados

Enabling the Definition and Enforcement of Governance Rules in Open Source Sy... por
Enabling the Definition and Enforcement of Governance Rules in Open Source Sy...Enabling the Definition and Enforcement of Governance Rules in Open Source Sy...
Enabling the Definition and Enforcement of Governance Rules in Open Source Sy...Javier Canovas
1.4K vistas21 diapositivas
Software Modernization Revisited: Challenges and Prospects por
Software Modernization Revisited:Challenges and ProspectsSoftware Modernization Revisited:Challenges and Prospects
Software Modernization Revisited: Challenges and ProspectsJavier Canovas
302 vistas17 diapositivas
Scoping Tips and Tricks por
Scoping Tips and TricksScoping Tips and Tricks
Scoping Tips and TricksSebastian Zarnekow
1.9K vistas26 diapositivas
Extending the Xbase Typesystem por
Extending the Xbase TypesystemExtending the Xbase Typesystem
Extending the Xbase TypesystemSebastian Zarnekow
2.2K vistas33 diapositivas
Xtext Best Practices por
Xtext Best PracticesXtext Best Practices
Xtext Best PracticesSebastian Zarnekow
6.3K vistas54 diapositivas
Building a Python IDE with Xtext por
Building a Python IDE with XtextBuilding a Python IDE with Xtext
Building a Python IDE with XtextSebastian Zarnekow
2K vistas41 diapositivas

Más contenido relacionado

Similar a Findings from GitHub. Methods, Datasets and Limitations

SEEK for Science: A Data and Model Management Platform to support Open and Re... por
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
2.2K vistas30 diapositivas
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles por
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesBesnik Fetahu
3K vistas40 diapositivas
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata... por
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...Open Science Fair
203 vistas21 diapositivas
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse por
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseLaurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseGigaScience, BGI Hong Kong
1.1K vistas48 diapositivas
HKU Data Curation MLIM7350 Class 9 por
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
308 vistas99 diapositivas
Being FAIR: FAIR data and model management SSBSS 2017 Summer School por
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
978 vistas65 diapositivas

Similar a Findings from GitHub. Methods, Datasets and Limitations(20)

SEEK for Science: A Data and Model Management Platform to support Open and Re... por Carole Goble
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
Carole Goble2.2K vistas
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles por Besnik Fetahu
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
Besnik Fetahu3K vistas
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata... por Open Science Fair
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
Open Science Fair203 vistas
HKU Data Curation MLIM7350 Class 9 por Scott Edmunds
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
Scott Edmunds308 vistas
Being FAIR: FAIR data and model management SSBSS 2017 Summer School por Carole Goble
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble978 vistas
Research Data Publishing por Brian Hole
Research Data PublishingResearch Data Publishing
Research Data Publishing
Brian Hole1.2K vistas
The Rhetoric of Research Objects por Carole Goble
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble2.4K vistas
OSFair2017 Training | Best practice in Open Science por Open Science Fair
OSFair2017 Training | Best practice in Open ScienceOSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open Science
Open Science Fair720 vistas
Human Genome and Big Data Challenges por Philip Bourne
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
Philip Bourne1.4K vistas
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi... por sesrdm
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
Case Study Life Sciences Data: Central for Integrative Systems Biology and Bi...
sesrdm461 vistas
There is a method to it: Making meaning in information research through a mix... por Lynn Connaway
There is a method to it: Making meaning in information research through a mix...There is a method to it: Making meaning in information research through a mix...
There is a method to it: Making meaning in information research through a mix...
Lynn Connaway111 vistas
615900072 por picktru
615900072615900072
615900072
picktru1.7K vistas
FAIR BioData Management por Ulrike Wittig
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
Ulrike Wittig131 vistas
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data por Susanna-Assunta Sansone
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
The Digital Library Federation Aquifer Initiative por Jenn Riley
The Digital Library Federation Aquifer InitiativeThe Digital Library Federation Aquifer Initiative
The Digital Library Federation Aquifer Initiative
Jenn Riley600 vistas
Research Shared: researchobject.org por Norman Morrison
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
Norman Morrison2K vistas
Data Stewardship for SPATIAL/IsoCamp 2014 por Carly Strasser
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
Carly Strasser1.5K vistas

Más de Javier Canovas

On the Analysis of Non-Coding Roles in Open Source Development por
On the Analysis of Non-Coding Roles in Open Source DevelopmentOn the Analysis of Non-Coding Roles in Open Source Development
On the Analysis of Non-Coding Roles in Open Source DevelopmentJavier Canovas
4 vistas27 diapositivas
Open Source Software Governance Guide: Developing a Matrix of Leading Questio... por
Open Source Software Governance Guide: Developing a Matrix of Leading Questio...Open Source Software Governance Guide: Developing a Matrix of Leading Questio...
Open Source Software Governance Guide: Developing a Matrix of Leading Questio...Javier Canovas
127 vistas35 diapositivas
A Model-based Chatbot Generation Approach to Converse with Open Data Sources por
A Model-based Chatbot Generation Approach to Converse with Open Data SourcesA Model-based Chatbot Generation Approach to Converse with Open Data Sources
A Model-based Chatbot Generation Approach to Converse with Open Data SourcesJavier Canovas
145 vistas32 diapositivas
Chatbots to Democratize the Access to Information and Internet Services por
Chatbots to Democratize the Access to Information and Internet ServicesChatbots to Democratize the Access to Information and Internet Services
Chatbots to Democratize the Access to Information and Internet ServicesJavier Canovas
1.7K vistas33 diapositivas
Analysis and Modeling of the Governance in General Programming Languages por
Analysis and Modeling of the Governance in General Programming LanguagesAnalysis and Modeling of the Governance in General Programming Languages
Analysis and Modeling of the Governance in General Programming LanguagesJavier Canovas
226 vistas26 diapositivas
Automatic Generation of Test Cases for REST APIs: a Specification-Based Approach por
Automatic Generation of Test Cases for REST APIs: a Specification-Based ApproachAutomatic Generation of Test Cases for REST APIs: a Specification-Based Approach
Automatic Generation of Test Cases for REST APIs: a Specification-Based ApproachJavier Canovas
981 vistas30 diapositivas

Más de Javier Canovas(19)

On the Analysis of Non-Coding Roles in Open Source Development por Javier Canovas
On the Analysis of Non-Coding Roles in Open Source DevelopmentOn the Analysis of Non-Coding Roles in Open Source Development
On the Analysis of Non-Coding Roles in Open Source Development
Javier Canovas4 vistas
Open Source Software Governance Guide: Developing a Matrix of Leading Questio... por Javier Canovas
Open Source Software Governance Guide: Developing a Matrix of Leading Questio...Open Source Software Governance Guide: Developing a Matrix of Leading Questio...
Open Source Software Governance Guide: Developing a Matrix of Leading Questio...
Javier Canovas127 vistas
A Model-based Chatbot Generation Approach to Converse with Open Data Sources por Javier Canovas
A Model-based Chatbot Generation Approach to Converse with Open Data SourcesA Model-based Chatbot Generation Approach to Converse with Open Data Sources
A Model-based Chatbot Generation Approach to Converse with Open Data Sources
Javier Canovas145 vistas
Chatbots to Democratize the Access to Information and Internet Services por Javier Canovas
Chatbots to Democratize the Access to Information and Internet ServicesChatbots to Democratize the Access to Information and Internet Services
Chatbots to Democratize the Access to Information and Internet Services
Javier Canovas1.7K vistas
Analysis and Modeling of the Governance in General Programming Languages por Javier Canovas
Analysis and Modeling of the Governance in General Programming LanguagesAnalysis and Modeling of the Governance in General Programming Languages
Analysis and Modeling of the Governance in General Programming Languages
Javier Canovas226 vistas
Automatic Generation of Test Cases for REST APIs: a Specification-Based Approach por Javier Canovas
Automatic Generation of Test Cases for REST APIs: a Specification-Based ApproachAutomatic Generation of Test Cases for REST APIs: a Specification-Based Approach
Automatic Generation of Test Cases for REST APIs: a Specification-Based Approach
Javier Canovas981 vistas
A UML Profile for Privacy Enforcement por Javier Canovas
A UML Profile for Privacy EnforcementA UML Profile for Privacy Enforcement
A UML Profile for Privacy Enforcement
Javier Canovas1.3K vistas
The Role of Foundations in Open Source Projects por Javier Canovas
The Role of Foundations in Open Source ProjectsThe Role of Foundations in Open Source Projects
The Role of Foundations in Open Source Projects
Javier Canovas336 vistas
An Empirical Study on the Maturity of the Eclipse Modeling Ecosystem por Javier Canovas
An Empirical Study on the Maturity of the Eclipse Modeling EcosystemAn Empirical Study on the Maturity of the Eclipse Modeling Ecosystem
An Empirical Study on the Maturity of the Eclipse Modeling Ecosystem
Javier Canovas1.3K vistas
Example-driven Web API Specification Discovery por Javier Canovas
Example-driven Web API Specification DiscoveryExample-driven Web API Specification Discovery
Example-driven Web API Specification Discovery
Javier Canovas12.6K vistas
Exploring the Use of Labels to Categorize Issues in Open-Source Software Pro... por Javier Canovas
Exploring the Use of Labels to Categorize Issues in Open-Source Software Pro...Exploring the Use of Labels to Categorize Issues in Open-Source Software Pro...
Exploring the Use of Labels to Categorize Issues in Open-Source Software Pro...
Javier Canovas2.4K vistas
Composing JSON-based Web APIs por Javier Canovas
Composing JSON-based Web APIsComposing JSON-based Web APIs
Composing JSON-based Web APIs
Javier Canovas1.5K vistas
Retos Actuales en el Desarrollo de Lenguajes Específicos del Dominio por Javier Canovas
Retos Actuales en el Desarrollo de Lenguajes Específicos del DominioRetos Actuales en el Desarrollo de Lenguajes Específicos del Dominio
Retos Actuales en el Desarrollo de Lenguajes Específicos del Dominio
Javier Canovas1.1K vistas
Discovering Implicit Schemas in JSON Data por Javier Canovas
Discovering Implicit Schemas in JSON DataDiscovering Implicit Schemas in JSON Data
Discovering Implicit Schemas in JSON Data
Javier Canovas2.7K vistas
Enabling the Collaborative Definition of DSMLs por Javier Canovas
Enabling the Collaborative Definition of DSMLsEnabling the Collaborative Definition of DSMLs
Enabling the Collaborative Definition of DSMLs
Javier Canovas1.3K vistas
Collaboro - Creación Colaborativa de Lenguajes Específicos del Dominio por Javier Canovas
Collaboro - Creación Colaborativa de Lenguajes Específicos del DominioCollaboro - Creación Colaborativa de Lenguajes Específicos del Dominio
Collaboro - Creación Colaborativa de Lenguajes Específicos del Dominio
Javier Canovas361 vistas
Collaboro - Community-Driven Language Development por Javier Canovas
Collaboro - Community-Driven Language DevelopmentCollaboro - Community-Driven Language Development
Collaboro - Community-Driven Language Development
Javier Canovas1.4K vistas

Último

AI and Ml presentation .pptx por
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptxFayazAli87
11 vistas15 diapositivas
ict act 1.pptx por
ict act 1.pptxict act 1.pptx
ict act 1.pptxsanjaniarun08
13 vistas17 diapositivas
MariaDB stored procedures and why they should be improved por
MariaDB stored procedures and why they should be improvedMariaDB stored procedures and why they should be improved
MariaDB stored procedures and why they should be improvedFederico Razzoli
8 vistas32 diapositivas
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... por
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Marc Müller
37 vistas83 diapositivas
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t... por
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...Deltares
9 vistas26 diapositivas
Generic or specific? Making sensible software design decisions por
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsBert Jan Schrijver
6 vistas60 diapositivas

Último(20)

AI and Ml presentation .pptx por FayazAli87
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptx
FayazAli8711 vistas
MariaDB stored procedures and why they should be improved por Federico Razzoli
MariaDB stored procedures and why they should be improvedMariaDB stored procedures and why they should be improved
MariaDB stored procedures and why they should be improved
Federico Razzoli8 vistas
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI... por Marc Müller
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Dev-Cloud Conference 2023 - Continuous Deployment Showdown: Traditionelles CI...
Marc Müller37 vistas
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t... por Deltares
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
DSD-INT 2023 Thermobaricity in 3D DCSM-FM - taking pressure into account in t...
Deltares9 vistas
Generic or specific? Making sensible software design decisions por Bert Jan Schrijver
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema por Deltares
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - GeertsemaDSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
DSD-INT 2023 Delft3D FM Suite 2024.01 1D2D - Beta testing programme - Geertsema
Deltares17 vistas
Tridens DevOps por Tridens
Tridens DevOpsTridens DevOps
Tridens DevOps
Tridens9 vistas
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs por Deltares
DSD-INT 2023 The Danube Hazardous Substances Model - KovacsDSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
Deltares8 vistas
Navigating container technology for enhanced security by Niklas Saari por Metosin Oy
Navigating container technology for enhanced security by Niklas SaariNavigating container technology for enhanced security by Niklas Saari
Navigating container technology for enhanced security by Niklas Saari
Metosin Oy13 vistas
360 graden fabriek por info33492
360 graden fabriek360 graden fabriek
360 graden fabriek
info3349237 vistas
DSD-INT 2023 Salt intrusion Modelling of the Lauwersmeer, towards a measureme... por Deltares
DSD-INT 2023 Salt intrusion Modelling of the Lauwersmeer, towards a measureme...DSD-INT 2023 Salt intrusion Modelling of the Lauwersmeer, towards a measureme...
DSD-INT 2023 Salt intrusion Modelling of the Lauwersmeer, towards a measureme...
Deltares5 vistas
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -... por Deltares
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
DSD-INT 2023 Simulating a falling apron in Delft3D 4 - Engineering Practice -...
Deltares6 vistas
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J... por Deltares
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
DSD-INT 2023 3D hydrodynamic modelling of microplastic transport in lakes - J...
Deltares9 vistas
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ... por Deltares
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...
DSD-INT 2023 Wave-Current Interaction at Montrose Tidal Inlet System and Its ...
Deltares10 vistas

Findings from GitHub. Methods, Datasets and Limitations