SlideShare una empresa de Scribd logo
1 de 20
RESEARCH DATA
SUPPORT FOR
RESEARCHERS:
METADATA.
CHALLENGES AND
OPPORTUNITIES
Clara Llebot Lorente NISO, September 2021
OREGON STATE UNIVERSITY 1
Fix old ones…!!! <2000
Abbreviations of
phytoplankton species
Blanks and zeros
Two sets of dates
Methods?
OREGON STATE UNIVERSITY 2
Why this pattern?
OREGON STATE UNIVERSITY 3
Who am I talking to?
OREGON STATE UNIVERSITY 4
Grad students
and early
career in
classes and
workshops
Consultations
Data
management
plans
Deposit of
datasets in
institutional
repository
How is their data?
OREGON STATE UNIVERSITY 5
Small datasets Disciplines without well
established standards
for metadata,
interdisciplinary
Challenges
OREGON STATE UNIVERSITY 6
Kirby Lee-USA TODAY Sports
Challenges
Enough metadata to ensure a
robust scientific process
OREGON STATE UNIVERSITY 7
Reproducibility and reuse
1 2
3
1. Metadata for a robust scientific
process
OREGON STATE UNIVERSITY 8
• Concept vs application.
• Now vs later.
• Intentionally, thoroughly,
systematically
readme templates
2. Reproducibility and reusability
OREGON STATE UNIVERSITY 9
1. Context: premise of study
We asked researchers to tell
us about how they interpret
datasets through a peer-
review like process
Peer reviewers and
Librarians evaluate dataset -
how different are the
interpretations of quality?
Does/should this lead to a
revision of our curation
methods and best practices?
Flickr/AJ Cann, CC BY-SA
2. Reproducibility and reusability
2. Reproducibility and reusability
● Datasets from ScholarsArchive@OSU,
institutional repository
● All datasets go through a review
process. Documentation is mandatory
● 8 datasets reviewed by 11 reviewers
11
2. Reproducibility and reusability
● Is the record
sufficiently
descriptive?
Title,
abstract,
keywords.
● Are there
other
elements that
could be
added?
● Are the data easily
readable? E.g.
community formats
● Are the data of high
quality?
● Are the values
physically possible
and plausible?
● Are there missing
data?
● Contact information
● Contextual information?
● Comprehensive
description of all the data
that is there?
● Methods well described
and reproducible
● Internal references
available
● Rights to use the dataset
RECORD DATA DOCUMENTATION
3. Results
● Descriptive information is critical
to a user’s ability to
understand what the data is
and whether it is potentially
useful
● Deficiencies limit the potential
reusability of the dataset.
● Areas of description work
together to create a more
complete description of the
dataset.
● Information often provided via
links to other sources: articles,
dissertations.
● Researchers are comfortable
using related articles. Librarians
value the presence of dataset
specific documentation higher
than most reviewers.
● Librarians took into consideration
whether links were accessible
and open.
INSUFFICIENT DESCRIPTION LINKS
3. Results
● We ask for the same
information in multiple
documentation locations (record
metadata, documentation, and
dataset). Sometimes is in
articles too.
● Not clear how this duplication of
effort impacts data submission
quality, as the combination
typically was enough to allow the
reviewer or librarian to
understand the dataset in
detail
● Domain expertise was important
across all areas of review for
datasets. The curating librarians do
not have sufficient domain
expertise to properly evaluate the
quality of the data, or metadata.
● Reviewers confused in the areas of
licensing, rights statements,
persistent identifiers, and where
specific types of information belong -
librarian’s expertise.
DUPLICATION OF EFFORT DOMAIN EXPERTISE
3. FAIR data
• F2. Data are described with rich metadata
• A2. Metadata are accessible, even when the data are no longer
available
• I1. (Meta)data use a formal, accessible, shared, and broadly
applicable language for knowledge representation.
• R1.3. (Meta)data meet domain-relevant community standards
OREGON STATE UNIVERSITY 15
3. FAIR data
OREGON STATE UNIVERSITY 16
Greatest disconnect between researchers and metadata
Tools, tools, tools
Most standards are
made for metadata
specialists, not for
researchers
Support
3. FAIR data
• FAIR principles are aspirational
• Disciplines are at different points in their development of
standards and tools. What for some are choices, for others are
challenges. (Jacobsen et al., 2020)
• There is a lot that is being done, but convergence may take
time.
OREGON STATE UNIVERSITY 17
Conclusions
OREGON STATE UNIVERSITY 18
Training and
teaching that can
be done with
support (e.g.
libraries)
Basics of metadata Tools and
translation of
concepts
Organizations and
communities that
maintain
specifications and
standards
Convergence of
standards
Organizations and
researchers talking
about metadata
Clara Llebot Lorente | Data Management Specialist
clara.llebot@oregonstate.edu
ResearchDataServices@oregonstate.edu
http://bit.ly/OSUData
This presentation is licensed under a CC0 license.
OREGON STATE UNIVERSITY 19

Más contenido relacionado

La actualidad más candente

RDAP14: University-wide Research Data Management Policy
RDAP14: University-wide Research Data Management PolicyRDAP14: University-wide Research Data Management Policy
RDAP14: University-wide Research Data Management PolicyASIS&T
 
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open ContextRDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open ContextASIS&T
 
Research Data Overview
Research Data OverviewResearch Data Overview
Research Data Overviewntunmg
 
RDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationRDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationASIS&T
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel ASIS&T
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthASIS&T
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...ASIS&T
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectASIS&T
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
 

La actualidad más candente (20)

Henderson "Institutional Identifiers"
Henderson "Institutional Identifiers"Henderson "Institutional Identifiers"
Henderson "Institutional Identifiers"
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP14: University-wide Research Data Management Policy
RDAP14: University-wide Research Data Management PolicyRDAP14: University-wide Research Data Management Policy
RDAP14: University-wide Research Data Management Policy
 
Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"Holmes "Institutional Infrastructure for Data Sharing"
Holmes "Institutional Infrastructure for Data Sharing"
 
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open ContextRDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
RDAP14: Comparing disciplinary repositories: tDAR vs. Open Context
 
Valen Metadata and the [Data] Repository
Valen Metadata and the [Data] RepositoryValen Metadata and the [Data] Repository
Valen Metadata and the [Data] Repository
 
Research Data Overview
Research Data OverviewResearch Data Overview
Research Data Overview
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management educationRDAP14: Emerging role of UC Libraries in research data management education
RDAP14: Emerging role of UC Libraries in research data management education
 
RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel RDAP14: Learning to Curate Panel
RDAP14: Learning to Curate Panel
 
RDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for EarthRDAP14: DataONE: Data Observation Network for Earth
RDAP14: DataONE: Data Observation Network for Earth
 
Lee "Supporting Research Data is a Group Effort"
Lee "Supporting Research Data is a Group Effort"Lee "Supporting Research Data is a Group Effort"
Lee "Supporting Research Data is a Group Effort"
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 
Putnam Data Quality and the IR
Putnam Data Quality and the IRPutnam Data Quality and the IR
Putnam Data Quality and the IR
 
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
RDAP14: Policy Recommendations for Institutions to Serve as Trustworthy Stewa...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Chilton "Collaborative Collection Assessment"
Chilton "Collaborative Collection Assessment"Chilton "Collaborative Collection Assessment"
Chilton "Collaborative Collection Assessment"
 

Similar a Llebot "Research Data Support for Researchers: Metadata, Challenges, and Opportunities"

Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflowVarsha Khodiyar
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Riccardo Albertoni
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceSusanna-Assunta Sansone
 
FSCI Data Discovery
FSCI Data DiscoveryFSCI Data Discovery
FSCI Data DiscoveryARDC
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Susanna-Assunta Sansone
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...ASIS&T
 
Doing research better: The role of meta‐data
Doing research better: The role of meta‐dataDoing research better: The role of meta‐data
Doing research better: The role of meta‐dataGarethKnight
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...The University of Edinburgh
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...University of California Curation Center
 
Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...UoLResearchSupport
 
Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in researchLouise Corti
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Datakfear
 
The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...Hilmar Lapp
 
Effective research data management
Effective research data managementEffective research data management
Effective research data managementCatherine Gold
 
A Framework for improving the effectiveness of the Openness in OER Repositori...
A Framework for improving the effectiveness of the Openness in OER Repositori...A Framework for improving the effectiveness of the Openness in OER Repositori...
A Framework for improving the effectiveness of the Openness in OER Repositori...Open Education Consortium
 

Similar a Llebot "Research Data Support for Researchers: Metadata, Challenges, and Opportunities" (20)

Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...Semantic Similarity and Selection of Resources Published According to Linked ...
Semantic Similarity and Selection of Resources Published According to Linked ...
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better ScienceNC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
NC3Rs Publication Bias workshop - Sansone - Better Data = Better Science
 
FSCI Data Discovery
FSCI Data DiscoveryFSCI Data Discovery
FSCI Data Discovery
 
ROER4D Open Data Initiative
ROER4D Open Data InitiativeROER4D Open Data Initiative
ROER4D Open Data Initiative
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
 
Doing research better: The role of meta‐data
Doing research better: The role of meta‐dataDoing research better: The role of meta‐data
Doing research better: The role of meta‐data
 
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
DMPTool Webinar 8: Data Curation Profiles and the DMPTool (presented by Jake ...
 
Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...
 
Transparency and reproducibility in research
Transparency and reproducibility in researchTransparency and reproducibility in research
Transparency and reproducibility in research
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Data
 
The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...The blessing and the curse: handshaking between general and specialist data r...
The blessing and the curse: handshaking between general and specialist data r...
 
Effective research data management
Effective research data managementEffective research data management
Effective research data management
 
A Framework for improving the effectiveness of the Openness in OER Repositori...
A Framework for improving the effectiveness of the Openness in OER Repositori...A Framework for improving the effectiveness of the Openness in OER Repositori...
A Framework for improving the effectiveness of the Openness in OER Repositori...
 

Más de National Information Standards Organization (NISO)

Más de National Information Standards Organization (NISO) (20)

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"Bazargan "NISO Webinar, Sustainability in Publishing"
Bazargan "NISO Webinar, Sustainability in Publishing"
 
Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"Rapple "Scholarly Communications and the Sustainable Development Goals"
Rapple "Scholarly Communications and the Sustainable Development Goals"
 
Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"Compton "NISO Webinar, Sustainability in Publishing"
Compton "NISO Webinar, Sustainability in Publishing"
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
Hazen, Morse, and Varnum "Spring 2024 ODI Conformance Statement Workshop for ...
 
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
Mattingly "AI & Prompt Design" - Introduction to Machine Learning"
 
Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"Mattingly "Text and Data Mining: Building Data Driven Applications"
Mattingly "Text and Data Mining: Building Data Driven Applications"
 
Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"Mattingly "Text and Data Mining: Searching Vectors"
Mattingly "Text and Data Mining: Searching Vectors"
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"Mattingly "Text Processing for Library Data: Representing Text as Data"
Mattingly "Text Processing for Library Data: Representing Text as Data"
 
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
Carpenter "Designing NISO's New Strategic Plan: 2023-2026"
 
Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"Ross and Clark "Strategic Planning"
Ross and Clark "Strategic Planning"
 
Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"Mattingly "Data Mining Techniques: Classification and Clustering"
Mattingly "Data Mining Techniques: Classification and Clustering"
 
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...Straza "Global collaboration towards equitable and open science: UNESCO Recom...
Straza "Global collaboration towards equitable and open science: UNESCO Recom...
 
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
Lippincott "Beyond access: Accelerating discovery and increasing trust throug...
 
Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"Kriegsman "Integrating Open and Equitable Research into Open Science"
Kriegsman "Integrating Open and Equitable Research into Open Science"
 
Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"Mattingly "Ethics and Cleaning Data"
Mattingly "Ethics and Cleaning Data"
 
Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"Mercado-Lara "Open & Equitable Program"
Mercado-Lara "Open & Equitable Program"
 

Último

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 

Último (20)

Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 

Llebot "Research Data Support for Researchers: Metadata, Challenges, and Opportunities"

  • 1. RESEARCH DATA SUPPORT FOR RESEARCHERS: METADATA. CHALLENGES AND OPPORTUNITIES Clara Llebot Lorente NISO, September 2021
  • 2. OREGON STATE UNIVERSITY 1 Fix old ones…!!! <2000 Abbreviations of phytoplankton species Blanks and zeros Two sets of dates Methods?
  • 3. OREGON STATE UNIVERSITY 2 Why this pattern?
  • 5. Who am I talking to? OREGON STATE UNIVERSITY 4 Grad students and early career in classes and workshops Consultations Data management plans Deposit of datasets in institutional repository
  • 6. How is their data? OREGON STATE UNIVERSITY 5 Small datasets Disciplines without well established standards for metadata, interdisciplinary
  • 7. Challenges OREGON STATE UNIVERSITY 6 Kirby Lee-USA TODAY Sports
  • 8. Challenges Enough metadata to ensure a robust scientific process OREGON STATE UNIVERSITY 7 Reproducibility and reuse 1 2 3
  • 9. 1. Metadata for a robust scientific process OREGON STATE UNIVERSITY 8 • Concept vs application. • Now vs later. • Intentionally, thoroughly, systematically readme templates
  • 10. 2. Reproducibility and reusability OREGON STATE UNIVERSITY 9
  • 11. 1. Context: premise of study We asked researchers to tell us about how they interpret datasets through a peer- review like process Peer reviewers and Librarians evaluate dataset - how different are the interpretations of quality? Does/should this lead to a revision of our curation methods and best practices? Flickr/AJ Cann, CC BY-SA 2. Reproducibility and reusability
  • 12. 2. Reproducibility and reusability ● Datasets from ScholarsArchive@OSU, institutional repository ● All datasets go through a review process. Documentation is mandatory ● 8 datasets reviewed by 11 reviewers 11
  • 13. 2. Reproducibility and reusability ● Is the record sufficiently descriptive? Title, abstract, keywords. ● Are there other elements that could be added? ● Are the data easily readable? E.g. community formats ● Are the data of high quality? ● Are the values physically possible and plausible? ● Are there missing data? ● Contact information ● Contextual information? ● Comprehensive description of all the data that is there? ● Methods well described and reproducible ● Internal references available ● Rights to use the dataset RECORD DATA DOCUMENTATION
  • 14. 3. Results ● Descriptive information is critical to a user’s ability to understand what the data is and whether it is potentially useful ● Deficiencies limit the potential reusability of the dataset. ● Areas of description work together to create a more complete description of the dataset. ● Information often provided via links to other sources: articles, dissertations. ● Researchers are comfortable using related articles. Librarians value the presence of dataset specific documentation higher than most reviewers. ● Librarians took into consideration whether links were accessible and open. INSUFFICIENT DESCRIPTION LINKS
  • 15. 3. Results ● We ask for the same information in multiple documentation locations (record metadata, documentation, and dataset). Sometimes is in articles too. ● Not clear how this duplication of effort impacts data submission quality, as the combination typically was enough to allow the reviewer or librarian to understand the dataset in detail ● Domain expertise was important across all areas of review for datasets. The curating librarians do not have sufficient domain expertise to properly evaluate the quality of the data, or metadata. ● Reviewers confused in the areas of licensing, rights statements, persistent identifiers, and where specific types of information belong - librarian’s expertise. DUPLICATION OF EFFORT DOMAIN EXPERTISE
  • 16. 3. FAIR data • F2. Data are described with rich metadata • A2. Metadata are accessible, even when the data are no longer available • I1. (Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. • R1.3. (Meta)data meet domain-relevant community standards OREGON STATE UNIVERSITY 15
  • 17. 3. FAIR data OREGON STATE UNIVERSITY 16 Greatest disconnect between researchers and metadata Tools, tools, tools Most standards are made for metadata specialists, not for researchers Support
  • 18. 3. FAIR data • FAIR principles are aspirational • Disciplines are at different points in their development of standards and tools. What for some are choices, for others are challenges. (Jacobsen et al., 2020) • There is a lot that is being done, but convergence may take time. OREGON STATE UNIVERSITY 17
  • 19. Conclusions OREGON STATE UNIVERSITY 18 Training and teaching that can be done with support (e.g. libraries) Basics of metadata Tools and translation of concepts Organizations and communities that maintain specifications and standards Convergence of standards Organizations and researchers talking about metadata
  • 20. Clara Llebot Lorente | Data Management Specialist clara.llebot@oregonstate.edu ResearchDataServices@oregonstate.edu http://bit.ly/OSUData This presentation is licensed under a CC0 license. OREGON STATE UNIVERSITY 19

Notas del editor

  1. Must be in Slide Master mode to swap out photos.
  2. Statistical tool that converts a set of variables that are interrelated to another set of variables that are independent and that account for as much as the variability of the sample as possible.
  3. Research intensive university
  4. I will talk about my perception of challenges experimented by researchers, and I just want to acknowledge that many are probably just doing a wonderful job, and I never interact with them because of that! Kirby Lee-USA TODAY Sports
  5. Low hanging fruit Metadata during the research process Concept vs application. They understand well what metadata is, and why we should record it. But when you ask them what metadata they will collect, they will say that their project does not need metadata. Researchers writing DMP leave the metadata section blank, because they do not know what to write.
  6. Image source: Flickr/AJ Cann, CC BY-SA in http://theconversation.com/explainer-what-is-peer-review-27797
  7. This is a summary of the questions we asked
  8. Reviewers reported missing methodology, information about the authors and their contact information, about licenses, and url about the dataset.
  9. Reviewers reported missing methodology, information about the authors and their contact information, about licenses, and url about the dataset.
  10. The FAIR principles add a step, because now we are considering not only reusability by humans, but by machines The FAIR principles talk about metadata pretty much everywhere. I chose four subprinciples, one of each principle, to talk about in this presentation. I think that the interoperability criteria is the most challenging, and also the one that really makes a difference. For metadata what this means is the use of standards, which I haven’t talked about.
  11. Giving support is challenging from the perspective of a