SlideShare una empresa de Scribd logo
1 de 40
Descargar para leer sin conexión
How To Make
Your Data Count
Patricia Cruse, Exec Director, DataCite
Daniella Lowenberg, Project Lead, MDC
November 26, 2018
Agenda
❏ What is our initiative about?
❏ Milestones in 2018
❏ Our first release
❏ Implementation at your repository
❏ Code of Practice
❏ Log Processing
❏ Sending Usage Logs
❏ Pulling & Displaying Usage & Citation Metrics
❏ Questions
What is MDC?
Imagine a world where
data are considered a
first-class research
output and are valued as
such...
Making Data
Count
2014 -
2015
▪2014 - 2015
5
“
6
1. Formal recommendation for measuring data usage
2. Develop Hub for all Data Level Metrics (DLM)
3. Make usage tracking easier
4. Drive adoption by showing how it can be done
(easily)
5. Engage across all research communities
6. Iterate!
8
Make Data
Count
2017 -
2019
9
DevelopNewDataLevelMetricsHub
101110
01101001
10100101
00111001
10100110
Data Citations
Repository
Usage Metrics
Future Metrics
Leverage Existing
Initiatives
Develop New
Recommendation
ServerLog
Processing
Drive Adoption - Engage Communities
Display
Data Metrics
Milestones thus far
We created a data usage metrics standard
We gave MDC a narrative
We built out an open hub for usage metrics
https://api.datacite.org/events
We implemented at our own repositories
We began to address citations
What does it look like?
How does this relate to
Scholix?
Scholix is not a thing - it is a change initiative
MDC & Scholix work hand in hand to advocate for best data
citation practices
● Scholix is an information framework for submitting data
citations
● MDC allows for displaying data citations back at the
repository
Implementation at your
repository
Why it is important
Community has long
grappled with the problem
of assessing and tracking
the results of scholarship
● Researchers
● Repositories
● Funders
● Publishers
Five simple steps to Make YOUR Data Count
1. Read the data usage metrics standard “Code of Practice for
Research Data”
2. Process your usage logs against this standard
3. Send processed and standardized usage logs to an open hub
4. Pull usage and citation metrics from an open hub
5. Display standardized usage and citation metrics on your
repository interface
Getting Started
We have built a “Getting Started” guide walking through
these steps as implemented in CDL’s Dash
https://github.com/CDLUC3/Make-Data-Count/blob/master/g
etting-started.md
1. Code of Practice for
Research Data
2. Log Processing
Standardized Logs
● Specialized logs that are processed against Code of Practice
● Views
● Downloads
● Users: at the country level, access during a session
● Session: de-duplicate access to page within 30 seconds
3. Sending Usage Reports
JSON Report - HeaderJSON Report - Body
The Usage Metrics Hub
Usage Metrics Hub (hosted by DataCite)
● Aggregator of research data usage reports
● Usage reports are made available via API (in original JSON format) and soon
web interface and CSV
● Usage reports are broken down by dataset (and request method), and can
then be aggregated over time
● Information in usage reports can be combined with data citations and dataset
metadata
4. Pulling Usage and
Citations
Pulling Usage and Citations
● Data usage metrics and citations are made available via public API, with one
“event” for each data citation or monthly usage count.
● Data citations are provided by DataCite metadata (i.e. come from data
repositories) and Crossref, with more to come
● Currently separate APIs for usage and citations, and a third API for dataset
metadata, will be combined into single API for easier retrieval of information
What’s next?
Looking Ahead
● Outreach and Adoption
○ Repository & Publishers
● Iterating on our implementation
○ Adding volume and usage by regions
○ Provide aggregation through DataCite hub
○ Beyond the DOI: metrics for other types of identifiers
○ Possible: altmetrics
Questions?
www.makedatacount.org
@makedatacount

Más contenido relacionado

La actualidad más candente

Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architecture
IBM Analytics
 

La actualidad más candente (20)

Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and RoadmapDenodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
 
CLOUD COMPUTING AND ITS APPLICATIONS IN DIGITAL LIBRARY SERVICES
CLOUD COMPUTING AND ITS APPLICATIONS IN  DIGITAL LIBRARY SERVICESCLOUD COMPUTING AND ITS APPLICATIONS IN  DIGITAL LIBRARY SERVICES
CLOUD COMPUTING AND ITS APPLICATIONS IN DIGITAL LIBRARY SERVICES
 
Software reuse, repurposing and reproducibility
Software reuse, repurposing and reproducibilitySoftware reuse, repurposing and reproducibility
Software reuse, repurposing and reproducibility
 
Accelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO WayAccelerating Delivery of Data Products - The EBSCO Way
Accelerating Delivery of Data Products - The EBSCO Way
 
Tracking research data footprints - slides
Tracking research data footprints - slidesTracking research data footprints - slides
Tracking research data footprints - slides
 
Joint Information and Preservation Management (WP5 ForgetIT 1st year review)
Joint Information and Preservation Management (WP5 ForgetIT 1st year review)Joint Information and Preservation Management (WP5 ForgetIT 1st year review)
Joint Information and Preservation Management (WP5 ForgetIT 1st year review)
 
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
Polyglot Persistence and Database Deployment by Sandeep Khuperkar CTO and Dir...
 
10 benefits to thinking inside Box
10 benefits to thinking inside Box10 benefits to thinking inside Box
10 benefits to thinking inside Box
 
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
Data Virtualization Reference Architectures: Correctly Architecting your Solu...Data Virtualization Reference Architectures: Correctly Architecting your Solu...
Data Virtualization Reference Architectures: Correctly Architecting your Solu...
 
Psicquic applications
Psicquic applicationsPsicquic applications
Psicquic applications
 
Bridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architectureBridging to a hybrid cloud data services architecture
Bridging to a hybrid cloud data services architecture
 
Obiee content
Obiee contentObiee content
Obiee content
 
Affinomics Bioinformatics Meeting
Affinomics Bioinformatics MeetingAffinomics Bioinformatics Meeting
Affinomics Bioinformatics Meeting
 
HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board  HNSciCloud update @ the World LHC Computing Grid deployment board
HNSciCloud update @ the World LHC Computing Grid deployment board
 
Data vault - a platform for archiving research data
Data vault - a platform for archiving research dataData vault - a platform for archiving research data
Data vault - a platform for archiving research data
 
SQL Server: BI
SQL Server: BISQL Server: BI
SQL Server: BI
 
A4 r overview deck_1.7
A4 r overview deck_1.7A4 r overview deck_1.7
A4 r overview deck_1.7
 
RAGLD - Rapid Assembly of Geo-Centred Linked Data Applications
RAGLD - Rapid Assembly of Geo-Centred Linked Data ApplicationsRAGLD - Rapid Assembly of Geo-Centred Linked Data Applications
RAGLD - Rapid Assembly of Geo-Centred Linked Data Applications
 
The Science Cloud Users: Challenges and Needs
The Science Cloud Users: Challenges and NeedsThe Science Cloud Users: Challenges and Needs
The Science Cloud Users: Challenges and Needs
 
Sharing Big Data - Bob Jones
Sharing Big Data - Bob JonesSharing Big Data - Bob Jones
Sharing Big Data - Bob Jones
 

Similar a How to make your data count webinar, 26 Nov 2018

Similar a How to make your data count webinar, 26 Nov 2018 (20)

Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017Team Data Science Process Presentation (TDSP), Aug 29, 2017
Team Data Science Process Presentation (TDSP), Aug 29, 2017
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
rough-work.pptx
rough-work.pptxrough-work.pptx
rough-work.pptx
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-PremiseWebinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
 
OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)
OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)
OpenAIRE Metrics Service: Usage Statistics (webinar for repository managers)
 
Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations
 
Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Efficient & effective data management for research projects : ILRI's Data Ma...
Efficient & effective  data management for research projects : ILRI's Data Ma...Efficient & effective  data management for research projects : ILRI's Data Ma...
Efficient & effective data management for research projects : ILRI's Data Ma...
 
Towards Generating Policy-compliant Datasets (poster)
Towards GeneratingPolicy-compliant Datasets (poster)Towards GeneratingPolicy-compliant Datasets (poster)
Towards Generating Policy-compliant Datasets (poster)
 
Shikha fdp 62_14july2017
Shikha fdp 62_14july2017Shikha fdp 62_14july2017
Shikha fdp 62_14july2017
 
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...
Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...
Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...
 

Más de ARDC

Más de ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 
Connected DMPs at UoA - we have a dream
Connected DMPs at UoA - we have a dreamConnected DMPs at UoA - we have a dream
Connected DMPs at UoA - we have a dream
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 

How to make your data count webinar, 26 Nov 2018

  • 1. How To Make Your Data Count Patricia Cruse, Exec Director, DataCite Daniella Lowenberg, Project Lead, MDC November 26, 2018
  • 2. Agenda ❏ What is our initiative about? ❏ Milestones in 2018 ❏ Our first release ❏ Implementation at your repository ❏ Code of Practice ❏ Log Processing ❏ Sending Usage Logs ❏ Pulling & Displaying Usage & Citation Metrics ❏ Questions
  • 4. Imagine a world where data are considered a first-class research output and are valued as such...
  • 7.
  • 8. 1. Formal recommendation for measuring data usage 2. Develop Hub for all Data Level Metrics (DLM) 3. Make usage tracking easier 4. Drive adoption by showing how it can be done (easily) 5. Engage across all research communities 6. Iterate! 8 Make Data Count 2017 - 2019
  • 9. 9 DevelopNewDataLevelMetricsHub 101110 01101001 10100101 00111001 10100110 Data Citations Repository Usage Metrics Future Metrics Leverage Existing Initiatives Develop New Recommendation ServerLog Processing Drive Adoption - Engage Communities Display Data Metrics
  • 11. We created a data usage metrics standard
  • 12. We gave MDC a narrative
  • 13. We built out an open hub for usage metrics https://api.datacite.org/events
  • 14. We implemented at our own repositories
  • 15. We began to address citations
  • 16. What does it look like?
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. How does this relate to Scholix?
  • 22. Scholix is not a thing - it is a change initiative MDC & Scholix work hand in hand to advocate for best data citation practices ● Scholix is an information framework for submitting data citations ● MDC allows for displaying data citations back at the repository
  • 24. Why it is important Community has long grappled with the problem of assessing and tracking the results of scholarship ● Researchers ● Repositories ● Funders ● Publishers
  • 25. Five simple steps to Make YOUR Data Count 1. Read the data usage metrics standard “Code of Practice for Research Data” 2. Process your usage logs against this standard 3. Send processed and standardized usage logs to an open hub 4. Pull usage and citation metrics from an open hub 5. Display standardized usage and citation metrics on your repository interface
  • 26. Getting Started We have built a “Getting Started” guide walking through these steps as implemented in CDL’s Dash https://github.com/CDLUC3/Make-Data-Count/blob/master/g etting-started.md
  • 27. 1. Code of Practice for Research Data
  • 28.
  • 30. Standardized Logs ● Specialized logs that are processed against Code of Practice ● Views ● Downloads ● Users: at the country level, access during a session ● Session: de-duplicate access to page within 30 seconds
  • 31. 3. Sending Usage Reports
  • 32.
  • 33. JSON Report - HeaderJSON Report - Body
  • 35. Usage Metrics Hub (hosted by DataCite) ● Aggregator of research data usage reports ● Usage reports are made available via API (in original JSON format) and soon web interface and CSV ● Usage reports are broken down by dataset (and request method), and can then be aggregated over time ● Information in usage reports can be combined with data citations and dataset metadata
  • 36. 4. Pulling Usage and Citations
  • 37. Pulling Usage and Citations ● Data usage metrics and citations are made available via public API, with one “event” for each data citation or monthly usage count. ● Data citations are provided by DataCite metadata (i.e. come from data repositories) and Crossref, with more to come ● Currently separate APIs for usage and citations, and a third API for dataset metadata, will be combined into single API for easier retrieval of information
  • 39. Looking Ahead ● Outreach and Adoption ○ Repository & Publishers ● Iterating on our implementation ○ Adding volume and usage by regions ○ Provide aggregation through DataCite hub ○ Beyond the DOI: metrics for other types of identifiers ○ Possible: altmetrics