SlideShare a Scribd company logo
1 of 25
 
UK Digital Curation Centre : enabling research data management at the coalface Dr Liz Lyon Associate Director DCC / Director UKOLN University of Bath, UK
Overview ,[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],http://www.ukoln.ac.uk/projects/I2S2/
Structural Sciences Infrastructure
Diamond Light Source Synchotron National Crystallography Service University of Southampton Local Earth Sciences Lab University of Cambridge Function International service -multiple communities UK service - multiple institutions.  Also uses Diamond Lone researcher at institution - uses NCS and ISIS large-scale facility Administration Peer-reviewed proposal required Vetted applications. Electronic & paper-based records –experiments, safety ERA,  instrument time Multiple proposals, multiple forms Workflow Formulaic and bespoke Formulaic Complex, unrecorded Software In-house scripts In-house scripts + open-source suite In-house scripts + open-source suite Raw data storage In-house GDA store ATLAS data-store Laptop / local server Derived data storage Taken offsite on laptop / USB stick eCrystals repository Laptop / local server / USB stick Metadata Core Scientific MetaData Model  eBank/eCrystals schema ? Identifiers Beam-line number DOI  InChI ?
Research Outputs Citations, References User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Process & Analyse Derived Data Research Concept  and/or  Experiment Design  Start Project Peer-review Proposal  Conduct Experiment Generate, Create,  & Collect Raw Data  Check & Clean Raw Data Interpret & Analyse  Results Data Archive, Preservation & Curation (OAIS conformant; Representation Information etc.) IPR, Embargo & Access Control Discover, Access, Validate, Reuse & Repurpose Data Publish  Research  Results Data Derived Data Processed Data Raw Data Documentation, Metadata & Storage  (Reference, Provenance, Context, Calibration etc.) Acquire Sample Write Proposal (include DMP) Scholarly Knowledge Write Usage Report Research Activity Administrative Activity Curation Activity Information Flow KEY: Peer Review  Prepare Manuscript  Prepare Supplementary Data  Publications Database Publication Activity An Idealised Scientific Research Activity Lifecycle Model Appraisal & Quality Control Programs (generate customised software) Papers, articles, presentations, reports
Existing work : mappings and gaps Data Management and Provenance  (CSMD,  OPM?) Bibliographic records (FRBR, SWAP) Curation (OAIS, PREMIS?) DC, Ontologies Software descriptions (??) Slide : Brian Matthews, STFC PROCESS
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Integrated Information Model
Requirements Analysis Report “… it is apparent that  the greatest need is for a robust data management infrastructure which supports each researcher in capturing, storing, managing and working with all the data generated during an experiment . Internal sharing of research data amongst collaborating scientists … is also a primary concern as is a requirement for access to research data in the long run so that a researcher … can return to and validate the results well into the future.”
INCREMENTAL Project ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Incremental Project Report, June 2010 “ While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice. ..... using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines.” The majority of people felt that some form of policy or guidance was needed.... http://www.flickr.com/photos/mattimattila/3003324844/
Emerging funder   requirements
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
http://www.dcc.ac.uk/dmponline   DMP Online Currently updating Version 2.0 Version 3.0 summer 2010
Making DMPs work : the start of a long process… ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Research Outputs Citations, References User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Process & Analyse Derived Data Research Concept  and/or  Experiment Design  Start Project Peer-review Proposal  Conduct Experiment Generate, Create,  & Collect Raw Data  Check & Clean Raw Data Interpret & Analyse  Results Data Archive, Preservation & Curation (OAIS conformant; Representation Information etc.) IPR, Embargo & Access Control Discover, Access, Validate, Reuse & Repurpose Data Publish  Research  Results Data Derived Data Processed Data Raw Data Documentation, Metadata & Storage  (Reference, Provenance, Context, Calibration etc.) Acquire Sample Write Proposal (include DMP) Scholarly Knowledge Write Usage Report Research Activity Administrative Activity Curation Activity Information Flow KEY: Peer Review  Prepare Manuscript  Prepare Supplementary Data  Publications Database Publication Activity An Idealised Scientific Research Activity Lifecycle Model Appraisal & Quality Control Programs (generate customised software) Papers, articles, presentations, reports
Data citation, credit,  metrics, attribution Incentives?
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Macro Attribution granularity Complexity : what are we citing? Micro / Nano
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Large-scale predictive network models of disease ,[object Object],[object Object],[object Object],[object Object]
Functionality? How do we cite?   ,[object Object],[object Object],[object Object],[object Object],[object Object]
Take homes... ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Chicago Mart Plaza, 6-8 December 2010 Thank you… www.dcc.ac.uk
 

More Related Content

What's hot

Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Philipp Zumstein
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Sarah Shreeves
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data ManagementOpenAIRE
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data ProjectEdward Blurock
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...Megan O'Donnell
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-supportSherry Lake
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisEDINA, University of Edinburgh
 
Poster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goalPoster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goalASIS&T
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?Jian Qin
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Jian Qin
 
RDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuseRDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuseASIS&T
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMCarole Goble
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd PlenaryBrigitte Jörg
 

What's hot (20)

Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)Integration of research literature and data (InFoLiS)
Integration of research literature and data (InFoLiS)
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...Small Science: First Impressions of Curation Needs. Presentation at Digital L...
Small Science: First Impressions of Curation Needs. Presentation at Digital L...
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and Analysis
 
Poster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goalPoster RDAP13: Data information literacy multiple paths to a single goal
Poster RDAP13: Data information literacy multiple paths to a single goal
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
 
RDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuseRDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuse
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Research Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOM
 
"Data in Context" IG sessions @ RDA 3rd Plenary
"Data in Context" IG sessions @  RDA 3rd Plenary"Data in Context" IG sessions @  RDA 3rd Plenary
"Data in Context" IG sessions @ RDA 3rd Plenary
 

Viewers also liked

Journalism and Social Media
Journalism and Social MediaJournalism and Social Media
Journalism and Social MediaDavy Sims
 
כנסת פתוחה באוגוסט פינגווין
כנסת פתוחה באוגוסט פינגוויןכנסת פתוחה באוגוסט פינגווין
כנסת פתוחה באוגוסט פינגוויןOfri Raviv
 
Social Media Association for Business Presentation
Social Media Association for Business PresentationSocial Media Association for Business Presentation
Social Media Association for Business PresentationDavy Sims
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeLizLyon
 
Onwebinar презентация для инвесторов
Onwebinar презентация для инвесторовOnwebinar презентация для инвесторов
Onwebinar презентация для инвесторовMichael Savin
 
Introduction to Social Media
Introduction to Social MediaIntroduction to Social Media
Introduction to Social MediaDavy Sims
 
Hybinar hybrid events & cloud video notes
Hybinar hybrid events & cloud video notesHybinar hybrid events & cloud video notes
Hybinar hybrid events & cloud video notesMichael Savin
 

Viewers also liked (7)

Journalism and Social Media
Journalism and Social MediaJournalism and Social Media
Journalism and Social Media
 
כנסת פתוחה באוגוסט פינגווין
כנסת פתוחה באוגוסט פינגוויןכנסת פתוחה באוגוסט פינגווין
כנסת פתוחה באוגוסט פינגווין
 
Social Media Association for Business Presentation
Social Media Association for Business PresentationSocial Media Association for Business Presentation
Social Media Association for Business Presentation
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
Onwebinar презентация для инвесторов
Onwebinar презентация для инвесторовOnwebinar презентация для инвесторов
Onwebinar презентация для инвесторов
 
Introduction to Social Media
Introduction to Social MediaIntroduction to Social Media
Introduction to Social Media
 
Hybinar hybrid events & cloud video notes
Hybinar hybrid events & cloud video notesHybinar hybrid events & cloud video notes
Hybinar hybrid events & cloud video notes
 

Similar to UK Digital Curation Centre: enabling research data management at the coalface

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...heila1
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarshiptsbbbu
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management BlueprintJisc
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012IUPUI
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfreypvhead123
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodDuncan Hull
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)aaroncollie
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobus
 
Developing institutional RDM services
Developing institutional RDM servicesDeveloping institutional RDM services
Developing institutional RDM servicesMichael Day
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycleSherry Lake
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 

Similar to UK Digital Curation Centre: enabling research data management at the coalface (20)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Preserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of ScholarshipPreserving the Inputs and Outputs of Scholarship
Preserving the Inputs and Outputs of Scholarship
 
Institutional Data Management Blueprint
Institutional Data Management BlueprintInstitutional Data Management Blueprint
Institutional Data Management Blueprint
 
Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012Meeting the NSF DMP Requirement June 13, 2012
Meeting the NSF DMP Requirement June 13, 2012
 
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfrey
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific Method
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)Data Management for Research (New Faculty Orientation)
Data Management for Research (New Faculty Orientation)
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening Keynote
 
Developing institutional RDM services
Developing institutional RDM servicesDeveloping institutional RDM services
Developing institutional RDM services
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 

Recently uploaded

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Recently uploaded (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

UK Digital Curation Centre: enabling research data management at the coalface

  • 1.  
  • 2. UK Digital Curation Centre : enabling research data management at the coalface Dr Liz Lyon Associate Director DCC / Director UKOLN University of Bath, UK
  • 3.
  • 4.
  • 6. Diamond Light Source Synchotron National Crystallography Service University of Southampton Local Earth Sciences Lab University of Cambridge Function International service -multiple communities UK service - multiple institutions. Also uses Diamond Lone researcher at institution - uses NCS and ISIS large-scale facility Administration Peer-reviewed proposal required Vetted applications. Electronic & paper-based records –experiments, safety ERA, instrument time Multiple proposals, multiple forms Workflow Formulaic and bespoke Formulaic Complex, unrecorded Software In-house scripts In-house scripts + open-source suite In-house scripts + open-source suite Raw data storage In-house GDA store ATLAS data-store Laptop / local server Derived data storage Taken offsite on laptop / USB stick eCrystals repository Laptop / local server / USB stick Metadata Core Scientific MetaData Model eBank/eCrystals schema ? Identifiers Beam-line number DOI InChI ?
  • 7. Research Outputs Citations, References User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Process & Analyse Derived Data Research Concept and/or Experiment Design Start Project Peer-review Proposal Conduct Experiment Generate, Create, & Collect Raw Data Check & Clean Raw Data Interpret & Analyse Results Data Archive, Preservation & Curation (OAIS conformant; Representation Information etc.) IPR, Embargo & Access Control Discover, Access, Validate, Reuse & Repurpose Data Publish Research Results Data Derived Data Processed Data Raw Data Documentation, Metadata & Storage (Reference, Provenance, Context, Calibration etc.) Acquire Sample Write Proposal (include DMP) Scholarly Knowledge Write Usage Report Research Activity Administrative Activity Curation Activity Information Flow KEY: Peer Review Prepare Manuscript Prepare Supplementary Data Publications Database Publication Activity An Idealised Scientific Research Activity Lifecycle Model Appraisal & Quality Control Programs (generate customised software) Papers, articles, presentations, reports
  • 8. Existing work : mappings and gaps Data Management and Provenance (CSMD, OPM?) Bibliographic records (FRBR, SWAP) Curation (OAIS, PREMIS?) DC, Ontologies Software descriptions (??) Slide : Brian Matthews, STFC PROCESS
  • 9.
  • 10. Requirements Analysis Report “… it is apparent that the greatest need is for a robust data management infrastructure which supports each researcher in capturing, storing, managing and working with all the data generated during an experiment . Internal sharing of research data amongst collaborating scientists … is also a primary concern as is a requirement for access to research data in the long run so that a researcher … can return to and validate the results well into the future.”
  • 11.
  • 12. Incremental Project Report, June 2010 “ While many researchers are positive about sharing data in principle, they are almost universally reluctant in practice. ..... using these data to publish results before anyone else is the primary way of gaining prestige in nearly all disciplines.” The majority of people felt that some form of policy or guidance was needed.... http://www.flickr.com/photos/mattimattila/3003324844/
  • 13. Emerging funder requirements
  • 14.
  • 15. http://www.dcc.ac.uk/dmponline DMP Online Currently updating Version 2.0 Version 3.0 summer 2010
  • 16.
  • 17. Research Outputs Citations, References User registration data; Instrument allocation data etc. Comments, annotations, ratings etc. Risk assessment data; other sample data Process & Analyse Derived Data Research Concept and/or Experiment Design Start Project Peer-review Proposal Conduct Experiment Generate, Create, & Collect Raw Data Check & Clean Raw Data Interpret & Analyse Results Data Archive, Preservation & Curation (OAIS conformant; Representation Information etc.) IPR, Embargo & Access Control Discover, Access, Validate, Reuse & Repurpose Data Publish Research Results Data Derived Data Processed Data Raw Data Documentation, Metadata & Storage (Reference, Provenance, Context, Calibration etc.) Acquire Sample Write Proposal (include DMP) Scholarly Knowledge Write Usage Report Research Activity Administrative Activity Curation Activity Information Flow KEY: Peer Review Prepare Manuscript Prepare Supplementary Data Publications Database Publication Activity An Idealised Scientific Research Activity Lifecycle Model Appraisal & Quality Control Programs (generate customised software) Papers, articles, presentations, reports
  • 18. Data citation, credit, metrics, attribution Incentives?
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. Chicago Mart Plaza, 6-8 December 2010 Thank you… www.dcc.ac.uk
  • 25.  

Editor's Notes

  1. Microsoft Research Faculty Summit 2010 © 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION
  2. Microsoft Research Faculty Summit 2010 © 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION
  3. The I2S2 project aims to understand and identify the requirements for a data-driven research infrastructure in the Structural Sciences. The work is focused on the exemplar domain of Chemistry, but with a view towards inter-disciplinary application. This Idealised Scientific Research Data Lifecycle Model produced by the I2S2 project seeks to extend and adapt from a “researcher perspective”, the Keeping Research Data Safe (KRDS) Activity Model. It adapts KRDS from an archive-centric to a researcher-centric view by: Defining and emphasising more of the activities in the research (KRDS “Pre-Archive” ) phase where research data is created; Adding a “Publication” set of activities; Concatenating the KRDS “Archive” phase activities in the centre of the model for simplification and presentational purposes; Adding some specific local research administration activities. In addition for the purposes of the project, it adds some selective detail of information flows and information objects between the activities. Note this is an idealised model and several activities such as peer review or conduct experiment may have multiple instances or repetitions. It also represents a project view as of June 2010 and may be subject to further changes.
  4. The I2S2 project aims to understand and identify the requirements for a data-driven research infrastructure in the Structural Sciences. The work is focused on the exemplar domain of Chemistry, but with a view towards inter-disciplinary application. This Idealised Scientific Research Data Lifecycle Model produced by the I2S2 project seeks to extend and adapt from a “researcher perspective”, the Keeping Research Data Safe (KRDS) Activity Model. It adapts KRDS from an archive-centric to a researcher-centric view by: Defining and emphasising more of the activities in the research (KRDS “Pre-Archive” ) phase where research data is created; Adding a “Publication” set of activities; Concatenating the KRDS “Archive” phase activities in the centre of the model for simplification and presentational purposes; Adding some specific local research administration activities. In addition for the purposes of the project, it adds some selective detail of information flows and information objects between the activities. Note this is an idealised model and several activities such as peer review or conduct experiment may have multiple instances or repetitions. It also represents a project view as of June 2010 and may be subject to further changes.
  5. Microsoft Research Faculty Summit 2010 © 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION