SlideShare una empresa de Scribd logo
1 de 16
Supporting
the Research Data Life Cycle

                    Joan Starr
                     @joan_starr
    University of California Curation Center
           California Digital Library



                Columbia Research Data Symposium
Partnership between CDL | 10 UC campuses | Peer institutions
Provide solutions, services, resources for digital assets

Pool & distribute diverse experience, expertise, & resources



                        Columbia Research Data Symposium
A life cycle approach
           Create, edit, share, and save
               data management plans
                    Create and manage
                                                                      plan
                   long-term identifiers

    Open source add-in & Web app for                                 manage
Microsoft Excel as a data collection tool
                                                                     share
                                                                     collect
        Collect, manage, preserve and
      publish websites and documents
                  Curation repository:
store, manage, and share research data

 Open Access publishing services /
       dynamic research platform

                                  Columbia Research Data Symposium
A life cycle approach
           Create, edit, share, and save
               data management plans
                    Create and manage
                                                                     plan
                   long-term identifiers

    Open source add-in & Web app for
Microsoft Excel as a data collection tool

        Collect, manage, preserve and
      publish websites and documents
                  Curation repository:
store, manage, and share research data

 Open Access publishing services /
       dynamic research platform

                                  Columbia Research Data Symposium
DMPTool
 Meeting funding agencies data management plan requirements


• Connect researchers to resources
  to create a data management plan
• NSF and directorates, NIH, NEH,
  IMLS, foundations plus
• Customizable



             Primary Functions
             1. Step-by-step “wizard”
             2. Templates and examples
             3. Links to institutional resources and agency information
             4. Plan publication and sharing

                             Data Curation for Practitioners Workshop
DMP Tool: https://dmp.cdlib.org/



                                                                                         Usage
                                                              3500                                                                         600
            Number of Plans (solid) & Unique Users (dashed)




                                                              3000
                                                                                                                                           500


                                                              2500
                                                                                                                                           400




                                                                                                                                                 Number of Institutions
                                                              2000

                                                                                                                                           300

                                                              1500

                                                                                                                          Unique Users     200
                                                              1000
                                                                                                                          Plans

                                                                                                                          Institutions     100
                                                               500



                                                                 0                                                                         0
                                                                     Oct-11   Dec-11     Feb-12      Apr-12      Jun-12           Aug-12


                                                                                   Data Curation for Practitioners Workshop
@ezidCDL
                                              EZID
                           Long term identifiers made easy
•    Precise identification of a dataset
     (DOI or ARK)
•    Credit to data producers and data
     publishers
•    A link from the traditional literature
     to the data
•    Exposure and research metrics for
     datasets
     (Web of Knowledge, Google)

                        Primary Functions
                        1. Create long term identifiers
                        2. Manage identifiers (and associated
                           metadata) over time
                        3. Resolve identifiers

                                     Columbia Research Data Symposium
EZID: http://n2t.net/ezid


                            EZID in action




                              Columbia Research Data Symposium
EZID: http://n2t.net/ezid


                            EZID in action




                              Columbia Research Data Symposium
A life cycle approach
           Create, edit, share, and save
               data management plans
                    Create and manage
                   long-term identifiers

    Open source add-in & Web app for
Microsoft Excel as a data collection tool
                                                                     collect
        Collect, manage, preserve and
      publish websites and documents
                  Curation repository:
store, manage, and share research data

 Open Access publishing services /
       dynamic research platform

                                  Columbia Research Data Symposium
@DataUpCDL
                                    DataUp
                         Collect, share, archive, publish data


Primary Functions
1. An Excel 1) add-in & 2) cloud
   application
2. Document data
3. Check for good data practices
3. Obtain identifier and citation
4. Archive and share




                                    Columbia Research Data Symposium
DataUp: http://dataup.cdlib.org/

              Researchers: How Frequently Do You Use Excel?
      100%

       90%

       80%

       70%

       60%                                                                              5
                                                                                            Every day
       50%                                                                              4




                                                                                             …
       40%                                                                              3
                                                                                        2
       30%                                                                                  Rarely
                                                                                        1
       20%                                                                                  55 Respondents

       10%

        0%
             Undergrad Masters PhD grad           postdoc Masters sci PhD sci
                       grad stud stud
                                                                  Carly Strasser, CDL
                                   Columbia Research Data Symposium
Web Archiving Service (WAS)
          •   ARCHIVE institution websites
          •   BUILD collections for research
          •   CAPTURE political and social events
          •   SAVE at-risk government websites


              Primary Functions
              1. Capture
              2. Manage
              3. Preserve
              4. Publish



         Columbia Research Data Symposium
WAS: http://webarchives.cdlib.org/


                       WAS Snapshot

                                                             54 public archives
                                                             120+ archives total
                                                             7,500+ sites
                                                             50+ TB
                                                             23 institutions


                                Columbia Research Data Symposium
A life cycle approach

                                           plan

                                          collect

                                          manage

                                          share



       Columbia Research Data Symposium
For more information
UC3 Data Management Planning Resources
   http://www.cdlib.org/services/uc3/dmp/index.html
Twitter: @ezidCDL and @DataUpCDL
Email: uc3@ucop.edu; washelp@ucop.edu

How to find me:
Twitter: @joan_starr
Email: joan.starr@ucop.edu



                     Columbia Research Data Symposium

Más contenido relacionado

Similar a Supporting the Research Data Life Cycle

ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012Lee Dirks
 
DMP Online: A Brief Background
DMP Online: A Brief BackgroundDMP Online: A Brief Background
DMP Online: A Brief BackgroundMartin Donnelly
 
Biocatalogue Talk Slides
Biocatalogue Talk SlidesBiocatalogue Talk Slides
Biocatalogue Talk SlidesBioCatalogue
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011Lee Dirks
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesBDLSS
 
To architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesTo architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesjiscdatapool
 
Generating User Interface for Information Applications from Task, Domain and ...
Generating User Interface for Information Applications from Task, Domain and ...Generating User Interface for Information Applications from Task, Domain and ...
Generating User Interface for Information Applications from Task, Domain and ...Jean Vanderdonckt
 
What you need to know about Generative AI and Data Management?
What you need to know about Generative AI and Data Management?What you need to know about Generative AI and Data Management?
What you need to know about Generative AI and Data Management?Denodo
 
Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Jian Qin
 
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...European Collaboration Summit
 
Searching Heterogenous E Learning Resources
Searching Heterogenous E Learning ResourcesSearching Heterogenous E Learning Resources
Searching Heterogenous E Learning Resourcesimranlatif
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSWSören Auer
 
Alex Wade, Digital Library Interoperability
Alex Wade, Digital Library InteroperabilityAlex Wade, Digital Library Interoperability
Alex Wade, Digital Library Interoperabilityparker01
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Faculties Online
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Self service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot shortSelf service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot shortEduardo Castro
 

Similar a Supporting the Research Data Life Cycle (20)

Role of Semantic Web in Health Informatics
Role of Semantic Web in Health InformaticsRole of Semantic Web in Health Informatics
Role of Semantic Web in Health Informatics
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
DMP Online: A Brief Background
DMP Online: A Brief BackgroundDMP Online: A Brief Background
DMP Online: A Brief Background
 
Biocatalogue Talk Slides
Biocatalogue Talk SlidesBiocatalogue Talk Slides
Biocatalogue Talk Slides
 
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011Lynch & Dirks  - Platforms for Open Research - Charleston Conference 2011
Lynch & Dirks - Platforms for Open Research - Charleston Conference 2011
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
To architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositoriesTo architect or engineer? Lessons from DataPool on building RDM repositories
To architect or engineer? Lessons from DataPool on building RDM repositories
 
Generating User Interface for Information Applications from Task, Domain and ...
Generating User Interface for Information Applications from Task, Domain and ...Generating User Interface for Information Applications from Task, Domain and ...
Generating User Interface for Information Applications from Task, Domain and ...
 
What you need to know about Generative AI and Data Management?
What you need to know about Generative AI and Data Management?What you need to know about Generative AI and Data Management?
What you need to know about Generative AI and Data Management?
 
Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management Educating a New Breed of Data Scientists for Scientific Data Management
Educating a New Breed of Data Scientists for Scientific Data Management
 
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...
ECS19 - Bill Ayers - UNLOCK YOUR BUSINESS KNOWLEDGE WITH THE MICROSOFT GRAPH,...
 
Searching Heterogenous E Learning Resources
Searching Heterogenous E Learning ResourcesSearching Heterogenous E Learning Resources
Searching Heterogenous E Learning Resources
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
Alex Wade, Digital Library Interoperability
Alex Wade, Digital Library InteroperabilityAlex Wade, Digital Library Interoperability
Alex Wade, Digital Library Interoperability
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum Hyperion Essbase Course Curriculum
Hyperion Essbase Course Curriculum
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Self service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot shortSelf service BI with sql server 2008 R2 and microsoft power pivot short
Self service BI with sql server 2008 R2 and microsoft power pivot short
 
圖書館趨勢觀察
圖書館趨勢觀察圖書館趨勢觀察
圖書館趨勢觀察
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 

Más de University of California Curation Center

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaUniversity of California Curation Center
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchUniversity of California Curation Center
 

Más de University of California Curation Center (20)

ETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of CaliforniaETDs: Electronic Thesis and Dissertation Service at the University of California
ETDs: Electronic Thesis and Dissertation Service at the University of California
 
Dash UCCSC 2016
Dash UCCSC 2016Dash UCCSC 2016
Dash UCCSC 2016
 
Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16Uc3 ucacc-2015-11-16
Uc3 ucacc-2015-11-16
 
Dash: data sharing made easy
Dash: data sharing made easyDash: data sharing made easy
Dash: data sharing made easy
 
CDL research lifecycle
CDL research lifecycleCDL research lifecycle
CDL research lifecycle
 
Ucmp 20150407
Ucmp 20150407Ucmp 20150407
Ucmp 20150407
 
What does "data publication" mean to researchers?
What does "data publication" mean to researchers?What does "data publication" mean to researchers?
What does "data publication" mean to researchers?
 
Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.Researcher perspectives on publication and peer review of data.
Researcher perspectives on publication and peer review of data.
 
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning ProcessEnhancing DMPTool: Further Streamlineing Data Mangement Planning Process
Enhancing DMPTool: Further Streamlineing Data Mangement Planning Process
 
DataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data CurationDataShare: Empowering Researcher Data Curation
DataShare: Empowering Researcher Data Curation
 
Future of web archiving
Future of web archivingFuture of web archiving
Future of web archiving
 
Data preservation 101
Data preservation 101Data preservation 101
Data preservation 101
 
Creating superior data management plans with the DMPTool
Creating superior data management plans with the DMPToolCreating superior data management plans with the DMPTool
Creating superior data management plans with the DMPTool
 
ESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S AbramsESA Ignite talk on the DMPTool by S Abrams
ESA Ignite talk on the DMPTool by S Abrams
 
DMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for AdministratorsDMPTool2 Webinar #1 for Administrators
DMPTool2 Webinar #1 for Administrators
 
DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2DMPTool2 Administrator Webinar #2
DMPTool2 Administrator Webinar #2
 
DataShare for UC Campuses
DataShare for UC CampusesDataShare for UC Campuses
DataShare for UC Campuses
 
Helping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data managementHelping librarians use the DMPTool as a centerpiece for data management
Helping librarians use the DMPTool as a centerpiece for data management
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
Dataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZIDDataset Metadata Publication Through EZID
Dataset Metadata Publication Through EZID
 

Supporting the Research Data Life Cycle

  • 1. Supporting the Research Data Life Cycle Joan Starr @joan_starr University of California Curation Center California Digital Library Columbia Research Data Symposium
  • 2. Partnership between CDL | 10 UC campuses | Peer institutions Provide solutions, services, resources for digital assets Pool & distribute diverse experience, expertise, & resources Columbia Research Data Symposium
  • 3. A life cycle approach Create, edit, share, and save data management plans Create and manage plan long-term identifiers Open source add-in & Web app for manage Microsoft Excel as a data collection tool share collect Collect, manage, preserve and publish websites and documents Curation repository: store, manage, and share research data Open Access publishing services / dynamic research platform Columbia Research Data Symposium
  • 4. A life cycle approach Create, edit, share, and save data management plans Create and manage plan long-term identifiers Open source add-in & Web app for Microsoft Excel as a data collection tool Collect, manage, preserve and publish websites and documents Curation repository: store, manage, and share research data Open Access publishing services / dynamic research platform Columbia Research Data Symposium
  • 5. DMPTool Meeting funding agencies data management plan requirements • Connect researchers to resources to create a data management plan • NSF and directorates, NIH, NEH, IMLS, foundations plus • Customizable Primary Functions 1. Step-by-step “wizard” 2. Templates and examples 3. Links to institutional resources and agency information 4. Plan publication and sharing Data Curation for Practitioners Workshop
  • 6. DMP Tool: https://dmp.cdlib.org/ Usage 3500 600 Number of Plans (solid) & Unique Users (dashed) 3000 500 2500 400 Number of Institutions 2000 300 1500 Unique Users 200 1000 Plans Institutions 100 500 0 0 Oct-11 Dec-11 Feb-12 Apr-12 Jun-12 Aug-12 Data Curation for Practitioners Workshop
  • 7. @ezidCDL EZID Long term identifiers made easy • Precise identification of a dataset (DOI or ARK) • Credit to data producers and data publishers • A link from the traditional literature to the data • Exposure and research metrics for datasets (Web of Knowledge, Google) Primary Functions 1. Create long term identifiers 2. Manage identifiers (and associated metadata) over time 3. Resolve identifiers Columbia Research Data Symposium
  • 8. EZID: http://n2t.net/ezid EZID in action Columbia Research Data Symposium
  • 9. EZID: http://n2t.net/ezid EZID in action Columbia Research Data Symposium
  • 10. A life cycle approach Create, edit, share, and save data management plans Create and manage long-term identifiers Open source add-in & Web app for Microsoft Excel as a data collection tool collect Collect, manage, preserve and publish websites and documents Curation repository: store, manage, and share research data Open Access publishing services / dynamic research platform Columbia Research Data Symposium
  • 11. @DataUpCDL DataUp Collect, share, archive, publish data Primary Functions 1. An Excel 1) add-in & 2) cloud application 2. Document data 3. Check for good data practices 3. Obtain identifier and citation 4. Archive and share Columbia Research Data Symposium
  • 12. DataUp: http://dataup.cdlib.org/ Researchers: How Frequently Do You Use Excel? 100% 90% 80% 70% 60% 5 Every day 50% 4 … 40% 3 2 30% Rarely 1 20% 55 Respondents 10% 0% Undergrad Masters PhD grad postdoc Masters sci PhD sci grad stud stud Carly Strasser, CDL Columbia Research Data Symposium
  • 13. Web Archiving Service (WAS) • ARCHIVE institution websites • BUILD collections for research • CAPTURE political and social events • SAVE at-risk government websites Primary Functions 1. Capture 2. Manage 3. Preserve 4. Publish Columbia Research Data Symposium
  • 14. WAS: http://webarchives.cdlib.org/ WAS Snapshot 54 public archives 120+ archives total 7,500+ sites 50+ TB 23 institutions Columbia Research Data Symposium
  • 15. A life cycle approach plan collect manage share Columbia Research Data Symposium
  • 16. For more information UC3 Data Management Planning Resources http://www.cdlib.org/services/uc3/dmp/index.html Twitter: @ezidCDL and @DataUpCDL Email: uc3@ucop.edu; washelp@ucop.edu How to find me: Twitter: @joan_starr Email: joan.starr@ucop.edu Columbia Research Data Symposium

Notas del editor

  1. But first a very brief context setting.Serving the 10 UC campuses226,000 students 134,000 faculty and staff
  2. What is a data management plan?A document that describes what you will do with your data duringandafter you complete your researchThe DMPTool“walks” scientists through the process of developing a concise, but comprehensive data management plan that could enable good stewardship of data and meet requirements of sponsors and home institutions.Partners: University of Virginia Library, University of Illinois at Urbana-Champaign Library, and DataONE, UCLA, UCSDThe California Digital Library and its partners were awarded a $590,000 grant from the Alfred P. Sloan Foundation to fund further development of the popular Data Management Planning Tool in 2013.  The bulk of the grant will go to the UC Curation Center (UC3) at the CDL to fund improvements to the DMPTool including expanded functionality, training modules, documentation and the creation of an open-source community to sustain the DMPTool in the future.  Project partners are the University of Virginia Library, University of Illinois at Urbana-Champaign Library, and DataONE
  3. supplemental grant in the worksGot 2012 Digital Preservation Award recognition from Library of Congress
  4. My colleague,Carly Strasser, the Service Manager for DataUp, spoke to about 200 researchers and these trends held.Key findings: No data preservationUnaware of archivesResistant to sharingPoor data documentation