SlideShare a Scribd company logo
1 of 48
Data Literacy For the Arctic and Below:
        Help your data help you
           (and satisfy NSF requirements in the process!)




        Lynn Yarmey and Liz Schlagel – National Snow and Ice Data Center
Where we are going today:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
          Data Types
          Data Stages
          Storage
          Versioning
          Naming Conventions
          Metadata and Standards
          Data Sharing and Access
          Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
  EVERYTHING we talk about will be able to go into your Data Management Plan (DMP)
Why Care – Big Picture




          Photo from: http://www.mediafuturist.com/2010/11/gues-ddb-blog-future-of-marketing-media-data-is-new-oil.html
Why Care – Big Picture




 These days, Dr. Hodes said, “the old model in
 which researchers jealously guarded their data is
 no longer applicable.”
 http://www.nytimes.com/2011/04/04/health/04alzheimer.html




                                                             Image courtesy of:http://www.sciencemag.org/content/331/6018.cover-expansion
Why Care – Your work




              http://www.phdcomics.com/comics/archive.php?comicid=382




         You are a Data Manager
Data Management is Important! Because……

  … Reproducibility is the foundation of science
  … Journals are starting to require data deposit
  … You want to get credit for producing data (data citations)
  … Others can use and build on your work (data reuse)
Data Management is Important! Because……

  … Reproducibility is the foundation of science
  … Journals are starting to require data deposit
  … You want to get credit for producing it (data citations)
  … Others can use and build on your work (data reuse)
  … Your new instruments collect a LOT more data than older ones
  … Recreating a figure from a 2006 paper shouldn’t be painful
  … Funders tell us so (See NSF, NIH, NOAA, etc)
  … Students graduate!
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Data Types



         What types of data do
        you collect or generate?
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Data Stages

  Raw

  Organized

  Standardized
      Transformed

  Processed
      Quality Controlled

  Analyzed

  Summarized

  Presented/Published                         Photo courtesy of Zillow Database gurus:
                           http://www.zillow.com/blog/2007-11-02/we-know-how-to-celebrate-halloween/
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Data Storage




  http://chronicle.texterity.com/chronicle/20110318a?pg=16#pg16
Data Storage




               Tips:
                 - 1 working copy on your computer
                 - 1 copy on infrastructure near you
                 - 1 copy on infrastructure far away
                 - ‘Final’ copy with a data center/archive
                 - Get help! (CSS, CSU Libraries, etc.)
               (Note: These won’t work well in all cases, ex. For Very Large
                 Data, but are a good start for coming up with a storage
                                          plan)
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Versioning




  Tips:
    - communicate with your lab/research group and agree on
      a versioning system (file names, what makes a new version)
    - WRITE IT DOWN and post/save to a shared space.
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
File naming conventions – Discuss
File naming conventions – Better example

 Make names unique!
 Include (as appropriate):
   - Project name or acronym
   - Study title
   - Location
   - Data type
   - Researcher initials
   - Date
   - Data stage
   - Version number
   - File type

   DO – Use_underscores-or-dashes       DO NOT – Use spaces &/or special characters!

     For more info - https://www.dataone.org/content/assign-descriptive-file-names
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Metadata

           “Data about Data”




                 But what does that MEAN?!
Metadata – The bottom line

 What would someone* unfamiliar with your
 data (and possibly your research) need in order
 to find, evaluate, understand, and reuse them?

 *How about someone:
   - who works in your lab?
   - from a different lab in your field?
   - who is in a related interdisciplinary field?
   - who researches a completely different area?
   - who works for a newspaper? Congress?
Metadata – Example




          Temperature
             31.5
Metadata – Example

           Temperature
              31.5
             For what purpose?
                                     Instrument precision/accuracy?




                  When was the sensor
                last cleaned/calibrated?



                          AKA – T, Temp, degC, C, oF… lots of different names!
Metadata


  Just like file names, metadata
  does it’s job best when it is:
    - consistent
    - documented
    - for people
    - such that computers are happy


           Enter Metadata Standards
Metadata Standards – Examples


  Local (people -> people)
     Naming Conventions
     Standard Operating Procedures

  Beyond (people -> computers -> people)
      ISO 19115           (http://www.fgdc.gov/metadata/geospatial-metadata-standards#nap)



      GCMD DIF            (http://gcmd.nasa.gov/User/difguide/difman.html)



      EML (http://knb.ecoinformatics.org/software/eml/)
Metadata Standards – Example




Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution
of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011).
http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
Metadata Standards – Example (XML)
      <attributeName>Sea Surface Temperature</attributeName>
      <attributeDefinition>temperature measurement</attributeDefinition>
      <measurementScale>
          <unit>celsius</unit>
          <numericDomain><numberType>real</numberType></numericDomain>
      </measurementScale>
      <missingValueCode><code>-99</code>
             <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <missingValueCode><code>-999</code>
           <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <missingValueCode><code>-99999</code>
           <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <methods><description> subject { seaSurface } </description>
           <description> calculationType { calculated }; calculationTypeDetail { average };
      calculationInterval { day }; </description></methods>
Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution
of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011).
http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
Metadata Standards – Example (XML)
      <attributeName>Sea Surface Temperature</attributeName>
      <attributeDefinition>temperature measurement</attributeDefinition>
      <measurementScale>
          <unit>celsius</unit>
          <numericDomain><numberType>real</numberType></numericDomain>
      </measurementScale>
      <missingValueCode><code>-99</code>
             <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <missingValueCode><code>-999</code>
           <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <missingValueCode><code>-99999</code>
           <codeExplanation>missing value</codeExplanation>
      </missingValueCode>
      <methods><description> subject { seaSurface } </description>
           <description> calculationType { calculated }; calculationTypeDetail { average };
      calculationInterval { day }; </description></methods>
Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution
of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011).
http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
Metadata – Standards

 They exist!

 If everyone used them, you could do very cool
     science!

 Compliance is often a lot of work

 There are lots

 HOWEVER, there are baby steps to get started
Metadata – Yikes and/or Yay!
  Tips for the short-term:
   - Get help!
          - support@aoncadis.org, librarians, standards groups,
          data centers, domain communities, tools
   - Get your own house in order
          - use common date formats, codes, smart file names
          - WRITE EVERYTHING DOWN! (keep good readme files)
    - Put in the time early on to implement a standard
          - most have minimum compliance levels with options
          to get more detailed
    - Stay flexible

  Tips for the long-term:
    - Get help!
    - Watch for Best Practices and standards in your field
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Sharing and Access

                      Levels:
                   Low    - Not sharing your data (note: appropriate in a few cases)
                          - Emailing your data to a researcher who asks for it
                          - Posting your data on your project or lab website
Funder Happiness




                          - Posting your data AND METADATA on your website
                         - Submitting your metadata to an online catalog (ex.
                      ACADIS)
                         - Submitting your data and metadata to an appropriate
                      repository and getting a permanent ID (DOI, EZID, etc)
                                - Data Repositories (ex. ACADIS, GenBank, Dryad)
                                - CSU Digital Repository

                   High
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Archiving

              Terminology Fuzziness in the data world:

             Archival = Preservation (close enough)
                      Archival ≠ Storage!
  Tips for the short-term:

    - Leave yourself time at the end of a project to clean up
    - Choose open source formats when you can (ex. CSV > XLS)

  Tips for the long-term:
    - Work with NREL data experts: IBIS team, LTER, Computer
      Systems Support
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
Data Management Plans (DMPs)

        NSF Data Management Plan - General Requirement (as of 2011-10-10)
1. the types of data, samples, physical collections, software, curriculum materials, and
other materials to be produced in the course of the project;

2. the standards to be used for data and metadata format and content (where existing
standards are absent or deemed inadequate, this should be documented along with any
proposed solutions or remedies);

3. policies for access and sharing including provisions for appropriate protection of
privacy, confidentiality, security, intellectual property, or other rights or requirements;

4. policies and provisions for re-use, re-distribution, and the production of derivatives;
and

5. plans for archiving data, samples, and other research products, and for preservation of
access to them.

              From http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
Data Management Plans (DMPs)

        NSF Data Management Plan - General Requirement (as of 2011-10-10)
1. the types of data, samples, physical collections, software, curriculum materials, and
other materials to be produced in the course of the project;

2. the standards to be used for data and metadata format and content (where existing
standards are absent or deemed inadequate, this should be documented along with any
proposed solutions or remedies);

3. policies for access and sharing including provisions for appropriate protection of
privacy, confidentiality, security, intellectual property, or other rights or requirements;

4. policies and provisions for re-use, re-distribution, and the production of derivatives;
and

5. plans for archiving data, samples, and other research products, and for preservation of
access to them.

              From http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
Data Management Plans (DMPs)


  Tips for the short-term:

    - Check your Directorate/Agency policy before every
      proposal
    - Keep it real(istic), you will need to include your actions in
      your project report and next proposal.


  Tips for the long-term:
    - Keep working on implementing metadata standards
    - Watch out for emerging trends, repositories, tools
    - Partner with data people (data centers, libraries, etc)
Where we are:

 • Why care about data management?
 • “What the heck is metadata?” and other jargon
       Data Types
       Data Stages
       Storage
       Versioning
       Naming Conventions
       Metadata and Standards
       Data Sharing and Access
       Archiving and Preservation


 • Pulling this all together – Data Management Plans
 • From the lab to ACADIS (and beyond)
CADIS - Data Support for NSF-Arctic Program

  • Cooperative Arctic Data and Information System

  • The Mandate:
    – Develop advanced data management system for the
      Arctic Observing Network (AON)
    – Preserve metadata and data
    – Serve NSF-funded AON investigators
Transition to Advanced CADIS
                             NSF Arctic
•   A new mandate            Field Sites
    – For all NSF programs that collect Arctic data
    – Serve NSF-funded Arctic investigators by archiving
      data from many field programs
• Other changes:
    –   An advisory group
    –   Value-added products
    –   Two full time Data Curators
    –   New data types – biological, social, terrestrial,
        ecological
Transition to Advanced CADIS (ACADIS)

  • A new mandate
    – For all NSF/ARC programs that collect Arctic data
    – Serve NSF-funded Arctic investigators by archiving
      data from field programs and individual investigators
  • Other changes:
    – An advisory group
    – Value-added products
    – Two full time Data Curators
    – New data types – biological, social, terrestrial,
      ecological
    – Expanded metadata tool for diverse disciplines
ACADIS – Metadata and Standards




 Metadata Profile

 Supports established
 standards

 Based on IPY-DIS profile.
 Compatible with GCMD,
 FGDC, ISO…

 Profile driven interface
 validates fields

 NASA GCMD vocabulary
 used where possible
ACADIS Data Management Plan Template
                  Example guidance from the ACADIS DMP template:

• Assists
  investigators
  in developing
  the DMP now
  required for
  all NSF
  proposals

• Linked from
  aoncadis.org
Beyond ACADIS - Other Resources

    • IBIS         Local: NREL

    • CSU Digital Repository              Local: CSU


    •   Knowledge Network for Biocomplexity
    •   ESA Ecological Archives      Remote:
    •   DAAC at ORNL                 Centralized and
                                     Domain Specific
    •   Advanced Cooperative
    •   Arctic Data and Information Service

    • Data Conservancy
    • DataONE                          Federated and distributed
Beyond ACADIS – Other Resources
 General Info and help -
    Earth Science Information Partners (ESIP): http://wiki.esipfed.org/
    UVA Libraries: http://www2.lib.virginia.edu/brown/data/

 Data Management Plan and other tools –
     DMP Tool: https://dmp.cdlib.org/
     DataOne: https://www.dataone.org/cattools/Data%20and%20Metadata%20Management

 Metadata -
    Excel Plug-in tool (in development):
          http://www.cdlib.org/cdlinfo/2011/09/01/facilitating-data-management-dcxl/
     Lists of Standards (not complete!)
          for bio, climate, ecology, oceanography - http://marinemetadata.org/conventions
          Stanford-based portal for medical/bio - http://bioportal.bioontology.org/resources
Questions?


                                                             Contact me:    Lynn.yarmey@nsidc.org

                                                             For questions, help, or to submit Arctic data:
                                                                            support@aoncadis.org

                                                             Visit ACADIS: www.aoncadis.org



Special thanks for pilfered slides and content approaches: Florence Fetterer, Carly Strasser, and Dorothea Salo

More Related Content

What's hot

Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...The University of Edinburgh
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Mojtaba Lotfaliany
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management PlanKristin Briney
 
Going Full Circle: Research Data Management @ University of Pretoria
Going Full Circle: Research Data Management @ University of PretoriaGoing Full Circle: Research Data Management @ University of Pretoria
Going Full Circle: Research Data Management @ University of PretoriaJohann van Wyk
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersJez Cope
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsAaron Collie
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Natsuko Nicholls
 
NIH BD2K DataMed data index - DATS model
NIH BD2K DataMed data index - DATS modelNIH BD2K DataMed data index - DATS model
NIH BD2K DataMed data index - DATS modelSusanna-Assunta Sansone
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassAaron Collie
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...Amanda Whitmire
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxfPhilippe Rocca-Serra
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto UniversityStephanie Simms
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Robert Oostenveld
 

What's hot (20)

Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
Perspectives on the Role of Trustworthy Repository Standards in Data Journal ...
 
Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing Best Practice in Data Management and Sharing
Best Practice in Data Management and Sharing
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Going Full Circle: Research Data Management @ University of Pretoria
Going Full Circle: Research Data Management @ University of PretoriaGoing Full Circle: Research Data Management @ University of Pretoria
Going Full Circle: Research Data Management @ University of Pretoria
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
University of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchersUniversity of Bath Research Data Management training for researchers
University of Bath Research Data Management training for researchers
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
NIH BD2K DataMed data index - DATS model
NIH BD2K DataMed data index - DATS modelNIH BD2K DataMed data index - DATS model
NIH BD2K DataMed data index - DATS model
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities Class
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
IDCC Workshop: Analysing DMPs to inform research data services: lessons from ...
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
 
Dats nih-dccpc-kc7-april2018-prs-uoxf
Dats  nih-dccpc-kc7-april2018-prs-uoxfDats  nih-dccpc-kc7-april2018-prs-uoxf
Dats nih-dccpc-kc7-april2018-prs-uoxf
 
The Donders Repository
The Donders RepositoryThe Donders Repository
The Donders Repository
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data Using Open Science to advance science - advancing open data
Using Open Science to advance science - advancing open data
 
Introduction to Data Management and Sharing
Introduction to Data Management and SharingIntroduction to Data Management and Sharing
Introduction to Data Management and Sharing
 

Viewers also liked

Stanik swarajyni sanstha
Stanik swarajyni sansthaStanik swarajyni sanstha
Stanik swarajyni sansthaanishakansagra
 
Ucar data citationworkshop_yarmey_20120405
Ucar data citationworkshop_yarmey_20120405Ucar data citationworkshop_yarmey_20120405
Ucar data citationworkshop_yarmey_20120405lyarmey
 
Perfect square
Perfect squarePerfect square
Perfect squaredian7045
 
Term Paper - Managing Anger
Term Paper - Managing AngerTerm Paper - Managing Anger
Term Paper - Managing Angeramaalee
 

Viewers also liked (9)

Our national symbols
Our national symbolsOur national symbols
Our national symbols
 
Shravan
ShravanShravan
Shravan
 
Stanik swarajyni sanstha
Stanik swarajyni sansthaStanik swarajyni sanstha
Stanik swarajyni sanstha
 
Ucar data citationworkshop_yarmey_20120405
Ucar data citationworkshop_yarmey_20120405Ucar data citationworkshop_yarmey_20120405
Ucar data citationworkshop_yarmey_20120405
 
Shravan
ShravanShravan
Shravan
 
Perfect square
Perfect squarePerfect square
Perfect square
 
Mental health sagun
Mental health sagunMental health sagun
Mental health sagun
 
Term Paper - Managing Anger
Term Paper - Managing AngerTerm Paper - Managing Anger
Term Paper - Managing Anger
 
Sociology presentation
Sociology presentationSociology presentation
Sociology presentation
 

Similar to CSU-ACADIS_dataManagement101-20120217

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate StudentsRebekah Cummings
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data LocallyErin D. Foster
 
Managing your data paget
Managing your data pagetManaging your data paget
Managing your data pagetTERN Australia
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseStavros Papadopoulos
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data ManagementC. Tobin Magle
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciencesChris Dwan
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodDuncan Hull
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampSherry Lake
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data ManagmentDaniel Crane
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolkfear
 

Similar to CSU-ACADIS_dataManagement101-20120217 (20)

The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate Students
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
Managing your data paget
Managing your data pagetManaging your data paget
Managing your data paget
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal DatabaseDebunking "Purpose-Built Data Systems:": Enter the Universal Database
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
 
Responsible conduct of research: Data Management
Responsible conduct of research: Data ManagementResponsible conduct of research: Data Management
Responsible conduct of research: Data Management
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciences
 
eScience: A Transformed Scientific Method
eScience: A Transformed Scientific MethodeScience: A Transformed Scientific Method
eScience: A Transformed Scientific Method
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM Bootcamp
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data Managment
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 

Recently uploaded

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 

Recently uploaded (20)

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

CSU-ACADIS_dataManagement101-20120217

  • 1. Data Literacy For the Arctic and Below: Help your data help you (and satisfy NSF requirements in the process!) Lynn Yarmey and Liz Schlagel – National Snow and Ice Data Center
  • 2. Where we are going today: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond) EVERYTHING we talk about will be able to go into your Data Management Plan (DMP)
  • 3. Why Care – Big Picture Photo from: http://www.mediafuturist.com/2010/11/gues-ddb-blog-future-of-marketing-media-data-is-new-oil.html
  • 4. Why Care – Big Picture These days, Dr. Hodes said, “the old model in which researchers jealously guarded their data is no longer applicable.” http://www.nytimes.com/2011/04/04/health/04alzheimer.html Image courtesy of:http://www.sciencemag.org/content/331/6018.cover-expansion
  • 5. Why Care – Your work http://www.phdcomics.com/comics/archive.php?comicid=382 You are a Data Manager
  • 6. Data Management is Important! Because…… … Reproducibility is the foundation of science … Journals are starting to require data deposit … You want to get credit for producing data (data citations) … Others can use and build on your work (data reuse)
  • 7. Data Management is Important! Because…… … Reproducibility is the foundation of science … Journals are starting to require data deposit … You want to get credit for producing it (data citations) … Others can use and build on your work (data reuse) … Your new instruments collect a LOT more data than older ones … Recreating a figure from a 2006 paper shouldn’t be painful … Funders tell us so (See NSF, NIH, NOAA, etc) … Students graduate!
  • 8. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 9. Data Types What types of data do you collect or generate?
  • 10. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 11. Data Stages Raw Organized Standardized Transformed Processed Quality Controlled Analyzed Summarized Presented/Published Photo courtesy of Zillow Database gurus: http://www.zillow.com/blog/2007-11-02/we-know-how-to-celebrate-halloween/
  • 12. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 13. Data Storage http://chronicle.texterity.com/chronicle/20110318a?pg=16#pg16
  • 14. Data Storage Tips: - 1 working copy on your computer - 1 copy on infrastructure near you - 1 copy on infrastructure far away - ‘Final’ copy with a data center/archive - Get help! (CSS, CSU Libraries, etc.) (Note: These won’t work well in all cases, ex. For Very Large Data, but are a good start for coming up with a storage plan)
  • 15. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 16. Versioning Tips: - communicate with your lab/research group and agree on a versioning system (file names, what makes a new version) - WRITE IT DOWN and post/save to a shared space.
  • 17. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 18. File naming conventions – Discuss
  • 19. File naming conventions – Better example Make names unique! Include (as appropriate): - Project name or acronym - Study title - Location - Data type - Researcher initials - Date - Data stage - Version number - File type DO – Use_underscores-or-dashes DO NOT – Use spaces &/or special characters! For more info - https://www.dataone.org/content/assign-descriptive-file-names
  • 20. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 21. Metadata “Data about Data” But what does that MEAN?!
  • 22. Metadata – The bottom line What would someone* unfamiliar with your data (and possibly your research) need in order to find, evaluate, understand, and reuse them? *How about someone: - who works in your lab? - from a different lab in your field? - who is in a related interdisciplinary field? - who researches a completely different area? - who works for a newspaper? Congress?
  • 23. Metadata – Example Temperature 31.5
  • 24. Metadata – Example Temperature 31.5 For what purpose? Instrument precision/accuracy? When was the sensor last cleaned/calibrated? AKA – T, Temp, degC, C, oF… lots of different names!
  • 25. Metadata Just like file names, metadata does it’s job best when it is: - consistent - documented - for people - such that computers are happy Enter Metadata Standards
  • 26. Metadata Standards – Examples Local (people -> people) Naming Conventions Standard Operating Procedures Beyond (people -> computers -> people) ISO 19115 (http://www.fgdc.gov/metadata/geospatial-metadata-standards#nap) GCMD DIF (http://gcmd.nasa.gov/User/difguide/difman.html) EML (http://knb.ecoinformatics.org/software/eml/)
  • 27. Metadata Standards – Example Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011). http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
  • 28. Metadata Standards – Example (XML) <attributeName>Sea Surface Temperature</attributeName> <attributeDefinition>temperature measurement</attributeDefinition> <measurementScale> <unit>celsius</unit> <numericDomain><numberType>real</numberType></numericDomain> </measurementScale> <missingValueCode><code>-99</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <missingValueCode><code>-999</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <missingValueCode><code>-99999</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <methods><description> subject { seaSurface } </description> <description> calculationType { calculated }; calculationTypeDetail { average }; calculationInterval { day }; </description></methods> Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011). http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
  • 29. Metadata Standards – Example (XML) <attributeName>Sea Surface Temperature</attributeName> <attributeDefinition>temperature measurement</attributeDefinition> <measurementScale> <unit>celsius</unit> <numericDomain><numberType>real</numberType></numericDomain> </measurementScale> <missingValueCode><code>-99</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <missingValueCode><code>-999</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <missingValueCode><code>-99999</code> <codeExplanation>missing value</codeExplanation> </missingValueCode> <methods><description> subject { seaSurface } </description> <description> calculationType { calculated }; calculationTypeDetail { average }; calculationInterval { day }; </description></methods> Scripps Institution Of Oceanography Pier Water Temperature - Station Dataset (CCELTER). [Online]. Scripps Institution of Oceanography Shore Station Program [Producer]. Oceaninformatics Datazoo [Distributor]. (February 28, 2011). http://oceaninformatics.ucsd.edu/datazoo/data/ccelter/datasets?action=summary&id=15
  • 30. Metadata – Standards They exist! If everyone used them, you could do very cool science! Compliance is often a lot of work There are lots HOWEVER, there are baby steps to get started
  • 31. Metadata – Yikes and/or Yay! Tips for the short-term: - Get help! - support@aoncadis.org, librarians, standards groups, data centers, domain communities, tools - Get your own house in order - use common date formats, codes, smart file names - WRITE EVERYTHING DOWN! (keep good readme files) - Put in the time early on to implement a standard - most have minimum compliance levels with options to get more detailed - Stay flexible Tips for the long-term: - Get help! - Watch for Best Practices and standards in your field
  • 32. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 33. Sharing and Access Levels: Low - Not sharing your data (note: appropriate in a few cases) - Emailing your data to a researcher who asks for it - Posting your data on your project or lab website Funder Happiness - Posting your data AND METADATA on your website - Submitting your metadata to an online catalog (ex. ACADIS) - Submitting your data and metadata to an appropriate repository and getting a permanent ID (DOI, EZID, etc) - Data Repositories (ex. ACADIS, GenBank, Dryad) - CSU Digital Repository High
  • 34. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 35. Archiving Terminology Fuzziness in the data world: Archival = Preservation (close enough) Archival ≠ Storage! Tips for the short-term: - Leave yourself time at the end of a project to clean up - Choose open source formats when you can (ex. CSV > XLS) Tips for the long-term: - Work with NREL data experts: IBIS team, LTER, Computer Systems Support
  • 36. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 37. Data Management Plans (DMPs) NSF Data Management Plan - General Requirement (as of 2011-10-10) 1. the types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project; 2. the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies); 3. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements; 4. policies and provisions for re-use, re-distribution, and the production of derivatives; and 5. plans for archiving data, samples, and other research products, and for preservation of access to them. From http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
  • 38. Data Management Plans (DMPs) NSF Data Management Plan - General Requirement (as of 2011-10-10) 1. the types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project; 2. the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies); 3. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements; 4. policies and provisions for re-use, re-distribution, and the production of derivatives; and 5. plans for archiving data, samples, and other research products, and for preservation of access to them. From http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
  • 39. Data Management Plans (DMPs) Tips for the short-term: - Check your Directorate/Agency policy before every proposal - Keep it real(istic), you will need to include your actions in your project report and next proposal. Tips for the long-term: - Keep working on implementing metadata standards - Watch out for emerging trends, repositories, tools - Partner with data people (data centers, libraries, etc)
  • 40. Where we are: • Why care about data management? • “What the heck is metadata?” and other jargon Data Types Data Stages Storage Versioning Naming Conventions Metadata and Standards Data Sharing and Access Archiving and Preservation • Pulling this all together – Data Management Plans • From the lab to ACADIS (and beyond)
  • 41. CADIS - Data Support for NSF-Arctic Program • Cooperative Arctic Data and Information System • The Mandate: – Develop advanced data management system for the Arctic Observing Network (AON) – Preserve metadata and data – Serve NSF-funded AON investigators
  • 42. Transition to Advanced CADIS NSF Arctic • A new mandate Field Sites – For all NSF programs that collect Arctic data – Serve NSF-funded Arctic investigators by archiving data from many field programs • Other changes: – An advisory group – Value-added products – Two full time Data Curators – New data types – biological, social, terrestrial, ecological
  • 43. Transition to Advanced CADIS (ACADIS) • A new mandate – For all NSF/ARC programs that collect Arctic data – Serve NSF-funded Arctic investigators by archiving data from field programs and individual investigators • Other changes: – An advisory group – Value-added products – Two full time Data Curators – New data types – biological, social, terrestrial, ecological – Expanded metadata tool for diverse disciplines
  • 44. ACADIS – Metadata and Standards Metadata Profile Supports established standards Based on IPY-DIS profile. Compatible with GCMD, FGDC, ISO… Profile driven interface validates fields NASA GCMD vocabulary used where possible
  • 45. ACADIS Data Management Plan Template Example guidance from the ACADIS DMP template: • Assists investigators in developing the DMP now required for all NSF proposals • Linked from aoncadis.org
  • 46. Beyond ACADIS - Other Resources • IBIS Local: NREL • CSU Digital Repository Local: CSU • Knowledge Network for Biocomplexity • ESA Ecological Archives Remote: • DAAC at ORNL Centralized and Domain Specific • Advanced Cooperative • Arctic Data and Information Service • Data Conservancy • DataONE Federated and distributed
  • 47. Beyond ACADIS – Other Resources General Info and help - Earth Science Information Partners (ESIP): http://wiki.esipfed.org/ UVA Libraries: http://www2.lib.virginia.edu/brown/data/ Data Management Plan and other tools – DMP Tool: https://dmp.cdlib.org/ DataOne: https://www.dataone.org/cattools/Data%20and%20Metadata%20Management Metadata - Excel Plug-in tool (in development): http://www.cdlib.org/cdlinfo/2011/09/01/facilitating-data-management-dcxl/ Lists of Standards (not complete!) for bio, climate, ecology, oceanography - http://marinemetadata.org/conventions Stanford-based portal for medical/bio - http://bioportal.bioontology.org/resources
  • 48. Questions? Contact me: Lynn.yarmey@nsidc.org For questions, help, or to submit Arctic data: support@aoncadis.org Visit ACADIS: www.aoncadis.org Special thanks for pilfered slides and content approaches: Florence Fetterer, Carly Strasser, and Dorothea Salo

Editor's Notes

  1. How about physical samples?Read your DMP guidelines carefully!
  2. NSF OPP meeting – open to including programmer time in budget requests to help with this kind of work
  3. CADIS was funded initially in 2007, and this is the 3rd AMS IIPS presentation I’ve given on it. For the Arctic Observing Network Mostly field observationsServe NSF-funded AON investigators by archiving AON dataNot so much the wider communityAssumptions starting outThe AON data portal would support full integration of a diverse collection - scientists could archive their data AND find all data relevant to a location or processInformatics and cyberinfrastructure would play a large role Implications were that …all data have browse imagery and complete documentation; …time series or fields can be plotted online;…and all metadata are in a relational database
  4. For all NSF programs that collect Arctic dataOffice of Polar Programs (OPP) Division of Arctic Sciences (ARC)AON, Arctic System Sciences (ARCSS), Arctic Natural Sciences (ANS) and the Arctic Social Sciences Program (ASSP) within OPP/ARCServe NSF-funded Arctic investigators by archiving data from many field programsStill little or no remote sensing dataEmphasis is still on serving those contributing data first, but will begin to shift to making the ACADIS portal more useful for those who need to use the data held or cataloged by ACADIS