SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Rescue of Long-Tail Data 

from the Ocean Bottom to the Moon!
!

Leslie Hsu, Kerstin Lehnert, Suzanne Carbotte, Vicki Ferrini,!
1
2
3!
! John Delano , James B. Gill , Maurice Tivey
!
Lamont-Doherty Earth Observatory, Columbia University,!
! 1University of Albany, 2University of California, Santa Cruz, 3Woods Hole Oceanographic Institution!
!

!

IN12A. Data Curation, Credibility, Preservation Implementation, and Data Rescue to Enable Multi-source Science!
Fall AGU 2013!

IEDA

iedadata.org
Data at Risk!
¤  "Data at Risk" is scientific data that are !
¤  not in formats that permit full electronic access to the information they
contain. !

¤  Data at Risk may be !
¤  non-digital (e.g., handwritten or photographic), !
¤  on near-obsolete digital media (such as floppy disks), !
¤  or insufficiently described (lacking metadata). !

¤  Some born-digital data are considered "at risk" if they cannot be
ingested into managed databases because they lack adequate
formatting or metadata.!
!
Definition from the ICSU CODATA Data at Risk Task Group (DARTG)!

IEDA

iedadata.org
Data Rescue!
¤  A “Data Rescue Mission” is any effort to preserve data at risk. Rescue
missions can come in the form of digitization, format migration, treating
damaged materials (e.g., water or mold), adding metadata or any action
taken to make data accessible in the long term.!

M. Tivey
Definition from ICSU CODATA Data at Risk Task Group (DARTG)

IEDA

iedadata.org
Long Tail Data are often Data at Risk!
The Head:

Long Tail Characteristics!

Astronomy,
Climate,
High Energy
Physics,
Genomics

q 
q 
q 
q 
q 
q 

Long Tail:

Environmental and
Earth sciences

http://juliegood.wordpress.com/tag/long-tail/

L. Wyborn

More specialised!
Low volume!
On C drives!
Hard to find!
Heterogeneous!
Collected by many
people!
q  Citizen science!
q  Etc!
q  Etc!

IEDA

iedadata.org
IEDA Data Rescue Mini-Awards!
¤ Established to preserve valuable legacy data sets that
are in danger by impending retirement or degradation!
¤  Evaluated by highest impact on future research by quality, size,
rarity, unique location or data type!
¤  Made accessible to the community for re-use by inclusion in the IEDA
data collections (EarthChem, MGDS, SESAR)!
¤  $7000 award to support proper compilation, documentation, transfer!

¤  3 awardees chosen from 11 entries over a wide range of geochemical
and geophysical data!
!

IEDA

iedadata.org
1: Geologic samples and geochemistry!
¤  WHAT: Compilation of sample
metadata and geochemical
analyses from three areas – Fiji,
Izu Arc, and Endeavour segment.
(James B. Gill)!

Maps made with GeoMapApp

¤  WHY: study of intra-ocean arcs
and spreading centers!
¤  HOW: Check and add incomplete
data, digitize data, add persistent
identifiers. Link between related
resources!
¤  Major challenge: Physical sample
management!

IEDA

iedadata.org
The importance of Sample identification!
¤  Individual samples can play a large role in scientific conclusions, so
accurate documentation of sample metadata is critical.!
¤  The key measurement was the one backarc basalt called "PPTUW”...
Subsequent efforts to confirm the observation ran into problems. The
apparently-same sample was variously called PPTU, PPTUW/5,
PPTUW-1, and TVZ19 in four other papers. None of those papers gave
its latitude and longitude… (J. Gill and E. Todd)!

IEDA

iedadata.org
2: Near-bottom magnetics!
¤  WHAT: Compilation of near-bottom
magnetometer data, including raw,
merged, processed, and navigation
metadata (Maurice Tivey)!
¤  WHY: study of magnetic reversals,
effect of tectonics on magnetic field!
¤  HOW: gather data from different
formats, add complete metadata
and workflow!
¤  Challenge: over three decades of
technology and file formats!

IEDA

iedadata.org
Evolution of equipment: 1985, 1992, 2004, 2011 !

IEDA

iedadata.org
Evolution of storage media!

M. Tivey

IEDA

iedadata.org
Addition of “sufficient” metadata!

IEDA

iedadata.org
3: Lunar sample geochemistry!
¤  WHAT: Compilation of lunar
sample geochemistry (John W.
Delano et al.)!
¤  WHY: composition of the Moon!
¤  HOW: Digitize photos, label
specific grains, compile
geochemistry in data templates!
¤  Challenge: nothing was digital!

!

LPI

IEDA

iedadata.org
Use of IEDA EarthChem templates!

IEDA

iedadata.org
Common needs addressed!
¤ Accessibility – web access, links between systems!
¤ Documentation – README files, additional descriptions!
¤ Standardization – IEDA EarthChem geochemical templates !
¤ Persistent links – DOIs and IGSNs!
¤ Citability – DOIs, example citations!
¤ Guidance/Training – calls and emails with disciplinary repository
staff!

IEDA

iedadata.org
IEDA

iedadata.org
Lessons learned: investigator!
¤ Take ownership of your own legacy!
¤  Data curation by others may not be complete or correct!

¤ Data rescue of an entire career does not need to be
overwhelming !
¤  Start with small steps!
¤  Disciplinary repositories will help and guide you to what is needed!

¤ Despite the time investment, data rescue is worth it!
¤  Others will now be able to re-use the data!
¤  Notes taken years ago actually explain anomalies!
!

IEDA

iedadata.org
Lessons learned: repository!
¤ For Long Tail Data, every project is different !
¤  There is not an established workflow – just past experience!
¤  Time commitment from staff is nontrivial!

¤ Disciplinary training helps a great deal!
¤  Investigators need help determining the best products!

¤ A small incentive will motivate investigators!
¤ Data Rescue missions help the repository determine
next steps for development of tools and services!

IEDA

iedadata.org
Summary of Long-tail Data Rescue!
¤ Three Data Rescue efforts this past year by IEDA have
made data that were at risk!
¤  digitized from analog data and near-obsolete media!
¤  sufficiently described for reuse!
¤  in formats that permit full electronic access!
¤  Citable, with persistent identifiers, and ready for reuse!

¤ The projects also helped IEDA identify improvements in
data rescue workflow, and future tools and services!

IEDA

iedadata.org
More Data Rescue Activities!

¤ Elsevier-IEDA Data Rescue Process Study!
¤  A data entry tool for lunar geochemistry: MoonDB!

¤ Elsevier-IEDA International Data Rescue Award!
¤  Winner announced at reception tonight, Monday Dec 9th, 2013!
¤  Intercontinental Hotel, Twin Peaks Room, 7:00-8:30pm!

IEDA

iedadata.org

Más contenido relacionado

Destacado

Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
Burcu BuRcu
 
What do you_know_about_the_usa
What do you_know_about_the_usaWhat do you_know_about_the_usa
What do you_know_about_the_usa
Antshil
 
Clusters - Quayside Clothing Case Study
Clusters - Quayside Clothing Case StudyClusters - Quayside Clothing Case Study
Clusters - Quayside Clothing Case Study
Clusters Ltd
 
Мой класс
Мой класс Мой класс
Мой класс
Antshil
 
A profile of a famous person
A profile of a famous personA profile of a famous person
A profile of a famous person
Egiptodiaz12
 
коледа
коледаколеда
коледа
Sinapova
 
Laporan teknologi pupukdan pemupukan
Laporan teknologi pupukdan pemupukanLaporan teknologi pupukdan pemupukan
Laporan teknologi pupukdan pemupukan
fahmiganteng
 
Laporan praktikum fistanklorofil
Laporan praktikum fistanklorofilLaporan praktikum fistanklorofil
Laporan praktikum fistanklorofil
fahmiganteng
 
Presentation bus - international business
Presentation bus - international businessPresentation bus - international business
Presentation bus - international business
Triệu Minh Nguyễn
 

Destacado (17)

Memoire desiu médecine subaquatique et hyperbare dr soualhi .dr naamani.dr ba...
Memoire desiu médecine subaquatique et hyperbare dr soualhi .dr naamani.dr ba...Memoire desiu médecine subaquatique et hyperbare dr soualhi .dr naamani.dr ba...
Memoire desiu médecine subaquatique et hyperbare dr soualhi .dr naamani.dr ba...
 
Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
Diş ti̇carette kullanilan tesli̇m şeki̇lleri̇
 
What do you_know_about_the_usa
What do you_know_about_the_usaWhat do you_know_about_the_usa
What do you_know_about_the_usa
 
INFLOW-2014-NVM-Compression
INFLOW-2014-NVM-CompressionINFLOW-2014-NVM-Compression
INFLOW-2014-NVM-Compression
 
Clusters - Quayside Clothing Case Study
Clusters - Quayside Clothing Case StudyClusters - Quayside Clothing Case Study
Clusters - Quayside Clothing Case Study
 
Мой класс
Мой класс Мой класс
Мой класс
 
Facebook education, Facebook Marketing
Facebook education, Facebook MarketingFacebook education, Facebook Marketing
Facebook education, Facebook Marketing
 
A profile of a famous person
A profile of a famous personA profile of a famous person
A profile of a famous person
 
коледа
коледаколеда
коледа
 
Laporan teknologi pupukdan pemupukan
Laporan teknologi pupukdan pemupukanLaporan teknologi pupukdan pemupukan
Laporan teknologi pupukdan pemupukan
 
portfolio web design
portfolio web designportfolio web design
portfolio web design
 
Laporan praktikum fistanklorofil
Laporan praktikum fistanklorofilLaporan praktikum fistanklorofil
Laporan praktikum fistanklorofil
 
I-Lappy- the Future Laptop
I-Lappy- the Future LaptopI-Lappy- the Future Laptop
I-Lappy- the Future Laptop
 
Presentation bus - international business
Presentation bus - international businessPresentation bus - international business
Presentation bus - international business
 
Ignatius termes i condicions
Ignatius termes i condicionsIgnatius termes i condicions
Ignatius termes i condicions
 
American Golf Courses Of Note
American Golf Courses Of NoteAmerican Golf Courses Of Note
American Golf Courses Of Note
 
Schedule
ScheduleSchedule
Schedule
 

Similar a Rescue of Long-Tail Data from the Ocean Bottom to the Moon

Similar a Rescue of Long-Tail Data from the Ocean Bottom to the Moon (20)

Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013Talk at OHSU, September 25, 2013
Talk at OHSU, September 25, 2013
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
METRO RDM Webinar
METRO RDM WebinarMETRO RDM Webinar
METRO RDM Webinar
 
IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014IEDA Overview & Updates, March 2014
IEDA Overview & Updates, March 2014
 
Goldschmidt2019 Samples Workshop
Goldschmidt2019 Samples WorkshopGoldschmidt2019 Samples Workshop
Goldschmidt2019 Samples Workshop
 
GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016
 
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
Managing Social Science Data from the Arctic with ELOKA, ACADIS, NSIDC, and (...
 
Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521Introduction to research data management; Lecture 01 for GRAD521
Introduction to research data management; Lecture 01 for GRAD521
 
GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)GBIF and reuse of research data, Bergen (2016-12-14)
GBIF and reuse of research data, Bergen (2016-12-14)
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curation
 
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data EquivalenceNISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
 
Use of persistent identifiers to link heterogeneous data systems in the Integ...
Use of persistent identifiers to link heterogeneous data systems in the Integ...Use of persistent identifiers to link heterogeneous data systems in the Integ...
Use of persistent identifiers to link heterogeneous data systems in the Integ...
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
 
Module 1 - Data Around Us .pptx
Module 1 - Data Around Us .pptxModule 1 - Data Around Us .pptx
Module 1 - Data Around Us .pptx
 
IEEE_BigData2014-Lee.pdf
IEEE_BigData2014-Lee.pdfIEEE_BigData2014-Lee.pdf
IEEE_BigData2014-Lee.pdf
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Guy avoiding-dat apocalypse
Guy avoiding-dat apocalypseGuy avoiding-dat apocalypse
Guy avoiding-dat apocalypse
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010E research overview gahegan bioinformatics workshop 2010
E research overview gahegan bioinformatics workshop 2010
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Rescue of Long-Tail Data from the Ocean Bottom to the Moon

  • 1. Rescue of Long-Tail Data 
 from the Ocean Bottom to the Moon! ! Leslie Hsu, Kerstin Lehnert, Suzanne Carbotte, Vicki Ferrini,! 1 2 3! ! John Delano , James B. Gill , Maurice Tivey ! Lamont-Doherty Earth Observatory, Columbia University,! ! 1University of Albany, 2University of California, Santa Cruz, 3Woods Hole Oceanographic Institution! ! ! IN12A. Data Curation, Credibility, Preservation Implementation, and Data Rescue to Enable Multi-source Science! Fall AGU 2013! IEDA iedadata.org
  • 2. Data at Risk! ¤  "Data at Risk" is scientific data that are ! ¤  not in formats that permit full electronic access to the information they contain. ! ¤  Data at Risk may be ! ¤  non-digital (e.g., handwritten or photographic), ! ¤  on near-obsolete digital media (such as floppy disks), ! ¤  or insufficiently described (lacking metadata). ! ¤  Some born-digital data are considered "at risk" if they cannot be ingested into managed databases because they lack adequate formatting or metadata.! ! Definition from the ICSU CODATA Data at Risk Task Group (DARTG)! IEDA iedadata.org
  • 3. Data Rescue! ¤  A “Data Rescue Mission” is any effort to preserve data at risk. Rescue missions can come in the form of digitization, format migration, treating damaged materials (e.g., water or mold), adding metadata or any action taken to make data accessible in the long term.! M. Tivey Definition from ICSU CODATA Data at Risk Task Group (DARTG) IEDA iedadata.org
  • 4. Long Tail Data are often Data at Risk! The Head: Long Tail Characteristics! Astronomy, Climate, High Energy Physics, Genomics q  q  q  q  q  q  Long Tail: Environmental and Earth sciences http://juliegood.wordpress.com/tag/long-tail/ L. Wyborn More specialised! Low volume! On C drives! Hard to find! Heterogeneous! Collected by many people! q  Citizen science! q  Etc! q  Etc! IEDA iedadata.org
  • 5. IEDA Data Rescue Mini-Awards! ¤ Established to preserve valuable legacy data sets that are in danger by impending retirement or degradation! ¤  Evaluated by highest impact on future research by quality, size, rarity, unique location or data type! ¤  Made accessible to the community for re-use by inclusion in the IEDA data collections (EarthChem, MGDS, SESAR)! ¤  $7000 award to support proper compilation, documentation, transfer! ¤  3 awardees chosen from 11 entries over a wide range of geochemical and geophysical data! ! IEDA iedadata.org
  • 6. 1: Geologic samples and geochemistry! ¤  WHAT: Compilation of sample metadata and geochemical analyses from three areas – Fiji, Izu Arc, and Endeavour segment. (James B. Gill)! Maps made with GeoMapApp ¤  WHY: study of intra-ocean arcs and spreading centers! ¤  HOW: Check and add incomplete data, digitize data, add persistent identifiers. Link between related resources! ¤  Major challenge: Physical sample management! IEDA iedadata.org
  • 7. The importance of Sample identification! ¤  Individual samples can play a large role in scientific conclusions, so accurate documentation of sample metadata is critical.! ¤  The key measurement was the one backarc basalt called "PPTUW”... Subsequent efforts to confirm the observation ran into problems. The apparently-same sample was variously called PPTU, PPTUW/5, PPTUW-1, and TVZ19 in four other papers. None of those papers gave its latitude and longitude… (J. Gill and E. Todd)! IEDA iedadata.org
  • 8. 2: Near-bottom magnetics! ¤  WHAT: Compilation of near-bottom magnetometer data, including raw, merged, processed, and navigation metadata (Maurice Tivey)! ¤  WHY: study of magnetic reversals, effect of tectonics on magnetic field! ¤  HOW: gather data from different formats, add complete metadata and workflow! ¤  Challenge: over three decades of technology and file formats! IEDA iedadata.org
  • 9. Evolution of equipment: 1985, 1992, 2004, 2011 ! IEDA iedadata.org
  • 10. Evolution of storage media! M. Tivey IEDA iedadata.org
  • 11. Addition of “sufficient” metadata! IEDA iedadata.org
  • 12. 3: Lunar sample geochemistry! ¤  WHAT: Compilation of lunar sample geochemistry (John W. Delano et al.)! ¤  WHY: composition of the Moon! ¤  HOW: Digitize photos, label specific grains, compile geochemistry in data templates! ¤  Challenge: nothing was digital! ! LPI IEDA iedadata.org
  • 13. Use of IEDA EarthChem templates! IEDA iedadata.org
  • 14. Common needs addressed! ¤ Accessibility – web access, links between systems! ¤ Documentation – README files, additional descriptions! ¤ Standardization – IEDA EarthChem geochemical templates ! ¤ Persistent links – DOIs and IGSNs! ¤ Citability – DOIs, example citations! ¤ Guidance/Training – calls and emails with disciplinary repository staff! IEDA iedadata.org
  • 16. Lessons learned: investigator! ¤ Take ownership of your own legacy! ¤  Data curation by others may not be complete or correct! ¤ Data rescue of an entire career does not need to be overwhelming ! ¤  Start with small steps! ¤  Disciplinary repositories will help and guide you to what is needed! ¤ Despite the time investment, data rescue is worth it! ¤  Others will now be able to re-use the data! ¤  Notes taken years ago actually explain anomalies! ! IEDA iedadata.org
  • 17. Lessons learned: repository! ¤ For Long Tail Data, every project is different ! ¤  There is not an established workflow – just past experience! ¤  Time commitment from staff is nontrivial! ¤ Disciplinary training helps a great deal! ¤  Investigators need help determining the best products! ¤ A small incentive will motivate investigators! ¤ Data Rescue missions help the repository determine next steps for development of tools and services! IEDA iedadata.org
  • 18. Summary of Long-tail Data Rescue! ¤ Three Data Rescue efforts this past year by IEDA have made data that were at risk! ¤  digitized from analog data and near-obsolete media! ¤  sufficiently described for reuse! ¤  in formats that permit full electronic access! ¤  Citable, with persistent identifiers, and ready for reuse! ¤ The projects also helped IEDA identify improvements in data rescue workflow, and future tools and services! IEDA iedadata.org
  • 19. More Data Rescue Activities! ¤ Elsevier-IEDA Data Rescue Process Study! ¤  A data entry tool for lunar geochemistry: MoonDB! ¤ Elsevier-IEDA International Data Rescue Award! ¤  Winner announced at reception tonight, Monday Dec 9th, 2013! ¤  Intercontinental Hotel, Twin Peaks Room, 7:00-8:30pm! IEDA iedadata.org