SlideShare una empresa de Scribd logo
1 de 22
Archival Information Packages for
NASA HDF-EOS Data
R. Duerr, Kent Yang, Azhar Sikander
Outline
• What is an Archival Information Package?
 HDF-AIP

• Standards? What Standards?
 METS
 DIF/FGDC/ISO 19115-2
 PREMIS

• Results
• Next Steps

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
OAIS Reference Model1
Archive Information Package

1

Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002.

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Archival Information Package Contents
• Content Information
 The data object to be preserved
 Information that describes the data object
o Typically interpreted as the syntax and semantics of the file
structure

• Preservation Description Information
 Provenance –

Origin or source of the data, any changes that have taken place since,
and who has had custody of it

 Fixity – the authentication mechanisms (with keys) needed to ensure that the data
object has not been altered in an undocumented manner

 Reference – identification mechanisms and values
 Context – relation of the object to its environment

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
HDF-Archive Information Packages
• The HDF group was
funded to investigate
and propose a design
for a complete archival
information package
for HDF data files
• The result was a METS
metadata file to
accompany the HDF
data file
http://www.hdfgroup.org/projects/hdf5_aip/hdf5_aip_wp.html
Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Metadata Standards - METS
• Metadata Encoding and Transmission Standard
• An initiative of the Digital Library Federation
• Provides the means to convey the metadata
necessary for
 management of digital objects within a repository
 exchange of objects between repositories (or between
repositories and their users)

• Designed to facilitate
 shared development of information management
tools/services
 interoperable exchange of digital materials

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
METS - A very brief overview
Describes the METS
document itself
Describes the editor
e.g., creator orobject
using some external standard
Describes object creation, storage,
e.g., MARC, FGDC, Dublin Core
intellectual property rights, source
info, provenance, etc.
Provides an inventory of all of the
e.g., PREMIS
files that are part of the object
described
A physical or logical map of the
organization of the materials
described
Allows specification of hyperlinks
between parts of the map (mostly
useful when preserving websites)
Used to associate executable code
with parts of the content

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Metadata Standards - Descriptive Metadata

Derived from

• Discovery, Assess and Access Metadata
 GCMD DIF
 FGDC CSDGM
 ISO 19115

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Metadata Standards - ISO 19115:2003
• The international equivalent of the FGDC standard
• Most fields can be mapped or generated from
FGDC metadata
• The exception is the Dataset Topic Keywords
• Allows for national profiles

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Metadata Standards - ISO 19115:2003

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Is there a metadata standard for AIP
information?
Archive Information Package

1

Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002.

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Preservation Metadata Implementation Strategies
(PREMIS)
• Provide a core preservation metadata set with broad
applicability across the digital preservation
community
• Developed by an OCLC and RLG sponsored
international working group
 Representatives from libraries, museums, archives,
government, and the private sector.

• Maintained by the Library of Congress
• Based on the OAIS reference model

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
PREMIS - Entity-Relationship Diagram
Intellectual
Entities

Objects

“an action that involves at
least organization, or
Rights
“a“a coherent set of content
person,one object or agent
known to the of information
software program associated
“a discrete unitpreservation
that is reasonably
repository”
with described as a unit” in
preservation events
in digital form”
thee.g.,example,archived,
For created, a data file
life of a web site,
For example, an object” data
migrated or more
e.g., Dr. Spockofof data it
“assertions donated sets
set or collection one
rights or permissions
pertaining to an object
or an agent”
e.g., copywrite notice, legal
Events
statute, deposit agreement

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII

Agents
Is there a metadata standard for AIP
information?
PREMIS

ISO 19115

1

Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002.

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
NOAA Data Stewardship Prototype
• NSIDC and THG demonstrated the feasibility of
migrating NASA data to a standard HDF-AIP
format
• Motivation:
Technologies change regularly,
organizations come and go, but data must
survive
But preserving data takes more than just
preserving the bits, all the components of an
AIP are critical
Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Project Goals
• Prototype development of Archive Information
Packages for HDF data:
 For entire data sets
 For individual “granules”

• Test usability of digital library standards with
geospatial data

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Program Plan (Modified)
ISO-19115
CDM/NetCDF4

ECS to
METS
(Data Set)

HDF5-AIP
NetCDF4 /
HDF5 Data

METS

NetCDF4/HDF5-data

ECS to
METS

NSIDC/ECS
Metadata

(Granule)

H4to
H5

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII

NSIDC/ ECS
HDF4-data
HDF5 Granule Level Archive Information
Packages
Data file

HDF5

Metadata file

METS

Primary Schema

Extension Schema

|<mets>
|---<dmdSec>----------------<ISO 19115>
|---<amdSec>--------------|--<techMD>
|
|--<rightsMD>
|
|--<sourceMD>
|----<fileGrp>
|----<structMap>

PREMIS

HDF5 AIP Components

http://www.hdfgroup.uiuc.edu/papers/papers/AIP/HDF5_AIP_White_Paper.pdf

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
File Level AIP Activity Status
• Developed a map from NSIDC/ECS metadata to
METS/PREMIS/ISO 19115 components
• Prototype software completed
• Issues
 What goes in PREMIS vs ISO 19115?
 Auxillary file handling - own AIP or not?
o

E.g., browse files, processing history, PGE’s

 Granules vs files

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Issues and Questions
• Inconsistent use of terminology between standards
– for example, what is a data set?
• Many of the standards care about distribution
formats
 Are these even relevant concepts any more?
 Do you really want to have to update the metadata record
just because a new distribution format was added?
 What about new access services?

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Next Steps
• NSIDC is updating our non-ECS data systems
handling of metadata including support for
PREMIS, etc. metadata on all holdings
• Work underway to upgrade granule level metadata
for NSIDC flagship sea ice products
(PREMIS/METS/ISO AIP packages)
• Work to improve archivability of data stored in
HDF formats on-going – NASA implementing a
standard XML description of contents across its
archives
Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII
Acknowledgement
This work was supported under NOAA Scientific
Stewardship Program grant number
NA07OAR4310286. Any opinions, findings,
and conclusions or recommendations
expressed in this material are those of the
author(s) and do not necessarily reflect the
views of NOAA.

Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF
and HDF-EOS Workshop XIII

Más contenido relacionado

La actualidad más candente

Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout MapsEnsuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
The HDF-EOS Tools and Information Center
 

La actualidad más candente (20)

Hdf5 intro
Hdf5 introHdf5 intro
Hdf5 intro
 
Introduction to NetCDF-4
Introduction to NetCDF-4Introduction to NetCDF-4
Introduction to NetCDF-4
 
HDF5 and The HDF Group
HDF5 and The HDF GroupHDF5 and The HDF Group
HDF5 and The HDF Group
 
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout MapsEnsuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
Ensuring Long Term Access to Remotely Sensed HDF4 Data with Layout Maps
 
Status of HDF-EOS, Related Software and Tools
Status of HDF-EOS, Related Software and ToolsStatus of HDF-EOS, Related Software and Tools
Status of HDF-EOS, Related Software and Tools
 
Caching and Buffering in HDF5
Caching and Buffering in HDF5Caching and Buffering in HDF5
Caching and Buffering in HDF5
 
Aura HDF-EOS File Format Guidelines: Overview and Status
Aura HDF-EOS File Format Guidelines: Overview and StatusAura HDF-EOS File Format Guidelines: Overview and Status
Aura HDF-EOS File Format Guidelines: Overview and Status
 
Digital Object Identifiers for EOSDIS data
Digital Object Identifiers for EOSDIS dataDigital Object Identifiers for EOSDIS data
Digital Object Identifiers for EOSDIS data
 
SCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with NaniteSCAPE Information Day at BL - Characterising content in web archives with Nanite
SCAPE Information Day at BL - Characterising content in web archives with Nanite
 
HDF Project Status and Plans
HDF Project Status and PlansHDF Project Status and Plans
HDF Project Status and Plans
 
Metadata in EOSDIS
Metadata in EOSDISMetadata in EOSDIS
Metadata in EOSDIS
 
iRODS
iRODSiRODS
iRODS
 
HDF
HDFHDF
HDF
 
iRODS: Interoperability in Data Management
iRODS: Interoperability in Data ManagementiRODS: Interoperability in Data Management
iRODS: Interoperability in Data Management
 
Images of HDF5
Images of HDF5Images of HDF5
Images of HDF5
 
Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)
Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)
Access HDF5 Datasets via OPeNDAP's Data Access Protocol (DAP)
 
Introduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIsIntroduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIs
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
Migrating from HDF5 1.6 to 1.8
Migrating from HDF5 1.6 to 1.8Migrating from HDF5 1.6 to 1.8
Migrating from HDF5 1.6 to 1.8
 
A Survey on Different File Handling Mechanisms in HDFS
A Survey on Different File Handling Mechanisms in HDFSA Survey on Different File Handling Mechanisms in HDFS
A Survey on Different File Handling Mechanisms in HDFS
 

Similar a Archive Information Packages for NASA HDF-EOS Data

Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
The HDF-EOS Tools and Information Center
 

Similar a Archive Information Packages for NASA HDF-EOS Data (20)

NASA HDF and HDF-EOS Status - Use in EOSDIS
NASA HDF and HDF-EOS Status - Use in EOSDISNASA HDF and HDF-EOS Status - Use in EOSDIS
NASA HDF and HDF-EOS Status - Use in EOSDIS
 
Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...Improving long-term preservation of EOS data by independently mapping HDF4 da...
Improving long-term preservation of EOS data by independently mapping HDF4 da...
 
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...Reference Model for an Open Archival Information Systems (OAIS): Overview and...
Reference Model for an Open Archival Information Systems (OAIS): Overview and...
 
The HDF-EOS Aura Data Guidelines - "What's New"
The HDF-EOS Aura Data Guidelines - "What's New"The HDF-EOS Aura Data Guidelines - "What's New"
The HDF-EOS Aura Data Guidelines - "What's New"
 
Integrating HDF5 with SRB
Integrating HDF5 with SRBIntegrating HDF5 with SRB
Integrating HDF5 with SRB
 
HDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and ToolsHDF-EOS Development: Current Status and Tools
HDF-EOS Development: Current Status and Tools
 
Using HDF5 Archive Information Package to preserve HDF-EOS2 data
Using HDF5 Archive Information Package to preserve HDF-EOS2 dataUsing HDF5 Archive Information Package to preserve HDF-EOS2 data
Using HDF5 Archive Information Package to preserve HDF-EOS2 data
 
HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.HDF-EOS APIs, tools, etc.
HDF-EOS APIs, tools, etc.
 
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
Using Dublin Core for DISCOVER: a New Zealand visual art and music resource f...
 
NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
HDF-EOS Development - Current Status and Schedule
HDF-EOS Development - Current Status and ScheduleHDF-EOS Development - Current Status and Schedule
HDF-EOS Development - Current Status and Schedule
 
HDF-EOS Workshop II Introduction
HDF-EOS Workshop II IntroductionHDF-EOS Workshop II Introduction
HDF-EOS Workshop II Introduction
 
HDF Town Hall
HDF Town HallHDF Town Hall
HDF Town Hall
 
Survey of Data Format Tools
Survey of Data Format ToolsSurvey of Data Format Tools
Survey of Data Format Tools
 
Geoscience Data Analysis and Visualization Tools from NCAR
Geoscience Data Analysis and Visualization Tools from NCARGeoscience Data Analysis and Visualization Tools from NCAR
Geoscience Data Analysis and Visualization Tools from NCAR
 
SEEDS Standards Process
SEEDS Standards ProcessSEEDS Standards Process
SEEDS Standards Process
 
HDF4 Mapping Project Update
HDF4 Mapping Project UpdateHDF4 Mapping Project Update
HDF4 Mapping Project Update
 
Data Interoperability
Data InteroperabilityData Interoperability
Data Interoperability
 
HDF5 High Level and Lite Libraries
HDF5 High Level and Lite LibrariesHDF5 High Level and Lite Libraries
HDF5 High Level and Lite Libraries
 
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs ProjectsGES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
 

Más de The HDF-EOS Tools and Information Center

Más de The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

Archive Information Packages for NASA HDF-EOS Data

  • 1. Archival Information Packages for NASA HDF-EOS Data R. Duerr, Kent Yang, Azhar Sikander
  • 2. Outline • What is an Archival Information Package?  HDF-AIP • Standards? What Standards?  METS  DIF/FGDC/ISO 19115-2  PREMIS • Results • Next Steps Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 3. OAIS Reference Model1 Archive Information Package 1 Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002. Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 4. Archival Information Package Contents • Content Information  The data object to be preserved  Information that describes the data object o Typically interpreted as the syntax and semantics of the file structure • Preservation Description Information  Provenance – Origin or source of the data, any changes that have taken place since, and who has had custody of it  Fixity – the authentication mechanisms (with keys) needed to ensure that the data object has not been altered in an undocumented manner  Reference – identification mechanisms and values  Context – relation of the object to its environment Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 5. HDF-Archive Information Packages • The HDF group was funded to investigate and propose a design for a complete archival information package for HDF data files • The result was a METS metadata file to accompany the HDF data file http://www.hdfgroup.org/projects/hdf5_aip/hdf5_aip_wp.html Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 6. Metadata Standards - METS • Metadata Encoding and Transmission Standard • An initiative of the Digital Library Federation • Provides the means to convey the metadata necessary for  management of digital objects within a repository  exchange of objects between repositories (or between repositories and their users) • Designed to facilitate  shared development of information management tools/services  interoperable exchange of digital materials Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 7. METS - A very brief overview Describes the METS document itself Describes the editor e.g., creator orobject using some external standard Describes object creation, storage, e.g., MARC, FGDC, Dublin Core intellectual property rights, source info, provenance, etc. Provides an inventory of all of the e.g., PREMIS files that are part of the object described A physical or logical map of the organization of the materials described Allows specification of hyperlinks between parts of the map (mostly useful when preserving websites) Used to associate executable code with parts of the content Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 8. Metadata Standards - Descriptive Metadata Derived from • Discovery, Assess and Access Metadata  GCMD DIF  FGDC CSDGM  ISO 19115 Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 9. Metadata Standards - ISO 19115:2003 • The international equivalent of the FGDC standard • Most fields can be mapped or generated from FGDC metadata • The exception is the Dataset Topic Keywords • Allows for national profiles Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 10. Metadata Standards - ISO 19115:2003 Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 11. Is there a metadata standard for AIP information? Archive Information Package 1 Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002. Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 12. Preservation Metadata Implementation Strategies (PREMIS) • Provide a core preservation metadata set with broad applicability across the digital preservation community • Developed by an OCLC and RLG sponsored international working group  Representatives from libraries, museums, archives, government, and the private sector. • Maintained by the Library of Congress • Based on the OAIS reference model Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 13. PREMIS - Entity-Relationship Diagram Intellectual Entities Objects “an action that involves at least organization, or Rights “a“a coherent set of content person,one object or agent known to the of information software program associated “a discrete unitpreservation that is reasonably repository” with described as a unit” in preservation events in digital form” thee.g.,example,archived, For created, a data file life of a web site, For example, an object” data migrated or more e.g., Dr. Spockofof data it “assertions donated sets set or collection one rights or permissions pertaining to an object or an agent” e.g., copywrite notice, legal Events statute, deposit agreement Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII Agents
  • 14. Is there a metadata standard for AIP information? PREMIS ISO 19115 1 Reference Model for an Open Archival Information System (OAIS), CCSDS 650.0-B-1, Blue Book, January 2002. Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 15. NOAA Data Stewardship Prototype • NSIDC and THG demonstrated the feasibility of migrating NASA data to a standard HDF-AIP format • Motivation: Technologies change regularly, organizations come and go, but data must survive But preserving data takes more than just preserving the bits, all the components of an AIP are critical Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 16. Project Goals • Prototype development of Archive Information Packages for HDF data:  For entire data sets  For individual “granules” • Test usability of digital library standards with geospatial data Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 17. Program Plan (Modified) ISO-19115 CDM/NetCDF4 ECS to METS (Data Set) HDF5-AIP NetCDF4 / HDF5 Data METS NetCDF4/HDF5-data ECS to METS NSIDC/ECS Metadata (Granule) H4to H5 Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII NSIDC/ ECS HDF4-data
  • 18. HDF5 Granule Level Archive Information Packages Data file HDF5 Metadata file METS Primary Schema Extension Schema |<mets> |---<dmdSec>----------------<ISO 19115> |---<amdSec>--------------|--<techMD> | |--<rightsMD> | |--<sourceMD> |----<fileGrp> |----<structMap> PREMIS HDF5 AIP Components http://www.hdfgroup.uiuc.edu/papers/papers/AIP/HDF5_AIP_White_Paper.pdf Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 19. File Level AIP Activity Status • Developed a map from NSIDC/ECS metadata to METS/PREMIS/ISO 19115 components • Prototype software completed • Issues  What goes in PREMIS vs ISO 19115?  Auxillary file handling - own AIP or not? o E.g., browse files, processing history, PGE’s  Granules vs files Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 20. Issues and Questions • Inconsistent use of terminology between standards – for example, what is a data set? • Many of the standards care about distribution formats  Are these even relevant concepts any more?  Do you really want to have to update the metadata record just because a new distribution format was added?  What about new access services? Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 21. Next Steps • NSIDC is updating our non-ECS data systems handling of metadata including support for PREMIS, etc. metadata on all holdings • Work underway to upgrade granule level metadata for NSIDC flagship sea ice products (PREMIS/METS/ISO AIP packages) • Work to improve archivability of data stored in HDF formats on-going – NASA implementing a standard XML description of contents across its archives Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII
  • 22. Acknowledgement This work was supported under NOAA Scientific Stewardship Program grant number NA07OAR4310286. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of NOAA. Archival Information Packages for NASA HDF-EOS Data, presented 11/4/09 by R. Duerr HDF and HDF-EOS Workshop XIII

Notas del editor

  1. Lots of background material that I won’t really discuss – indicated
  2. Syntax - XFDU - DFDL - ESML Semantics?
  3. A couple of interesting and useful things about METS: is that it is deliberately designed to handle objects at a wide variety of scales (single files, complex web sites) Rather than attempting to define descriptive and administrative metadata needs for all kinds of objects, they designed the standard to incorporate a variety of other standards (e.g., FGDC for geospatial metadata)
  4. When you talk to a geoscientist or data scientist who deals with geospatial data – these are the standards they know and care about GCMD – because it is the oldest, is internationally accepted; NASA/NOAA/NSF require it for data set descriptions; because the Global Change Master Directory is the data equivalent of WorldCat FGDC – Content Standard for Digital Geospatial Metadata; derived from DIF; mandated for all federally funded data by Executive Order ISO 19115 – Most recent standard – replacing FGDC – adopted by NOAA and likely NASA
  5. But more than just descriptive metadata is needed It is equally important to know what has happened to the data since it’s creation, to know it’s provenance
  6. The PREMIS entity&lt;-&gt;relationship diagram Representation - “the set of files needed for a complete and reasonable rendition of an Intellectual Entity” File Bitstream - “contiguous or non-contiguous data within a file that has meaningful common properties for preservation purposes” So how does this apply to science data?
  7. Keeping track of events in the digital library world for a few years Noticed that they’ve come up with standards to deal with a wide variety of information types NOAA and USGS were to be the ultimate home of much of NASA’s EOS data THG with funding ultimately from National Archives and Records Administration had written a white paper defining an HDF-AIP using a digital library standard A standard called METS
  8. Primary Schema Extension Schema National Digital Geospatial Archive - LOC NDIIP (National Digital Information Infrastructure and Preservation Program ) Recommendation by Nancy Hoebelheinrich of Stanford
  9. Different data sets are different - some data sets have 1 file per granule; others have many; some data sets have a browse for each granule; in others the mapping is 1 to many; many to 1, or many to many
  10. In ISO 19115 parlance, a dataset is an “identifiable collection of data,” where a dataset may reside in a larger dataset, can be as small as a single feature, and could even be a single map or chart (see ISO 19115:2003(E) page 3). This is in contrast to a data series which is a “collection of datasets sharing the same product specification” where the phrase “product specification” is totally undefined. In NASA, NOAA, and NSF parlance a data set is the collection of all of the files for a particular project, from a particular instrument, etc. preferentially that are all of the same type. A data set is comprised of data files or data granules.In HDF parlance, a Science Data Set is the unit within a file that contains a particular data array.