SlideShare una empresa de Scribd logo
1 de 18
DM_PPT_NP_v02
Hierarchical Data Formats (HDF)
Update
Latest HDF releases and more
The HDF Group
Elena Pourmal (epourmal@hdfgroup.org)
This work was supported by NASA/GSFC under
Raytheon Co. contract number NNG15HZ39C
DM_PPT_NP_v02
2
Outline
• The HDF Group Website changes
• Update on HDF5 1.8.19, 1.10.1 and HDF 4.2.13
• Compatibility issues
• Updates on HDF-Java, HDFView 3.0 and other
tools
• Supported compilers and systems
• Compression library for interoperability with
h5py and Pandas
• Tell us about your needs!
DM_PPT_NP_v02
3
Where to find us on the Web?
• New Website (https://hdfgroup.org)
– Info about organization
– Latest 1.10 releases and HDFview 3.0
– New commercial tools by The HDF Group
• ODBC (Excel connector to HDF5)
– Registration
– Links to The HDF Group Support Website
(https://support.hdfgroup.org)
• Documentation
• Old releases
• Misc. information about projects
– We are working on the new Support Portal (launch by the
end of 2017)
• Send us your feedback!
DM_PPT_NP_v02
4
Latest HDF releases
• Release cycle – once a year
• HDF 4.2.13 (June 30, 2017)
– Memory leak fixes
– Support for Mac OS 10.12
– Support for the latest GNU, PGI an dIntel
compilers
• We do not plan any major work (i.e.,
performance improvements, new features,
etc.) for HDF4
• Encourage to move to HDF5
DM_PPT_NP_v02
5
HDF5
• Two versions
– HDF5 1.8.19 (May 16, 2017)
• Bug fixes, new APIs
– HDF5 1.10.1 (April 27, 2017)
• New features, extensions to HDF5 file format
DM_PPT_NP_v02
6
Dropping Support for HDF5 1.8
• Last release by June 30, 2019
– 4 more HDF5 1.8 releases
• We encourage you to move to HDF5 1.10
during the next year
– Recompile your application with the new
version of HDF5
• Contact help@hdfgroup.org if you
encounter any problems
DM_PPT_NP_v02
7
Issues you may encounter when
moving applications to 1.10
• C, Fortran, C++, Python application that
worked with HDF5 1.8 may create HDF5 file
incompatible with HDF5 1.8 file format
– When specifying latest file format while calling
H5Pset_libver_bounds function
– The HDF Group will provide a fix before dropping
support for HDF5 1.8
• Small update to the function call is required
• HDF5 Java applications
– HDF5 JNI supports 64-bit objects identifiers; code
based on the previous versions of HDF5 JNI
need to be updated
DM_PPT_NP_v02
8
Compatibility Issues
1.8 1.10
1.8 Yes No
Use H5Pset_libver_bounds
with appropriate parameters;
don’t use features new in
1.10.0, 1.10.1
1.10 Yes Yes
File is created by HDF5
FileisreadbyHDF5
DM_PPT_NP_v02
9
HDF5 1.8.19 New Features
• H5DOread_chunk
– Function to read compressed data without
uncompressing it (see H5DOwrite_chunk)
H5DOread_chunk
H5Dread
DM_PPT_NP_v02
10
HDF5 1.10.1 (Performance)
• “Evict on close” feature
– Reduces memory footprint when iterating
through many HDF5 objects (i.e, files, groups,
datasets)
• I/O improvements
– Paged Aggregation
– Page Buffering
https://support.hdfgroup.org/HDF5/docNewFeatures/
DM_PPT_NP_v02
11
HDF-JAVA Update
• HDF4 and HDF5 JNI are part of the HDF4
and HDF5 1.10 source distribution
– HDF5 JNI supports 64-bit objects identifiers;
code based on the previous versions of HDF5
JNI need
DM_PPT_NP_v02
12
HDFView 3.0 (beta)
• HDFView 3.0-beta release (May 31, 2017)
– The Graphical User Interface (GUI) framework that HDFView
uses was migrated from Swing (GUI widget toolkit for Java; part
of Oracle’s Java Foundation Classes ) to Standard Widget
Toolkit (http://www.eclipse.org/swt/ ), which provides a more
native application look and feel and advanced support for tables.
– The data views have been separated from the main HDFView
window. The main HDFView window still displays open files and
their structures on the left side of the window, and it now displays
any metadata on the right side.
– This release includes improved support for various datatypes
(compound, array of compound, and opaque).
• HDFView 3.0 planned for December 2017
DM_PPT_NP_v02
13
HDF Tools
• Command-line tools in HDF4 and HDF5
– Display content
– Copy data from one file to another
– Diff two files
• Maintenance mode (bug fixing)
• Which tools are missing?
– HDF4 and HDF5 diff
– ?
DM_PPT_NP_v02
14
Supported Compilers
• GNU
• PGI
• Intel
• We test with two latest compiler versions
available
• Other?
DM_PPT_NP_v02
15
Supported OSs
• Linux 2.6, 2.7 and 3.10
• Mac OS X 10.(8,9,10,11) and moving to 10.12
• Windows 10 (32 and 64-bit)
– VS 2015 and Intel Fortran v.16
• Windows 7 (32 and 64-bit)
– VS 2013 and Intel Fortran v.15
• Cygwin 32-bit
• SunOS 5.11 (32 and 64-bit)
• PowerPC 64
• Different Linux distributions (Fedora, Suse, Debian)
• Anything missing?
DM_PPT_NP_v02
16
Compression Library
• HDF5 compression filters (plugins)
• Dynamically loaded at run-time
– BZIP2 (PyTables, Pandas)
– MAFISC
– BLOSC (PyTables, Pandas)
– LZ4 (h5py)
– More filters are coming….
• Contact help@hdfgroup.org if interested to
try
DM_PPT_NP_v02
17
Open Discussion
• Tell us about your needs
DM_PPT_NP_v02
18
This work was supported by
NASA/GSFC under Raytheon Co.
contract number NNG15HZ39C

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

HDF Product Designer
HDF Product DesignerHDF Product Designer
HDF Product Designer
 
MODIS Land and HDF-EOS
MODIS Land and HDF-EOSMODIS Land and HDF-EOS
MODIS Land and HDF-EOS
 
GDAL Enhancement for ESDIS Project
GDAL Enhancement for ESDIS ProjectGDAL Enhancement for ESDIS Project
GDAL Enhancement for ESDIS Project
 
Scientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDFScientific Computing and Visualization using HDF
Scientific Computing and Visualization using HDF
 
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
Geospatial Data Abstraction Library (GDAL) Enhancement for ESDIS (GEE)
 
HDF Project Update
HDF Project UpdateHDF Project Update
HDF Project Update
 
Incorporating ISO Metadata Using HDF Product Designer
Incorporating ISO Metadata Using HDF Product DesignerIncorporating ISO Metadata Using HDF Product Designer
Incorporating ISO Metadata Using HDF Product Designer
 
MATLAB and Scientific Data: New Features and Capabilities
MATLAB and Scientific Data: New Features and CapabilitiesMATLAB and Scientific Data: New Features and Capabilities
MATLAB and Scientific Data: New Features and Capabilities
 
NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
Utilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud ComputingUtilizing HDF4 File Content Maps for the Cloud Computing
Utilizing HDF4 File Content Maps for the Cloud Computing
 
Improved Methods for Accessing Scientific Data for the Masses
Improved Methods for Accessing Scientific Data for the MassesImproved Methods for Accessing Scientific Data for the Masses
Improved Methods for Accessing Scientific Data for the Masses
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF Cloud Services
HDF Cloud ServicesHDF Cloud Services
HDF Cloud Services
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF-EOS 2/5 to netCDF Converter
HDF-EOS 2/5 to netCDF ConverterHDF-EOS 2/5 to netCDF Converter
HDF-EOS 2/5 to netCDF Converter
 
Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5Data Analytics using MATLAB and HDF5
Data Analytics using MATLAB and HDF5
 
HDF5 Performance Enhancements with the Elimination of Unlimited Dimension
HDF5 Performance Enhancements with the Elimination of Unlimited DimensionHDF5 Performance Enhancements with the Elimination of Unlimited Dimension
HDF5 Performance Enhancements with the Elimination of Unlimited Dimension
 
Data Are from Mars, Tools Are from Venus
Data Are from Mars, Tools Are from VenusData Are from Mars, Tools Are from Venus
Data Are from Mars, Tools Are from Venus
 
Putting some Spark into HDF5
Putting some Spark into HDF5Putting some Spark into HDF5
Putting some Spark into HDF5
 
Indexing HDF5: A Survey
Indexing HDF5: A SurveyIndexing HDF5: A Survey
Indexing HDF5: A Survey
 

Similar a Hierarchical Data Formats (HDF) Update

Similar a Hierarchical Data Formats (HDF) Update (20)

HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF Updae
HDF UpdaeHDF Updae
HDF Updae
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF Status and Development
HDF Status and DevelopmentHDF Status and Development
HDF Status and Development
 
Parallel HDF5 Developments
Parallel HDF5 DevelopmentsParallel HDF5 Developments
Parallel HDF5 Developments
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 
Moving applications to HDF5 1.8
Moving applications to HDF5 1.8Moving applications to HDF5 1.8
Moving applications to HDF5 1.8
 
HDF Project Status and Plans
HDF Project Status and PlansHDF Project Status and Plans
HDF Project Status and Plans
 
HDF Status Update
HDF Status UpdateHDF Status Update
HDF Status Update
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
Support for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF GroupSupport for NPP/NPOESS by The HDF Group
Support for NPP/NPOESS by The HDF Group
 
HDF Update
HDF UpdateHDF Update
HDF Update
 
HDF5 Backward and Forward Compatibility Issues
HDF5 Backward and Forward Compatibility IssuesHDF5 Backward and Forward Compatibility Issues
HDF5 Backward and Forward Compatibility Issues
 
Transition from HDF4 to HDF5
Transition from HDF4 to HDF5 Transition from HDF4 to HDF5
Transition from HDF4 to HDF5
 
Introduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming ModelsIntroduction to HDF5 Data and Programming Models
Introduction to HDF5 Data and Programming Models
 
HDF Update
HDF UpdateHDF Update
HDF Update
 

Más de The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

Más de The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
Leveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software TestingLeveraging the Cloud for HDF Software Testing
Leveraging the Cloud for HDF Software Testing
 
Google Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOSGoogle Colaboratory for HDF-EOS
Google Colaboratory for HDF-EOS
 
Parallel Computing with HDF Server
Parallel Computing with HDF ServerParallel Computing with HDF Server
Parallel Computing with HDF Server
 
HDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's GuideHDF-EOS Data Product Developer's Guide
HDF-EOS Data Product Developer's Guide
 
NASA Terra Data Fusion
NASA Terra Data FusionNASA Terra Data Fusion
NASA Terra Data Fusion
 

Último

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Último (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

Hierarchical Data Formats (HDF) Update

  • 1. DM_PPT_NP_v02 Hierarchical Data Formats (HDF) Update Latest HDF releases and more The HDF Group Elena Pourmal (epourmal@hdfgroup.org) This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C
  • 2. DM_PPT_NP_v02 2 Outline • The HDF Group Website changes • Update on HDF5 1.8.19, 1.10.1 and HDF 4.2.13 • Compatibility issues • Updates on HDF-Java, HDFView 3.0 and other tools • Supported compilers and systems • Compression library for interoperability with h5py and Pandas • Tell us about your needs!
  • 3. DM_PPT_NP_v02 3 Where to find us on the Web? • New Website (https://hdfgroup.org) – Info about organization – Latest 1.10 releases and HDFview 3.0 – New commercial tools by The HDF Group • ODBC (Excel connector to HDF5) – Registration – Links to The HDF Group Support Website (https://support.hdfgroup.org) • Documentation • Old releases • Misc. information about projects – We are working on the new Support Portal (launch by the end of 2017) • Send us your feedback!
  • 4. DM_PPT_NP_v02 4 Latest HDF releases • Release cycle – once a year • HDF 4.2.13 (June 30, 2017) – Memory leak fixes – Support for Mac OS 10.12 – Support for the latest GNU, PGI an dIntel compilers • We do not plan any major work (i.e., performance improvements, new features, etc.) for HDF4 • Encourage to move to HDF5
  • 5. DM_PPT_NP_v02 5 HDF5 • Two versions – HDF5 1.8.19 (May 16, 2017) • Bug fixes, new APIs – HDF5 1.10.1 (April 27, 2017) • New features, extensions to HDF5 file format
  • 6. DM_PPT_NP_v02 6 Dropping Support for HDF5 1.8 • Last release by June 30, 2019 – 4 more HDF5 1.8 releases • We encourage you to move to HDF5 1.10 during the next year – Recompile your application with the new version of HDF5 • Contact help@hdfgroup.org if you encounter any problems
  • 7. DM_PPT_NP_v02 7 Issues you may encounter when moving applications to 1.10 • C, Fortran, C++, Python application that worked with HDF5 1.8 may create HDF5 file incompatible with HDF5 1.8 file format – When specifying latest file format while calling H5Pset_libver_bounds function – The HDF Group will provide a fix before dropping support for HDF5 1.8 • Small update to the function call is required • HDF5 Java applications – HDF5 JNI supports 64-bit objects identifiers; code based on the previous versions of HDF5 JNI need to be updated
  • 8. DM_PPT_NP_v02 8 Compatibility Issues 1.8 1.10 1.8 Yes No Use H5Pset_libver_bounds with appropriate parameters; don’t use features new in 1.10.0, 1.10.1 1.10 Yes Yes File is created by HDF5 FileisreadbyHDF5
  • 9. DM_PPT_NP_v02 9 HDF5 1.8.19 New Features • H5DOread_chunk – Function to read compressed data without uncompressing it (see H5DOwrite_chunk) H5DOread_chunk H5Dread
  • 10. DM_PPT_NP_v02 10 HDF5 1.10.1 (Performance) • “Evict on close” feature – Reduces memory footprint when iterating through many HDF5 objects (i.e, files, groups, datasets) • I/O improvements – Paged Aggregation – Page Buffering https://support.hdfgroup.org/HDF5/docNewFeatures/
  • 11. DM_PPT_NP_v02 11 HDF-JAVA Update • HDF4 and HDF5 JNI are part of the HDF4 and HDF5 1.10 source distribution – HDF5 JNI supports 64-bit objects identifiers; code based on the previous versions of HDF5 JNI need
  • 12. DM_PPT_NP_v02 12 HDFView 3.0 (beta) • HDFView 3.0-beta release (May 31, 2017) – The Graphical User Interface (GUI) framework that HDFView uses was migrated from Swing (GUI widget toolkit for Java; part of Oracle’s Java Foundation Classes ) to Standard Widget Toolkit (http://www.eclipse.org/swt/ ), which provides a more native application look and feel and advanced support for tables. – The data views have been separated from the main HDFView window. The main HDFView window still displays open files and their structures on the left side of the window, and it now displays any metadata on the right side. – This release includes improved support for various datatypes (compound, array of compound, and opaque). • HDFView 3.0 planned for December 2017
  • 13. DM_PPT_NP_v02 13 HDF Tools • Command-line tools in HDF4 and HDF5 – Display content – Copy data from one file to another – Diff two files • Maintenance mode (bug fixing) • Which tools are missing? – HDF4 and HDF5 diff – ?
  • 14. DM_PPT_NP_v02 14 Supported Compilers • GNU • PGI • Intel • We test with two latest compiler versions available • Other?
  • 15. DM_PPT_NP_v02 15 Supported OSs • Linux 2.6, 2.7 and 3.10 • Mac OS X 10.(8,9,10,11) and moving to 10.12 • Windows 10 (32 and 64-bit) – VS 2015 and Intel Fortran v.16 • Windows 7 (32 and 64-bit) – VS 2013 and Intel Fortran v.15 • Cygwin 32-bit • SunOS 5.11 (32 and 64-bit) • PowerPC 64 • Different Linux distributions (Fedora, Suse, Debian) • Anything missing?
  • 16. DM_PPT_NP_v02 16 Compression Library • HDF5 compression filters (plugins) • Dynamically loaded at run-time – BZIP2 (PyTables, Pandas) – MAFISC – BLOSC (PyTables, Pandas) – LZ4 (h5py) – More filters are coming…. • Contact help@hdfgroup.org if interested to try
  • 18. DM_PPT_NP_v02 18 This work was supported by NASA/GSFC under Raytheon Co. contract number NNG15HZ39C

Notas del editor

  1. HDF – Hierarchical Data Format (Version 4 and Version 5) A free and open source (BSD license) General purpose platform for storing, managing, archiving, and exchanging data Extensive facilities for data and metadata association, hierarchies, and annotation A self describing file format that is portable across operating systems and architectures, and that supports flexible user defined types A software library for high I/O performance, parallel I/O and out of core data access (partial I/O), which supports compression and other custom filters High quality documentation A responsive helpdesk and active users’ forum for community based support The HDF Group is a not for profit corporation whose mission is to ensure the long term accessibility to HDF data through the sustainable development and support of HDF technologies. The HDF Group is dedicated to evolving HDF technologies to serve the needs of users in ever changing computational environments, while at the same time maintaining its commitment to ensure the accessibility of data stored in HDF for the coming decades, even centuries. The HDF project started at NCSA and the University of Illinois in 1987. The HDF Group completed its transition to an independent corporation in mid 2006.
  2. Use when no decoding is necessary, for example, when rewriting the data from one file to another
  3. The HDF5 library's metadata cache is fairly conservative about holding on to HDF5 object metadata (object headers, chunk index structures, etc.), which can cause the cache size to grow, resulting in memory pressure on an application or system. The "evict on close" property will cause all metadata for an object to be evicted from the cache as long as metadata is not referenced from any other open object. See the Fine Tuning the Metadata Cache documentation for information on the APIs. The current HDF5 file space allocation accumulates small pieces of metadata and raw data in aggregator blocks which are not page aligned and vary widely in sizes. The paged aggregation feature was implemented to provide efficient paged access of these small pieces of metadata and raw data. See the RFC for details. Also, see the File Space Management documentation. Small and random I/O accesses on parallel file systems result in poor performance for applications. Page buffering in conjunction with paged aggregation can improve performance by giving an application control of minimizing HDF5 I/O requests to a specific granularity and alignment. See the RFC for details. Also, see the Page Buffering documentation.