SlideShare a Scribd company logo
1 of 18
ISO 19157 – Questions and
Answers
Ted Habermann
Director of Earth Science
The HDF Group
thabermann@hdfgroup.org
1
The ISO Data Quality Metadata standards
are evolving.
Data Quality metadata has moved from
ISO 19115 to ISO 19157.
Evolution is good.
The Big Picture
ISO 19157 is a conceptual model
of data quality that was recently
approved as an international
standard. It combines concepts
from three older standards into a
unified conceptual model for
describing data quality.
Many of the principle elements
of this conceptual model are
abstract, and can be
implemented in several ways.
The Big Picture
ISO 19157 is a conceptual model
of data quality that was recently
approved as an international
standard. It combines concepts
from three older standards into a
unified conceptual model for
describing data quality.
Many of the principle elements
of this conceptual model are
abstract, they can be
implemented in several ways.
When only the abstract concepts
are considered, the model is very
simple.
Data Quality Scope
4
“The quality of my data vary in time and space and different parameters have
different quality measures and results.”
ISO quality reports all include descriptions of temporal and spatial extents and
elements of the data set that they pertain to. You can say things like:
Between 2001 and 2002 the quality of the data in the northern hemisphere …
or
The data collected by this sensor degraded during June 2011 because…
or
Quality information for this parameter is in this variable…
<<DataType>>
DQ_Scope
+ level : MD_ScopeCode
+ extent [0..1] : EX_Extent
+ levelDescription [0..*] :MD_ScopeDescription
Stand Alone Quality Reports
5
“There are papers and web pages that describe the quality of my data.”
Papers and reports that describe data quality are StandAloneReports.
Metadata can include brief descriptions of the results (abstracts) and
references to any number of these (citations).
DQ_StandaloneQualityReportInformation
+ reportReference: CI_Citation
+ abstract : CharacterString
Abstract: The fire training-set may also have been biased
against savanna and savanna woodland fires since their
detection is more difficult than in humid, forest environments
with cool background temperatures [Malingreau, 1990]. There
may, therefore, be an under-sampling of fires in these warmer
background environments. Citation: Malingreau J.P, 1990, The contribution
of remote sensing to the global monitoring of
fires in tropical and subtropical ecosystems. In:
Fire in Tropical Biota, (J.G. Goldammer , editor),
Springer Verlag , Berlin: 337-370.
What is a Data Quality Element?
6
Data
Quality
Element
Measure
Result
Method
QA_PercentMissingData
Number of Pixels with Missing Flags
Total Number of Pixels
15%
What Are Quality Measures?
“My metadata already include data quality measures .”
ECHO includes two types of quality measures.
What Are Quality Measures?
“I use consistent Quality Measures across many products.”
QAStats – Standard measures for all products
QAPercentMissingData - Granule level % missing data. This attribute can be
repeated for individual parameters within a granule.
QAPercentOutOfBoundsData – Granule level % out of bounds data. This attribute
can be repeated for individual parameters within a granule.
QAPercentInterpolatedData – Granule level % interpolated data. This attribute
can be repeated for individual parameters within a granule.
QAPercentCloudCover – This attribute is used to characterize the cloud cover
amount of a granule. This attribute may be repeated for individual parameters
within a granule. (Note - there may be more than one way to define a cloud or it's
effects within a product containing several parameters; i.e. this attribute may be
parameter specific)
ECHO includes two types of quality measures.
What Are Quality Measures?
“I use consistent types of Quality Measure across many products.”
QAFlags – Classes of quality measures with product specific implementations
AutomaticQualityFlag – The granule level flag applying generally to the granule and specifically to
parameters the granule level. When applied to parameter, the flag refers to the quality of that parameter
for the granule (as applicable). The parameters determining whether the flag is set are defined by the
developer and documented in the Quality Flag Explanation.
AutomaticQualityFlagExplanation – A text explanation of the criteria used to set automatic quality flag,
including thresholds or other criteria.
OperationalQualityFlag – The granule level flag applying both generally to a granule and specifically to
parameters at the granule level. When applied to parameter, the flag refers to the quality of that
parameter for the granule (as applicable). The parameters determining whether the flag is set are defined
by the developers and documented in the Operational Quality Flag Explanation.
OperationalQualityFlagExplanation – A text explanation of the criteria used to set operational quality
flag; including thresholds or other criteria.
ScienceQualityFlag – Granule level flag applying to a granule, and specifically to parameters. When
applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The
parameters determining whether the flag is set are defined by the developers and documented in the
Science Quality Flag Explanation.
ScienceQualityFlagExplanation – A text explanation of the criteria used to set science quality flag;
including thresholds or other criteria.
<<Abstract>>
DQ_Element
Data Quality Measures
DQM_Measure
+ measureIdentifier : MD_Identifier
+ name : CharacterString
+ alias [0..*] : CharacterString
+ sourceReference [0..*] : CI_Citation
+ elementName [1..*] : TypeName
+ definition : CharacterString
+ description [0..1] : DQM_Description
+ valueType : TypeName
+ valueStructure [0..1] : DQM_ValueStructure
+ example [0..*] : DQM_Description
DQM_BasicMeasure
+ name : CharacterString
+ definition : CharacterString
+ example : DQM_Description [0..1]
+ valueType : TypeName
<<CodeList>>
DQ_ValueStructure
+ bag
+ set
+ sequence
+ table
+ matrix
+ coverage
DQM_Parameter
+ name : CharacterString
+ definition : CharacterString
+ description: DQM_Description [0..1]
+ valueType : TypeName
+ valueStructure [0..1] : DQM_ValueStructure
DQM_Description
+ textDescription: CharacterString
+ extendedDescription [0..1] : MD_BrowseGraphic
DQ_MeasureReference
+ measureIdentification [0..1] : MD_Identifier
+ nameOfMeasure [0..*] : CharacterString
+ measureDescription [0..1] : CharacterString
if measureIdentification is not provided,
then nameOfMeasure shall be provided
+ measure 0..1
ISO 19157 includes a DQ_MeasureReference designed to provide a connection to a
detailed description of the quality measure.
“My data quality measures are consistently described in a database .”
Data Quality Measures
11
“I need to clearly and consistently explain how I measure quality.”
The ISO model for quality measures includes identifiers, definitions, descriptions,
references and illustrations.
Modular Data Quality Information
12
“My data quality information exists in databases or web services.”
Major elements of the 19157 conceptual model are separate components that can be
independently connected to the metadata and reused in multiple records.
Results
Measures
Methods
Data Quality Results
13
“My metadata currently includes descriptions of the quality of my data.”
These descriptions can be included in 19157 metadata as descriptive reports.
<Quality>
Due to the lack of high resolution data available over the region for
1993-94, it has been hard to validate the product. However the maps of
burnt areas correspond well with active fire maps for the
region. Where large [>3km] scars are found, the detection is more
reliable. In areas of small scars more problems are involved. It is
hoped that the 1994-95 data set will cover the whole of the study area
and be calibrated by high resolution data.
</Quality>
DQ_DescriptiveResult
+ statement : CharacterString
<gco:CharacterString>
Due to the lack of high resolution data available over the region for
1993-94, it has been hard to validate the product. However the maps of
burnt areas correspond well with active fire maps for the
region. Where large [>3km] scars are found, the detection is more
reliable. In areas of small scars more problems are involved. It is
hoped that the 1994-95 data set will cover the whole of the study area
and be calibrated by high resolution data.
</gco:CharacterString>
Data Usage
14
“Users increase our understanding of data quality. We need to keep them in
the loop.”
MD_Usage
+ specificUsage : CharacterString
+ usageDateTime [0..1] : DateTime
+ userDeterminedLimitations [0..1] : CharacterString
+ userContactInfo [1..*] : CI_ResponsibleParty
+ response [0..*] : CharacterString
+ additionalDocumentation [0..*] : CI_Citation
+ identifiedIssues [0..1] : CI_Citation
International Standard
15
“Users all over the world need to use and understand the quality of my data.”
Summary
“There are papers and web pages
that describe the quality of my data.”
“My data quality information
exists in databases or web
services.”
“I use consistent Quality
Measures across many
products.”
“I use consistent types of
Quality Measure across
many products.”
“I need to clearly and consistently
explain how I measure quality.”
“My metadata currently includes
descriptions of the quality of my
data.”
“Users increase our
understanding of data quality. We
need to keep them in the loop.”
“The quality of my data vary in time and
space and different parameters have
different quality measures and results.”
“Users all over the world need to use and
understand the quality of my data.”
17
Questions?
tedhabermann@hdfgroup.org
Acknowledgements
This work was partially supported by contract number NNG10HP02C from NASA.
Any opinions, findings, conclusions, or recommendations expressed in this material are
those of the author and do not necessarily reflect the views of NASA or The HDF Group.

More Related Content

Viewers also liked

Spatial data infrastructure in the cloud, 2011
Spatial data infrastructure in the cloud, 2011Spatial data infrastructure in the cloud, 2011
Spatial data infrastructure in the cloud, 2011
Moullet
 
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
Samuel Dekolo
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNode
Sebastian Benthall
 

Viewers also liked (9)

Prezentare ESRI
Prezentare ESRIPrezentare ESRI
Prezentare ESRI
 
The building of a European Spatial Data Infrastructure , INSPIRE
The building of a European Spatial Data Infrastructure,  INSPIREThe building of a European Spatial Data Infrastructure,  INSPIRE
The building of a European Spatial Data Infrastructure , INSPIRE
 
European Spatial Data Infrastructure - INSPIRE and beyond
European Spatial Data Infrastructure - INSPIRE and beyond European Spatial Data Infrastructure - INSPIRE and beyond
European Spatial Data Infrastructure - INSPIRE and beyond
 
National Spatial Data Infrastructure (NSDI)
National Spatial Data Infrastructure (NSDI)National Spatial Data Infrastructure (NSDI)
National Spatial Data Infrastructure (NSDI)
 
The Loading and Maintenance of Cadastral Data in the Spatial Data Infrastruct...
The Loading and Maintenance of Cadastral Data in the Spatial Data Infrastruct...The Loading and Maintenance of Cadastral Data in the Spatial Data Infrastruct...
The Loading and Maintenance of Cadastral Data in the Spatial Data Infrastruct...
 
“Cadastral Maps for Socio-Economic Data Visualization and Integration for Lan...
“Cadastral Maps for Socio-Economic Data Visualization and Integration for Lan...“Cadastral Maps for Socio-Economic Data Visualization and Integration for Lan...
“Cadastral Maps for Socio-Economic Data Visualization and Integration for Lan...
 
Spatial data infrastructure in the cloud, 2011
Spatial data infrastructure in the cloud, 2011Spatial data infrastructure in the cloud, 2011
Spatial data infrastructure in the cloud, 2011
 
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
Building Spatial Data Infrastructures for Spatial Planning in Africa: Lagos e...
 
Spatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNodeSpatial Data Infrastructure Best Practices with GeoNode
Spatial Data Infrastructure Best Practices with GeoNode
 

Similar to 19157 Questions and Answers

Quality management in software engineering
Quality management in software engineeringQuality management in software engineering
Quality management in software engineering
selinasimpson1201
 
TejGaurThesis
TejGaurThesisTejGaurThesis
TejGaurThesis
Tej Gaur
 
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.pptcdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
poonamshukla311
 
Open06
Open06Open06
Open06
butest
 

Similar to 19157 Questions and Answers (20)

Software metrics
Software metricsSoftware metrics
Software metrics
 
Quality key users
Quality key usersQuality key users
Quality key users
 
Software Product Measurement and Analysis in a Continuous Integration Environ...
Software Product Measurement and Analysis in a Continuous Integration Environ...Software Product Measurement and Analysis in a Continuous Integration Environ...
Software Product Measurement and Analysis in a Continuous Integration Environ...
 
IRJET- American Sign Language Classification
IRJET- American Sign Language ClassificationIRJET- American Sign Language Classification
IRJET- American Sign Language Classification
 
Multi-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learningMulti-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learning
 
Quality management in software engineering
Quality management in software engineeringQuality management in software engineering
Quality management in software engineering
 
AIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTIONAIRLINE FARE PRICE PREDICTION
AIRLINE FARE PRICE PREDICTION
 
Slides chapter 15
Slides chapter 15Slides chapter 15
Slides chapter 15
 
software engineering module i & ii.pptx
software engineering module i & ii.pptxsoftware engineering module i & ii.pptx
software engineering module i & ii.pptx
 
A Firefly based improved clustering algorithm
A Firefly based improved clustering algorithmA Firefly based improved clustering algorithm
A Firefly based improved clustering algorithm
 
TejGaurThesis
TejGaurThesisTejGaurThesis
TejGaurThesis
 
Classification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining techniqueClassification on multi label dataset using rule mining technique
Classification on multi label dataset using rule mining technique
 
churn_detection.pptx
churn_detection.pptxchurn_detection.pptx
churn_detection.pptx
 
Survey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy AlgorithmsSurvey paper on Big Data Imputation and Privacy Algorithms
Survey paper on Big Data Imputation and Privacy Algorithms
 
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.pptcdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
cdiscsdtmtrainingpresentation-12900042423172-phpapp01.ppt
 
TERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTIONTERM DEPOSIT SUBSCRIPTION PREDICTION
TERM DEPOSIT SUBSCRIPTION PREDICTION
 
Mind Map Test Data Management Overview
Mind Map Test Data Management OverviewMind Map Test Data Management Overview
Mind Map Test Data Management Overview
 
Open06
Open06Open06
Open06
 
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
IRJET- Empower Syntactic Exploration Based on Conceptual Graph using Searchab...
 
The interoperability challenges of 3D personal data
The interoperability challenges of 3D personal dataThe interoperability challenges of 3D personal data
The interoperability challenges of 3D personal data
 

More from Ted Habermann

The HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First MileThe HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First Mile
Ted Habermann
 
Hdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last MileHdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last Mile
Ted Habermann
 

More from Ted Habermann (12)

Translation proofing
Translation proofingTranslation proofing
Translation proofing
 
New ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata CapabilitiesNew ISO Geospatial Metadata Capabilities
New ISO Geospatial Metadata Capabilities
 
The HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First MileThe HDF Product Designer – Interoperability in the First Mile
The HDF Product Designer – Interoperability in the First Mile
 
Hdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last MileHdf Augmentation: Interoperability in the Last Mile
Hdf Augmentation: Interoperability in the Last Mile
 
Citations in ISO Metadata
Citations in ISO MetadataCitations in ISO Metadata
Citations in ISO Metadata
 
Translation proofing
Translation proofingTranslation proofing
Translation proofing
 
Wikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving DocumentationWikis, Rubrics and Views: An Integrated Approach to Improving Documentation
Wikis, Rubrics and Views: An Integrated Approach to Improving Documentation
 
Hdf Inside
Hdf InsideHdf Inside
Hdf Inside
 
Metadata Evaluation and Improvement
Metadata Evaluation and ImprovementMetadata Evaluation and Improvement
Metadata Evaluation and Improvement
 
Metadata For Humans and Machines
Metadata For Humans and MachinesMetadata For Humans and Machines
Metadata For Humans and Machines
 
ESDIS Metadata Archive
ESDIS Metadata ArchiveESDIS Metadata Archive
ESDIS Metadata Archive
 
Earth Science Platform
Earth Science PlatformEarth Science Platform
Earth Science Platform
 

Recently uploaded

Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
jyothisaisri
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 

Recently uploaded (20)

Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...
 
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPESTHE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
THE GENERAL PROPERTIES OF PROTEOBACTERIA AND ITS TYPES
 
The Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdfThe Scientific names of some important families of Industrial plants .pdf
The Scientific names of some important families of Industrial plants .pdf
 
GBSN - Microbiology (Unit 7) Microbiology in Everyday Life
GBSN - Microbiology (Unit 7) Microbiology in Everyday LifeGBSN - Microbiology (Unit 7) Microbiology in Everyday Life
GBSN - Microbiology (Unit 7) Microbiology in Everyday Life
 
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptxPOST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
POST TRANSCRIPTIONAL GENE SILENCING-AN INTRODUCTION.pptx
 
TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
Tuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notesTuberculosis (TB)-Notes.pdf microbiology notes
Tuberculosis (TB)-Notes.pdf microbiology notes
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
In-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptxIn-pond Race way systems for Aquaculture (IPRS).pptx
In-pond Race way systems for Aquaculture (IPRS).pptx
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
 
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana LahariERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
ERTHROPOIESIS: Dr. E. Muralinath & R. Gnana Lahari
 
NuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfNuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdf
 
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
MSCII_              FCT UNIT 5 TOXICOLOGY.pdfMSCII_              FCT UNIT 5 TOXICOLOGY.pdf
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
EU START PROJECT. START-Newsletter_Issue_4.pdf
EU START PROJECT. START-Newsletter_Issue_4.pdfEU START PROJECT. START-Newsletter_Issue_4.pdf
EU START PROJECT. START-Newsletter_Issue_4.pdf
 
RACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptxRACEMIzATION AND ISOMERISATION completed.pptx
RACEMIzATION AND ISOMERISATION completed.pptx
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
Emergent ribozyme behaviors in oxychlorine brines indicate a unique niche for...
 
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
Extensive Pollution of Uranus and Neptune’s Atmospheres by Upsweep of Icy Mat...
 

19157 Questions and Answers

  • 1. ISO 19157 – Questions and Answers Ted Habermann Director of Earth Science The HDF Group thabermann@hdfgroup.org 1 The ISO Data Quality Metadata standards are evolving. Data Quality metadata has moved from ISO 19115 to ISO 19157. Evolution is good.
  • 2. The Big Picture ISO 19157 is a conceptual model of data quality that was recently approved as an international standard. It combines concepts from three older standards into a unified conceptual model for describing data quality. Many of the principle elements of this conceptual model are abstract, and can be implemented in several ways.
  • 3. The Big Picture ISO 19157 is a conceptual model of data quality that was recently approved as an international standard. It combines concepts from three older standards into a unified conceptual model for describing data quality. Many of the principle elements of this conceptual model are abstract, they can be implemented in several ways. When only the abstract concepts are considered, the model is very simple.
  • 4. Data Quality Scope 4 “The quality of my data vary in time and space and different parameters have different quality measures and results.” ISO quality reports all include descriptions of temporal and spatial extents and elements of the data set that they pertain to. You can say things like: Between 2001 and 2002 the quality of the data in the northern hemisphere … or The data collected by this sensor degraded during June 2011 because… or Quality information for this parameter is in this variable… <<DataType>> DQ_Scope + level : MD_ScopeCode + extent [0..1] : EX_Extent + levelDescription [0..*] :MD_ScopeDescription
  • 5. Stand Alone Quality Reports 5 “There are papers and web pages that describe the quality of my data.” Papers and reports that describe data quality are StandAloneReports. Metadata can include brief descriptions of the results (abstracts) and references to any number of these (citations). DQ_StandaloneQualityReportInformation + reportReference: CI_Citation + abstract : CharacterString Abstract: The fire training-set may also have been biased against savanna and savanna woodland fires since their detection is more difficult than in humid, forest environments with cool background temperatures [Malingreau, 1990]. There may, therefore, be an under-sampling of fires in these warmer background environments. Citation: Malingreau J.P, 1990, The contribution of remote sensing to the global monitoring of fires in tropical and subtropical ecosystems. In: Fire in Tropical Biota, (J.G. Goldammer , editor), Springer Verlag , Berlin: 337-370.
  • 6. What is a Data Quality Element? 6 Data Quality Element Measure Result Method QA_PercentMissingData Number of Pixels with Missing Flags Total Number of Pixels 15%
  • 7. What Are Quality Measures? “My metadata already include data quality measures .” ECHO includes two types of quality measures.
  • 8. What Are Quality Measures? “I use consistent Quality Measures across many products.” QAStats – Standard measures for all products QAPercentMissingData - Granule level % missing data. This attribute can be repeated for individual parameters within a granule. QAPercentOutOfBoundsData – Granule level % out of bounds data. This attribute can be repeated for individual parameters within a granule. QAPercentInterpolatedData – Granule level % interpolated data. This attribute can be repeated for individual parameters within a granule. QAPercentCloudCover – This attribute is used to characterize the cloud cover amount of a granule. This attribute may be repeated for individual parameters within a granule. (Note - there may be more than one way to define a cloud or it's effects within a product containing several parameters; i.e. this attribute may be parameter specific) ECHO includes two types of quality measures.
  • 9. What Are Quality Measures? “I use consistent types of Quality Measure across many products.” QAFlags – Classes of quality measures with product specific implementations AutomaticQualityFlag – The granule level flag applying generally to the granule and specifically to parameters the granule level. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developer and documented in the Quality Flag Explanation. AutomaticQualityFlagExplanation – A text explanation of the criteria used to set automatic quality flag, including thresholds or other criteria. OperationalQualityFlag – The granule level flag applying both generally to a granule and specifically to parameters at the granule level. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developers and documented in the Operational Quality Flag Explanation. OperationalQualityFlagExplanation – A text explanation of the criteria used to set operational quality flag; including thresholds or other criteria. ScienceQualityFlag – Granule level flag applying to a granule, and specifically to parameters. When applied to parameter, the flag refers to the quality of that parameter for the granule (as applicable). The parameters determining whether the flag is set are defined by the developers and documented in the Science Quality Flag Explanation. ScienceQualityFlagExplanation – A text explanation of the criteria used to set science quality flag; including thresholds or other criteria.
  • 10. <<Abstract>> DQ_Element Data Quality Measures DQM_Measure + measureIdentifier : MD_Identifier + name : CharacterString + alias [0..*] : CharacterString + sourceReference [0..*] : CI_Citation + elementName [1..*] : TypeName + definition : CharacterString + description [0..1] : DQM_Description + valueType : TypeName + valueStructure [0..1] : DQM_ValueStructure + example [0..*] : DQM_Description DQM_BasicMeasure + name : CharacterString + definition : CharacterString + example : DQM_Description [0..1] + valueType : TypeName <<CodeList>> DQ_ValueStructure + bag + set + sequence + table + matrix + coverage DQM_Parameter + name : CharacterString + definition : CharacterString + description: DQM_Description [0..1] + valueType : TypeName + valueStructure [0..1] : DQM_ValueStructure DQM_Description + textDescription: CharacterString + extendedDescription [0..1] : MD_BrowseGraphic DQ_MeasureReference + measureIdentification [0..1] : MD_Identifier + nameOfMeasure [0..*] : CharacterString + measureDescription [0..1] : CharacterString if measureIdentification is not provided, then nameOfMeasure shall be provided + measure 0..1 ISO 19157 includes a DQ_MeasureReference designed to provide a connection to a detailed description of the quality measure. “My data quality measures are consistently described in a database .”
  • 11. Data Quality Measures 11 “I need to clearly and consistently explain how I measure quality.” The ISO model for quality measures includes identifiers, definitions, descriptions, references and illustrations.
  • 12. Modular Data Quality Information 12 “My data quality information exists in databases or web services.” Major elements of the 19157 conceptual model are separate components that can be independently connected to the metadata and reused in multiple records. Results Measures Methods
  • 13. Data Quality Results 13 “My metadata currently includes descriptions of the quality of my data.” These descriptions can be included in 19157 metadata as descriptive reports. <Quality> Due to the lack of high resolution data available over the region for 1993-94, it has been hard to validate the product. However the maps of burnt areas correspond well with active fire maps for the region. Where large [>3km] scars are found, the detection is more reliable. In areas of small scars more problems are involved. It is hoped that the 1994-95 data set will cover the whole of the study area and be calibrated by high resolution data. </Quality> DQ_DescriptiveResult + statement : CharacterString <gco:CharacterString> Due to the lack of high resolution data available over the region for 1993-94, it has been hard to validate the product. However the maps of burnt areas correspond well with active fire maps for the region. Where large [>3km] scars are found, the detection is more reliable. In areas of small scars more problems are involved. It is hoped that the 1994-95 data set will cover the whole of the study area and be calibrated by high resolution data. </gco:CharacterString>
  • 14. Data Usage 14 “Users increase our understanding of data quality. We need to keep them in the loop.” MD_Usage + specificUsage : CharacterString + usageDateTime [0..1] : DateTime + userDeterminedLimitations [0..1] : CharacterString + userContactInfo [1..*] : CI_ResponsibleParty + response [0..*] : CharacterString + additionalDocumentation [0..*] : CI_Citation + identifiedIssues [0..1] : CI_Citation
  • 15. International Standard 15 “Users all over the world need to use and understand the quality of my data.”
  • 16. Summary “There are papers and web pages that describe the quality of my data.” “My data quality information exists in databases or web services.” “I use consistent Quality Measures across many products.” “I use consistent types of Quality Measure across many products.” “I need to clearly and consistently explain how I measure quality.” “My metadata currently includes descriptions of the quality of my data.” “Users increase our understanding of data quality. We need to keep them in the loop.” “The quality of my data vary in time and space and different parameters have different quality measures and results.” “Users all over the world need to use and understand the quality of my data.”
  • 18. Acknowledgements This work was partially supported by contract number NNG10HP02C from NASA. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author and do not necessarily reflect the views of NASA or The HDF Group.