SlideShare a Scribd company logo
1 of 54
Download to read offline
Getting Data Quality Right
Good data is like good water: best served fresh, and ideally well-
filtered. Data management strategies can produce tremendous
procedural improvements and increased profit margins across the
board, but only if the data being managed is of high quality.
Determining how data quality should be engineered provides a useful
framework for utilizing data quality management effectively in support
of business strategy. This, in turn, allows for speedy identification of
business problems, delineation between structural and practice-
oriented defects in data management, and proactive prevention of
future issues. Organizations must realize what it means to utilize
data quality engineering in support of business strategy. This webinar
will illustrate how organizations with chronic business challenges
often can trace the root of the problem to poor data quality. Showing
how data quality should be engineered provides a useful framework
in which to develop an effective approach. This in turn allows
organizations to more quickly identify business problems as well as
data problems caused by structural issues versus practice-oriented
defects and prevent these from re-occurring.
Program Learning Objectives
• Help you understand foundational data quality concepts
for improving data quality at your organization
• Demonstrate how chronic business challenges for
organizations are often rooted in poor data quality
• Share case studies illustrating the hallmarks and
benefits of data quality success
© Copyright 2022 by Peter Aiken Slide # 1
https://anythingawesome.com
Date: 13 September 2022 | Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD
© Copyright 2022 by Peter Aiken Slide # 2
https://anythingawesome.com
Collibra | Confidential
Delivering Trusted Data:
Collibra's Modern Approach To Data Quality
2
Consider the following
features as we demonstrate
them in action…
How would you
manage this
company’s
data?
The data is created by your front office…
• Sales representatives record the transactions
• They have flexibility in their own software
The data is in multiple places…
• Your “controls” separated the data
• You have sales data and financial transactions
Everyone up to CEO needs that data…
• What could possibly go wrong?
• How will you discover it?
Human Error
Technical Error
No Error
Collibra | Confidential 3
Is our data
quality getting
better or worse
over time?
How do I identify
sensitive data
before running DQ
checks?
Where are my
data
inconsistencies?
Who owns
this data
issue...
How can I
trust this
report or
dashboard?
Our data producers
and data consumers
operate in silos...
What is the
quality of the
data behind my
models and
reports?
Problem: Managing enterprise-scale data quality is a challenge
BI Analyst/
Data
Scientist
Everyday
Employee
Management/
Leadership
Collibra | Confidential
4
50-70% 15-25%
lost revenue for
most companies
due to bad data.
average time spent
on manual rule
writing and
management.
Sources:
IDC and Gartner 2020, Tech Jury forecast, 2021
Assessing Data Quality: A Managerial Call to Action (May 2020, HBR)
Seizing Opportunity in Data Quality (MIT Sloan Management Review, 2017)
$1.9B
projected data
quality spend
in 2021.
Problem: This leads to real costs across your organization
47%
of recently created
data records have
at least one
critical error.
5
Collibra Data Quality & Observability Joy of automation
• Discover data
quality issues
• “Feels” statistical,
not code
• Scales as easy a
identify, scan,
discover, train
Intelligent
Auto-generate
explainable and
adaptive DQ
rules
Scalable
Scan large and
diverse databases,
files and
streaming data
Self-service
Empower users
with a unified
scoring system and
flexible rule
management
Workflow
Good Notifications
Standardize Rules
Good Scale
Automatically detect change
Good Rule Logic
Profile your data consistently
Good Metadata
Integrate across
multiple data sources
Good Data
minimal viable
data trust
Data Quality &
Observability
'Stack’
Collibra Data Quality &
Observability automates
static observations and
technical rules while
scaling complex business
rules. Consider solutions
in the market that
combine the three
trending features in
modern DQ - advanced
autometrics, rule
templating, and data
discovery/enforcement.
Chief Digital Manager @ Dataversity.Net
© Copyright 2022 by Peter Aiken Slide # 3
https://anythingawesome.com
Shannon Kempe
Two Most Commonly
Asked Questions
© Copyright 2022 by Peter Aiken Slide # 4
https://anythingawesome.com
1) Will I get copies of the
slides after the event?
2) Is this being recorded?
Peter Aiken, Ph.D.
• I've been doing this a long time
• My work is recognized as useful
• Associate Professor of IS (vcu.edu)
• Institute for Defense Analyses (ida.org)
• DAMA International (dama.org)
• MIT CDO Society (iscdo.org)
• Anything Awesome (plusanythingawesome.com)
• Experienced w/ 500+ data
management practices worldwide
• Multi-year immersions
– US DoD (DISA/Army/Marines/DLA)
– Nokia
– Deutsche Bank
– Wells Fargo
– Walmart …
• 12 books and
dozens of articles
© Copyright 2022 by Peter Aiken Slide # 5
https://anythingawesome.com
+
• DAMA International President 2009-2013/2018/2020
• DAMA International Achievement Award 2001
(with Dr. E. F. "Ted" Codd
• DAMA International Community Award 2005
Event Pricing
© Copyright 2022 by Peter Aiken Slide # 6
https://anythingawesome.com
• 20% off
directly from the publisher
on select titles
• My Book Store @
http://anythingawesome.com
• Enter the code
"anythingawesome" at the
Technics bookstore
checkout where it says to
"Apply Coupon"
anythingawesome
Getting Data
Quality Right
© Copyright 2022 by Peter Aiken Slide # 7
peter.aiken@anythingawesome.com +1.804.382.5957 Peter Aiken, PhD
Engineering Business Success Stories
© Copyright 2022 by Peter Aiken Slide # 8
https://anythingawesome.com
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
© Copyright 2022 by Peter Aiken Slide #
https://anythingawesome.com 9
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
Famous 1990's Words?
• Question:
– Why haven't organizations taken a
more proactive approach to data quality?
• Answer:
– Fixing data quality problems is not easy
– It is dangerous -- they'll come after you
– Your efforts are likely to be misunderstood
– You could make things worse
– Now you get to fix it
• A single data quality
issue can grow
into a significant,
unexpected
investment
© Copyright 2022 by Peter Aiken Slide # 10
https://anythingawesome.com
What Does Everyone Want?
© Copyright 2022 by Peter Aiken Slide # 11
https://anythingawesome.com
DIGITAL
Digital Insight
• Subtract data from digital and what do you have?
• Subtract digital from data and you still have data
© Copyright 2022 by Peter Aiken Slide # 12
https://anythingawesome.com
https://www.linkedin.com/in/mark-johnson-518a752/
DIGITAL
DATA
?
DIGITAL
DATA
DATA
© Copyright 2022 by Peter Aiken Slide # 13
https://anythingawesome.com
https://anythingawesome.com/why.html
+
Definitions
• Quality Data
– Fit for purpose meets the requirements of its authors,
users, and administrators (from Martin Eppler)
– Synonymous with information quality, since poor data
quality results in inaccurate information and poor performance
• Data Quality Management
– "Planning, implementation and control activities that
apply quality management techniques to measure,
assess, improve, and ensure data quality"
– Encompasses life cycle activities
– Include supporting processes from change management, etc.
– Continuous improvement process requiring core capabilities
• Data Quality Engineering
– Recognition that data quality solutions cannot not managed
but, instead, must be engineered
– Data quality engineering concepts are generally not
known and understood within IT or business!
© Copyright 2022 by Peter Aiken Slide # 14
https://anythingawesome.com Spinach/Popeye story from: https://medium.com/hurb-labs/data-quality-a-lesson-from-the-myth-behind-popeye-the-sailor-a7bd50b61510
DQ Effort Pattern
© Copyright 2022 by Peter Aiken Slide # 15
https://anythingawesome.com
from The DAMA Guide to the Data Management Body of Knowledge © 2017 by DAMA International
80% time spent preparing
20% analysis time
Data Knowledge Is Too Little and Too Informal
• Data management happens 'pretty well' at
the workgroup level
– Defining characteristic of a workgroup
– Without guidance (strategy), what are the chances that
all workgroups are pulling toward the same objectives?
– Consider the time spent attempting informal practices
• Data chaff becomes sand in the machinery
– Preventing smooth interoperation and exchanges
– Losses due to lots of little data cuts have been
difficult to account for
• Organizations and individuals lack data quality
– Knowledge
– Skills
– Data Engineering (How?)
– Data Strategy (Why this as opposed to that?)
© Copyright 2022 by Peter Aiken Slide # 16
https://anythingawesome.com
Wally Easton Playing Piano
https://www.youtube.com/watch?v=NNbPxSvII-Q
Everyone Wants To Do Better Data Analysis …
• Some data preparation is inevitable
• What would a 'good' ratio be?
• "Everyone knows"
© Copyright 2022 by Peter Aiken Slide # 17
https://anythingawesome.com
0%
25%
50%
75%
100%
Data Analysis Data Preparation
20%
80%
Hidden Data Factories
© Copyright 2022 by Peter Aiken Slide # 18
https://anythingawesome.com
https://hbr.org/2016/09/bad-data-costs-the-u-s-3-trillion-per-year
Work products are
delivered to
Customers
Customers
Knowledge Workers
80% looking for stuff
20% doing useful work
Department B
1. Check A's work
2. Make any corrections
3. Complete B's work
4. Deliver to Department C
Department A
https://en.wikipedia.org/wiki/Theory_of_constraints
Department C
1. Check B's work
2. Make any corrections
3. Complete C's work
4. Deliver to Customer
5. Deal with consequences
Hidden Data Factories
© Copyright 2022 by Peter Aiken Slide # 19
https://anythingawesome.com
IT
Process
Business
Process
Business
Challenge
Root-cause analysis
reveals always
reveals a data
component!
Poor Data Quality Manifests as Multifaceted Organizational Challenges
© Copyright 2022 by Peter Aiken Slide # 20
https://anythingawesome.com
IT
System
Business
Challenge
Business
Challenge
Business
System
Business
Challenge
IT
Process
Business
Challenge
IT
System
Business
Challenge
Business
Process
Business
Challenge
Poor results
Business
Challenge
Consistency Encourages Quality Analysis
© Copyright 2022 by Peter Aiken Slide # 21
https://anythingawesome.com
IT
System
Business
Challenge
Business
Process
Business
Challenge
IT
Process
Business
System
Business
Challenge
IT
Process
Business
Challenge
IT
System
Business
Challenge
Business
Process
Business
Challenge
Data quality requires a
team with specialized
skills deployed to create
a repeatable process
and develop sustained
organizational skillsets
Events Are Not Always Recognized as Data Quality Challenges?
© Copyright 2022 by Peter Aiken Slide # 22
https://anythingawesome.com
• A letter from a bank
• A very expensive, very
small data rounding error
• Health data story
• The chocolate story
• Covid-19
© Copyright 2022 by Peter Aiken Slide # 23
https://anythingawesome.com
A congratulations
letter from another
bank
Problems
• Bank did not know it
made an error
• Tools alone could not
have prevented this
error
• Lost confidence in the
ability of the bank to
manage customer
funds
IEEE Senior Member Status (30+ Years Membership)
© Copyright 2022 by Peter Aiken Slide # 24
https://anythingawesome.com
© Copyright 2022 by Peter Aiken Slide # 25
https://anythingawesome.com
• Needed trench for electrical
cable 2.52" - delivered 2.5"
• $1M required to rent other
facilities while new cable is obtained
• Either rounding or truncation could explain
– We need to get a summary on all of this," he said.
"How did the mistake occur? Who's at fault? What
are the damages? And how is money going to be recovered?"
Port of Seattle
2.52" ➜ 2.5"
© Copyright 2022 by Peter Aiken Slide # 26
https://anythingawesome.com
Areyougonnatellusthechocolatestory-again?
© Copyright 2022 by Peter Aiken Slide # 27
https://anythingawesome.com
This research, published as a letter
this week in the British Medical
Journal, was meant to draw attention
to how much data gets entered
incorrectly in the country’s medical
system. These guys weren’t turning
up at the doctor for pregnancy-
related services. Instead, they were
at their doctor for procedures that
had medical codes similar to those of
midwifery and obstetric services.
With a misplaced keystroke here or
there, an annual physical could
become a consultation with a
midwife.
Why Britain has 17,000 pregnant men
here
keystroke
misplaced
there
© Copyright 2022 by Peter Aiken Slide # 28
https://anythingawesome.com
https://www.bbc.com/news/technology-54423988?es_p=12801491
Why Using Microsoft's Tool Caused Covid-19 Results To Be Lost
Why Using Microsoft's Tool Caused Covid-19 Results To Be Lost
© Copyright 2022 by Peter Aiken Slide # 29
https://anythingawesome.com
https://www.bbc.com/news/technology-54423988?es_p=12801491
• Since 2007 should have
been forced to use .xlsx
(1,000,000+ rows)
• Used .xls (65,000 rows)
• Additional data was
dropped without
notification
Practice-Oriented
• Failure in rigor when capturing/
manipulating data
• Allowing incorrect data to be collected
when requirements specify otherwise
• Presenting data out of sequence
Structure-Oriented
• Data arranged imperfectly
• Street address → GPS coordinates
• Data is captured but inaccessible
• When incorrect data is provided as the
correct response
Practice-oriented
activities focus on the
capture and manipulation
of data
Getting Data Quality Right Depends on Both
© Copyright 2022 by Peter Aiken Slide # 30
https://anythingawesome.com
Structure-oriented
activities focus on the
data implementation
Quality
"Fit for
purpose"
Data
Data Is Not Broadly or Widely Understood
© Copyright 2022 by Peter Aiken Slide # 31
https://anythingawesome.com
adapted from: http://www.dailymirror.lk/print/opinion/editorial-we-need-to-become-channels-of-peace/172-27164
It is like a fan!
It is like a snake!
It is like a wall!
It is like a rope!
It is like a tree! Blind Persons and the Elephant
It is like a story!
It is like a dashboard!
It is like a pipes!
It is like a warehouse!
It is like statistics!
Poor Data Manifests as Multifaceted Organizational Challenges
© Copyright 2022 by Peter Aiken Slide # 32
https://anythingawesome.com
Many DQ Challenges Are Unique and/or Context Specific!
© Copyright 2022 by Peter Aiken Slide # 33
https://anythingawesome.com
• Something bad happened
– Imperfect data was to blame
• Someone needs to fix
– Poor quality data
• You currently have
management's attention
– It is wise to ensure you
also have their understanding
• "Do something" often
leads to "Buy something"
– Mostly technology-based
• Get data quality-ing!
– A fool with a tool is still a fool
• Something is accomplished
– Most often all the project funding
is used up
Burning Bridge
© Copyright 2022 by Peter Aiken Slide # 34
https://anythingawesome.com
• Early cases have a dual
purpose
– Make the case that this will fix
the immediate challenge
– Illustrate why a programmatic
approach is preferable
Leverage Is an Engineering Concept
• Using proper engineering techniques, a human can lift a bulk that
is weighs much more than the human
© Copyright 2022 by Peter Aiken Slide # 35
https://anythingawesome.com
1 kg
10 kg
11 kg
Data Quality Is a Wholistic Approach to Obtaining Data Leverage
© Copyright 2022 by Peter Aiken Slide # 36
https://anythingawesome.com
Organizational
Data
Knowledge workers
supplemented by
data professionals
Process
Guided by strategy
https://www.computerhope.com/jargon/f/framework.htm
People
Technology
Reducing ROT increases data leverage
Data Leverage Is a Multi-Use Concept
• Permits organizations to better manage their data
– Within the organization, and
– With organizational data exchange partners
– In support of the organizational mission
• Leverage
– Obtained by implementation of data-centric
technologies, processes, and human skill sets
– Focus on the non-ROT data
• The bigger the organization, the greater potential leverage exists
• Treating data more asset-like simultaneously
– Lowers organizational IT costs and
– Increases organizational knowledge worker productivity
© Copyright 2022 by Peter Aiken Slide # 37
https://anythingawesome.com
Concrete Example of Data Leverage
• Reference
– Controls accessible data values
• Master
– Controls access to system capabilities
• Transaction
– Instances of values
© Copyright 2022 by Peter Aiken Slide # 38
https://anythingawesome.com
Countries where we do business?
Types of accounts available?
Controlled vocabulary items
Are you a member of our premium club?
Authorizing uses/users?
Common/standard data structures
$5
Authorized
Like !
Example based on: Dr. Christopher Bradley of DMAdvisors–he has more, ping him at chris.bradley@dmadvisors.co.uk
Cannot do business overseas?
Cannot determine product origin?
Cannot add a foreign language to the website?
Cannot select a valid menu item?
Simple Math
• At the beginning of a project,
• Where the parties know the least about each other
• All are expected to agree on the meaning of price, timing, and
functionalities
• Define X (some resources)
• Define Y (cleaning 1 set of data)
• Define Z (that data will be clean)
© Copyright 2022 by Peter Aiken Slide # 39
https://anythingawesome.com
If X is invested in Y then outcome Z will result (Z > X)
Simple Math
• Define X ($100)
• Define Y (cleaning 1 set of data)
• Define Z ($1000)
© Copyright 2022 by Peter Aiken Slide # 40
https://anythingawesome.com
If $100 is invested in cleaning 1 set of data then outcome $1000 will result
What Does It Mean "Data Quality Program"?
• Ongoing commitment
– Permits evolutionary improvement of the approach
• Governance
– Senior level coordination, direction, and control
• Executive leadership capabilities
– Change and risk management
• Data quality approach inherits (above)
– Budget, strategic priorities
– Senior level attention and improving topical facility
– Reasonable timelines/expectations
© Copyright 2022 by Peter Aiken Slide # 41
https://anythingawesome.com
https://blog.ducenit.com/data-quality-management
Data Quality Is Not a Project
• Durable asset
– An asset that has a usable
life more than one year
• Reasonable project
deliverables
– 90 day increments
– Data evolution is measured in years
• Data
– Evolves - it is not created
– Significantly more stable
• Readymade data architectural components
– Prerequisite to agile development
• Only alternative is to create additional data siloes!
© Copyright 2022 by Peter Aiken Slide # 42
https://anythingawesome.com
Differences Between Programs and Projects
• Programs are Ongoing, Projects End
– Managing a program involves long term strategic planning and
continuous process improvement is not required of a project
• Programs are Tied to the Financial Calendar
– Program managers are often responsible for delivering
results tied to the organization's financial calendar
• Program Management is Governance Intensive
– Programs are governed by a senior board that provides direction,
oversight, and control while projects tend to be less governance-intensive
• Programs Have Greater Scope of Financial Management
– Projects typically have a straight-forward budget and project financial
management is focused on spending to budget while program planning,
management and control is significantly more complex
• Program Change Management is an Executive Leadership
Capability
– Projects employ a formal change management process while at the program
level, change management requires executive leadership skills and program
change is driven more by an organization's strategy and is subject to market
conditions and changing business goals
© Copyright 2022 by Peter Aiken Slide #
Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management
43
https://anythingawesome.com
Your data quality
program must last at
least as long as your
HR program!
© Copyright 2022 by Peter Aiken Slide # 44
https://anythingawesome.com
NYC Bridge Maintenance
•Painting is continuous and optimized
•As is much of the remaining maintenance
•Ensures a knowledgeable workforce
© Copyright 2022 by Peter Aiken Slide # 45
https://anythingawesome.com
Making a Better Quality Data Sandwich
Data supply
Data literacy
Standard data
Standard data
Leverage Point - High Performance Automation
© Copyright 2022 by Peter Aiken Slide #
Data literacy
46
https://anythingawesome.com
Data supply
Leverage Point - High Performance Automation
© Copyright 2022 by Peter Aiken Slide #
Standard data
Data supply
Data literacy
47
https://anythingawesome.com
Leverage Point - High Performance Automation
© Copyright 2022 by Peter Aiken Slide #
This cannot happen without engineering and architecture!
Quality engineering/
architecture work products
do not happen accidentally!
48
https://anythingawesome.com
Data supply
Data literacy
Standard data
Leverage Point - High Performance Automation
© Copyright 2022 by Peter Aiken Slide #
This cannot happen without data engineering and architecture!
49
https://anythingawesome.com
Quality data engineering/
architecture work products
do not happen accidentally!
Data supply
Data literacy
Standard data
USS Midway
& Pancakes
Why Is This an Excellent
Example of Engineering?
• It is tall
• It has a clutch
• It was built in 1942
• It is cemented to the floor
• It is still in regular use!
© Copyright 2022 by Peter Aiken Slide # 50
https://anythingawesome.com
Our Barn Had To Pass a Foundation Inspection
• Before further construction could proceed
• It makes good business sense
• No IT equivalent
© Copyright 2022 by Peter Aiken Slide # 51
https://anythingawesome.com
© Copyright 2022 by Peter Aiken Slide # 52
peter.aiken@anythingawesome.com +1.804.382.5957 Peter Aiken, PhD
Data
Quality
Engineering
© Copyright 2022 by Peter Aiken Slide #
https://anythingawesome.com 53
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
© Copyright 2022 by Peter Aiken Slide #
https://anythingawesome.com 54
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
Systems Thinking
© Copyright 2022 by Peter Aiken Slide # 55
https://anythingawesome.com
http://victorianscandal.wordpress.com/picturesque/rachel-olshausen/
• A framework that is based on
the belief that the component
parts of a system can best
be understood in the
context of relationships
with other systems, rather than
in isolation.
• The only way to fully
understand why a problem or
element occurs and persists is
to understand the part in
relation to the whole.
Capra, F. (1996) The web of life: a new scientific understanding of living
systems (1st Anchor Books ed). New York: Anchor Books. p. 30
Outputs
Process
Input ➜ Process ➜ Output Diagram
© Copyright 2022 by Peter Aiken Slide # 56
https://anythingawesome.com
Inputs
Pizza
Make Pizza
Dough
Water
Pizza
Crust
Make Crust
Make Pizza
Data Steward Quality
Responsibilities
• Inputs
– From where, do each of these my
responsible data items come?
– Why are they produced?
– What level of quality is required by
'my processes?'
• Process
– What business processes use the
data within my fiduciary
responsibility?
– For what business purpose do they
use each data item?
– What role does quality play for my
processes to contribute?
• Output
– What downstream business
processes consume data that was
under my fiduciary care?
– For what purpose are each data
items consumed?
– What quality attribute are required
by each downstream consumer?
© Copyright 2022 by Peter Aiken Slide # 57
https://anythingawesome.com
Interdependencies
© Copyright 2022 by Peter Aiken Slide # 58
https://anythingawesome.com
Data Governance
CRM (SalesForce)
Data Quality
© Copyright 2022 by Peter Aiken Slide # 59
https://anythingawesome.com
Data
Management
Body
of
Knowledge
(DM
BoK
V2)
Practice
Areas
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
One Option
© Copyright 2022 by Peter Aiken Slide # 60
https://anythingawesome.com
Data
Strategy
Data
Governance
BI/
Warehouse
Perfecting
operations in 3
data management
practice areas
1X
1X
1X
Metadata
Data
Quality
from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
© Copyright 2022 by Peter Aiken Slide # 61
https://anythingawesome.com
Separating the Wheat from the Chaff
Separating the Wheat From the Chaff
© Copyright 2022 by Peter Aiken Slide # 62
https://anythingawesome.com
Does organizing data add to its value?
Pre-Information Age Metadata
• Examples of information architecture achievements that happened
well before the information age:
– Page numbering
– Alphabetical order
– Table of contents
– Indexes
– Lexicons
– Maps
– Diagrams
© Copyright 2022 by Peter Aiken Slide # 63
https://anythingawesome.com
Example from: How to make sense of any mess
by Abby Covert (2014) ISBN: 1500615994
"While we can arrange things
with the intent to communicate
certain information, we can't
actually make information. Our
users do that for us."
https://www.youtube.com/watch?v=60oD1TDzAXQ&feature=emb_logo
https://www.youtube.com/watch?v=r10Sod44rME&t=1s
https://www.youtube.com/watch?v=XD2OkDPAl6s
Remove the Structure and Things Fall Apart Rapidly
• Better organized data increases in value
© Copyright 2022 by Peter Aiken Slide # 64
https://anythingawesome.com
Separating the Wheat From the Chaff
• Better organized data increases in value
• Poor data quality practices are costing organizations money/time/
effort
• 80% of organizational data is ROT
– Redundant
– Obsolete
– Trivial
• The question is which data to eliminate?
– Most enterprise data is never analyzed
© Copyright 2022 by Peter Aiken Slide # 65
https://anythingawesome.com
Does cleaning all of it make any sense?
© Copyright 2022 by Peter Aiken Slide # 66
https://anythingawesome.com
Who is best qualified to decide this?
1. Allow the form of the
problem to guide the
form of the solution
2. Provide a means of
decomposing the problem
3. Feature a variety of tools
simplifying system understanding
4. Offer a set of strategies for evolving the
design of a programmatic solution
5. Provide criteria for evaluating the quality of the various solutions
6. Facilitate development of a framework for developing
organizational knowledge
© Copyright 2022 by Peter Aiken Slide # 67
https://anythingawesome.com
Programmatic
Data Quality
Engineering
© Copyright 2022 by Peter Aiken Slide # 68
https://anythingawesome.com
https://en.wikipedia.org/wiki/Theory_of_constraints
(TOC)
• A management paradigm that views any
manageable system as being limited in
achieving more of its goals by a small
number of constraints (Eliyahu M. Goldratt)
• There is always at least one constraint, and
TOC uses a focusing process to identify the
constraint and restructure the rest of the
organization to address it
• TOC adopts the
common idiom "a
chain is no stronger
than its weakest link,"
processes, organizations, etc., are
vulnerable because the weakest
component can damage or break them
or at least adversely affect the outcome
The DQE Cycle
© Copyright 2022 by Peter Aiken Slide # 69
https://anythingawesome.com
• Deming cycle
• "Plan-do-study-act" or
"plan-do-check-act"
– Identifying data issues that are
critical to the achievement of
business objectives
– Defining business requirements
for data quality
– Identifying key data quality
dimensions
– Defining business rules critical
to ensuring high quality data
https://deming.org/explore/pdsa/
The DQE Cycle: (1) Plan
© Copyright 2022 by Peter Aiken Slide # 70
https://anythingawesome.com
• Plan for the assessment of
the current state and
identification of key metrics
for measuring quality
• The data quality engineering
team assesses the scope of
known issues
– Determining cost and impact
– Evaluating alternatives for
addressing them
The DQE Cycle: (2) Deploy
© Copyright 2022 by Peter Aiken Slide # 71
https://anythingawesome.com
• Deploy processes for
measuring and improving
the quality of data:
• Data profiling
– Institute inspections and
monitors to identify data issues
when they occur
– Fix flawed processes that are
the root cause of data errors or
correct errors downstream
– When it is not possible to
correct errors at their source,
correct them at their earliest
point in the data flow
The DQE Cycle: (3) Monitor
© Copyright 2022 by Peter Aiken Slide # 72
https://anythingawesome.com
• Monitor the quality of data
as measured against the
defined business rules
• If data quality meets defined
thresholds for acceptability,
the processes are in control
and the level of data quality
meets the business
requirements
• If data quality falls below
acceptability thresholds,
notify data stewards so they
can take action during the
next stage
The DQE Cycle: (4) Act
© Copyright 2022 by Peter Aiken Slide # 73
https://anythingawesome.com
• Act to resolve any identified
issues to improve data
quality and better meet
business expectations
• New cycles begin as new
data sets come under
investigation or as new data
quality requirements are
identified for existing data
sets
Starting
point
for new
system
development
data performance metadata
data architecture
data
architecture and
data models
shared data updated data
corrected
data
architecture
refinements
facts &
meanings
Metadata &
Data Storage
Starting point
for existing
systems
Metadata Refinement
• Correct Structural Defects
• Update Implementation
Metadata Creation
• Define Data Architecture
• Define Data Model Structures
Metadata Structuring
• Implement Data Model Views
• Populate Data Model Views
Data Refinement
• Correct Data Value Defects
• Re-store Data Values
Data Manipulation
• Manipulate Data
• Updata Data
Data Utilization
• Inspect Data
• Present Data
Data Creation
• Create Data
• Verify Data Values
Data Assessment
• Assess Data Values
• Assess Metadata
Extended Data Life Cycle Model With Metadata Sources and Uses
© Copyright 2022 by Peter Aiken Slide # 74
https://anythingawesome.com
Data Quality Attributes
© Copyright 2022 by Peter Aiken Slide # 75
https://anythingawesome.com
© Copyright 2022 by Peter Aiken Slide #
https://anythingawesome.com 76
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
© Copyright 2022 by Peter Aiken Slide #
https://anythingawesome.com 77
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
Compare the Utility of Data Quality Conversation Topics
© Copyright 2022 by Peter Aiken Slide # 78
https://anythingawesome.com
Engineers say: Business wants to hear:
Clean some data
Decrease the number of
undeliverable targeted marketing
ads
Reorganize the database
Increase the ability of the
salesforce to
perform their own analyses
Develop a taxonomy
Create a common vocabulary for
the organization
Optimize a query
Shaved 1 second off a task that
runs a billion times a day
Reverse engineer the legacy
system
Understand: what was good
about the old system so it can be
formally preserved and, what
was bad so it can be improved
CDO Agenda
Inventory Data -> uncovering
assets & decreasing ROT
Develop the first version of an
organizational data strategy
Monetize your organization's data
© Copyright 2022 by Peter Aiken Slide # 79
https://anythingawesome.com
The CDOs goal is to better manage data as an organizational
asset in support of the organizational mission!
What Is Strategy?
• Current use derived from military
- a pattern in a stream of decisions
[Henry Mintzberg]
© Copyright 2022 by Peter Aiken Slide # 80
https://anythingawesome.com
A thing
Q1
Organizations
without
a formalized
data quality focus
Q4
Data Quality
Focus: both,
simultaneously
Q2
Data Quality
Focus: Increase
organizational
efficiencies/effectiveness
Improve Operations
Innovation
Initially Pick One or the Other but Not Both
© Copyright 2022 by Peter Aiken Slide # 81
https://anythingawesome.com
x
x
Q3
Data Quality
Focus: Use data
to create |
strategic |
opportunities |
12.5
25
37.5
50
Monday Tuesday Wednesday Thursday Friday
48
24
12
6
3
Pandemic Math Question? (A Very Bad Week)
• If demand at a 48-bed hospital facility is doubling-daily …
• … at what point does anyone notice that the hospital beds are
becoming scarce?
- Monday 3 beds occupied
- Tuesday 6
- Wednesday – ¾ of all beds were available
- Yesterday – ½ of all beds were available
- Today – zero beds available
- Tomorrow …???
© Copyright 2022 by Peter Aiken Slide # 82
https://anythingawesome.com
A Musical Analogy That Works for Both Practice and Storytelling
© Copyright 2022 by Peter Aiken Slide # 83
https://anythingawesome.com
+ =
https://www.youtube.com/watch?v=4n1GT-VjjVs&frags=pl%2Cwn
© Copyright 2022 by Peter Aiken Slide # 84
https://anythingawesome.com
Getting Data Quality Right
Engineering Success Stories
Program
• Approaching Data Quality
– Definitions
– Causes can be difficult to discern
– Data quality challenges are the
root cause of most IT and
business failures
– Must be built on leverage
– Requires a programmatic
approach to be most effective
– Early business cases
often have a dual purpose
– High quality data requires
architecture and engineering
• What do we need to get
better at?
– Systems thinking
– Not looking at data
quality in isolation
– Understanding data ROT
– Not underestimating
the role of culture
– Developing repeatable
capabilities/core data
quality expertise
• How do we get better?
– Refocus the request around
business outcomes
– Leadership
– Program focus
– Math (cost or investment?)
– Storytelling/Practice
• Takeaways and Q&A
© Copyright 2022 by Peter Aiken Slide # 85
https://anythingawesome.com
• Information transparency
• Analytics
• Business Intelligence
• Increasing efficiencies
• Decreasing costs
• Driving holistic decision-making
across the organization
• Information transparency
• Analytics
• Business Intelligence
• Increasing efficiencies
• Decreasing costs
• Driving holistic decision-making
across the organization
High
Quality
Data is
Critical
Not Helpful
© Copyright 2022 by Peter Aiken Slide # 86
https://anythingawesome.com
1 The project needs to be small Projects should not be allowed to
begin unless the data
requirements for the entire
project are verified
2 The product Owner or sponsor
must be highly skilled
Few in IT have the requisite data
skills and knowledge
3 The process must be agile-
ready
Agile is a construction technique/
data requires more planning
before construction
4 The team must be highly skilled
in both the data quality
processes and technology
Few teams have requisite levels
of data skills
5 The organization must be highly
skilled at emotional maturity
Few organizations understand
data stuff
Winning Cards for data quality program success
Data
Things
Happen
Organizational
Things
Happen
This Approach Only Works if
• We know where the data that needs to be fixed–resides
• We can communicate precisely and correctly amongst team
members, sponsors, collaborators
• We are adept with the correct technological support
• …
© Copyright 2022 by Peter Aiken Slide # 87
https://anythingawesome.com
≈
≈
≈
≈
≈
≈
X
$
X $
X
$
X $
X $
X
$
X
$
X $
X $
Data Quality Dimensions
© Copyright 2022 by Peter Aiken Slide # 88
https://anythingawesome.com
Data Value Quality
© Copyright 2022 by Peter Aiken Slide # 89
https://anythingawesome.com
Data Representation Quality
© Copyright 2022 by Peter Aiken Slide # 90
https://anythingawesome.com
Data Model Quality
© Copyright 2022 by Peter Aiken Slide # 91
https://anythingawesome.com
Data Architecture Quality
© Copyright 2022 by Peter Aiken Slide # 92
https://anythingawesome.com
Upcoming Events
Where Data Architecture and Data Governance Collide
11 October 2022
What's in Your Data Warehouse?
8 November 2022
Data Management Best Practices
13 December 2022
© Copyright 2022 by Peter Aiken Slide # 93
https://anythingawesome.com
Brought to you by:
Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD
Note: In this .pdf, clicking any webinar title opens the registration link
Event Pricing
© Copyright 2022 by Peter Aiken Slide # 94
https://anythingawesome.com
• 20% off
directly from the publisher
on select titles
• My Book Store @
http://anythingawesome.com
• Enter the code
"anythingawesome" at the
Technics bookstore
checkout where it says to
"Apply Coupon"
anythingawesome
Peter.Aiken@AnythingAwesome.com +1.804.382.5957
Thank You!© Copyright 2022 by Peter Aiken Slide # 95
Book a call with Peter to discuss anything - https://anythingawesome.com/OfficeHours.html
Critical Design Review?
Hiring Assistance?
Reverse Engineering Expertise?
Executive Data
Literacy Training?
Mentoring?
Tool/automation evaluation?
Use your data more strategically?
Independent
Verification
&
Validation
Collaboration?

More Related Content

What's hot

The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of MetadataDATAVERSITY
 
Unlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for CollibraUnlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for CollibraPrecisely
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data GovernanceTuba Yaman Him
 
Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherDATAVERSITY
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesLars E Martinsson
 
Data Governance
Data GovernanceData Governance
Data GovernanceBoris Otto
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best PracticesDATAVERSITY
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management DATAVERSITY
 
AI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for SuccessAI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for SuccessDatabricks
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Data Management is Data Governance
Data Management is Data GovernanceData Management is Data Governance
Data Management is Data GovernanceDATAVERSITY
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationDATAVERSITY
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...DATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityDATAVERSITY
 
CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016Christopher Bradley
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceDATAVERSITY
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best PracticesDATAVERSITY
 

What's hot (20)

The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Unlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for CollibraUnlocking Greater Insights with Integrated Data Quality for Collibra
Unlocking Greater Insights with Integrated Data Quality for Collibra
 
Data Quality & Data Governance
Data Quality & Data GovernanceData Quality & Data Governance
Data Quality & Data Governance
 
Data Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working TogetherData Management, Metadata Management, and Data Governance – Working Together
Data Management, Metadata Management, and Data Governance – Working Together
 
Enterprise Data Architecture Deliverables
Enterprise Data Architecture DeliverablesEnterprise Data Architecture Deliverables
Enterprise Data Architecture Deliverables
 
Data Governance
Data GovernanceData Governance
Data Governance
 
Data Quality Best Practices
Data Quality Best PracticesData Quality Best Practices
Data Quality Best Practices
 
Data Governance and Metadata Management
Data Governance and Metadata ManagementData Governance and Metadata Management
Data Governance and Metadata Management
 
AI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for SuccessAI Data Acquisition and Governance: Considerations for Success
AI Data Acquisition and Governance: Considerations for Success
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Data Management is Data Governance
Data Management is Data GovernanceData Management is Data Governance
Data Management is Data Governance
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016CDMP preparation workshop EDW2016
CDMP preparation workshop EDW2016
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Glossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data GovernanceGlossaries, Dictionaries, and Catalogs Result in Data Governance
Glossaries, Dictionaries, and Catalogs Result in Data Governance
 
Data Governance Best Practices
Data Governance Best PracticesData Governance Best Practices
Data Governance Best Practices
 

Similar to Getting Data Quality Right

Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityDATAVERSITY
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data Blueprint
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMDATAVERSITY
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringDATAVERSITY
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsDATAVERSITY
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements Data Blueprint
 
DataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDATAVERSITY
 
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
DataEd Webinar:  Reference & Master Data Management - Unlocking Business ValueDataEd Webinar:  Reference & Master Data Management - Unlocking Business Value
DataEd Webinar: Reference & Master Data Management - Unlocking Business ValueDATAVERSITY
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation FundamentalsDATAVERSITY
 
Key Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramKey Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramDATAVERSITY
 
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAOAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAAlex Fiteni
 
Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success StoriesDATAVERSITY
 
Fate of the Chief Data Officer
Fate of the Chief Data OfficerFate of the Chief Data Officer
Fate of the Chief Data OfficerTamarah Usher
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsDATAVERSITY
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyPerficient, Inc.
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 
Is Your Agency Data Challenged?
Is Your Agency Data Challenged?Is Your Agency Data Challenged?
Is Your Agency Data Challenged?DLT Solutions
 

Similar to Getting Data Quality Right (20)

Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM Data-Ed: Business Value From MDM
Data-Ed: Business Value From MDM
 
Data-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDMData-Ed Online Webinar: Business Value from MDM
Data-Ed Online Webinar: Business Value from MDM
 
Data-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality EngineeringData-Ed Webinar: Data Quality Engineering
Data-Ed Webinar: Data Quality Engineering
 
2014 dqe handouts
2014 dqe handouts2014 dqe handouts
2014 dqe handouts
 
Data-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture RequirementsData-Ed Webinar: Data Architecture Requirements
Data-Ed Webinar: Data Architecture Requirements
 
Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements  Data-Ed: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
 
Data Science and Analytics
Data Science and Analytics Data Science and Analytics
Data Science and Analytics
 
DataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business OutcomesDataEd Slides: Expressing Data Improvements as Business Outcomes
DataEd Slides: Expressing Data Improvements as Business Outcomes
 
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
DataEd Webinar:  Reference & Master Data Management - Unlocking Business ValueDataEd Webinar:  Reference & Master Data Management - Unlocking Business Value
DataEd Webinar: Reference & Master Data Management - Unlocking Business Value
 
Data Preparation Fundamentals
Data Preparation FundamentalsData Preparation Fundamentals
Data Preparation Fundamentals
 
Key Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance ProgramKey Elements of a Successful Data Governance Program
Key Elements of a Successful Data Governance Program
 
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMAOAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
OAUG 05-2009-MDM-1683-A Fiteni CPA, CMA
 
Why data governance is the new buzz?
Why data governance is the new buzz?Why data governance is the new buzz?
Why data governance is the new buzz?
 
Data Quality Success Stories
Data Quality Success StoriesData Quality Success Stories
Data Quality Success Stories
 
Fate of the Chief Data Officer
Fate of the Chief Data OfficerFate of the Chief Data Officer
Fate of the Chief Data Officer
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced Analytics
 
Five Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data StrategyFive Attributes to a Successful Big Data Strategy
Five Attributes to a Successful Big Data Strategy
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Is Your Agency Data Challenged?
Is Your Agency Data Challenged?Is Your Agency Data Challenged?
Is Your Agency Data Challenged?
 

More from DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceDATAVERSITY
 

More from DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 
Empowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business IntelligenceEmpowering the Data Driven Business with Modern Business Intelligence
Empowering the Data Driven Business with Modern Business Intelligence
 

Recently uploaded

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...karishmasinghjnh
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...gajnagarg
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...amitlee9823
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 

Recently uploaded (20)

CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
Just Call Vip call girls Bellary Escorts ☎️9352988975 Two shot with one girl ...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 

Getting Data Quality Right

  • 1. Getting Data Quality Right Good data is like good water: best served fresh, and ideally well- filtered. Data management strategies can produce tremendous procedural improvements and increased profit margins across the board, but only if the data being managed is of high quality. Determining how data quality should be engineered provides a useful framework for utilizing data quality management effectively in support of business strategy. This, in turn, allows for speedy identification of business problems, delineation between structural and practice- oriented defects in data management, and proactive prevention of future issues. Organizations must realize what it means to utilize data quality engineering in support of business strategy. This webinar will illustrate how organizations with chronic business challenges often can trace the root of the problem to poor data quality. Showing how data quality should be engineered provides a useful framework in which to develop an effective approach. This in turn allows organizations to more quickly identify business problems as well as data problems caused by structural issues versus practice-oriented defects and prevent these from re-occurring. Program Learning Objectives • Help you understand foundational data quality concepts for improving data quality at your organization • Demonstrate how chronic business challenges for organizations are often rooted in poor data quality • Share case studies illustrating the hallmarks and benefits of data quality success © Copyright 2022 by Peter Aiken Slide # 1 https://anythingawesome.com Date: 13 September 2022 | Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD © Copyright 2022 by Peter Aiken Slide # 2 https://anythingawesome.com
  • 2. Collibra | Confidential Delivering Trusted Data: Collibra's Modern Approach To Data Quality
  • 3. 2 Consider the following features as we demonstrate them in action… How would you manage this company’s data? The data is created by your front office… • Sales representatives record the transactions • They have flexibility in their own software The data is in multiple places… • Your “controls” separated the data • You have sales data and financial transactions Everyone up to CEO needs that data… • What could possibly go wrong? • How will you discover it? Human Error Technical Error No Error
  • 4. Collibra | Confidential 3 Is our data quality getting better or worse over time? How do I identify sensitive data before running DQ checks? Where are my data inconsistencies? Who owns this data issue... How can I trust this report or dashboard? Our data producers and data consumers operate in silos... What is the quality of the data behind my models and reports? Problem: Managing enterprise-scale data quality is a challenge BI Analyst/ Data Scientist Everyday Employee Management/ Leadership
  • 5. Collibra | Confidential 4 50-70% 15-25% lost revenue for most companies due to bad data. average time spent on manual rule writing and management. Sources: IDC and Gartner 2020, Tech Jury forecast, 2021 Assessing Data Quality: A Managerial Call to Action (May 2020, HBR) Seizing Opportunity in Data Quality (MIT Sloan Management Review, 2017) $1.9B projected data quality spend in 2021. Problem: This leads to real costs across your organization 47% of recently created data records have at least one critical error.
  • 6. 5 Collibra Data Quality & Observability Joy of automation • Discover data quality issues • “Feels” statistical, not code • Scales as easy a identify, scan, discover, train Intelligent Auto-generate explainable and adaptive DQ rules Scalable Scan large and diverse databases, files and streaming data Self-service Empower users with a unified scoring system and flexible rule management
  • 7. Workflow Good Notifications Standardize Rules Good Scale Automatically detect change Good Rule Logic Profile your data consistently Good Metadata Integrate across multiple data sources Good Data minimal viable data trust Data Quality & Observability 'Stack’ Collibra Data Quality & Observability automates static observations and technical rules while scaling complex business rules. Consider solutions in the market that combine the three trending features in modern DQ - advanced autometrics, rule templating, and data discovery/enforcement.
  • 8. Chief Digital Manager @ Dataversity.Net © Copyright 2022 by Peter Aiken Slide # 3 https://anythingawesome.com Shannon Kempe Two Most Commonly Asked Questions © Copyright 2022 by Peter Aiken Slide # 4 https://anythingawesome.com 1) Will I get copies of the slides after the event? 2) Is this being recorded?
  • 9. Peter Aiken, Ph.D. • I've been doing this a long time • My work is recognized as useful • Associate Professor of IS (vcu.edu) • Institute for Defense Analyses (ida.org) • DAMA International (dama.org) • MIT CDO Society (iscdo.org) • Anything Awesome (plusanythingawesome.com) • Experienced w/ 500+ data management practices worldwide • Multi-year immersions – US DoD (DISA/Army/Marines/DLA) – Nokia – Deutsche Bank – Wells Fargo – Walmart … • 12 books and dozens of articles © Copyright 2022 by Peter Aiken Slide # 5 https://anythingawesome.com + • DAMA International President 2009-2013/2018/2020 • DAMA International Achievement Award 2001 (with Dr. E. F. "Ted" Codd • DAMA International Community Award 2005 Event Pricing © Copyright 2022 by Peter Aiken Slide # 6 https://anythingawesome.com • 20% off directly from the publisher on select titles • My Book Store @ http://anythingawesome.com • Enter the code "anythingawesome" at the Technics bookstore checkout where it says to "Apply Coupon" anythingawesome
  • 10. Getting Data Quality Right © Copyright 2022 by Peter Aiken Slide # 7 peter.aiken@anythingawesome.com +1.804.382.5957 Peter Aiken, PhD Engineering Business Success Stories © Copyright 2022 by Peter Aiken Slide # 8 https://anythingawesome.com Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A
  • 11. © Copyright 2022 by Peter Aiken Slide # https://anythingawesome.com 9 Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A Famous 1990's Words? • Question: – Why haven't organizations taken a more proactive approach to data quality? • Answer: – Fixing data quality problems is not easy – It is dangerous -- they'll come after you – Your efforts are likely to be misunderstood – You could make things worse – Now you get to fix it • A single data quality issue can grow into a significant, unexpected investment © Copyright 2022 by Peter Aiken Slide # 10 https://anythingawesome.com
  • 12. What Does Everyone Want? © Copyright 2022 by Peter Aiken Slide # 11 https://anythingawesome.com DIGITAL Digital Insight • Subtract data from digital and what do you have? • Subtract digital from data and you still have data © Copyright 2022 by Peter Aiken Slide # 12 https://anythingawesome.com https://www.linkedin.com/in/mark-johnson-518a752/ DIGITAL DATA ? DIGITAL DATA DATA
  • 13. © Copyright 2022 by Peter Aiken Slide # 13 https://anythingawesome.com https://anythingawesome.com/why.html + Definitions • Quality Data – Fit for purpose meets the requirements of its authors, users, and administrators (from Martin Eppler) – Synonymous with information quality, since poor data quality results in inaccurate information and poor performance • Data Quality Management – "Planning, implementation and control activities that apply quality management techniques to measure, assess, improve, and ensure data quality" – Encompasses life cycle activities – Include supporting processes from change management, etc. – Continuous improvement process requiring core capabilities • Data Quality Engineering – Recognition that data quality solutions cannot not managed but, instead, must be engineered – Data quality engineering concepts are generally not known and understood within IT or business! © Copyright 2022 by Peter Aiken Slide # 14 https://anythingawesome.com Spinach/Popeye story from: https://medium.com/hurb-labs/data-quality-a-lesson-from-the-myth-behind-popeye-the-sailor-a7bd50b61510
  • 14. DQ Effort Pattern © Copyright 2022 by Peter Aiken Slide # 15 https://anythingawesome.com from The DAMA Guide to the Data Management Body of Knowledge © 2017 by DAMA International 80% time spent preparing 20% analysis time Data Knowledge Is Too Little and Too Informal • Data management happens 'pretty well' at the workgroup level – Defining characteristic of a workgroup – Without guidance (strategy), what are the chances that all workgroups are pulling toward the same objectives? – Consider the time spent attempting informal practices • Data chaff becomes sand in the machinery – Preventing smooth interoperation and exchanges – Losses due to lots of little data cuts have been difficult to account for • Organizations and individuals lack data quality – Knowledge – Skills – Data Engineering (How?) – Data Strategy (Why this as opposed to that?) © Copyright 2022 by Peter Aiken Slide # 16 https://anythingawesome.com Wally Easton Playing Piano https://www.youtube.com/watch?v=NNbPxSvII-Q
  • 15. Everyone Wants To Do Better Data Analysis … • Some data preparation is inevitable • What would a 'good' ratio be? • "Everyone knows" © Copyright 2022 by Peter Aiken Slide # 17 https://anythingawesome.com 0% 25% 50% 75% 100% Data Analysis Data Preparation 20% 80% Hidden Data Factories © Copyright 2022 by Peter Aiken Slide # 18 https://anythingawesome.com https://hbr.org/2016/09/bad-data-costs-the-u-s-3-trillion-per-year Work products are delivered to Customers Customers Knowledge Workers 80% looking for stuff 20% doing useful work Department B 1. Check A's work 2. Make any corrections 3. Complete B's work 4. Deliver to Department C Department A https://en.wikipedia.org/wiki/Theory_of_constraints Department C 1. Check B's work 2. Make any corrections 3. Complete C's work 4. Deliver to Customer 5. Deal with consequences
  • 16. Hidden Data Factories © Copyright 2022 by Peter Aiken Slide # 19 https://anythingawesome.com IT Process Business Process Business Challenge Root-cause analysis reveals always reveals a data component! Poor Data Quality Manifests as Multifaceted Organizational Challenges © Copyright 2022 by Peter Aiken Slide # 20 https://anythingawesome.com IT System Business Challenge Business Challenge Business System Business Challenge IT Process Business Challenge IT System Business Challenge Business Process Business Challenge Poor results
  • 17. Business Challenge Consistency Encourages Quality Analysis © Copyright 2022 by Peter Aiken Slide # 21 https://anythingawesome.com IT System Business Challenge Business Process Business Challenge IT Process Business System Business Challenge IT Process Business Challenge IT System Business Challenge Business Process Business Challenge Data quality requires a team with specialized skills deployed to create a repeatable process and develop sustained organizational skillsets Events Are Not Always Recognized as Data Quality Challenges? © Copyright 2022 by Peter Aiken Slide # 22 https://anythingawesome.com • A letter from a bank • A very expensive, very small data rounding error • Health data story • The chocolate story • Covid-19
  • 18. © Copyright 2022 by Peter Aiken Slide # 23 https://anythingawesome.com A congratulations letter from another bank Problems • Bank did not know it made an error • Tools alone could not have prevented this error • Lost confidence in the ability of the bank to manage customer funds IEEE Senior Member Status (30+ Years Membership) © Copyright 2022 by Peter Aiken Slide # 24 https://anythingawesome.com
  • 19. © Copyright 2022 by Peter Aiken Slide # 25 https://anythingawesome.com • Needed trench for electrical cable 2.52" - delivered 2.5" • $1M required to rent other facilities while new cable is obtained • Either rounding or truncation could explain – We need to get a summary on all of this," he said. "How did the mistake occur? Who's at fault? What are the damages? And how is money going to be recovered?" Port of Seattle 2.52" ➜ 2.5" © Copyright 2022 by Peter Aiken Slide # 26 https://anythingawesome.com Areyougonnatellusthechocolatestory-again?
  • 20. © Copyright 2022 by Peter Aiken Slide # 27 https://anythingawesome.com This research, published as a letter this week in the British Medical Journal, was meant to draw attention to how much data gets entered incorrectly in the country’s medical system. These guys weren’t turning up at the doctor for pregnancy- related services. Instead, they were at their doctor for procedures that had medical codes similar to those of midwifery and obstetric services. With a misplaced keystroke here or there, an annual physical could become a consultation with a midwife. Why Britain has 17,000 pregnant men here keystroke misplaced there © Copyright 2022 by Peter Aiken Slide # 28 https://anythingawesome.com https://www.bbc.com/news/technology-54423988?es_p=12801491 Why Using Microsoft's Tool Caused Covid-19 Results To Be Lost
  • 21. Why Using Microsoft's Tool Caused Covid-19 Results To Be Lost © Copyright 2022 by Peter Aiken Slide # 29 https://anythingawesome.com https://www.bbc.com/news/technology-54423988?es_p=12801491 • Since 2007 should have been forced to use .xlsx (1,000,000+ rows) • Used .xls (65,000 rows) • Additional data was dropped without notification Practice-Oriented • Failure in rigor when capturing/ manipulating data • Allowing incorrect data to be collected when requirements specify otherwise • Presenting data out of sequence Structure-Oriented • Data arranged imperfectly • Street address → GPS coordinates • Data is captured but inaccessible • When incorrect data is provided as the correct response Practice-oriented activities focus on the capture and manipulation of data Getting Data Quality Right Depends on Both © Copyright 2022 by Peter Aiken Slide # 30 https://anythingawesome.com Structure-oriented activities focus on the data implementation Quality "Fit for purpose" Data
  • 22. Data Is Not Broadly or Widely Understood © Copyright 2022 by Peter Aiken Slide # 31 https://anythingawesome.com adapted from: http://www.dailymirror.lk/print/opinion/editorial-we-need-to-become-channels-of-peace/172-27164 It is like a fan! It is like a snake! It is like a wall! It is like a rope! It is like a tree! Blind Persons and the Elephant It is like a story! It is like a dashboard! It is like a pipes! It is like a warehouse! It is like statistics! Poor Data Manifests as Multifaceted Organizational Challenges © Copyright 2022 by Peter Aiken Slide # 32 https://anythingawesome.com
  • 23. Many DQ Challenges Are Unique and/or Context Specific! © Copyright 2022 by Peter Aiken Slide # 33 https://anythingawesome.com • Something bad happened – Imperfect data was to blame • Someone needs to fix – Poor quality data • You currently have management's attention – It is wise to ensure you also have their understanding • "Do something" often leads to "Buy something" – Mostly technology-based • Get data quality-ing! – A fool with a tool is still a fool • Something is accomplished – Most often all the project funding is used up Burning Bridge © Copyright 2022 by Peter Aiken Slide # 34 https://anythingawesome.com • Early cases have a dual purpose – Make the case that this will fix the immediate challenge – Illustrate why a programmatic approach is preferable
  • 24. Leverage Is an Engineering Concept • Using proper engineering techniques, a human can lift a bulk that is weighs much more than the human © Copyright 2022 by Peter Aiken Slide # 35 https://anythingawesome.com 1 kg 10 kg 11 kg Data Quality Is a Wholistic Approach to Obtaining Data Leverage © Copyright 2022 by Peter Aiken Slide # 36 https://anythingawesome.com Organizational Data Knowledge workers supplemented by data professionals Process Guided by strategy https://www.computerhope.com/jargon/f/framework.htm People Technology Reducing ROT increases data leverage
  • 25. Data Leverage Is a Multi-Use Concept • Permits organizations to better manage their data – Within the organization, and – With organizational data exchange partners – In support of the organizational mission • Leverage – Obtained by implementation of data-centric technologies, processes, and human skill sets – Focus on the non-ROT data • The bigger the organization, the greater potential leverage exists • Treating data more asset-like simultaneously – Lowers organizational IT costs and – Increases organizational knowledge worker productivity © Copyright 2022 by Peter Aiken Slide # 37 https://anythingawesome.com Concrete Example of Data Leverage • Reference – Controls accessible data values • Master – Controls access to system capabilities • Transaction – Instances of values © Copyright 2022 by Peter Aiken Slide # 38 https://anythingawesome.com Countries where we do business? Types of accounts available? Controlled vocabulary items Are you a member of our premium club? Authorizing uses/users? Common/standard data structures $5 Authorized Like ! Example based on: Dr. Christopher Bradley of DMAdvisors–he has more, ping him at chris.bradley@dmadvisors.co.uk Cannot do business overseas? Cannot determine product origin? Cannot add a foreign language to the website? Cannot select a valid menu item?
  • 26. Simple Math • At the beginning of a project, • Where the parties know the least about each other • All are expected to agree on the meaning of price, timing, and functionalities • Define X (some resources) • Define Y (cleaning 1 set of data) • Define Z (that data will be clean) © Copyright 2022 by Peter Aiken Slide # 39 https://anythingawesome.com If X is invested in Y then outcome Z will result (Z > X) Simple Math • Define X ($100) • Define Y (cleaning 1 set of data) • Define Z ($1000) © Copyright 2022 by Peter Aiken Slide # 40 https://anythingawesome.com If $100 is invested in cleaning 1 set of data then outcome $1000 will result
  • 27. What Does It Mean "Data Quality Program"? • Ongoing commitment – Permits evolutionary improvement of the approach • Governance – Senior level coordination, direction, and control • Executive leadership capabilities – Change and risk management • Data quality approach inherits (above) – Budget, strategic priorities – Senior level attention and improving topical facility – Reasonable timelines/expectations © Copyright 2022 by Peter Aiken Slide # 41 https://anythingawesome.com https://blog.ducenit.com/data-quality-management Data Quality Is Not a Project • Durable asset – An asset that has a usable life more than one year • Reasonable project deliverables – 90 day increments – Data evolution is measured in years • Data – Evolves - it is not created – Significantly more stable • Readymade data architectural components – Prerequisite to agile development • Only alternative is to create additional data siloes! © Copyright 2022 by Peter Aiken Slide # 42 https://anythingawesome.com
  • 28. Differences Between Programs and Projects • Programs are Ongoing, Projects End – Managing a program involves long term strategic planning and continuous process improvement is not required of a project • Programs are Tied to the Financial Calendar – Program managers are often responsible for delivering results tied to the organization's financial calendar • Program Management is Governance Intensive – Programs are governed by a senior board that provides direction, oversight, and control while projects tend to be less governance-intensive • Programs Have Greater Scope of Financial Management – Projects typically have a straight-forward budget and project financial management is focused on spending to budget while program planning, management and control is significantly more complex • Program Change Management is an Executive Leadership Capability – Projects employ a formal change management process while at the program level, change management requires executive leadership skills and program change is driven more by an organization's strategy and is subject to market conditions and changing business goals © Copyright 2022 by Peter Aiken Slide # Adapted from http://top.idownloadnew.com/program_vs_project/ and http://management.simplicable.com/management/new/program-management-vs-project-management 43 https://anythingawesome.com Your data quality program must last at least as long as your HR program! © Copyright 2022 by Peter Aiken Slide # 44 https://anythingawesome.com NYC Bridge Maintenance •Painting is continuous and optimized •As is much of the remaining maintenance •Ensures a knowledgeable workforce
  • 29. © Copyright 2022 by Peter Aiken Slide # 45 https://anythingawesome.com Making a Better Quality Data Sandwich Data supply Data literacy Standard data Standard data Leverage Point - High Performance Automation © Copyright 2022 by Peter Aiken Slide # Data literacy 46 https://anythingawesome.com Data supply
  • 30. Leverage Point - High Performance Automation © Copyright 2022 by Peter Aiken Slide # Standard data Data supply Data literacy 47 https://anythingawesome.com Leverage Point - High Performance Automation © Copyright 2022 by Peter Aiken Slide # This cannot happen without engineering and architecture! Quality engineering/ architecture work products do not happen accidentally! 48 https://anythingawesome.com Data supply Data literacy Standard data
  • 31. Leverage Point - High Performance Automation © Copyright 2022 by Peter Aiken Slide # This cannot happen without data engineering and architecture! 49 https://anythingawesome.com Quality data engineering/ architecture work products do not happen accidentally! Data supply Data literacy Standard data USS Midway & Pancakes Why Is This an Excellent Example of Engineering? • It is tall • It has a clutch • It was built in 1942 • It is cemented to the floor • It is still in regular use! © Copyright 2022 by Peter Aiken Slide # 50 https://anythingawesome.com
  • 32. Our Barn Had To Pass a Foundation Inspection • Before further construction could proceed • It makes good business sense • No IT equivalent © Copyright 2022 by Peter Aiken Slide # 51 https://anythingawesome.com © Copyright 2022 by Peter Aiken Slide # 52 peter.aiken@anythingawesome.com +1.804.382.5957 Peter Aiken, PhD Data Quality Engineering
  • 33. © Copyright 2022 by Peter Aiken Slide # https://anythingawesome.com 53 Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A © Copyright 2022 by Peter Aiken Slide # https://anythingawesome.com 54 Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A
  • 34. Systems Thinking © Copyright 2022 by Peter Aiken Slide # 55 https://anythingawesome.com http://victorianscandal.wordpress.com/picturesque/rachel-olshausen/ • A framework that is based on the belief that the component parts of a system can best be understood in the context of relationships with other systems, rather than in isolation. • The only way to fully understand why a problem or element occurs and persists is to understand the part in relation to the whole. Capra, F. (1996) The web of life: a new scientific understanding of living systems (1st Anchor Books ed). New York: Anchor Books. p. 30 Outputs Process Input ➜ Process ➜ Output Diagram © Copyright 2022 by Peter Aiken Slide # 56 https://anythingawesome.com Inputs Pizza Make Pizza Dough Water Pizza Crust Make Crust Make Pizza
  • 35. Data Steward Quality Responsibilities • Inputs – From where, do each of these my responsible data items come? – Why are they produced? – What level of quality is required by 'my processes?' • Process – What business processes use the data within my fiduciary responsibility? – For what business purpose do they use each data item? – What role does quality play for my processes to contribute? • Output – What downstream business processes consume data that was under my fiduciary care? – For what purpose are each data items consumed? – What quality attribute are required by each downstream consumer? © Copyright 2022 by Peter Aiken Slide # 57 https://anythingawesome.com Interdependencies © Copyright 2022 by Peter Aiken Slide # 58 https://anythingawesome.com Data Governance CRM (SalesForce) Data Quality
  • 36. © Copyright 2022 by Peter Aiken Slide # 59 https://anythingawesome.com Data Management Body of Knowledge (DM BoK V2) Practice Areas from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International One Option © Copyright 2022 by Peter Aiken Slide # 60 https://anythingawesome.com Data Strategy Data Governance BI/ Warehouse Perfecting operations in 3 data management practice areas 1X 1X 1X Metadata Data Quality from The DAMA Guide to the Data Management Body of Knowledge 2E © 2017 by DAMA International
  • 37. © Copyright 2022 by Peter Aiken Slide # 61 https://anythingawesome.com Separating the Wheat from the Chaff Separating the Wheat From the Chaff © Copyright 2022 by Peter Aiken Slide # 62 https://anythingawesome.com Does organizing data add to its value?
  • 38. Pre-Information Age Metadata • Examples of information architecture achievements that happened well before the information age: – Page numbering – Alphabetical order – Table of contents – Indexes – Lexicons – Maps – Diagrams © Copyright 2022 by Peter Aiken Slide # 63 https://anythingawesome.com Example from: How to make sense of any mess by Abby Covert (2014) ISBN: 1500615994 "While we can arrange things with the intent to communicate certain information, we can't actually make information. Our users do that for us." https://www.youtube.com/watch?v=60oD1TDzAXQ&feature=emb_logo https://www.youtube.com/watch?v=r10Sod44rME&t=1s https://www.youtube.com/watch?v=XD2OkDPAl6s Remove the Structure and Things Fall Apart Rapidly • Better organized data increases in value © Copyright 2022 by Peter Aiken Slide # 64 https://anythingawesome.com
  • 39. Separating the Wheat From the Chaff • Better organized data increases in value • Poor data quality practices are costing organizations money/time/ effort • 80% of organizational data is ROT – Redundant – Obsolete – Trivial • The question is which data to eliminate? – Most enterprise data is never analyzed © Copyright 2022 by Peter Aiken Slide # 65 https://anythingawesome.com Does cleaning all of it make any sense? © Copyright 2022 by Peter Aiken Slide # 66 https://anythingawesome.com Who is best qualified to decide this?
  • 40. 1. Allow the form of the problem to guide the form of the solution 2. Provide a means of decomposing the problem 3. Feature a variety of tools simplifying system understanding 4. Offer a set of strategies for evolving the design of a programmatic solution 5. Provide criteria for evaluating the quality of the various solutions 6. Facilitate development of a framework for developing organizational knowledge © Copyright 2022 by Peter Aiken Slide # 67 https://anythingawesome.com Programmatic Data Quality Engineering © Copyright 2022 by Peter Aiken Slide # 68 https://anythingawesome.com https://en.wikipedia.org/wiki/Theory_of_constraints (TOC) • A management paradigm that views any manageable system as being limited in achieving more of its goals by a small number of constraints (Eliyahu M. Goldratt) • There is always at least one constraint, and TOC uses a focusing process to identify the constraint and restructure the rest of the organization to address it • TOC adopts the common idiom "a chain is no stronger than its weakest link," processes, organizations, etc., are vulnerable because the weakest component can damage or break them or at least adversely affect the outcome
  • 41. The DQE Cycle © Copyright 2022 by Peter Aiken Slide # 69 https://anythingawesome.com • Deming cycle • "Plan-do-study-act" or "plan-do-check-act" – Identifying data issues that are critical to the achievement of business objectives – Defining business requirements for data quality – Identifying key data quality dimensions – Defining business rules critical to ensuring high quality data https://deming.org/explore/pdsa/ The DQE Cycle: (1) Plan © Copyright 2022 by Peter Aiken Slide # 70 https://anythingawesome.com • Plan for the assessment of the current state and identification of key metrics for measuring quality • The data quality engineering team assesses the scope of known issues – Determining cost and impact – Evaluating alternatives for addressing them
  • 42. The DQE Cycle: (2) Deploy © Copyright 2022 by Peter Aiken Slide # 71 https://anythingawesome.com • Deploy processes for measuring and improving the quality of data: • Data profiling – Institute inspections and monitors to identify data issues when they occur – Fix flawed processes that are the root cause of data errors or correct errors downstream – When it is not possible to correct errors at their source, correct them at their earliest point in the data flow The DQE Cycle: (3) Monitor © Copyright 2022 by Peter Aiken Slide # 72 https://anythingawesome.com • Monitor the quality of data as measured against the defined business rules • If data quality meets defined thresholds for acceptability, the processes are in control and the level of data quality meets the business requirements • If data quality falls below acceptability thresholds, notify data stewards so they can take action during the next stage
  • 43. The DQE Cycle: (4) Act © Copyright 2022 by Peter Aiken Slide # 73 https://anythingawesome.com • Act to resolve any identified issues to improve data quality and better meet business expectations • New cycles begin as new data sets come under investigation or as new data quality requirements are identified for existing data sets Starting point for new system development data performance metadata data architecture data architecture and data models shared data updated data corrected data architecture refinements facts & meanings Metadata & Data Storage Starting point for existing systems Metadata Refinement • Correct Structural Defects • Update Implementation Metadata Creation • Define Data Architecture • Define Data Model Structures Metadata Structuring • Implement Data Model Views • Populate Data Model Views Data Refinement • Correct Data Value Defects • Re-store Data Values Data Manipulation • Manipulate Data • Updata Data Data Utilization • Inspect Data • Present Data Data Creation • Create Data • Verify Data Values Data Assessment • Assess Data Values • Assess Metadata Extended Data Life Cycle Model With Metadata Sources and Uses © Copyright 2022 by Peter Aiken Slide # 74 https://anythingawesome.com
  • 44. Data Quality Attributes © Copyright 2022 by Peter Aiken Slide # 75 https://anythingawesome.com © Copyright 2022 by Peter Aiken Slide # https://anythingawesome.com 76 Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A
  • 45. © Copyright 2022 by Peter Aiken Slide # https://anythingawesome.com 77 Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A Compare the Utility of Data Quality Conversation Topics © Copyright 2022 by Peter Aiken Slide # 78 https://anythingawesome.com Engineers say: Business wants to hear: Clean some data Decrease the number of undeliverable targeted marketing ads Reorganize the database Increase the ability of the salesforce to perform their own analyses Develop a taxonomy Create a common vocabulary for the organization Optimize a query Shaved 1 second off a task that runs a billion times a day Reverse engineer the legacy system Understand: what was good about the old system so it can be formally preserved and, what was bad so it can be improved
  • 46. CDO Agenda Inventory Data -> uncovering assets & decreasing ROT Develop the first version of an organizational data strategy Monetize your organization's data © Copyright 2022 by Peter Aiken Slide # 79 https://anythingawesome.com The CDOs goal is to better manage data as an organizational asset in support of the organizational mission! What Is Strategy? • Current use derived from military - a pattern in a stream of decisions [Henry Mintzberg] © Copyright 2022 by Peter Aiken Slide # 80 https://anythingawesome.com A thing
  • 47. Q1 Organizations without a formalized data quality focus Q4 Data Quality Focus: both, simultaneously Q2 Data Quality Focus: Increase organizational efficiencies/effectiveness Improve Operations Innovation Initially Pick One or the Other but Not Both © Copyright 2022 by Peter Aiken Slide # 81 https://anythingawesome.com x x Q3 Data Quality Focus: Use data to create | strategic | opportunities | 12.5 25 37.5 50 Monday Tuesday Wednesday Thursday Friday 48 24 12 6 3 Pandemic Math Question? (A Very Bad Week) • If demand at a 48-bed hospital facility is doubling-daily … • … at what point does anyone notice that the hospital beds are becoming scarce? - Monday 3 beds occupied - Tuesday 6 - Wednesday – ¾ of all beds were available - Yesterday – ½ of all beds were available - Today – zero beds available - Tomorrow …??? © Copyright 2022 by Peter Aiken Slide # 82 https://anythingawesome.com
  • 48. A Musical Analogy That Works for Both Practice and Storytelling © Copyright 2022 by Peter Aiken Slide # 83 https://anythingawesome.com + = https://www.youtube.com/watch?v=4n1GT-VjjVs&frags=pl%2Cwn © Copyright 2022 by Peter Aiken Slide # 84 https://anythingawesome.com Getting Data Quality Right Engineering Success Stories Program • Approaching Data Quality – Definitions – Causes can be difficult to discern – Data quality challenges are the root cause of most IT and business failures – Must be built on leverage – Requires a programmatic approach to be most effective – Early business cases often have a dual purpose – High quality data requires architecture and engineering • What do we need to get better at? – Systems thinking – Not looking at data quality in isolation – Understanding data ROT – Not underestimating the role of culture – Developing repeatable capabilities/core data quality expertise • How do we get better? – Refocus the request around business outcomes – Leadership – Program focus – Math (cost or investment?) – Storytelling/Practice • Takeaways and Q&A
  • 49. © Copyright 2022 by Peter Aiken Slide # 85 https://anythingawesome.com • Information transparency • Analytics • Business Intelligence • Increasing efficiencies • Decreasing costs • Driving holistic decision-making across the organization • Information transparency • Analytics • Business Intelligence • Increasing efficiencies • Decreasing costs • Driving holistic decision-making across the organization High Quality Data is Critical Not Helpful © Copyright 2022 by Peter Aiken Slide # 86 https://anythingawesome.com 1 The project needs to be small Projects should not be allowed to begin unless the data requirements for the entire project are verified 2 The product Owner or sponsor must be highly skilled Few in IT have the requisite data skills and knowledge 3 The process must be agile- ready Agile is a construction technique/ data requires more planning before construction 4 The team must be highly skilled in both the data quality processes and technology Few teams have requisite levels of data skills 5 The organization must be highly skilled at emotional maturity Few organizations understand data stuff Winning Cards for data quality program success
  • 50. Data Things Happen Organizational Things Happen This Approach Only Works if • We know where the data that needs to be fixed–resides • We can communicate precisely and correctly amongst team members, sponsors, collaborators • We are adept with the correct technological support • … © Copyright 2022 by Peter Aiken Slide # 87 https://anythingawesome.com ≈ ≈ ≈ ≈ ≈ ≈ X $ X $ X $ X $ X $ X $ X $ X $ X $ Data Quality Dimensions © Copyright 2022 by Peter Aiken Slide # 88 https://anythingawesome.com
  • 51. Data Value Quality © Copyright 2022 by Peter Aiken Slide # 89 https://anythingawesome.com Data Representation Quality © Copyright 2022 by Peter Aiken Slide # 90 https://anythingawesome.com
  • 52. Data Model Quality © Copyright 2022 by Peter Aiken Slide # 91 https://anythingawesome.com Data Architecture Quality © Copyright 2022 by Peter Aiken Slide # 92 https://anythingawesome.com
  • 53. Upcoming Events Where Data Architecture and Data Governance Collide 11 October 2022 What's in Your Data Warehouse? 8 November 2022 Data Management Best Practices 13 December 2022 © Copyright 2022 by Peter Aiken Slide # 93 https://anythingawesome.com Brought to you by: Time: 19:00 UTC (2:00 PM NYC) | Presented by: Peter Aiken, PhD Note: In this .pdf, clicking any webinar title opens the registration link Event Pricing © Copyright 2022 by Peter Aiken Slide # 94 https://anythingawesome.com • 20% off directly from the publisher on select titles • My Book Store @ http://anythingawesome.com • Enter the code "anythingawesome" at the Technics bookstore checkout where it says to "Apply Coupon" anythingawesome
  • 54. Peter.Aiken@AnythingAwesome.com +1.804.382.5957 Thank You!© Copyright 2022 by Peter Aiken Slide # 95 Book a call with Peter to discuss anything - https://anythingawesome.com/OfficeHours.html Critical Design Review? Hiring Assistance? Reverse Engineering Expertise? Executive Data Literacy Training? Mentoring? Tool/automation evaluation? Use your data more strategically? Independent Verification & Validation Collaboration?