SlideShare una empresa de Scribd logo
1 de 17
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Best Practices in DataOps
How to Create Agile, Automated Data Pipelines
Wayne W. Eckerson
May 8, 2019
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
1. Your data team is flooded with minor request tickets and is burning out.
2. Business users don’t trust the data because it contains too many errors.
3. Source system changes keep breaking your ETL jobs and data pipelines.
4. Business users don’t understand why it takes so long to get data.
5. You have difficulty meeting service level agreements (SLAs).
6. Data analysts write the same jobs and reports with minor variations.
7. Data scientists wait for months for data and computing resources
8. Your company can’t discern the true cost of migrating to the cloud
9. Your data environment is too chaotic to implement predictive analytics
10. Your self-service initiative has spawned hundreds of data silos.
Bonus: Your data lake is more of a data swamp.
Bonus: It’s takes months to deploy a single predictive model.
10 Symptoms You Need DataOps
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
What is DataOps?
LeanTQM
Agile Dev/Ops
• Scrum, Kanban
• Business engagement
• Self-organizing teams
• Retrospectives
• Automation
• Orchestration
• Efficiency
• Simplicity
• Team-based development
• Version control
• Continuous integration/ delivery
• Test-driven development
• Performance management
• Performance metrics
• Continuous monitoring
• Benchmarking
DataOps
“A set of practices, processes, and technologies for building, operationalizing,
automating, and managing data pipelines from source to consumption.”
DataOps = Data Operations
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
DataOps History
DataOps applies rigor of software engineering to the
development and execution of data pipelines.
“Cowboy
Coders”
Team-based
Development
DevOps-based
Development
1960s 1970s 1980s 1990s 2000s 2010s 2020s
First DevOps
event (2009)
Manifesto for Agile Software
Development published (2001)
DevOps DataOps
KEY:
DataOps Manifesto
published (2017)
First DataOps
Event (2019)
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Primary Use Cases
Big Data Data Science Self Service
Data
Warehousing
Standardize and
reuse core data
pipeline components:
ingest, transform,
clean, etc.
Create data science
sandboxes on
demand; deploy
models automatically;
monitor data drift.
Centralize logic and
permissions to
facilitate data access
and analysis while
eliminating data silos
Speed development
by assigning agile
teams to business
groups to build end-
to-end solutions
Agile but ungoverned Governed but not agile
Reuse and
Collaboration
Self Service and
Automation
Governance and
Infrastructure
Speed and
Prioritization
Biggest
Needs
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Adoption
Yes
27%
Somewhat
30%
No
43%
DOES YOUR ORGANIZATION HAVE A
DATAOPS INITIATIVE?
Based on 175 respondents from an Eckerson Group
survey conducted in April, 2019.
32%
29%
10%
9%
7%
6%
5%
2%
1%
0%
IT OR BI DIRE C T OR OR MA NA GE R
IT OR BI A RC H IT E C T ,…
C ONS ULT A NT
BUS INE S S MA NA GE R - A NA LY T IC S
DA T A A NA LY S T OR S C IE NT IS T
BUS INE S S E X E C UT IVE OR …
DA T A E NGINE E R
A C A DE MIC
VE NDOR
DA T A OP S E NGINE E R
RESPONDENT ROLES
18%
15%
11%
29%
26%
VE RY S MA LL < 100 …
S MA LL <500 …
ME DIUM <1, 000 E MP …
LA RGE <10, 000
VE RY LA RGE > …
COMPANY SIZE
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Benefits
Faster cycle time
Fewer data defects
More scalability, reliability
Lower costs
More innovation
Happier customers
Continuous integration/delivery, reuse, automation
Test-driven development and execution
Team-based development, continuous monitoring
Higher development capacity, fewer errors
Focus efforts on value-add solutions and technologies
Get more for less with greater trust and alignment
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Benefits from Survey
60%
55%
50%
50%
48%
47%
47%
42%
FA S TE R CY CLE TI ME S
HA P P I E R B US I NE S S US E RS
DE LI V E R NE W A P P LI CA TI ONS MORE QUI CK LY
FE W E R DE FE CTS A ND E RRORS
I NGE S T NE W DA TA S OURCE S MORE RA P I DLY
FASTER CHANGE REQUESTS
I NCRE A S E D DE V E LOP ME NT CA P A CI TY
I MP ROV E D DA TA GOV E RNA NCE
BENEFITS OF DATAOPS
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Challenges
55%
53%
50%
50%
47%
42%
35%
34%
26%
23%
E S TA B LI S HING FORMA L P ROCE S S E S
ORCHE S TRA TI NG CODE A ND DA TA A CROS S TOOLS
STAFF CAPACI TY
MONI TORI NG THE E ND -T O-E ND E NV I RONME NT
BUI LDI NG RI GOROUS TESTS UPFRONT
LA CK OF A DE QUA TE A UTOMA TI ON TOOLS
GE TTI NG B US I NE S S US E RS TO B UY I NTO THE
P ROCE S S
A DOP TI NG A GI LE ME THODS A ND TE A MS
DA TA I S TOO HA RD TO FI ND
GETTI NG TECHNI CAL USERS TO BUY I N TO THE
P ROCE S S
DATAOPS CHALLENGES
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Components and Tools
58%
54%
53%
50%
50%
46%
46%
41%
38%
32%
28%
A GI LE DE V E LOP ME NT
CONTI NUOUS DE LI V E RY
COLLA B ORA TI ON A ND RE US E
CONTI NUOUS I NTE GRA TI ON
CODE RE P OS I TORY
DATA PI PELI NE ORCHESTRATI ON
P E RFORMA NCE A ND A P P LI CA TI ON MONI TORI NG
CONTI NUOUS TE S TI NG
W ORKFLOW MA NA GE ME NT
CHA NGE MA NA GE ME NT RE QUE S T
CONTA I NE RS A ND ORCHE S TRA TI ON TOOLS
RATE THE IMPORTANCE OF EACH DATAOPS
COMPONENT?
High
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Use Cases
66%
60%
56%
52%
39%
29%
34%
27%
DA T A W A RE H OUS E S A ND MA RT S
RE P ORT ING A ND DA S H BOA RDING
S E LF - S E RVIC E A NA LY S IS
DA T A S C IE NC E A ND MA C H INE LE A RNING
DA T A LA K E
OLA P C UBE S F OR RE P ORT ING A ND A NA LY S IS
C US T OME R- F A CING A P P LIC A T IONS
A UDIT , C OMP LIA NC E , S E C URIT Y
DATAOPS USE CASES
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Best Practices
Form a data department (with a CDO)
Map and assess your data environment
Educate your team about DataOps
Create cross-functional dev teams
Align the teams with business priorities
Continuously review and refine processes
The “Soft
Stuff”
If you don’t have one already Add a CDO for executive clout
Map data flows; assess waste, inefficiencies,
manual processes, error sources, dev capacity.
Expect resistance: ”Data is different!” “Don’t
slow us down!”
Stick with it; “You can’t drive fast w/o brakes.”
Self-organizing, cross-trained; collaborative, agile
teams that build end-to-end solutions
Align agile themes, initiatives, epics, and stories
with business goals; get cross-functional priorities
It’s a journey; benchmark performance and
continuously improve cycle times, capacity,
reuse, and other core objectives.
Pull ”data people” out of IT; unite data engineers,
data scientists, and SW engineers.
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Best Practices (cont)
Start small and build incrementally
Build for reuse
Segregate duties and environments
Test and monitor everything
Use DevOps and DataOps tools
Create a self-service infrastructure
Build for the enterprise
The “Hard
Stuff”
Standardize ingest, transforms, configurations,
code, data sets; use repositories & containers.
Use tools to migrate code from dev to test, to
production environments and segregate duties
Build tests before and after coding; use tests to
monitor and automate data pipelines.
Repositories for data, code, configurations; tools
for agile collaboration, CI/CD, testing, data
catalog, orchestration, data glossary, unification.
Centralize logic; apply permissions for data
access and functionality; automate report and
model deployment; serverless, Kubernetes,
Plan for security, governance, auditability,
scalability, reliability, portability, and continuous
monitoring.
Insist on business representation on the dev
team; get cross-functional priorities monthly
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Summary
DataOps puts
your data on a
solid foundation
• Speeds cycle time,
improves quality,
increases capacity,
reduces cost
Lets your data
team focus on
value-add
• Such as
predictive
analytics,
streaming data,
cloud computing
Increasing
customer
satisfaction and
business value
DataOps is light—
out, automated
data operations.
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Questions?
I’m listening!
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Wayne Eckerson
• 25+ year thought leader in data and analytics
• Sought-after speaker and consultant
• President, Eckerson Group
• Former director of research at TDWI
• Author of hundreds of articles and reports
Performance
Management
BI/Analytics
© Eckerson Group 2019 Twitter: @weckerson www.eckerson.com
Get More Value from
Data and Analytics

Más contenido relacionado

La actualidad más candente

Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachDATAVERSITY
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...Dr. Arif Wider
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureDATAVERSITY
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureDatabricks
 
DataOps introduction : DataOps is not only DevOps applied to data!
DataOps introduction : DataOps is not only DevOps applied to data!DataOps introduction : DataOps is not only DevOps applied to data!
DataOps introduction : DataOps is not only DevOps applied to data!Adrien Blind
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDATAVERSITY
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks DeltaDatabricks
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleHow to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleDATAVERSITY
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Adrien Blind
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationAnalytics8
 
Getting Started with Databricks SQL Analytics
Getting Started with Databricks SQL AnalyticsGetting Started with Databricks SQL Analytics
Getting Started with Databricks SQL AnalyticsDatabricks
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 Dataiku
 

La actualidad más candente (20)

Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
Data Mesh in Practice - How Europe's Leading Online Platform for Fashion Goes...
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
DataOps with Project Amaterasu
DataOps with Project AmaterasuDataOps with Project Amaterasu
DataOps with Project Amaterasu
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
How to Streamline DataOps on AWS
How to Streamline DataOps on AWSHow to Streamline DataOps on AWS
How to Streamline DataOps on AWS
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
DataOps introduction : DataOps is not only DevOps applied to data!
DataOps introduction : DataOps is not only DevOps applied to data!DataOps introduction : DataOps is not only DevOps applied to data!
DataOps introduction : DataOps is not only DevOps applied to data!
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at ScaleHow to Use a Semantic Layer to Deliver Actionable Insights at Scale
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
The Path to Data and Analytics Modernization
The Path to Data and Analytics ModernizationThe Path to Data and Analytics Modernization
The Path to Data and Analytics Modernization
 
Getting Started with Databricks SQL Analytics
Getting Started with Databricks SQL AnalyticsGetting Started with Databricks SQL Analytics
Getting Started with Databricks SQL Analytics
 
The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016 The Rise of the DataOps - Dataiku - J On the Beach 2016
The Rise of the DataOps - Dataiku - J On the Beach 2016
 

Similar a Best Practices in DataOps: How to Create Agile, Automated Data Pipelines

Predictive and Prescriptive Analytics Expert Session Webinar
Predictive  and Prescriptive Analytics Expert Session Webinar Predictive  and Prescriptive Analytics Expert Session Webinar
Predictive and Prescriptive Analytics Expert Session Webinar ibi
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesFrank Vullers
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...IADSS
 
Making Data Governance Work - Think Big but Start Small
Making Data Governance Work - Think Big but Start SmallMaking Data Governance Work - Think Big but Start Small
Making Data Governance Work - Think Big but Start SmallEarley Information Science
 
Oracle's Modern HR in the Cloud
Oracle's Modern HR in the CloudOracle's Modern HR in the Cloud
Oracle's Modern HR in the CloudJohnHansenHCM
 
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Precisely
 
Dzr guide to_enterprise_integration
Dzr guide to_enterprise_integrationDzr guide to_enterprise_integration
Dzr guide to_enterprise_integrationHamed Hatami
 
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...Alluxio, Inc.
 
2016 Email Marketing Industry Census Webinar
2016 Email Marketing Industry Census Webinar2016 Email Marketing Industry Census Webinar
2016 Email Marketing Industry Census WebinarAdestra
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDATAVERSITY
 
Measuring the Business Impact of Learning: Lagging indicators to predictive a...
Measuring the Business Impact of Learning: Lagging indicators to predictive a...Measuring the Business Impact of Learning: Lagging indicators to predictive a...
Measuring the Business Impact of Learning: Lagging indicators to predictive a...Watershed
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Databricks
 
2015 Future of Open Source Survey Results
2015 Future of Open Source Survey Results2015 Future of Open Source Survey Results
2015 Future of Open Source Survey ResultsBlack Duck by Synopsys
 
The Digital Emperor Has No Clothes
The Digital Emperor Has No ClothesThe Digital Emperor Has No Clothes
The Digital Emperor Has No Clothesaccenture
 
Lifecycle Integration with the University of Kentucky
Lifecycle Integration with the University of KentuckyLifecycle Integration with the University of Kentucky
Lifecycle Integration with the University of KentuckySalesforce.org
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of MetadataDATAVERSITY
 

Similar a Best Practices in DataOps: How to Create Agile, Automated Data Pipelines (20)

Best practices in data ops
Best practices in data opsBest practices in data ops
Best practices in data ops
 
Predictive and Prescriptive Analytics Expert Session Webinar
Predictive  and Prescriptive Analytics Expert Session Webinar Predictive  and Prescriptive Analytics Expert Session Webinar
Predictive and Prescriptive Analytics Expert Session Webinar
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use cases
 
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
 
Making Data Governance Work - Think Big but Start Small
Making Data Governance Work - Think Big but Start SmallMaking Data Governance Work - Think Big but Start Small
Making Data Governance Work - Think Big but Start Small
 
Oracle's Modern HR in the Cloud
Oracle's Modern HR in the CloudOracle's Modern HR in the Cloud
Oracle's Modern HR in the Cloud
 
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
Bridging Legacy Systems and Cloud Data Platforms to Unlock Valuable Enterpris...
 
Dzr guide to_enterprise_integration
Dzr guide to_enterprise_integrationDzr guide to_enterprise_integration
Dzr guide to_enterprise_integration
 
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...Alluxio Monthly Webinar | Five Disruptive Trends that Every  Data & AI Leader...
Alluxio Monthly Webinar | Five Disruptive Trends that Every Data & AI Leader...
 
Ba introduction
Ba introductionBa introduction
Ba introduction
 
Ba introduction
Ba introductionBa introduction
Ba introduction
 
2016 Email Marketing Industry Census Webinar
2016 Email Marketing Industry Census Webinar2016 Email Marketing Industry Census Webinar
2016 Email Marketing Industry Census Webinar
 
DataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best PracticesDataEd Slides: Data Management Best Practices
DataEd Slides: Data Management Best Practices
 
Measuring the Business Impact of Learning: Lagging indicators to predictive a...
Measuring the Business Impact of Learning: Lagging indicators to predictive a...Measuring the Business Impact of Learning: Lagging indicators to predictive a...
Measuring the Business Impact of Learning: Lagging indicators to predictive a...
 
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
Transforming Devon’s Data Pipeline with an Open Source Data Hub—Built on Data...
 
2015 Future of Open Source Survey Results
2015 Future of Open Source Survey Results2015 Future of Open Source Survey Results
2015 Future of Open Source Survey Results
 
The Digital Emperor Has No Clothes
The Digital Emperor Has No ClothesThe Digital Emperor Has No Clothes
The Digital Emperor Has No Clothes
 
Lifecycle Integration with the University of Kentucky
Lifecycle Integration with the University of KentuckyLifecycle Integration with the University of Kentucky
Lifecycle Integration with the University of Kentucky
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 

Más de Eric Kavanagh

The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationEric Kavanagh
 
Expediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisExpediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisEric Kavanagh
 
Will AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsWill AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsEric Kavanagh
 
Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationEric Kavanagh
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Database Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastDatabase Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastEric Kavanagh
 
Better to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityBetter to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityEric Kavanagh
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceEric Kavanagh
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingEric Kavanagh
 
A Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyA Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyEric Kavanagh
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs MatterEric Kavanagh
 
Health Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIHealth Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIEric Kavanagh
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueEric Kavanagh
 
Solving the Really Big Tech Problems with IoT
 Solving the Really Big Tech Problems with IoT Solving the Really Big Tech Problems with IoT
Solving the Really Big Tech Problems with IoTEric Kavanagh
 
Beyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisBeyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisEric Kavanagh
 
Protect Your Database: High Availability for High Demand Data
 Protect Your Database: High Availability for High Demand Data Protect Your Database: High Availability for High Demand Data
Protect Your Database: High Availability for High Demand DataEric Kavanagh
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataEric Kavanagh
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesEric Kavanagh
 
A Tight Ship: How Containers and SDS Optimize the Enterprise
 A Tight Ship: How Containers and SDS Optimize the Enterprise A Tight Ship: How Containers and SDS Optimize the Enterprise
A Tight Ship: How Containers and SDS Optimize the EnterpriseEric Kavanagh
 
Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users	Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users Eric Kavanagh
 

Más de Eric Kavanagh (20)

The Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data IntegrationThe Future of Data Warehousing and Data Integration
The Future of Data Warehousing and Data Integration
 
Expediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source AnalysisExpediting the Path to Discovery with Multi-Source Analysis
Expediting the Path to Discovery with Multi-Source Analysis
 
Will AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and DashboardsWill AI Eliminate Reports and Dashboards
Will AI Eliminate Reports and Dashboards
 
Metadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI ModernizationMetadata Mastery: A Big Step for BI Modernization
Metadata Mastery: A Big Step for BI Modernization
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Database Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory WebcastDatabase Survival Guide: Exploratory Webcast
Database Survival Guide: Exploratory Webcast
 
Better to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and SecurityBetter to Ask Permission? Best Practices for Privacy and Security
Better to Ask Permission? Best Practices for Privacy and Security
 
The Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data GovernanceThe Model Enterprise: A Blueprint for Enterprise Data Governance
The Model Enterprise: A Blueprint for Enterprise Data Governance
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
 
A Winning Strategy for the Digital Economy
A Winning Strategy for the Digital EconomyA Winning Strategy for the Digital Economy
A Winning Strategy for the Digital Economy
 
Discovering Big Data in the Fog: Why Catalogs Matter
 Discovering Big Data in the Fog: Why Catalogs Matter Discovering Big Data in the Fog: Why Catalogs Matter
Discovering Big Data in the Fog: Why Catalogs Matter
 
Health Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BIHealth Check: Maintaining Enterprise BI
Health Check: Maintaining Enterprise BI
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the Rescue
 
Solving the Really Big Tech Problems with IoT
 Solving the Really Big Tech Problems with IoT Solving the Really Big Tech Problems with IoT
Solving the Really Big Tech Problems with IoT
 
Beyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid AnalysisBeyond the Platform: Enabling Fluid Analysis
Beyond the Platform: Enabling Fluid Analysis
 
Protect Your Database: High Availability for High Demand Data
 Protect Your Database: High Availability for High Demand Data Protect Your Database: High Availability for High Demand Data
Protect Your Database: High Availability for High Demand Data
 
A Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with DataA Better Understanding: Solving Business Challenges with Data
A Better Understanding: Solving Business Challenges with Data
 
The Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning QueriesThe Key to Effective Analytics: Fast-Returning Queries
The Key to Effective Analytics: Fast-Returning Queries
 
A Tight Ship: How Containers and SDS Optimize the Enterprise
 A Tight Ship: How Containers and SDS Optimize the Enterprise A Tight Ship: How Containers and SDS Optimize the Enterprise
A Tight Ship: How Containers and SDS Optimize the Enterprise
 
Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users	Application Acceleration: Faster Performance for End Users
Application Acceleration: Faster Performance for End Users
 

Último

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Último (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Best Practices in DataOps: How to Create Agile, Automated Data Pipelines

  • 1. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Best Practices in DataOps How to Create Agile, Automated Data Pipelines Wayne W. Eckerson May 8, 2019
  • 2. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com 1. Your data team is flooded with minor request tickets and is burning out. 2. Business users don’t trust the data because it contains too many errors. 3. Source system changes keep breaking your ETL jobs and data pipelines. 4. Business users don’t understand why it takes so long to get data. 5. You have difficulty meeting service level agreements (SLAs). 6. Data analysts write the same jobs and reports with minor variations. 7. Data scientists wait for months for data and computing resources 8. Your company can’t discern the true cost of migrating to the cloud 9. Your data environment is too chaotic to implement predictive analytics 10. Your self-service initiative has spawned hundreds of data silos. Bonus: Your data lake is more of a data swamp. Bonus: It’s takes months to deploy a single predictive model. 10 Symptoms You Need DataOps
  • 3. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com What is DataOps? LeanTQM Agile Dev/Ops • Scrum, Kanban • Business engagement • Self-organizing teams • Retrospectives • Automation • Orchestration • Efficiency • Simplicity • Team-based development • Version control • Continuous integration/ delivery • Test-driven development • Performance management • Performance metrics • Continuous monitoring • Benchmarking DataOps “A set of practices, processes, and technologies for building, operationalizing, automating, and managing data pipelines from source to consumption.” DataOps = Data Operations
  • 4. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com DataOps History DataOps applies rigor of software engineering to the development and execution of data pipelines. “Cowboy Coders” Team-based Development DevOps-based Development 1960s 1970s 1980s 1990s 2000s 2010s 2020s First DevOps event (2009) Manifesto for Agile Software Development published (2001) DevOps DataOps KEY: DataOps Manifesto published (2017) First DataOps Event (2019)
  • 5. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Primary Use Cases Big Data Data Science Self Service Data Warehousing Standardize and reuse core data pipeline components: ingest, transform, clean, etc. Create data science sandboxes on demand; deploy models automatically; monitor data drift. Centralize logic and permissions to facilitate data access and analysis while eliminating data silos Speed development by assigning agile teams to business groups to build end- to-end solutions Agile but ungoverned Governed but not agile Reuse and Collaboration Self Service and Automation Governance and Infrastructure Speed and Prioritization Biggest Needs
  • 6. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Adoption Yes 27% Somewhat 30% No 43% DOES YOUR ORGANIZATION HAVE A DATAOPS INITIATIVE? Based on 175 respondents from an Eckerson Group survey conducted in April, 2019. 32% 29% 10% 9% 7% 6% 5% 2% 1% 0% IT OR BI DIRE C T OR OR MA NA GE R IT OR BI A RC H IT E C T ,… C ONS ULT A NT BUS INE S S MA NA GE R - A NA LY T IC S DA T A A NA LY S T OR S C IE NT IS T BUS INE S S E X E C UT IVE OR … DA T A E NGINE E R A C A DE MIC VE NDOR DA T A OP S E NGINE E R RESPONDENT ROLES 18% 15% 11% 29% 26% VE RY S MA LL < 100 … S MA LL <500 … ME DIUM <1, 000 E MP … LA RGE <10, 000 VE RY LA RGE > … COMPANY SIZE
  • 7. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Benefits Faster cycle time Fewer data defects More scalability, reliability Lower costs More innovation Happier customers Continuous integration/delivery, reuse, automation Test-driven development and execution Team-based development, continuous monitoring Higher development capacity, fewer errors Focus efforts on value-add solutions and technologies Get more for less with greater trust and alignment
  • 8. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Benefits from Survey 60% 55% 50% 50% 48% 47% 47% 42% FA S TE R CY CLE TI ME S HA P P I E R B US I NE S S US E RS DE LI V E R NE W A P P LI CA TI ONS MORE QUI CK LY FE W E R DE FE CTS A ND E RRORS I NGE S T NE W DA TA S OURCE S MORE RA P I DLY FASTER CHANGE REQUESTS I NCRE A S E D DE V E LOP ME NT CA P A CI TY I MP ROV E D DA TA GOV E RNA NCE BENEFITS OF DATAOPS
  • 9. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Challenges 55% 53% 50% 50% 47% 42% 35% 34% 26% 23% E S TA B LI S HING FORMA L P ROCE S S E S ORCHE S TRA TI NG CODE A ND DA TA A CROS S TOOLS STAFF CAPACI TY MONI TORI NG THE E ND -T O-E ND E NV I RONME NT BUI LDI NG RI GOROUS TESTS UPFRONT LA CK OF A DE QUA TE A UTOMA TI ON TOOLS GE TTI NG B US I NE S S US E RS TO B UY I NTO THE P ROCE S S A DOP TI NG A GI LE ME THODS A ND TE A MS DA TA I S TOO HA RD TO FI ND GETTI NG TECHNI CAL USERS TO BUY I N TO THE P ROCE S S DATAOPS CHALLENGES
  • 10. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Components and Tools 58% 54% 53% 50% 50% 46% 46% 41% 38% 32% 28% A GI LE DE V E LOP ME NT CONTI NUOUS DE LI V E RY COLLA B ORA TI ON A ND RE US E CONTI NUOUS I NTE GRA TI ON CODE RE P OS I TORY DATA PI PELI NE ORCHESTRATI ON P E RFORMA NCE A ND A P P LI CA TI ON MONI TORI NG CONTI NUOUS TE S TI NG W ORKFLOW MA NA GE ME NT CHA NGE MA NA GE ME NT RE QUE S T CONTA I NE RS A ND ORCHE S TRA TI ON TOOLS RATE THE IMPORTANCE OF EACH DATAOPS COMPONENT? High
  • 11. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Use Cases 66% 60% 56% 52% 39% 29% 34% 27% DA T A W A RE H OUS E S A ND MA RT S RE P ORT ING A ND DA S H BOA RDING S E LF - S E RVIC E A NA LY S IS DA T A S C IE NC E A ND MA C H INE LE A RNING DA T A LA K E OLA P C UBE S F OR RE P ORT ING A ND A NA LY S IS C US T OME R- F A CING A P P LIC A T IONS A UDIT , C OMP LIA NC E , S E C URIT Y DATAOPS USE CASES
  • 12. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Best Practices Form a data department (with a CDO) Map and assess your data environment Educate your team about DataOps Create cross-functional dev teams Align the teams with business priorities Continuously review and refine processes The “Soft Stuff” If you don’t have one already Add a CDO for executive clout Map data flows; assess waste, inefficiencies, manual processes, error sources, dev capacity. Expect resistance: ”Data is different!” “Don’t slow us down!” Stick with it; “You can’t drive fast w/o brakes.” Self-organizing, cross-trained; collaborative, agile teams that build end-to-end solutions Align agile themes, initiatives, epics, and stories with business goals; get cross-functional priorities It’s a journey; benchmark performance and continuously improve cycle times, capacity, reuse, and other core objectives. Pull ”data people” out of IT; unite data engineers, data scientists, and SW engineers.
  • 13. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Best Practices (cont) Start small and build incrementally Build for reuse Segregate duties and environments Test and monitor everything Use DevOps and DataOps tools Create a self-service infrastructure Build for the enterprise The “Hard Stuff” Standardize ingest, transforms, configurations, code, data sets; use repositories & containers. Use tools to migrate code from dev to test, to production environments and segregate duties Build tests before and after coding; use tests to monitor and automate data pipelines. Repositories for data, code, configurations; tools for agile collaboration, CI/CD, testing, data catalog, orchestration, data glossary, unification. Centralize logic; apply permissions for data access and functionality; automate report and model deployment; serverless, Kubernetes, Plan for security, governance, auditability, scalability, reliability, portability, and continuous monitoring. Insist on business representation on the dev team; get cross-functional priorities monthly
  • 14. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Summary DataOps puts your data on a solid foundation • Speeds cycle time, improves quality, increases capacity, reduces cost Lets your data team focus on value-add • Such as predictive analytics, streaming data, cloud computing Increasing customer satisfaction and business value DataOps is light— out, automated data operations.
  • 15. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Questions? I’m listening!
  • 16. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Wayne Eckerson • 25+ year thought leader in data and analytics • Sought-after speaker and consultant • President, Eckerson Group • Former director of research at TDWI • Author of hundreds of articles and reports Performance Management BI/Analytics
  • 17. © Eckerson Group 2019 Twitter: @weckerson www.eckerson.com Get More Value from Data and Analytics