SlideShare una empresa de Scribd logo
1 de 44
Descargar para leer sin conexión
© 2022 Thoughtworks | Confidential
By
Thoughtworks
1
© 2022 Thoughtworks | Confidential
By Thoughtworks
Building data as a product: the key to
unlocking Data Mesh's potential
2
© 2022 Thoughtworks | Confidential
Harmeet Sokhi
3
Vishal Srivastava
● Rich experience in implementing data platforms.
● Passionate about data mesh
● Member & Contributor of Global Data Mesh guild at
Thoughtworks.
● Design and Implement data engineering platforms, data
Products and Machine Learning architectures
● Passionate about helping organizations unleash the potential
of data thru democratisation
● Co-organizer of the Data Engineering Melbourne meetup
© 2022 Thoughtworks | Confidential
Agenda
4
Story of Essence Financials
K’s new use case
Data product plot
Conclusion
Filling the gaps
© 2022 Thoughtworks | Confidential
Essence
Financials
5
5
© 2021 Thoughtworks
Vision : To keep market position as a
leading financial service provider
© 2022 Thoughtworks | Confidential
Significant investment in data
Reflection of past
Ambitions and Challenges
6
Ambition was to become data oriented
organisation
Data Teams were not able to keep up
with the demand.
Data Trustworthiness was questionable
Reliance on central data engineering
team.
Siloed and hyper-specialized ownership
© 2022 Thoughtworks | Confidential
Data Mesh to the rescue
Scale
Analytical Data
7
Socio-technical
Decentralized
© 2022 Thoughtworks | Confidential 8
8
Why Essence Financials moved away from “data sets”?
Data
Discoverable
Addressable
Understandable
Trustworthy
Natively Accessible
Valuable
Secure
Data Product > Dataset
Interoperable
© 2022 Thoughtworks | Confidential
Data Mesh to the
rescue
Essence Financials got benefited by four interconnected & non-negotiable pillars
9
Dehghani, Z. (2022). Data Mesh: Delivering Data-Driven Value at Scale (1st ed.). O’Reilly Media.
Domain-oriented
Ownership
Data as a
Product
Self-serve data
platform
Federated Computational
Governance
{G} {G} {G}
© 2022 Thoughtworks | Confidential
Logical architecture of “a” Data Product in Essence
Financials
Transformation
Governance
Platform
Output
Port(s)
Input
Port(s)
© 2022 Thoughtworks | Confidential
Data Product: Categories in Essence Financials
11
Source oriented Data
Product
Fit-for-purpose Data
Product
Customer oriented
Data Product
*
*
*
*
*
*
© 2022 Thoughtworks | Confidential
Data Product: Typical deployment in Essence
Financials
Data Product
Developers
Data Product Code
(Business Logic)
Data Product Model Spec (Platform
& Governance Configuration)
Data Product
Owner
Problem / use case
Utility Deploy
© 2022 Thoughtworks | Confidential
Process after adoption
13
© 2022 Thoughtworks | Confidential 14
14
Tying it back: Designing Data Product
14
© 2022 Thoughtworks
Gap analysis Build & Measure
Data
Strategy
Data Mesh is the right
architectural choice
Use Case
Define
© 2022 Thoughtworks | Confidential
K’s new use case
15
/Define
Gap analysis
Use Case
Build & Measure
Define
© 2022 Thoughtworks | Confidential
Use Case
16
Promote prime loans
© 2022 Thoughtworks | Confidential
Elevator Pitch
FOR Marketing Team
WHO Want to sell high value loans
THE “Promote Prime Loans”
IS A Recommendation Engine
THAT will allow them to spend more time in engaging with relevant customers
UNLIKE now when then send blanket communication to everyone
OUR PRODUCT Resulting in more conversions as well as saving Time spend in Operations
17
K’s new use case
PROMOTE
PRIME
LOANS
© 2022 Thoughtworks | Confidential
Agenda
18
Story of Essence Financials
K’s new use case
Data product plot
Conclusion
Filling the gaps
© 2022 Thoughtworks | Confidential
SLO
19
Gap analysis
Use Case
Build &
Measure
Define
Data Landscape Data Product Map CFRs
Define
© 2022 Thoughtworks | Confidential
Data Landscape - SLO-Data Product Map - CFRs
20
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential
21
Data Product: Data Landscape
Understand Current State,Pains & Aspirations
Customer
Information
Customer
transaction History
Acquire Productionize &
Execute
Curate Model
© 2022 Thoughtworks | Confidential
Data Landscape -SLO- Data Product Map - CFRs
22
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential
23
How often it
is used?
How many
people or
services use
it?
How
complete the
data need to
be?
How often is it
updated?
When does
it need to be
updated?
How
accurate
does the
data need
tobe?
How fresh
the data
need to be?
Every
second
Millions
100%
complete
Every second
24/7
Perfectly
Accurate
Realtime
Every Hour Thousands Every Hour
Within
minutes
Several
times a day
Hundreds
Mostly
complete
Several times
a day
Extended
Workday
Some
room of
error
Within
Hours
Daily Dozens Daily
Within
the day
Weekly
A couple
of teams Can
tolerate
Missing
data
Weekly
During
one part
of the day
As long as
its
representa
tive
Within
the week
Monthly +
Just a few
people
Monthly +
Within
the
month
PROMOTE
PRIME
LOANS
Records with 80%
complete data with
100% accuracy are
refreshed daily
Data Product: SLOs
Service Level Objectives assist in determining technical design
© 2022 Thoughtworks | Confidential
Data Landscape - SLO-Data Product Map - CFRs
24
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential
Data Product: Integrated Data Product Map
What could be different types of data products?
25
Sources
Source oriented Data
Product
Fit-for-purpose Data
Product
Customer Oriented
Data Product
Operational
Reporting/Analytics
PROMOTE
PRIME
LOANS
© 2022 Thoughtworks | Confidential
26
Sources Source oriented DP Fit-for-purpose Data
Product
Customer Oriented
Data Product
Operational
Reporting/Analytics
PROMOTE
PRIME
LOANS
Salesforce
Snowflake
Data Product: - Integrated Data Product Map
Start from customer oriented data product
Domain: Marketing
Subdomain: Promotion
© 2022 Thoughtworks | Confidential
27
Sources Source oriented DP Fit-for-purpose Data
Product
Customer Oriented
Data Product
Operational
Reporting/Analytics
PROMOTE
PRIME
LOANS
Salesforce
Snowflake
Data Product: Integrated Data Product Map
Now lets focus on source oriented data product and their attributes
Domain: Marketing
Subdomain: Promotion
Customer
Information
CUSTOMER
INFO
Customer
History Table
CUSTOMER
REPAYMENT
HISTORY
Domain: Customer
Subdomain: Payment
© 2022 Thoughtworks | Confidential
Data Product: Integrated Data Product Map
We need intermediate Data Product also
CUSTOMER
REPAYMENT
STATUS
Sources Source oriented DP Fit-for-purpose Data
Product
Customer Oriented
Data Product
Operational
Reporting/Analytics
PROMOTE
PRIME
LOANS
Salesforce
Snowflake
Domain: Marketing
Subdomain: Promotion
Customer
Information
CUSTOMER
INFO
Customer
History Table
CUSTOMER
REPAYMENT
HISTORY
Domain: Customer
Subdomain: Payment
Domain: Marketing
Subdomain: Promotion
© 2022 Thoughtworks | Confidential
SLO-Data Landscape - Data Product Map - CFRs
29
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential 30
30
Data Product: Cross
Functional
Requirements (CFRs)
30
© 2021 Thoughtworks
Data Access Policy
Data standardisation
Rules
Data Classification
Rules
Data retention policy
TFN
© 2022 Thoughtworks | Confidential
SLO-Data Landscape - Data Product Map - CFRs
31
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential
Agenda
32
Story of Essence Financials
M’s new use case
Data product plot
Conclusion
Filling the gaps
© 2022 Thoughtworks | Confidential
Architecture Gap analysis
33
Gap analysis
Use Case
Build & Measure
Define
© 2022 Thoughtworks | Confidential
Architecture Gap Analysis
34
Data Product
Architecture
Governance
Platform
Capabilities
Gap
analysis
Use Case
Build &
Measure
(
Define
© 2022 Thoughtworks | Confidential
Orchestration
Managed Airflow
35
Data Product: Gap Analysis
Data storage
Snowflake
Marketing
data mart
PROMOTE PRIME LOANS
Governance
Transformation
Data product code
Platform Capabilities
Snowflake
connector
Salesforce
connector
Snowflake
connector
© 2022 Thoughtworks | Confidential
Data Product- Promote Prime Loan
CUSTOMER
REPAYMENT
STATUS
PROMOTE
PRIME
LOANS
Salesforce
Snowflake
Customer
Information
CUSTOMER
INFO
Customer
transaction
history
CUSTOMER
REPAYMENT
HISTORY
Platform Capabilities
Governance
Managed Airflow Snowflake ci/cd Observability
© 2022 Thoughtworks | Confidential
Agenda
37
Story of Essence Financials
M’s new use case
Data product plot
Conclusion
Filling the gaps
© 2022 Thoughtworks | Confidential
/Build & Measure
38
Gap analysis
Use Case
Build &
Measure
Define
© 2022 Thoughtworks | Confidential
Build & Measure
39
Gap analysis
Use Case
Build &
Measure
(
Define
Learn
Build
Measure
© 2022 Thoughtworks | Confidential
Build & Measure
40
Gap analysis
Use Case
Build &
Measure
(
Define
Data Product Score
Rule based User feedback
KPIs
SLOs
© 2022 Thoughtworks | Confidential
Happy K
41
© 2022 Thoughtworks | Confidential
Recap
42
Gap analysis
Define
Data Landscape - SLO
Data Product Map - CFRs
Use Case
Build &
Measure
Data Product
Architecture
Governance
Platform
Capabilities
Learn
Build
Measure
© 2022 Thoughtworks | Confidential
“A Journey of Thousand Miles start with a
single step”
-Lau Tze
43
© 2022 Thoughtworks | Confidential
We look forward to
working with you
Harmeet Kaur Sokhi
Lead Data Consultant
harmeet.sokhi@thoughtworks.com
Vishal Srivastava
Senior Data Consultant
vishal.srivastava@thoughtworks.com
44
Please share your feedback

Más contenido relacionado

La actualidad más candente

Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
DATAVERSITY
 

La actualidad más candente (20)

Master Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and GovernanceMaster Data Management – Aligning Data, Process, and Governance
Master Data Management – Aligning Data, Process, and Governance
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Data-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data QualityData-Ed Online: Approaching Data Quality
Data-Ed Online: Approaching Data Quality
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Azure Data Engineering.pptx
Azure Data Engineering.pptxAzure Data Engineering.pptx
Azure Data Engineering.pptx
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSISMicrosoft Data Integration Pipelines: Azure Data Factory and SSIS
Microsoft Data Integration Pipelines: Azure Data Factory and SSIS
 
Time to Talk about Data Mesh
Time to Talk about Data MeshTime to Talk about Data Mesh
Time to Talk about Data Mesh
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)
 
Building an integrated data strategy
Building an integrated data strategyBuilding an integrated data strategy
Building an integrated data strategy
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Data Monetization Framework
Data Monetization FrameworkData Monetization Framework
Data Monetization Framework
 
Accenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdfAccenture-Cloud-Data-Migration-POV-Final.pdf
Accenture-Cloud-Data-Migration-POV-Final.pdf
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
Strategic imperative the enterprise data model
Strategic imperative the enterprise data modelStrategic imperative the enterprise data model
Strategic imperative the enterprise data model
 

Similar a By Thoughtworks | Building data as a product: The key to unlocking Data Mesh's potential with Harmeet Sokhi & Vishal Srivastava

Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
ConnectaDigital
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data Virtualization
Denodo
 
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
Denodo
 
Data and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the CloudData and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the Cloud
redmondpulver
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Denodo
 

Similar a By Thoughtworks | Building data as a product: The key to unlocking Data Mesh's potential with Harmeet Sokhi & Vishal Srivastava (20)

Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Consumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data VirtualizationConsumption based analytics enabled by Data Virtualization
Consumption based analytics enabled by Data Virtualization
 
Modernizing Integration with Data Virtualization
Modernizing Integration with Data VirtualizationModernizing Integration with Data Virtualization
Modernizing Integration with Data Virtualization
 
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
¿Cómo las manufacturas están evolucionando hacia la Industria 4.0 con la virt...
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Is Our Information Management Mature?  
Is Our Information Management Mature?  Is Our Information Management Mature?  
Is Our Information Management Mature?  
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
Logical Data Fabric: Maturing Implementation from Small to Big (APAC)
 
Data and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the CloudData and Application Modernization in the Age of the Cloud
Data and Application Modernization in the Age of the Cloud
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
 
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Education Seminar: Self-service BI, Logical Data Warehouse and Data LakesEducation Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
 
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
ADV Slides: Modern Analytic Data Architecture Maturity ModelingADV Slides: Modern Analytic Data Architecture Maturity Modeling
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
 
Data governance in a Cloud BI world
Data governance in a Cloud BI worldData governance in a Cloud BI world
Data governance in a Cloud BI world
 
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
DAS Slides: Emerging Trends in Data Architecture — What’s the Next Big Thing?
 
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
 
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
Data Virtualization: Fulfilling The Digital Transformation Requirement In Ban...
 
Cloud Orchestration Market Size, Share, & Trends Estimation Report By Service...
Cloud Orchestration Market Size, Share, & Trends Estimation Report By Service...Cloud Orchestration Market Size, Share, & Trends Estimation Report By Service...
Cloud Orchestration Market Size, Share, & Trends Estimation Report By Service...
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Trends in Enterprise Advanced Analytics
Trends in Enterprise Advanced AnalyticsTrends in Enterprise Advanced Analytics
Trends in Enterprise Advanced Analytics
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

By Thoughtworks | Building data as a product: The key to unlocking Data Mesh's potential with Harmeet Sokhi & Vishal Srivastava

  • 1. © 2022 Thoughtworks | Confidential By Thoughtworks 1
  • 2. © 2022 Thoughtworks | Confidential By Thoughtworks Building data as a product: the key to unlocking Data Mesh's potential 2
  • 3. © 2022 Thoughtworks | Confidential Harmeet Sokhi 3 Vishal Srivastava ● Rich experience in implementing data platforms. ● Passionate about data mesh ● Member & Contributor of Global Data Mesh guild at Thoughtworks. ● Design and Implement data engineering platforms, data Products and Machine Learning architectures ● Passionate about helping organizations unleash the potential of data thru democratisation ● Co-organizer of the Data Engineering Melbourne meetup
  • 4. © 2022 Thoughtworks | Confidential Agenda 4 Story of Essence Financials K’s new use case Data product plot Conclusion Filling the gaps
  • 5. © 2022 Thoughtworks | Confidential Essence Financials 5 5 © 2021 Thoughtworks Vision : To keep market position as a leading financial service provider
  • 6. © 2022 Thoughtworks | Confidential Significant investment in data Reflection of past Ambitions and Challenges 6 Ambition was to become data oriented organisation Data Teams were not able to keep up with the demand. Data Trustworthiness was questionable Reliance on central data engineering team. Siloed and hyper-specialized ownership
  • 7. © 2022 Thoughtworks | Confidential Data Mesh to the rescue Scale Analytical Data 7 Socio-technical Decentralized
  • 8. © 2022 Thoughtworks | Confidential 8 8 Why Essence Financials moved away from “data sets”? Data Discoverable Addressable Understandable Trustworthy Natively Accessible Valuable Secure Data Product > Dataset Interoperable
  • 9. © 2022 Thoughtworks | Confidential Data Mesh to the rescue Essence Financials got benefited by four interconnected & non-negotiable pillars 9 Dehghani, Z. (2022). Data Mesh: Delivering Data-Driven Value at Scale (1st ed.). O’Reilly Media. Domain-oriented Ownership Data as a Product Self-serve data platform Federated Computational Governance {G} {G} {G}
  • 10. © 2022 Thoughtworks | Confidential Logical architecture of “a” Data Product in Essence Financials Transformation Governance Platform Output Port(s) Input Port(s)
  • 11. © 2022 Thoughtworks | Confidential Data Product: Categories in Essence Financials 11 Source oriented Data Product Fit-for-purpose Data Product Customer oriented Data Product * * * * * *
  • 12. © 2022 Thoughtworks | Confidential Data Product: Typical deployment in Essence Financials Data Product Developers Data Product Code (Business Logic) Data Product Model Spec (Platform & Governance Configuration) Data Product Owner Problem / use case Utility Deploy
  • 13. © 2022 Thoughtworks | Confidential Process after adoption 13
  • 14. © 2022 Thoughtworks | Confidential 14 14 Tying it back: Designing Data Product 14 © 2022 Thoughtworks Gap analysis Build & Measure Data Strategy Data Mesh is the right architectural choice Use Case Define
  • 15. © 2022 Thoughtworks | Confidential K’s new use case 15 /Define Gap analysis Use Case Build & Measure Define
  • 16. © 2022 Thoughtworks | Confidential Use Case 16 Promote prime loans
  • 17. © 2022 Thoughtworks | Confidential Elevator Pitch FOR Marketing Team WHO Want to sell high value loans THE “Promote Prime Loans” IS A Recommendation Engine THAT will allow them to spend more time in engaging with relevant customers UNLIKE now when then send blanket communication to everyone OUR PRODUCT Resulting in more conversions as well as saving Time spend in Operations 17 K’s new use case PROMOTE PRIME LOANS
  • 18. © 2022 Thoughtworks | Confidential Agenda 18 Story of Essence Financials K’s new use case Data product plot Conclusion Filling the gaps
  • 19. © 2022 Thoughtworks | Confidential SLO 19 Gap analysis Use Case Build & Measure Define Data Landscape Data Product Map CFRs Define
  • 20. © 2022 Thoughtworks | Confidential Data Landscape - SLO-Data Product Map - CFRs 20 Gap analysis Use Case Build & Measure Define
  • 21. © 2022 Thoughtworks | Confidential 21 Data Product: Data Landscape Understand Current State,Pains & Aspirations Customer Information Customer transaction History Acquire Productionize & Execute Curate Model
  • 22. © 2022 Thoughtworks | Confidential Data Landscape -SLO- Data Product Map - CFRs 22 Gap analysis Use Case Build & Measure Define
  • 23. © 2022 Thoughtworks | Confidential 23 How often it is used? How many people or services use it? How complete the data need to be? How often is it updated? When does it need to be updated? How accurate does the data need tobe? How fresh the data need to be? Every second Millions 100% complete Every second 24/7 Perfectly Accurate Realtime Every Hour Thousands Every Hour Within minutes Several times a day Hundreds Mostly complete Several times a day Extended Workday Some room of error Within Hours Daily Dozens Daily Within the day Weekly A couple of teams Can tolerate Missing data Weekly During one part of the day As long as its representa tive Within the week Monthly + Just a few people Monthly + Within the month PROMOTE PRIME LOANS Records with 80% complete data with 100% accuracy are refreshed daily Data Product: SLOs Service Level Objectives assist in determining technical design
  • 24. © 2022 Thoughtworks | Confidential Data Landscape - SLO-Data Product Map - CFRs 24 Gap analysis Use Case Build & Measure Define
  • 25. © 2022 Thoughtworks | Confidential Data Product: Integrated Data Product Map What could be different types of data products? 25 Sources Source oriented Data Product Fit-for-purpose Data Product Customer Oriented Data Product Operational Reporting/Analytics PROMOTE PRIME LOANS
  • 26. © 2022 Thoughtworks | Confidential 26 Sources Source oriented DP Fit-for-purpose Data Product Customer Oriented Data Product Operational Reporting/Analytics PROMOTE PRIME LOANS Salesforce Snowflake Data Product: - Integrated Data Product Map Start from customer oriented data product Domain: Marketing Subdomain: Promotion
  • 27. © 2022 Thoughtworks | Confidential 27 Sources Source oriented DP Fit-for-purpose Data Product Customer Oriented Data Product Operational Reporting/Analytics PROMOTE PRIME LOANS Salesforce Snowflake Data Product: Integrated Data Product Map Now lets focus on source oriented data product and their attributes Domain: Marketing Subdomain: Promotion Customer Information CUSTOMER INFO Customer History Table CUSTOMER REPAYMENT HISTORY Domain: Customer Subdomain: Payment
  • 28. © 2022 Thoughtworks | Confidential Data Product: Integrated Data Product Map We need intermediate Data Product also CUSTOMER REPAYMENT STATUS Sources Source oriented DP Fit-for-purpose Data Product Customer Oriented Data Product Operational Reporting/Analytics PROMOTE PRIME LOANS Salesforce Snowflake Domain: Marketing Subdomain: Promotion Customer Information CUSTOMER INFO Customer History Table CUSTOMER REPAYMENT HISTORY Domain: Customer Subdomain: Payment Domain: Marketing Subdomain: Promotion
  • 29. © 2022 Thoughtworks | Confidential SLO-Data Landscape - Data Product Map - CFRs 29 Gap analysis Use Case Build & Measure Define
  • 30. © 2022 Thoughtworks | Confidential 30 30 Data Product: Cross Functional Requirements (CFRs) 30 © 2021 Thoughtworks Data Access Policy Data standardisation Rules Data Classification Rules Data retention policy TFN
  • 31. © 2022 Thoughtworks | Confidential SLO-Data Landscape - Data Product Map - CFRs 31 Gap analysis Use Case Build & Measure Define
  • 32. © 2022 Thoughtworks | Confidential Agenda 32 Story of Essence Financials M’s new use case Data product plot Conclusion Filling the gaps
  • 33. © 2022 Thoughtworks | Confidential Architecture Gap analysis 33 Gap analysis Use Case Build & Measure Define
  • 34. © 2022 Thoughtworks | Confidential Architecture Gap Analysis 34 Data Product Architecture Governance Platform Capabilities Gap analysis Use Case Build & Measure ( Define
  • 35. © 2022 Thoughtworks | Confidential Orchestration Managed Airflow 35 Data Product: Gap Analysis Data storage Snowflake Marketing data mart PROMOTE PRIME LOANS Governance Transformation Data product code Platform Capabilities Snowflake connector Salesforce connector Snowflake connector
  • 36. © 2022 Thoughtworks | Confidential Data Product- Promote Prime Loan CUSTOMER REPAYMENT STATUS PROMOTE PRIME LOANS Salesforce Snowflake Customer Information CUSTOMER INFO Customer transaction history CUSTOMER REPAYMENT HISTORY Platform Capabilities Governance Managed Airflow Snowflake ci/cd Observability
  • 37. © 2022 Thoughtworks | Confidential Agenda 37 Story of Essence Financials M’s new use case Data product plot Conclusion Filling the gaps
  • 38. © 2022 Thoughtworks | Confidential /Build & Measure 38 Gap analysis Use Case Build & Measure Define
  • 39. © 2022 Thoughtworks | Confidential Build & Measure 39 Gap analysis Use Case Build & Measure ( Define Learn Build Measure
  • 40. © 2022 Thoughtworks | Confidential Build & Measure 40 Gap analysis Use Case Build & Measure ( Define Data Product Score Rule based User feedback KPIs SLOs
  • 41. © 2022 Thoughtworks | Confidential Happy K 41
  • 42. © 2022 Thoughtworks | Confidential Recap 42 Gap analysis Define Data Landscape - SLO Data Product Map - CFRs Use Case Build & Measure Data Product Architecture Governance Platform Capabilities Learn Build Measure
  • 43. © 2022 Thoughtworks | Confidential “A Journey of Thousand Miles start with a single step” -Lau Tze 43
  • 44. © 2022 Thoughtworks | Confidential We look forward to working with you Harmeet Kaur Sokhi Lead Data Consultant harmeet.sokhi@thoughtworks.com Vishal Srivastava Senior Data Consultant vishal.srivastava@thoughtworks.com 44 Please share your feedback