SlideShare una empresa de Scribd logo
1 de 26
Specification-Driven Data Mesh
Sion Smith CTO - oso.sh
Neil Avery CTO - liquidlabs.com
Introducing
The Enterprise Guide to
Building a Data Mesh
About us
2
Introduction
Sion Smith
CTO, OSO
15 years consulting experience solving
complex problems with various cloud and
programming technologies
Neil Avery
CTO, Liquidlabs
Distributed systems, previously
Confluent, Luxoft, startups and
others
Emerging Technology
Current State of Play The Spec Mesh Way
Agenda
3
Developer Tooling
● The Gartner Hype Cycle
● Foundations of Data Mesh
● Evolution of central
nervous system
● Domain mapping out of the
box
● Example specification
● Data Mesh lifecycle
management
● Features of Spec Mesh
● Screenshots
● Developer roadmap
Our Data Mesh Journey
Is the Hype Real?
5
State of Play
Gartner, Hype Cycle for Data Management, 2022, Donald Feinberg, Philip Russom, Nina Showell, 30 June 2022
https://www.denodo.com/en/document/analyst-report/2022-gartner-hype-cycle-data-management
Data Hub
Strategy
Data Integration
Tools
Data Lakes
Data
Engineering
Data Ops
Data
Mesh
Expectations
Time
Innovation trigger
Peak of
inflated
expectations
Trough of
disillusionment
Slope of
enlightenment
Plateau of
productivity
Gartner Hype Cycle for Data Management - 2022
Four pillars of Data Mesh
6
State of Play
Data as a Product
Self-serve data
infrastructure as a
platform
Federated computational
Governance
Domain-oriented
decentralised
data ownership &
architecture
https://martinfowler.com/articles/data-mesh-principles.html
Is Data Mesh really new?
7
State of Play
+ Data mesh incremental evolution of style of architecture we
have been building for several years for event streaming
+ A mature data streaming system adopts a central nervous
system
+ Can we build a data mesh around event streaming principles?
+ A central nervous system models topics with a domain structure
and federated computational governance
Introducing:
+ An agreement / contract for data mesh using a specification
Central
data team
Domain teams
Stakeholders
So where does Data Mesh fit?
8
State of Play
Investment & Time
Value
4
5
Early Interest
Central Nervous
System
Mission critical,
but disparate
LOBs
Identify a project
Mission-critical,
connected LOBs
Projects Platform
Single
solution
Scalable pipeline
Pub/Sub
Clusters of reuse,
real-time
analytics
Platform effect:
reuse of data,
efficiencies of
scale
Enterprise Data-as-a
Product. Event-driven
architecture
3
2
1
This is used
throughout large
enterprise in
production
Confluent maturity model for event driven architecture
https://www.confluent.io/resources/5-stages-streaming-platform-adoption/
Data Mesh should not boil the ocean
9
Introducing SpecMesh
Specification-Driven Data Mesh
CNS Patterns applied to Data Mesh
1. Events and storage comprise data platform fundamentals required to build
almost anything
2. Events and storage already exist and that won’t change
3. Organise data resources hierarchically
4. Enforcing a domain model is need to control complexity and cope with scale
5. Orgs have a data quality function
6. Orgs have a data catalogue function
7. Registry models are required to model data (and support data evolution etc)
8. Most organisations suffer from inconsistent mechanisms for data-as-an-api
11
12
Spec Mesh
Supporting the pillars of Data Mesh
Features
Domain
ownership
Self-serve
Data as a
Product
Federated
Computational
Governance
Spec driven (async-api spec) ✔ ✔
SDLC plugins (unit-test, integration-test) ✔
SDLC plugins (provision - terraform) ✔ ✔ ✔
3rd party data catalogue ✔ ✔ ✔
Big Data London is our data product
Spec Mesh
13
14
Data Model
Spec Mesh
/ london / borough / venue / event
/
retail /
/ transport /
/ accommodation /
/ services /
/ london / hammersmith / olympia / bigdatalondon / public / attendee
/ london / hammersmith / transport / public / tube
/ london / heathrow / transport / public / airport
/ london / hammersmith / olympia / bigdatalondon / vendor / terra / public / visitor
/ london / hammersmith / olympia / bigdatalondon / retailer / subway / public / purchase
asyncapi: '2.4.0'
id: 'urn:london:hammersmith:olympia:bigdatalondon'
Info:
title: BigDataLondon API
version: '1.0.0'
description: Simple model of BigDataLondon as a Data Product
Servers:
test:
url: test.mykafkacluster.org:8092
protocol: kafka-secure
channels:
public/attendee:
Publish:
summary: Humans arriving
Message:
name: Human
"tags": [
"name": "human",
"name": "big data london"]
Payload:
type: object
Properties:
Id:
type: integer
minimum: 0
description: Id of the human
Age:
type: integer
15
Spec Mesh
16
Spec Mesh
Data Mesh as Code: Spec Mesh
Capture Provision Serve
>> >>
17
Spec Mesh
Capture
● Create repo from git template
● Async-api spec
● Apply the domain model/structure
● Specify data event streams & storage
● Model the data entities (schemas)
● Apply tags to data entities
● Permission data as
public/protected/private
● Write tests and build your product
Provision
Spec
Data
Catalogue
Spec Mesh
Capture Provision Serve
What is Spec Mesh?
Provision
18
What is Spec Mesh?
Spec Mesh
Provision
Spec
Data
Catalogue
Spec Mesh
Capture Provision Serve
● Pipeline driven provisioning of data product
based on Specification
● Includes domain structure, governance and
permissions
● Data catalogue automatically updated
(including tags)
● Data schema’s published to registry
19
What is Spec Mesh?
Spec Mesh
Provision
Spec
Data
Catalogue
Spec Mesh
Capture Provision Serve
Public
Private
Restricted
Serve
● Governance integrated via underlying Data
Platform. I.e. Kafka ACLs
● Data catalogue supports discovery (via tags),
and access requests
● Data resources are structured using public,
private and protected scope
20
Spec Mesh conceptual
Spec Mesh
Data
Catalogue
Data
Product A
Spec
Data
Product B
Spec
Serve
Public
Private
Restricted
Developer Tooling
22
Why use Spec Mesh?
Developer Tooling
Developer focused tooling:. This is not some platform
Abstract away complexities: Leverage layers of abstraction to create
repeatable patterns
Unit testing specifications: Java based tooling supports development
lifecycle
Tooling support flexibility: Using configurable guardrails
Increased extensibility: Interoperate with existing Data Management
tooling and modules
Icons: www.streamlinehq.com
23
What do you get out of the box?
Developer Tooling
Spec
Specification as Code
[ ]
Data
Provider
Provisioning Workflow Data Mesh
Data
Product A
Data
Product B
Data
Product Y
Data
Product Z
Build Apply
JSON
Data Product
Owner
24
Demo
Developer Tooling
Provisioning summary
Specification
output (JSON)
Kafka Infrastructure
Add title
Beyond
Community driven
Help us shape the roadmap by
suggesting and contributing to
features!
Q1 - 2023
Extensions
● More Kafka extensions
supported (incl provisioning
support): Quotas, Storage &
ACLs for Public, Private,
Protected resources (basic
governance)
Data Catalogue
● Initial integration to existing
products to support discovery
and basic governance
Storage
● Storage resource mapping
Governance
● Delegation to a Governance API
where protected topics are
specified and requested
Proof of Value - Q4 22
Developer Experience
● Local testing support (Java &
Python) - Test containers via
gradle plugin
● SpecMesh gradle plugin to use
HCL Terraform & state mgmt
Domain model
● Domain modelling built into
Spec.Id field
Build Pipeline
● Executions pushes the
APISpec to apply orthogonal
resource allocation
Governance
● Manual governance of ‘public’
resources through Credential
creation (API Keys + ACLs)
Extensions
● Kafka support for Async API
Q2 - 2023
Data As A Product
● Data Product Open Registry;
simple Data Catalogue with
Discovery and tagging (Spec
resources contain tags - show
example)
Observability
● Topology view (derived from
specs showing
product/consume data flows
and ownership visualisation)
Bolt-ons
● Multi-region
● Cross resource
● SMT
● DQ integration + other simple
additions
25
Developer Tooling
Roadmap
Thank you
github.com/specmesh
@osodevops
@avery_neil

Más contenido relacionado

Similar a The Enterprise Guide to Building a Data Mesh - Introducing SpecMesh

Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo
 
Bigdata.sunil_6+yearsExp
Bigdata.sunil_6+yearsExpBigdata.sunil_6+yearsExp
Bigdata.sunil_6+yearsExp
bigdata sunil
 
Sourav_Giri_Resume_2015
Sourav_Giri_Resume_2015Sourav_Giri_Resume_2015
Sourav_Giri_Resume_2015
sourav giri
 

Similar a The Enterprise Guide to Building a Data Mesh - Introducing SpecMesh (20)

Microservices Patterns with GoldenGate
Microservices Patterns with GoldenGateMicroservices Patterns with GoldenGate
Microservices Patterns with GoldenGate
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
DEVNET-1140 InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...
DEVNET-1140	InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...DEVNET-1140	InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...
DEVNET-1140 InterCloud Mapreduce and Spark Workload Migration and Sharing: Fi...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
Data Engineer, Patterns & Architecture The future: Deep-dive into Microservic...
 
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data InitiativeBig Data Fabric: A Necessity For Any Successful Big Data Initiative
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
Cisco project ideas
Cisco   project ideasCisco   project ideas
Cisco project ideas
 
Bigdata.sunil_6+yearsExp
Bigdata.sunil_6+yearsExpBigdata.sunil_6+yearsExp
Bigdata.sunil_6+yearsExp
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
Session 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers ProgramSession 8 - Creating Data Processing Services | Train the Trainers Program
Session 8 - Creating Data Processing Services | Train the Trainers Program
 
Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)Data Virtualization: Introduction and Business Value (UK)
Data Virtualization: Introduction and Business Value (UK)
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Belgium & Luxembourg dedicated online Data Virtualization discovery workshopBelgium & Luxembourg dedicated online Data Virtualization discovery workshop
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Sourav_Giri_Resume_2015
Sourav_Giri_Resume_2015Sourav_Giri_Resume_2015
Sourav_Giri_Resume_2015
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 

The Enterprise Guide to Building a Data Mesh - Introducing SpecMesh

  • 1. Specification-Driven Data Mesh Sion Smith CTO - oso.sh Neil Avery CTO - liquidlabs.com Introducing The Enterprise Guide to Building a Data Mesh
  • 2. About us 2 Introduction Sion Smith CTO, OSO 15 years consulting experience solving complex problems with various cloud and programming technologies Neil Avery CTO, Liquidlabs Distributed systems, previously Confluent, Luxoft, startups and others Emerging Technology
  • 3. Current State of Play The Spec Mesh Way Agenda 3 Developer Tooling ● The Gartner Hype Cycle ● Foundations of Data Mesh ● Evolution of central nervous system ● Domain mapping out of the box ● Example specification ● Data Mesh lifecycle management ● Features of Spec Mesh ● Screenshots ● Developer roadmap
  • 4. Our Data Mesh Journey
  • 5. Is the Hype Real? 5 State of Play Gartner, Hype Cycle for Data Management, 2022, Donald Feinberg, Philip Russom, Nina Showell, 30 June 2022 https://www.denodo.com/en/document/analyst-report/2022-gartner-hype-cycle-data-management Data Hub Strategy Data Integration Tools Data Lakes Data Engineering Data Ops Data Mesh Expectations Time Innovation trigger Peak of inflated expectations Trough of disillusionment Slope of enlightenment Plateau of productivity Gartner Hype Cycle for Data Management - 2022
  • 6. Four pillars of Data Mesh 6 State of Play Data as a Product Self-serve data infrastructure as a platform Federated computational Governance Domain-oriented decentralised data ownership & architecture https://martinfowler.com/articles/data-mesh-principles.html
  • 7. Is Data Mesh really new? 7 State of Play + Data mesh incremental evolution of style of architecture we have been building for several years for event streaming + A mature data streaming system adopts a central nervous system + Can we build a data mesh around event streaming principles? + A central nervous system models topics with a domain structure and federated computational governance Introducing: + An agreement / contract for data mesh using a specification Central data team Domain teams Stakeholders
  • 8. So where does Data Mesh fit? 8 State of Play Investment & Time Value 4 5 Early Interest Central Nervous System Mission critical, but disparate LOBs Identify a project Mission-critical, connected LOBs Projects Platform Single solution Scalable pipeline Pub/Sub Clusters of reuse, real-time analytics Platform effect: reuse of data, efficiencies of scale Enterprise Data-as-a Product. Event-driven architecture 3 2 1 This is used throughout large enterprise in production Confluent maturity model for event driven architecture https://www.confluent.io/resources/5-stages-streaming-platform-adoption/
  • 9. Data Mesh should not boil the ocean 9
  • 11. CNS Patterns applied to Data Mesh 1. Events and storage comprise data platform fundamentals required to build almost anything 2. Events and storage already exist and that won’t change 3. Organise data resources hierarchically 4. Enforcing a domain model is need to control complexity and cope with scale 5. Orgs have a data quality function 6. Orgs have a data catalogue function 7. Registry models are required to model data (and support data evolution etc) 8. Most organisations suffer from inconsistent mechanisms for data-as-an-api 11
  • 12. 12 Spec Mesh Supporting the pillars of Data Mesh Features Domain ownership Self-serve Data as a Product Federated Computational Governance Spec driven (async-api spec) ✔ ✔ SDLC plugins (unit-test, integration-test) ✔ SDLC plugins (provision - terraform) ✔ ✔ ✔ 3rd party data catalogue ✔ ✔ ✔
  • 13. Big Data London is our data product Spec Mesh 13
  • 14. 14 Data Model Spec Mesh / london / borough / venue / event / retail / / transport / / accommodation / / services / / london / hammersmith / olympia / bigdatalondon / public / attendee / london / hammersmith / transport / public / tube / london / heathrow / transport / public / airport / london / hammersmith / olympia / bigdatalondon / vendor / terra / public / visitor / london / hammersmith / olympia / bigdatalondon / retailer / subway / public / purchase
  • 15. asyncapi: '2.4.0' id: 'urn:london:hammersmith:olympia:bigdatalondon' Info: title: BigDataLondon API version: '1.0.0' description: Simple model of BigDataLondon as a Data Product Servers: test: url: test.mykafkacluster.org:8092 protocol: kafka-secure channels: public/attendee: Publish: summary: Humans arriving Message: name: Human "tags": [ "name": "human", "name": "big data london"] Payload: type: object Properties: Id: type: integer minimum: 0 description: Id of the human Age: type: integer 15 Spec Mesh
  • 16. 16 Spec Mesh Data Mesh as Code: Spec Mesh Capture Provision Serve >> >>
  • 17. 17 Spec Mesh Capture ● Create repo from git template ● Async-api spec ● Apply the domain model/structure ● Specify data event streams & storage ● Model the data entities (schemas) ● Apply tags to data entities ● Permission data as public/protected/private ● Write tests and build your product Provision Spec Data Catalogue Spec Mesh Capture Provision Serve What is Spec Mesh?
  • 18. Provision 18 What is Spec Mesh? Spec Mesh Provision Spec Data Catalogue Spec Mesh Capture Provision Serve ● Pipeline driven provisioning of data product based on Specification ● Includes domain structure, governance and permissions ● Data catalogue automatically updated (including tags) ● Data schema’s published to registry
  • 19. 19 What is Spec Mesh? Spec Mesh Provision Spec Data Catalogue Spec Mesh Capture Provision Serve Public Private Restricted Serve ● Governance integrated via underlying Data Platform. I.e. Kafka ACLs ● Data catalogue supports discovery (via tags), and access requests ● Data resources are structured using public, private and protected scope
  • 20. 20 Spec Mesh conceptual Spec Mesh Data Catalogue Data Product A Spec Data Product B Spec Serve Public Private Restricted
  • 22. 22 Why use Spec Mesh? Developer Tooling Developer focused tooling:. This is not some platform Abstract away complexities: Leverage layers of abstraction to create repeatable patterns Unit testing specifications: Java based tooling supports development lifecycle Tooling support flexibility: Using configurable guardrails Increased extensibility: Interoperate with existing Data Management tooling and modules Icons: www.streamlinehq.com
  • 23. 23 What do you get out of the box? Developer Tooling Spec Specification as Code [ ] Data Provider Provisioning Workflow Data Mesh Data Product A Data Product B Data Product Y Data Product Z Build Apply JSON Data Product Owner
  • 25. Add title Beyond Community driven Help us shape the roadmap by suggesting and contributing to features! Q1 - 2023 Extensions ● More Kafka extensions supported (incl provisioning support): Quotas, Storage & ACLs for Public, Private, Protected resources (basic governance) Data Catalogue ● Initial integration to existing products to support discovery and basic governance Storage ● Storage resource mapping Governance ● Delegation to a Governance API where protected topics are specified and requested Proof of Value - Q4 22 Developer Experience ● Local testing support (Java & Python) - Test containers via gradle plugin ● SpecMesh gradle plugin to use HCL Terraform & state mgmt Domain model ● Domain modelling built into Spec.Id field Build Pipeline ● Executions pushes the APISpec to apply orthogonal resource allocation Governance ● Manual governance of ‘public’ resources through Credential creation (API Keys + ACLs) Extensions ● Kafka support for Async API Q2 - 2023 Data As A Product ● Data Product Open Registry; simple Data Catalogue with Discovery and tagging (Spec resources contain tags - show example) Observability ● Topology view (derived from specs showing product/consume data flows and ownership visualisation) Bolt-ons ● Multi-region ● Cross resource ● SMT ● DQ integration + other simple additions 25 Developer Tooling Roadmap