SlideShare a Scribd company logo
1 of 43
Download to read offline
Stream Me Up, Scotty
Transitioning to the cloud
using a streaming data platform
About us
Chrix Finne, Director of Product Management Bob Lehmann, Architect, Data Platform
We’ll Talk About
● Everyone is moving to the cloud
● Common challenges
● Transition design patterns
● How streaming platform fits in
● Monsanto’s data challenges
● Monsanto’s streaming architecture / use cases
● Security
● Lessons learned
● Future plans
How Amazon
thinks you will
move to cloud
You have a lot to worry about
❏ Do I migrate to cloud-native databases?
❏ Will my apps survive random cloud failures?
❏ Do I need a failure injection test framework?
❏ Dynamic configuration of hostnames
❏ Do I break my monolith into microservices?
❏ Do I want to migrate to more than one cloud?
❏ Can I migrate back?
At first,
this is no
big deal….
App
App
AppApp
DWH
DB
KV
App
DB
DC1 AWS
6 month
later...
DC1 AWS
DB
APP
APP
APP
APP
APP
APP
APP
APP
DB
DB
DWH
KV
KV
KV
DWH
Are you
kidding?
● This is expensive
● This is a maintenance
nightmare
● We may need more than
one region!
● We may need more than
one cloud!
We’ve done this before...
This... To this...
There is a
better way
Bridge to Cloud Pattern
Key Components
1. Distributed Log
2. Big ReLiable Buffer
3. Great Replication
4. Connect Anywhere
5. Management & Monitoring
Deeper look at bi-directional replication
Why is this
awesome?
● Proven architecture
● Non-stop low-latency
● One throat to choke
● Cost savings
But wait -
there is
more!
● Future-proof architecture
○ With Connect - get the data everywhere.
○ Try cloud services without fear of lock-in
○ “Kafka is our escape valve”
● Multi-zone availability
● Multi-Region architecture
● Multi-Cloud architectures
● Microservices ready
● Jump to stream processing!
Let’s look at how this worked for...
Monsanto
A sustainable agriculture company
• Bringing a broad range of solutions to help nourish our growing
world
• Headquartered in Saint Louis, Missouri
• >20,000 employees in 66 countries
• A global company with >50% employees based outside of the
United States
• One of the 25 World’s Best Multinational Workplaces by Great
Place to Work Institute
Produce with more
judicious use
of limited natural
resources.
improve the
lives of the
world’s
farmers.
Increase
production
to meet
needs of
a growing
population.
“We succeed when
farmers succeed.”
-Hugh Grant, Monsanto
CEO
Inbred
(Parent 1)
Inbred
(Parent 2)
Hybrid
The Monsanto Corn “Galaxy”
To Boldly Go Where No Man (or
Woman) Has Gone Before...
CAPTAIN’S LOG STARTDATE 41153.7 (early 2015)
Mission: Develop an enterprise information architecture
covering BI to machine learning and everything in between.
Maximize use of cloud based assets.
In What Parallel
Universe Are You
Living?
Existing Challenges
● Data sprawl
● Data consistency
● Data discovery
● Data latency
The Cloud - New Challenges
● Increased data sprawl
(microservices’ dirty little secret)
● Can’t forklift applications overnight
● Cloud apps need on-prem data
● On-prem apps need cloud data
The “Aha” Moment
The Log: What every software engineer should know
about real-time data's unifying abstraction
LinkedIn Engineering Blog post by Jay Kreps
Dec. 6th, 2013
Let’s Clean Up This Mess!
Courtesy of Jay Kreps
Replication Is Your Friend
● Uncontrolled, unsynchronized
replication IS BAD
● Controlled, synchronized
replication IS NECESSARY
AND BENEFICIAL...especially
in distributed environments
The Enterprise DataHub
EDH Plan
- Create Kafka clusters on prem and in AWS
- Establish cross datacenter connection
- Provide replication between the clusters
- Use AVRO schemas
- Only alow Apps To interact with the local
Cluster
IMPORTANT!
Use Schema Registry
VPN?
DIRECT CONNECT?
MIRRORMAKER
Use Cases And
Architecture Evolution
Initial POC - Operations Metrics Collection
Use Case - Customer 360
Use Case - Replication Between Dissimilar Databases
Use Case - Data Warehouse Replication
Use Case - Data Warehouse Migration
Turn this feed off
once cloud solution
is complete
Drives adoption of the platform despite these
common “concerns”:
● Our data will never be used by anybody else
● We don’t need our data in near real time
● Our volumes are too low
● Our volumes are too high
● Do we really have to use schemas?
Cross Datacenter Replication - The Killer App!
Enterprise
DataHub
Simplified
AWS
Architecture
Security
Native Clients
❏ SSL certs used for authentication AND
authorization
❏ Principal is defined by the cert DN
❏ ACLs are defined based on DN
CN=YourProject, OU=YourOrg, O=YourCompany, L=YourLocation, ST=YourState, C=YourCountry
CN and OU are the only attributes
defined by clients
Security
REST Clients
❏ API endpoint defined in Network Director for each
topic
❏ REST client authenticates with SSO platform and
then calls API endpoint
❏ Network Director calls REST Proxy as privileged
user
❏ Access to topics is controlled by restricting access
to API endpoint
Security Architecture
Security Notes
● Some Kafka tools don’t work with SSL and ACLs
● SSL client certs have to be added to truststore on each
broker - requires brokers to be bounced. Patched
broker code to load certs dynamically.
● Long term roadmap
○ LDAP authentication and role based authorization
for native clients.
○ REST Proxy authorization based on OAuth
entitlements
Lessons Learned
❏ Partition By Default
❏ Use “new” consumer - required for SSL anyway
❏ Lock down Zookeeper
❏ Be prepared for AWS instance termination!
❏ Use Associated VPCs in AWS
❏ Use LVM For disks
Future - Make Ingress/Egress Part Of The
Platform
Future - Multi-Cloud Integration
THE END

More Related Content

What's hot

What's hot (20)

Common Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache KafkaCommon Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache Kafka
 
Streaming in Practice - Putting Apache Kafka in Production
Streaming in Practice - Putting Apache Kafka in ProductionStreaming in Practice - Putting Apache Kafka in Production
Streaming in Practice - Putting Apache Kafka in Production
 
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
 
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails? Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
Kafka Summit NYC 2017 - Apache Kafka in the Enterprise: What if it Fails?
 
Event Sourcing, Stream Processing and Serverless (Benjamin Stopford, Confluen...
Event Sourcing, Stream Processing and Serverless (Benjamin Stopford, Confluen...Event Sourcing, Stream Processing and Serverless (Benjamin Stopford, Confluen...
Event Sourcing, Stream Processing and Serverless (Benjamin Stopford, Confluen...
 
Kafka at scale facebook israel
Kafka at scale   facebook israelKafka at scale   facebook israel
Kafka at scale facebook israel
 
Metrics Are Not Enough: Monitoring Apache Kafka and Streaming Applications
Metrics Are Not Enough: Monitoring Apache Kafka and Streaming ApplicationsMetrics Are Not Enough: Monitoring Apache Kafka and Streaming Applications
Metrics Are Not Enough: Monitoring Apache Kafka and Streaming Applications
 
Deploying Confluent Platform for Production
Deploying Confluent Platform for ProductionDeploying Confluent Platform for Production
Deploying Confluent Platform for Production
 
Maximizing Audience Engagement in Media Delivery (MED303) | AWS re:Invent 2013
Maximizing Audience Engagement in Media Delivery (MED303) | AWS re:Invent 2013Maximizing Audience Engagement in Media Delivery (MED303) | AWS re:Invent 2013
Maximizing Audience Engagement in Media Delivery (MED303) | AWS re:Invent 2013
 
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
Kafka Summit NYC 2017 - Cloud Native Data Streaming Microservices with Spring...
 
101 ways to configure kafka - badly (Kafka Summit)
101 ways to configure kafka - badly (Kafka Summit)101 ways to configure kafka - badly (Kafka Summit)
101 ways to configure kafka - badly (Kafka Summit)
 
kafka for db as postgres
kafka for db as postgreskafka for db as postgres
kafka for db as postgres
 
Exactly-once Semantics in Apache Kafka
Exactly-once Semantics in Apache KafkaExactly-once Semantics in Apache Kafka
Exactly-once Semantics in Apache Kafka
 
Ingesting Healthcare Data, Micah Whitacre
Ingesting Healthcare Data, Micah WhitacreIngesting Healthcare Data, Micah Whitacre
Ingesting Healthcare Data, Micah Whitacre
 
Building Stream Infrastructure across Multiple Data Centers with Apache Kafka
Building Stream Infrastructure across Multiple Data Centers with Apache KafkaBuilding Stream Infrastructure across Multiple Data Centers with Apache Kafka
Building Stream Infrastructure across Multiple Data Centers with Apache Kafka
 
Running Kafka for Maximum Pain
Running Kafka for Maximum PainRunning Kafka for Maximum Pain
Running Kafka for Maximum Pain
 
Data Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDBData Streaming with Apache Kafka & MongoDB
Data Streaming with Apache Kafka & MongoDB
 
Building Scalable and Extendable Data Pipeline for Call of Duty Games (Yarosl...
Building Scalable and Extendable Data Pipeline for Call of Duty Games (Yarosl...Building Scalable and Extendable Data Pipeline for Call of Duty Games (Yarosl...
Building Scalable and Extendable Data Pipeline for Call of Duty Games (Yarosl...
 
Reducing Microservice Complexity with Kafka and Reactive Streams
Reducing Microservice Complexity with Kafka and Reactive StreamsReducing Microservice Complexity with Kafka and Reactive Streams
Reducing Microservice Complexity with Kafka and Reactive Streams
 
How to over-engineer things and have fun? | Oto Brglez, OPALAB
How to over-engineer things and have fun? | Oto Brglez, OPALABHow to over-engineer things and have fun? | Oto Brglez, OPALAB
How to over-engineer things and have fun? | Oto Brglez, OPALAB
 

Similar to Stream Me Up, Scotty: Transitioning to the Cloud Using a Streaming Data Platform

Ultimate hybrid cloud
Ultimate hybrid cloudUltimate hybrid cloud
Ultimate hybrid cloud
Mirantis
 
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
HostedbyConfluent
 

Similar to Stream Me Up, Scotty: Transitioning to the Cloud Using a Streaming Data Platform (20)

Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native Platform
 
Big Data on Cloud Native Platform
Big Data on Cloud Native PlatformBig Data on Cloud Native Platform
Big Data on Cloud Native Platform
 
The Cloud Imperative – What, Why, When and How
The Cloud Imperative – What, Why, When and HowThe Cloud Imperative – What, Why, When and How
The Cloud Imperative – What, Why, When and How
 
Ultimate hybrid cloud: World Wide Cloud
Ultimate hybrid cloud: World Wide CloudUltimate hybrid cloud: World Wide Cloud
Ultimate hybrid cloud: World Wide Cloud
 
Ultimate hybrid cloud
Ultimate hybrid cloudUltimate hybrid cloud
Ultimate hybrid cloud
 
Containerizing couchbase with microservice architecture on mesosphere.pptx
Containerizing couchbase with microservice architecture on mesosphere.pptxContainerizing couchbase with microservice architecture on mesosphere.pptx
Containerizing couchbase with microservice architecture on mesosphere.pptx
 
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
From Monoliths to Microservices - A Journey With Confluent With Gayathri Veal...
 
AWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data AnalyticsAWS Sydney Summit 2013 - Big Data Analytics
AWS Sydney Summit 2013 - Big Data Analytics
 
Webinar: Enterprise Cloud Migration - 4 Problems to Solve
Webinar: Enterprise Cloud Migration - 4 Problems to SolveWebinar: Enterprise Cloud Migration - 4 Problems to Solve
Webinar: Enterprise Cloud Migration - 4 Problems to Solve
 
Technology insights: Decision Science Platform
Technology insights: Decision Science PlatformTechnology insights: Decision Science Platform
Technology insights: Decision Science Platform
 
Reblaze Case Study on GCP
Reblaze Case Study on GCPReblaze Case Study on GCP
Reblaze Case Study on GCP
 
AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DE...
AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DE...AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DE...
AWS re:Invent 2016: From Dial-Up to DevOps - AOL’s Migration to the Cloud (DE...
 
Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...Dori Exterman, Considerations for choosing the parallel computing strategy th...
Dori Exterman, Considerations for choosing the parallel computing strategy th...
 
Running OpenStack in Production
Running OpenStack in ProductionRunning OpenStack in Production
Running OpenStack in Production
 
Reactive Cloud Security | AWS Public Sector Summit 2016
Reactive Cloud Security | AWS Public Sector Summit 2016Reactive Cloud Security | AWS Public Sector Summit 2016
Reactive Cloud Security | AWS Public Sector Summit 2016
 
Cloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native appsCloud-Native Data: What data questions to ask when building cloud-native apps
Cloud-Native Data: What data questions to ask when building cloud-native apps
 
How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365How to grow to a modern workplace in 16 steps with microsoft 365
How to grow to a modern workplace in 16 steps with microsoft 365
 
Understanding cloud with Google Cloud Platform
Understanding cloud with Google Cloud PlatformUnderstanding cloud with Google Cloud Platform
Understanding cloud with Google Cloud Platform
 
AWS Summit Kuala Lumpur Keynote with Stephen Orban - Head of Enterprise Strategy
AWS Summit Kuala Lumpur Keynote with Stephen Orban - Head of Enterprise StrategyAWS Summit Kuala Lumpur Keynote with Stephen Orban - Head of Enterprise Strategy
AWS Summit Kuala Lumpur Keynote with Stephen Orban - Head of Enterprise Strategy
 

More from confluent

More from confluent (20)

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streams
 

Recently uploaded

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 

Recently uploaded (20)

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
ManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide DeckManageIQ - Sprint 236 Review - Slide Deck
ManageIQ - Sprint 236 Review - Slide Deck
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
The Guide to Integrating Generative AI into Unified Continuous Testing Platfo...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 

Stream Me Up, Scotty: Transitioning to the Cloud Using a Streaming Data Platform

  • 1. Stream Me Up, Scotty Transitioning to the cloud using a streaming data platform
  • 2. About us Chrix Finne, Director of Product Management Bob Lehmann, Architect, Data Platform
  • 3. We’ll Talk About ● Everyone is moving to the cloud ● Common challenges ● Transition design patterns ● How streaming platform fits in ● Monsanto’s data challenges ● Monsanto’s streaming architecture / use cases ● Security ● Lessons learned ● Future plans
  • 4. How Amazon thinks you will move to cloud
  • 5. You have a lot to worry about ❏ Do I migrate to cloud-native databases? ❏ Will my apps survive random cloud failures? ❏ Do I need a failure injection test framework? ❏ Dynamic configuration of hostnames ❏ Do I break my monolith into microservices? ❏ Do I want to migrate to more than one cloud? ❏ Can I migrate back?
  • 6. At first, this is no big deal…. App App AppApp DWH DB KV App DB DC1 AWS
  • 8. Are you kidding? ● This is expensive ● This is a maintenance nightmare ● We may need more than one region! ● We may need more than one cloud!
  • 9. We’ve done this before... This... To this...
  • 11. Bridge to Cloud Pattern
  • 12. Key Components 1. Distributed Log 2. Big ReLiable Buffer 3. Great Replication 4. Connect Anywhere 5. Management & Monitoring
  • 13. Deeper look at bi-directional replication
  • 14. Why is this awesome? ● Proven architecture ● Non-stop low-latency ● One throat to choke ● Cost savings
  • 15. But wait - there is more! ● Future-proof architecture ○ With Connect - get the data everywhere. ○ Try cloud services without fear of lock-in ○ “Kafka is our escape valve” ● Multi-zone availability ● Multi-Region architecture ● Multi-Cloud architectures ● Microservices ready ● Jump to stream processing!
  • 16. Let’s look at how this worked for...
  • 17. Monsanto A sustainable agriculture company • Bringing a broad range of solutions to help nourish our growing world • Headquartered in Saint Louis, Missouri • >20,000 employees in 66 countries • A global company with >50% employees based outside of the United States • One of the 25 World’s Best Multinational Workplaces by Great Place to Work Institute Produce with more judicious use of limited natural resources. improve the lives of the world’s farmers. Increase production to meet needs of a growing population. “We succeed when farmers succeed.” -Hugh Grant, Monsanto CEO
  • 19. The Monsanto Corn “Galaxy”
  • 20. To Boldly Go Where No Man (or Woman) Has Gone Before... CAPTAIN’S LOG STARTDATE 41153.7 (early 2015) Mission: Develop an enterprise information architecture covering BI to machine learning and everything in between. Maximize use of cloud based assets.
  • 21. In What Parallel Universe Are You Living?
  • 22. Existing Challenges ● Data sprawl ● Data consistency ● Data discovery ● Data latency
  • 23. The Cloud - New Challenges ● Increased data sprawl (microservices’ dirty little secret) ● Can’t forklift applications overnight ● Cloud apps need on-prem data ● On-prem apps need cloud data
  • 24. The “Aha” Moment The Log: What every software engineer should know about real-time data's unifying abstraction LinkedIn Engineering Blog post by Jay Kreps Dec. 6th, 2013
  • 25. Let’s Clean Up This Mess! Courtesy of Jay Kreps
  • 26. Replication Is Your Friend ● Uncontrolled, unsynchronized replication IS BAD ● Controlled, synchronized replication IS NECESSARY AND BENEFICIAL...especially in distributed environments
  • 27. The Enterprise DataHub EDH Plan - Create Kafka clusters on prem and in AWS - Establish cross datacenter connection - Provide replication between the clusters - Use AVRO schemas - Only alow Apps To interact with the local Cluster IMPORTANT! Use Schema Registry VPN? DIRECT CONNECT? MIRRORMAKER
  • 29. Initial POC - Operations Metrics Collection
  • 30. Use Case - Customer 360
  • 31. Use Case - Replication Between Dissimilar Databases
  • 32. Use Case - Data Warehouse Replication
  • 33. Use Case - Data Warehouse Migration Turn this feed off once cloud solution is complete
  • 34. Drives adoption of the platform despite these common “concerns”: ● Our data will never be used by anybody else ● We don’t need our data in near real time ● Our volumes are too low ● Our volumes are too high ● Do we really have to use schemas? Cross Datacenter Replication - The Killer App!
  • 36. Security Native Clients ❏ SSL certs used for authentication AND authorization ❏ Principal is defined by the cert DN ❏ ACLs are defined based on DN CN=YourProject, OU=YourOrg, O=YourCompany, L=YourLocation, ST=YourState, C=YourCountry CN and OU are the only attributes defined by clients
  • 37. Security REST Clients ❏ API endpoint defined in Network Director for each topic ❏ REST client authenticates with SSO platform and then calls API endpoint ❏ Network Director calls REST Proxy as privileged user ❏ Access to topics is controlled by restricting access to API endpoint
  • 39. Security Notes ● Some Kafka tools don’t work with SSL and ACLs ● SSL client certs have to be added to truststore on each broker - requires brokers to be bounced. Patched broker code to load certs dynamically. ● Long term roadmap ○ LDAP authentication and role based authorization for native clients. ○ REST Proxy authorization based on OAuth entitlements
  • 40. Lessons Learned ❏ Partition By Default ❏ Use “new” consumer - required for SSL anyway ❏ Lock down Zookeeper ❏ Be prepared for AWS instance termination! ❏ Use Associated VPCs in AWS ❏ Use LVM For disks
  • 41. Future - Make Ingress/Egress Part Of The Platform
  • 42. Future - Multi-Cloud Integration