Kafka streams at Scale Preview

•

2 recomendaciones•147 vistas

Walmart.com generates millions of events per second. At WalmartLabs, I’m working in a team called the Customer Backbone (CBB), where we wanted to upgrade to a platform capable of processing this event volume in real-time and store the state/knowledge of possibly all the Walmart Customers generated by the processing. Kafka streams’ event-driven architecture seemed like the only obvious choice. However, there are a few challenges w.r.t. Walmart’s scale: • the clusters need to be large and the problems thereof. • infinite retention of changelog topics, wasting valuable disk. • slow stand-by task recovery in case of a node failure (changelog topics have GBs of data) • no repartitioning in Kafka Streams. As part of the event-driven development and addressing the challenges above, I’m going to talk about some bold new ideas we developed as features/patches to Kafka Streams to deal with the scale required at Walmart. • Cold Bootstrap: Where in case of a Kafka Streams node failure, how instead of recovering from the change-log topic, we bootstrap the standby from active’s RocksDB using JSch and zero event loss by careful offset management. • Dynamic Repartitioning: We added support for repartitioning in Kafka Streams where state is distributed among the new partitions. We can now elastically scale to any number of partitions and any number of nodes. • Cloud/Rack/AZ aware task assignment: No active and standby tasks of the same partition are assigned to the same rack. • Decreased Partition Assignment Size: With large clusters like ours (>400 nodes and 3 stream threads per node), the size of Partition Assignment of the KS cluster being few 100MBs, it takes a lot of time to settle a rebalance. Key Takeaways: • Basic understanding of Kafka Streams. • Productionizing Kafka Streams at scale. • Using Kafka Streams as Distributed NoSQL DB

Ingeniería

Processing multi-million
events per second Walmart

Kafka Streams at Scale
Deepak Goyal
Customer Backbone
Walmart Labs

app-instance
consumer
processor
rocks
db
producer
akka
server
Kafka Streams
@walmartlabs

Event Flow
kafka cluster
app cluster
app-0 app-0’
stand-by
app-0’’
stand-by
change-log topic
partition 0’’
partition 0’
partition 0
input topic
partition 0’’
partition 0’
partition 0
rocks
db
rocks
db
active
rocks
db
Event Flow
@walmartlabs

Challenges
1. Fault Recovery
2. Horizontal Scalability
3. Cloud Readiness
4. Restricted RocksDB
5. Large Clusters
@walmartlabs

keep streaming
. . . . . . . . . . . . . . . .
. . .

Más contenido relacionado

Último

Detection&Tracking - Thermal imaging object detection and trackinghadarpinhas1

Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmDeepika Walanjkar

22CYT12 & Chemistry for Computer Systems_Unit-II-Corrosion & its Control Meth...KrishnaveniKrishnara1

Comprehensive energy systems.pdf Comprehensive energy systems.pdfalene1

ROBOETHICS-CCS345 ETHICS AND ARTIFICIAL INTELLIGENCE.pptJohnWilliam111370

AntColonyOptimizationManetNetworkAODV.pptxLina Kadam

Curve setting (Basic Mine Surveying)_MI10412MI.pptxRomil Mishra

Theory of Machine Notes / Lecture Material .pdfShreyas Pandit

Guardians of E-Commerce: Harnessing NLP and Machine Learning Approaches for A...IJAEMSJORNAL

Artificial Intelligence in Power System overviewsandhya757531

Module-1-(Building Acoustics) Noise Control (Unit-3). pdfManish Kumar

priority interrupt computer organizationchnrketan

STATE TRANSITION DIAGRAM in psoc subjectGayathriM270621

Triangulation survey (Basic Mine Surveying)_MI10412MI.pptxRomil Mishra

Prach: A Feature-Rich Platform Empowering the Autism Communityprachaibot

70 POWER PLANT IAE V2500 technical trainingGladiatorsKasper

Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Sumanth A

TEST CASE GENERATION GENERATION BLOCK BOX APPROACHSneha Padhiar

US Department of Education FAFSA Week of ActionMebane Rash

Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra

Destacado

PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley

Content Methodology: A Best Practices Report (Webinar)contently

How to Prepare For a Successful Job Search for 2024Albert Qian

Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)

Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal

5 Public speaking tips from TED - Visualized summarySpeakerHub

ChatGPT and the Future of Work - Clark Boyd Clark Boyd

Getting into the tech field. what next Tessa Mero

Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray

How to have difficult conversations Rajiv Jayarajah, MAppComm, ACC

Introduction to Data ScienceChristy Abraham Joy

Time Management & Productivity - Best PracticesVit Horky

The six step guide to practical project managementMindGenius

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools

12 Ways to Increase Your Influence at WorkGetSmarter

ChatGPT webinar slidesAlireza Esmikhani

More than Just Lines on a Map: Best Practices for U.S Bike RoutesProject for Public Spaces & National Center for Biking and Walking

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference

Barbie - Brand Strategy PresentationErica Santiago

Destacado (20)

PEPSICO Presentation to CAGNY Conference Feb 2024

Content Methodology: A Best Practices Report (Webinar)

How to Prepare For a Successful Job Search for 2024

Social Media Marketing Trends 2024 // The Global Indie Insights

Trends In Paid Search: Navigating The Digital Landscape In 2024

5 Public speaking tips from TED - Visualized summary

ChatGPT and the Future of Work - Clark Boyd

Getting into the tech field. what next

Google's Just Not That Into You: Understanding Core Updates & Search Intent

How to have difficult conversations

Introduction to Data Science

Time Management & Productivity - Best Practices

The six step guide to practical project management

Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...

Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...

12 Ways to Increase Your Influence at Work

ChatGPT webinar slides

More than Just Lines on a Map: Best Practices for U.S Bike Routes

Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...

Barbie - Brand Strategy Presentation