SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
Cisco's eCommerce Transformation
using Kafka
Presented By:
Dharmesh Panchmatia (Sr. Director – Cisco Systems)
Gaurav Goyal (Principal Architect – Cisco Systems)
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Agenda
Kafka Architecture2
1 Kafka Use Cases
Kafka Monitoring3
Lessons Learnt4
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Orders booked
$50+B
138 Countries
63 Device types 16 Browsers
185KUsers16 Languages
6M Hits/day
6.9 M Estimates 5.3 M Quotes 1.9 M Orders 85.6% Orders
Orders
Autobook
Portal 71% B2B 29%
Cisco Commerce By The Numbers
© 2018 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Ref Data
REFERENCE DATA SOURCE
DMPRD - RDBMS
Logging
Order Capture
DC1 - Tomcat
Order Capture
Transaction
Data
Downstream
Publish
X-Functional
Services (73)
DC3DC2DC1
TRANSACTION DATA STORE
P S S S S
N1 N2 N3 N4 N5
DC2 - Tomcat
1 2
3
4
Addresses Items
Preferences
Roles
Contacts Logging
DC1 & DC2
Commerce – Cloud Native
Kafka Use Cases
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Kafka – Use Cases
Data push to
downstreams
1. Avoid point to point
integration.
2. Avoid direct
connection to
transactional DB.
Elastic Search Data
Push
1. Reduce load on
transactional DB
2. Eliminates ES out of
sync in multi-DCs
Machine Learning Use
Cases using Spark
1. Recommendation
Engine
2. Most Popular
Configurations
3. Most popular products
for a given category
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Customer who bought X also bought Y.
Identify products which are
mostly bought together so we can create
bundles or promotions accordingly.
1
2
Algorithm: Apriori
ML Use Case
1. Recommendation Engine
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Provide visibility to most popular
configurations for a given product.
Provide visibility to a configuration which
Customer has recently bought for the
given product.
1
2
Allow selection of pre-configured products
instead of starting from scratch.
ML Use Case
2. Popular Product Configuration
Kafka Architecture
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Producer (Capture Order) Producer (Return Order)
Broker 1 Broker 2 Broker 3 Broker 4
ZK - 1 ZK - 2 ZK - 3 ZK - 4 ZK - 5
Consumer (Smart- SW SC)
Kafka Cluster
Zookeeper
DC1 DC2
DC1 DC2 DC3
DC1 – RCDN; DC2 – ALLEN; DC3 – RTP
Coordinates cluster membership
Commit Offset (v 0.10.x.x)
Kafka Architecture
Consumer (EDW)
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
RDBMS
ProducerCustom Code (DC1) Custom Code (DC2)FAULT
TOLERENT
Kafka
DC1 and DC2
Consumer Group - DC1 Consumer Group – DC2
Elastic Search – DC1 Elastic Search – DC2
Kafka Architecture – Elastic Search
Transaction Data
Order,
Estimate
Quote
RDBMS
Reference Data
Click Stream
Data
Data Visualization
Dynamic Querying
Data Science
Primary Analytics Data Store
MQL
Transaction Data
RDBMS
Subscriptions
Invoices
Kafka Architecture – ML & Analytics Use Case
Kafka Monitoring
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Monitoring: Kafka Manager and Kafdrop
Kafka Manager Kafdrop
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Kafka – Custom Scripts
1. Cron job to check Kafka
processes every minute. Restart
Kafka process
(and send email) in case it’s not
running.
2. Always take back up of logs
systematically when Kafka processes
are getting restarted.
3. Have a test topic and push test
message every minute. Trigger a
notification in case of failures.
Best Practices / Lessons Learnt
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Best
Practices
1
Have a mechanism to reset Kafka offsets on
demand.
4 Auto Re-push mechanism in case producer
gets error while pushing data into Kafka
2
Have a mechanism to re-push data to
Kafka topic,
3 Enable SSL for secure access
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
UI – Reset Offsets
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
UI – Re-push data
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Kafka Producer & Consumer Setup with SSL
Below properties are required to enable SSL for both Producer and Consumer
If client authentication is not required in
the broker then below configuration is
suffice, (kafka.client.truststore.jks will
be provided by kafka service host.)
1
If client authentication is required in the
broker then below configuration is
required. (kafka.client.keystore.jks will
be provided by kafka service host. )
2
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Auto Re-Push Mechanism
Failure
Source
Data Push
In case of failures
Offline
Scheduler
Failed records
Re - Push
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Lessons
Learnt
1
Have a while loop while subscribing to any
Kafka Topics instead of creating
consumer every time.
4
Data Size - consumer's
max.partition.fetch.bytes should be greater
or equals to the producers
producer.max.request.size Default is 1MB.
2
Always use key if you want all messages
for a particular key (e.g. order id) always
goes to a particular partition.
3
enable.auto.commit - Default is true. It is
better to set it false to get control over
when to commit the offset.
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Lessons
Learnt
5
Have a custom script deployed to monitor &
restart Kafka nodes in case of
any issues.
8
Reset offset: Make sure there is no active
consumer on this topic for that
consumer group.
6
heartbeat.interval.ms must be smaller
than session.timeout.ms.
session.timeout.ms : it controls the time it
takes to detect a consumer crash and
stop sending heartbeats.
heartbeat.interval.ms :The expected time
between heartbeats to the consumer
7 auto.offset.reset -default latest
Questions and Answers
© 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential
Kafka Architecture – ML Use Case
Quote
Stream
Order
Stream

Más contenido relacionado

La actualidad más candente

Kafka as your Data Lake - is it Feasible?
Kafka as your Data Lake - is it Feasible?Kafka as your Data Lake - is it Feasible?
Kafka as your Data Lake - is it Feasible?
Guido Schmutz
 

La actualidad más candente (20)

현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...
현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...
현대백화점 리테일테크랩과 AWS Prototyping 팀 개발자가 들려주는 인공 지능 무인 스토어 개발 여정 - 최권열 AWS 프로토타이핑...
 
Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics Apache Flink, AWS Kinesis, Analytics
Apache Flink, AWS Kinesis, Analytics
 
Introducing Change Data Capture with Debezium
Introducing Change Data Capture with DebeziumIntroducing Change Data Capture with Debezium
Introducing Change Data Capture with Debezium
 
Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service Introducing Confluent Cloud: Apache Kafka as a Service
Introducing Confluent Cloud: Apache Kafka as a Service
 
Multi-Datacenter Kafka - Strata San Jose 2017
Multi-Datacenter Kafka - Strata San Jose 2017Multi-Datacenter Kafka - Strata San Jose 2017
Multi-Datacenter Kafka - Strata San Jose 2017
 
Stream processing using Kafka
Stream processing using KafkaStream processing using Kafka
Stream processing using Kafka
 
Securing Kafka
Securing Kafka Securing Kafka
Securing Kafka
 
Apache Ranger
Apache RangerApache Ranger
Apache Ranger
 
Exactly-once Stream Processing with Kafka Streams
Exactly-once Stream Processing with Kafka StreamsExactly-once Stream Processing with Kafka Streams
Exactly-once Stream Processing with Kafka Streams
 
Streaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache KafkaStreaming Data and Stream Processing with Apache Kafka
Streaming Data and Stream Processing with Apache Kafka
 
Getting started with Amazon ElastiCache
Getting started with Amazon ElastiCacheGetting started with Amazon ElastiCache
Getting started with Amazon ElastiCache
 
ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!ksqlDB - Stream Processing simplified!
ksqlDB - Stream Processing simplified!
 
Kafka Connect - debezium
Kafka Connect - debeziumKafka Connect - debezium
Kafka Connect - debezium
 
Introduction to Apache Kafka
Introduction to Apache KafkaIntroduction to Apache Kafka
Introduction to Apache Kafka
 
Getting Started with Confluent Schema Registry
Getting Started with Confluent Schema RegistryGetting Started with Confluent Schema Registry
Getting Started with Confluent Schema Registry
 
Kafka as your Data Lake - is it Feasible?
Kafka as your Data Lake - is it Feasible?Kafka as your Data Lake - is it Feasible?
Kafka as your Data Lake - is it Feasible?
 
Building a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - WebinarBuilding a Modern Data Architecture on AWS - Webinar
Building a Modern Data Architecture on AWS - Webinar
 
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
 
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams APIuser Behavior Analysis with Session Windows and Apache Kafka's Streams API
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
 
SRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and AuthorizationSRV403_Serverless Authentication and Authorization
SRV403_Serverless Authentication and Authorization
 

Similar a Cisco’s E-Commerce Transformation Using Kafka

How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
StampedeCon
 
L'azienda è più agile? Tutto merito del Data Center
L'azienda è più agile? Tutto merito del Data Center L'azienda è più agile? Tutto merito del Data Center
L'azienda è più agile? Tutto merito del Data Center
SMAU
 
How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...
InfluxData
 
20120416 tf mms_feedback_slideshare
20120416 tf mms_feedback_slideshare20120416 tf mms_feedback_slideshare
20120416 tf mms_feedback_slideshare
Osamu Takazoe
 

Similar a Cisco’s E-Commerce Transformation Using Kafka (20)

TechWiseTV Workshop: ASR 9000
TechWiseTV Workshop: ASR 9000 TechWiseTV Workshop: ASR 9000
TechWiseTV Workshop: ASR 9000
 
Not Your Mother's Kafka - Deep Dive into Confluent Cloud Infrastructure | Gwe...
Not Your Mother's Kafka - Deep Dive into Confluent Cloud Infrastructure | Gwe...Not Your Mother's Kafka - Deep Dive into Confluent Cloud Infrastructure | Gwe...
Not Your Mother's Kafka - Deep Dive into Confluent Cloud Infrastructure | Gwe...
 
Elastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ CiscoElastic Cloud Enterprise @ Cisco
Elastic Cloud Enterprise @ Cisco
 
StampedeCon 2015 Keynote
StampedeCon 2015 KeynoteStampedeCon 2015 Keynote
StampedeCon 2015 Keynote
 
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
 
Cisco connect montreal 2018 compute v final
Cisco connect montreal 2018   compute v finalCisco connect montreal 2018   compute v final
Cisco connect montreal 2018 compute v final
 
L'azienda è più agile? Tutto merito del Data Center
L'azienda è più agile? Tutto merito del Data Center L'azienda è più agile? Tutto merito del Data Center
L'azienda è più agile? Tutto merito del Data Center
 
Simplifying the secure data center
Simplifying the secure data centerSimplifying the secure data center
Simplifying the secure data center
 
Как развернуть кампусную сеть Cisco за 10 минут? Новые технологии для автомат...
Как развернуть кампусную сеть Cisco за 10 минут? Новые технологии для автомат...Как развернуть кампусную сеть Cisco за 10 минут? Новые технологии для автомат...
Как развернуть кампусную сеть Cisco за 10 минут? Новые технологии для автомат...
 
Big datadc skyfall_preso_v2
Big datadc skyfall_preso_v2Big datadc skyfall_preso_v2
Big datadc skyfall_preso_v2
 
Cisco connect winnipeg 2018 putting firepower into the next generation fire...
Cisco connect winnipeg 2018   putting firepower into the next generation fire...Cisco connect winnipeg 2018   putting firepower into the next generation fire...
Cisco connect winnipeg 2018 putting firepower into the next generation fire...
 
Cisco connect montreal 2018 secure dc
Cisco connect montreal 2018    secure dcCisco connect montreal 2018    secure dc
Cisco connect montreal 2018 secure dc
 
Citi Tech Talk: Hybrid Cloud
Citi Tech Talk: Hybrid CloudCiti Tech Talk: Hybrid Cloud
Citi Tech Talk: Hybrid Cloud
 
Gain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC NetworkingGain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC Networking
 
Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with Cisco DC Networking: Gain Insight and Programmability with
Cisco DC Networking: Gain Insight and Programmability with
 
Gain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC NetworkingGain Insight and Programmability with Cisco DC Networking
Gain Insight and Programmability with Cisco DC Networking
 
Optimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database DriversOptimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database Drivers
 
How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...How Cisco Provides World-Class Technology Conference Experiences Using Automa...
How Cisco Provides World-Class Technology Conference Experiences Using Automa...
 
20120416 tf mms_feedback_slideshare
20120416 tf mms_feedback_slideshare20120416 tf mms_feedback_slideshare
20120416 tf mms_feedback_slideshare
 
New Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQLNew Approaches for Fraud Detection on Apache Kafka and KSQL
New Approaches for Fraud Detection on Apache Kafka and KSQL
 

Más de confluent

Más de confluent (20)

Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
Catch the Wave: SAP Event-Driven and Data Streaming for the Intelligence Ente...
 
Santander Stream Processing with Apache Flink
Santander Stream Processing with Apache FlinkSantander Stream Processing with Apache Flink
Santander Stream Processing with Apache Flink
 
Unlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insightsUnlocking the Power of IoT: A comprehensive approach to real-time insights
Unlocking the Power of IoT: A comprehensive approach to real-time insights
 
Workshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con FlinkWorkshop híbrido: Stream Processing con Flink
Workshop híbrido: Stream Processing con Flink
 
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
Industry 4.0: Building the Unified Namespace with Confluent, HiveMQ and Spark...
 
AWS Immersion Day Mapfre - Confluent
AWS Immersion Day Mapfre   -   ConfluentAWS Immersion Day Mapfre   -   Confluent
AWS Immersion Day Mapfre - Confluent
 
Eventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalkEventos y Microservicios - Santander TechTalk
Eventos y Microservicios - Santander TechTalk
 
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent CloudQ&A with Confluent Experts: Navigating Networking in Confluent Cloud
Q&A with Confluent Experts: Navigating Networking in Confluent Cloud
 
Citi TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep DiveCiti TechTalk Session 2: Kafka Deep Dive
Citi TechTalk Session 2: Kafka Deep Dive
 
Build real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with ConfluentBuild real-time streaming data pipelines to AWS with Confluent
Build real-time streaming data pipelines to AWS with Confluent
 
Q&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service MeshQ&A with Confluent Professional Services: Confluent Service Mesh
Q&A with Confluent Professional Services: Confluent Service Mesh
 
Citi Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka MicroservicesCiti Tech Talk: Event Driven Kafka Microservices
Citi Tech Talk: Event Driven Kafka Microservices
 
Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3Confluent & GSI Webinars series - Session 3
Confluent & GSI Webinars series - Session 3
 
Citi Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging ModernizationCiti Tech Talk: Messaging Modernization
Citi Tech Talk: Messaging Modernization
 
Citi Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time dataCiti Tech Talk: Data Governance for streaming and real time data
Citi Tech Talk: Data Governance for streaming and real time data
 
Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2Confluent & GSI Webinars series: Session 2
Confluent & GSI Webinars series: Session 2
 
Data In Motion Paris 2023
Data In Motion Paris 2023Data In Motion Paris 2023
Data In Motion Paris 2023
 
Confluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with SynthesisConfluent Partner Tech Talk with Synthesis
Confluent Partner Tech Talk with Synthesis
 
The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023The Future of Application Development - API Days - Melbourne 2023
The Future of Application Development - API Days - Melbourne 2023
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streams
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 

Cisco’s E-Commerce Transformation Using Kafka

  • 1. Cisco's eCommerce Transformation using Kafka Presented By: Dharmesh Panchmatia (Sr. Director – Cisco Systems) Gaurav Goyal (Principal Architect – Cisco Systems)
  • 2. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Agenda Kafka Architecture2 1 Kafka Use Cases Kafka Monitoring3 Lessons Learnt4
  • 3. © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Orders booked $50+B 138 Countries 63 Device types 16 Browsers 185KUsers16 Languages 6M Hits/day 6.9 M Estimates 5.3 M Quotes 1.9 M Orders 85.6% Orders Orders Autobook Portal 71% B2B 29% Cisco Commerce By The Numbers
  • 4. © 2018 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Ref Data REFERENCE DATA SOURCE DMPRD - RDBMS Logging Order Capture DC1 - Tomcat Order Capture Transaction Data Downstream Publish X-Functional Services (73) DC3DC2DC1 TRANSACTION DATA STORE P S S S S N1 N2 N3 N4 N5 DC2 - Tomcat 1 2 3 4 Addresses Items Preferences Roles Contacts Logging DC1 & DC2 Commerce – Cloud Native
  • 6. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Kafka – Use Cases Data push to downstreams 1. Avoid point to point integration. 2. Avoid direct connection to transactional DB. Elastic Search Data Push 1. Reduce load on transactional DB 2. Eliminates ES out of sync in multi-DCs Machine Learning Use Cases using Spark 1. Recommendation Engine 2. Most Popular Configurations 3. Most popular products for a given category
  • 7. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Customer who bought X also bought Y. Identify products which are mostly bought together so we can create bundles or promotions accordingly. 1 2 Algorithm: Apriori ML Use Case 1. Recommendation Engine
  • 8. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Provide visibility to most popular configurations for a given product. Provide visibility to a configuration which Customer has recently bought for the given product. 1 2 Allow selection of pre-configured products instead of starting from scratch. ML Use Case 2. Popular Product Configuration
  • 10. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Producer (Capture Order) Producer (Return Order) Broker 1 Broker 2 Broker 3 Broker 4 ZK - 1 ZK - 2 ZK - 3 ZK - 4 ZK - 5 Consumer (Smart- SW SC) Kafka Cluster Zookeeper DC1 DC2 DC1 DC2 DC3 DC1 – RCDN; DC2 – ALLEN; DC3 – RTP Coordinates cluster membership Commit Offset (v 0.10.x.x) Kafka Architecture Consumer (EDW)
  • 11. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential RDBMS ProducerCustom Code (DC1) Custom Code (DC2)FAULT TOLERENT Kafka DC1 and DC2 Consumer Group - DC1 Consumer Group – DC2 Elastic Search – DC1 Elastic Search – DC2 Kafka Architecture – Elastic Search
  • 12. Transaction Data Order, Estimate Quote RDBMS Reference Data Click Stream Data Data Visualization Dynamic Querying Data Science Primary Analytics Data Store MQL Transaction Data RDBMS Subscriptions Invoices Kafka Architecture – ML & Analytics Use Case
  • 14. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Monitoring: Kafka Manager and Kafdrop Kafka Manager Kafdrop
  • 15. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Kafka – Custom Scripts 1. Cron job to check Kafka processes every minute. Restart Kafka process (and send email) in case it’s not running. 2. Always take back up of logs systematically when Kafka processes are getting restarted. 3. Have a test topic and push test message every minute. Trigger a notification in case of failures.
  • 16. Best Practices / Lessons Learnt
  • 17. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Best Practices 1 Have a mechanism to reset Kafka offsets on demand. 4 Auto Re-push mechanism in case producer gets error while pushing data into Kafka 2 Have a mechanism to re-push data to Kafka topic, 3 Enable SSL for secure access
  • 18. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential UI – Reset Offsets
  • 19. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential UI – Re-push data
  • 20. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Kafka Producer & Consumer Setup with SSL Below properties are required to enable SSL for both Producer and Consumer If client authentication is not required in the broker then below configuration is suffice, (kafka.client.truststore.jks will be provided by kafka service host.) 1 If client authentication is required in the broker then below configuration is required. (kafka.client.keystore.jks will be provided by kafka service host. ) 2
  • 21. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Auto Re-Push Mechanism Failure Source Data Push In case of failures Offline Scheduler Failed records Re - Push
  • 22. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Lessons Learnt 1 Have a while loop while subscribing to any Kafka Topics instead of creating consumer every time. 4 Data Size - consumer's max.partition.fetch.bytes should be greater or equals to the producers producer.max.request.size Default is 1MB. 2 Always use key if you want all messages for a particular key (e.g. order id) always goes to a particular partition. 3 enable.auto.commit - Default is true. It is better to set it false to get control over when to commit the offset.
  • 23. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Lessons Learnt 5 Have a custom script deployed to monitor & restart Kafka nodes in case of any issues. 8 Reset offset: Make sure there is no active consumer on this topic for that consumer group. 6 heartbeat.interval.ms must be smaller than session.timeout.ms. session.timeout.ms : it controls the time it takes to detect a consumer crash and stop sending heartbeats. heartbeat.interval.ms :The expected time between heartbeats to the consumer 7 auto.offset.reset -default latest
  • 25. © 2017 Cisco and/or its affiliates. All rights reserved. Cisco Confidential Kafka Architecture – ML Use Case Quote Stream Order Stream