12. Azure is the only public cloud to offer Apache
Kafka as a managed service
Can be provisioned directly from the Azure Portal
Apache Kafka is one of the HDInsight cluster types
Clusters can be scaled within minutes
99.9 percent SLA
No additional charge for running Kafka clusters
Out-of-box management using Azure Monitor
Logs
Apache Kafka on HDInsight
A open-source, scalable, stream ingestion platform offered as a managed service on Azure HDInsight
14. Kafka for Azure HDInsight
• Managed Kafka clusters with 99.9% service level
SLA
• Native integration with Azure Managed Disks.
Allows for exponentially lower costs, and higher
scale.
• Scalable On Demand clusters - Kafka clusters
with 16 TB/node and Zookeeper up and running
in 15 minutes
• Rack awareness for Kafka on the Azure cloud
• Alerting and predictive cluster maintenance
through Azure Monitor Logs
• Extensibility via one click deploy of leading ISVs
such as StreamSets
• Disaster recovery support via MirrorMaker
• Deploy End to End streaming pipelines with
Storm, Spark, Storage via automated ARM
templates in the same VNET.
15. Kafka is a distributed, horizontally-scalable, fault-tolerant pub-sub store
Broker 1
Producer 1
IoT Hub
Storm
Spark
Streaming
1
2
3
ZK 1 ZK 2 ZK 3
Broker 2
Broker 3
3
1
2
Topic 1
Topic 2 Topic 1
Topic 2
Topic 2
Topic 1
Data Ingestion using Kafka on HDInsight
16. 4 5
Setup the broker
configuration
Publish the
message
The consumer
reads the messages
Kafka: Producers and Consumers
18. Azure
Gateway
Services
Open source Stream Processing on Azure HDInsight
Real-time applications
Long term storage
Real-time dashboards
IoT Hubs
Azure VNet Boundary
Connected Car Architecture Powered by HDInsight
28. Event Hubs in the real world:
Halo 5
80 million requests per minute
within 24 hours of release
All game telemetry and statistics
run through Azure Event Hubs,
processed, and sent back to console
1 Dedicated Capacity cluster (3 CUs)
Zero administration by Halo team