Acsug scalable windows azure patterns

Scalable Windows Azure Patterns
Presented by Nikolai Blackie @ Auckland Connected Systems User Group
Principal Architect
Adaptiv Integration
20th of October 2011

Agenda

Overview the patterns of scalability
Overview the scale targets on Windows Azure and how to
scale past these
Introduce Windows Azure CAT team cloud scalability
framework – CloudFx
Demonstrate Adaptiv’s scaled out distributed architecture
Show how you can save money when scaling on Windows
Azure

Traditional Scalability Issues
Synchronous sequential units of work
Large units of work
Tight coupling between components
Stateful applications
Clustering for HA
Expensive
Scaling limits
Inefficient use of resources

How should you approach this in the cloud?

Parallel Units Of Work

Single Threaded Multi Threaded

Event Driven Processing
Storage Queues
► Enable event driven architectures
► Allow load levelling

► Scale target of 500 messages per

second
Service Bus V1
► Low latency relay
► Non durable direct eventing

Service Bus V2
► Durable asynchronous pub-sub and
queues

Load Balancing / Load Sharing

Stateless Round Robin Network Load Balancer

Web Role Web Role Web Role

Queue

Worker Role Worker Role Worker Role

Legend Azure Instance
Unit of Work

Horizontal Partitioning

Partitioned Cloud Queue
http://partitioncloudqueue.codeplex.com/

Storage Services Scalability Targets
Maximum scale per account
Capacity – Up to 100 TBs
Transactions – Up to 5,000 entities/messages/blobs per second
Bandwidth – Up to 3 gigabits per second
Per storage abstraction
► Single Queue - 500 messages per second
► Single Table Partition - 500 entities per second

► Single Blob - 60 MBytes/sec

When targets reached?
► 503 Server busy – transient not fatal
► Use Upsert on batch operations

Storage Services Scalability Targets

Front-End (FE)
layer
Partition Layer
Distributed and
replicated File
System (DFS)

Service Bus Scalability Targets
Quotas per service namespace
Concurrent connections – 100
Number of topics/queues - 10,000
Number of subscriptions per topic - 2,000
Maximum Queue size – 1gb – 5gb
Throughput Targets
Queues depending on message size much faster than storage
queues
Topics throughput dependent on subscription counts
Official guidance is coming soon

Sql Azure Scalability Targets
No official guidance on I/O performance
Sql Azure is a multi tenant database platform
Runs on commodity hardware
Throttled when connection overloads Sql Azure

Cloud Scalability – Scale Up

Azure Role Cost (NZD) CPU Bandwidth Memory
XSmall $0.06 Shared Core 1.0 GHz 5 Mbps 0.768 GB
Small $0.14 1 Core 1.6 GHz 100 Mbps 1.750 GB
Medium $0.28 2 Cores 1.6 GHz 200 Mbps 3.50 GB
Large $0.55 4 Cores 1.6 GHz 400 Mbps 7.00 GB
XLarge $1.10 8 Cores 1.6 GHz 800 Mbps 14.0 GB

Cloud Scalability – Scale Up

Edition Database Cost (NZD)
Web 1GB $ 11.48
Web 5GB $ 57.41
Business 10GB $ 114.93

Cloud Scalability Key Features
Small logical units of work
Parallel units of work
Event driven processing
Load distribution
► Load balancing
► Vertical partitioning

► Horizontal partitioning

Low cost scale out
Dynamic scale up

Scalable Frameworks
Windows Azure CAT Team Azure CloudFx Reference
Library & Implementation

CloudFx Framework Features

► CloudFx is a cloud solution framework and extensions
library
► A Swiss army knife for building scalable systems
► Service agnostic retries
► Large message queue

► Payload compression

► And much more….

Cost-Efficient Queue Listener

Anti-Pattern continuous
queue polling when idle
Notify of new work
Provision parallel dequeue
threads & increase poll rate
When no work reduce
threads & polling rate
Very efficient event based
processing

Reliable Retry Framework
Anti-Pattern failing catastrophically
or rerunning entire processes due
to service call failure
Call to resource fails
Retry based on configured pattern
► Fixed, Incremental, Exponential
Ensure Idempotent operations

Reliable Retry Framework
Anti-Pattern failing catastrophically
or rerunning entire processes due
to service call failure
Call to resource fails
Retry based on configured pattern
► Fixed, Incremental, Exponential
Ensure Idempotent operations
Expect failure, design for fault
tolerance
► Netflix Simian Army
► CloudFx unit tests

Service Aggregation
Anti-Pattern tightly coupled
service deployments
Enable flexible service
aggregation
Implemented using
System.ServiceModel
Extensions
► IExtensible<T>
► IExtensibleObject<T>

Consolidate services or
partition vertically

Large Message Queue Support
Anti-Pattern writing custom code
to handle queue storage limits of
64kb message size
Abstracted support for messages
of any size
Built in compression when writing
to all services
Utilises DeflateStream
Compress on write, decompress
on read

Scalable Reconciliation Implementation

Cost Capacity Planning
Storage Transactions
► Queues 1 transaction to put, 2 transactions to get and delete
Can batch up to 32 get message operations in 1 transaction
► Table storage 1 transaction for read/write
Can batch 100 entity transactions is into 1 group transaction
► Blobs 1 transaction for read/write
Bandwidth
► Measured from outside and between data centers
► Free inbound data

Cost Capacity Planning
Storage
► Cumulative total through month, charged on average usage
Compute Instances
► Charged as soon as virtual instance allocated regardless of
running state
► Billed to the nearest hour

Measurement
► Instrument Azure service access
► Use billing manager with A & B testing approach

Idle Polling *
Polling Algorithm Transactions Per Day Service Bus Cost Per Day
Standard
One Dequeue Thread 79,200 $ - $ 0.09
10 Compute Instances - 150 Dequeue Threads 11,880,000 $ - $ 13.66
Back off
One Dequeue Thread - $ 4.59 $ 0.15
10 Compute Instances - 150 Dequeue Threads - $ 22.87 $ 0.76
* 22 hours per day
Running Polling **
Polling Algorithm Transactions Per Day Cost Per Day
One Dequeue Thread 72,000 $ 0.08
10 Compute Instances - 150 Dequeue Threads 10,800,000 $ 12.41
** 2 hours per day, 5 msgs per sec, 2 trans per message

Savings
Polling Algorithm Transactions Per Day Cost Per Day
One Dequeue Thread 79,200 $ (0.06)
10 Compute Instances - 150 Dequeue Threads 11880000 $ 12.89


* Terms and conditions apply, your mileage may vary, these
calculations are based on simple models

Batching Units of Work

Parsing a 10000 line file
► 10000 messages a day
► Table storage batch transactions

► Using scatter scale out pattern

Batch Size Queue Transactions Table Transactions Total Transactions Cost Per Day

10 Lines 30,000,000 10,000,000 40,000,000 $45.98

200 Lines 1,500,000 500,000 2,00,000 $ 2.30

2000 Lines 150,000 50,000 200,000 $0.23

Batching Units of Work


Parallel Processing & Mixing Roles

Per Day Per Day
Utilisation Cost Utilisation Cost

5 Threads Dequeuing across 5 instances $ 16.55 One Service over 3 instances $ 9.93

25 Threads Dequeuing across 1 Instance $ 3.31 Three Services on one instance $ 3.31


Object Compression

Compression Ratios
► Compressing a text based CSV 5:1 ratio
► Compressing a Xml file 10:1 ratio

Based on 10000 CSV messages a day

10000 Line CSV File Blob Storage Space (GB) Cost Per Day
Uncompressed 2064KB 19.68 $3.39
Compressed 390KB 3.72 $0.64
Savings 15.96 $2.75

Object Compression


To Summarise
You can scale with any combination of up and out using
horizontal and vertical partitioning
On the cloud make the most of the ability to scale out using
small units of work
Distribute load to reduce resource contention
Make the best use of the resources you have paid for
Use frameworks like CloudFx and to
help you scale using best practices
Use techniques like back off polling, batching, parallel
processing and compression to reduce costs

Links
Windows Azure Capacity Assessment

Building Highly Scalable Java Applications on Windows Azure

Cost Architecting for Windows Azure

Understanding Windows Azure Storage Billing – Bandwidth, Transactions,
and Capacity

Operational costs of an Azure Message Queue

Links
Windows Azure Storage Abstractions and their Scalability Targets

Understanding the Scalability, Availability, Durability, and Billing of Windows
Azure Storage

Windows Azure Storage Architecture Overview

Windows Azure AppFabric Service Bus Quotas

Inside SQL Azure

The Netflix Simian Army

Windows Azure CAT Links
How to Simplify & Scale Inter-Role Communication Using Windows Azure
AppFabric Service Bus

Implementing Storage Abstraction Layer to Support Very Large Messages in
Windows Azure Queues

Transient Fault Handling Framework for SQL Azure, Windows Azure Storage,
AppFabric Service Bus

Best Practices for Maximizing Scalability and Cost Effectiveness of Queue-
Based Messaging Solutions on Windows Azure

We would like to thank the sponsors. . .
Premier Partners

Associated Partners

Supporting Partners:

Acsug scalable windows azure patterns

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Similar a Acsug scalable windows azure patterns

Similar a Acsug scalable windows azure patterns (20)

Más de Nikolai Blackie

Más de Nikolai Blackie (7)

Último

Último (20)

Acsug scalable windows azure patterns

Notas del editor