Optimizing Your Cloud Applications in RightScale

1

Optimizing Your Cloud Applications
in RightScale
Rafael H. Saavedra
VP of Engineering, RightScale
Watch the video of this presentation

2#

Agenda
• Introduction
• 3-tier application architecture
• Vertical & horizontal scaling
• RightScale monitoring and cluster graphs
• New Relic RPM
• Support for optimizing DB performance
• Load testing

Real Cloud Experience. Shared.

3#

Cloud computing characteristics
• Multi-tenancy
• Shared resource pooling
• Geo-distribution and ubiquitous network access
• Service oriented
• Dynamic resource provisioning
• Self-organizing
• Utility based pricing


4#

Cloud computing advantages
• No upfront investment
• Lowering operating costs
• Highly scalable
• Easy access
• Reduces business risk and maintenance costs
• Enables process automation


5#

3-tier application architecture
• Load balancers
• An array of application servers
• Master-slave


6

Optimizing Your Cloud Applications in RightScale

Vertical & Horizontal Scaling

7#

Cloud performance optimization
• Instance size (vertical scaling)

• Instance autoscaling (horizontal scaling)
• Server arrays

• RightScale support for performance optimization
• ServerTemplates are configured to capture performance data
• Collectd RightScripts
• Hardware & OS monitoring data
• Specialized plugins – MySQL, HAProxy, Apache, NgInx, IIS, etc
• Monitoring graphs: individual, cluster, stacked, heat maps
• Alerts & escalations

• New Relic RPM


8#

Scaling up – spectrum of instance sizes
• Compute units vs memory
128.0

m2.4xlarge
64.0

m2.2xlarge
32.0 cc1/cg1.4xlarge
m2.xlarge c1.xlarge
Memory (GB)

16.0
m1.xlarge Scalable
m1.large Applications
8.0
High Performance
4.0
Computing

m1.small
2.0
c1.medium
1.0
Test & Dev
t1.micro
0.5
0.0 5.0 10.0 15.0 20.0 25.0 30.0 35.0 40.0
Compute Units


9#

Server arrays provide horizontal scaling


10#

Server arrays provide horizontal scaling
• The array scales up or down based on performance votes
• Tags allow scaling on an arbitrary decision set
• Decision threshold controls reaction time
• Sleep time allows new resources to have an impact
• Scaling can be time dependent
• Detailed setup instructions: http://bit.ly/c1oLr2

• Fast response to changes in load conditions using alerts

• Allocation of servers to availability zones based on weights

• Deployment-based so configuration is consistent

• Arrays can be pre-scaled to support anticipated demand

11


Monitoring & Cluster Graphs
with RightScale

12#

Server monitoring graphs


13#

Cluster monitoring
• Individual graphs
• Good for a dozen servers
• Displays all standard graphs with full detail
• Stacked graphs
• Displays the contribution of many servers to a total
• Great to see the sum and variability of activity in a cluster
• Difficult to make out individual servers
• Examples: requests/sec, cpu busy cycles, I/O bytes/sec
• Heat maps
• Displays a bar for each server
• Great to see uneven distribution across servers
• Great to quickly spot performance problems across many servers
• Difficult to read absolute values or see the total cluster activity


14#

Cluster monitoring architecture
• Architecture
• Monitoring front-end servers
pull data from storage servers
• Up to 100 servers on one graph
(to be increased)

monitoring monitoring
storage front-end
servers servers

your servers


15#

Cluster monitoring
• Current cluster monitoring: one graph per server


16#

Stacked graphs
• Each color band shows contribution of one server
• Servers are stacked on top of one another


17#

Heat maps
• Each horizontal strip shows one server
• The color shows how “hot” the server is running


18#

Heat map with 100 servers


19#

Stacked graph of the same 100 servers


20


Application Performance Analytics
with New Relic

21#

New Relic RPM
• Real-Time App Performance Analytics

• Supports Ruby, PHP, Java & .Net

• SQL & NoSQL performance

• Web transaction tracing

• Performance notifications

• Availability monitoring

• Scalability analysis


22#

New Relic RPM
• Direct access from RightScale dashboard


23#

New Relic RPM
• Historical statistics over a period of time


24#

New Relic RPM
• Distribution of the most time consuming requests


25#

New Relic RPM
• Statistics about response times from different countries


26#

New Relic RPM
• Detailed response times by browser


27#

New Relic RPM – 2 Examples
• An expensive query

• The N+1 query problem


28


Optimizing Database Performance

29#

Optimizing DB performance
• RightScale MySQL ServerTemplates
• Configuration files tailored to instance size
• innodb_buffer_pool_size
• key_buffer_size
• thread_size
• sort_buffer_size

• The never ending task of identifying current bottlenecks
• Disk seeks
• Performance of disk operations
• Scale up when working set cannot fit in memory – avoid active swapping
• Constant monitoring of performance graphs, logs and query

• Schema considerations


30#

Schema considerations
• Lookups need to be indexed

• Sorting requires an index

• Joins need to be done on indices
• Become slower as tables grow

• Compounded indices should be used consistently

• Do not abuse indices
• Each index requires a disk write

• Compact tables if they become fragmented
• Deleted rows do not remove the corresponding index entries

31#

Monitoring DB performance
• Standard collectd statistics
• User vs wait time (disk operations)
• Performance of disk operations
• Scale up when working set cannot fit in memory

• MySQL collectd plugin
• Monitor INSERT, SELECT, UPDATE operations
• The breakdown of read operations can indicate missing indices

• Monitoring /var/log/mysqlslow.log file
• Identify slow queries

• Use MySQL EXPLAIN command to identify query plan


32#

MySQL Collectd Plugin
• Uses MySQL SHOW STATUS command to collect statistics
• A large set of counters that are divided into 10 categories
• Connections
• IO Requests
• Select Rates
• Read Rates
• Key Rates
• Commands Rates
• Query Cache
• Tables
• Memory
• Misc.


33#

MySQL Collectd Plugin
• Uses MySQL SHOW STATUS command to collect statistics


34#

Mysqlslow.log & explain command


35#

MySQL performance depends on locality
• Wait time should be minimum when working set fits in memory
• Performance degrades once wait time is significant

wait time insignificant

user time dominates


36#

MySQL reads graphs
• Read-random-next represents a table scan
• Read-next represents an index scan


37


Load Testing

38#

Load testing using httperf
• RightScale provides ServerTemplates in the marketplace
• https://my.rightscale.com/library/server_templates/Httperf-Load-Tester/24714

• Tutorial on httperf setup and configuration
• http://support.rightscale.com/03-Tutorials/02-AWS/E2E_Examples/E2E_Gaming_Deployment/Adding_Httperf_Load_Tester


Optimizing Your Cloud Applications in RightScale

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (7)

Similar a Optimizing Your Cloud Applications in RightScale

Similar a Optimizing Your Cloud Applications in RightScale (20)

Más de RightScale

Más de RightScale (20)

Último

Último (20)

Optimizing Your Cloud Applications in RightScale

Notas del editor