Scaling apps for the big time

•Descargar como PPTX, PDF•

0 recomendaciones•489 vistas

Accompanying slides for the "Scaling apps for the big time" presentation delivered at MelbDjango 1.4, hosted by Common Code

Tecnología

Pro IT Consulting
Scaling apps for the big time

The Challenge?
• You have an app that works
• You have users that like it
Awesome
• Performance is suffering as you scale.
• Reliability is getting worse, not better.
• As your data sets grow,
the problems are more pronounced.
• The operations team are talking about problems, not
solutions
Not so awesome

You are not alone – unfortunately…
• Your cool app
• May end up supported
• By lots of things
• You can’t control

What is the root cause?
• Take the time to understand what happens when your code
asks the server to do some task.
select * from
some_production_table_with_100,000,000_records
Is really not the same workload as
select * from some_dev_table_with_100_records
• Look for evidence in logs and tools that provide real insight.

Issues of priority…
• Disk drive, single user
session
• Disk drives, Multiple
users….

Issues of Scale…
• Fetching Blocks, single
user session
• Fetching Blocks,
enterprise workload

Storage
• Many database and operating system vendor
recommendations are woefully out of date.
• Modern techniques utilising flash in the right way can deliver
millions of random IOPS.
• SAN and flash vendors have made dramatic changes over the
last few years that invalidate many of the old
recommendations.
• Some principles still hold and are important for optimised
performance
– 1 process writes to each disk group
– Avoid reads and writes occurring simultaneously if possible

CPU
• CPUs are not all created equal.
• Use SpecInt to compare if it matters for your workload.
• Split up the work and scale wide if you can. There is a reason
the web scale companies have.
• Don’t process work now that can wait until later.
• Later might be in a few seconds and on another box.
• Schedule intensive workloads like reports.
• Don’t expect your laptop and the production server to scale
the same way.

Memory
• Memory is addressable in various forms with performance
tradeoffs for capacity.
• Use the lowest latency one you can afford.
Memory Type Typical Capacity Approximate
Access time
CPU cache 30MB < 10 ns
DDR3 64GB <100ns
SSD ~ 800GB <20,000ns
FC or SAS ~ 1TB <20,000,000ns
SATA 4TB + <8,000,000ns

Network
• Why is it that we conceptualise networks from an individual
point of view?

Network
The best transport is context dependent

Network
• Latency & Bandwidth are not the same thing.
– Think satellite delay on a TV interview
• In this context we use these definitions
– Latency is the amount of time a network takes to reach the other end.
– Bandwidth is the rate at which we can successfully transmit data to the
other end.
• This is why you need to test your app through a latency
generator.
– There are capable free open source tools such as WANEM

Middleware
• Websphere, WebLogic, JBOSS, Tomcat
– Garbage collection tradeoffs between JVM size and system
memory/CPU capacities.
• Django
– Read HighPerformanceDjango by the team from Lincoln Loop
– Sponsored by the Common Code team

SQL databases
• Microsoft SQL, Oracle DB, PostgreSQL & MySQL.
• Various strengths & weaknesses for each but have some key
things in common.
• Offload reporting away from OLTP workloads
• Indexes are important
• Transaction Logs are a performance bottleneck
• Think deeply about scaling out
• Think about caching queries
• Backups are critical because you will need to restore one day

Backup is about Restore
• Enterprise wide backup will find all your infrastructure failings
by pushing more data for longer while other work continues.
• Test your restores. Really, test them.
• Offload large backups away from your production systems.

Questions?
How to get in touch?
James Clifford
Email: james@proitconsulting.com.au
Phone: 0421 648 034
Brenton Carbins
Email: brenton@proitconsulting.com.au
Phone: 0409 779 230

Más contenido relacionado

La actualidad más candente

BDM37 - Simon Grondin - Scaling an API proxy in OCamlBig Data Montreal

Platform Cache (DF15 session)Salesforce Partners

Platform CacheSalesforce Developers

Architecture and Design MySQL powered applications by Peter Zaitsev Meetup Sa...MySQL Brasil

The have no fear guide to virtualizing databasesSolarWinds

Domino server and application performance in the real worlddominion

London VMUG Presentation 19th July 2012Chris Evans

Capacity - Ransomware - Protection - Three Windows File Server Upgrades to AvoidStorage Switzerland

Lessons in moving from physical hosts to mesosRaj Shekhar

EarthLink Business Cloud Server BackupMike Ricca

How to speed up any pcunrant

De3 IT Solutions - Hosted Desktop PresentationAaron Thirling

Five things virtualization has changed in your dr planJosh Mazgelis

No stress with stateUwe Friedrichsen

What Ifs - VMware Lightning Talk OpsCamp San FranciscoOpsCamp

How not to be a cranky dbaMike Hillwig

Stopping Storage Hardware SprawlStorage Switzerland

Secured DevelopmentBurhan Khalid

The 5 Minute MySQL DBAIrawan Soetomo

Optimizing Flash Storage for SQL DatabasesStorage Switzerland

La actualidad más candente (20)

BDM37 - Simon Grondin - Scaling an API proxy in OCaml

Platform Cache (DF15 session)

Platform Cache

Architecture and Design MySQL powered applications by Peter Zaitsev Meetup Sa...

The have no fear guide to virtualizing databases

Domino server and application performance in the real world

London VMUG Presentation 19th July 2012

Capacity - Ransomware - Protection - Three Windows File Server Upgrades to Avoid

Lessons in moving from physical hosts to mesos

EarthLink Business Cloud Server Backup

How to speed up any pc

De3 IT Solutions - Hosted Desktop Presentation

Five things virtualization has changed in your dr plan

No stress with state

What Ifs - VMware Lightning Talk OpsCamp San Francisco

How not to be a cranky dba

Stopping Storage Hardware Sprawl

Secured Development

The 5 Minute MySQL DBA

Optimizing Flash Storage for SQL Databases

Similar a Scaling apps for the big time

Performance TuningJannet Peetz

Performance Optimization of Cloud Based Applications by Peter Smith, ACLTriNimbus

Handling Massive WritesLiran Zelkha

Scaling Systems: Architectures that growGibraltar Software

Adding Value in the Cloud with Performance TestRodolfo Kohn

Hardware ProvisioningMongoDB

Doc 2011101412020074Rhythm Sun

Антон Бойко "Разделяй и властвуй — набор практик для построения масштабируемо...Marina Peregud

Silicon Valley Code Camp 2015 - Advanced MongoDB - The SequelDaniel Coupal

Storage Systems For Scalable systemselliando dias

Scaling a High Traffic Web Application: Our Journey from Java to PHP120bi

Scaling High Traffic Web ApplicationsAchievers Tech

Breaking dataTerry Bunio

Choosing the right parallel compute architecture corehard_by

Capacity Planning For Your Growing MongoDB ClusterMongoDB

Capacityplanning Paulo Fagundes

Dori Exterman, Considerations for choosing the parallel computing strategy th...Sergey Platonov

Fastest Servlets in the WestStuart (Pid) Williams

MySQL Performance Tuning at COSCUP 2014Ryusuke Kajiyama

Database Administration & Management - 01FaisalMashood

Similar a Scaling apps for the big time (20)

Performance Tuning

Performance Optimization of Cloud Based Applications by Peter Smith, ACL

Handling Massive Writes

Scaling Systems: Architectures that grow

Adding Value in the Cloud with Performance Test

Hardware Provisioning

Doc 2011101412020074

Антон Бойко "Разделяй и властвуй — набор практик для построения масштабируемо...

Silicon Valley Code Camp 2015 - Advanced MongoDB - The Sequel

Storage Systems For Scalable systems

Scaling a High Traffic Web Application: Our Journey from Java to PHP

Scaling High Traffic Web Applications

Breaking data

Choosing the right parallel compute architecture

Capacity Planning For Your Growing MongoDB Cluster

Capacityplanning

Dori Exterman, Considerations for choosing the parallel computing strategy th...

Fastest Servlets in the West

MySQL Performance Tuning at COSCUP 2014

Database Administration & Management - 01

Último

A Call to Action for Generative AI in 2024Results

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco

Slack Application Development 101 Slidespraypatel2

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Real Time Object Detection Using Open CVKhem

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Scaling apps for the big time

1. Pro IT Consulting Scaling apps for the big time

2. The Challenge? • You have an app that works • You have users that like it Awesome • Performance is suffering as you scale. • Reliability is getting worse, not better. • As your data sets grow, the problems are more pronounced. • The operations team are talking about problems, not solutions Not so awesome

3. So what happens if you win big?

4. You are not alone – unfortunately… • Your cool app • May end up supported • By lots of things • You can’t control

5. You are not alone – unfortunately…

6. What is the root cause? • Take the time to understand what happens when your code asks the server to do some task. select * from some_production_table_with_100,000,000_records Is really not the same workload as select * from some_dev_table_with_100_records • Look for evidence in logs and tools that provide real insight.

7. What is the root cause?

8. Issues of priority… • Disk drive, single user session • Disk drives, Multiple users….

9. Issues of Scale… • Fetching Blocks, single user session • Fetching Blocks, enterprise workload

10. Storage • Many database and operating system vendor recommendations are woefully out of date. • Modern techniques utilising flash in the right way can deliver millions of random IOPS. • SAN and flash vendors have made dramatic changes over the last few years that invalidate many of the old recommendations. • Some principles still hold and are important for optimised performance – 1 process writes to each disk group – Avoid reads and writes occurring simultaneously if possible

11. CPU • CPUs are not all created equal. • Use SpecInt to compare if it matters for your workload. • Split up the work and scale wide if you can. There is a reason the web scale companies have. • Don’t process work now that can wait until later. • Later might be in a few seconds and on another box. • Schedule intensive workloads like reports. • Don’t expect your laptop and the production server to scale the same way.

12. Memory • Memory is addressable in various forms with performance tradeoffs for capacity. • Use the lowest latency one you can afford. Memory Type Typical Capacity Approximate Access time CPU cache 30MB < 10 ns DDR3 64GB <100ns SSD ~ 800GB <20,000ns FC or SAS ~ 1TB <20,000,000ns SATA 4TB + <8,000,000ns

13. Network • Why is it that we conceptualise networks from an individual point of view?

14. Network The best transport is context dependent

15. Network • Latency & Bandwidth are not the same thing. – Think satellite delay on a TV interview • In this context we use these definitions – Latency is the amount of time a network takes to reach the other end. – Bandwidth is the rate at which we can successfully transmit data to the other end. • This is why you need to test your app through a latency generator. – There are capable free open source tools such as WANEM

16. Middleware • Websphere, WebLogic, JBOSS, Tomcat – Garbage collection tradeoffs between JVM size and system memory/CPU capacities. • Django – Read HighPerformanceDjango by the team from Lincoln Loop – Sponsored by the Common Code team

17. SQL databases • Microsoft SQL, Oracle DB, PostgreSQL & MySQL. • Various strengths & weaknesses for each but have some key things in common. • Offload reporting away from OLTP workloads • Indexes are important • Transaction Logs are a performance bottleneck • Think deeply about scaling out • Think about caching queries • Backups are critical because you will need to restore one day

18. Backup is about Restore • Enterprise wide backup will find all your infrastructure failings by pushing more data for longer while other work continues. • Test your restores. Really, test them. • Offload large backups away from your production systems.

19. Questions? How to get in touch? James Clifford Email: james@proitconsulting.com.au Phone: 0421 648 034 Brenton Carbins Email: brenton@proitconsulting.com.au Phone: 0409 779 230

Notas del editor

You are not the only fish in the sea…
You are not the only fish in the sea…

Scaling apps for the big time

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Scaling apps for the big time

Similar a Scaling apps for the big time (20)

Último

Último (20)

Scaling apps for the big time

Notas del editor