Performance By Design

Performance by Design Guy Harrison Director, R&D Melbourne www.guyharrison.net

Save the red-shirt Toad! The Red-shirt Toad is NOT expendable!

Core message Design limits performance Architecture maps requirements to design Make sure performance requirements are specified Make sure architecture allows for performance Make sure performance requirements are realized

Elements of Performance by Design

High performance can mean different things Speed: response time

Not usually easy to change architectures

Poorly defined requirements lead to this:

“Twitter is, fundamentally, a messaging system. Twitter was not architected as a messaging system, however. For expediency's sake, Twitter was built with technologies and practices that are more appropriate to a content management system.”

Patterns of database performance Hard to distinguish patterns at low levels

Validating performance can’t wait... User adoption and growth UI Layer (HTML, JavaScript, Ajax) Middleware layer (J2EE) SQLs Database (Tables, views, partitions, etc)

Other logical design thoughts Artificial keys Generally more efficient than long composite keys Null values Not a good idea if you intend to search for “unknown” or “incomplete” values Null should not mean something But beneficial as long as you don’t need to look for them. Data types Constraints on precision can sometimes reduce row lengths Variable length strings usually better Carefully consider CLOBs vs long VARCHARs

Logical to Physical: Subtypes “Customers are people too”

Indexing, clustering and weird table types Lots’ of options: B*-Tree index Bitmap index Hash cluster Index Cluster Nested table Index Organized Table Most often useful: B*-Tree (concatenated) indexes Bitmap indexes Hash Clusters

Concatenated index effectiveness SELECT cust_id FROM sh.customers c WHERE cust_first_name = 'Connor' AND cust_last_name = 'Bishop' AND cust_year_of_birth = 1976;

Concatenated indexing guidleines Create a concatenated index for columns from a table that appear together in the WHERE clause. If columns sometimes appear on their own in a WHERE clause, place them at the start of the index. The more selective a column is, the more useful it will be at the leading end of the index (better single key lookups) But indexes compress better when the leading columns are less selective. (better scans) Index skip scans can make use of an index even if the leading columns are not specified, but it’s a poor second choice to a “normal” index range scan.

Bitmap join performance SELECT SUM (amount_sold) FROM customers JOIN sales s USING (cust_id) WHERE cust_email='flint.jeffreys@company2.com';

Hash Cluster Cluster key determines physical location on disk Single IO lookup by cluster key Misconfiguration leads to overflow or sparse tables

Denormalization and partitioning Repeating groups – VARRAYS, nested tables Summary tables – Materialized Views, Result cache Horizontal partitioning – Oracle Partition Option In-line aggregations – Dimensions Derived columns – Virtual columns Vertical partitioning Replicated columns - triggers

Summary tables Aggregate queries on big tables often the most expensive Pre-computing them makes a lot of sense Balance accuracy with overhead Aggregate Query MV on COMMIT Manual Summary Result set cache MV stale tolerated Accuracy Efficiency

Physical storage options LOB Storage PCTFREE Compression Block size Partitioning

Application Architecture and implementation

The best SQL is no SQL Avoid asking for the same data twice.

11g client side cache CLIENT_RESULT_CACHE_SIZE: this is the amount of memory each client program will dedicate to the cache. Use RESULT_CACHE hint or (11GR2) table property Optionally set the CLIENT_RESULT_CACHE_LAG

Parse overhead It’s easy enough in most programming languages to create a unique SQL for every query:

Identifying similar SQLs See force_matching.sql at www.guyharrison.net

Transaction design Optimistic vs. Pessimistic

Using ORA_ROWSCN Setting ROWDEPENDENCIES will reduce false fails

Network overhead – Array processing

Geek quiz stuff: High probability answers (keep standing if): Know what Alice and Wally have in common You know the next number in this series 3 . 1 4 Know what “M” is in E=MC2

Know (or can work out) your age in hex Have an opinion about of ST vs SW If you know who Leonard McCoy is Think there is an important distinction between Nerd and Geek Can quote Monty Python …. Other than dead parrot? You’ve ever watched Jerry Springer

There are more networked devices in your house than people, pets and cars Know the names of two of Thomas the tank engines friends Know the names of any of Angelina and Brad’s babies Low probability answers: (sit down if you): Have a twitter account # Azure is your new favourite color

You’ve ever played Zork You have a favourite Dr Who companion Your favourite is Sarah Jane Know your age in binary (or can work it out in your head) You are proficient in some form of assembler

# You are proficient in some for or English There is a rubicks cube in your house Have your own domain Have ever been to Azeroth Who is Know who said “Dude I am not your nemesis”

Worn a star trek or star wars costume Played a game that uses a non-six sided dice Get email on my phone – before getting out of bed Calculator watch Binary time piece Was on the internet prior to the WWW

# Met my current partner on line Know the next thing in this sequence: Hydrogen, Helium, Lithuim, Berilium, …. Know what a Gigaquads in a megaquad is

Saw a sci-fi movie more than twice at the movies ========================================================= You cleaned up at home before going to work

Performance By Design

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (20)

Similar a Performance By Design

Similar a Performance By Design (20)

Más de Guy Harrison

Más de Guy Harrison (19)

Último

Último (20)

Performance By Design

Notas del editor