SlideShare una empresa de Scribd logo
1 de 67
Descargar para leer sin conexión
Making the most of Solid State Disk
in Oracle 11g


Guy Harrison
Director, R&D Melbourne

Email:      guy.harrison@quest.com
Twitter:    @guyharrison
Web:        http://www.guyharrison.net




                                         ©2011 Quest Software, Inc. All rights reserved..
Introductions
Star trek shirt fatality analysis



  Red




Yellow




 Blue




         0   10   20      30       40       50        60   70   80
                                   Pct
Agenda

• Brief History of Magnetic Disk
• Solid State Disk (SSD) technologies
• SSD internals
• Oracle DB flash cache architecture
• Performance comparisons
• Recommendations and Suggestions




                                                                                           10

                                        ©2011 Quest Software, Inc. All rights reserved..
A brief history of disk




                                                                             11

                          ©2011 Quest Software, Inc. All rights reserved..
5MB HDD circa 1956
28MB HDD - 1961
1800 RPM
The more that things change....
Moore’s law
• Transistor density doubles every 18 months
• Exponential growth is observed in most electronic
  components:
 •  CPU clock speeds
 •  RAM
 •  Hard Disk Drive storage density

• But not in mechanical components
 •  Service time (Seek latency) – limited by actuator arm speed and disk
    circumference
 •  Throughput (rotational latency) – limited by speed of rotation,
    circumference and data density


                                                                                                             15

                                                          ©2011 Quest Software, Inc. All rights reserved..
Disk trends 2001-2009

                 2,000




                 1,500




                 1,000
   %age change




                   500



                           260         1,635                      1,013
                     0
                                                       -630                -390



                  -500




                 -1,000
                          IO Rate   Disk Capacity   IO/Capacity   CPU     IO/CPU
Solid State Disk




                                                                      17

                   ©2011 Quest Software, Inc. All rights reserved..
SSD to the rescue?


     SSD DDR-RAM
                           15


       SSD PCI flash
                           25


     SSD SATA Flash
                           80


       Magnetic Disk
                                                                           4,000

                       0        500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500
                                              Seek time (us)
Power consumption




       Start up
                                                                         20




                            0.15
         Seek
                                                                                    Flash SSD
                                                                    10
                                                                                    SATA HDD




                     0.08
           Idle
                                                               8



              0.01   0.1                      1                10             100
                                   Watts (logarithmic scale)
Economics of SSD

                                                                  $/GB
                        0.00          10.00     20.00             30.00          40.00     50.00           60.00

                               0.06
     FusionIO PCI SLC SSD                                                                          53.44


                               0.06
 FusionIO PCI MLC Duo SSD                                      24.92


                            0.05
       Intel SLC SATA SSD                               21.88
                                                                                                                   $/IOP
                            0.05                                                                                   $/GB
       Intel MLC SATA SSD             6.88




         Seagate SAS HDD       1.00                                              1.53



        Seagate SATA HDD    0.09                                                                           2.38


                        0.00             0.50           1.00              1.50           2.00              2.50
                                                                  $/IOP
Tiered storage management



                           Main Memory

                             DDR SSD

                            Flash SSD
    $/GB




                                                 $/IOP
                     Fast Disk (SAS, RAID 0+1)

                     Slow Disk (SATA, RAID 5)

                      Tape, Flat Files, Hadoop
SSD technology and internals




                                                                             22

                          ©2011 Quest Software, Inc. All rights reserved..
Flavours of Flash SSD
   DDR RAM Drive
   SATA flash drive
   PCI flash drive
   SSD storage Server
PCI SSD vs SATA SSD


     PCI vs SATA
     •  SATA was designed for traditional disk drives with high latencies
     •  PCI is designed for high speed devices
     •  PCI SSD has latency ~ 1/3rd of SATA
Booth 1107   25
Flash SSD Technology

    Storage Hierarchy:

    •  Cell: One (SLC) or Two (MLC) bits
    •  Page: Typically 4K
    •  Block: Typically 128-512K

    Writes:

    •  Read and first write require single page IO
    •  Overwriting a page requires an erase & overwrite of the block

    Write endurance:

    •  100,000 erase cycles for SLC before failure
    •  5,000 – 10,000 erase cycles for MLC


                                                                                                         26

                                                      ©2011 Quest Software, Inc. All rights reserved..
Flash SSD performance

Update (256K block erase)                                                                                               2000




First insert (4k page write)                  250




     Read (4k page seek)           25




                               0        200     400   600   800       1000   1200   1400         1600             1800            2000
                                                                  Microseconds




                                                                                                                                         27

                                                                                     ©2011 Quest Software, Inc. All rights reserved..
Flash Disk write degradation

                                    Empty

                                    Partially Full




All Blocks empty:
Write time=250 us
25% part full:
•  Write time= ( ¾ * 250 us + 1/4 * 2000 us) = 687 us
75% part full
•  Write time = ( ¼ * 250 us + ¾ * 2000 us ) = 1562 us
Data Insert                    Free Block Pool




     Insert
              SSD Controller

                               Used Block Pool




                                        Empty Data Page

                                        Valid Data Page

                                        InValid Data Page
Free Block Pool

Data Update



    Update
              SSD Controller

                               Used Block Pool




                                       Empty Data Page

                                       Valid Data Page

                                        Invalid Data Page
Free Block Pool

Garbage Collection



             SSD Controller

                              Used Block Pool




                                      Empty Data Page

                                      Valid Data Page

                                       Invalid Data Page
32

©2011 Quest Software, Inc. All rights reserved..
11g DB flash Cache




                                                                        33

                     ©2011 Quest Software, Inc. All rights reserved..
Oracle DB flash cache


• Introduced in 11gR2 for
OEL and Solaris only
• Secondary cache
maintained by the DBWR,
but only when idle cycles
permit
• Architecture is tolerant of
poor flash write
performance

                                                                                   34

                                ©2011 Quest Software, Inc. All rights reserved..
Buffer cache and Free buffer waits

                                                                              Buffer
                           Read from buffer cache   Free                      cache
Oracle process                                      Buffer
                           Write to buffer cache
                                                    Waits




                       Free buffer waits often occur                          DBWR

                       when reads are much faster
                       than writes....



          Read from disk

                                      Database
                                        files        Write dirty blocks to disk
Flash Cache
                                                                                   Buffer
                             Read from buffer cache                                cache
Oracle process
                           Write to buffer cache


          Read from
          flash cache            Flash Cache
                                                                                   DBWR
                                                           Write clean
                                                           blocks (time
                                                           permitting)


                 DB Flash cache architecture is designed to
                 accelerate buffered reads


          Read from disk                              Write dirty blocks to disk
                                      Database
                                        files
Configuration
• Create filesystem from flash device
• Set DB_FLASH_CACHE_FILE and
  DB_FLASH_CACHE_SIZE.
• Consider Filesystemio_options=setall




                                                                                            37

                                         ©2011 Quest Software, Inc. All rights reserved..
Flash KEEP pool
• You can prioritise blocks for important objects using the
  FLASH_CACHE clause:




                                                                                               38

                                            ©2011 Quest Software, Inc. All rights reserved..
Oracle Db flash cache statistics




        http://guyharrison.squarespace.com/storage/flash_insert_stats.sql


                                                                                                                39

                                                             ©2011 Quest Software, Inc. All rights reserved..
Flash Cache Efficiency




        http://guyharrison.squarespace.com/storage/flash_time_savings.sql
Flash cache Contents




      http://guyharrison.squarespace.com/storage/flashContents.sql
Performance tests




                                                                       42

                    ©2011 Quest Software, Inc. All rights reserved..
Test systems
• Low end system:
 •  Dell Optiplex dual-core 4GB RAM
 •  2xSeagate 7500RPM Baracuda SATA HDD
 •  Intel X-25E SLC SATA SSD


• Higher end system:
 •  Dell R510 2xquad core, 32 GB RAM
 •  4x300GB 15K RPM,6Gbps Dell SAS HDD
 •  1xFusionIO ioDrive SLC PCI SSD




                                                                                             43

                                          ©2011 Quest Software, Inc. All rights reserved..
Performance: indexed reads(X-25)



Flash tablespace       48.17




                                                                                            CPU
    Flash cache                      143.27
                                                                                            db file IO
                                                                                            flash cache IO
                                                                                            Other




       No Flash                                                               529.7




                   0           100            200      300        400   500           600
                                                    Elapsed (s)
Performance: Read/Write (X-25)



Flash tablespace       200




                                                                                              CPU
                                                                                              db file IO
    Flash Cache                                   1,693
                                                                                              write complete
                                                                                              free buffer
                                                                                              flash cache IO
                                                                                              Other

                                                                               3,289



       No Flash




                   0         500   1000   1500            2000   2500   3000           3500
                                           Elapsed time (s)
Random reads – FusionIO


          Table on SSD         121




   SAS disk, flash cache                   583                                                  CPU
                                                                                                Other
                                                                                                DB File IO
                                                                                                Flash cache IO




SAS disk, no flash cache                                                         2,211




                           0         500         1000              1500   2000           2500
                                                    Elapsed time (s)
Updates – Fusion IO



          Table on SSD         529




                                                                                                  DB CPU
                                                                                                  db file IO
   SAS disk, flash cache                  1,934
                                                                                                  log file IO
                                                                                                  flash cache
                                                                                                  free buffer waits
                                                                                                  Other




SAS disk, no flash cache                                                           6,219




                           0     1000   2000      3000        4000   5000   6000           7000
                                                  Elapsed Time (s)
Full table scan – FusionIO



          Table on SSD              72




                                                                                                        CPU
   SAS disk, flash cache                                                              398
                                                                                                        Other
                                                                                                        DB File IO
                                                                                                        Flash Cache IO




SAS disk, no flash cache                                                                    418




                           0   50        100   150   200        250     300   350   400           450
                                                     Elasped time (s)
Sorting – what we expect




                                                                         Multi-pass
                                                                         Disk Sort
Time




                                                Single Pass
                                                Disk Sort

        Memory Sort


                                   PGA Memory available (MB)


                      Table/Index IO    CPU Time       Temp Segment IO                49
Disk Sorts – temporary tablespace


                                                                               4000



                                                                               3500



                                                                               3000
                                                              Multi-pass
                                                              Disk Sort        2500




                                                                                      Elapsed time (s)
                                                                               2000



                                                                               1500



                       Single Pass                                             1000
                       Disk Sort
                                                                               500



                                                                               0
300      250     200              150                 100         50       0
                             Sort Area Size

                         SAS based TTS        SSD based TTS                          50
Redo performance – Fusion IO




SAS based redo log                                               291.93




                                                                                CPU
                                                                                Log IO




Flash based redo log                                             292.39




                       0   50   100   150            200   250   300      350
                                       Elapsed time (s)
Concurrent redo workload (x10)




Flash based redo log             1,637               331                      1,681




                                                                                                                  CPU
                                                                                                                  Other
                                                                                                                  Log File IO




SAS based redo log               1,605               397                         1,944




                       0   500       1,000   1,500         2,000      2,500       3,000   3,500   4,000   4,500
                                                           Elapsed time (s)                                               52
Buffer Cache bottlenecks
•  Flash cache architecture
   avoids ‘free buffer waits’
   due to flash IO, but write
   complete waits can still
   occur on hot blocks.
•  Free buffer waits are still
   likely against the
   database files, due to
   high physical read rates
   created by the flash
   cache

                                                                                    53

                                 ©2011 Quest Software, Inc. All rights reserved..
Write degradation
• In theory, high sustained write IO can lead to SSD
  degradation when GC fails to cope with the block
  erase/update cycle
• In practice, this is rarely noticeable from Oracle:
 •  Oracle write IO is largely asynchronous (DBWR)
 •  Almost all write activity has at least an equal amount of read activity
 •  Garbage collection and wear levelling algorithms are sophisticated in
   decent SSD drives




                                                                                                              54

                                                           ©2011 Quest Software, Inc. All rights reserved..
55

©2011 Quest Software, Inc. All rights reserved..
56

©2011 Quest Software, Inc. All rights reserved..
Fusion IO direct cache




                     File System/ Raw       File System/ Raw
                       Devices/ ASM           Devices/ ASM
• Temp
Tablespace                                 Caching Block Device
•  Hot Segments     Regular Block Device                                                             Read-
•  Hot Partitions                              directCache                                         intensive,
•  DB Flash                                                                                       potentially
Cache                  ioMemory VSL           ioMemory VSL                                          massive
                                                                                                 tablespaces
(limited to the
size of the SSD)
                                                         LUN




                                                                                                            57

                                                         ©2011 Quest Software, Inc. All rights reserved..
                                                                                                                 57
Fusion IO direct cache – Table scans



direct cache on 2nd scan            36




direct cache on 1st scan                                                           147



                                                                                               CPU
                                                                                               IO
                                                                                               Other
    No cache 2nd scan                                                              147




     No cache 1st scan                                                             147




                           0   20   40   60         80           100   120   140         160
                                              Elapsed time (s)
Exadata




                                                             59

          ©2011 Quest Software, Inc. All rights reserved..        59
Exadata flash storage
• 4x96GB PCI Flash drives on each storage server
• Flash can be configured as:
 •  Exadata Smart Flash Cache (ESFC)
 •  Solid State Disk available to ASM disk groups
• ESFC is not the same as the DB flash cache:
 •  Maintained by cellsrv, not DBWR
 •  DOES supprort full table scans
 •  DOES NOT support smart scans
  •  Unless CELL_FLASH_CACHE= KEEP,

 •  Statistics accessed via the cellcli program

• Considerations for cache vs SSD may be similar

                                                                                                       61

                                                    ©2011 Quest Software, Inc. All rights reserved..
Summary




                                                             62

          ©2011 Quest Software, Inc. All rights reserved..
Recommendations
• Don’t wait for SSD to become as cheap as HDD
 •  Magnetic HDD will always be cheaper per GB, SSD cheaper per IO

• Consider a mixed or tiered storage strategy
 •  Using DB flash cache, selective SSD tablespaces or partitions
 •  Use SSD where your IO bottleneck is greatest and SSD advantage is
    significant


• DB flash cache offers an easy way to leverage SSD for
  OLTP workloads, but has few advantages for OLAP or
  Data Warehouse


                                                                                                           63

                                                        ©2011 Quest Software, Inc. All rights reserved..
How to use SSD
• Database flash cache
 •  If your bottleneck is single block (indexed reads) and you are on OEL or
    Solaris 11GR2

• Flash tablespace
 •  Optimize read/writes against “hot” segments or partitions

• Flash temp tablespace
 •  If multi-pass disk sorts or hash joins are your bottleneck

• FusionIO direct cache
 •  If you want to optimize both scans and index reads OR you are not on
    OEL/Solaris 11GR2



                                                                                                              64

                                                           ©2011 Quest Software, Inc. All rights reserved..        64
65

©2011 Quest Software, Inc. All rights reserved..
66

©2011 Quest Software, Inc. All rights reserved..
References
•  Latest version of this presentation:
 http://www.slideshare.net/gharriso/ssd-and-the-db-flash-cache

•  Guy Harrison blog (guyharrison.net) postings:
 •  All blog posts:
   •  http://guyharrison.squarespace.com/blog/tag/ssd

 •  SSD guiide (work in progress):
   •  http://guyharrison.squarespace.com/ssdguide/




•  Kevin Closson:
 •  http://kevinclosson.wordpress.com/2009/12/15/pardon-me-where-is-that-flash-cache-part-ii/
•  General articles on SSD:
 •  http://www.anandtech.com/storage/showdoc.aspx?i=3631
 •  http://en.wikipedia.org/wiki/Flash_memory
 •  http://www.virident.com/downloads/Virident_Sustained_Performance_Whitepaper.pdf




                                                                                                                         67

                                                                      ©2011 Quest Software, Inc. All rights reserved..

Más contenido relacionado

La actualidad más candente

Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...
Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...
Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...Samsung Business USA
 
Vso tower 8.0.3x
Vso tower 8.0.3xVso tower 8.0.3x
Vso tower 8.0.3xHumaAR
 
Massively Scalable NoSQL with Apache Cassandra
Massively Scalable NoSQL with Apache CassandraMassively Scalable NoSQL with Apache Cassandra
Massively Scalable NoSQL with Apache Cassandrajbellis
 
Dell storage sc5020_Guia de Producto
Dell storage sc5020_Guia de ProductoDell storage sc5020_Guia de Producto
Dell storage sc5020_Guia de ProductoJulioMinFonseca
 
What's new in SQL Server 2012 for philly code camp 2012.1
What's new in SQL Server 2012 for philly code camp 2012.1What's new in SQL Server 2012 for philly code camp 2012.1
What's new in SQL Server 2012 for philly code camp 2012.1Mark Kromer
 
NetApp system installation workbook Spokane
NetApp system installation workbook SpokaneNetApp system installation workbook Spokane
NetApp system installation workbook SpokaneAccenture
 
2013 SSD Adoption Trends
2013 SSD Adoption Trends2013 SSD Adoption Trends
2013 SSD Adoption TrendsIT Brand Pulse
 
State of Cassandra 2012
State of Cassandra 2012State of Cassandra 2012
State of Cassandra 2012jbellis
 
Dell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage mythsDell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage mythsNatalie Cerullo
 
Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Accenture
 
Sql server 2012 roadshow masd overview 003
Sql server 2012 roadshow masd overview 003Sql server 2012 roadshow masd overview 003
Sql server 2012 roadshow masd overview 003Mark Kromer
 
AsawariKhedkar_SSD_HDD_Comparison
AsawariKhedkar_SSD_HDD_ComparisonAsawariKhedkar_SSD_HDD_Comparison
AsawariKhedkar_SSD_HDD_ComparisonAsawari Khedkar
 
MySQL Cluster 8.0 tutorial text
MySQL Cluster 8.0 tutorial textMySQL Cluster 8.0 tutorial text
MySQL Cluster 8.0 tutorial textFrazer Clement
 
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]オラクルエンジニア通信
 

La actualidad más candente (15)

Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...
Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...
Samsung Solid-State Drives Enable Ultra-Reliable Point of Sale Terminals for ...
 
Vso tower 8.0.3x
Vso tower 8.0.3xVso tower 8.0.3x
Vso tower 8.0.3x
 
Massively Scalable NoSQL with Apache Cassandra
Massively Scalable NoSQL with Apache CassandraMassively Scalable NoSQL with Apache Cassandra
Massively Scalable NoSQL with Apache Cassandra
 
PLEXTOR SSD: The Value of PLEXTOR
PLEXTOR SSD:  The Value of PLEXTORPLEXTOR SSD:  The Value of PLEXTOR
PLEXTOR SSD: The Value of PLEXTOR
 
Dell storage sc5020_Guia de Producto
Dell storage sc5020_Guia de ProductoDell storage sc5020_Guia de Producto
Dell storage sc5020_Guia de Producto
 
What's new in SQL Server 2012 for philly code camp 2012.1
What's new in SQL Server 2012 for philly code camp 2012.1What's new in SQL Server 2012 for philly code camp 2012.1
What's new in SQL Server 2012 for philly code camp 2012.1
 
NetApp system installation workbook Spokane
NetApp system installation workbook SpokaneNetApp system installation workbook Spokane
NetApp system installation workbook Spokane
 
2013 SSD Adoption Trends
2013 SSD Adoption Trends2013 SSD Adoption Trends
2013 SSD Adoption Trends
 
State of Cassandra 2012
State of Cassandra 2012State of Cassandra 2012
State of Cassandra 2012
 
Dell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage mythsDell whitepaper busting solid state storage myths
Dell whitepaper busting solid state storage myths
 
Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1Data ontap 8.x 7 mode cook book v1 1
Data ontap 8.x 7 mode cook book v1 1
 
Sql server 2012 roadshow masd overview 003
Sql server 2012 roadshow masd overview 003Sql server 2012 roadshow masd overview 003
Sql server 2012 roadshow masd overview 003
 
AsawariKhedkar_SSD_HDD_Comparison
AsawariKhedkar_SSD_HDD_ComparisonAsawariKhedkar_SSD_HDD_Comparison
AsawariKhedkar_SSD_HDD_Comparison
 
MySQL Cluster 8.0 tutorial text
MySQL Cluster 8.0 tutorial textMySQL Cluster 8.0 tutorial text
MySQL Cluster 8.0 tutorial text
 
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]
【旧版】Oracle Autonomous Database:サービス概要のご紹介 [2020年8月版]
 

Destacado

Your Disk Array Is Slower Than It Should Be
Your Disk Array Is Slower Than It Should BeYour Disk Array Is Slower Than It Should Be
Your Disk Array Is Slower Than It Should BePerconaPerformance
 
Linux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsLinux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsBrendan Gregg
 
Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Brendan Gregg
 
Linux Systems Performance 2016
Linux Systems Performance 2016Linux Systems Performance 2016
Linux Systems Performance 2016Brendan Gregg
 
Linux internals for Database administrators at Linux Piter 2016
Linux internals for Database administrators at Linux Piter 2016Linux internals for Database administrators at Linux Piter 2016
Linux internals for Database administrators at Linux Piter 2016PostgreSQL-Consulting
 
Velocity 2015 linux perf tools
Velocity 2015 linux perf toolsVelocity 2015 linux perf tools
Velocity 2015 linux perf toolsBrendan Gregg
 
Linux Profiling at Netflix
Linux Profiling at NetflixLinux Profiling at Netflix
Linux Profiling at NetflixBrendan Gregg
 
BPF: Tracing and more
BPF: Tracing and moreBPF: Tracing and more
BPF: Tracing and moreBrendan Gregg
 

Destacado (9)

Your Disk Array Is Slower Than It Should Be
Your Disk Array Is Slower Than It Should BeYour Disk Array Is Slower Than It Should Be
Your Disk Array Is Slower Than It Should Be
 
Raid_intro.ppt
Raid_intro.pptRaid_intro.ppt
Raid_intro.ppt
 
Linux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old SecretsLinux Performance Analysis: New Tools and Old Secrets
Linux Performance Analysis: New Tools and Old Secrets
 
Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016Broken Linux Performance Tools 2016
Broken Linux Performance Tools 2016
 
Linux Systems Performance 2016
Linux Systems Performance 2016Linux Systems Performance 2016
Linux Systems Performance 2016
 
Linux internals for Database administrators at Linux Piter 2016
Linux internals for Database administrators at Linux Piter 2016Linux internals for Database administrators at Linux Piter 2016
Linux internals for Database administrators at Linux Piter 2016
 
Velocity 2015 linux perf tools
Velocity 2015 linux perf toolsVelocity 2015 linux perf tools
Velocity 2015 linux perf tools
 
Linux Profiling at Netflix
Linux Profiling at NetflixLinux Profiling at Netflix
Linux Profiling at Netflix
 
BPF: Tracing and more
BPF: Tracing and moreBPF: Tracing and more
BPF: Tracing and more
 

Similar a Database & Technology 1 _ Guy Harrison _ Making the most of SSD in Oracle11g.pdf

Making the most of ssd in oracle11g
Making the most of ssd in oracle11gMaking the most of ssd in oracle11g
Making the most of ssd in oracle11gGuy Harrison
 
SSD Deployment Strategies for MySQL
SSD Deployment Strategies for MySQLSSD Deployment Strategies for MySQL
SSD Deployment Strategies for MySQLYoshinori Matsunobu
 
SUN+Oracle存储产品介绍
SUN+Oracle存储产品介绍SUN+Oracle存储产品介绍
SUN+Oracle存储产品介绍PencilData
 
Practical experiences and best practices for SSD and IBM i
Practical experiences and best practices for SSD and IBM iPractical experiences and best practices for SSD and IBM i
Practical experiences and best practices for SSD and IBM iCOMMON Europe
 
Top Technology Trends
Top Technology Trends Top Technology Trends
Top Technology Trends InnoTech
 
10 Things you need to know abut designing enterprise SSD
10 Things you need to know abut designing enterprise SSD10 Things you need to know abut designing enterprise SSD
10 Things you need to know abut designing enterprise SSDSimon Huang
 
Ssd(solid state drive )
Ssd(solid state drive )Ssd(solid state drive )
Ssd(solid state drive )Karthik m
 
San presentation nov 2012 central pa
San presentation nov 2012 central paSan presentation nov 2012 central pa
San presentation nov 2012 central paJoseph D'Antoni
 
Ssd And Enteprise Storage
Ssd And Enteprise StorageSsd And Enteprise Storage
Ssd And Enteprise StorageFrank Zhao
 
Presentation sun storage tek™ 2500 series arrays update
Presentation   sun storage tek™ 2500 series arrays updatePresentation   sun storage tek™ 2500 series arrays update
Presentation sun storage tek™ 2500 series arrays updatexKinAnx
 
SSD based storage tuning for databases
SSD based storage tuning for databasesSSD based storage tuning for databases
SSD based storage tuning for databasesAngelo Rajadurai
 
Database performance tuning for SSD based storage
Database  performance tuning for SSD based storageDatabase  performance tuning for SSD based storage
Database performance tuning for SSD based storageAngelo Rajadurai
 
Enterprise Flash Storage - Time Capsule Snapshot 2011
Enterprise Flash Storage - Time Capsule Snapshot 2011Enterprise Flash Storage - Time Capsule Snapshot 2011
Enterprise Flash Storage - Time Capsule Snapshot 2011Ryan Snell
 
Sigmod08ssd slides
Sigmod08ssd slidesSigmod08ssd slides
Sigmod08ssd slidesheybbs2009
 
Challenges and Trends of SSD Design
Challenges and Trends of SSD DesignChallenges and Trends of SSD Design
Challenges and Trends of SSD DesignHenry Chao
 
了解IO设备
了解IO设备了解IO设备
了解IO设备Feng Yu
 

Similar a Database & Technology 1 _ Guy Harrison _ Making the most of SSD in Oracle11g.pdf (20)

Making the most of ssd in oracle11g
Making the most of ssd in oracle11gMaking the most of ssd in oracle11g
Making the most of ssd in oracle11g
 
Momentus xt PP Briefing
Momentus xt PP BriefingMomentus xt PP Briefing
Momentus xt PP Briefing
 
SSD Deployment Strategies for MySQL
SSD Deployment Strategies for MySQLSSD Deployment Strategies for MySQL
SSD Deployment Strategies for MySQL
 
SUN+Oracle存储产品介绍
SUN+Oracle存储产品介绍SUN+Oracle存储产品介绍
SUN+Oracle存储产品介绍
 
SSD-Bondi.pptx
SSD-Bondi.pptxSSD-Bondi.pptx
SSD-Bondi.pptx
 
Practical experiences and best practices for SSD and IBM i
Practical experiences and best practices for SSD and IBM iPractical experiences and best practices for SSD and IBM i
Practical experiences and best practices for SSD and IBM i
 
Top Technology Trends
Top Technology Trends Top Technology Trends
Top Technology Trends
 
10 Things you need to know abut designing enterprise SSD
10 Things you need to know abut designing enterprise SSD10 Things you need to know abut designing enterprise SSD
10 Things you need to know abut designing enterprise SSD
 
Ssd(solid state drive )
Ssd(solid state drive )Ssd(solid state drive )
Ssd(solid state drive )
 
IO Dubi Lebel
IO Dubi LebelIO Dubi Lebel
IO Dubi Lebel
 
San presentation nov 2012 central pa
San presentation nov 2012 central paSan presentation nov 2012 central pa
San presentation nov 2012 central pa
 
Ssd And Enteprise Storage
Ssd And Enteprise StorageSsd And Enteprise Storage
Ssd And Enteprise Storage
 
Presentation sun storage tek™ 2500 series arrays update
Presentation   sun storage tek™ 2500 series arrays updatePresentation   sun storage tek™ 2500 series arrays update
Presentation sun storage tek™ 2500 series arrays update
 
SSD based storage tuning for databases
SSD based storage tuning for databasesSSD based storage tuning for databases
SSD based storage tuning for databases
 
Database performance tuning for SSD based storage
Database  performance tuning for SSD based storageDatabase  performance tuning for SSD based storage
Database performance tuning for SSD based storage
 
Enterprise Flash Storage - Time Capsule Snapshot 2011
Enterprise Flash Storage - Time Capsule Snapshot 2011Enterprise Flash Storage - Time Capsule Snapshot 2011
Enterprise Flash Storage - Time Capsule Snapshot 2011
 
Sigmod08ssd slides
Sigmod08ssd slidesSigmod08ssd slides
Sigmod08ssd slides
 
SSD PPT BY SAURABH
SSD PPT BY SAURABHSSD PPT BY SAURABH
SSD PPT BY SAURABH
 
Challenges and Trends of SSD Design
Challenges and Trends of SSD DesignChallenges and Trends of SSD Design
Challenges and Trends of SSD Design
 
了解IO设备
了解IO设备了解IO设备
了解IO设备
 

Más de InSync2011

Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...
Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...
Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...InSync2011
 
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdf
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdfNew & Emerging _ KrisDowney _ Simplifying the Change Process.pdf
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdfInSync2011
 
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdf
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdfOracle Systems _ Kevin McIsaac _The IT landscape has changed.pdf
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdfInSync2011
 
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdf
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdfReporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdf
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdfInSync2011
 
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...InSync2011
 
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...InSync2011
 
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...InSync2011
 
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...InSync2011
 
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...InSync2011
 
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdf
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdfDatabase & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdf
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdfInSync2011
 
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdf
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdfDatabase & Technology 1 _ Tom Kyte _ SQL Techniques.pdf
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdfInSync2011
 
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...InSync2011
 
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...InSync2011
 
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...InSync2011
 
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...InSync2011
 
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...InSync2011
 
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...InSync2011
 
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...InSync2011
 
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...InSync2011
 
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...InSync2011
 

Más de InSync2011 (20)

Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...
Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...
Developer & Fusion Middleware 2 _ Scott Robertson _ SOA, Portals and Enterpri...
 
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdf
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdfNew & Emerging _ KrisDowney _ Simplifying the Change Process.pdf
New & Emerging _ KrisDowney _ Simplifying the Change Process.pdf
 
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdf
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdfOracle Systems _ Kevin McIsaac _The IT landscape has changed.pdf
Oracle Systems _ Kevin McIsaac _The IT landscape has changed.pdf
 
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdf
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdfReporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdf
Reporting _ Scott Tunbridge _ Op Mgmt to Perf Excel.pdf
 
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...
Developer and Fusion Middleware 2 _ Scott Robertson _ SOA, portals and entepr...
 
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...
Primavera _ Loretta Bayliss _ Implementing EPPM in rapidly changing and compe...
 
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...
Database & Technology 1 _ Martin Power _ Delivering Oracles hight availabilit...
 
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...
Database & Technology 1 _ Craig Shallahamer _ Unit of work time based perform...
 
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...
Database & Technology 1 _ Marcelle Kratchvil _ Why you should be storing unst...
 
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdf
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdfDatabase & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdf
Database & Technology 1 _ Milina Ristic _ Why use oracle data guard.pdf
 
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdf
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdfDatabase & Technology 1 _ Tom Kyte _ SQL Techniques.pdf
Database & Technology 1 _ Tom Kyte _ SQL Techniques.pdf
 
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...
Database & Technology 1 _ Clancy Bufton _ Flashback Query - oracle total reca...
 
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...
Databse & Technology 2 _ Francisco Munoz Alvarez _ Oracle Security Tips - Som...
 
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...
Databse & Technology 2 _ Francisco Munoz alvarez _ 11g new functionalities fo...
 
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...
Databse & Technology 2 | Connor McDonald | Managing Optimiser Statistics - A ...
 
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...
Databse & Technology 2 _ Shan Nawaz _ Oracle 11g Top 10 features - not your u...
 
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...
Databse & Technology 2 _ Paul Guerin _ The biggest looser database - a boot c...
 
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...
Developer and Fusion Middleware 1 _ Kevin Powe _ Log files - a wealth of fore...
 
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...
Developer and Fusion Middleware 2 _ Aaron Blishen _ Event driven SOA Integrat...
 
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...
Developer and Fusion Middleware 2 _Greg Kirkendall _ How Australia Post teach...
 

Database & Technology 1 _ Guy Harrison _ Making the most of SSD in Oracle11g.pdf

  • 1. Making the most of Solid State Disk in Oracle 11g Guy Harrison Director, R&D Melbourne Email: guy.harrison@quest.com Twitter: @guyharrison Web: http://www.guyharrison.net ©2011 Quest Software, Inc. All rights reserved..
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Star trek shirt fatality analysis Red Yellow Blue 0 10 20 30 40 50 60 70 80 Pct
  • 10. Agenda • Brief History of Magnetic Disk • Solid State Disk (SSD) technologies • SSD internals • Oracle DB flash cache architecture • Performance comparisons • Recommendations and Suggestions 10 ©2011 Quest Software, Inc. All rights reserved..
  • 11. A brief history of disk 11 ©2011 Quest Software, Inc. All rights reserved..
  • 13. 28MB HDD - 1961 1800 RPM
  • 14. The more that things change....
  • 15. Moore’s law • Transistor density doubles every 18 months • Exponential growth is observed in most electronic components: •  CPU clock speeds •  RAM •  Hard Disk Drive storage density • But not in mechanical components •  Service time (Seek latency) – limited by actuator arm speed and disk circumference •  Throughput (rotational latency) – limited by speed of rotation, circumference and data density 15 ©2011 Quest Software, Inc. All rights reserved..
  • 16. Disk trends 2001-2009 2,000 1,500 1,000 %age change 500 260 1,635 1,013 0 -630 -390 -500 -1,000 IO Rate Disk Capacity IO/Capacity CPU IO/CPU
  • 17. Solid State Disk 17 ©2011 Quest Software, Inc. All rights reserved..
  • 18. SSD to the rescue? SSD DDR-RAM 15 SSD PCI flash 25 SSD SATA Flash 80 Magnetic Disk 4,000 0 500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500 Seek time (us)
  • 19. Power consumption Start up 20 0.15 Seek Flash SSD 10 SATA HDD 0.08 Idle 8 0.01 0.1 1 10 100 Watts (logarithmic scale)
  • 20. Economics of SSD $/GB 0.00 10.00 20.00 30.00 40.00 50.00 60.00 0.06 FusionIO PCI SLC SSD 53.44 0.06 FusionIO PCI MLC Duo SSD 24.92 0.05 Intel SLC SATA SSD 21.88 $/IOP 0.05 $/GB Intel MLC SATA SSD 6.88 Seagate SAS HDD 1.00 1.53 Seagate SATA HDD 0.09 2.38 0.00 0.50 1.00 1.50 2.00 2.50 $/IOP
  • 21. Tiered storage management Main Memory DDR SSD Flash SSD $/GB $/IOP Fast Disk (SAS, RAID 0+1) Slow Disk (SATA, RAID 5) Tape, Flat Files, Hadoop
  • 22. SSD technology and internals 22 ©2011 Quest Software, Inc. All rights reserved..
  • 23. Flavours of Flash SSD DDR RAM Drive SATA flash drive PCI flash drive SSD storage Server
  • 24. PCI SSD vs SATA SSD PCI vs SATA •  SATA was designed for traditional disk drives with high latencies •  PCI is designed for high speed devices •  PCI SSD has latency ~ 1/3rd of SATA
  • 26. Flash SSD Technology Storage Hierarchy: •  Cell: One (SLC) or Two (MLC) bits •  Page: Typically 4K •  Block: Typically 128-512K Writes: •  Read and first write require single page IO •  Overwriting a page requires an erase & overwrite of the block Write endurance: •  100,000 erase cycles for SLC before failure •  5,000 – 10,000 erase cycles for MLC 26 ©2011 Quest Software, Inc. All rights reserved..
  • 27. Flash SSD performance Update (256K block erase) 2000 First insert (4k page write) 250 Read (4k page seek) 25 0 200 400 600 800 1000 1200 1400 1600 1800 2000 Microseconds 27 ©2011 Quest Software, Inc. All rights reserved..
  • 28. Flash Disk write degradation Empty Partially Full All Blocks empty: Write time=250 us 25% part full: •  Write time= ( ¾ * 250 us + 1/4 * 2000 us) = 687 us 75% part full •  Write time = ( ¼ * 250 us + ¾ * 2000 us ) = 1562 us
  • 29. Data Insert Free Block Pool Insert SSD Controller Used Block Pool Empty Data Page Valid Data Page InValid Data Page
  • 30. Free Block Pool Data Update Update SSD Controller Used Block Pool Empty Data Page Valid Data Page Invalid Data Page
  • 31. Free Block Pool Garbage Collection SSD Controller Used Block Pool Empty Data Page Valid Data Page Invalid Data Page
  • 32. 32 ©2011 Quest Software, Inc. All rights reserved..
  • 33. 11g DB flash Cache 33 ©2011 Quest Software, Inc. All rights reserved..
  • 34. Oracle DB flash cache • Introduced in 11gR2 for OEL and Solaris only • Secondary cache maintained by the DBWR, but only when idle cycles permit • Architecture is tolerant of poor flash write performance 34 ©2011 Quest Software, Inc. All rights reserved..
  • 35. Buffer cache and Free buffer waits Buffer Read from buffer cache Free cache Oracle process Buffer Write to buffer cache Waits Free buffer waits often occur DBWR when reads are much faster than writes.... Read from disk Database files Write dirty blocks to disk
  • 36. Flash Cache Buffer Read from buffer cache cache Oracle process Write to buffer cache Read from flash cache Flash Cache DBWR Write clean blocks (time permitting) DB Flash cache architecture is designed to accelerate buffered reads Read from disk Write dirty blocks to disk Database files
  • 37. Configuration • Create filesystem from flash device • Set DB_FLASH_CACHE_FILE and DB_FLASH_CACHE_SIZE. • Consider Filesystemio_options=setall 37 ©2011 Quest Software, Inc. All rights reserved..
  • 38. Flash KEEP pool • You can prioritise blocks for important objects using the FLASH_CACHE clause: 38 ©2011 Quest Software, Inc. All rights reserved..
  • 39. Oracle Db flash cache statistics http://guyharrison.squarespace.com/storage/flash_insert_stats.sql 39 ©2011 Quest Software, Inc. All rights reserved..
  • 40. Flash Cache Efficiency http://guyharrison.squarespace.com/storage/flash_time_savings.sql
  • 41. Flash cache Contents http://guyharrison.squarespace.com/storage/flashContents.sql
  • 42. Performance tests 42 ©2011 Quest Software, Inc. All rights reserved..
  • 43. Test systems • Low end system: •  Dell Optiplex dual-core 4GB RAM •  2xSeagate 7500RPM Baracuda SATA HDD •  Intel X-25E SLC SATA SSD • Higher end system: •  Dell R510 2xquad core, 32 GB RAM •  4x300GB 15K RPM,6Gbps Dell SAS HDD •  1xFusionIO ioDrive SLC PCI SSD 43 ©2011 Quest Software, Inc. All rights reserved..
  • 44. Performance: indexed reads(X-25) Flash tablespace 48.17 CPU Flash cache 143.27 db file IO flash cache IO Other No Flash 529.7 0 100 200 300 400 500 600 Elapsed (s)
  • 45. Performance: Read/Write (X-25) Flash tablespace 200 CPU db file IO Flash Cache 1,693 write complete free buffer flash cache IO Other 3,289 No Flash 0 500 1000 1500 2000 2500 3000 3500 Elapsed time (s)
  • 46. Random reads – FusionIO Table on SSD 121 SAS disk, flash cache 583 CPU Other DB File IO Flash cache IO SAS disk, no flash cache 2,211 0 500 1000 1500 2000 2500 Elapsed time (s)
  • 47. Updates – Fusion IO Table on SSD 529 DB CPU db file IO SAS disk, flash cache 1,934 log file IO flash cache free buffer waits Other SAS disk, no flash cache 6,219 0 1000 2000 3000 4000 5000 6000 7000 Elapsed Time (s)
  • 48. Full table scan – FusionIO Table on SSD 72 CPU SAS disk, flash cache 398 Other DB File IO Flash Cache IO SAS disk, no flash cache 418 0 50 100 150 200 250 300 350 400 450 Elasped time (s)
  • 49. Sorting – what we expect Multi-pass Disk Sort Time Single Pass Disk Sort Memory Sort PGA Memory available (MB) Table/Index IO CPU Time Temp Segment IO 49
  • 50. Disk Sorts – temporary tablespace 4000 3500 3000 Multi-pass Disk Sort 2500 Elapsed time (s) 2000 1500 Single Pass 1000 Disk Sort 500 0 300 250 200 150 100 50 0 Sort Area Size SAS based TTS SSD based TTS 50
  • 51. Redo performance – Fusion IO SAS based redo log 291.93 CPU Log IO Flash based redo log 292.39 0 50 100 150 200 250 300 350 Elapsed time (s)
  • 52. Concurrent redo workload (x10) Flash based redo log 1,637 331 1,681 CPU Other Log File IO SAS based redo log 1,605 397 1,944 0 500 1,000 1,500 2,000 2,500 3,000 3,500 4,000 4,500 Elapsed time (s) 52
  • 53. Buffer Cache bottlenecks •  Flash cache architecture avoids ‘free buffer waits’ due to flash IO, but write complete waits can still occur on hot blocks. •  Free buffer waits are still likely against the database files, due to high physical read rates created by the flash cache 53 ©2011 Quest Software, Inc. All rights reserved..
  • 54. Write degradation • In theory, high sustained write IO can lead to SSD degradation when GC fails to cope with the block erase/update cycle • In practice, this is rarely noticeable from Oracle: •  Oracle write IO is largely asynchronous (DBWR) •  Almost all write activity has at least an equal amount of read activity •  Garbage collection and wear levelling algorithms are sophisticated in decent SSD drives 54 ©2011 Quest Software, Inc. All rights reserved..
  • 55. 55 ©2011 Quest Software, Inc. All rights reserved..
  • 56. 56 ©2011 Quest Software, Inc. All rights reserved..
  • 57. Fusion IO direct cache File System/ Raw File System/ Raw Devices/ ASM Devices/ ASM • Temp Tablespace Caching Block Device •  Hot Segments Regular Block Device Read- •  Hot Partitions directCache intensive, •  DB Flash potentially Cache ioMemory VSL ioMemory VSL massive tablespaces (limited to the size of the SSD) LUN 57 ©2011 Quest Software, Inc. All rights reserved.. 57
  • 58. Fusion IO direct cache – Table scans direct cache on 2nd scan 36 direct cache on 1st scan 147 CPU IO Other No cache 2nd scan 147 No cache 1st scan 147 0 20 40 60 80 100 120 140 160 Elapsed time (s)
  • 59. Exadata 59 ©2011 Quest Software, Inc. All rights reserved.. 59
  • 60.
  • 61. Exadata flash storage • 4x96GB PCI Flash drives on each storage server • Flash can be configured as: •  Exadata Smart Flash Cache (ESFC) •  Solid State Disk available to ASM disk groups • ESFC is not the same as the DB flash cache: •  Maintained by cellsrv, not DBWR •  DOES supprort full table scans •  DOES NOT support smart scans •  Unless CELL_FLASH_CACHE= KEEP, •  Statistics accessed via the cellcli program • Considerations for cache vs SSD may be similar 61 ©2011 Quest Software, Inc. All rights reserved..
  • 62. Summary 62 ©2011 Quest Software, Inc. All rights reserved..
  • 63. Recommendations • Don’t wait for SSD to become as cheap as HDD •  Magnetic HDD will always be cheaper per GB, SSD cheaper per IO • Consider a mixed or tiered storage strategy •  Using DB flash cache, selective SSD tablespaces or partitions •  Use SSD where your IO bottleneck is greatest and SSD advantage is significant • DB flash cache offers an easy way to leverage SSD for OLTP workloads, but has few advantages for OLAP or Data Warehouse 63 ©2011 Quest Software, Inc. All rights reserved..
  • 64. How to use SSD • Database flash cache •  If your bottleneck is single block (indexed reads) and you are on OEL or Solaris 11GR2 • Flash tablespace •  Optimize read/writes against “hot” segments or partitions • Flash temp tablespace •  If multi-pass disk sorts or hash joins are your bottleneck • FusionIO direct cache •  If you want to optimize both scans and index reads OR you are not on OEL/Solaris 11GR2 64 ©2011 Quest Software, Inc. All rights reserved.. 64
  • 65. 65 ©2011 Quest Software, Inc. All rights reserved..
  • 66. 66 ©2011 Quest Software, Inc. All rights reserved..
  • 67. References •  Latest version of this presentation: http://www.slideshare.net/gharriso/ssd-and-the-db-flash-cache •  Guy Harrison blog (guyharrison.net) postings: •  All blog posts: •  http://guyharrison.squarespace.com/blog/tag/ssd •  SSD guiide (work in progress): •  http://guyharrison.squarespace.com/ssdguide/ •  Kevin Closson: •  http://kevinclosson.wordpress.com/2009/12/15/pardon-me-where-is-that-flash-cache-part-ii/ •  General articles on SSD: •  http://www.anandtech.com/storage/showdoc.aspx?i=3631 •  http://en.wikipedia.org/wiki/Flash_memory •  http://www.virident.com/downloads/Virident_Sustained_Performance_Whitepaper.pdf 67 ©2011 Quest Software, Inc. All rights reserved..