SlideShare una empresa de Scribd logo
1 de 55
Descargar para leer sin conexión
Phua Chiu Kiang
Microsoft MVP (SQL Server)
•
•
•
•
•
•
Microsoft Data Warehousing Vision
Make SQL Server the gold standard for data warehousing offering customers




Massive Scalability at Low       Hardware Choice         Improved Business Agility
          Cost                                                and Alignment
Approximate data volume
    •                                                              managed by data warehouse
    •
                                                                           Today       In 3 Years
    •
                                                                                           21%
                                                                   Less than 500 GB
                                                                                      5%
    •
    •                                                                 500 GB – 1 TB
                                                                                       12%
                                                                                           20%



                                                                                           21%
                                                                           1 – 3 TB
                                                                                           18%
    •
                                                                                           19%
    •                                                                     3 – 10 TB
                                                                                            25%


                                                                                           17%
                                                                   More than 10 TB
                                                                                                 34%


                                                                                      2%
                                                                        Don’t Know
                                                                                      6%

                                                                    Source: TDWI Report – Next Generation DW


Microsoft Confidential—Preliminary Information Subject to Change                                               4
Data Warehouse Industry Trends
                           100%
       Broad Commitment




                                                                                                          Advanced
                                                                 Centralized                              Analytics
                                                                   EDW
                                                                                                 Data
                                                                                                Quality 
                           75%




                                                                  Analytics
                                                                 within EDW             HA for DW           64-bit      MDM   
                                                         Analytics
                                                                                           Web Services
                                                        Outside EDW DBMS Built
                                                                     for DW                                                              
Plan to Use




                                                                                                                          Real-time DW
                                                            Blades in
                                                                               Security
                                                                                                          MPP
                                                             Racks
                           50%




                                     DBMS Built
                                         for
                                                                                   DW Appliance  Streaming
                                                                                                     Data 
                                    Transactions                                   Mixed Workloads  SOA
                                                                      Server
                                                                  Virtualization    Data Federation Low-Power
                                         SMP
                                                                                                     Hardware
                                                                                   Columnar DBMS
                                                                    DW                                   In-Memory DBMS
                                                                  Bundles
                           25%




                                                                                             SaaS
       Narrow Commitment




                                                                             Open Source       Open Source
                                                                                 OS             Reporting
                                                                                   Software    Open Source
                                                                                  Appliance Data Integration
                                                                                                     Open Source DBMS
                                                                                     Public Cloud
                           0%




                                  -50%               -25%               0%                25%                   50%         75%          100%
                                  Decreasing Usage
                                                            Anticipated Growth in the next 3 Years                            Increasing Usage

      Areas of strategic investment for Microsoft                                                                                               Source: TDWI
6
•   Building a traditional DW
       •   Time consuming
       •   Expensive
       •   Performance varies
       •   Scalability issues

                                        Potential bottlenecks in standard DW architecture



   •   The DW appliance model
       •   Tuned h/w + s/w
                                                                           Faster
       •   Views entire stack holistically             Lower TCO
                                                                         deployment

       •   Known performance & scalability                       Benefits

       •   Encapsulates best practices                    Better            Minimised
                                                       performance          DBA time
       •   Leverages Sequential I/O

©2009 Microsoft Corporation
Software:
  • SQL Server 2008
     Enterprise
  • Windows Server 2008

Configuration guidelines:
     • Physical table structures
     • Indexes
     • Compression
     • SQL Server settings
     • Windows Server settings
     • Loading

Hardware:
  • Tight specifications for servers,
    storage and networking
  • ‘Per core’ building block
Reduces DBA effort; fewer indexes,
  much higher level of sequential I/O




        Dell, HP, Bull, EMC and IBM – more in
                         future




               Commodity Hardware and value pricing;
                      Lower storage costs.




        New reference architectures scale up to
         48TB (assuming 2.5x compression)



Validated by Microsoft; better choice of
hardware; application of Best Practice
SQL Server
                          Teradata                                 Comparison
                                             Fast Track DW
        Loading
                        5:10:21 total time    0:51:31 total time       R
     Subject Area 1
                                                                     6x faster

        Loading
                        4:36:08 total time    1:50.01 total time       R
     Subject Area 2
                                                                    2.5x faster

      Query times      3:03 avg query time
                       (using 9 benchmark
                                             0:15 avg query time
                                             (using 9 benchmark        R
     Subject Area 1          queries)              queries)         12x faster

      Query times     56:44 avg query time
                       (using 4 benchmark
                                             8:09 avg query time
                                             (using 4 benchmark
                                                                       R
     Subject Area 2          queries)              queries)         7x faster


©2009 Microsoft Corporation
•
    −

    −
    −

•
    −

    −
    −

•
    −

    −
    −
•
    −
    −
    −

•
    −
    −
    −

•
    −

    −
    −
•
    −

    −
    −
    −
    −

•
    −

    −
    −
    −
    −
•
                         −

                         −
                         −

                   •
                         −

                         −
                         −

                   •
                         −
                         −
                         −
                         −
                         −
Microsoft Confidential
Fast Track vNext
                                                            Fast Track Data Warehouse 2.0                   Future Partners to create new
   Enterprise ETL Services                                                                                  Validated Reference
   Star Join Query Optimizations                           New Reference Architectures from
                                                           IBM                                              Architectures with Test Harness
                                                           Updated Configurations from HP,
                                                           Dell and Bull
                                                           EMC as a Service Partner for Fast
                                                           Track




    2008                                2009                                    2010                              Beyond




                               Fast Track Data Warehouse                  New Test Harness for Partners
                              DW Reference Architectures                   Microsoft to create new Test
                              Predictable performance at low               Harness for validation of new
                              cost                                         Fast Track configurations
                              Faster time to solution                      NEC to validate new Reference
                                                                           Architectures




Microsoft Confidential—Preliminary                                                                           16
•
•

•
•
•
Parallel Data Warehouse compute node




      Database Server      Storage Node
Parallel Data Warehouse Appliance - Hardware
Architecture
                                                                  Database Servers                              Storage Nodes


                      Control Nodes
                                                                                     SQL
                      Active / Passive
                                                                                     SQL


                                         SQL
                                                                                     SQL



                                                                                     SQL


                    Management Servers




                                                                                           Dual Fiber Channel
                                                                                     SQL




                                               Dual Infiniband
                                                                                     SQL



                      Landing Zone
                                                                                     SQL


                                                                                     SQL



                                                                                     SQL
                       Backup Node
                                                                                     SQL


                                                                 Spare Database Server


Corporate Network    Private Network
Parallel Data Warehouse demo at BI conference 2008
                                       • Query
                                        ‐ Cache flushed
                                        ‐ Inner joins

                                         • Report
                                           ‐ Retailer: day-part analysis
                                           ‐ Sales, Time, Date, Prod type




• Sample Results
 ‐ 625K rows returned in 11 seconds
   from 1 trillion row table
 ‐ Final product will be even faster
Existing                  Current            Madison
    Environment                Challenges         Highlights

Hardware                   Data Load Speeds    Improved by 300%
16 CPU HP 8620 Itanium
Hitachi Storage 27TB Raw
SATA 21 LUNS
                           Analytic Capacity   30TB/160 Cores

Software                   Analytic Speed      Query Speeds 70X
Windows 2003 SP2                               Improvement
SQLServer 2008
SSIS/SSRS
                           Mixed Workload      Concurrency
Data Warehouse                                 Mixed Workload
18 Terabytes
Star Schema                Total Cost of       TCO Lowered by
80 Fact Tables
500 + Dimensions
                              Ownership        50%
Parallel Data Warehouse

•
•
•
•
    −
    −
•
•
    −
    −
PDW vNext
                                                                                                       Focus on continually lowering the
Microsoft Announce Intention to                    MTP Program Launched                                costs of high end DW, while
Acquire DATAllegro (July)                          Circa 10 Customers Provided with early              increasing performance
Acquisition Closes (Sept)                          Madison Benchmark                                   Additional Hardware Partners
150TB demo of DATAllegro on SQL                    Madison Named as SQL Server 2008 R2                 Closer functional alignment with SQL
Server run at BI Conference (Oct)                  Parallel Data Warehouse                             Server
                                                   List Price at $57.5K per proc                       Better integration with SQL and tools
                                                                                                       and technologies




 2008                               2009                                 2010                                     Beyond




                        Project “Madison”
                                                                      MTP 2 Program to Launch (fully
                        Compatibility with DATAllegro v3              functional, fully performant)
                        MS BI integration                             TAP Program (on client site)
                                                                      RTM in H1 2010



                                                                                                                     ?
Hub and Spoke – Flexible Business Alignment




  EDW provides “single version of truth” but makes it difficult to support mixed
          workloads and multiple user groups, each requiring SLAs
Hub and Spoke – Flexible Business Alignment




   Departmental data marts enable mixed workloads, but make it difficult to
               consolidate information across the enterprise
Hub and Spoke – Flexible Business Alignment

    Parallel database copy                                  Support user groups with
    technology enables rapid                                very different SLAs:
    data movement and                                                 Performance
    consistency between hub                                           Capacity
    and spokes                                                        Loading
                                                                      Concurrency




      Create SQL Server 2008, Fast Track Data Warehouse, and SQL Server Analysis
                                    Services spokes

A Hub and Spoke solution gives you the flexibility to add/change diverse workloads/user
          groups, while maintaining data consistency across the enterprise
GEO AREAS   METRICS
Analytic MDM
Faster time to solution
                          High scale: up to 48TB
   Fast Track             Low TCO with better price performance; industry standard hardware
Data Warehouse            Better performance out of the box and predictable performance
offers customers          Reduced risk through balanced hardware & Best practices
                          Integration with Madison Hub & Spoke Architecture




                                          Twelve reference architectures from HP, Dell, Bull, EMC
 SQL Server Fast Track Data               and IBM
Warehouse has 2 components
                                          System Integrators with industry solution templates –
                                          Avanade, HP, Hitachi, Cognizant and EMC
• Fast Track Data Warehouse offers
    −
    −
    −
    −
    −
• Parallel Data Warehouse offers
    −
    −
    −
    −
•
    −
    −

    −
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.
The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions,
                 it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation.
                                       MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Más contenido relacionado

La actualidad más candente

The Collaboration Imperative - Extending Back-office Systems to Automate All...
The Collaboration Imperative -  Extending Back-office Systems to Automate All...The Collaboration Imperative -  Extending Back-office Systems to Automate All...
The Collaboration Imperative - Extending Back-office Systems to Automate All...
SAP Ariba
 
ADMA Macquarie Campaign Measurement
ADMA Macquarie Campaign MeasurementADMA Macquarie Campaign Measurement
ADMA Macquarie Campaign Measurement
Datalicious
 

La actualidad más candente (12)

Big Data Challenges
Big Data ChallengesBig Data Challenges
Big Data Challenges
 
The Business Performance Index Development And Evolution & Performance
The Business Performance Index   Development And Evolution & PerformanceThe Business Performance Index   Development And Evolution & Performance
The Business Performance Index Development And Evolution & Performance
 
Confluence Adoption: Techniques for Growing Your Wiki
Confluence Adoption: Techniques for Growing Your WikiConfluence Adoption: Techniques for Growing Your Wiki
Confluence Adoption: Techniques for Growing Your Wiki
 
SunCorp Campaign Measurement
SunCorp Campaign MeasurementSunCorp Campaign Measurement
SunCorp Campaign Measurement
 
2011 Game Changer Presentation Agenda
2011 Game Changer Presentation Agenda2011 Game Changer Presentation Agenda
2011 Game Changer Presentation Agenda
 
Lte asia 2011 s niri
Lte asia 2011 s niriLte asia 2011 s niri
Lte asia 2011 s niri
 
Dcom be-en-data-assessment-approach
Dcom be-en-data-assessment-approachDcom be-en-data-assessment-approach
Dcom be-en-data-assessment-approach
 
The Collaboration Imperative - Extending Back-office Systems to Automate All...
The Collaboration Imperative -  Extending Back-office Systems to Automate All...The Collaboration Imperative -  Extending Back-office Systems to Automate All...
The Collaboration Imperative - Extending Back-office Systems to Automate All...
 
Making Confluence an Enterprise Standard for Knowledge Management - Atlassian...
Making Confluence an Enterprise Standard for Knowledge Management - Atlassian...Making Confluence an Enterprise Standard for Knowledge Management - Atlassian...
Making Confluence an Enterprise Standard for Knowledge Management - Atlassian...
 
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX DatamaticsWebinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
Webinar: Scaling MongoDB through Sharding - A Case Study with CIGNEX Datamatics
 
Macquarie ADMA Campaign Measurement
Macquarie ADMA Campaign Measurement Macquarie ADMA Campaign Measurement
Macquarie ADMA Campaign Measurement
 
ADMA Macquarie Campaign Measurement
ADMA Macquarie Campaign MeasurementADMA Macquarie Campaign Measurement
ADMA Macquarie Campaign Measurement
 

Similar a Sql2008 R2 Dw (Phua Chiu Kiang)

Mass tlc presentation menninger
Mass tlc presentation    menningerMass tlc presentation    menninger
Mass tlc presentation menninger
MassTLC
 
Mass tlc presentation menninger
Mass tlc presentation    menningerMass tlc presentation    menninger
Mass tlc presentation menninger
MassTLC
 
Airtel Represented at The Mobile VAS SUMMIT 2009
Airtel Represented at The Mobile VAS SUMMIT 2009Airtel Represented at The Mobile VAS SUMMIT 2009
Airtel Represented at The Mobile VAS SUMMIT 2009
Paritosh Sharma
 
Windows azure platform_business_overview
Windows azure platform_business_overviewWindows azure platform_business_overview
Windows azure platform_business_overview
Allan Naim
 
DYN MassTLC go-to-market strategy
DYN MassTLC go-to-market strategyDYN MassTLC go-to-market strategy
DYN MassTLC go-to-market strategy
MassTLC
 
Bottlenecks exposed web app db servers
Bottlenecks exposed web app db serversBottlenecks exposed web app db servers
Bottlenecks exposed web app db servers
Upender Dravidum
 

Similar a Sql2008 R2 Dw (Phua Chiu Kiang) (20)

How to Increase Performance and Virtualization Efficiency with Emulex 16Gb FC...
How to Increase Performance and Virtualization Efficiency with Emulex 16Gb FC...How to Increase Performance and Virtualization Efficiency with Emulex 16Gb FC...
How to Increase Performance and Virtualization Efficiency with Emulex 16Gb FC...
 
Mass tlc presentation menninger
Mass tlc presentation    menningerMass tlc presentation    menninger
Mass tlc presentation menninger
 
Mass tlc presentation menninger
Mass tlc presentation    menningerMass tlc presentation    menninger
Mass tlc presentation menninger
 
Managing highly virtualized environments - Presented by Softchoice and VMware
Managing highly virtualized environments - Presented by Softchoice and VMwareManaging highly virtualized environments - Presented by Softchoice and VMware
Managing highly virtualized environments - Presented by Softchoice and VMware
 
Airtel Represented at The Mobile VAS SUMMIT 2009
Airtel Represented at The Mobile VAS SUMMIT 2009Airtel Represented at The Mobile VAS SUMMIT 2009
Airtel Represented at The Mobile VAS SUMMIT 2009
 
iStrategy 2012, Melbourne: Customer relevance - the next frontier for competi...
iStrategy 2012, Melbourne: Customer relevance - the next frontier for competi...iStrategy 2012, Melbourne: Customer relevance - the next frontier for competi...
iStrategy 2012, Melbourne: Customer relevance - the next frontier for competi...
 
01 roland top storage trends_praha_02
01 roland top storage trends_praha_0201 roland top storage trends_praha_02
01 roland top storage trends_praha_02
 
Top 5 Ways the Cloud is Impacting Your IT
Top 5 Ways the Cloud is Impacting Your ITTop 5 Ways the Cloud is Impacting Your IT
Top 5 Ways the Cloud is Impacting Your IT
 
Windows azure platform_business_overview
Windows azure platform_business_overviewWindows azure platform_business_overview
Windows azure platform_business_overview
 
Choosing the Right Cloud Applications in a Challenging Economy
Choosing the Right Cloud Applications in a Challenging EconomyChoosing the Right Cloud Applications in a Challenging Economy
Choosing the Right Cloud Applications in a Challenging Economy
 
DYN MassTLC go-to-market strategy
DYN MassTLC go-to-market strategyDYN MassTLC go-to-market strategy
DYN MassTLC go-to-market strategy
 
Dc design
Dc designDc design
Dc design
 
Roi by it-roi
Roi by it-roiRoi by it-roi
Roi by it-roi
 
Riverbed Cascade and VXLAN Monitoring
Riverbed Cascade and VXLAN MonitoringRiverbed Cascade and VXLAN Monitoring
Riverbed Cascade and VXLAN Monitoring
 
Introduction to Enterprise Cloud Economics
Introduction to Enterprise Cloud EconomicsIntroduction to Enterprise Cloud Economics
Introduction to Enterprise Cloud Economics
 
Bottlenecks exposed web app db servers
Bottlenecks exposed web app db serversBottlenecks exposed web app db servers
Bottlenecks exposed web app db servers
 
Cloud is Transforming the Enterprise
Cloud is Transforming the EnterpriseCloud is Transforming the Enterprise
Cloud is Transforming the Enterprise
 
MSP Best Practice | Staffing for Growth and Core KPIs to Use
MSP Best Practice | Staffing for Growth and Core KPIs to UseMSP Best Practice | Staffing for Growth and Core KPIs to Use
MSP Best Practice | Staffing for Growth and Core KPIs to Use
 
Proformative:The Three Stages of Cloud Economics
Proformative:The Three Stages of Cloud EconomicsProformative:The Three Stages of Cloud Economics
Proformative:The Three Stages of Cloud Economics
 
Data Migration and MDM - DMM5
Data Migration and MDM - DMM5Data Migration and MDM - DMM5
Data Migration and MDM - DMM5
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Sql2008 R2 Dw (Phua Chiu Kiang)

  • 1. Phua Chiu Kiang Microsoft MVP (SQL Server)
  • 3. Microsoft Data Warehousing Vision Make SQL Server the gold standard for data warehousing offering customers Massive Scalability at Low Hardware Choice Improved Business Agility Cost and Alignment
  • 4. Approximate data volume • managed by data warehouse • Today In 3 Years • 21% Less than 500 GB 5% • • 500 GB – 1 TB 12% 20% 21% 1 – 3 TB 18% • 19% • 3 – 10 TB 25% 17% More than 10 TB 34% 2% Don’t Know 6% Source: TDWI Report – Next Generation DW Microsoft Confidential—Preliminary Information Subject to Change 4
  • 5. Data Warehouse Industry Trends 100% Broad Commitment Advanced Centralized Analytics EDW Data Quality  75% Analytics within EDW HA for DW  64-bit  MDM  Analytics Web Services Outside EDW DBMS Built for DW  Plan to Use Real-time DW Blades in Security MPP Racks 50% DBMS Built for DW Appliance  Streaming Data  Transactions Mixed Workloads  SOA Server Virtualization Data Federation Low-Power SMP Hardware Columnar DBMS DW In-Memory DBMS Bundles 25% SaaS Narrow Commitment Open Source Open Source OS Reporting Software Open Source Appliance Data Integration Open Source DBMS Public Cloud 0% -50% -25% 0% 25% 50% 75% 100% Decreasing Usage Anticipated Growth in the next 3 Years Increasing Usage  Areas of strategic investment for Microsoft Source: TDWI
  • 6. 6
  • 7. Building a traditional DW • Time consuming • Expensive • Performance varies • Scalability issues Potential bottlenecks in standard DW architecture • The DW appliance model • Tuned h/w + s/w Faster • Views entire stack holistically Lower TCO deployment • Known performance & scalability Benefits • Encapsulates best practices Better Minimised performance DBA time • Leverages Sequential I/O ©2009 Microsoft Corporation
  • 8. Software: • SQL Server 2008 Enterprise • Windows Server 2008 Configuration guidelines: • Physical table structures • Indexes • Compression • SQL Server settings • Windows Server settings • Loading Hardware: • Tight specifications for servers, storage and networking • ‘Per core’ building block
  • 9. Reduces DBA effort; fewer indexes, much higher level of sequential I/O Dell, HP, Bull, EMC and IBM – more in future Commodity Hardware and value pricing; Lower storage costs. New reference architectures scale up to 48TB (assuming 2.5x compression) Validated by Microsoft; better choice of hardware; application of Best Practice
  • 10. SQL Server Teradata Comparison Fast Track DW Loading 5:10:21 total time 0:51:31 total time R Subject Area 1 6x faster Loading 4:36:08 total time 1:50.01 total time R Subject Area 2 2.5x faster Query times 3:03 avg query time (using 9 benchmark 0:15 avg query time (using 9 benchmark R Subject Area 1 queries) queries) 12x faster Query times 56:44 avg query time (using 4 benchmark 8:09 avg query time (using 4 benchmark R Subject Area 2 queries) queries) 7x faster ©2009 Microsoft Corporation
  • 11. − − − • − − − • − − −
  • 12. − − − • − − − • − − −
  • 13. − − − − − • − − − − −
  • 14. − − − • − − − • − − − − − Microsoft Confidential
  • 15. Fast Track vNext Fast Track Data Warehouse 2.0 Future Partners to create new Enterprise ETL Services Validated Reference Star Join Query Optimizations New Reference Architectures from IBM Architectures with Test Harness Updated Configurations from HP, Dell and Bull EMC as a Service Partner for Fast Track 2008 2009 2010 Beyond Fast Track Data Warehouse New Test Harness for Partners DW Reference Architectures Microsoft to create new Test Predictable performance at low Harness for validation of new cost Fast Track configurations Faster time to solution NEC to validate new Reference Architectures Microsoft Confidential—Preliminary 16
  • 17. Parallel Data Warehouse compute node Database Server Storage Node
  • 18. Parallel Data Warehouse Appliance - Hardware Architecture Database Servers Storage Nodes Control Nodes SQL Active / Passive SQL SQL SQL SQL Management Servers Dual Fiber Channel SQL Dual Infiniband SQL Landing Zone SQL SQL SQL Backup Node SQL Spare Database Server Corporate Network Private Network
  • 19. Parallel Data Warehouse demo at BI conference 2008 • Query ‐ Cache flushed ‐ Inner joins • Report ‐ Retailer: day-part analysis ‐ Sales, Time, Date, Prod type • Sample Results ‐ 625K rows returned in 11 seconds from 1 trillion row table ‐ Final product will be even faster
  • 20. Existing Current Madison Environment Challenges Highlights Hardware Data Load Speeds Improved by 300% 16 CPU HP 8620 Itanium Hitachi Storage 27TB Raw SATA 21 LUNS Analytic Capacity 30TB/160 Cores Software Analytic Speed Query Speeds 70X Windows 2003 SP2 Improvement SQLServer 2008 SSIS/SSRS Mixed Workload Concurrency Data Warehouse Mixed Workload 18 Terabytes Star Schema Total Cost of TCO Lowered by 80 Fact Tables 500 + Dimensions Ownership 50%
  • 21. Parallel Data Warehouse • • • • − − • • − −
  • 22. PDW vNext Focus on continually lowering the Microsoft Announce Intention to MTP Program Launched costs of high end DW, while Acquire DATAllegro (July) Circa 10 Customers Provided with early increasing performance Acquisition Closes (Sept) Madison Benchmark Additional Hardware Partners 150TB demo of DATAllegro on SQL Madison Named as SQL Server 2008 R2 Closer functional alignment with SQL Server run at BI Conference (Oct) Parallel Data Warehouse Server List Price at $57.5K per proc Better integration with SQL and tools and technologies 2008 2009 2010 Beyond Project “Madison” MTP 2 Program to Launch (fully Compatibility with DATAllegro v3 functional, fully performant) MS BI integration TAP Program (on client site) RTM in H1 2010 ?
  • 23.
  • 24. Hub and Spoke – Flexible Business Alignment EDW provides “single version of truth” but makes it difficult to support mixed workloads and multiple user groups, each requiring SLAs
  • 25. Hub and Spoke – Flexible Business Alignment Departmental data marts enable mixed workloads, but make it difficult to consolidate information across the enterprise
  • 26. Hub and Spoke – Flexible Business Alignment Parallel database copy Support user groups with technology enables rapid very different SLAs: data movement and Performance consistency between hub Capacity and spokes Loading Concurrency Create SQL Server 2008, Fast Track Data Warehouse, and SQL Server Analysis Services spokes A Hub and Spoke solution gives you the flexibility to add/change diverse workloads/user groups, while maintaining data consistency across the enterprise
  • 27.
  • 28. GEO AREAS METRICS
  • 29.
  • 30.
  • 31.
  • 33.
  • 34.
  • 35.
  • 36.
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51. Faster time to solution High scale: up to 48TB Fast Track Low TCO with better price performance; industry standard hardware Data Warehouse Better performance out of the box and predictable performance offers customers Reduced risk through balanced hardware & Best practices Integration with Madison Hub & Spoke Architecture Twelve reference architectures from HP, Dell, Bull, EMC SQL Server Fast Track Data and IBM Warehouse has 2 components System Integrators with industry solution templates – Avanade, HP, Hitachi, Cognizant and EMC
  • 52. • Fast Track Data Warehouse offers − − − − − • Parallel Data Warehouse offers − − − − • − − −
  • 53.
  • 54.
  • 55. © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.