SlideShare una empresa de Scribd logo
1 de 34
Sizing Your Content Databases:
 Understanding the New Limits

        Randy Williams
          AvePoint
Randy Williams
         • Enterprise Trainer & Evangelist – AvePoint
         • 20+ years in IT
            ●   developer, consultant, trainer, author
         • Three-time SharePoint MVP
         • Speaker at many global conferences

         randy.williams@avepoint.com
         http://linkd.in/plEEb1
         @tweetraw
Agenda

 Understanding                Remote BLOB
   new limits                 storage (RBS)




    1 2          Achieving
                                 3 4          Summary
                   larger                       Q&A
                 capacities
Agenda

 Understanding
   new limits




    1
The SharePoint storage dilemma

• Documents, databases, and BLOBs
• Storage growth
                                                 SharePoint

                                                  SQL Server
                                                   2008/R2


                                                   Content
                                                  Database
                                                    Content
                                         Content Database
                                                   Database

       Active Content   Actual Content
Previously supported limits

                                                                      Large, single
                                                                      -site
                                                                      repositories
                                                               1 TB
                                                                      and archives
                           General use                                (records
          200 GB
                           scenarios                                  center)




                                         100 GB site collection *


* A larger site collection is supported if it
is the only site collection in the database
Revised limits (July ‘11)
                                                                  Document
                                                         No       archive
                                                       explicit   scenario:
                                      All scenarios:    limit     caveats
                          4 TB
                                      caveats                     apply
         General use                  apply
200 GB
         scenarios




                           Site collection
                       No explicit size – limit by
                       scenario, database size,
                             item count
Understanding scenarios

• SharePoint is multi-purpose
• Scenario primarily refers to needs and
  usage patterns
  ●   Read/write centric
  ●   Concurrent users
  ●   Average/peak loads
  ●   Recovery objectives
• Isolate different usage patterns to
  separate databases
Common scenarios

Record Center                 Team Site
• Long term retention         • Day to day collaboration
• Low volatility – very few     w/ shorter retention
  write operations            • Higher volatility
• Limited reads               • Higher reads

Larger databases              Smaller databases
What are the 4TB-level caveats?

   • A larger db requires faster storage
        ●   Between 0.25 – 2.0 IOPS/GB
        ●   4TB DB : 1000 IOPS minimum
   • Plans developed for DR/HA
   • Capacity planning/perf testing
   • Recognize added complexity
        ●   Skilled architects and proactive admins
   • 60M total item limit per db
http://technet.microsoft.com/en-us/library/cc262787.aspx
What are the >4TB caveats?

   • All 4TB caveats, plus
   • Document Center or Record Center only
   • In any given month
        ●   <5% of content accessed
        ●   <1% of content modified
   • No alerts, user workflow, item-level
     security, et al


http://technet.microsoft.com/en-us/library/cc262787.aspx
Why is 200GB still a good number?

• Support operations are much easier
• Better performance
  ●   The larger the db, the slower it gets
• Easier to meet backup and recovery
  objectives
  ●   Most recoveries begin with a db restore
  ●   Can you meet your recovery objectives?
• Patching / upgrading is faster
                                                200 GB
Why are larger DBs slower?

• Select queries take longer
  ●   More rows to filter, group and sort
• Write queries take longer
• Locking escalation
  ●   More blocking
• More data, but data cache same size
• DB maintenance takes longer
  ●   reindex
  ●   dbcc checkdb
What happens as size increases?




http://technet.microsoft.com/en-us/library/hh395916.aspx
Demo

SIZE AFFECTS PERFORMANCE
Agenda




          2
         Achieving
           larger
         capacities
Achieving storage performance

• Storage array (RAID 1+0)
  ●   10 300GB SAS drives, 15k RPM
  ●   1.5 TB effective space
  ●   ~1500 IOPS = 1.0 IOPS/GB
• Set of drives (RAID 1+0)
  ●   4 750GB SATA drives, 10k RPM
  ●   1.5 TB effective space
  ●   ~300 IOPS = 0.2 IOPS/GB
• Go with higher quality storage
  ●   SAS > SATA ; SAN > DAS
Scaling storage

• Multiple storage arrays (RAID 1+0)
• Break out into multiple LUNs
• Add additional data files to DB, one per
  array
                                F:SP_DocCenter_1.mdf
• Advice                        G: SP_DocCenter_2.ndf
                                                                 Data
   ●   Many smaller drives >           H: SP_DocCenter_3.ndf
                                        I: SP_DocCenter_4.ndf
       fewer larger ones
                                        J: SP_DocCenter.ldf     Log
   ●   RAID 1+0 > RAID 5
Additional performance guidance
    • How many data files?
        ●   Advice varies – between 0.25 to 1 per physical CPU
        ●   Each on a different spindle/LUN
    • Adjust database growth settings
        ●   Use 50-100MB for each data file
        ●   Use 20-40MB for log
    • Enable instant file initialization
    • Optimize tempdb
        ●   Use multiple data files
        ●   Pre-size to 25% of largest db
        ●   RAID 1+0
http://slidesha.re/pwVlJM
Demo (if time permits)

DB SETTINGS AFFECT
PERFORMANCE
Achieving Disaster Recovery

   • Built-in SharePoint backup is incapable of
     working with large capacities
        ●   Site collection backup limit : 15GB
        ●   Practical database backup limit : 200GB
   • Look at your backup/recovery objectives
        ●   Most recoveries involve a database restore
   • Look for third-party solutions
   • Deploy SP1 – site recycle bin

http://slidesha.re/rlv3u1
Agenda

         Remote BLOB
         storage (RBS)




            3
Remote BLOB Storage (RBS)

• Storing document (BLOB) outside
  database
  ●   Reduce database size
• Cannot be used to scale beyond database
  limits
  ●   Effective size = DB size + BLOB store
• Can externalize based on document size
• Built in RBS support with SQL Server
  2008 (FILESTREAM provider)
Overview of BLOB externalization




                                             Pointer
                                              (stub)
                                    RBS
           Upload                         SQL Server



                    Web Front-end




Externalized BLOB is
transparent to both                       File System

SharePoint and its users
Advantages of externalizing BLOBs

• Reduce storage costs
• Increase performance
  ●   Read & write
  ●   All other activity by users of the DB and SQL server
• Access to features of BLOB storage
  platform
• Efficient content restructure
  ●   Shallow copy in SP1
Advantages of keeping BLOBs in
SQL
• One storage container to
   ●   Maintain
   ●   Monitor
   ●   Recover
• Tier I storage
   ●   Performance relative to lower tiers of storage
       benefits all content access
• SQL caching
   ●   Performance of reads/writes of small documents
   ●   SQL caching benefits reads
RBS Guidance

• Consider using in document-heavy databases
• Trade off
  ●   Storage cost & performance benefits versus
  ●   More complex architecture (support, DR, HA)
• Consider third party providers
  ●   More full-featured solutions
• In general
  ●   Do not externalize <1MB documents
  ●   Ideal number varies widely
Agenda




          4
         Summary
           Q&A
In review

• 4TB is the new supported limit for all
  scenarios
• No limit for record/document centers
• Keys to achieving larger sizes
  ●   Storage performance planning/testing
  ●   DR/HA planning/testing
• RBS offers benefits but does not extend
  these limits
Your Feedback is Important

 Please fill out a session evaluation form
  drop it off at the conference registration
                      desk.

                Thank you!
Questions?
  randy.williams@avepoint.com
  http://linkd.in/plEEb1
  @tweetraw
Sizing your Content Databases: Understanding the Limits
Sizing your Content Databases: Understanding the Limits
Sizing your Content Databases: Understanding the Limits

Más contenido relacionado

La actualidad más candente

Storing and managing your content in share point tspbug
Storing and managing your content in share point tspbugStoring and managing your content in share point tspbug
Storing and managing your content in share point tspbug
Baris Bruce Tuncertan
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
Yiwei Ma
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messages
yarapavan
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
enissoz
 

La actualidad más candente (19)

Inexpensive storage
Inexpensive storageInexpensive storage
Inexpensive storage
 
Find a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage systemFind a needle in Haystack: Facebook's storage system
Find a needle in Haystack: Facebook's storage system
 
To blob or not to blob
To blob or not to blobTo blob or not to blob
To blob or not to blob
 
HBase Advanced - Lars George
HBase Advanced - Lars GeorgeHBase Advanced - Lars George
HBase Advanced - Lars George
 
HBase @ Twitter
HBase @ TwitterHBase @ Twitter
HBase @ Twitter
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 Performance
 
HBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBaseHBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBase
 
Storing and managing your content in share point tspbug
Storing and managing your content in share point tspbugStoring and managing your content in share point tspbug
Storing and managing your content in share point tspbug
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
Presentation day1oracle 12c
Presentation day1oracle 12cPresentation day1oracle 12c
Presentation day1oracle 12c
 
Alfresco Large Scale Enterprise Deployments
Alfresco Large Scale Enterprise DeploymentsAlfresco Large Scale Enterprise Deployments
Alfresco Large Scale Enterprise Deployments
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messages
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guide
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Future of cloud storage
Future of cloud storageFuture of cloud storage
Future of cloud storage
 
Hbase: an introduction
Hbase: an introductionHbase: an introduction
Hbase: an introduction
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance Tuning
 
Hadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment EvolutionHadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment Evolution
 
Digital Library Collection Management using HBase
Digital Library Collection Management using HBaseDigital Library Collection Management using HBase
Digital Library Collection Management using HBase
 

Similar a Sizing your Content Databases: Understanding the Limits

Sizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The LimitsSizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The Limits
wahidsaleemi
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
Alluxio, Inc.
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
Eric Shupps
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
Taldor Group
 

Similar a Sizing your Content Databases: Understanding the Limits (20)

Sizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The LimitsSizing Your Content Databases- Understanding The Limits
Sizing Your Content Databases- Understanding The Limits
 
SharePoint Storage Best Practices
SharePoint Storage Best PracticesSharePoint Storage Best Practices
SharePoint Storage Best Practices
 
Scalable Filesystem Metadata Services with RocksDB
Scalable Filesystem Metadata Services with RocksDBScalable Filesystem Metadata Services with RocksDB
Scalable Filesystem Metadata Services with RocksDB
 
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-sensitive queries for Presto at Facebook: A Collaboration ...
 
SPS Kansas City: What SharePoint Admin need to know about SQL
SPS Kansas City: What SharePoint Admin need to know about SQLSPS Kansas City: What SharePoint Admin need to know about SQL
SPS Kansas City: What SharePoint Admin need to know about SQL
 
What SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-CinncinatiWhat SharePoint Admins need to know about SQL-Cinncinati
What SharePoint Admins need to know about SQL-Cinncinati
 
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
Optimizing Latency-Sensitive Queries for Presto at Facebook: A Collaboration ...
 
What SQL DBA's need to know about SharePoint-St. Louis 2013
What SQL DBA's need to know about SharePoint-St. Louis 2013What SQL DBA's need to know about SharePoint-St. Louis 2013
What SQL DBA's need to know about SharePoint-St. Louis 2013
 
SharePoint 2010 database maintenance
SharePoint 2010 database maintenanceSharePoint 2010 database maintenance
SharePoint 2010 database maintenance
 
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
Optimize MySQL Workloads with Amazon Elastic Block Store - February 2017 AWS ...
 
Apache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouseApache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouse
 
Red Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep DiveRed Hat Storage Server Administration Deep Dive
Red Hat Storage Server Administration Deep Dive
 
Evolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage SubsystemEvolving HDFS to Generalized Storage Subsystem
Evolving HDFS to Generalized Storage Subsystem
 
Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...Scalable and High available Distributed File System Metadata Service Using gR...
Scalable and High available Distributed File System Metadata Service Using gR...
 
What SQL DBA's need to know about SharePoint
What SQL DBA's need to know about SharePointWhat SQL DBA's need to know about SharePoint
What SQL DBA's need to know about SharePoint
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
 
4. hadoop גיא לבנברג
4. hadoop  גיא לבנברג4. hadoop  גיא לבנברג
4. hadoop גיא לבנברג
 
Sps Ottawa - Storing Your Content in SharePoint
Sps Ottawa - Storing Your Content in SharePointSps Ottawa - Storing Your Content in SharePoint
Sps Ottawa - Storing Your Content in SharePoint
 
Still All on One Server: Perforce at Scale
Still All on One Server: Perforce at Scale Still All on One Server: Perforce at Scale
Still All on One Server: Perforce at Scale
 
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
HBaseConAsia2018 Track1-5: Improving HBase reliability at PInterest with geo ...
 

Más de Randy Williams

Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013
Randy Williams
 
Governance - how does information & security drive your architecture
Governance - how does information & security drive your architectureGovernance - how does information & security drive your architecture
Governance - how does information & security drive your architecture
Randy Williams
 

Más de Randy Williams (20)

Architecting Solutions and Systems – Randy’s Secrets to Success
Architecting Solutions and Systems – Randy’s Secrets to SuccessArchitecting Solutions and Systems – Randy’s Secrets to Success
Architecting Solutions and Systems – Randy’s Secrets to Success
 
An IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
An IT Pro Guide to Deploying and Managing SharePoint 2013 AppsAn IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
An IT Pro Guide to Deploying and Managing SharePoint 2013 Apps
 
Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013Integrating SharePoint with Exchange-2013
Integrating SharePoint with Exchange-2013
 
Governing and managing hybrid SharePoint environments
Governing and managing hybrid SharePoint environmentsGoverning and managing hybrid SharePoint environments
Governing and managing hybrid SharePoint environments
 
When governance lacks compliance
When governance lacks complianceWhen governance lacks compliance
When governance lacks compliance
 
The Future of Social Collaboration
The Future of Social CollaborationThe Future of Social Collaboration
The Future of Social Collaboration
 
Getting Started with SharePoint 2013 Apps
Getting Started with SharePoint 2013 AppsGetting Started with SharePoint 2013 Apps
Getting Started with SharePoint 2013 Apps
 
Building the next generation of browser apps today
Building the next generation of browser apps todayBuilding the next generation of browser apps today
Building the next generation of browser apps today
 
SQL Server 2012 and SharePoint 2010: Reporting Nirvana
SQL Server 2012 and SharePoint 2010: Reporting NirvanaSQL Server 2012 and SharePoint 2010: Reporting Nirvana
SQL Server 2012 and SharePoint 2010: Reporting Nirvana
 
How governance drives your information and security architecture
How governance drives your information and security architectureHow governance drives your information and security architecture
How governance drives your information and security architecture
 
SharePoint Online and the Cloud
SharePoint Online and the CloudSharePoint Online and the Cloud
SharePoint Online and the Cloud
 
Enforcing SharePoint Governance
Enforcing SharePoint GovernanceEnforcing SharePoint Governance
Enforcing SharePoint Governance
 
Migrating to Office 365
Migrating to Office 365Migrating to Office 365
Migrating to Office 365
 
Governance - how does information & security drive your architecture
Governance - how does information & security drive your architectureGovernance - how does information & security drive your architecture
Governance - how does information & security drive your architecture
 
Understanding SharePoint Governance
Understanding SharePoint GovernanceUnderstanding SharePoint Governance
Understanding SharePoint Governance
 
Why I Use SharePoint
Why I Use SharePointWhy I Use SharePoint
Why I Use SharePoint
 
Guiding a Successful SharePoint Implementation
Guiding a Successful SharePoint ImplementationGuiding a Successful SharePoint Implementation
Guiding a Successful SharePoint Implementation
 
Plugging holes in your SharePoint 2010 disaster recovery strategy
Plugging holes in your SharePoint 2010 disaster recovery strategyPlugging holes in your SharePoint 2010 disaster recovery strategy
Plugging holes in your SharePoint 2010 disaster recovery strategy
 
Spicing up SharePoint web parts
Spicing up SharePoint web partsSpicing up SharePoint web parts
Spicing up SharePoint web parts
 
Using Visual Studio to Build SharePoint 2010 Solutions
Using Visual Studio to Build SharePoint 2010 SolutionsUsing Visual Studio to Build SharePoint 2010 Solutions
Using Visual Studio to Build SharePoint 2010 Solutions
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Sizing your Content Databases: Understanding the Limits

  • 1. Sizing Your Content Databases: Understanding the New Limits Randy Williams AvePoint
  • 2. Randy Williams • Enterprise Trainer & Evangelist – AvePoint • 20+ years in IT ● developer, consultant, trainer, author • Three-time SharePoint MVP • Speaker at many global conferences randy.williams@avepoint.com http://linkd.in/plEEb1 @tweetraw
  • 3. Agenda Understanding Remote BLOB new limits storage (RBS) 1 2 Achieving 3 4 Summary larger Q&A capacities
  • 4. Agenda Understanding new limits 1
  • 5. The SharePoint storage dilemma • Documents, databases, and BLOBs • Storage growth SharePoint SQL Server 2008/R2 Content Database Content Content Database Database Active Content Actual Content
  • 6. Previously supported limits Large, single -site repositories 1 TB and archives General use (records 200 GB scenarios center) 100 GB site collection * * A larger site collection is supported if it is the only site collection in the database
  • 7. Revised limits (July ‘11) Document No archive explicit scenario: All scenarios: limit caveats 4 TB caveats apply General use apply 200 GB scenarios Site collection No explicit size – limit by scenario, database size, item count
  • 8. Understanding scenarios • SharePoint is multi-purpose • Scenario primarily refers to needs and usage patterns ● Read/write centric ● Concurrent users ● Average/peak loads ● Recovery objectives • Isolate different usage patterns to separate databases
  • 9. Common scenarios Record Center Team Site • Long term retention • Day to day collaboration • Low volatility – very few w/ shorter retention write operations • Higher volatility • Limited reads • Higher reads Larger databases Smaller databases
  • 10. What are the 4TB-level caveats? • A larger db requires faster storage ● Between 0.25 – 2.0 IOPS/GB ● 4TB DB : 1000 IOPS minimum • Plans developed for DR/HA • Capacity planning/perf testing • Recognize added complexity ● Skilled architects and proactive admins • 60M total item limit per db http://technet.microsoft.com/en-us/library/cc262787.aspx
  • 11. What are the >4TB caveats? • All 4TB caveats, plus • Document Center or Record Center only • In any given month ● <5% of content accessed ● <1% of content modified • No alerts, user workflow, item-level security, et al http://technet.microsoft.com/en-us/library/cc262787.aspx
  • 12. Why is 200GB still a good number? • Support operations are much easier • Better performance ● The larger the db, the slower it gets • Easier to meet backup and recovery objectives ● Most recoveries begin with a db restore ● Can you meet your recovery objectives? • Patching / upgrading is faster 200 GB
  • 13. Why are larger DBs slower? • Select queries take longer ● More rows to filter, group and sort • Write queries take longer • Locking escalation ● More blocking • More data, but data cache same size • DB maintenance takes longer ● reindex ● dbcc checkdb
  • 14. What happens as size increases? http://technet.microsoft.com/en-us/library/hh395916.aspx
  • 16. Agenda 2 Achieving larger capacities
  • 17. Achieving storage performance • Storage array (RAID 1+0) ● 10 300GB SAS drives, 15k RPM ● 1.5 TB effective space ● ~1500 IOPS = 1.0 IOPS/GB • Set of drives (RAID 1+0) ● 4 750GB SATA drives, 10k RPM ● 1.5 TB effective space ● ~300 IOPS = 0.2 IOPS/GB • Go with higher quality storage ● SAS > SATA ; SAN > DAS
  • 18. Scaling storage • Multiple storage arrays (RAID 1+0) • Break out into multiple LUNs • Add additional data files to DB, one per array F:SP_DocCenter_1.mdf • Advice G: SP_DocCenter_2.ndf Data ● Many smaller drives > H: SP_DocCenter_3.ndf I: SP_DocCenter_4.ndf fewer larger ones J: SP_DocCenter.ldf Log ● RAID 1+0 > RAID 5
  • 19. Additional performance guidance • How many data files? ● Advice varies – between 0.25 to 1 per physical CPU ● Each on a different spindle/LUN • Adjust database growth settings ● Use 50-100MB for each data file ● Use 20-40MB for log • Enable instant file initialization • Optimize tempdb ● Use multiple data files ● Pre-size to 25% of largest db ● RAID 1+0 http://slidesha.re/pwVlJM
  • 20. Demo (if time permits) DB SETTINGS AFFECT PERFORMANCE
  • 21. Achieving Disaster Recovery • Built-in SharePoint backup is incapable of working with large capacities ● Site collection backup limit : 15GB ● Practical database backup limit : 200GB • Look at your backup/recovery objectives ● Most recoveries involve a database restore • Look for third-party solutions • Deploy SP1 – site recycle bin http://slidesha.re/rlv3u1
  • 22. Agenda Remote BLOB storage (RBS) 3
  • 23. Remote BLOB Storage (RBS) • Storing document (BLOB) outside database ● Reduce database size • Cannot be used to scale beyond database limits ● Effective size = DB size + BLOB store • Can externalize based on document size • Built in RBS support with SQL Server 2008 (FILESTREAM provider)
  • 24. Overview of BLOB externalization Pointer (stub) RBS Upload SQL Server Web Front-end Externalized BLOB is transparent to both File System SharePoint and its users
  • 25. Advantages of externalizing BLOBs • Reduce storage costs • Increase performance ● Read & write ● All other activity by users of the DB and SQL server • Access to features of BLOB storage platform • Efficient content restructure ● Shallow copy in SP1
  • 26. Advantages of keeping BLOBs in SQL • One storage container to ● Maintain ● Monitor ● Recover • Tier I storage ● Performance relative to lower tiers of storage benefits all content access • SQL caching ● Performance of reads/writes of small documents ● SQL caching benefits reads
  • 27. RBS Guidance • Consider using in document-heavy databases • Trade off ● Storage cost & performance benefits versus ● More complex architecture (support, DR, HA) • Consider third party providers ● More full-featured solutions • In general ● Do not externalize <1MB documents ● Ideal number varies widely
  • 28. Agenda 4 Summary Q&A
  • 29. In review • 4TB is the new supported limit for all scenarios • No limit for record/document centers • Keys to achieving larger sizes ● Storage performance planning/testing ● DR/HA planning/testing • RBS offers benefits but does not extend these limits
  • 30. Your Feedback is Important Please fill out a session evaluation form drop it off at the conference registration desk. Thank you!
  • 31. Questions? randy.williams@avepoint.com http://linkd.in/plEEb1 @tweetraw

Notas del editor

  1. Introduce concept of documents being stored as BLOBs in CDBBUILD: Diagram of architectureDiscuss storage growthBUILD: Bloat of data, mostly inactiveBUILD: Burden on CDBsDiscuss need to thin about storage holistically: lifecycle, compliance, SLAs, cost
  2. v4iMMm