SlideShare a Scribd company logo
1 of 38
Download to read offline
Planning SP2013 Search
                  for IT PROs
Nottingham 2012
The boring stuff (disclaimer)

Most of the content of this presentation was put
together using material and tests based on SharePoint
2013 Release Preview. Although a lot of it is still relevant
to the RTM version, it is provided “as-is” and there is no
guarantee as to its accuracy.

Additionally, any opinions stated are that of the author
and do not represent the views of Content and Code or
Microsoft.
About me…
Are all Search engines created equal?




                       =                ?
Are all Search engines created equal?




              Enterprise Search is a different animal, right?
The Enterprise Search market (Gartner)




                                         2013?




       2006                     2009
Introducing SharePoint Server 2013 Search




FAST architecture integrated along with    UI improvements: AJAX, preview panes, results blocks
improvements from Bing; continuous crawl   Content Search Web part reduces need for custom code
What’s new under the hood for IT PROs?
       Search capability   2007                    2010                 2013 Preview

       Architecture        SSP                     Service App          Service App


       Configurable        Query                   Query                Query Processor
       Components          Index                   Crawl                Crawl
                                                   Admin                Admin
                                                   Index Partition      Index Partition
                                                                        Content Processor
                                                                        Analytics Processor
                                                                        (Replaces SP2010 Web
                                                                        Analytics SA)
       Databases           Search                  Crawl                Crawl
                                                   Admin                Admin
                                                   Property             Property
                                                                        Analytics
                                                                        Link


       Resiliency          No index HA (although   No admin component   Admin component
                           we had query HA)        HA                   redundancy
       Management          Central Admin           Central Admin        Central Admin – except
                           STSADM                  STSADM               topology changes
                                                   PowerShell           STSADM
                                                                        PowerShell
       Scheduling          Full/Incremental        Full/Incremental     Full/Incremental
                                                                        Continuous
Quick Demo

Lets find stuff in SharePoint 2013!
Architecture
SharePoint Server 2013 Search Architecture
1. Crawl Component


                 • Invokes connectors to retrieve items and metadata
                   from Content Sources

                 • Crawl DB stores crawled item history

                 • Discovers content and metadata (e.g. Author, Title,
                   and Creation Date) collectively known as crawled
                   properties

                 • Delivers crawled properties to the Content
                   Processing Component
2. Content Processing Component




• Parses crawled items using format handlers and 3rd party iFilters

• Reports crawled properties to the Search Admin Database

• Writes URL information to Link DB for usage by Analytics Component

• Nugget from Neil Hodgkinson (Microsoft): for now we are stuck with the default PDF format handler.
3. Analytics Processing Component
                                    • Replaces SP2010 Web Analytics

                                    • Analyses crawled items and user
                                      interactions with Search results
                                      (e.g. clicks, recommendations)

                                    • Results fed back to the Content
                                      Processing Component to
                                      improve relevance

                                    • Scales well – additional APCs or
                                      databases can be added for
                                      additional throughput/capacity
4. Index Component




                • Central part of Search capability – used in both
                  feeding and Query processes:
                    • Feeding – writes items received from Content
                      Processor to index file
                    • Query – provides results set to the Query Processor
                      (similar to the “query” component in 2010)

                • Physically moves index files in response to Search
                  topology changes.

                • Stores ACLs in disk index
Scaled out SP2013 Search Index
Central Admin – 1 partition, 2 replicas   Marketecture




 Use Get-SPEnterpriseSearchStatus to
 find the Primary Replica:
5. Query Processing Component




                • New component in SP2013. Complements the index
                  component.

                • Presents results to users! 

                • Performs linguistics processing at query time, e.g.
                  spellchecking, thesaurus

                • Analyses and processes query to determine which index
                  partition to send query to and which rule(s) to apply
6. Admin Component




               • Responsible for search provisioning and topology changes

               • Search Admin DB is basically a “Config DB” for search – it contains
                 the topology, crawl/query rules, crawled/managed properties.

               • Does NOT store ACLs in 2013 – these are stored within the disk index
                 alongside content (used for security trimming results)
Demo

Create a new Search Service Application using PowerShell
What did we create?



• One of each Search component
• Up to 10m items (on paper)
• No component redundancy
Topology
Minimum “Enterprise” Search hardware requirements




             These requirements are cumulative (56GB in total!)



                        http://technet.microsoft.com/en-us/library/jj219628(v=office.15).aspx
Example “medium”
 topology
• “Medium” topology taken from Microsoft’s
  “Topologies for SP2013” document. “Finger in
  the air” capacity:
    • Up to 10 million items
    • 10-20,000 users
    • 1-2 TB content

• 8 VMs on 4 physical hosts + SQL!
    • OWA for Search Preview Pane

• No Search components on WFE servers

• Query processing and index components hosted
  together

• Traditional “app” servers for everything else.

• No mention of a distributed cache (AppFabric)
  cluster – this could be a mistake.
Nuts and bolts
Default Search topology footprint - 2013
2 Service Applications and 1 Proxy in SPCA   2 Service App Endpoints in IIS




                                                             5 noderunner Processes in Task Manager
3 Services on Server                2 Windows Services       1 mssearch executable
                                                                                         4 Databases




5 Noderunner processes in Process Explorer
So is it really a continuous crawl?
•    Short answer: “it depends on how much content you have”.

•    Overlapped/parallel crawls every 15 minutes by default. Items shown in index “within seconds”.

•    Fresher content, but NOT a “silver bullet” – continuous crawl generally run with a periodic full crawl.

        •   E.g. Full crawl needed for new managed properties, clean up of inaccessible/deleted items.
PowerShell and Search: what’s new?

• New-SPEnterpriseSearchAnalyticsProcessingComponent
   • BUT no “Get” cmdlet is a pain if trying to work with
     the component.

• New Get and Set cmdlets for
  SPEnterpriseSearchQueryProcessingComponent

• You must use PowerShell if you want to scale a search
  topology and to avoid GUIDs
   • No interface within SPCA to modify the topology.
Demo

Modifying the Search topology using PowerShell
What did we change?
Upgrade
Considerations
Migrating SP2010 Search to 2013
• Remember that in-place upgrades are not supported

• Only the SP2010 Admin DB can be migrated to 2013.

    • SP2010 Search Admin DB contains :
       • content sources
       • crawl rules
       • start addresses
       • server name mapping
       • federated locations.

• Properties are gathered during the first crawl

• SP2010 Web Analytics does not migrate to SP2013.

• Logical topology settings such as servers, components in farm need to be manually recreated using PowerShell.

• SP2013 can crawl SharePoint 2003/2007/2010 farms to facilitate a “Search first” upgrade
SP2013 Search Boundary key changes
Limit                          2010               2013

Crawl Databases                10 per Search SA   5 per Search SA

Crawl Components               16 per Search SA   2 per Search SA

Index Partitions               20 per Search SA   20 per Search SA
                               128 total
Link DB                        N/A                2 per Search SA

Query Processing Component     N/A                1 per server

Content Processing Component   N/A                1 per server

Analytics Processing Component N/A                6 per Search SA
Gotchas / considerations

• Suggested the Search / distributed cache services are split for large implementations
   • Impacts the “starting” topology for larger customers

• High resource requirements as discussed

• Some Search features deprecated / removed
  (see http://technet.microsoft.com/en-us/library/ff607742(v=office.15).aspx#section1):
    • No migration path for SP2010 Foundation Search settings
    • No means of modifying Search topology via UI
    • No Search SOAP Web service http://server/site/_vti_bin/search.asmx is no more. Use CSOM/REST!
    • No Search RSS due to lack of claims support
    • No Search SQL Syntax
    • No support for docpush.exe to “push” items into the index (possible in FAST)
What about Foundation?

  •    SharePoint Foundation 2013 Search capabilities are now based on the same search implementation as
      SharePoint Server 2013.

  • If using the Farm Configuration Wizard (AKA “white wizard”) in SP2013 RP, a Search Service app is created.
       • However, the PowerShell cmdlets required to scale out requires a Server license.
       • RTM may be different. Any input welcome 

  • My thoughts: Appropriate only for small implementations due to single server limitation in release preview.




https://www.nothingbutsharepoint.com/sites/itpro/Pages/Search-in-SharePoint-2013-Foundation-Versus-Full-Blown-Server.aspx
The FCW solving all of our problems!??*




                          *This is a joke. The Farm Config Wizard rarely solves problems.
Summary

• SP2013 brings a bunch of cool new native Search functionality that is an
  evolution of 2010 functionality.

• Most FAST features are now integrated

• 2013 Search is resource hungry – we must plan for this!

• Continuous crawl can replace incremental but still requires full crawls

• PowerShell required for topology changes – brush up those skills!
Questions?
Thanks for listening!
Nottingham 2012

More Related Content

What's hot

MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houberg
knowledgelakemarketing
 
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
Findwise
 
Enterprise search-sizing-ha-and-migration-path
Enterprise search-sizing-ha-and-migration-pathEnterprise search-sizing-ha-and-migration-path
Enterprise search-sizing-ha-and-migration-path
microsoftasap
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
Agnes Molnar
 

What's hot (20)

SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!SharePoint 2013 'Search': What you need to Know!
SharePoint 2013 'Search': What you need to Know!
 
Technical Overview of FAST Search Server 2010 for SharePoint - SharePoint Sat...
Technical Overview of FAST Search Server 2010 for SharePoint - SharePoint Sat...Technical Overview of FAST Search Server 2010 for SharePoint - SharePoint Sat...
Technical Overview of FAST Search Server 2010 for SharePoint - SharePoint Sat...
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
SharePoint 2013 Search Architecture with Russ Houberg
SharePoint 2013  Search Architecture with Russ HoubergSharePoint 2013  Search Architecture with Russ Houberg
SharePoint 2013 Search Architecture with Russ Houberg
 
Sps boston 2014_o365_power_shell_csom_amitv
Sps boston 2014_o365_power_shell_csom_amitvSps boston 2014_o365_power_shell_csom_amitv
Sps boston 2014_o365_power_shell_csom_amitv
 
Developing a Custom FAST Search Server 2010 for SharePoint Solution - Share...
Developing a Custom FAST Search Server 2010 for SharePoint Solution - Share...Developing a Custom FAST Search Server 2010 for SharePoint Solution - Share...
Developing a Custom FAST Search Server 2010 for SharePoint Solution - Share...
 
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
Enterprise search in SharePoint 2013 - Sydney 15th of January 2013
 
Enterprise search-sizing-ha-and-migration-path
Enterprise search-sizing-ha-and-migration-pathEnterprise search-sizing-ha-and-migration-path
Enterprise search-sizing-ha-and-migration-path
 
Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013Building a scalable search architecture in share point 2013
Building a scalable search architecture in share point 2013
 
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 SearchSPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
 
SharePoint 2013 Search - Whats new for End Users
SharePoint 2013 Search - Whats new for End UsersSharePoint 2013 Search - Whats new for End Users
SharePoint 2013 Search - Whats new for End Users
 
What SQL DBAs need to know about SharePoint-Indianapolis 2013
What SQL DBAs need to know about SharePoint-Indianapolis 2013What SQL DBAs need to know about SharePoint-Indianapolis 2013
What SQL DBAs need to know about SharePoint-Indianapolis 2013
 
Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013
 
Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010Enterprise Search in SharePoint 2010
Enterprise Search in SharePoint 2010
 
Search First Migration - Using SharePoint 2013 Search for SharePoint 2010
Search First Migration - Using SharePoint 2013 Search for SharePoint 2010Search First Migration - Using SharePoint 2013 Search for SharePoint 2010
Search First Migration - Using SharePoint 2013 Search for SharePoint 2010
 
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
 
Highly available and scalable architectures
Highly available and scalable architecturesHighly available and scalable architectures
Highly available and scalable architectures
 
Tuning SQL Server for Sharepoint 2013- What every sharepoint consultant need...
Tuning SQL Server for Sharepoint 2013-  What every sharepoint consultant need...Tuning SQL Server for Sharepoint 2013-  What every sharepoint consultant need...
Tuning SQL Server for Sharepoint 2013- What every sharepoint consultant need...
 
Infrastructure Best Practices for SharePoint On-Premises presented by Michael...
Infrastructure Best Practices for SharePoint On-Premises presented by Michael...Infrastructure Best Practices for SharePoint On-Premises presented by Michael...
Infrastructure Best Practices for SharePoint On-Premises presented by Michael...
 

Similar to Planning SharePoint 2013 Search for IT PROs

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
Agnes Molnar
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
Kim Negaard
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
Eric Shupps
 

Similar to Planning SharePoint 2013 Search for IT PROs (20)

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
SharePoint 2013 Search Operations
SharePoint 2013 Search OperationsSharePoint 2013 Search Operations
SharePoint 2013 Search Operations
 
Installing and tweaking FASTSearch
Installing and tweaking FASTSearchInstalling and tweaking FASTSearch
Installing and tweaking FASTSearch
 
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
EPC Group - Comprehensive Overview of SharePoint 2010's Enterprise Search Cap...
 
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 SearchSPLive Orlando - 10 Things I Like in SharePoint 2013 Search
SPLive Orlando - 10 Things I Like in SharePoint 2013 Search
 
SharePoint Search - SPSNYC 2014
SharePoint Search - SPSNYC 2014SharePoint Search - SPSNYC 2014
SharePoint Search - SPSNYC 2014
 
Optimizing SharePoint for Transactional Content Management
Optimizing SharePoint for Transactional Content ManagementOptimizing SharePoint for Transactional Content Management
Optimizing SharePoint for Transactional Content Management
 
Sökmotorn i SharePoint 2013 - Comperio
Sökmotorn i SharePoint 2013 - ComperioSökmotorn i SharePoint 2013 - Comperio
Sökmotorn i SharePoint 2013 - Comperio
 
Fried dallas spug
Fried dallas spugFried dallas spug
Fried dallas spug
 
#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization#SPSPhilly search topology & optimization
#SPSPhilly search topology & optimization
 
SharePoint Performance Optimization In 10 Steps for the IT Professional
SharePoint Performance Optimization In 10 Steps for the IT ProfessionalSharePoint Performance Optimization In 10 Steps for the IT Professional
SharePoint Performance Optimization In 10 Steps for the IT Professional
 
SharePoint Intelligence Real World Business Workflow With Share Point Designe...
SharePoint Intelligence Real World Business Workflow With Share Point Designe...SharePoint Intelligence Real World Business Workflow With Share Point Designe...
SharePoint Intelligence Real World Business Workflow With Share Point Designe...
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
 
Fishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar PresentationFishbowl Solutions WebCenter Search Webinar Presentation
Fishbowl Solutions WebCenter Search Webinar Presentation
 
Share point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practicesShare point 2010 performance and capacity planning best practices
Share point 2010 performance and capacity planning best practices
 
KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101KMWorld SharePoint 2010-Admin 101
KMWorld SharePoint 2010-Admin 101
 
SharePoint Development
SharePoint DevelopmentSharePoint Development
SharePoint Development
 
Service quality monitoring system architecture
Service quality monitoring system architectureService quality monitoring system architecture
Service quality monitoring system architecture
 
Improve Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - ComperioImprove Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - Comperio
 
Ms net work-sharepoint 2013-applied architecture from the field v4
Ms net work-sharepoint 2013-applied architecture from the field v4Ms net work-sharepoint 2013-applied architecture from the field v4
Ms net work-sharepoint 2013-applied architecture from the field v4
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 

Planning SharePoint 2013 Search for IT PROs

  • 1. Planning SP2013 Search for IT PROs Nottingham 2012
  • 2. The boring stuff (disclaimer) Most of the content of this presentation was put together using material and tests based on SharePoint 2013 Release Preview. Although a lot of it is still relevant to the RTM version, it is provided “as-is” and there is no guarantee as to its accuracy. Additionally, any opinions stated are that of the author and do not represent the views of Content and Code or Microsoft.
  • 4. Are all Search engines created equal? = ?
  • 5. Are all Search engines created equal? Enterprise Search is a different animal, right?
  • 6. The Enterprise Search market (Gartner) 2013? 2006 2009
  • 7. Introducing SharePoint Server 2013 Search FAST architecture integrated along with UI improvements: AJAX, preview panes, results blocks improvements from Bing; continuous crawl Content Search Web part reduces need for custom code
  • 8. What’s new under the hood for IT PROs? Search capability 2007 2010 2013 Preview Architecture SSP Service App Service App Configurable Query Query Query Processor Components Index Crawl Crawl Admin Admin Index Partition Index Partition Content Processor Analytics Processor (Replaces SP2010 Web Analytics SA) Databases Search Crawl Crawl Admin Admin Property Property Analytics Link Resiliency No index HA (although No admin component Admin component we had query HA) HA redundancy Management Central Admin Central Admin Central Admin – except STSADM STSADM topology changes PowerShell STSADM PowerShell Scheduling Full/Incremental Full/Incremental Full/Incremental Continuous
  • 9. Quick Demo Lets find stuff in SharePoint 2013!
  • 11. SharePoint Server 2013 Search Architecture
  • 12. 1. Crawl Component • Invokes connectors to retrieve items and metadata from Content Sources • Crawl DB stores crawled item history • Discovers content and metadata (e.g. Author, Title, and Creation Date) collectively known as crawled properties • Delivers crawled properties to the Content Processing Component
  • 13. 2. Content Processing Component • Parses crawled items using format handlers and 3rd party iFilters • Reports crawled properties to the Search Admin Database • Writes URL information to Link DB for usage by Analytics Component • Nugget from Neil Hodgkinson (Microsoft): for now we are stuck with the default PDF format handler.
  • 14. 3. Analytics Processing Component • Replaces SP2010 Web Analytics • Analyses crawled items and user interactions with Search results (e.g. clicks, recommendations) • Results fed back to the Content Processing Component to improve relevance • Scales well – additional APCs or databases can be added for additional throughput/capacity
  • 15. 4. Index Component • Central part of Search capability – used in both feeding and Query processes: • Feeding – writes items received from Content Processor to index file • Query – provides results set to the Query Processor (similar to the “query” component in 2010) • Physically moves index files in response to Search topology changes. • Stores ACLs in disk index
  • 16. Scaled out SP2013 Search Index Central Admin – 1 partition, 2 replicas Marketecture Use Get-SPEnterpriseSearchStatus to find the Primary Replica:
  • 17. 5. Query Processing Component • New component in SP2013. Complements the index component. • Presents results to users!  • Performs linguistics processing at query time, e.g. spellchecking, thesaurus • Analyses and processes query to determine which index partition to send query to and which rule(s) to apply
  • 18. 6. Admin Component • Responsible for search provisioning and topology changes • Search Admin DB is basically a “Config DB” for search – it contains the topology, crawl/query rules, crawled/managed properties. • Does NOT store ACLs in 2013 – these are stored within the disk index alongside content (used for security trimming results)
  • 19. Demo Create a new Search Service Application using PowerShell
  • 20. What did we create? • One of each Search component • Up to 10m items (on paper) • No component redundancy
  • 22. Minimum “Enterprise” Search hardware requirements These requirements are cumulative (56GB in total!) http://technet.microsoft.com/en-us/library/jj219628(v=office.15).aspx
  • 23. Example “medium” topology • “Medium” topology taken from Microsoft’s “Topologies for SP2013” document. “Finger in the air” capacity: • Up to 10 million items • 10-20,000 users • 1-2 TB content • 8 VMs on 4 physical hosts + SQL! • OWA for Search Preview Pane • No Search components on WFE servers • Query processing and index components hosted together • Traditional “app” servers for everything else. • No mention of a distributed cache (AppFabric) cluster – this could be a mistake.
  • 25. Default Search topology footprint - 2013 2 Service Applications and 1 Proxy in SPCA 2 Service App Endpoints in IIS 5 noderunner Processes in Task Manager 3 Services on Server 2 Windows Services 1 mssearch executable 4 Databases 5 Noderunner processes in Process Explorer
  • 26. So is it really a continuous crawl? • Short answer: “it depends on how much content you have”. • Overlapped/parallel crawls every 15 minutes by default. Items shown in index “within seconds”. • Fresher content, but NOT a “silver bullet” – continuous crawl generally run with a periodic full crawl. • E.g. Full crawl needed for new managed properties, clean up of inaccessible/deleted items.
  • 27. PowerShell and Search: what’s new? • New-SPEnterpriseSearchAnalyticsProcessingComponent • BUT no “Get” cmdlet is a pain if trying to work with the component. • New Get and Set cmdlets for SPEnterpriseSearchQueryProcessingComponent • You must use PowerShell if you want to scale a search topology and to avoid GUIDs • No interface within SPCA to modify the topology.
  • 28. Demo Modifying the Search topology using PowerShell
  • 29. What did we change?
  • 31. Migrating SP2010 Search to 2013 • Remember that in-place upgrades are not supported • Only the SP2010 Admin DB can be migrated to 2013. • SP2010 Search Admin DB contains : • content sources • crawl rules • start addresses • server name mapping • federated locations. • Properties are gathered during the first crawl • SP2010 Web Analytics does not migrate to SP2013. • Logical topology settings such as servers, components in farm need to be manually recreated using PowerShell. • SP2013 can crawl SharePoint 2003/2007/2010 farms to facilitate a “Search first” upgrade
  • 32. SP2013 Search Boundary key changes Limit 2010 2013 Crawl Databases 10 per Search SA 5 per Search SA Crawl Components 16 per Search SA 2 per Search SA Index Partitions 20 per Search SA 20 per Search SA 128 total Link DB N/A 2 per Search SA Query Processing Component N/A 1 per server Content Processing Component N/A 1 per server Analytics Processing Component N/A 6 per Search SA
  • 33. Gotchas / considerations • Suggested the Search / distributed cache services are split for large implementations • Impacts the “starting” topology for larger customers • High resource requirements as discussed • Some Search features deprecated / removed (see http://technet.microsoft.com/en-us/library/ff607742(v=office.15).aspx#section1): • No migration path for SP2010 Foundation Search settings • No means of modifying Search topology via UI • No Search SOAP Web service http://server/site/_vti_bin/search.asmx is no more. Use CSOM/REST! • No Search RSS due to lack of claims support • No Search SQL Syntax • No support for docpush.exe to “push” items into the index (possible in FAST)
  • 34. What about Foundation? • SharePoint Foundation 2013 Search capabilities are now based on the same search implementation as SharePoint Server 2013. • If using the Farm Configuration Wizard (AKA “white wizard”) in SP2013 RP, a Search Service app is created. • However, the PowerShell cmdlets required to scale out requires a Server license. • RTM may be different. Any input welcome  • My thoughts: Appropriate only for small implementations due to single server limitation in release preview. https://www.nothingbutsharepoint.com/sites/itpro/Pages/Search-in-SharePoint-2013-Foundation-Versus-Full-Blown-Server.aspx
  • 35. The FCW solving all of our problems!??* *This is a joke. The Farm Config Wizard rarely solves problems.
  • 36. Summary • SP2013 brings a bunch of cool new native Search functionality that is an evolution of 2010 functionality. • Most FAST features are now integrated • 2013 Search is resource hungry – we must plan for this! • Continuous crawl can replace incremental but still requires full crawls • PowerShell required for topology changes – brush up those skills!

Editor's Notes

  1. In SharePoint Server 2010, host distribution rules are used to associate a host with a specific crawl database. Because of changes in the search system architecture, SharePoint Server 2013 does not use host distribution rules. Instead, Search service application administrators can determine whether the crawl database should be rebalanced by monitoring the Databases view in the crawl log.In a result source, you can also restrict queries to a subset of content by using a query transform. For example, the pre-defined "Local Video Results" result source uses a query transform to return only video results from the local SharePoint index. In SharePoint Server 2010, you configured this kind of query restriction by using search scopes.http://technet.microsoft.com/en-us/library/ff607742(v=office.15).aspx#section3
  2. The event store provides information on front-end events to the analytics processing component. E.g. the number of times an item is viewed to improve relevancy.
  3. We do support 3rd party ifilters but a format handler is not the same as an ifilterNugget: You cannot replace the new pdf format handler in SP2013 with a 3rd party ifilter (adobe, foxitetc) – so if SP2013 pdf search isn’t your cup of tea you are stuck with it (for now, design change request has been submitted by me to get this modified so we can override it)
  4. Technically in sp2013 the index partition (replica) and query component are still together on the same machine. We have a new component, the query processing component that submits queries to the index component but the index component still does the same job as the old 2010 query component by sending back the query response to the query processing component. You still need one index component per partition replica.
  5. Add Microsoft topologies from clients
  6. Why so much space for index component? What changes need 4 * index space?
  7. http://technet.microsoft.com/en-us/library/45dc078f-de39-425c-a94f-1de78b68052e#reference
  8. http://technet.microsoft.com/en-us/library/cc262787.aspx#Search
  9. Continuous crawl is lightweight – has no retry logic so skips inaccessible items hence the need for a full crawl to clean up and remove items that are no longer available. For example, an incremental crawl will skip a webapp if it isn’t online but wont retry. A full crawl will retry and count the fails, if it fails x times in x days it will remove the failed entry. In short SharePoint Search cannot survive by continuous crawls alone 
  10. Add Microsoft topologies from clients