This document summarizes a presentation about cloud hybrid search in SharePoint. It discusses:
1. The benefits of cloud hybrid search such as access from anywhere, a consistent user experience, and scalable index storage.
2. Some current limitations of cloud hybrid search including slower performance, lack of features in SharePoint Online, and complex administration.
3. Techniques for implementing cloud hybrid search including connecting an on-premises search service application to Office 365 and setting up content sources and search centers.
4. Topics covered include indexing and queries, a case study of a large university implementation, and tuning the cloud hybrid search experience.
4. Focused on Search and
SharePoint since 2004
Longtime
Search Nerd
• CTO, BA Insight
• Senior PM, Microsoft
• VP, FAST
• SVP, LingoMotors
About Jeff Fried
Passionate About
• Search
• SharePoint
• Search-driven
applications
• Information Strategy
Blog:
BAinsight.com/blog
Technet Column
“A View from the
Crawlspace”
jeff.fried@bainsight.com
16. Access anywhere
Consistent user experience
Unified search results
No upgrades
No infrastructure mgt
Index storage scalable
Benefits of
Cloud Hybrid Search
17. Reduce Your Footprint
Servers
Volume of Content
(indexable items) Pattern
On-prem Search
Farm
Cloud Hybrid
Search
0-10 million items Small 4 App + 2 DB 1 or 2
10-40 million items Medium 12 App + 2 DB 2
40-100 million items Large 28 App + 4 DB 2
400 million items XL example (SP2016) 86 App + 4DB 2 or 3
18. SharePoint Server
(On-premises or Hosted)
Office 365
SharePoint Online Content
Onedrive for Business Content
SharePoint Content
Cloud Hybrid Search
23. Setting up Cloud Hybrid Search
•Create
• Cloud Search Service Application in
SharePoint Server 2016
•Set up
• search architecture in SharePoint
Server 2016 for cloud hybrid search
•Connect
• your Cloud Search Service Application
to your Office 365 tenant
•Create
• a content source to crawl for cloud
hybrid search
•Setup
• Search Center to validate
hybrid search results in O365
•Start
• full crawl of on-premises
content for cloud hybrid search
•Verify
• that cloud hybrid search works
Tune
• cloud hybrid search
experiences
25. Setup for Support Search
The Support Search vertical only searches sites that
are relevant to the Support team.
It uses Local SharePoint results plus a filter on
which sites to include in the search results
Result source query:
{searchTerms} (
Path:»http://sp2010» OR
Path:»file://fileshare» OR
Path:»http://demohybrid...
/../supportforum»)
SharePoint Online Support Search
27. Search
Unified search across
SharePoint on-premises
and Office 365 content
and people
SharePoint 2013/2016
Deliverunified search results
from Office 365 and on-
premises in a single search
36. Issues with Cloud Hybrid Search (1)
Cloud Hybrid Search "annoyances"
Performance Characteristics
slower query latency for on-prem queries against Cloud SSA
SharePoint Online Limitations
no synonyms
no site-level schema
no full trust code access
Hybrid Administration Weaknesses
clunky metadata mapping
can't remove on-premises search results from Cloud SSA
trickier to test & debug crawls
can't reset index from Cloud SSA
Be aware of these
& compensate for them
(Fixed in August PU)
(Semi-addressed in June PU)
And it’s getting better:
41. Item Limits and Pricing
Licensing: 1M items of external content in index for every 1TB storage in O365
1TB included by default
+ 0.5 GB per licensed O365 user
No limit on number of items from O365 in the index
Default throttling at 20M external items; current threshold at 25M
2000 users x 0.5 GB = 1TB
+ 1TB default = 2 TB total
-> 2M external items indexed
+ Can also buy the “Office 365 Extra File Storage” Add-on
$0.20/GB/Month = $200/TB/Month = $200/M items/Month
50,000 users x 0.5 GB = 25TB
+ 1TB default = 26 TB total
-> 26M external items indexed
42. Should I run index reset?
NO!
DeleteAllCloudHybridSearchContent()
https://blogs.technet.microsoft.com/beyondsharepoint/2016/07/07/cloud-hybrid-search-service-application-removing-items-from-the-office-365-search-index/
43. Issues with Cloud Hybrid Search (2)
43
Content Enrichment
no CEWS
no Entity Extraction
Security
no Custom Security Trimming
Can't crawl across Multiple Domains
Can't Crawl SP in Classic Auth Mode
Data Sovereignty
export-restricted content
can't be put in O365 index
Limitations of Cloud SSA
44. External Content
(on-premises and/or
in the cloud)
SharePoint Server
(On-premises or Hosted)
SPO Content
OneDrive Content
Connectors
SharePoint Content
Connector
Framework
Office 365
AutoClassifier
(app version)
CEWS
Custom
Processing
48. Mapping of Access Control Lists
Allow: S-1-5-21-1212121212-1212121212-1212 Allow: PUID-XXXX-XXXXXXXXXX
• User SIDs are mapped to PUIDs
• Group SIDs are mapped to Object IDs
• «Everyone» and «Authenticated users» are mapped to
«Everyone except external users»
Only AD Users and Groups,
Only from one domain
52. Issues with Cloud Hybrid Search OOB
Content Enrichment
no CEWS
no Entity Extraction
Security
no Custom Security Trimming
Can't crawl across Multiple Domains
Can't Crawl SP in Classic Auth Mode
Data Sovereignty
export-restricted content
can't be put in O365 index
Limitations of Cloud SSA BA Insight Solution
Connector Framework
AutoClassifier
Connector Framework
can 'map down' to AD groups
can 'map across' cross-domain
can crawl and map security
Federator
REMEMBER – An intranet project is not just a significant change project, it has the potential to be transformative to the way a company operates.
Adoption is key to achieving this, so a clear plan for engagement and communication is crucial, based around your three areas of focus…
REMEMBER – An intranet project is not just a significant change project, it has the potential to be transformative to the way a company operates.
Adoption is key to achieving this, so a clear plan for engagement and communication is crucial, based around your three areas of focus…
Remote
Result
Sources
REMEMBER – An intranet project is not just a significant change project, it has the potential to be transformative to the way a company operates.
Adoption is key to achieving this, so a clear plan for engagement and communication is crucial, based around your three areas of focus…
SharePoint Server 2013 and SharePoint Server 2016 provide two individual hybrid search scenarios, Cloud Hybrid Search introduced in August 2015 to SharePoint Server 2016 IT Preview and SharePoint Server 2013, in addition to the classic federated hybrid search scenario, introduced in SharePoint Server 2013. Cloud Hybrid Search
The Cloud Hybrid Search scenario represents the next generation in hybrid search and discovery. With the cloud hybrid search solution, both your on-premises and Office 365 crawled content is unified in a search index hosted in Office 365. When users query your search index in Office 365, they get search results from both on-premises and Office 365 content. The content metadata is encrypted when it’s transferred to the search index in Office 365, so the on-premises content remains secure.Federated Hybrid Search
Federated hybrid search is a hybrid search scenario in which a query issued by a user is federated or distributed across on-premises and Office 365 returning a set of results from each location as discrete entities. In a federated hybrid search scenario on-premises crawled content is stored on-premises in the search index and Office 365 content in the search index in Office 365 with no affinity between the two data sets. Federated hybrid search can be configured in inbound, outbound, or bi-directional hybrid topologies. Outbound
User searches from the SharePoint Server 2013 Search Center display hybrid results. This is called outbound hybrid search.Inbound
User searches from the SharePoint Online Search Center display hybrid results. This is called inbound hybrid search.
A SharePoint hybrid experience provides three layers of opportunity for customers of SharePoint Server 2016 that allow customers to take advantage of Office 365 innovation at their own pace, whether you are considering migrating to Office 365 or plan to maintain a hybrid model.
App Launcher
The App Launcher is a familiar feature in Office and it’s now been extended to SharePoint Server 2016. The App Launcher provides a common location to discover new apps and navigate SharePoint on-premises and Office 365.
Apps
The Apps represent the experiences a customer can implement and choose from with hybrid through the App Launcher. These are the core scenarios a customer can implement as part of their hybrid experience.
Data Discovery
To complete the hybrid scenarios, customers can choose to implement hybrid search that enables unified discovery of both content and people across SharePoint and Office 365 and enables the use of powerful capabilities such as the Office Graph.
REMEMBER – An intranet project is not just a significant change project, it has the potential to be transformative to the way a company operates.
Adoption is key to achieving this, so a clear plan for engagement and communication is crucial, based around your three areas of focus…
Data Sovereignty Laws
Safe Harbor agreement struck down this fall (US/EC)
New Russian localization law (went into effect in September
Currently 20+ countries also considering similar privacy laws
REMEMBER – An intranet project is not just a significant change project, it has the potential to be transformative to the way a company operates.
Adoption is key to achieving this, so a clear plan for engagement and communication is crucial, based around your three areas of focus…