SlideShare a Scribd company logo
1 of 19
The Lifecycle of a FAST
Search Implementation
    Rem Purushothaman
        Search Practice Lead
Rem Purushothaman
                Search Practice Lead

                Contact:
                rem.purushothaman@perficient.com
                312.589.3371
                @RemSearchPro


www.perficient.com
Prepare
Prepare

You’ve decided to implement FAST. Now what?
      Build your team. What kind of skill sets do you need?

            Solution Architect

            Project Manager

            Business Analyst

            SharePoint Developers

            QA and Testers

            Operations Personnel

      Get your team trained on FAST Search

      Understand at a high level what needs to be accomplished in each of the project phases

            Envision, Plan, Build, Stabilize, Deploy

            Operations
Envision
Envision

Determine what needs to be accomplished.
      High level search requirements

            Custom UI for Search?

            Identify content that needs to be crawled and indexed. Do you need custom connectors?

            How do you measure relevancy?

            Security Considerations

            Linguistics (Synonyms, Spell Check Exceptions)

            Integration to other systems

      Initial Architecture and Environment Sizing

            Number of servers (Dev, QA, Staging/Performance, Production)

            Failover, Index Redundancy, High Availability

            Initial specifications for crawling and indexing performance

            Initial specifications for query and search performance
Plan
Plan

This is the most important phase of any search project.
      Gather detailed search requirements for the UI

            Identify metadata that needs to be displayed in the results. Are multiple search result
             pages needed?

            Identify Refiners for each search result page

            Is there a need for a custom UI and web parts?

            If necessary, design the services layer and integration to external systems

      Content Sources

            Identify content sources (crawl rules) and access restrictions (security considerations)

            Identify content metadata (properties, fields, elements) to be crawled

            Estimate work required to crawl content using custom connectors

            Identify special security requirement. If necessary, map custom security model to Active
             Directory for security trimming

            Determine if any of the content sources have to be cleaned up before crawling
Plan

   Crawling and Indexing

         Map crawl properties (content metadata) to managed properties (indexed items)

         Identify custom relevancy models

         Identify managed properties to be indexed

         Identify full text index priority

         Identify Linguistic Components - keywords, synonyms, best bets, type ahead

   Testing

         Create content set for testing

         Create plan for performance testing

   Operations

         Identify plans for incremental updates and deletes

         Create plans for identifying and managing the incremental growth of the index

         Identify how to monitor and manage search issues
Build
Build

Put it all together. Good planning will have big payoffs.
       Build the UI and Services Layer

             Search and Search Results Pages including Advanced Search

             If necessary, build the Services Layer and integrate to external systems

             Custom Web Parts (refiners, federation)

       Crawling and Indexing

             Set up Content Sources (Custom Connectors) and Crawl Rules

                 Optional: Custom pipeline development

                 Optional: External process to pre-process & scrub the content

                 Optional: Map custom security to AD

             Set up Crawled and Managed Properties

             Set up Type Ahead, Keywords, Best Bests, Synonyms, Refiners

             Crawl and Index the Content (rinse and repeat)
Stabilize
Stabilize

Getting results back from search doesn’t mean the results are valid.

        Application and Search Related Testing

              Its critical that the QA understand how FAST works to test it properly

              Use external tools to test queries outside of the SharePoint Search Center

              Validate search result relevancy, document counts, and refiner counts

              Validate Security Trimming

              Validate Linguistics (spell check, synonyms, stemming, stop words, etc..)

        Performance Testing

              Ideally, the performance test environment should be approximately half the size of the
               production environment

              Performance test early in the project life cycle and then on a regular basis

              Compile a list of expensive queries that will really stress the system

              Determine the max QPS system can handle

              Determine max number of documents that can be supported in a column
Deploy
Deploy

For the most, deploying a FAST search solution is just like deploying a SP solution

   Create PowerShell scripts for all the FAST related deployment items

         Content Source, Crawl Rules

         Keywords (with Best Bets, Synonyms, etc…), Type Ahead, Spell Check Exceptions

         Make the scripts generic (try not to hard code environment specific values)

   Test the FAST deployment scripts through each of the environments (dev, test, staging)

   Validate crawler access to all content source systems

   In production, pre-populate (crawl and index) the content ahead of the application deployment
Best Practices
Best Practices

   Get key people trained in FAST Search

   There is no substitute for good planning and design. This pays huge dividends in terms of saving
    time from having to re-crawl and re-index your content

   Do you best to prepare the content to have consistent and clean metadata.

   Introduce search organically within your organization. Start with a pilot group and grow from
    there

   Use the search engine for search

         Don’t use it as your content repository

         Don’t use it as a way to populate lists in the UI

   Keep the number of queries from a UI page to a minimum. Bigger search results are sometimes
    better than multiple search queries from a single page.

   Don’t index the world. Start with a manageable content set and add content in a phases

   Index only a small portion in your dev and test environments. This will make you agile and save
    you a lot of time during the build and stabilize phases.
Best Practices

   Be prepared to handle growth. Have a plan in place. Indexes can get large quickly

   Do a through job with performance testing and sizing. This will pay off in terms of performance
    and scalability in production

   Understand your performance requirements (number of users, QPS) and build your environment
    with excess capacity. It gives you breathing room for spikes and unexpected scenarios

   A fail over/redundant farm is highly recommended for the production environment. Besides the
    obvious benefits, a redundant farm helps with deployments

   Initial environment architecture and sizing will always be guess until performance testing
    validates environment architecture and size. Be prepared to add more servers

   Have measurable metrics that operations personnel can use for monitoring the health of FAST
Q&A

More Related Content

Similar to Lifecycle of a FAST Search Implementation

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Large scale agile development practices
Large scale agile development practicesLarge scale agile development practices
Large scale agile development practicesSkills Matter
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)John Pape
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentSalesforce Developers
 
Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Findwise
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchElasticsearch
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMrtpaem
 
Developer 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainDeveloper 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainInflectra
 
Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Aditya Varun Chadha
 
AvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntAvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntMary Leigh Mackie
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Planning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsPlanning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsBenjamin Athawes
 
Roadmap to Enterprise Quality
Roadmap to Enterprise QualityRoadmap to Enterprise Quality
Roadmap to Enterprise QualityJeff Bramwell
 
Martin Woodward - DevOps Habits
Martin Woodward  - DevOps HabitsMartin Woodward  - DevOps Habits
Martin Woodward - DevOps HabitsPeter Laker
 
1 Ads
1 Ads1 Ads
1 Adslcbj
 
SPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningSPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningChristian Buckley
 
Build your open source data science platform
Build your open source data science platformBuild your open source data science platform
Build your open source data science platformDavid Talby
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineSalford Systems
 

Similar to Lifecycle of a FAST Search Implementation (20)

ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Large scale agile development practices
Large scale agile development practicesLarge scale agile development practices
Large scale agile development practices
 
Best practice adoption (and lack there of)
Best practice adoption (and lack there of)Best practice adoption (and lack there of)
Best practice adoption (and lack there of)
 
Advanced Application Lifecycle Managment
Advanced Application Lifecycle ManagmentAdvanced Application Lifecycle Managment
Advanced Application Lifecycle Managment
 
Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013Enterprise Search in SharePoint 2013
Enterprise Search in SharePoint 2013
 
Vsts
VstsVsts
Vsts
 
Project
ProjectProject
Project
 
Keynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified searchKeynote: Harnessing the power of Elasticsearch for simplified search
Keynote: Harnessing the power of Elasticsearch for simplified search
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
 
Developer 2: Building A DevOps Toolchain
Developer 2: Building A DevOps ToolchainDeveloper 2: Building A DevOps Toolchain
Developer 2: Building A DevOps Toolchain
 
Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010Naukri Search Team achievements, 2009-2010
Naukri Search Team achievements, 2009-2010
 
AvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle MgmntAvePoint - SharePoint App Lifecycle Mgmnt
AvePoint - SharePoint App Lifecycle Mgmnt
 
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 SearchMetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
 
Planning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROsPlanning SharePoint 2013 Search for IT PROs
Planning SharePoint 2013 Search for IT PROs
 
Roadmap to Enterprise Quality
Roadmap to Enterprise QualityRoadmap to Enterprise Quality
Roadmap to Enterprise Quality
 
Martin Woodward - DevOps Habits
Martin Woodward  - DevOps HabitsMartin Woodward  - DevOps Habits
Martin Woodward - DevOps Habits
 
1 Ads
1 Ads1 Ads
1 Ads
 
SPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration PlanningSPSHOU - Mastering SharePoint Migration Planning
SPSHOU - Mastering SharePoint Migration Planning
 
Build your open source data science platform
Build your open source data science platformBuild your open source data science platform
Build your open source data science platform
 
Machine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search EngineMachine Learned Relevance at A Large Scale Search Engine
Machine Learned Relevance at A Large Scale Search Engine
 

More from Perficient, Inc.

Driving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsDriving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsPerficient, Inc.
 
Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Perficient, Inc.
 
The Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesThe Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesPerficient, Inc.
 
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Perficient, Inc.
 
Content, Commerce, and... COVID
Content, Commerce, and... COVIDContent, Commerce, and... COVID
Content, Commerce, and... COVIDPerficient, Inc.
 
Centene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryCentene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryPerficient, Inc.
 
Automate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaAutomate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaPerficient, Inc.
 
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPreparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPerficient, Inc.
 
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Perficient, Inc.
 
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinThe Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinPerficient, Inc.
 
Cardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudCardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudPerficient, Inc.
 
Teams Summit - What is New and Coming
Teams Summit -  What is New and ComingTeams Summit -  What is New and Coming
Teams Summit - What is New and ComingPerficient, Inc.
 
Empower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementEmpower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementPerficient, Inc.
 
Adoption & Change Management Overview
Adoption & Change Management OverviewAdoption & Change Management Overview
Adoption & Change Management OverviewPerficient, Inc.
 
Microsoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomeMicrosoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomePerficient, Inc.
 
Securing Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkSecuring Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkPerficient, Inc.
 
Infrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersInfrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersPerficient, Inc.
 
Accelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsAccelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsPerficient, Inc.
 
Preparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPreparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPerficient, Inc.
 
Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Perficient, Inc.
 

More from Perficient, Inc. (20)

Driving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season ResultsDriving Strong 2020 Holiday Season Results
Driving Strong 2020 Holiday Season Results
 
Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation Transforming Pharmacovigilance Workflows with AI & Automation
Transforming Pharmacovigilance Workflows with AI & Automation
 
The Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial ServicesThe Secret to Acquiring and Retaining Customers in Financial Services
The Secret to Acquiring and Retaining Customers in Financial Services
 
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
Oracle Strategic Modeling Live: Defined. Discussed. Demonstrated.
 
Content, Commerce, and... COVID
Content, Commerce, and... COVIDContent, Commerce, and... COVID
Content, Commerce, and... COVID
 
Centene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success StoryCentene's Financial Transformation Journey: A OneStream Success Story
Centene's Financial Transformation Journey: A OneStream Success Story
 
Automate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug KodaAutomate Medical Coding With WHODrug Koda
Automate Medical Coding With WHODrug Koda
 
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration ProjectPreparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
Preparing for Your Oracle, Medidata, and Veeva CTMS Migration Project
 
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
Accelerating Partner Management: How Manufacturers Can Navigate Covid-19
 
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand FishkinThe Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
The Critical Role of Audience Intelligence with Eric Enge and Rand Fishkin
 
Cardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM CloudCardtronics Future Ready with Oracle EPM Cloud
Cardtronics Future Ready with Oracle EPM Cloud
 
Teams Summit - What is New and Coming
Teams Summit -  What is New and ComingTeams Summit -  What is New and Coming
Teams Summit - What is New and Coming
 
Empower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis ManagementEmpower Your Organization with Teams & Remote Work Crisis Management
Empower Your Organization with Teams & Remote Work Crisis Management
 
Adoption & Change Management Overview
Adoption & Change Management OverviewAdoption & Change Management Overview
Adoption & Change Management Overview
 
Microsoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from HomeMicrosoft Teams: Measuring Activity of Employees Working from Home
Microsoft Teams: Measuring Activity of Employees Working from Home
 
Securing Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote WorkSecuring Teams with Microsoft 365 Security for Remote Work
Securing Teams with Microsoft 365 Security for Remote Work
 
Infrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote WorkersInfrastructure Best Practices for Teams Remote Workers
Infrastructure Best Practices for Teams Remote Workers
 
Accelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft TeamsAccelerate Adoption for Microsoft Teams
Accelerate Adoption for Microsoft Teams
 
Preparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge ManagementPreparing for Project Cortex and the Future of Knowledge Management
Preparing for Project Cortex and the Future of Knowledge Management
 
Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work Utilizing Microsoft 365 Security for Remote Work
Utilizing Microsoft 365 Security for Remote Work
 

Recently uploaded

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

Lifecycle of a FAST Search Implementation

  • 1. The Lifecycle of a FAST Search Implementation Rem Purushothaman Search Practice Lead
  • 2. Rem Purushothaman Search Practice Lead Contact: rem.purushothaman@perficient.com 312.589.3371 @RemSearchPro www.perficient.com
  • 4. Prepare You’ve decided to implement FAST. Now what?  Build your team. What kind of skill sets do you need?  Solution Architect  Project Manager  Business Analyst  SharePoint Developers  QA and Testers  Operations Personnel  Get your team trained on FAST Search  Understand at a high level what needs to be accomplished in each of the project phases  Envision, Plan, Build, Stabilize, Deploy  Operations
  • 6. Envision Determine what needs to be accomplished.  High level search requirements  Custom UI for Search?  Identify content that needs to be crawled and indexed. Do you need custom connectors?  How do you measure relevancy?  Security Considerations  Linguistics (Synonyms, Spell Check Exceptions)  Integration to other systems  Initial Architecture and Environment Sizing  Number of servers (Dev, QA, Staging/Performance, Production)  Failover, Index Redundancy, High Availability  Initial specifications for crawling and indexing performance  Initial specifications for query and search performance
  • 8. Plan This is the most important phase of any search project.  Gather detailed search requirements for the UI  Identify metadata that needs to be displayed in the results. Are multiple search result pages needed?  Identify Refiners for each search result page  Is there a need for a custom UI and web parts?  If necessary, design the services layer and integration to external systems  Content Sources  Identify content sources (crawl rules) and access restrictions (security considerations)  Identify content metadata (properties, fields, elements) to be crawled  Estimate work required to crawl content using custom connectors  Identify special security requirement. If necessary, map custom security model to Active Directory for security trimming  Determine if any of the content sources have to be cleaned up before crawling
  • 9. Plan  Crawling and Indexing  Map crawl properties (content metadata) to managed properties (indexed items)  Identify custom relevancy models  Identify managed properties to be indexed  Identify full text index priority  Identify Linguistic Components - keywords, synonyms, best bets, type ahead  Testing  Create content set for testing  Create plan for performance testing  Operations  Identify plans for incremental updates and deletes  Create plans for identifying and managing the incremental growth of the index  Identify how to monitor and manage search issues
  • 10. Build
  • 11. Build Put it all together. Good planning will have big payoffs.  Build the UI and Services Layer  Search and Search Results Pages including Advanced Search  If necessary, build the Services Layer and integrate to external systems  Custom Web Parts (refiners, federation)  Crawling and Indexing  Set up Content Sources (Custom Connectors) and Crawl Rules  Optional: Custom pipeline development  Optional: External process to pre-process & scrub the content  Optional: Map custom security to AD  Set up Crawled and Managed Properties  Set up Type Ahead, Keywords, Best Bests, Synonyms, Refiners  Crawl and Index the Content (rinse and repeat)
  • 13. Stabilize Getting results back from search doesn’t mean the results are valid.  Application and Search Related Testing  Its critical that the QA understand how FAST works to test it properly  Use external tools to test queries outside of the SharePoint Search Center  Validate search result relevancy, document counts, and refiner counts  Validate Security Trimming  Validate Linguistics (spell check, synonyms, stemming, stop words, etc..)  Performance Testing  Ideally, the performance test environment should be approximately half the size of the production environment  Performance test early in the project life cycle and then on a regular basis  Compile a list of expensive queries that will really stress the system  Determine the max QPS system can handle  Determine max number of documents that can be supported in a column
  • 15. Deploy For the most, deploying a FAST search solution is just like deploying a SP solution  Create PowerShell scripts for all the FAST related deployment items  Content Source, Crawl Rules  Keywords (with Best Bets, Synonyms, etc…), Type Ahead, Spell Check Exceptions  Make the scripts generic (try not to hard code environment specific values)  Test the FAST deployment scripts through each of the environments (dev, test, staging)  Validate crawler access to all content source systems  In production, pre-populate (crawl and index) the content ahead of the application deployment
  • 17. Best Practices  Get key people trained in FAST Search  There is no substitute for good planning and design. This pays huge dividends in terms of saving time from having to re-crawl and re-index your content  Do you best to prepare the content to have consistent and clean metadata.  Introduce search organically within your organization. Start with a pilot group and grow from there  Use the search engine for search  Don’t use it as your content repository  Don’t use it as a way to populate lists in the UI  Keep the number of queries from a UI page to a minimum. Bigger search results are sometimes better than multiple search queries from a single page.  Don’t index the world. Start with a manageable content set and add content in a phases  Index only a small portion in your dev and test environments. This will make you agile and save you a lot of time during the build and stabilize phases.
  • 18. Best Practices  Be prepared to handle growth. Have a plan in place. Indexes can get large quickly  Do a through job with performance testing and sizing. This will pay off in terms of performance and scalability in production  Understand your performance requirements (number of users, QPS) and build your environment with excess capacity. It gives you breathing room for spikes and unexpected scenarios  A fail over/redundant farm is highly recommended for the production environment. Besides the obvious benefits, a redundant farm helps with deployments  Initial environment architecture and sizing will always be guess until performance testing validates environment architecture and size. Be prepared to add more servers  Have measurable metrics that operations personnel can use for monitoring the health of FAST
  • 19. Q&A