Fusion 3.1 comes with exciting new features that will make your search more personal and better targeted. Join us for a webinar to learn more about Fusion's features, what's new in this release, and what's around the corner for Fusion.
Axa Assurance Maroc - Insurer Innovation Award 2024
Webinar: Fusion 3.1 - What's New
1.
2. Based in San Francisco
Offices in Bangalore, Bangkok,
New York City, Raleigh, Munich
Over 300 customers across the
Fortune 1000
Fusion, a Solr-powered platform
for search-driven apps
Consulting and support for
organizations using Solr
Produces the world’s largest open
source user conference dedicated
to Lucene/Solr
Lucidworks is the primary sponsor of
the Apache Solr project
Employs over 40% of the active
committers on the Solr project
Contributes over 70% of Solr's
open source codebase
40%
70%
3. If you can’t find it, then you can’t…
R&D Online
Retail
Customer
Insights
Fraud &
Compliance
Customer
Service
5. • Over 50 connectors to
integrate all your data
• Robust parsing framework
to seamlessly ingest all your
document types
• Point and click Indexing
configuration and iterative
simulation of results for full
control over your ETL process
• Your security model
enforced end-to-end from
ingest to search across your
different datasources
6. • Relevancy tuning: Point-and-click
query pipeline configuration allow
fine-grained control of results.
• Machine-driven relevancy: Signals
aggregation learn and automatically
tune relevancy and drive
recommendations out of the box .
• Powerful pipeline stages:
Customize fields, stages, synonyms,
boosts, facets, machine learning
models, your own scripted behavior,
and dozens of other powerful
search stages.
• Turnkey search UI
(Lucidworks AppStudio): Build a
sophisticated end-to-end search
application in just hours.
10. SECURITY BUILT-IN
RESTAPI
Admin UI
Lucidworks
View
LOGS FILE WEB DATABASE CLOUD
HDFS(Optional)
Fusion abstracts the
OS so you don't have
to worry about that.
Core Services
• • •
ETL and Query Pipelines
Recommenders/Signals
NLP
Machine Learning
Alerting and Messaging
Security
Connectors
Scheduling
Fusion Abstracts Open Source Bits
11. Core Services
• • •
NLP
Recommenders / Signals
Blob Storage
Pipelines
Scheduling
Alerting / Messaging
Connectors
RESTAPI
Admin UI
Lucidworks
View
LOGS FILE WEB DATABASE CLOUD
• Seamless integration of your
entire search & analytics
platform
• All capabilities exposed
through secured API's, so you
can use our UI or build your
own.
• End-to-end security policies
can be applied out of the box
to every aspect of your search
ecosystem.
• Distributed, fault-tolerant
scaling and supervision of
your entire search application
21. New APIs
• Links API - explore the links between Fusion objects.
• Jobs API - replaces the Scheduler API
• Tasks API - Jobs schedule Tasks, tasks are the things you do (i.e. make
this rest call)
• Groups API - Tag groups of Fusion Objects with identifiers
Improved APIs
• Jobs and Schedules are now separate (generally but this leaks into the
APIs)
• Spark Jobs API
• Objects API
23. Connector Enhancements
• JIRA Connector - security trimming, parsers and performance,
settings for field, timeout and retry.
• Web Connector - crawl JavaScript powered sites, authentication and
timeout settings
• Jive Connector - lists and maps
• Box.com connector - incremental crawling improvements, more
depth and exclusion settings
• Google Drive - batch incremental crawling, security trimming,
timeouts, index trash
24. New Parsers
• XML Parser - Parse XML separately into new documents
• HTML Parser - not Tika based, uses JSoup Selectors to extract HTML
or CSS into fields/documents
25. Query Pipeline
• Recommendation -> Boost With Signals
• User Recommendation Boosting -> Recommend Items for User
• Recommend Similar -> Recommend More Like This
28. Machine Learning Functionalities with Fusion 3.1
Fusion machine learning capabilities include:
• Recommendation and Personalization.
• Query intent and document classification.
• Learning to rank.
• Automatic doc clustering, anomaly detection, cluster labeling and topic
detection.
• NLP: synonym discovery and expansion via w2v; phrase detection using log
likelihood.
• Clickstream auto-tuning relevance and analysis.
• Multi-arm bandit experimentation.
29. Recommendation and Personalization :
Out of the box, multi-modal recommenders based on collaborative
filtering and content:
Use cases:
• Recommend item for user: select items that user likely to buy based on past click/
purchase behavior.
• Recommend item for query: help boost items based on query history.
• Item to item similarity: users who were interested in items like X were also
interested in items like Y and Z.
• Query to query similarity: suggest similar queries to provide query expansion.
Built in pipeline, spark job, job schedules and visualizations all provided.
30. Query relevancy and intent:
Query intent classification using Random Forest or Logistic Regression:
Use Cases:
• Predict product category of an incoming query to help reduce ambiguity.
• Predict category of a set of new product to be incorporated into catalog.
Learning to Rank:
• Use internal and external features of content and query to influence ranking.
• Leverage Solr’s re-ranking (&rq) capability.
• Support for: linear and non-linear models (libSVM, liblinear, RankLib)
31. Automatic Doc Clustering and Anomaly Detection :
Features :
• Automatic outlier detection.
• Automatic decision about number of clusters K. (in house algorithm)
• Automatic cluster topic detection. (in house algorithm)
• Ranking of document based on relevance to the topic.
• Minimum data science knowledge needed to use the module:
Extensive research has been done to help choose the best set of models; Good default
parameter settings that generalize well; Flexible pipeline.
Use Cases :
• Financial: group emails, news, research articles.
• Ecommerce: group product reviews, product descriptions.
32. NLP using Machine Learning:
Automatic phrase detection using log likelihood:
• Spark job to generate a table of phrases (freq co-occurred terms) such as “area rug”,
“interior paint” and “ipad case”.
Spell checking using edit distance:
• Match a tail query to head query or product name based on edit distance to correct typos:
e.g., ”wireless motam”->”wireless modem”.
Synonym detection/query expansion using Word2Vec:
• e.g., “refrigerator”->”freezer”, “phone”->”cellular”.