SlideShare a Scribd company logo
1 of 23
Download to read offline
GraphAware®
RELEVANT SEARCH
LEVERAGING KNOWLEDGE GRAPHS
WITH NEO4J
Alessandro Negro

Chief Scientist @ GraphAware
graphaware.com

@graph_aware, @AlessandroNegro
‣ The rise of Knowledge Graphs
‣ Relevant Search
‣ Knowledge Graphs for e-Commerce
‣ Infrastructure
‣ Conclusions
OUTLINE
GraphAware®
“Knowledge graphs provide contextual windows
into master data domains and the links between
domains”
KNOWLEDGE GRAPH
CONNECTING THE DOTS
GraphAware®
The Forrester Wave, Master Data Management
THE RISE OF
KNOWLEDGE GRAPHS
GraphAware®
E-Commerce
‣ Many data sources
‣ Marketing strategies
‣ Business goals
‣ Category hierarchies
‣ Searches

Enterprise Networks
‣ Uncover new opportunities, hidden leads



Finance
‣ Textual corpora such as financial
documents contain a wealth of
knowledge
‣ Structured knowledge of entities and
relationships
Medicine & Health
‣ Dynamic ontologies where data is
categorized and organised around
people, places, things and events
‣ Patterns in disease progression, causal
relations involving disease and
symptoms, new relationships previously
unrecognised

Criminal Investigation & Intelligence
‣ Obfuscated information
‣ Traceability to sources of information
GraphAware®
THE RISE OF
KNOWLEDGE GRAPHS
DATA SPARSITY

PROBLEM
GraphAware®
Collaborative Filtering
‣ Cold Start
Content Based Recommendation
‣ Missing Data
‣ Wrong Data

Text Search
‣ User agnostic
‣ Relevant Search













KNOWLEDGE GRAPH:
DATA CONVERGENCE
GraphAware®
RELEVANT SEARCH
GraphAware®
“Relevance is the practice of improving search
results for users by satisfying their information
needs in the context of a particular user
experience, while balancing how ranking
impacts business’s needs.”
RELEVANT SEARCH
DIMENSIONS
GraphAware®
KNOWLEDGE GRAPHS

THE MODEL
Search architecture must be able to handle highly heterogenous data
Knowledge Graphs represent the information structure for relevant search
Graphs are the right representation for:
‣ Information Extraction
‣ Recommendation Engines
‣ Context Representation
‣ Rule Engine
Critical aspects and peculiarities:
‣ Defined and controlled set of searchable Items
‣ Multiple category hierarchies
‣ Marketing strategy
‣ User feedback and interactions
‣ Supplier information
‣ Business constraints
THE USE CASE

E-COMMERCE
GraphAware®
→ Text search and catalog navigation as Sales People
KNOWLEDGE GRAPH

FOR E-COMMERCE
GraphAware®
INFRASTRUCTURE

A 10K-FOOT VIEW
GraphAware®
A graph centric approach
THE DATA FLOW
GraphAware®
‣ Async data ingestion
‣ Data Pipeline
‣ Single Neo4j Writer
‣ Microservice approach for
isolation and scalability
‣ Event notification
‣ Multiple views exported into
Elasticsearch
THE NEO4J ROLES
GraphAware®
‣ Single source of truth
‣ Cleansing
‣ Fast access to connected data
‣ Query
‣ Knowledge Graph store
‣ Merging External Data
‣ Existing Data Augmentation
Natural Language Processing
‣ Unsupervised Topic Identification
‣ Word2Vec
‣ Clustering (Label Propagation)
EXTERNALISE INTENSE
PROCESSES
GraphAware®
Recommendation model building
‣ Content-Based
‣ Collaborative Filtering (internal and
external)
Fast, Reliable and Easy-to-tune textual searches
‣ Multiple views for multiple scopes:
‣ Catalog Navigation and Search
‣ Faceting
‣ Product details page
‣ Product variants aggregation
‣ Autocomplete
‣ Suggestion
THE ELASTICSEARCH
ROLES
GraphAware®
→ It is not used as a database
Any components of relevance-scoring calculation
corresponding to a meaningful and measurable
information

Two techniques to control relevancy:
‣ Signal Modeling
‣ Ranking Function
Note: balance precision and recall
Multiple sources
CRAFTING

SIGNALS
GraphAware®
→ Users as a new source of information
GraphAware®
Profile-based personalisation:
‣ Explicit: Users provide profile
information
‣ Implicit: Profile created from user
interactions

Behavioural-Based personalisation
‣ Focus on User-Item Interaction
‣ Make explicit the relationships
among users and items
PERSONALISING

SEARCH
Tying personalisation back to search
‣ Query-time personalisation
‣ Index-time personalisation
→ Search for things, not for strings
CONCEPT

SEARCH
GraphAware®
Basic Approaches:
‣ Concept field (Manual Tagging)
‣ Synonyms

Content Augmentation (ML based)
‣ Co-occurrence
‣ Latent Semantic Analysis
‣ Latent Dirichlet Allocation
‣ Word2Vec
COMBINED SEARCH
APPROACHES
GraphAware®
Knowledge Graphs can
‣ store easy-to-query model
‣ gather data from multiple sources
‣ be easily extended

Search Engines can
‣ provide fast, reliable and easy-to-
tune textual search
‣ provide features like faceting,
autocomplete
CONCLUSION
GraphAware®
→ By combining them, it is possible to offer an unlimited
set of services to the end users
www.graphaware.com

@graph_aware
GraphAware
GraphAware®
world’s #1 Neo4j consultancy

More Related Content

What's hot

Talend AS A Product
Talend AS A ProductTalend AS A Product
Talend AS A Product
Abdul Manaf
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
Open Analytics
 

What's hot (20)

Spring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise AppsSpring Data Neo4j: Graph Power Your Enterprise Apps
Spring Data Neo4j: Graph Power Your Enterprise Apps
 
5 Simple Steps to Unleash Big Data Talend Connect
5 Simple Steps to Unleash Big Data Talend Connect5 Simple Steps to Unleash Big Data Talend Connect
5 Simple Steps to Unleash Big Data Talend Connect
 
Machine Learning with PyCaret
Machine Learning with PyCaretMachine Learning with PyCaret
Machine Learning with PyCaret
 
Data democratised
Data democratisedData democratised
Data democratised
 
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4jBuilding Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
Building Intelligent Solutions with Graphs, Stefan Kolmar, Neo4j
 
Case Studies on Big-Data Processing and Streaming - Iranian Java User Group
Case Studies on Big-Data Processing and Streaming - Iranian Java User GroupCase Studies on Big-Data Processing and Streaming - Iranian Java User Group
Case Studies on Big-Data Processing and Streaming - Iranian Java User Group
 
Talend AS A Product
Talend AS A ProductTalend AS A Product
Talend AS A Product
 
Mutable data @ scale
Mutable data @ scaleMutable data @ scale
Mutable data @ scale
 
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
The Art of Intelligence – A Practical Introduction Machine Learning for Oracl...
 
GraphQL Advanced
GraphQL AdvancedGraphQL Advanced
GraphQL Advanced
 
MLSD18. Automating Machine Learning Workflows
MLSD18. Automating Machine Learning WorkflowsMLSD18. Automating Machine Learning Workflows
MLSD18. Automating Machine Learning Workflows
 
GraphTour 2020 - Customer Journey with Neo4j Services
GraphTour 2020 - Customer Journey with Neo4j ServicesGraphTour 2020 - Customer Journey with Neo4j Services
GraphTour 2020 - Customer Journey with Neo4j Services
 
Big data-science-oanyc
Big data-science-oanycBig data-science-oanyc
Big data-science-oanyc
 
GraphQL and its schema as a universal layer for database access
GraphQL and its schema as a universal layer for database accessGraphQL and its schema as a universal layer for database access
GraphQL and its schema as a universal layer for database access
 
Cloud-Native Microservices
Cloud-Native MicroservicesCloud-Native Microservices
Cloud-Native Microservices
 
GraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and MLGraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and ML
 
schema.org, Linked Data's Gateway Drug
schema.org, Linked Data's Gateway Drugschema.org, Linked Data's Gateway Drug
schema.org, Linked Data's Gateway Drug
 
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
Deploying an End-to-End TigerGraph Enterprise Architecture using Kafka, Maria...
 
Plume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis LibraryPlume - A Code Property Graph Extraction and Analysis Library
Plume - A Code Property Graph Extraction and Analysis Library
 
Big Analytics: Building Lasting Value
Big Analytics: Building Lasting ValueBig Analytics: Building Lasting Value
Big Analytics: Building Lasting Value
 

Similar to Relevant Search Leveraging Knowledge Graphs with Neo4j

EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using Hadoop
DataWorks Summit
 

Similar to Relevant Search Leveraging Knowledge Graphs with Neo4j (20)

Connect, Enrich, Evolve: Convert Unstructured Data Silos to Knowledge Graphs
Connect, Enrich, Evolve: Convert Unstructured Data Silos to Knowledge GraphsConnect, Enrich, Evolve: Convert Unstructured Data Silos to Knowledge Graphs
Connect, Enrich, Evolve: Convert Unstructured Data Silos to Knowledge Graphs
 
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
Robert Isele | eccenca CorporateMemory - Semantically integrated Enterprise D...
 
EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using Hadoop
 
The Case for Open Source in the Public Sector
The Case for Open Source in the Public SectorThe Case for Open Source in the Public Sector
The Case for Open Source in the Public Sector
 
Big Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data DemocratizationBig Data Fabric 2.0 Drives Data Democratization
Big Data Fabric 2.0 Drives Data Democratization
 
DAS Slides: Graph Databases — Practical Use Cases
DAS Slides: Graph Databases — Practical Use CasesDAS Slides: Graph Databases — Practical Use Cases
DAS Slides: Graph Databases — Practical Use Cases
 
Accelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data FabricAccelerate Digital Transformation with an Enterprise Big Data Fabric
Accelerate Digital Transformation with an Enterprise Big Data Fabric
 
RDBMS to Graph Webinar
RDBMS to Graph WebinarRDBMS to Graph Webinar
RDBMS to Graph Webinar
 
Bigdata : Big picture
Bigdata : Big pictureBigdata : Big picture
Bigdata : Big picture
 
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4jLeveraging Graphs for AI and ML - Alicia Frame, Neo4j
Leveraging Graphs for AI and ML - Alicia Frame, Neo4j
 
Reducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop ApplicationsReducing Development Time for Production-Grade Hadoop Applications
Reducing Development Time for Production-Grade Hadoop Applications
 
Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph TechnologyOracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
Oracle NoSQL DB & InfiniteGraph - Trends in Big Data and Graph Technology
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Roadmap for Enterprise Graph Strategy
Roadmap for Enterprise Graph StrategyRoadmap for Enterprise Graph Strategy
Roadmap for Enterprise Graph Strategy
 
Boost your data analytics with open data and public news content
Boost your data analytics with open data and public news contentBoost your data analytics with open data and public news content
Boost your data analytics with open data and public news content
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 

More from GraphAware

More from GraphAware (20)

Unparalleled Graph Database Scalability Delivered by Neo4j 4.0
Unparalleled Graph Database Scalability Delivered by Neo4j 4.0Unparalleled Graph Database Scalability Delivered by Neo4j 4.0
Unparalleled Graph Database Scalability Delivered by Neo4j 4.0
 
Challenges in knowledge graph visualization
Challenges in knowledge graph visualizationChallenges in knowledge graph visualization
Challenges in knowledge graph visualization
 
Social media monitoring with ML-powered Knowledge Graph
Social media monitoring with ML-powered Knowledge GraphSocial media monitoring with ML-powered Knowledge Graph
Social media monitoring with ML-powered Knowledge Graph
 
To be or not to be.
To be or not to be. To be or not to be.
To be or not to be.
 
It Depends (and why it's the most frequent answer to modelling questions)
It Depends (and why it's the most frequent answer to modelling questions)It Depends (and why it's the most frequent answer to modelling questions)
It Depends (and why it's the most frequent answer to modelling questions)
 
When privacy matters! Chatbots in data-sensitive businesses
When privacy matters! Chatbots in data-sensitive businessesWhen privacy matters! Chatbots in data-sensitive businesses
When privacy matters! Chatbots in data-sensitive businesses
 
Graph-Powered Machine Learning
Graph-Powered Machine LearningGraph-Powered Machine Learning
Graph-Powered Machine Learning
 
Signals from outer space
Signals from outer spaceSignals from outer space
Signals from outer space
 
Neo4j-Databridge: Enterprise-scale ETL for Neo4j
Neo4j-Databridge: Enterprise-scale ETL for Neo4jNeo4j-Databridge: Enterprise-scale ETL for Neo4j
Neo4j-Databridge: Enterprise-scale ETL for Neo4j
 
(Big) Data Science
 (Big) Data Science (Big) Data Science
(Big) Data Science
 
Modelling Data in Neo4j (plus a few tips)
Modelling Data in Neo4j (plus a few tips)Modelling Data in Neo4j (plus a few tips)
Modelling Data in Neo4j (plus a few tips)
 
Intro to Neo4j (CZ)
Intro to Neo4j (CZ)Intro to Neo4j (CZ)
Intro to Neo4j (CZ)
 
Modelling Data as Graphs (Neo4j)
Modelling Data as Graphs (Neo4j)Modelling Data as Graphs (Neo4j)
Modelling Data as Graphs (Neo4j)
 
GraphAware Framework Intro
GraphAware Framework IntroGraphAware Framework Intro
GraphAware Framework Intro
 
Advanced Neo4j Use Cases with the GraphAware Framework
Advanced Neo4j Use Cases with the GraphAware FrameworkAdvanced Neo4j Use Cases with the GraphAware Framework
Advanced Neo4j Use Cases with the GraphAware Framework
 
Recommendations with Neo4j (FOSDEM 2015)
Recommendations with Neo4j (FOSDEM 2015)Recommendations with Neo4j (FOSDEM 2015)
Recommendations with Neo4j (FOSDEM 2015)
 
Knowledge Graphs and Chatbots with Neo4j and IBM Watson - Christophe Willemsen
Knowledge Graphs and Chatbots with Neo4j and IBM Watson - Christophe WillemsenKnowledge Graphs and Chatbots with Neo4j and IBM Watson - Christophe Willemsen
Knowledge Graphs and Chatbots with Neo4j and IBM Watson - Christophe Willemsen
 
The power of polyglot searching
The power of polyglot searchingThe power of polyglot searching
The power of polyglot searching
 
Neo4j-Databridge
Neo4j-DatabridgeNeo4j-Databridge
Neo4j-Databridge
 
Voice-driven Knowledge Graph Journey with Neo4j and Amazon Alexa
Voice-driven Knowledge Graph Journey with Neo4j and Amazon AlexaVoice-driven Knowledge Graph Journey with Neo4j and Amazon Alexa
Voice-driven Knowledge Graph Journey with Neo4j and Amazon Alexa
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Relevant Search Leveraging Knowledge Graphs with Neo4j

  • 1. GraphAware® RELEVANT SEARCH LEVERAGING KNOWLEDGE GRAPHS WITH NEO4J Alessandro Negro
 Chief Scientist @ GraphAware graphaware.com
 @graph_aware, @AlessandroNegro
  • 2. ‣ The rise of Knowledge Graphs ‣ Relevant Search ‣ Knowledge Graphs for e-Commerce ‣ Infrastructure ‣ Conclusions OUTLINE GraphAware®
  • 3. “Knowledge graphs provide contextual windows into master data domains and the links between domains” KNOWLEDGE GRAPH CONNECTING THE DOTS GraphAware® The Forrester Wave, Master Data Management
  • 4. THE RISE OF KNOWLEDGE GRAPHS GraphAware® E-Commerce ‣ Many data sources ‣ Marketing strategies ‣ Business goals ‣ Category hierarchies ‣ Searches
 Enterprise Networks ‣ Uncover new opportunities, hidden leads
 
 Finance ‣ Textual corpora such as financial documents contain a wealth of knowledge ‣ Structured knowledge of entities and relationships
  • 5. Medicine & Health ‣ Dynamic ontologies where data is categorized and organised around people, places, things and events ‣ Patterns in disease progression, causal relations involving disease and symptoms, new relationships previously unrecognised
 Criminal Investigation & Intelligence ‣ Obfuscated information ‣ Traceability to sources of information GraphAware® THE RISE OF KNOWLEDGE GRAPHS
  • 6. DATA SPARSITY
 PROBLEM GraphAware® Collaborative Filtering ‣ Cold Start Content Based Recommendation ‣ Missing Data ‣ Wrong Data
 Text Search ‣ User agnostic ‣ Relevant Search
 
 
 
 
 
 

  • 8. RELEVANT SEARCH GraphAware® “Relevance is the practice of improving search results for users by satisfying their information needs in the context of a particular user experience, while balancing how ranking impacts business’s needs.”
  • 10. KNOWLEDGE GRAPHS
 THE MODEL Search architecture must be able to handle highly heterogenous data Knowledge Graphs represent the information structure for relevant search Graphs are the right representation for: ‣ Information Extraction ‣ Recommendation Engines ‣ Context Representation ‣ Rule Engine
  • 11. Critical aspects and peculiarities: ‣ Defined and controlled set of searchable Items ‣ Multiple category hierarchies ‣ Marketing strategy ‣ User feedback and interactions ‣ Supplier information ‣ Business constraints THE USE CASE
 E-COMMERCE GraphAware® → Text search and catalog navigation as Sales People
  • 14. A graph centric approach THE DATA FLOW GraphAware® ‣ Async data ingestion ‣ Data Pipeline ‣ Single Neo4j Writer ‣ Microservice approach for isolation and scalability ‣ Event notification ‣ Multiple views exported into Elasticsearch
  • 15. THE NEO4J ROLES GraphAware® ‣ Single source of truth ‣ Cleansing ‣ Fast access to connected data ‣ Query ‣ Knowledge Graph store ‣ Merging External Data ‣ Existing Data Augmentation
  • 16. Natural Language Processing ‣ Unsupervised Topic Identification ‣ Word2Vec ‣ Clustering (Label Propagation) EXTERNALISE INTENSE PROCESSES GraphAware® Recommendation model building ‣ Content-Based ‣ Collaborative Filtering (internal and external)
  • 17. Fast, Reliable and Easy-to-tune textual searches ‣ Multiple views for multiple scopes: ‣ Catalog Navigation and Search ‣ Faceting ‣ Product details page ‣ Product variants aggregation ‣ Autocomplete ‣ Suggestion THE ELASTICSEARCH ROLES GraphAware® → It is not used as a database
  • 18. Any components of relevance-scoring calculation corresponding to a meaningful and measurable information
 Two techniques to control relevancy: ‣ Signal Modeling ‣ Ranking Function Note: balance precision and recall Multiple sources CRAFTING
 SIGNALS GraphAware®
  • 19. → Users as a new source of information GraphAware® Profile-based personalisation: ‣ Explicit: Users provide profile information ‣ Implicit: Profile created from user interactions
 Behavioural-Based personalisation ‣ Focus on User-Item Interaction ‣ Make explicit the relationships among users and items PERSONALISING
 SEARCH Tying personalisation back to search ‣ Query-time personalisation ‣ Index-time personalisation
  • 20. → Search for things, not for strings CONCEPT
 SEARCH GraphAware® Basic Approaches: ‣ Concept field (Manual Tagging) ‣ Synonyms
 Content Augmentation (ML based) ‣ Co-occurrence ‣ Latent Semantic Analysis ‣ Latent Dirichlet Allocation ‣ Word2Vec
  • 22. Knowledge Graphs can ‣ store easy-to-query model ‣ gather data from multiple sources ‣ be easily extended
 Search Engines can ‣ provide fast, reliable and easy-to- tune textual search ‣ provide features like faceting, autocomplete CONCLUSION GraphAware® → By combining them, it is possible to offer an unlimited set of services to the end users