SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
T H E D A T A B A S E F O R A I
1960 1970 1980 1990 2000 2010 2020 2030
Relational/SQL
Databases
NoSQL & NewSQL
Databases
SCALE
COMPLEXITY
COMPLEXITY
Business Intelligence (BI)
Web Applications
Artificial Intelligence (AI)
?
AI S YST EMS P RO CES S KN OW L EDG E T HAT I S TO O CO MP L EX FO R CURREN T DATABAS ES
Punch cards
& Tapes
Navigational
Databases
Record Keeping
SCALE
Follow us @GraknLabs
1960 1970 1980 1990 2000 2010 2020 2030
Relational/SQL
Databases
NoSQL & NewSQL
Databases
Business Intelligence (BI)
Web Applications
Artificial Intelligence (AI)
SCALE
COMPLEXITY
SCALE
COMPLEXITY
WHAT RELATIONAL DID FOR BI, IS WHAT GRAKN WILL DO FOR AI
Punch cards
& Tapes
Navigational
Databases
Record Keeping
Follow us @GraknLabs
Follow us @GraknLabs
What is the problem with complex data?
Too complex to model
Current modelling
techniques only based on
binary relationships
Could not model complex
domains
Too complex to query
Current languages only allow
you to query for explicitly
stored data
Could not simplify verbose
queries
Too expensive analytics
Automated distributed
algorithms (BSP) expensive
and not reusable
Could not reuse analytics
algorithms
DB QLs are too low-level
Strong abstraction over low-
level constructs and
complex relationships
Difficult to work with
complex data
Follow us @GraknLabs
GRAKN.AI is a hyper-relational database
for knowledge-oriented systems
i.e.
GRAKN.AI is a knowledge baseKnowledge Storage System
Novel Knowledge Representation System based on
Hypergraph Theory
Knowledge Inference
OLTP Reasoning Engine
Knowledge Analytics
OLAP Distributed Analytics
Follow us @GraknLabs
What is a hyper-relational database?
Hyper-expressive schema
Flexible Entity-Relationship
concept-level schema to
build knowledge models
Model complex
domains
Real-time inference
Automated deductive
reasoning of data points
during runtime (OLTP)
Derive implicit facts &
simplification
Analytics as a Language
Automated distributed
algorithms (BSP) as a
language (OLAP)
Automated large scale
analytics
High-level query language
Strong abstraction over low-
level constructs and
complex relationships
Easier to work with
complex data
Follow us @GraknLabs
Follow us @GraknLabs
THE CENTRAL DOGMA
TRANSLATION
RNA to PROTEINS
TRANSCRIPTION
DNA to RNA
REPLICATION
DNA to DNA
Francis Crick, 1958
Nobel Prize Winner 1962
Follow us @GraknLabs
https://www.ncbi.nlm.nih.gov http://www.uniprot.org http://www.geneontology.org
http://reactome.org
http://www.mirbase.org http://mircancer.ecu.edu
http://bioinfo.life.hust.edu.cn/miRNASNP2/index.php
http://mirtarbase.mbc.nctu.edu.tw http://www.genenames.org
http://www.microrna.org/microrna/home.do
A SMALL SAMPLE…
Follow us @GraknLabs
…AND THE CHALLENGE
Follow us @GraknLabs
Schema Example: Basic Model
Employ-
ment
Person CompanyName
Employee Employer
has has
relates relates
plays plays
Follow us @GraknLabs
Schema Example: Type-Hierarchy
Employ-
ment
Person
Customer
Company
Startup
Name
Employee Employer
has has
sub sub
relates relates
plays plays
plays plays
Follow us @GraknLabs
THE BIOGRAKN SCHEMA
Follow us @GraknLabs
Follow us @GraknLabs
Follow us @GraknLabs
Follow us @GraknLabs
Follow us @GraknLabs
THE CENTRAL DOGMA: INFERRED
TRANSLATION
RNA to PROTEINS
TRANSCRIPTION
DNA to RNA
REPLICATION
DNA to DNA
Francis Crick, 1958
Nobel Prize Winner 1962
Follow us @GraknLabs
WHAT’S NEXT?
T H E D A T A B A S E F O R A I
Follow us @GraknLabs
Schema Example: Type-Hierarchy
Employ-
ment
Person
Customer
Company
Startup
Name
Employee Employer
has has
sub sub
relates relates
plays plays
Husband
Wife
Marriage
plays
plays
relates
relates
Follow us @GraknLabs
Valid Data Insertion
Alice Bob
IBM
Grakn
mar
emp
emp
employer
employer
wife husband
✓ Write commit success
customerperson
startup
Follow us @GraknLabs
Invalid Data insertions – [intelligent] Schema Constraints are Back!
Charlie Applemar
husband wife
companyperson
❌ Write commit fails
❌ Invalid relationship
Follow us @GraknLabs
Hyper-Relationship Example: Nested-Relationship
Alice Bob
Austin
mar
loc
wife husband
personperson
City
07/01/2017
has
date
Follow us @GraknLabs
Hyper-Relationship Example: N-ary Relationship
Titanic Jack
Leonardo
cast
figuremovie
person
actor
1
Billing-number
Follow us @GraknLabs
Rule Example: Transitive Relationship
Kings
Cross London
loc
countryward
UK
loc
city
loc
Follow us @GraknLabs
Rule Example: Simple Business Rule
Schedule A
Schedule B
A Start B Start A End B end
Follow us @GraknLabs
THE	INFERENCE	OLTP	LANGUAGE
A knowledge-oriented query language should not only be able to
retrieve explicitly stored data, but also implicitly derived information.
Follow us @GraknLabs
Complex Query Example
drive
drive
drive
travel
travel
travel
Alice
Full-time Emp
Bob
Part-time Emp
Charlie
Temporary Emp
AB123
Bus
BC234
Van
CD345
Truck
Kings
Cross
Ward
London
City
UK
Country
loc
loc
Who are all the
drivers that will be
arriving in the UK?
The query would be very
long and complex in SQL,
NoSQL or even Graphs
Follow us @GraknLabs
Complex Query Example: Type and Relationship Inference
drive
drive
drive
travel
travel
travel
Alice
Full-time Emp
Bob
Part-time Emp
Charlie
Temporary Emp
AB123
Bus
BC234
Van
CD345
Truck
Kings
Cross
Ward
London
City
UK
Country
loc
loc
Who are all the
drivers that will be
arriving in the UK?
Follow us @GraknLabs
THE	ANALYTICS	OLAP	LANGUAGE
Large-scale analytics is like teenage sex: everyone talks about it,
nobody really knows how to do it, everyone thinks everyone else is
doing it, so everyone claims they are doing it too.
At the end of the day, very few people know how to code it.
Follow us @GraknLabs
Example of a Distributed Analytics Algorithm
For each vertex V,
Superstep 1:
V sends its own id via both out going and incoming edges
V sets its own id as cluster label
Do superstep n:
For every received message m of V, compare it to its current cluster label L:
If m > L, set the label to m;
If the cluster label has not changed in this super step, vote to halt;
Else, send the new cluster label via all edges;
Global operation:
While not every vertex votes to halt, and n < N, do another superstep n + 1.
Connected Component: a clustering algorithm (pseudocode)
An efficient implementation
of this algorithm is about
200 lines of code in Java
Follow us @GraknLabs
Example of a Distributed Analytics Algorithm
For each vertex V,
Superstep 1:
V sends its own id via both out going and incoming edges
V sets its own id as cluster label
Do superstep n:
For every received message m of V, compare it to its current cluster label L:
If m > L, set the label to m;
If the cluster label has not changed in this super step, vote to halt;
Else, send the new cluster label via all edges;
Global operation:
While not every vertex votes to halt, and n < N, do another superstep n + 1.
Connected Component: a clustering algorithm (pseudocode)
An efficient implementation
of this algorithm is about
200 lines of code in Java
Follow us @GraknLabs
Graql Distributed Analytics Queries
And we’ll continue to add more
algorithms into the language,
such as PageRank, K-Core, Triangle
Count, Density, Cliques, Centrality,
and so on

Más contenido relacionado

La actualidad más candente

Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2Dimitris Kontokostas
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncFranz Inc. - AllegroGraph
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinthsDaniel Camarda
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfHeiko Paulheim
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...Dimitris Kontokostas
 
aRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RaRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RGraphRM
 
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...Ontotext
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudOntotext
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
 
Data curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processData curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processAndrea Scharnhorst
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceBarry Norton
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsArmin Haller
 
C4l2008charper
C4l2008charperC4l2008charper
C4l2008charpercharper
 

La actualidad más candente (20)

Graph databases & data integration v2
Graph databases & data integration v2Graph databases & data integration v2
Graph databases & data integration v2
 
Data quality in Real Estate
Data quality in Real EstateData quality in Real Estate
Data quality in Real Estate
 
Why is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz IncWhy is JSON-LD Important to Businesses - Franz Inc
Why is JSON-LD Important to Businesses - Franz Inc
 
Find your way in Graph labyrinths
Find your way in Graph labyrinthsFind your way in Graph labyrinths
Find your way in Graph labyrinths
 
What_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdfWhat_do_Knowledge_Graph_Embeddings_Learn.pdf
What_do_Knowledge_Graph_Embeddings_Learn.pdf
 
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...PhD thesis defense:  Large-scale multilingual knowledge extraction, publishin...
PhD thesis defense: Large-scale multilingual knowledge extraction, publishin...
 
aRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con RaRangodb, un package per l'utilizzo di ArangoDB con R
aRangodb, un package per l'utilizzo di ArangoDB con R
 
HyperGraphQL
HyperGraphQLHyperGraphQL
HyperGraphQL
 
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
[Webinar] FactForge Debuts: Trump World Data and Instant Ranking of Industry ...
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
 
GraphDB
GraphDBGraphDB
GraphDB
 
Data curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research processData curation and data archiving at different stages of the research process
Data curation and data archiving at different stages of the research process
 
Rdf
RdfRdf
Rdf
 
Linked Data, Ontologies and Inference
Linked Data, Ontologies and InferenceLinked Data, Ontologies and Inference
Linked Data, Ontologies and Inference
 
A Semantic Data Model for Web Applications
A Semantic Data Model for Web ApplicationsA Semantic Data Model for Web Applications
A Semantic Data Model for Web Applications
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
C4l2008charper
C4l2008charperC4l2008charper
C4l2008charper
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 

Similar a Christian Jakenfelds

GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented Systems
GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented SystemsGRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented Systems
GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented SystemsVaticle
 
GRAKN.AI - The Knowledge Graph
GRAKN.AI - The Knowledge GraphGRAKN.AI - The Knowledge Graph
GRAKN.AI - The Knowledge GraphVaticle
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 
Introducing GRAKN.AI
Introducing GRAKN.AIIntroducing GRAKN.AI
Introducing GRAKN.AIVaticle
 
Graphs for Ai and ML
Graphs for Ai and MLGraphs for Ai and ML
Graphs for Ai and MLNeo4j
 
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...Big Data Spain
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeArangoDB Database
 
Multiplaform Solution for Graph Datasources
Multiplaform Solution for Graph DatasourcesMultiplaform Solution for Graph Datasources
Multiplaform Solution for Graph DatasourcesStratio
 
GraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and MLGraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and MLNeo4j
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningConnected Data World
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Chris Fregly
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016MLconf
 
Programming with Semantic Broad Data
Programming with Semantic Broad DataProgramming with Semantic Broad Data
Programming with Semantic Broad DataSteffen Staab
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...Daniel Zivkovic
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneDoug Needham
 
Scaling PyData Up and Out
Scaling PyData Up and OutScaling PyData Up and Out
Scaling PyData Up and OutTravis Oliphant
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsArijit Khan
 
Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Doug Needham
 

Similar a Christian Jakenfelds (20)

GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented Systems
GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented SystemsGRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented Systems
GRAKN.AI: The Hyper-Relational Database for Knowledge-Oriented Systems
 
GRAKN.AI - The Knowledge Graph
GRAKN.AI - The Knowledge GraphGRAKN.AI - The Knowledge Graph
GRAKN.AI - The Knowledge Graph
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
Introducing GRAKN.AI
Introducing GRAKN.AIIntroducing GRAKN.AI
Introducing GRAKN.AI
 
Graphs for Ai and ML
Graphs for Ai and MLGraphs for Ai and ML
Graphs for Ai and ML
 
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
Fishing Graphs in a Hadoop Data Lake by Jörg Schad and Max Neunhoeffer at Big...
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
 
Multiplaform Solution for Graph Datasources
Multiplaform Solution for Graph DatasourcesMultiplaform Solution for Graph Datasources
Multiplaform Solution for Graph Datasources
 
GraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and MLGraphTour Boston - Graphs for AI and ML
GraphTour Boston - Graphs for AI and ML
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep Learning
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
 
Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016Atlanta MLconf Machine Learning Conference 09-23-2016
Atlanta MLconf Machine Learning Conference 09-23-2016
 
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
Chris Fregly, Research Scientist, PipelineIO at MLconf ATL 2016
 
Programming with Semantic Broad Data
Programming with Semantic Broad DataProgramming with Semantic Broad Data
Programming with Semantic Broad Data
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
 
Gaps in the algorithm
Gaps in the algorithmGaps in the algorithm
Gaps in the algorithm
 
Data Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZoneData Structure Graph DMZ #DMZone
Data Structure Graph DMZ #DMZone
 
Scaling PyData Up and Out
Scaling PyData Up and OutScaling PyData Up and Out
Scaling PyData Up and Out
 
Dagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphsDagstuhl seminar talk on querying big graphs
Dagstuhl seminar talk on querying big graphs
 
Apache Spark GraphX highlights.
Apache Spark GraphX highlights. Apache Spark GraphX highlights.
Apache Spark GraphX highlights.
 

Más de Connected Data World

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenConnected Data World
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaConnected Data World
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is hereConnected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2Connected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data ModelConnected Data World
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseConnected Data World
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Connected Data World
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Connected Data World
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleConnected Data World
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Connected Data World
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the WebConnected Data World
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsConnected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsConnected Data World
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...Connected Data World
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGOConnected Data World
 

Más de Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora Lassila
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
The years of the graph: The future of the future is here
The years of the graph: The future of the future is hereThe years of the graph: The future of the future is here
The years of the graph: The future of the future is here
 
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
From Taxonomies and Schemas to Knowledge Graphs: Parts 1 & 2
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
Enterprise Data Governance: Leveraging Knowledge Graph & AI in support of a d...
 
Graph Realities
Graph RealitiesGraph Realities
Graph Realities
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
RAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needsRAPIDS cuGraph – Accelerating all your Graph needs
RAPIDS cuGraph – Accelerating all your Graph needs
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
From Knowledge Graphs to AI-powered SEO: Using taxonomies, schemas and knowle...
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 

Último

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 

Último (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Christian Jakenfelds

  • 1. T H E D A T A B A S E F O R A I
  • 2. 1960 1970 1980 1990 2000 2010 2020 2030 Relational/SQL Databases NoSQL & NewSQL Databases SCALE COMPLEXITY COMPLEXITY Business Intelligence (BI) Web Applications Artificial Intelligence (AI) ? AI S YST EMS P RO CES S KN OW L EDG E T HAT I S TO O CO MP L EX FO R CURREN T DATABAS ES Punch cards & Tapes Navigational Databases Record Keeping SCALE Follow us @GraknLabs
  • 3. 1960 1970 1980 1990 2000 2010 2020 2030 Relational/SQL Databases NoSQL & NewSQL Databases Business Intelligence (BI) Web Applications Artificial Intelligence (AI) SCALE COMPLEXITY SCALE COMPLEXITY WHAT RELATIONAL DID FOR BI, IS WHAT GRAKN WILL DO FOR AI Punch cards & Tapes Navigational Databases Record Keeping Follow us @GraknLabs
  • 4. Follow us @GraknLabs What is the problem with complex data? Too complex to model Current modelling techniques only based on binary relationships Could not model complex domains Too complex to query Current languages only allow you to query for explicitly stored data Could not simplify verbose queries Too expensive analytics Automated distributed algorithms (BSP) expensive and not reusable Could not reuse analytics algorithms DB QLs are too low-level Strong abstraction over low- level constructs and complex relationships Difficult to work with complex data
  • 5. Follow us @GraknLabs GRAKN.AI is a hyper-relational database for knowledge-oriented systems i.e. GRAKN.AI is a knowledge baseKnowledge Storage System Novel Knowledge Representation System based on Hypergraph Theory Knowledge Inference OLTP Reasoning Engine Knowledge Analytics OLAP Distributed Analytics
  • 6. Follow us @GraknLabs What is a hyper-relational database? Hyper-expressive schema Flexible Entity-Relationship concept-level schema to build knowledge models Model complex domains Real-time inference Automated deductive reasoning of data points during runtime (OLTP) Derive implicit facts & simplification Analytics as a Language Automated distributed algorithms (BSP) as a language (OLAP) Automated large scale analytics High-level query language Strong abstraction over low- level constructs and complex relationships Easier to work with complex data
  • 8. Follow us @GraknLabs THE CENTRAL DOGMA TRANSLATION RNA to PROTEINS TRANSCRIPTION DNA to RNA REPLICATION DNA to DNA Francis Crick, 1958 Nobel Prize Winner 1962
  • 9. Follow us @GraknLabs https://www.ncbi.nlm.nih.gov http://www.uniprot.org http://www.geneontology.org http://reactome.org http://www.mirbase.org http://mircancer.ecu.edu http://bioinfo.life.hust.edu.cn/miRNASNP2/index.php http://mirtarbase.mbc.nctu.edu.tw http://www.genenames.org http://www.microrna.org/microrna/home.do A SMALL SAMPLE…
  • 11. Follow us @GraknLabs Schema Example: Basic Model Employ- ment Person CompanyName Employee Employer has has relates relates plays plays
  • 12. Follow us @GraknLabs Schema Example: Type-Hierarchy Employ- ment Person Customer Company Startup Name Employee Employer has has sub sub relates relates plays plays plays plays
  • 13. Follow us @GraknLabs THE BIOGRAKN SCHEMA
  • 18. Follow us @GraknLabs THE CENTRAL DOGMA: INFERRED TRANSLATION RNA to PROTEINS TRANSCRIPTION DNA to RNA REPLICATION DNA to DNA Francis Crick, 1958 Nobel Prize Winner 1962
  • 20. T H E D A T A B A S E F O R A I
  • 21. Follow us @GraknLabs Schema Example: Type-Hierarchy Employ- ment Person Customer Company Startup Name Employee Employer has has sub sub relates relates plays plays Husband Wife Marriage plays plays relates relates
  • 22. Follow us @GraknLabs Valid Data Insertion Alice Bob IBM Grakn mar emp emp employer employer wife husband ✓ Write commit success customerperson startup
  • 23. Follow us @GraknLabs Invalid Data insertions – [intelligent] Schema Constraints are Back! Charlie Applemar husband wife companyperson ❌ Write commit fails ❌ Invalid relationship
  • 24. Follow us @GraknLabs Hyper-Relationship Example: Nested-Relationship Alice Bob Austin mar loc wife husband personperson City 07/01/2017 has date
  • 25. Follow us @GraknLabs Hyper-Relationship Example: N-ary Relationship Titanic Jack Leonardo cast figuremovie person actor 1 Billing-number
  • 26. Follow us @GraknLabs Rule Example: Transitive Relationship Kings Cross London loc countryward UK loc city loc
  • 27. Follow us @GraknLabs Rule Example: Simple Business Rule Schedule A Schedule B A Start B Start A End B end
  • 28. Follow us @GraknLabs THE INFERENCE OLTP LANGUAGE A knowledge-oriented query language should not only be able to retrieve explicitly stored data, but also implicitly derived information.
  • 29. Follow us @GraknLabs Complex Query Example drive drive drive travel travel travel Alice Full-time Emp Bob Part-time Emp Charlie Temporary Emp AB123 Bus BC234 Van CD345 Truck Kings Cross Ward London City UK Country loc loc Who are all the drivers that will be arriving in the UK? The query would be very long and complex in SQL, NoSQL or even Graphs
  • 30. Follow us @GraknLabs Complex Query Example: Type and Relationship Inference drive drive drive travel travel travel Alice Full-time Emp Bob Part-time Emp Charlie Temporary Emp AB123 Bus BC234 Van CD345 Truck Kings Cross Ward London City UK Country loc loc Who are all the drivers that will be arriving in the UK?
  • 31. Follow us @GraknLabs THE ANALYTICS OLAP LANGUAGE Large-scale analytics is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it too. At the end of the day, very few people know how to code it.
  • 32. Follow us @GraknLabs Example of a Distributed Analytics Algorithm For each vertex V, Superstep 1: V sends its own id via both out going and incoming edges V sets its own id as cluster label Do superstep n: For every received message m of V, compare it to its current cluster label L: If m > L, set the label to m; If the cluster label has not changed in this super step, vote to halt; Else, send the new cluster label via all edges; Global operation: While not every vertex votes to halt, and n < N, do another superstep n + 1. Connected Component: a clustering algorithm (pseudocode) An efficient implementation of this algorithm is about 200 lines of code in Java
  • 33. Follow us @GraknLabs Example of a Distributed Analytics Algorithm For each vertex V, Superstep 1: V sends its own id via both out going and incoming edges V sets its own id as cluster label Do superstep n: For every received message m of V, compare it to its current cluster label L: If m > L, set the label to m; If the cluster label has not changed in this super step, vote to halt; Else, send the new cluster label via all edges; Global operation: While not every vertex votes to halt, and n < N, do another superstep n + 1. Connected Component: a clustering algorithm (pseudocode) An efficient implementation of this algorithm is about 200 lines of code in Java
  • 34. Follow us @GraknLabs Graql Distributed Analytics Queries And we’ll continue to add more algorithms into the language, such as PageRank, K-Core, Triangle Count, Density, Cliques, Centrality, and so on