How Graph Algorithms Answer your Business Questions in Banking and Beyond

Graph Algorithms in Banking
Joe Depeau
Sr. Presales Consultant, UK
15th April, 2020
@joedepeau
http://linkedin.com/in/joedepeau

• Introduction to Graphs and Neo4j
• Introduction to The Neo4j Graph Data Science Library
• Demo Data Overview
• Review of Graph Algorithms for Demo
• Demo
• Q&A
2
Agenda

Introduction to Graphs and
Neo4j
3

Relational vs. Graph Databases
4

Graphs in the Age of Connections
5

7
Car
DRIVES
name: “Dan”
born: May 29, 1970
twitter: “@dan”
name: “Ann”
born: Dec 5, 1975
since:
Jan 10, 2011
brand: “Volvo”
model: “V70”
Anatomy of a Property Graph Database
Nodes
• Represent the objects in the
graph
• Can be labeled
Relationships
• Relate nodes by type and
direction
Properties
• Name-value pairs that can go on
nodes and relationships.
LOVES
LOVES
LIVES WITH
OW
NS
Person Person

Neo4j Graph Data Science
Library
8

Graph Algorithms are calculations that describe the
topology and connectivity of your graph
9
What the heck are graph algorithms?
- Global traversals & computations
- Learning overall structure
- Typically heuristics and
approximations
- Extracting new data from what you
already have
What’s important? What’s similar? What are efficient traversals?

10
...and what do I do with them?
Explore, plan, measure
Find significant patterns and plan for
optimal structures
Score outcomes and set a threshold
value for a prediction
Machine learning
Use the measures as features to train an
ML model
1st
node
2nd
node
Common
neighbors
Preferential
attachment
Label
1 2 4 15 1
3 4 7 12 1
5 6 1 1 0

11
Tell me more!
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Link Prediction
Finds optimal paths
or evaluates route availability and
quality.
Determines the importance of
distinct nodes in the network.
Detects group clustering or
partition options.
Evaluates how alike nodes are
by neighbors and
relationships.
Estimates the likelihood of
nodes forming a future
relationship.
Similarity

Graph and ML algorithms in Neo4j
• Minimum Weight Spanning Tree
• Shortest Path
• Single Source Shortest Path
• All Pairs Shortest Path
• A*
• Yen’s K-shortest Paths
• Random Walk
• Breadth First Search
• Depth First Search
• Degree Centrality
• Closeness Centrality
• Betweenness Centrality
• PageRank
• ArticleRank
• Eigenvector Centrality
• Triangle Count / Clustering Coefficient
• Weakly Connected Components
• Strongly Connected Components
• Label Propagation
• Louvain Modularity
• K-1 Colouring
• Modularity Optimisation
• Node Similarity
• Approximate Nearest Neighbours
• Cosine Similarity
• Euclidean Similarity
• Jaccard Similarity
• Overlap Similarity
• Pearson Similarity
Pathfinding
& Search
Centrality /
Importance
Community
Detection
Similarity
https://neo4j.com/docs/graph-data-science/1.0/
Link
Prediction
• Adamic Adar
• Common Neighbours
• Preferential Attachment
• Resource Allocations
• Same Community
• Total Neighbours
12

14
Some Examples of Typical Bank Data
Event DataProduct and
Services Data
Customer DataOrganisational
Data
3rd Party Data
Documentation
Employee
Data
Processes
Systems and
Databases
KPIs and Reports
Address
Personal Data
Documents
Relationships
Assets
Documentation
Processes
Product / Service
Details
Product / Service
Hierarchy
Pricing
Money
Movements
Web / App Activity
Customer Contact
Social Media
Credit Rating
Agencies
Market Data
Organisational
Hierarchy
Corporate Data

15
Some Examples of Typical Bank Data
Event DataProduct and
Services Data
Customer DataOrganisational
Data
3rd Party Data
Documentation
Employee
Data
Processes
Systems and
Databases
KPIs and Reports
Address
Personal Data
Documents
Relationships
Assets
Documentation
Processes
Product / Service
Details
Product / Service
Hierarchy
Pricing
Money
Movements
Web / App Activity
Customer Contact
Social Media
Credit Rating
Agencies
Market Data
Organisational
Hierarchy
Corporate Data

17
Three ways a Client node can be Flagged
Performed a transaction flagged as fraud Share a SSN with another Client
Have more than one SSN on file

Graph Algorithms for
Demonstration
18

PageRank
What: Finds important nodes based
on their relationships.
Why: Identify important or
influential Client nodes by
quantifying the flows of money
towards them.
Uses:
- Fraud detection
- Anti-money Laundering
- Inform prioritization during analysis
and investigation19

20
The PageRank Algorithm
PageRank: what nodes can be considered
‘important’ in our graph based on money flows ?

21
The PageRank Algorithm
PageRank: what nodes can be considered
‘important’ in our graph based on money flows ?
Inputs
.pagerank
Property Output

Weakly Connected Components
What: Finds disconnected
community subgraphs in our data.
Why: Identify communities based
on connections with shared pieces
of identity.
Uses:
- Householding
- Synthetic identities
- Stolen identities
22

23
The Weakly Connected Components Algorithm
Weakly Connected
Components: what
communities exist in the
data based on connections
to pieces of identity ?

24
The Weakly Connected Components Algorithm
Weakly Connected
Components: what
communities exist in the
data based on connections
to pieces of identity ?
.component_id
Property Output
Inputs

Node Similarity
What: Similarity between nodes
based on neighbours. Writes a new
relationship to the graph.
Why: Identify similar nodes who
share common pieces of identity.
Uses:
- Entity Resolution
- Synthetic identities
- Stolen identities
25

26
The Node Similarity Algorithm
Node Similarity : how
similar are two Client
nodes based on pieces of
shared identity ?

27
The Node Similarity Algorithm
Node Similarity : how
similar are two Client
nodes based on pieces of
shared identity ?
SIMILAR Relationship Output
with .score property
Inputs

Louvain Modularity
What: Finds communities in our
graph who are connected. Can
return intermediate results.
Why: Useful for identifying
communities based on transaction
behaviour rather than identity.
Uses:
- Fraud ring detection
- Anti-money Laundering
28

29
The Louvain Algorithm
Louvain: what communities of nodes transact
amongst themselves ?

30
The Louvain Algorithm
Louvain: what communities of nodes transact
amongst themselves ?
Inputs
.louvain_community
Property Output

How Graph Algorithms Answer your Business Questions in Banking and Beyond

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a How Graph Algorithms Answer your Business Questions in Banking and Beyond

Similar a How Graph Algorithms Answer your Business Questions in Banking and Beyond (20)

Más de Neo4j

Más de Neo4j (20)

Último

Último (20)

How Graph Algorithms Answer your Business Questions in Banking and Beyond