SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
A tour of the PokitDok Health Graph and
some open source graph projects
Graph Day Texas, Jan 2016
Denise Gosnell, PhD
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 2
PokitDok APIs:
The business of health,
for developers.
https://platform.pokitdok.com/
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 3
PokitDok APIs: Marketplace
Confidential 4
Doctor on Demand:
Powered by PokitDok
Twitter and Github:
@pokitdok
@denisekgosnell
onward to graphs.
6
What we built.
The HealthGraph
What we’ve open sourced.
A Gremlin-Python Library
Custom Titan Build
Dynamic JSON  Graph [WIP]
HealthGraph DSL [WIP]
Talk Outline:
Twitter and Github:
@pokitdok
@denisekgosnell
The PokitDok HealthGraph
Confidential 8
X12 Data Standard:
ETL hell from the 1970s
Twitter and Github:
@pokitdok
Confidential 9
X12 Data Standard:
ETL hell from the 1970s
Twitter and Github:
@pokitdok
Confidential 10
Health Graph: Transaction as Trees
• We treat transactions as
first-class objects in the
graph
• Buried in the depth of an
X12 transactions are the
entities of interest
Twitter and Github:
@pokitdok
Interactive graph available at:
https://fullmetalhealth.com/dsl/
Confidential 11
HealthGraph:
Property Graph Model
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 12
HealthGraph: Probabilistic Inferences
Confidential 13
HealthGraph:
Data Inferences
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 14
HealthGraph: Predictive Models
• What is the probability claim X will be denied?
• A new customer just searched for “family practice”;
recommend the best provider within 10 miles.
• Given a CPT code, what is the expected
reimbursement rate from insurance company A in zip
code 37601?
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 15HealthGraph: Top 100k Providers
Twitter and Github:
@pokitdok
@MacraeAlec
PokitDok Open Source:
Gremlin Python
Confidential 17
Our HealthGraph
Production Stack
• Titan 0.5.3
• TinkerPop’s
Blueprints 2.50
• Cassandra
and Elastic Search
 Gremlin-Python
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 18
• Lighter Context Switching between
development tools and environments
• Incompatible syntax issues between
Gremlin and Python
• Using Python.
Gremlin-Python Motivation
Twitter and Github:
@corbinbs
@denisekgosnell
Confidential 19
Option 1: Grab our docker container
1. Install Docker
https://www.docker.com/docker-toolbox
2. Jump in the “Docker Quickstart Terminal”
3. Fire up our example container:
docker run -i -t pokitdok/gremlin-python-test-drive
Option 2: Shell script install
1. Clone our repo:
https://github.com/pokitdok/gremlin-python
2. Run the set-up scripts:
$./test_drive/setup.sh &&./test_drive/run.sh
Gremlin-Python Test Drive
Twitter and Github:
@corbinbs
@denisekgosnell
Confidential 20
Bi-Partite Graph
Recommendation System
Customer
viewed
scheduled_with
Doctor
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 21
Bi-Partite Graph
Recommendation System
Customer
viewed
Doctor
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 22
Bi-Partite Graph
Recommendation System
Customer
viewed
Doctor
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 23
Bi-Partite Graph
Recommendation System
Customer
viewed
Doctor
Twitter and Github:
@pokitdok
@denisekgosnell
Confidential 24
Bi-Partite Graph
Recommendation System
Customer
viewed
Doctor
Twitter and Github:
@pokitdok
@denisekgosnell
g.E.has(‘edge_type’,’scheduled_with’)
.in_v()
.group_count(ranked_docs,
lambda it: it.full_name,
lambda it: it.b+1.0)
Confidential 25
Gremlin-Python Test Drive
Twitter and Github:
@corbinbs
@denisekgosnell
PokitDok Open Source:
Custom Build of Titan 0.5.3 to
Integrate with CDH5 Containers
Confidential 27
Motivation for Release of Custom Build:
Graph Production Stack:
Titan 0.5.x ships with Hadoop 2.2
API Production Stack:
contains Cloudera’s CDH5 containers and Hadoop 2.6.0
You guessed it:
 infrastructure dependency errors upon integration 
the Hadoop 2.6.0 API is not fully backwards compatible
with Hadoop 2.2
Twitter and Github:
@pokitdok
Confidential 28
Released:
A modification of the Titan 0.5.3 build
to upgrade to Hadoop 2.6.0 and
resolve numerous conflicts among
transitive dependencies.
… someone had to do it.
Grab it here:
https://github.com/pokitdok/titan/tree/
0.5.3-hadoop2.6.0
 Tested for Cassandra but not
Hbase.
Twitter and Github:
@pokitdok
HealthGraph Dynamic JSON Load
Open Source Version [WIP]
Confidential 30
Dyanmic JSONLoader:
Goal: Bulk load of JSON from sequenced HDFS files
straight to a Titan DB
Twitter and Github:
@pokitdok
Confidential 31
1. Extract PokitDok HealthGraph specific features
2. Move to Titan 1.0 and TP3 compatibility
3. Release on PokitDok GitHub
Dyanmic JSONLoader Future Work
Twitter and Github:
@pokitdok
HealthGraph DSL
Open Source Version [WIP]
Confidential 33
X12 Data Standard:
ETL hell from the 1970s
Twitter and Github:
@pokitdok
Confidential 34
X12 Spec Trees vs. Graph DSL:
Twitter and Github:
@pokitdok
Interactive graph available at:
https://fullmetalhealth.com/dsl/
Confidential 35
Graph DSL with TinkerPop 2.5:
Twitter and Github:
@pokitdok
Confidential 36
1. Move to Titan 1.0 and TP3 compatibility
2. Release on PokitDok GitHub
3. Current Open Question:
 We are looking for(ward to) more documentation on
implementing custom gremlin steps(DSLs) in TP3
DSL Future Work
Twitter and Github:
@pokitdok
and there will be more…!
Confidential 38
Reach Out
Dev Blog: FullMetalHealth.com
@PokitDok @DeniseKGosnell
Denise.Gosnell@pokitdok.com
A tour of the PokitDok Health Graph and
some open source graph projects
Graph Day Texas, Jan 2016
Denise Gosnell, PhD
Twitter and Github:
@pokitdok
@denisekgosnell

Más contenido relacionado

La actualidad más candente

Version Control System - Git
Version Control System - GitVersion Control System - Git
Version Control System - GitCarlo Bernaschina
 
Version Control & Git
Version Control & GitVersion Control & Git
Version Control & GitJason Byrne
 
News from Git in Eclipse - EclipseCon EU - 2016-10-26
News from Git in Eclipse - EclipseCon EU - 2016-10-26News from Git in Eclipse - EclipseCon EU - 2016-10-26
News from Git in Eclipse - EclipseCon EU - 2016-10-26msohn
 
Git basic and workflow
Git basic and workflowGit basic and workflow
Git basic and workflowbuikhanhbk
 
Effective Git with Eclipse
Effective Git with EclipseEffective Git with Eclipse
Effective Git with EclipseChris Aniszczyk
 
Version Control with Git
Version Control with GitVersion Control with Git
Version Control with GitLuigi De Russis
 
EclipseCon 2010 talk: Towards contributors heaven
EclipseCon 2010 talk: Towards contributors heavenEclipseCon 2010 talk: Towards contributors heaven
EclipseCon 2010 talk: Towards contributors heavenmsohn
 
Introduction to Git
Introduction to GitIntroduction to Git
Introduction to GitRick Umali
 
Understanding and Using Git at Eclipse
Understanding and Using Git at EclipseUnderstanding and Using Git at Eclipse
Understanding and Using Git at EclipseChris Aniszczyk
 
Git_and_GitHub Integration_with_Guidewire
Git_and_GitHub Integration_with_GuidewireGit_and_GitHub Integration_with_Guidewire
Git_and_GitHub Integration_with_GuidewireGandhi Ramu
 
Using Git Inside Eclipse, Pushing/Cloning from GitHub
Using Git Inside Eclipse, Pushing/Cloning from GitHubUsing Git Inside Eclipse, Pushing/Cloning from GitHub
Using Git Inside Eclipse, Pushing/Cloning from GitHubAboutHydrology Slides
 
Get Ur Git On: Introduction and getting started with Github
Get Ur Git On: Introduction and getting started with GithubGet Ur Git On: Introduction and getting started with Github
Get Ur Git On: Introduction and getting started with GithubChristine O'Connell
 
EclipseCon 2010 tutorial: Understanding git at Eclipse
EclipseCon 2010 tutorial: Understanding git at EclipseEclipseCon 2010 tutorial: Understanding git at Eclipse
EclipseCon 2010 tutorial: Understanding git at Eclipsemsohn
 
Let's Git this Party Started: An Introduction to Git and GitHub
Let's Git this Party Started: An Introduction to Git and GitHubLet's Git this Party Started: An Introduction to Git and GitHub
Let's Git this Party Started: An Introduction to Git and GitHubKim Moir
 

La actualidad más candente (20)

Introduction to git
Introduction to gitIntroduction to git
Introduction to git
 
Git within RStudio
Git within RStudioGit within RStudio
Git within RStudio
 
Version Control System - Git
Version Control System - GitVersion Control System - Git
Version Control System - Git
 
Version Control & Git
Version Control & GitVersion Control & Git
Version Control & Git
 
News from Git in Eclipse - EclipseCon EU - 2016-10-26
News from Git in Eclipse - EclipseCon EU - 2016-10-26News from Git in Eclipse - EclipseCon EU - 2016-10-26
News from Git in Eclipse - EclipseCon EU - 2016-10-26
 
Git basic and workflow
Git basic and workflowGit basic and workflow
Git basic and workflow
 
Effective Git with Eclipse
Effective Git with EclipseEffective Git with Eclipse
Effective Git with Eclipse
 
Version Control with Git
Version Control with GitVersion Control with Git
Version Control with Git
 
EclipseCon 2010 talk: Towards contributors heaven
EclipseCon 2010 talk: Towards contributors heavenEclipseCon 2010 talk: Towards contributors heaven
EclipseCon 2010 talk: Towards contributors heaven
 
Git and Version Control at Atlogys
Git and Version Control at AtlogysGit and Version Control at Atlogys
Git and Version Control at Atlogys
 
Github
GithubGithub
Github
 
Introduction to Git
Introduction to GitIntroduction to Git
Introduction to Git
 
Understanding and Using Git at Eclipse
Understanding and Using Git at EclipseUnderstanding and Using Git at Eclipse
Understanding and Using Git at Eclipse
 
Git_and_GitHub Integration_with_Guidewire
Git_and_GitHub Integration_with_GuidewireGit_and_GitHub Integration_with_Guidewire
Git_and_GitHub Integration_with_Guidewire
 
Git
GitGit
Git
 
Using Git Inside Eclipse, Pushing/Cloning from GitHub
Using Git Inside Eclipse, Pushing/Cloning from GitHubUsing Git Inside Eclipse, Pushing/Cloning from GitHub
Using Git Inside Eclipse, Pushing/Cloning from GitHub
 
Git tutorial
Git tutorialGit tutorial
Git tutorial
 
Get Ur Git On: Introduction and getting started with Github
Get Ur Git On: Introduction and getting started with GithubGet Ur Git On: Introduction and getting started with Github
Get Ur Git On: Introduction and getting started with Github
 
EclipseCon 2010 tutorial: Understanding git at Eclipse
EclipseCon 2010 tutorial: Understanding git at EclipseEclipseCon 2010 tutorial: Understanding git at Eclipse
EclipseCon 2010 tutorial: Understanding git at Eclipse
 
Let's Git this Party Started: An Introduction to Git and GitHub
Let's Git this Party Started: An Introduction to Git and GitHubLet's Git this Party Started: An Introduction to Git and GitHub
Let's Git this Party Started: An Introduction to Git and GitHub
 

Similar a Graph Day Texas: Open Source Graph Projects from PokitDok

MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...
MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...
MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...MITRE - ATT&CKcon
 
Building a Distributed Build System at Google Scale
Building a Distributed Build System at Google ScaleBuilding a Distributed Build System at Google Scale
Building a Distributed Build System at Google ScaleAysylu Greenberg
 
Efficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIEfficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIMatthias Trapp
 
Berlin R Meetup
Berlin R MeetupBerlin R Meetup
Berlin R MeetupSri Ambati
 
H2O at Berlin R Meetup
H2O at Berlin R MeetupH2O at Berlin R Meetup
H2O at Berlin R MeetupJo-fai Chow
 
What the git? - SAP Inside Track Munich 2016
What the git?  - SAP Inside Track Munich 2016What the git?  - SAP Inside Track Munich 2016
What the git? - SAP Inside Track Munich 2016Hendrik Neumann
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSAPRBETTER
 
PluginAndPlay-AmsterdamJUG.pptx
PluginAndPlay-AmsterdamJUG.pptxPluginAndPlay-AmsterdamJUG.pptx
PluginAndPlay-AmsterdamJUG.pptxSimonedeGijt
 
pluginandplay-UtrechtJUG.pptx
pluginandplay-UtrechtJUG.pptxpluginandplay-UtrechtJUG.pptx
pluginandplay-UtrechtJUG.pptxSimonedeGijt
 
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...Codemotion
 
How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017Rittman Analytics
 
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function AwakensPROIDEA
 
Primers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewPrimers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewDelft University of Technology
 
Introduction to Git Version Control System
Introduction to Git Version Control SystemIntroduction to Git Version Control System
Introduction to Git Version Control SystemOleksandr Zaitsev
 
Gimel at Teradata Analytics Universe 2018
Gimel at Teradata Analytics Universe 2018Gimel at Teradata Analytics Universe 2018
Gimel at Teradata Analytics Universe 2018Romit Mehta
 
Python PPT
Python PPTPython PPT
Python PPTEdureka!
 
Python for Big Data Analytics
Python for Big Data AnalyticsPython for Big Data Analytics
Python for Big Data AnalyticsEdureka!
 
COSCUP Scouter: Face recognizer retrieves your Github contribution
COSCUP Scouter: Face recognizer retrieves your Github contributionCOSCUP Scouter: Face recognizer retrieves your Github contribution
COSCUP Scouter: Face recognizer retrieves your Github contributionChe-Chia Chang
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and PythonTravis Oliphant
 
GDSC PCE Hacktoberfest 1.pptx
GDSC PCE Hacktoberfest 1.pptxGDSC PCE Hacktoberfest 1.pptx
GDSC PCE Hacktoberfest 1.pptxAnandMenon54
 

Similar a Graph Day Texas: Open Source Graph Projects from PokitDok (20)

MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...
MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...
MITRE ATT&CKcon 2.0: Ready to ATT&CK? Bring Your Own Data (BYOD) and Validate...
 
Building a Distributed Build System at Google Scale
Building a Distributed Build System at Google ScaleBuilding a Distributed Build System at Google Scale
Building a Distributed Build System at Google Scale
 
Efficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIEfficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL API
 
Berlin R Meetup
Berlin R MeetupBerlin R Meetup
Berlin R Meetup
 
H2O at Berlin R Meetup
H2O at Berlin R MeetupH2O at Berlin R Meetup
H2O at Berlin R Meetup
 
What the git? - SAP Inside Track Munich 2016
What the git?  - SAP Inside Track Munich 2016What the git?  - SAP Inside Track Munich 2016
What the git? - SAP Inside Track Munich 2016
 
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSABetter Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
Better Hackathon 2020 - Fraunhofer IAIS - Semantic geo-clustering with SANSA
 
PluginAndPlay-AmsterdamJUG.pptx
PluginAndPlay-AmsterdamJUG.pptxPluginAndPlay-AmsterdamJUG.pptx
PluginAndPlay-AmsterdamJUG.pptx
 
pluginandplay-UtrechtJUG.pptx
pluginandplay-UtrechtJUG.pptxpluginandplay-UtrechtJUG.pptx
pluginandplay-UtrechtJUG.pptx
 
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...
From Docker To Kubernetes: A Developer's Guide To Containers - Mandy White - ...
 
How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017How a Tweet Went Viral - BIWA Summit 2017
How a Tweet Went Viral - BIWA Summit 2017
 
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens
4Developers: Grzegorz Piwowarek- Java Wars VIII: The Function Awakens
 
Primers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code ReviewPrimers or Reminders? The Effects of Existing Review Comments on Code Review
Primers or Reminders? The Effects of Existing Review Comments on Code Review
 
Introduction to Git Version Control System
Introduction to Git Version Control SystemIntroduction to Git Version Control System
Introduction to Git Version Control System
 
Gimel at Teradata Analytics Universe 2018
Gimel at Teradata Analytics Universe 2018Gimel at Teradata Analytics Universe 2018
Gimel at Teradata Analytics Universe 2018
 
Python PPT
Python PPTPython PPT
Python PPT
 
Python for Big Data Analytics
Python for Big Data AnalyticsPython for Big Data Analytics
Python for Big Data Analytics
 
COSCUP Scouter: Face recognizer retrieves your Github contribution
COSCUP Scouter: Face recognizer retrieves your Github contributionCOSCUP Scouter: Face recognizer retrieves your Github contribution
COSCUP Scouter: Face recognizer retrieves your Github contribution
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and Python
 
GDSC PCE Hacktoberfest 1.pptx
GDSC PCE Hacktoberfest 1.pptxGDSC PCE Hacktoberfest 1.pptx
GDSC PCE Hacktoberfest 1.pptx
 

Último

Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 

Último (17)

Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 

Graph Day Texas: Open Source Graph Projects from PokitDok

Notas del editor

  1. Personal story of how I got into graph analytics; graph lineage
  2. we made all of our stuff available via API.
  3. For something the crowd can go see ---
  4. Relevant Timing: Xerox is powered by Pokitdok
  5. we are tackling two while fields. navigating the wild and quickly change space of graph technology while also trying to modernize healthcare
  6. transitional purposes only
  7. what kind of data do we have
  8. We are using graph paths to calculate a high density of providers with a co-occurance across payors – we can also find this by plan. GOAL: infer provider networks across plans – or whichever slice of the data we prefer
  9. we can also answer all sorts of questions
  10. Current healthcare infrastructure is fractured and antiquated… they can’t answer these questions.
  11. 4.3 million providers
  12. This is a slide about why
  13. data management: data engineering: loading of data into a database data science: probabilistic inferences
  14. updates to transitive dependencies aren’t sexy, but aren’t you glad you don’t have to do this now? Someone had to do it. There were people on the titan users group who suggested they had built titan 0.5 for hadoop 2.6 themselves, but we could not find any publically. That is why we released this.
  15. slightly more interesting than dependency whack a mole --
  16. Bulk load of JSON from squenced HDFS files
  17. Bulk load of JSON from squenced HDFS files
  18. We have created a groovy-gremlin based graph DSL for entity retrieval. The DSL is accessible from client scripts in python or groovy, or via TinkerPop’s gremlin console.
  19. Personal story of how I got into graph analytics; graph lineage