SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
Covid–19 Graph!
Bringing Clinical Data
Together with Neo4j
Dave Iberson-Hurst
Partner, S-cubed
Kirsten Walther Langendorf
Principal Consultant, S-cubed
17th September 2020
Copenhagen
2 | ©2020 S-cubed
S-cubed
• A3 Suite platform
• MDR
• SWB
• Linked Data Services
• CDISC Training and
Support
• Regulatory Development
Strategy
• Clinical Trial Documentation
• Marketing Authorisation
Applications & Licence
Maintenance
• EU SME Status, EU
OMPD Holder
• QA & GXP Services
• Statistical Consultancy
• SAS Programming
• Data Management
• CDISC services
• Statistical Analysis and
Reporting
• Quality Assurance
• Biostatistics
• Clinical Data Management
• Pharmacovigilance
• Medical Monitoring
• Risk Based Monitoring
• Operational Reporting
• Qlik Extensions Data Analytics
(Qlik)
Biometrics
Clinical
Standards
Management
Regulatory
Affairs
3 | ©2020 S-cubed
Our “Study” World
Collect Organize Analyse ResultsPlan
4 | ©2020 S-cubed
Neo4j Use
5 | ©2020 S-cubed
Study Workbench
6 | ©2020 S-cubed
Study Workbench
7 | ©2020 S-cubed
Electronic Health Records
8 | ©2020 S-cubed
Mining for Definitions
9 | ©2020 S-cubed
And Some Cypher …
o We use a lot of rectangular
structures, but we can recreate these
with Cypher queries
10 | ©2020 S-cubed
Change in daily life due to COVID-19
11th March 2020
How can I
help?
Letter from authorities
11 | ©2020 S-cubed
Covidgraph.org
Mycontribution
12 | ©2020 S-cubed
The source data
ClinicalTrials.gov API
Limit
More studies than limit
Study counter
13 | ©2020 S-cubed
Looping with Cypher
Allowing for all studies to be included
Getting the total number of
studies and divide by 1000 to
get the number of loops needed
14 | ©2020 S-cubed
Looping with Cypher
Allowing for all studies to be included
Getting the total number of
studies returned by the looping
15 | ©2020 S-cubed
Modelling ClinicalTrials
Converting tabular info to a graph
My pharma
experience
with Clinical
Trials
16 | ©2020 S-cubed
{
"NCTId": [
"NCT04366271"
],
"LocationFacility": [
"Hospital Universitario de Getafe",
"Hospital Universitario de Cruces",
"Hospital Universitario de La Princesa",
"Hospital Infantil Universitario Niño Jesus",
"Hospital Ramón Y Cajal",
"Complejo Universitario La Paz"
],
"Rank": 2,
"LocationCity": [
"Getafe",
"Barakaldo",
"Madrid",
"Madrid",
"Madrid",
"Madrid"
],
"LocationState": [
"Madrid"
],
"LocationCountry": [
"Spain",
"Spain",
"Spain",
"Spain",
"Spain",
"Spain"
]
}
}
From Json to nodes and relationships
From lists to graph
1
1
1
UNWIND study_metadata.NCTId as Id
match(ct:ClinicalTrial{NCTId:Id})
WITH Id, ct, study_metadata, RANGE(0,size(study_metadata.LocationFacility)-1) as
nfacil
FOREACH(i in nfacil |
MERGE(fa:Facility{name:study_metadata.LocationFacility[i]})
MERGE(ci:City{name:study_metadata.LocationCity[i]})
MERGE(c:Country{name:study_metadata.LocationCountry[i]})
MERGE(ct)-[:CONDUCTED_AT]->(fa)
MERGE(fa)-[:LOCATED_IN]->(ci)
)
WITH Id, study_metadata, RANGE(0,size(study_metadata.LocationCity)-1) as ncity
FOREACH(i in ncity |
MERGE(ci:City{name:study_metadata.LocationCity[i]})
MERGE(c:Country{name:study_metadata.LocationCountry[i]})
MERGE(ci)-[:LOCATED_IN]->(c)
)
17 | ©2020 S-cubed
Missing values
In json input
?
18 | ©2020 S-cubed
Missing values
In json input
Split in two
19 | ©2020 S-cubed
o Neo4j Professional certification in 2019
• The task with covidgraph was a great opportunity to get
some cypher and graph experience
o Modelling can be improved
• Data consistency can be a challenge when creating
nodes and relationships
• Could consider some more advanced techniques to
‘clean’ the data (ML)
• Need to be aware of missing values in json when
creating queries
• Next steps: adding trial results
o Great to be part of a helpful team
• Even though I am not a healthcare professional I can
help to provide a better understanding of COVID-19
Lessons Learned
Virtual collaboration
Contact Details
Kirsten Langendorf
kl@s-cubed.dk

Más contenido relacionado

Similar a Bringing Clinical Data Together with Neo4j

Similar a Bringing Clinical Data Together with Neo4j (20)

Research and Development Solutions | Accenture
Research and Development Solutions | AccentureResearch and Development Solutions | Accenture
Research and Development Solutions | Accenture
 
Digital Twin based Product Development in Life Science Industry – Sustainable...
Digital Twin based Product Development in Life Science Industry – Sustainable...Digital Twin based Product Development in Life Science Industry – Sustainable...
Digital Twin based Product Development in Life Science Industry – Sustainable...
 
RADLAB, patient flow management in radiology
RADLAB, patient flow management in radiologyRADLAB, patient flow management in radiology
RADLAB, patient flow management in radiology
 
Supporting Chemical Pharmaceutical Industry
Supporting Chemical Pharmaceutical IndustrySupporting Chemical Pharmaceutical Industry
Supporting Chemical Pharmaceutical Industry
 
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
Presentation (2015.11.20) of healthdata.be project for partners of Collaborat...
 
PatientSchedulingApp - Investor Presentation
PatientSchedulingApp - Investor PresentationPatientSchedulingApp - Investor Presentation
PatientSchedulingApp - Investor Presentation
 
Organised Health Technologies Announced as Winners
Organised Health Technologies Announced as WinnersOrganised Health Technologies Announced as Winners
Organised Health Technologies Announced as Winners
 
Organised Health Technologies Announced as Winners
Organised Health Technologies Announced as WinnersOrganised Health Technologies Announced as Winners
Organised Health Technologies Announced as Winners
 
iHT² Health IT Summit Denver 2013 - Pamela Peele, PhD, Chief Analytics Offic...
 iHT² Health IT Summit Denver 2013 - Pamela Peele, PhD, Chief Analytics Offic... iHT² Health IT Summit Denver 2013 - Pamela Peele, PhD, Chief Analytics Offic...
iHT² Health IT Summit Denver 2013 - Pamela Peele, PhD, Chief Analytics Offic...
 
WHO's Digital Enterprise Migration
WHO's Digital Enterprise Migration WHO's Digital Enterprise Migration
WHO's Digital Enterprise Migration
 
iBiopsy-ppt@RSNA2022_AI_Theater_FINAL (1) (1).pdf
iBiopsy-ppt@RSNA2022_AI_Theater_FINAL (1) (1).pdfiBiopsy-ppt@RSNA2022_AI_Theater_FINAL (1) (1).pdf
iBiopsy-ppt@RSNA2022_AI_Theater_FINAL (1) (1).pdf
 
Madison Park Group - Life Sciences Software M&A Market Update - H1 2018
Madison Park Group - Life Sciences Software M&A Market Update - H1 2018Madison Park Group - Life Sciences Software M&A Market Update - H1 2018
Madison Park Group - Life Sciences Software M&A Market Update - H1 2018
 
Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.Data Science by Chappuis Halder & Co.
Data Science by Chappuis Halder & Co.
 
CDISC Related Services
CDISC Related ServicesCDISC Related Services
CDISC Related Services
 
Natalia Korina_3
Natalia Korina_3Natalia Korina_3
Natalia Korina_3
 
B'IMPRESS: References in Health & MedTech _v2312
B'IMPRESS: References in Health & MedTech _v2312B'IMPRESS: References in Health & MedTech _v2312
B'IMPRESS: References in Health & MedTech _v2312
 
Hospital build 2013 - presentation hict
Hospital build 2013 - presentation hictHospital build 2013 - presentation hict
Hospital build 2013 - presentation hict
 
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
NIIT and Denodo: Business Continuity Planning in the times of the Covid-19 Pa...
 
20151028 healthdata WIV-ISP presentation for College van Geneesheren - Collèg...
20151028 healthdata WIV-ISP presentation for College van Geneesheren - Collèg...20151028 healthdata WIV-ISP presentation for College van Geneesheren - Collèg...
20151028 healthdata WIV-ISP presentation for College van Geneesheren - Collèg...
 
20151028 hd College van Geneesheren-Directeurs - Collège des directeurs médicaux
20151028 hd College van Geneesheren-Directeurs - Collège des directeurs médicaux20151028 hd College van Geneesheren-Directeurs - Collège des directeurs médicaux
20151028 hd College van Geneesheren-Directeurs - Collège des directeurs médicaux
 

Más de Neo4j

Más de Neo4j (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansQIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
QIAGEN: Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
ISDEFE - GraphSummit Madrid - ARETA: Aviation Real-Time Emissions Token Accre...
 
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafosBBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
BBVA - GraphSummit Madrid - Caso de éxito en BBVA: Optimizando con grafos
 
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
Graph Everywhere - Josep Taruella - Por qué Graph Data Science en tus modelos...
 
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4jGraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
GraphSummit Madrid - Product Vision and Roadmap - Luis Salvador Neo4j
 
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdfNeo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
Neo4j_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdfRabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
Rabobank_Exploring the Impact of Graph Technology on Financial Services.pdf
 
Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!Webinar - IA generativa e grafi Neo4j: RAG time!
Webinar - IA generativa e grafi Neo4j: RAG time!
 
IA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG timeIA Generativa y Grafos de Neo4j: RAG time
IA Generativa y Grafos de Neo4j: RAG time
 
Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)Neo4j: Data Engineering for RAG (retrieval augmented generation)
Neo4j: Data Engineering for RAG (retrieval augmented generation)
 
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdfNeo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
Neo4j Graph Summit 2024 Workshop - EMEA - Breda_and_Munchen.pdf
 
Enabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge GraphsEnabling GenAI Breakthroughs with Knowledge Graphs
Enabling GenAI Breakthroughs with Knowledge Graphs
 
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdfNeo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
Neo4j_Anurag Tandon_Product Vision and Roadmap.Benelux.pptx.pdf
 
Neo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with GraphNeo4j Jesus Barrasa The Art of the Possible with Graph
Neo4j Jesus Barrasa The Art of the Possible with Graph
 

Último

TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
VishalKumarJha10
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 

Último (20)

TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 

Bringing Clinical Data Together with Neo4j

  • 1. Covid–19 Graph! Bringing Clinical Data Together with Neo4j Dave Iberson-Hurst Partner, S-cubed Kirsten Walther Langendorf Principal Consultant, S-cubed 17th September 2020 Copenhagen
  • 2. 2 | ©2020 S-cubed S-cubed • A3 Suite platform • MDR • SWB • Linked Data Services • CDISC Training and Support • Regulatory Development Strategy • Clinical Trial Documentation • Marketing Authorisation Applications & Licence Maintenance • EU SME Status, EU OMPD Holder • QA & GXP Services • Statistical Consultancy • SAS Programming • Data Management • CDISC services • Statistical Analysis and Reporting • Quality Assurance • Biostatistics • Clinical Data Management • Pharmacovigilance • Medical Monitoring • Risk Based Monitoring • Operational Reporting • Qlik Extensions Data Analytics (Qlik) Biometrics Clinical Standards Management Regulatory Affairs
  • 3. 3 | ©2020 S-cubed Our “Study” World Collect Organize Analyse ResultsPlan
  • 4. 4 | ©2020 S-cubed Neo4j Use
  • 5. 5 | ©2020 S-cubed Study Workbench
  • 6. 6 | ©2020 S-cubed Study Workbench
  • 7. 7 | ©2020 S-cubed Electronic Health Records
  • 8. 8 | ©2020 S-cubed Mining for Definitions
  • 9. 9 | ©2020 S-cubed And Some Cypher … o We use a lot of rectangular structures, but we can recreate these with Cypher queries
  • 10. 10 | ©2020 S-cubed Change in daily life due to COVID-19 11th March 2020 How can I help? Letter from authorities
  • 11. 11 | ©2020 S-cubed Covidgraph.org Mycontribution
  • 12. 12 | ©2020 S-cubed The source data ClinicalTrials.gov API Limit More studies than limit Study counter
  • 13. 13 | ©2020 S-cubed Looping with Cypher Allowing for all studies to be included Getting the total number of studies and divide by 1000 to get the number of loops needed
  • 14. 14 | ©2020 S-cubed Looping with Cypher Allowing for all studies to be included Getting the total number of studies returned by the looping
  • 15. 15 | ©2020 S-cubed Modelling ClinicalTrials Converting tabular info to a graph My pharma experience with Clinical Trials
  • 16. 16 | ©2020 S-cubed { "NCTId": [ "NCT04366271" ], "LocationFacility": [ "Hospital Universitario de Getafe", "Hospital Universitario de Cruces", "Hospital Universitario de La Princesa", "Hospital Infantil Universitario Niño Jesus", "Hospital Ramón Y Cajal", "Complejo Universitario La Paz" ], "Rank": 2, "LocationCity": [ "Getafe", "Barakaldo", "Madrid", "Madrid", "Madrid", "Madrid" ], "LocationState": [ "Madrid" ], "LocationCountry": [ "Spain", "Spain", "Spain", "Spain", "Spain", "Spain" ] } } From Json to nodes and relationships From lists to graph 1 1 1 UNWIND study_metadata.NCTId as Id match(ct:ClinicalTrial{NCTId:Id}) WITH Id, ct, study_metadata, RANGE(0,size(study_metadata.LocationFacility)-1) as nfacil FOREACH(i in nfacil | MERGE(fa:Facility{name:study_metadata.LocationFacility[i]}) MERGE(ci:City{name:study_metadata.LocationCity[i]}) MERGE(c:Country{name:study_metadata.LocationCountry[i]}) MERGE(ct)-[:CONDUCTED_AT]->(fa) MERGE(fa)-[:LOCATED_IN]->(ci) ) WITH Id, study_metadata, RANGE(0,size(study_metadata.LocationCity)-1) as ncity FOREACH(i in ncity | MERGE(ci:City{name:study_metadata.LocationCity[i]}) MERGE(c:Country{name:study_metadata.LocationCountry[i]}) MERGE(ci)-[:LOCATED_IN]->(c) )
  • 17. 17 | ©2020 S-cubed Missing values In json input ?
  • 18. 18 | ©2020 S-cubed Missing values In json input Split in two
  • 19. 19 | ©2020 S-cubed o Neo4j Professional certification in 2019 • The task with covidgraph was a great opportunity to get some cypher and graph experience o Modelling can be improved • Data consistency can be a challenge when creating nodes and relationships • Could consider some more advanced techniques to ‘clean’ the data (ML) • Need to be aware of missing values in json when creating queries • Next steps: adding trial results o Great to be part of a helpful team • Even though I am not a healthcare professional I can help to provide a better understanding of COVID-19 Lessons Learned Virtual collaboration