SlideShare una empresa de Scribd logo
1 de 58
Toward F.A.I.R. Pharma
PhUSE Linked Data Initiatives
Past and Present
Semantics @ Roche
2019-04-04 9:30-10:00
Outline
I. Introduction
- What is PhUSE?
II. How FAIR is (late phase) Pharma?
III. Toward FAIR Pharma with PhUSE
2
Tim Williams
 Statistical Solutions Lead
 UCB Biosciences, Raleigh North Carolina
PhUSE
 Steering Committee, Computational Sciences
Symposium (CSS)
 Co-lead
 Analysis Results Model (RDF Data Cubes) (2016)
 Clinical Trials Data as RDF (2018)
 Going Translational with Linked Data (present)
 Instructor: Linked Data Hands-on Workshop
3
PhUSE
Pharmaceutical Users Software Exchange
• Membership: >8,700 spanning 30 countries
• Annual Conferences:
• EUConnect (November, Amsterdam)
• USConnect
• Single Day Events
• Computational Sciences Symposium (CSS)
• A working conference
4
PhUSE
Pharmaceutical Users Software Exchange
Mission
• Provide a welcoming, neutral platform for creating
and sharing ideas... exploring innovative
methodologies, techniques, and technologies.
Working Groups Mission
• ...open, transparent, and collaborative forum in a
non-competitive environment
5
PhUSE Linked Data Projects
Recent Work
• CDISC Foundational Standards in RDF
• CDISC Conformance Checks
• Reusing Medical Summaries for Enabling Clinical
Research
• Analysis Results and Metadata (RDF Data Cube)
• Regulatory Guidance in RDF
• Clinical Program Design in RDF
• CDISC Protocol Representation Model in RDF
6
II. How FAIR is (late phase) Pharma?
7
Magic mirror on the wall…
…
is
Pharma
F.A.I.R.
at all??
8
5 Star Open Data Principles
Web, open license, +/- format
Structured, machine readable
Non-proprietary format
URIs
Linked to other data
@NovasTaylor
9
F.A.I.R Data Principles
Findability
F1. globally unique, persistent id
F2. rich metadata
F3. searchable source
F4. metadata specify data id
Accessibility
A1. retrievable by id using standard protocol
A1.1 protocol open, free, universal
A1.2 protocol allows authentication
A2 metadata avail. when data is not
Interoperability
I1. formal, accessible, shared, broadly applicable language
I2. uses FAIR vocabularies
I3. qualified references to other data
Reusability
R1. plurality of accurate and relevant attributes
R1.1 clear and accessible usage license
R1.2 provenance
R1.3 meets domain-relevant standards10
How Does Pharma Fare on FAIR?
Findability
F1. globally unique, persistent id
Human Study Subject "Bob"
• PharmaCo
• Study 1, Drug A
• Study 2, Drug B
• DrugCo
• Study 3, Drug C
Merge Bob’s data from all studies.
PhUSE Project:
“Study URI”
11
How Does Pharma Fare on FAIR?
Accessibility
A2. Metadata available when data
is not
Biostatisticians and Medical Writers do not have
easy access to the metadata from data collection
and transformation processes.
Data changes form during its journey from
collection to analysis.
12
How Does Pharma Fare on FAIR?
Interoperability
I1. shared language for knowledge
Lack
• Knowledge representation
• language
Core Challenge : Change of mindset
• Currently: Data modeled to industry standards for
submission to regulatory authorities
• Future: Models of the process and the entities in
the data.
Linked Data / Knowledge Graph
adoption is helping!
13
How Does Pharma Fare on FAIR?
Reusability
R1.3 data meet domain-relevant
community standards
Positive:
• We have standards!
Negative:
• We have standards
• Historically are row x column structure
• New initiatives are not 5 Star Open or FAIR
14
New CDISC “Library” Initiative
API behind membership paywall
JSON, No RDF download
JSON. XML planned
JSON
Not possible with above restrictions…
15
III. Toward FAIR Pharma with PhUSE
16
Risk-averse Pharma has a skills deficit
17
Database Popularity https://db-engines.com/en/ranking_categories
18
Skills Deficit
Is it more practical to train:
A Knowledge Graph expert in Clinical Trials?
A Clinical Trials expert in Knowledge Graphs?
or
19
CSS Workshop
“Let’s Make a Knowledge Graph!”
• Build your own RDF Linked Data Knowledge Graph
• Sunday Evening, June 9th
https://www.phuse.eu/css19
• Introduction to Linked Data concepts
Also at PhUSE EUConnect, Amsterdam (November)
20
Knowledge Graph Workshop
21
Workshop : Merged Studies
22
Enterprise Knowledge Graphs
for Pharma
23
How?
The Roofshot / Moonshot Manifesto
Concept & Image Attribution: https://rework.withgoogle.com/blog/the-roofshot-manifesto/
1. Study URI
2. Clinical Trial Results Domains
3. Open Ontology Development
Invent and apply state-
of-the-art
Roofshot
Incremental impacts
in production
Moonshot
• Enterprise and Industry
Knowledge Graphs
• Across the Pharma Data
Life Cycle
Roofshot 1 : Study URI
Based on:
“Study URI” – K. Forsberg, D. Goude.
PhUSE EUConnect 18
• Easy entry point for Pharma
• Familiar Concept: NCT Number (clinicalTrials.gov),
EudraCT number
25
Study URI
and across repositories bind them.
One URI to rule them all,
One URI to find them,
One URI to bring them all
26
Study URI
https://github.com/phuse-org/LinkedDataEducation/blob/master/doc/StudyURI.md
• Value for Patients, Researchers, Agencies, Companies
• A unique, immutable Study ID
• Links across repositories
• Returns information
• Context dependent!
27
Wikidata as a Study URI Platform?
• Existing Infrastructure
• Community
• Process
• 27k studies already in Wikidata
http://tinyurl.com/yxqlegy3
28
Study URI as a WikiProject?
• https://www.wikidata.org/wiki/Wikidata:WikiProjects
• Setup a 3 day workshop with stakeholders?
Roofshot 2 : Clinical Trial Results
Domains
1. Clinical Trials Data (SDTM) as
RDF (CTDasRDF)
2. Going Translational with Linked
Data (GoTWLD)
29
Clinical Trials Data as RDF
Project
Co-leads
Dr. Armando Oliva
Medical Informatics Consulting
Semantica LLC
Tim Williams
Statistical Solutions Lead
UCB Biosciences
30
https://www.phuse.eu/white-papers
31
Data Challenges
 Submissions non-conformance*
o 32% of submissions at least 1 conformance flag
o 20% of uploads to JANUS fail
 Variability in standards implementation
o Multiple interpretations of implementation guides
 Version-conversion problems & costs
 Lack of intrinsic metadata
 Challenges linking data and standards
 Limitations in the row-by-column data model
* Conservative estimates
32
Machine-readable, Machine-interpretable
Data
With built-in: • Meaning (semantics)
Interpret
• Context
• Content
• Structure
• Purpose
• Rules + Traceability
• Validity Trustworthiness
33
Project Philosophy
• Model the concepts represented in SDTM
• Map instance data to the graph model
• Re-use where possible
– Ontologies, Terminologies
• Open Source, pre-competitive, cooperative
environment
– Results available to everyone
Resource Description Framework (RDF)
34
Re-use Existing Sources
SDTM Terminology
study
“mini” study Ontology
SNOMED
Systematized Nomenclature of Med.
Indication Condition
MedDRA
Med. Dictionary for Regulatory Activities
Adverse Events
UNII
Unique Ingredient Identifier
Substances
ND-FRT
National Drug File Ref. Terminology
Pharmacologic Class
WHO Drug Dictionary
World Health Org. Drug Dictionary
Medical Products
RDF
non-RDF
35
Study
HumanStudySubject
Medical ConditionStudy Activity
DrugProduct
hasStudyParticipant
afflictedBy
participatesIn
hasProcotol
SpecifiedActivity
administered
Drug
participatesInStudy
Concept
36
Adverse Event Representation
SDTM & BRIDG Our Model
Observation Medical Condition
temporally associated
with an Intervention
37
Adverse EventInterventionFinding
Assessment
Adverse Event
Hypertension
input
output
Assessor
Intervention
Drug X 10mg qd
2019-09-30
Observation
e.g. BP 160/110
2019-10-10
Medical
Condition
Hypertension
SDTM, BRIDG
Our Model
Observation
38
39
Controlled
Terminology Study Protocol
DM SUPPDM EX VS TS AE
Clinical Trials Data as RDF
SDTM Modeling and Data Conversion
40
Project on GitHub
https://github.com/phuse-org/CTDasRDF
41
Going Translational with Linked Data
42
Going Translational with Linked Data
Project
Co-leads
Drashtti Vasant
R&D IT Business Partner
Translational Sciences at Bayer Business Services
Dr. Armando Oliva
Medical Informatics Consulting
Semantica LLC
Tim Williams
Statistical Solutions Lead
UCB Biosciences
43
Project Focus
• Ontology Development
• Data Conversion to RDF
• Define XML
• Non-clinical models & data [new]
44
Study Protocol
DM SUPPDM EX VS TS AE
Clinical Trials Data as RDF
Going Translational With Linked Data
DM
+SEND
SUPPDM
+SEND
EX
+SEND
VS
+SEND
TS
+SEND
AE
+SEND
Protocol
+SEND
Study+SEND
?
+SEND
?
+SEND
Controlled
Terminology
Controlled
Terminology +SEND
Modeling and Data Conversion
45
Deliverables
Objective Timeline
Project Initiation February 2019
Supporting Ontologies PhUSE CSS 2020
Instance Data, Define
XML
PhUSE CSS 2020
Presentations PhUSE CSS 2019, 2020
PhUSE EUConnect 2019
Conclusion PhUSE CSS 2020
46
Additional Initiatives!
• SPARQL Endpoint for Project Data
o Query data online, from your desktop
Related Philosophy:
Current : Data Submission
Future : Data Sharing
• MedDRA Modeling and Conversion Process
47
Roofshot 3: Cooperative Ontology
Development
• The challenges cannot be solved by any one
person, department, company, agency, or
organization.
• We must cooperate as an industry.
48
Existing Ontologies
49
How do we ‘open source’ ontology
development?
• Github?
• Existing pre-competitive organizations?
• PhUSE
• TransCelerate
• Pistoia Alliance
50
Open Source Ontology Challenges
• Gate keeper
• Conflict resolution (approach, code)
• Company
• Participation
• Contribution
• Volunteers
51
Ontology Availability and Curation
Leverage Existing Portals?
• Open PHACTS
• OBO Foundry
• BioPortal
• ….others?
52
Ontologies must be open and
Accessible
Don’t hide my OWL !
53
Conclusion
54
Linked Data Adoption in Pharma
• Companies adopting
and adapting
CTDasRDF outputs
• Knowledge Graph
initiatives at:
– AstraZeneca
– Bayer
– Roche
– Sanofi
– ….and more
• External datasets
published as RDF
– EBI RDF platform
– openPhacts
– OpenTargets
– …and more
55
The Future
Knowledge Graphs are here to stay
“The application of graph processing and graph
DBMSs will grow at 100 percent annually
through 2022..." – Gartner “Top 10 Data and Analytics
Technology Trends for 2019”
56
Magic mirror on the wall…
…
Pharma
is getting
F.A.I.R-er
after all.
57
Thank you!
Contact
Email: tim.williams@phuse.eu
LinkedIn: https://www.linkedin.com/in/timpwilliams
Twitter : @NovasTaylor
58

Más contenido relacionado

La actualidad más candente

Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Tom Plasterer
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...David Peyruc
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
BioIT 2017 - Ontoforce and Amgen Gene Knowledge Discovery
BioIT 2017 - Ontoforce and Amgen Gene Knowledge DiscoveryBioIT 2017 - Ontoforce and Amgen Gene Knowledge Discovery
BioIT 2017 - Ontoforce and Amgen Gene Knowledge DiscoveryWolfgang G. Hoeck
 
The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data OSTHUS
 
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...Dr. Haxel Consult
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingMaaike Duine
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...Dr. Haxel Consult
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationMichel Dumontier
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Kerstin Forsberg
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance
 

La actualidad más candente (20)

Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
BioIT 2017 - Ontoforce and Amgen Gene Knowledge Discovery
BioIT 2017 - Ontoforce and Amgen Gene Knowledge DiscoveryBioIT 2017 - Ontoforce and Amgen Gene Knowledge Discovery
BioIT 2017 - Ontoforce and Amgen Gene Knowledge Discovery
 
The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data The Fast Track to Fair Lab Data
The Fast Track to Fair Lab Data
 
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...II-SDV 2012 Text Mining, Term Mining and Visualization  - Improving the Impac...
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
THOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier LinkingTHOR Workshop - Persistent Identifier Linking
THOR Workshop - Persistent Identifier Linking
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
AI-SDV 2020: Combining Knowledge and Machine Learning for the Analysis of Sci...
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
SciBite
SciBiteSciBite
SciBite
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium
 
TAIR ICAR 2010 Presentation
TAIR ICAR 2010 PresentationTAIR ICAR 2010 Presentation
TAIR ICAR 2010 Presentation
 
Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016Pistoia Alliance USA Conference 2016
Pistoia Alliance USA Conference 2016
 

Similar a Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present

Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveKees van Bochove
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Tom Plasterer
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveKees van Bochove
 
ODIN: Connecting research and researchers
ODIN: Connecting research and researchersODIN: Connecting research and researchers
ODIN: Connecting research and researchersSergio Ruiz
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Kees van Bochove
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingDenodo
 
effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemPaul Houston
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
 
The FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectThe FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectSusanna-Assunta Sansone
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314Philip Bourne
 
FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...Allyson Lister
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Kees van Bochove
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Peter Embi
 
DataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookDataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookIsabella Feierberg
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015William Gunn
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)Pistoia Alliance
 

Similar a Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present (20)

Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The Hyve
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
 
ODIN: Connecting research and researchers
ODIN: Connecting research and researchersODIN: Connecting research and researchers
ODIN: Connecting research and researchers
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare system
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...
 
The FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectThe FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus project
 
Workshop intro090314
Workshop intro090314Workshop intro090314
Workshop intro090314
 
FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...FAIRsharing: curation and governance of an ecosystem of research standards an...
FAIRsharing: curation and governance of an ecosystem of research standards an...
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024
 
DataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookDataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlook
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)
 

Último

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Último (20)

How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present

  • 1. Toward F.A.I.R. Pharma PhUSE Linked Data Initiatives Past and Present Semantics @ Roche 2019-04-04 9:30-10:00
  • 2. Outline I. Introduction - What is PhUSE? II. How FAIR is (late phase) Pharma? III. Toward FAIR Pharma with PhUSE 2
  • 3. Tim Williams  Statistical Solutions Lead  UCB Biosciences, Raleigh North Carolina PhUSE  Steering Committee, Computational Sciences Symposium (CSS)  Co-lead  Analysis Results Model (RDF Data Cubes) (2016)  Clinical Trials Data as RDF (2018)  Going Translational with Linked Data (present)  Instructor: Linked Data Hands-on Workshop 3
  • 4. PhUSE Pharmaceutical Users Software Exchange • Membership: >8,700 spanning 30 countries • Annual Conferences: • EUConnect (November, Amsterdam) • USConnect • Single Day Events • Computational Sciences Symposium (CSS) • A working conference 4
  • 5. PhUSE Pharmaceutical Users Software Exchange Mission • Provide a welcoming, neutral platform for creating and sharing ideas... exploring innovative methodologies, techniques, and technologies. Working Groups Mission • ...open, transparent, and collaborative forum in a non-competitive environment 5
  • 6. PhUSE Linked Data Projects Recent Work • CDISC Foundational Standards in RDF • CDISC Conformance Checks • Reusing Medical Summaries for Enabling Clinical Research • Analysis Results and Metadata (RDF Data Cube) • Regulatory Guidance in RDF • Clinical Program Design in RDF • CDISC Protocol Representation Model in RDF 6
  • 7. II. How FAIR is (late phase) Pharma? 7
  • 8. Magic mirror on the wall… … is Pharma F.A.I.R. at all?? 8
  • 9. 5 Star Open Data Principles Web, open license, +/- format Structured, machine readable Non-proprietary format URIs Linked to other data @NovasTaylor 9
  • 10. F.A.I.R Data Principles Findability F1. globally unique, persistent id F2. rich metadata F3. searchable source F4. metadata specify data id Accessibility A1. retrievable by id using standard protocol A1.1 protocol open, free, universal A1.2 protocol allows authentication A2 metadata avail. when data is not Interoperability I1. formal, accessible, shared, broadly applicable language I2. uses FAIR vocabularies I3. qualified references to other data Reusability R1. plurality of accurate and relevant attributes R1.1 clear and accessible usage license R1.2 provenance R1.3 meets domain-relevant standards10
  • 11. How Does Pharma Fare on FAIR? Findability F1. globally unique, persistent id Human Study Subject "Bob" • PharmaCo • Study 1, Drug A • Study 2, Drug B • DrugCo • Study 3, Drug C Merge Bob’s data from all studies. PhUSE Project: “Study URI” 11
  • 12. How Does Pharma Fare on FAIR? Accessibility A2. Metadata available when data is not Biostatisticians and Medical Writers do not have easy access to the metadata from data collection and transformation processes. Data changes form during its journey from collection to analysis. 12
  • 13. How Does Pharma Fare on FAIR? Interoperability I1. shared language for knowledge Lack • Knowledge representation • language Core Challenge : Change of mindset • Currently: Data modeled to industry standards for submission to regulatory authorities • Future: Models of the process and the entities in the data. Linked Data / Knowledge Graph adoption is helping! 13
  • 14. How Does Pharma Fare on FAIR? Reusability R1.3 data meet domain-relevant community standards Positive: • We have standards! Negative: • We have standards • Historically are row x column structure • New initiatives are not 5 Star Open or FAIR 14
  • 15. New CDISC “Library” Initiative API behind membership paywall JSON, No RDF download JSON. XML planned JSON Not possible with above restrictions… 15
  • 16. III. Toward FAIR Pharma with PhUSE 16
  • 17. Risk-averse Pharma has a skills deficit 17
  • 19. Skills Deficit Is it more practical to train: A Knowledge Graph expert in Clinical Trials? A Clinical Trials expert in Knowledge Graphs? or 19
  • 20. CSS Workshop “Let’s Make a Knowledge Graph!” • Build your own RDF Linked Data Knowledge Graph • Sunday Evening, June 9th https://www.phuse.eu/css19 • Introduction to Linked Data concepts Also at PhUSE EUConnect, Amsterdam (November) 20
  • 22. Workshop : Merged Studies 22
  • 24. The Roofshot / Moonshot Manifesto Concept & Image Attribution: https://rework.withgoogle.com/blog/the-roofshot-manifesto/ 1. Study URI 2. Clinical Trial Results Domains 3. Open Ontology Development Invent and apply state- of-the-art Roofshot Incremental impacts in production Moonshot • Enterprise and Industry Knowledge Graphs • Across the Pharma Data Life Cycle
  • 25. Roofshot 1 : Study URI Based on: “Study URI” – K. Forsberg, D. Goude. PhUSE EUConnect 18 • Easy entry point for Pharma • Familiar Concept: NCT Number (clinicalTrials.gov), EudraCT number 25
  • 26. Study URI and across repositories bind them. One URI to rule them all, One URI to find them, One URI to bring them all 26
  • 27. Study URI https://github.com/phuse-org/LinkedDataEducation/blob/master/doc/StudyURI.md • Value for Patients, Researchers, Agencies, Companies • A unique, immutable Study ID • Links across repositories • Returns information • Context dependent! 27
  • 28. Wikidata as a Study URI Platform? • Existing Infrastructure • Community • Process • 27k studies already in Wikidata http://tinyurl.com/yxqlegy3 28 Study URI as a WikiProject? • https://www.wikidata.org/wiki/Wikidata:WikiProjects • Setup a 3 day workshop with stakeholders?
  • 29. Roofshot 2 : Clinical Trial Results Domains 1. Clinical Trials Data (SDTM) as RDF (CTDasRDF) 2. Going Translational with Linked Data (GoTWLD) 29
  • 30. Clinical Trials Data as RDF Project Co-leads Dr. Armando Oliva Medical Informatics Consulting Semantica LLC Tim Williams Statistical Solutions Lead UCB Biosciences 30
  • 32. Data Challenges  Submissions non-conformance* o 32% of submissions at least 1 conformance flag o 20% of uploads to JANUS fail  Variability in standards implementation o Multiple interpretations of implementation guides  Version-conversion problems & costs  Lack of intrinsic metadata  Challenges linking data and standards  Limitations in the row-by-column data model * Conservative estimates 32
  • 33. Machine-readable, Machine-interpretable Data With built-in: • Meaning (semantics) Interpret • Context • Content • Structure • Purpose • Rules + Traceability • Validity Trustworthiness 33
  • 34. Project Philosophy • Model the concepts represented in SDTM • Map instance data to the graph model • Re-use where possible – Ontologies, Terminologies • Open Source, pre-competitive, cooperative environment – Results available to everyone Resource Description Framework (RDF) 34
  • 35. Re-use Existing Sources SDTM Terminology study “mini” study Ontology SNOMED Systematized Nomenclature of Med. Indication Condition MedDRA Med. Dictionary for Regulatory Activities Adverse Events UNII Unique Ingredient Identifier Substances ND-FRT National Drug File Ref. Terminology Pharmacologic Class WHO Drug Dictionary World Health Org. Drug Dictionary Medical Products RDF non-RDF 35
  • 37. Adverse Event Representation SDTM & BRIDG Our Model Observation Medical Condition temporally associated with an Intervention 37
  • 38. Adverse EventInterventionFinding Assessment Adverse Event Hypertension input output Assessor Intervention Drug X 10mg qd 2019-09-30 Observation e.g. BP 160/110 2019-10-10 Medical Condition Hypertension SDTM, BRIDG Our Model Observation 38
  • 39. 39
  • 40. Controlled Terminology Study Protocol DM SUPPDM EX VS TS AE Clinical Trials Data as RDF SDTM Modeling and Data Conversion 40
  • 42. Going Translational with Linked Data 42
  • 43. Going Translational with Linked Data Project Co-leads Drashtti Vasant R&D IT Business Partner Translational Sciences at Bayer Business Services Dr. Armando Oliva Medical Informatics Consulting Semantica LLC Tim Williams Statistical Solutions Lead UCB Biosciences 43
  • 44. Project Focus • Ontology Development • Data Conversion to RDF • Define XML • Non-clinical models & data [new] 44
  • 45. Study Protocol DM SUPPDM EX VS TS AE Clinical Trials Data as RDF Going Translational With Linked Data DM +SEND SUPPDM +SEND EX +SEND VS +SEND TS +SEND AE +SEND Protocol +SEND Study+SEND ? +SEND ? +SEND Controlled Terminology Controlled Terminology +SEND Modeling and Data Conversion 45
  • 46. Deliverables Objective Timeline Project Initiation February 2019 Supporting Ontologies PhUSE CSS 2020 Instance Data, Define XML PhUSE CSS 2020 Presentations PhUSE CSS 2019, 2020 PhUSE EUConnect 2019 Conclusion PhUSE CSS 2020 46
  • 47. Additional Initiatives! • SPARQL Endpoint for Project Data o Query data online, from your desktop Related Philosophy: Current : Data Submission Future : Data Sharing • MedDRA Modeling and Conversion Process 47
  • 48. Roofshot 3: Cooperative Ontology Development • The challenges cannot be solved by any one person, department, company, agency, or organization. • We must cooperate as an industry. 48
  • 50. How do we ‘open source’ ontology development? • Github? • Existing pre-competitive organizations? • PhUSE • TransCelerate • Pistoia Alliance 50
  • 51. Open Source Ontology Challenges • Gate keeper • Conflict resolution (approach, code) • Company • Participation • Contribution • Volunteers 51
  • 52. Ontology Availability and Curation Leverage Existing Portals? • Open PHACTS • OBO Foundry • BioPortal • ….others? 52
  • 53. Ontologies must be open and Accessible Don’t hide my OWL ! 53
  • 55. Linked Data Adoption in Pharma • Companies adopting and adapting CTDasRDF outputs • Knowledge Graph initiatives at: – AstraZeneca – Bayer – Roche – Sanofi – ….and more • External datasets published as RDF – EBI RDF platform – openPhacts – OpenTargets – …and more 55
  • 56. The Future Knowledge Graphs are here to stay “The application of graph processing and graph DBMSs will grow at 100 percent annually through 2022..." – Gartner “Top 10 Data and Analytics Technology Trends for 2019” 56
  • 57. Magic mirror on the wall… … Pharma is getting F.A.I.R-er after all. 57
  • 58. Thank you! Contact Email: tim.williams@phuse.eu LinkedIn: https://www.linkedin.com/in/timpwilliams Twitter : @NovasTaylor 58