SlideShare una empresa de Scribd logo
Supporting Ontology-Based
Standardization of Biomedical Metadata in
the CEDAR Workbench
Marcos Martínez-Romero
Stanford University
Stanford Universitymetadatacenter.org
EDAR
OR EXPANDED DATA
ION AND RETRIEVAL
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
DAR
DAR
CENTER FOR EXPANDED DATA
9/14/2017
2
age
Age
AGE
`Age
age (after birth)
age (in years)
age (y)
age (year)
age (years)
Age (years)
Age (Years)
age (yr)
age (yr-old)
age (yrs)
Age (yrs)
age [y]
age [year]
age [years]
age in years
age of patient
Age of patient
age of subjects
age(years)
Age(years)
Age(yrs.)
Age, year
age, years
age, yrs
age.year
age_years
Metadata are not standardized
3
age
Age
AGE
`Age
age (after birth)
age (in years)
age (y)
age (year)
age (years)
Age (years)
Age (Years)
age (yr)
age (yr-old)
age (yrs)
Age (yrs)
age [y]
age [year]
age [years]
age in years
age of patient
Age of patient
age of subjects
age(years)
Age(years)
Age(yrs.)
Age, year
age, years
age, yrs
age.year
age_years
Metadata are not standardized
Age-Years (NCIT)
(http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C37908)
“The	length	of	a	person's	life,	stated	in	years	since	birth.”
4
It’s extremely hard to:
–find experimental datasets
–understand how the experiments were
performed
–replicate study findings
Metadata are not standardized
5
Generating standard metadata is hard
• Submission formats rarely support
ontology terms
• No easy way of finding terms from
ontologies and including them into
metadata submissions
6
7
Semantic ecosystem to enable the
creation of high-quality metadata in
biomedicine
8
The CEDAR Workbench
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
fill in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies 9
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
fill in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies
The CEDAR Workbench
10
11
12
13
14
15
16
17
18
19
20
21
Template Designer Metadata Editor
Template authors Metadata authors
design
templates
Metadata Repository
template
fill in templates
with metadata
metadata
Public Databases
LINCS
submit
metadata
Biomedical Ontologies
The CEDAR Workbench
22
23
24
{
"@context": {
"rdfs": "http://www.w3.org/2000/01/rdf-schema#",
"xsd": "http://www.w3.org/2001/XMLSchema#",
"pav": "http://purl.org/pav/",
//...
"Title": "http://purl.obolibrary.org/obo/NGS_0000055",
"Disorder": "http://purl.org/net/OCRe/OCRe.owl#OCRE900086",
"Institution": "http://semantic-dicom.org/dcm#InstitutionName",
"Principal Investigator": "http://purl.org/net/OCRe/OCRe.owl#OCRE901006",
"Study Type": "http://purl.obolibrary.org/obo/NGS_0000056"
},
"@type": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C63536",
"Title": {
"@value": "A sample study"
},
"Disorder": {
"@id": "http://purl.obolibrary.org/obo/DOID_8986",
"rdfs:label": "narcolepsy"
},
"Institution": {
"@value": "Stanford University"
},
"Principal Investigator": {
"@value": "John Doe"
},
"Study Type": {
"@id": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C15273",
"rdfs:label": "Longitudinal Study"
},
// ...
"schema:isBasedOn": "https://repo.metadatacenter.orgx/templates/6381a0ce-3904-4885-bc44-5caacb4ad0e6",
"schema:name": "Study metadata",
"schema:description": "Study template",
"pav:createdOn": "2017-09-05T09:50:28-0700",
"pav:createdBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61",
"pav:lastUpdatedOn": "2017-09-05T09:50:28-0700",
"oslc:modifiedBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61",
"@id": "https://repo.metadatacenter.orgx/template-instances/ffe856e7-d920-480d-a666-009041f609e3"
}
25
Value Set Creation
• Lists of permissible
values for fields
• Example:
Longitudinal study types
– Prospective study
– Retrospective study
– Hybrid design
26
Class Creation
• Dynamically define a
new class and
immediately use it
• Optionally link it to
existing classes
– Ontology maintainers
may use this
information to enrich
their ontologies
• Example:
adductor dorsalis
27
Class Creation
CEDAR Provisional Classes
(CEDARPC)
UBERON
adductor
dorsalis
adductor
muscle
subclassOf
28
Evaluation
• The LINCS Consortium
– Cellular signatures
• ImmPort
– Immunology
• AIRR Community
– Datasets acquired using sequencing
– Submission to NCBI BioSample
• Stanford University Libraries
29
Summary
• Authoring metadata is hard and time-consuming
• Authoring semantic metadata is even harder
– Lack of convenient tools for linking metadata to
ontologies in a metadata authoring workflow
• The CEDAR Workbench facilitates metadata
creation in a semantically rigorous way
– Add type and property assertions
– Constrain the values of fields to ontology terms
– Create classes and value sets
http://metadatacenter.org
http://cedar.metadatacenter.net
30
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
CENTER FOR EXPANDED DATA
ANNOTATION AND RETRIEVAL
CEDAR
CEDAR
CEDAR
I
Metadata
Thanks!
31

Más contenido relacionado

Similar a ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench

Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
Jing-Doo Wang
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
Pistoia Alliance
 
Metadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data RepositoriesMetadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data Repositories
andrea huang
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014
Susanna-Assunta Sansone
 
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
Peter McQuilton
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...
geraintduck
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research
Yannick Wurm
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
Carole Goble
 
First Do No Harm - Droidcon Boston
First Do No Harm - Droidcon BostonFirst Do No Harm - Droidcon Boston
First Do No Harm - Droidcon Boston
Annyce Davis
 
Bioschemas: Using Schema.org for describing scientific information
Bioschemas: Using Schema.org for describing scientific information Bioschemas: Using Schema.org for describing scientific information
Bioschemas: Using Schema.org for describing scientific information
Bioschemas
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
ExternalEvents
 
Canadian health census to lod
Canadian health census to lodCanadian health census to lod
Canadian health census to lod
Syed Ahmad Chan Bukhari, PhD
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
Chris Rusbridge
 
Ontologias
OntologiasOntologias
Ontologias
Sérgio Santos
 
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
myGrid team
 
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
CEDAR: Center for Expanded Data Annotation and Retrieval
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Syed Ahmad Chan Bukhari, PhD
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
ICZN
 
CBS CEDAR Presentation
CBS CEDAR PresentationCBS CEDAR Presentation
CBS CEDAR Presentation
Albert Meroño-Peñuela
 
Profile Serialization IIPC GA 2015
Profile Serialization IIPC GA 2015Profile Serialization IIPC GA 2015
Profile Serialization IIPC GA 2015
Sawood Alam
 

Similar a ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench (20)

Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
Hadoop con 2016_9_10_王經篤(Jing-Doo Wang)
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Metadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data RepositoriesMetadata as Linked Data for Research Data Repositories
Metadata as Linked Data for Research Data Repositories
 
Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014Oxford DTP - Sansone curation tools - Dec 2014
Oxford DTP - Sansone curation tools - Dec 2014
 
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
BioSharing - RDA Plenary 6 - Metadata Standards Catalog WG and BioSharing WG ...
 
University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...University of Manchester Symposium 2012: Extraction and Representation of in ...
University of Manchester Symposium 2012: Extraction and Representation of in ...
 
2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research2014 11-13-sbsm032-reproducible research
2014 11-13-sbsm032-reproducible research
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 
First Do No Harm - Droidcon Boston
First Do No Harm - Droidcon BostonFirst Do No Harm - Droidcon Boston
First Do No Harm - Droidcon Boston
 
Bioschemas: Using Schema.org for describing scientific information
Bioschemas: Using Schema.org for describing scientific information Bioschemas: Using Schema.org for describing scientific information
Bioschemas: Using Schema.org for describing scientific information
 
Building bioinformatics resources for the global community
Building bioinformatics resources for the global communityBuilding bioinformatics resources for the global community
Building bioinformatics resources for the global community
 
Canadian health census to lod
Canadian health census to lodCanadian health census to lod
Canadian health census to lod
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
Ontologias
OntologiasOntologias
Ontologias
 
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
 
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
Fast and Accurate Metadata Authoring Using Ontology-Based Recommendations (AM...
 
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ... Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
Use of CEDAR Technology for Ontology-based Submission of Biomedical Data to ...
 
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
Yde de Jong & Dave Roberts - ZooBank and EDIT: Towards a business model for Z...
 
CBS CEDAR Presentation
CBS CEDAR PresentationCBS CEDAR Presentation
CBS CEDAR Presentation
 
Profile Serialization IIPC GA 2015
Profile Serialization IIPC GA 2015Profile Serialization IIPC GA 2015
Profile Serialization IIPC GA 2015
 

Último

Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
Frederic Leger
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
ToshihiroIto4
 
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
OECD Directorate for Financial and Enterprise Affairs
 
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
OECD Directorate for Financial and Enterprise Affairs
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
OECD Directorate for Financial and Enterprise Affairs
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
gpww3sf4
 
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
OECD Directorate for Financial and Enterprise Affairs
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
Robin Haunschild
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
OECD Directorate for Financial and Enterprise Affairs
 
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdfWhy Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Ben Linders
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
RIDHIMAGARG21
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
gharris9
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
OECD Directorate for Financial and Enterprise Affairs
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
OECD Directorate for Financial and Enterprise Affairs
 
Using-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptxUsing-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptx
kainatfatyma9
 
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij
 
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
OECD Directorate for Financial and Enterprise Affairs
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
artemacademy2
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Rosie Wells
 

Último (20)

Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussionArtificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
Artificial Intelligence, Data and Competition – LIM – June 2024 OECD discussion
 
2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf2024-05-30_meetup_devops_aix-marseille.pdf
2024-05-30_meetup_devops_aix-marseille.pdf
 
ASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdfASONAM2023_presection_slide_track-recommendation.pdf
ASONAM2023_presection_slide_track-recommendation.pdf
 
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
The Intersection between Competition and Data Privacy – KEMP – June 2024 OECD...
 
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...Competition and Regulation in Professions and Occupations – ROBSON – June 202...
Competition and Regulation in Professions and Occupations – ROBSON – June 202...
 
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
Artificial Intelligence, Data and Competition – ČORBA – June 2024 OECD discus...
 
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
原版制作贝德福特大学毕业证(bedfordhire毕业证)硕士文凭原版一模一样
 
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
Competition and Regulation in Professions and Occupations – OECD – June 2024 ...
 
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdfBRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
BRIC_2024_2024-06-06-11:30-haunschild_archival_version.pdf
 
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
 
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdfWhy Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
Why Psychological Safety Matters for Software Teams - ACE 2024 - Ben Linders.pdf
 
Disaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other usesDisaster Management project for holidays homework and other uses
Disaster Management project for holidays homework and other uses
 
Gregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics PresentationGregory Harris - Cycle 2 - Civics Presentation
Gregory Harris - Cycle 2 - Civics Presentation
 
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussionPro-competitive Industrial Policy – OECD – June 2024 OECD discussion
Pro-competitive Industrial Policy – OECD – June 2024 OECD discussion
 
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
The Intersection between Competition and Data Privacy – CAPEL – June 2024 OEC...
 
Using-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptxUsing-Presentation-Software-to-the-Fullf.pptx
Using-Presentation-Software-to-the-Fullf.pptx
 
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
Suzanne Lagerweij - Influence Without Power - Why Empathy is Your Best Friend...
 
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
The Intersection between Competition and Data Privacy – COLANGELO – June 2024...
 
Carrer goals.pptx and their importance in real life
Carrer goals.pptx  and their importance in real lifeCarrer goals.pptx  and their importance in real life
Carrer goals.pptx and their importance in real life
 
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie WellsCollapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
Collapsing Narratives: Exploring Non-Linearity • a micro report by Rosie Wells
 

ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench

  • 1. Supporting Ontology-Based Standardization of Biomedical Metadata in the CEDAR Workbench Marcos Martínez-Romero Stanford University Stanford Universitymetadatacenter.org EDAR OR EXPANDED DATA ION AND RETRIEVAL CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR DAR DAR CENTER FOR EXPANDED DATA 9/14/2017
  • 2. 2
  • 3. age Age AGE `Age age (after birth) age (in years) age (y) age (year) age (years) Age (years) Age (Years) age (yr) age (yr-old) age (yrs) Age (yrs) age [y] age [year] age [years] age in years age of patient Age of patient age of subjects age(years) Age(years) Age(yrs.) Age, year age, years age, yrs age.year age_years Metadata are not standardized 3
  • 4. age Age AGE `Age age (after birth) age (in years) age (y) age (year) age (years) Age (years) Age (Years) age (yr) age (yr-old) age (yrs) Age (yrs) age [y] age [year] age [years] age in years age of patient Age of patient age of subjects age(years) Age(years) Age(yrs.) Age, year age, years age, yrs age.year age_years Metadata are not standardized Age-Years (NCIT) (http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C37908) “The length of a person's life, stated in years since birth.” 4
  • 5. It’s extremely hard to: –find experimental datasets –understand how the experiments were performed –replicate study findings Metadata are not standardized 5
  • 6. Generating standard metadata is hard • Submission formats rarely support ontology terms • No easy way of finding terms from ontologies and including them into metadata submissions 6
  • 7. 7
  • 8. Semantic ecosystem to enable the creation of high-quality metadata in biomedicine 8
  • 9. The CEDAR Workbench Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template fill in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies 9
  • 10. Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template fill in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies The CEDAR Workbench 10
  • 11. 11
  • 12. 12
  • 13. 13
  • 14. 14
  • 15. 15
  • 16. 16
  • 17. 17
  • 18. 18
  • 19. 19
  • 20. 20
  • 21. 21
  • 22. Template Designer Metadata Editor Template authors Metadata authors design templates Metadata Repository template fill in templates with metadata metadata Public Databases LINCS submit metadata Biomedical Ontologies The CEDAR Workbench 22
  • 23. 23
  • 24. 24
  • 25. { "@context": { "rdfs": "http://www.w3.org/2000/01/rdf-schema#", "xsd": "http://www.w3.org/2001/XMLSchema#", "pav": "http://purl.org/pav/", //... "Title": "http://purl.obolibrary.org/obo/NGS_0000055", "Disorder": "http://purl.org/net/OCRe/OCRe.owl#OCRE900086", "Institution": "http://semantic-dicom.org/dcm#InstitutionName", "Principal Investigator": "http://purl.org/net/OCRe/OCRe.owl#OCRE901006", "Study Type": "http://purl.obolibrary.org/obo/NGS_0000056" }, "@type": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C63536", "Title": { "@value": "A sample study" }, "Disorder": { "@id": "http://purl.obolibrary.org/obo/DOID_8986", "rdfs:label": "narcolepsy" }, "Institution": { "@value": "Stanford University" }, "Principal Investigator": { "@value": "John Doe" }, "Study Type": { "@id": "http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C15273", "rdfs:label": "Longitudinal Study" }, // ... "schema:isBasedOn": "https://repo.metadatacenter.orgx/templates/6381a0ce-3904-4885-bc44-5caacb4ad0e6", "schema:name": "Study metadata", "schema:description": "Study template", "pav:createdOn": "2017-09-05T09:50:28-0700", "pav:createdBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61", "pav:lastUpdatedOn": "2017-09-05T09:50:28-0700", "oslc:modifiedBy": "https://metadatacenter.org/users/8d787b98-33dd-4aff-a88c-440caf452c61", "@id": "https://repo.metadatacenter.orgx/template-instances/ffe856e7-d920-480d-a666-009041f609e3" } 25
  • 26. Value Set Creation • Lists of permissible values for fields • Example: Longitudinal study types – Prospective study – Retrospective study – Hybrid design 26
  • 27. Class Creation • Dynamically define a new class and immediately use it • Optionally link it to existing classes – Ontology maintainers may use this information to enrich their ontologies • Example: adductor dorsalis 27
  • 28. Class Creation CEDAR Provisional Classes (CEDARPC) UBERON adductor dorsalis adductor muscle subclassOf 28
  • 29. Evaluation • The LINCS Consortium – Cellular signatures • ImmPort – Immunology • AIRR Community – Datasets acquired using sequencing – Submission to NCBI BioSample • Stanford University Libraries 29
  • 30. Summary • Authoring metadata is hard and time-consuming • Authoring semantic metadata is even harder – Lack of convenient tools for linking metadata to ontologies in a metadata authoring workflow • The CEDAR Workbench facilitates metadata creation in a semantically rigorous way – Add type and property assertions – Constrain the values of fields to ontology terms – Create classes and value sets http://metadatacenter.org http://cedar.metadatacenter.net 30
  • 31. CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR CENTER FOR EXPANDED DATA ANNOTATION AND RETRIEVAL CEDAR CEDAR CEDAR I Metadata Thanks! 31