SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
Semantic Annotation of Scientific Articles DC-2009 "Semantic Interoperability of Linked Data" Sudeshna Das  1,2  & Tim Clark  1,2 sudeshna_das@harvard.edu,  [email_address]   1 MIND, Massachusetts General Hospital 2 Harvard Medical School
Alzforum: The Pioneer in Biomedical Web Communities
Problem Statement Shared terminology Linked open data sources Reusable software Web 3.0
What is the SCF? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
SCF Overview
PDOnline Alzforum Pain StemBook SCF Toolkit
Semantic Annotation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Search for   beta-amyloid Retrieve content with “ abeta”, “ amyloid-beta”, “ A β ”, “Ab1-40”, “Ab1-42”, . . . Semantic Search
Search for   “ BACE1” Retrieve content with “ beta secretase”, “ beta-site APP cleaving enzyme”, “ membrane-associated aspartic protease”, etc. . . . Semantic Search
Search for   “ BACE1” Associate database content
Enabling semantic annotation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],High recall High precision
 
 
 
Powerful search across communities
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Team

Más contenido relacionado

La actualidad más candente

Automatic Metadata Generation Charles Duncan
Automatic Metadata Generation Charles DuncanAutomatic Metadata Generation Charles Duncan
Automatic Metadata Generation Charles DuncanJISC CETIS
 
Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011cmzmasek
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...marcosmartinezromero
 
BibBase Linked Data Triplification Challenge 2010 Presentation
BibBase Linked Data Triplification Challenge 2010 PresentationBibBase Linked Data Triplification Challenge 2010 Presentation
BibBase Linked Data Triplification Challenge 2010 PresentationReynold Xin
 
Biostatistics & Bioinformatics
Biostatistics & BioinformaticsBiostatistics & Bioinformatics
Biostatistics & Bioinformaticsgumccomm
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked DataMichel Dumontier
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Merce Crosas
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterGlobus
 
BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013Andrea de Souza
 
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsTim Clark
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?Varsha Khodiyar
 
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsThe HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsAlasdair Gray
 

La actualidad más candente (20)

Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Chemspider Presentation at the ACS Meeting in New orleans
Chemspider Presentation at the ACS Meeting in New orleansChemspider Presentation at the ACS Meeting in New orleans
Chemspider Presentation at the ACS Meeting in New orleans
 
Automatic Metadata Generation Charles Duncan
Automatic Metadata Generation Charles DuncanAutomatic Metadata Generation Charles Duncan
Automatic Metadata Generation Charles Duncan
 
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific ExperimentsAn Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
 
Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011
 
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
ICBO2017 - Supporting Ontology-Based Standardization of Biomedical Metadata i...
 
BibBase Linked Data Triplification Challenge 2010 Presentation
BibBase Linked Data Triplification Challenge 2010 PresentationBibBase Linked Data Triplification Challenge 2010 Presentation
BibBase Linked Data Triplification Challenge 2010 Presentation
 
Biostatistics & Bioinformatics
Biostatistics & BioinformaticsBiostatistics & Bioinformatics
Biostatistics & Bioinformatics
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
Saic aqua
Saic aquaSaic aqua
Saic aqua
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus PosterNIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
NIH NCI Childhood Cancer Data Initiative (CCDI) Symposium Globus Poster
 
Pub med
Pub medPub med
Pub med
 
BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical Communications
 
Why should researchers care about data curation?
Why should researchers care about data curation?Why should researchers care about data curation?
Why should researchers care about data curation?
 
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and DistributionsThe HCLS Community Profile: Describing Datasets, Versions, and Distributions
The HCLS Community Profile: Describing Datasets, Versions, and Distributions
 

Destacado

Semantic Annotation - Ontobras 2015
Semantic Annotation - Ontobras 2015Semantic Annotation - Ontobras 2015
Semantic Annotation - Ontobras 2015Newton Calegari
 
Semantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron DatabaseSemantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron DatabaseDavid Graus
 
Semantic annotation, clustering and visualization
Semantic annotation, clustering and visualizationSemantic annotation, clustering and visualization
Semantic annotation, clustering and visualizationDavid Graus
 
Semantic Annotation of Documents
Semantic Annotation of DocumentsSemantic Annotation of Documents
Semantic Annotation of Documentssubash chandra
 
Semantic annotation with Pundit: Enriching the Web of Science
Semantic annotation with Pundit: Enriching the Web of ScienceSemantic annotation with Pundit: Enriching the Web of Science
Semantic annotation with Pundit: Enriching the Web of ScienceFrancesca Di Donato
 
IRE Semantic Annotation of Documents
IRE Semantic Annotation of Documents IRE Semantic Annotation of Documents
IRE Semantic Annotation of Documents Sharvil Katariya
 

Destacado (6)

Semantic Annotation - Ontobras 2015
Semantic Annotation - Ontobras 2015Semantic Annotation - Ontobras 2015
Semantic Annotation - Ontobras 2015
 
Semantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron DatabaseSemantic Annotation of the Cyttron Database
Semantic Annotation of the Cyttron Database
 
Semantic annotation, clustering and visualization
Semantic annotation, clustering and visualizationSemantic annotation, clustering and visualization
Semantic annotation, clustering and visualization
 
Semantic Annotation of Documents
Semantic Annotation of DocumentsSemantic Annotation of Documents
Semantic Annotation of Documents
 
Semantic annotation with Pundit: Enriching the Web of Science
Semantic annotation with Pundit: Enriching the Web of ScienceSemantic annotation with Pundit: Enriching the Web of Science
Semantic annotation with Pundit: Enriching the Web of Science
 
IRE Semantic Annotation of Documents
IRE Semantic Annotation of Documents IRE Semantic Annotation of Documents
IRE Semantic Annotation of Documents
 

Similar a Semantic Annotation Dc 2009

Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Laurent Alquier
 
Asis&t webinar people directories access innovations
Asis&t webinar people directories access innovationsAsis&t webinar people directories access innovations
Asis&t webinar people directories access innovationsBert Carelli
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic WebChimezie Ogbuji
 
Cornell20080516
Cornell20080516Cornell20080516
Cornell20080516charper
 
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...Artificial Intelligence Institute at UofSC
 
A knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsA knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsramakanz
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdfSrimathideviJ
 
Using Taxonomies to Create People Directories and Author Networks
Using Taxonomies to Create People Directories and Author Networks Using Taxonomies to Create People Directories and Author Networks
Using Taxonomies to Create People Directories and Author Networks Access Innovations, Inc.
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...Susanna-Assunta Sansone
 
Archives Hub - Data in :: Data out
Archives Hub - Data in :: Data outArchives Hub - Data in :: Data out
Archives Hub - Data in :: Data outJane Stevenson
 
CHEM281 2012
CHEM281 2012CHEM281 2012
CHEM281 2012jda90
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Finalguestcaef1d
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 

Similar a Semantic Annotation Dc 2009 (20)

blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
 
Asis&t webinar people directories access innovations
Asis&t webinar people directories access innovationsAsis&t webinar people directories access innovations
Asis&t webinar people directories access innovations
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
 
Cornell20080516
Cornell20080516Cornell20080516
Cornell20080516
 
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
An Up-to-date Knowledge Base and Focused Exploration System for Human Perform...
 
A knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systemsA knowledge capture framework for domain specific search systems
A knowledge capture framework for domain specific search systems
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Locus link
Locus linkLocus link
Locus link
 
database retrival.pdf
database retrival.pdfdatabase retrival.pdf
database retrival.pdf
 
Using Taxonomies to Create People Directories and Author Networks
Using Taxonomies to Create People Directories and Author Networks Using Taxonomies to Create People Directories and Author Networks
Using Taxonomies to Create People Directories and Author Networks
 
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
NPG Scientific Data; SSP, Boston, May 2014: http://www.sspnet.org/events/annu...
 
Archives Hub - Data in :: Data out
Archives Hub - Data in :: Data outArchives Hub - Data in :: Data out
Archives Hub - Data in :: Data out
 
Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
Navigating the Neuroscience Data Landscape
Navigating the Neuroscience Data LandscapeNavigating the Neuroscience Data Landscape
Navigating the Neuroscience Data Landscape
 
CHEM281 2012
CHEM281 2012CHEM281 2012
CHEM281 2012
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 

Semantic Annotation Dc 2009

  • 1. Semantic Annotation of Scientific Articles DC-2009 "Semantic Interoperability of Linked Data" Sudeshna Das 1,2 & Tim Clark 1,2 sudeshna_das@harvard.edu, [email_address] 1 MIND, Massachusetts General Hospital 2 Harvard Medical School
  • 2. Alzforum: The Pioneer in Biomedical Web Communities
  • 3. Problem Statement Shared terminology Linked open data sources Reusable software Web 3.0
  • 4.
  • 6. PDOnline Alzforum Pain StemBook SCF Toolkit
  • 7.
  • 8. Search for beta-amyloid Retrieve content with “ abeta”, “ amyloid-beta”, “ A β ”, “Ab1-40”, “Ab1-42”, . . . Semantic Search
  • 9. Search for “ BACE1” Retrieve content with “ beta secretase”, “ beta-site APP cleaving enzyme”, “ membrane-associated aspartic protease”, etc. . . . Semantic Search
  • 10. Search for “ BACE1” Associate database content
  • 11.
  • 12.  
  • 13.  
  • 14.  
  • 15. Powerful search across communities
  • 16.

Notas del editor

  1. With the advent of Web 2.0, social networking sites have become very common and esp students spend increasing amounts of time on networking sites such as Facebook, Orkut & MySpace. Even within the science & education community, researchers are discussing findings and networking within scientific social communities. This is the home page of Alzforum, one of the oldest of such communities. It has over 4000 registered Alzheimers researchers networking to find a cure for the neurological disorder Alzheimers. Alzforum became very popular and is known as CNN for AD researchers. In fact it became necessary to clone the site. Alzourm was developed 10 years ago and features were added over time making it difficult to replicate the platform.
  2. To meet these needs we developed SCF – Science Collaboration Framework. SCF can be used to replicate Alzforum like communities. It is based on Drupal – an open source CMS. Contains Integrated Collaborative tools Web 2.0. One of our key contribution was to adapt Drupal to The Semantic Web. Thus we can leverage existing linked data and ontologies/vocabularies. SCF ssc are Interoperable with other SCF or Semantic Web communities. And finally provides powerful “semantic search” capabilities
  3. Our pilot project was StemBook - an online review of Stem Cell Biology for Stem Cell researchers. Then we took advantage of features in StemBook and developed PDOnline – a site for Parkinson’s researchers. Alzforum has come a full circle and is re-developing their site on SCF. A site for neuropathic pain and other sites are in planning stages. The idea is that every site contributes features to the SCF toolkit as well as reuses existing ones. And we hope to achieve asymptotic convergence.
  4. To link and integrate these communities developed with SCF, we annotate the content of the communities with ontologies, controlled vocabularies and linked data. The articles and comments on the community site are tagged with resources that have stable URIs or terms from controlled vocabularies. The tags have meaning and other details such as provenance and status are also captured. The details of the semantic annotation ontology can be found at our website swan.mindinformatics.org
  5. Suppose a document discusses the gene amyloid beta. We annotate the document with the gene resource “AB”, not just the term “AB”. The resource information is obtained from a SPARQL endpoint provided by Science Commons that contains the gene synonyms are other information. Thus, search using any of these terms returns the document
  6. Another example search for BACE1 returns document annotated wih Beta secretase, beta-site AP cleaving enzyme and so on…
  7. In principle, search for BACE1 could also bring up the structure for BACE1. This feature has not yet been implemented
  8. Such searches are made possible by semantic annotation of site content. And semantic annotation is facilitated by semi-automatic text mining. Text-mining algorithms suggest terms for annotation and then the editor of the community sites manually review those, prior to attaching the annotation to the document. Currently we mine documents for genes names, gene ontology terms, tissue cell types etc.
  9. Screen shot of SCF annotation editor. The editor facilitates the manual review process. The terms identified by the algorithm are highlighted and any term can be accepted, changed or deleted.
  10. So to recap “algorithm finds core terms”
  11. Relationships to other entities are established automatically. The gene points to the protein, the protein to the antibody and so on
  12. Thus powerful searches across communities are established