SlideShare una empresa de Scribd logo
1 de 26
An Up-to-date Knowledge Base and Focused Exploration System for Human Performance and Cognition AmitSheth LexisNexis Ohio Eminent Scholar Director, Kno.e.sis Center Wright State University RamakanthKavuluru Postdoctoral Research Scientist Kno.e.sis Center Thanks  to Dr. Victor Chan for support and guidance HPC-KB team: Christopher Thomas, Wenbo Wang, Alan Smith,  Paul Fultz, and Delroy Cameron
Focused Knowledge Bases A knowledge base (KB) functions as a standalone reference for a particular domain of interest backend for knowledge-based search, browsing, and exploration of literature
What is a KB? “A body of knowledge describing a topic or domain of interest” categories or classes – Neurotransmitters, Disease individuals (instances of classes) Dopamine, Magnesium, Migraine roles (properties/predicates) – inhibits, is a, part of assertions (triples) Dopamine is a neurotransmitter Magnesium treats Migraine
Then, What are Ontologies? “Ontology is the basic structure or armature around which a knowledge base can be built” (Swartout and Tate, 1999) “An ontology is an explicit representation of a shared understanding of the important concepts in some domain of interest.” (Kalfoglou, 2001) So, mostly static blocks of well accepted and consensual knowledge
Ontologies in Life Sciences The National Center for Biomedical Ontology (NCBO) - Open Biomedical Ontologies (OBO) About 200 ontologies and 1.5 million terms Only part_of and is_a relations in the Gene Ontology Histolysis is_a positive regulation of cell size Requests for changes are expert reviewed before modifications
What about Emergent Knowledge, Richer Relationships? New scientific results and insights published everyday backed by experimentation PubMed: 18+ million articles; 1300 new per day Also, what about other predicates besides is_a and part_of (eg., UMLS Semantic Network of 54 predicates).  Need a way of capturing and meaningfully utilizing this emerging knowledge
Enter HPC-KB NLP Patterns SCOONER
Steps in Creating the HPC-KB Carve a focused domain hierarchy out of Wikipedia Extract mentions of entities and relationships in the relevant scientific literature (Pubmed abstracts) to support non-hierarchical guidance. Map extracted entity mentions to concepts and extracted predicates to relationships to create the knowledge-base
Workflow Overview HPC keywords Doozer: Base Hierarchy from Wikipedia Focused Pattern based extraction SenseLab Neuroscience Ontologies Initial KB Creation Meta Knowledgebase PubMed Abstracts Knoesis: Parsing based NLP Triples   Enrich Knowledge Base NLM: Rule based BKR Triples Final Knowledge Base
Hierarchy Using Wikipedia Categories and Graph Structure
Triple Extraction Open Extraction  No fixed number of predetermined entities and predicates At  Knoesis – NLP (parsing and dependency trees) Supervised Extraction Predetermined set of entities and predicates At  Knoesis – Pattern based extraction to connect entities in the base hierarchy using statistical techniques At NLM – NLP and rule based approaches
Mapping of Triples to Hierarchy Entities in both subject and object must contain at least one concept from the hierarchy to be mapped to the KB Preliminary synonyms based on anchor labels and page redirects in Wikipedia Prolactostatin redirects to Dopamine Predicates  (verbs) and entities are subjected to stemming using Wordnet
HPC-KB Stats ,[object Object]
  Number of non-trivial facts  --  3 Million    Examples: NLP based: calcium-binding protein S100B modulates long-term synaptic plasticity Pattern based: Olfactory Bulb has physical part of anatomic structure Mitral cell ,[object Object],(Note: Abstracts accessible through Scooner: Oct 2010)
Full Architecture
Scooner  Features Knowledge-based browsing: Relations window, inverse relations, creating trails Persistent projects: Work bench, browsing history, comments, filtering Collaboration: comments, dashboard, exporting (sub)projects, importing projects
Comments on Scooner “The ability to browse predications together with documents will likely reduce the cognitive load required for encountering interesting facts, both for novice users and domain experts.” – Thomas Rindflesch, Researcher in Biomedical Informatics, NLM. “Being able to keep track of multiple articles is a really nice tool to have, and it cuts down on time between jumping back and forth between articles” - Anonymous comment from an AFRL researcher
Future Work: Knexpace Automatic Updates Index new abstracts as they arrive Extract relationships as new abstracts arrive Periodically update indices for abstracts and triples Other Maintenance Admin interfaces Application software support
Improving the KB Quality & Filtering Adhere to a stricter schema Having a fixed number of predicates and a fixed range and domain for each predicate Ex: For the immunology and chemical warfare agents Predicate: activates 	Restricted Domain: Cytokines OR μRNA Restricted Range: Macrophages Also useful to directly launch queries of the form  	Question:     ?x    activates    Macrophages 	Answer:        TNF-α  and IFN-γ
Normalize Entities and Predicates  How do we find  Bovine spongiform encephalopathy and mad cow disease are same   (Use UMLS Metathesaurus! ) long term memory and long lasting memory are the same computationally  (UMLS does not work) More complex similarities: HepG(2) cell line and Human Hepatoma G2 cells which textual forms map to the fixed set of predicates: Do modulates, regulates, stimulates all map to affects?(NLM  expert collaboration & AFRL help) NLM tools: MetaMap, SemRep, and other lexical tools
Provenance and other meta data Original abstract PMIDs will be captured for each triple. Other data: authors, journal names How about other meta data for filtering: In Vitro / In Vivo If, In Vivo, which organism. If Mice, which type: Ames Dwarf   Which techniques are used. ex: Flow Cytometry
New Knowledge Example VIP Peptide  – increases – Catecholamine Biosynthesis Catecholamines – induce – β-adrenergic receptor activity β-adrenergic receptors – are involved – fear conditioning VIP Peptide  – affects – fear conditioning In  Cattle In  Rats In  Humans
Domain Specific Provenance For Immunology and Warfare Agent effects Which pathogen:  Francisellatularensis Which Strains: U112, LVS, … What is measured: Cells (dendritic, macrophages, monocytes), Proteins, cytokines, chemokines Need to find preexisting taxonomies for organisms, techniques, pathogens; or need to build some and integrate The more the specificity, the more the KB quality
Better Ranking of Abstracts and Triples Use search phrases to fine tune ranking of abstracts and triples Just because the user clicked on neurogenesis, does not mean she wants to know everything about it. Predict  which triples or abstracts the user might be more interested in the current search session. Show top-k entities in the result set to facilitate filtering based on top concepts Visualize networks of a set of browsed triples
Semantic Integration Famous OBO Ontologies (total 7 foundry ontologies) Gene Ontology PRotein Ontology Other domain specific OBO candidateontologies NCBI organismal classification Infectious Disease Human Disease (Tularemia found here, and also in SNOMED, & others too, what to choose for mapping?) Pathogen transmission
Semantic Integration, contd… Linked Open Drug Data DrugBank: drugs and drug targets (pathways, structures, sequences) Diseasome: disorders and disease-gene associations LinkedCT: Clinical trials Mendelian Inheritance Online MendelianInhertance in Animals (OMIA) Online Mendelian Inheritance in Man (OMIM) Different formats: owl, obo, rrf

Más contenido relacionado

La actualidad más candente

BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)Sobia
 
Text-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networksText-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networksLars Juhl Jensen
 
Arcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedArcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedarcomem
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework Neuroscience Information Framework
 
Cartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...Neuroscience Information Framework
 
Sequencedatabases
SequencedatabasesSequencedatabases
SequencedatabasesAbhik Seal
 
Data retreival system
Data retreival systemData retreival system
Data retreival systemShikha Thakur
 
A Survey on Bioinformatics Tools
A Survey on Bioinformatics ToolsA Survey on Bioinformatics Tools
A Survey on Bioinformatics Toolsidescitation
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbgetSurendraKumar338
 
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v12016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v1Bruce Kozuma
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomaticsnguyenpg
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDatabricks
 
The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...Neuroscience Information Framework
 
Data Mining in Rediology reports
Data Mining in Rediology reportsData Mining in Rediology reports
Data Mining in Rediology reportsSaeed Mehrabi
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 

La actualidad más candente (20)

BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Text-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networksText-mining-based retrieval of protein networks
Text-mining-based retrieval of protein networks
 
Arcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advancedArcomem training Topic Analysis Models advanced
Arcomem training Topic Analysis Models advanced
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework
 
Cartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defenseCartic Ramakrishnan's dissertation defense
Cartic Ramakrishnan's dissertation defense
 
Data retrieval tools
Data retrieval toolsData retrieval tools
Data retrieval tools
 
The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...The Neuroscience Information Framework: Establishing a practical semantic fra...
The Neuroscience Information Framework: Establishing a practical semantic fra...
 
Sequencedatabases
SequencedatabasesSequencedatabases
Sequencedatabases
 
Data retrieval
Data retrievalData retrieval
Data retrieval
 
Data retreival system
Data retreival systemData retreival system
Data retreival system
 
A Survey on Bioinformatics Tools
A Survey on Bioinformatics ToolsA Survey on Bioinformatics Tools
A Survey on Bioinformatics Tools
 
Ontology at Manchester
Ontology at ManchesterOntology at Manchester
Ontology at Manchester
 
Data retriveal ,srg and dbget
Data retriveal ,srg and dbgetData retriveal ,srg and dbget
Data retriveal ,srg and dbget
 
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v12016 Bio-IT World Cell Line Coordination 2016-04-06v1
2016 Bio-IT World Cell Line Coordination 2016-04-06v1
 
bioinfomatics
bioinfomaticsbioinfomatics
bioinfomatics
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge Graphs
 
The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...The Neuroscience Information Framework: Making Resources Discoverable for the...
The Neuroscience Information Framework: Making Resources Discoverable for the...
 
Prosite
PrositeProsite
Prosite
 
Data Mining in Rediology reports
Data Mining in Rediology reportsData Mining in Rediology reports
Data Mining in Rediology reports
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 

Destacado (7)

Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
Knowledge-driven Personalized Contextual mHealth Service for Asthma Managemen...
 
Trust Management: A Tutorial
Trust Management: A TutorialTrust Management: A Tutorial
Trust Management: A Tutorial
 
Web and Complex Systems Lab @ Kno.e.sis
Web and Complex Systems Lab @ Kno.e.sisWeb and Complex Systems Lab @ Kno.e.sis
Web and Complex Systems Lab @ Kno.e.sis
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
 
2015 Kno.e.sis Center Annual Review
2015 Kno.e.sis Center Annual Review2015 Kno.e.sis Center Annual Review
2015 Kno.e.sis Center Annual Review
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...Smart Data - How you and I will exploit Big Data for personalized digital hea...
Smart Data - How you and I will exploit Big Data for personalized digital hea...
 

Similar a A knowledge capture framework for domain specific search systems

The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...Maryann Martone
 
Knowledge graph construction for research & medicine
Knowledge graph construction for research & medicineKnowledge graph construction for research & medicine
Knowledge graph construction for research & medicinePaul Groth
 
Clinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopClinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopRobin Featherstone
 
Diversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsDiversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsPaul Groth
 
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...Neuroscience Information Framework
 
How do we know what we don't know?  Exploring the data and knowledge space th...
How do we know what we don't know?  Exploring the data and knowledge space th...How do we know what we don't know?  Exploring the data and knowledge space th...
How do we know what we don't know?  Exploring the data and knowledge space th...Maryann Martone
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature miningLars Juhl Jensen
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...Maryann Martone
 
Biological literature mining - from information retrieval to biological disco...
Biological literature mining - from information retrieval to biological disco...Biological literature mining - from information retrieval to biological disco...
Biological literature mining - from information retrieval to biological disco...Lars Juhl Jensen
 
NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生ysuzuki-naist
 
ShortStory_bioCaster.pptx
ShortStory_bioCaster.pptxShortStory_bioCaster.pptx
ShortStory_bioCaster.pptxDeviPriyaRavi2
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainAngelo Salatino
 
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsTim Clark
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsAyeshaYousaf20
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Anubis Hosein
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemMaryann Martone
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 

Similar a A knowledge capture framework for domain specific search systems (20)

The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...
 
Knowledge graph construction for research & medicine
Knowledge graph construction for research & medicineKnowledge graph construction for research & medicine
Knowledge graph construction for research & medicine
 
Clinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopClinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching Workshop
 
Diversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domainsDiversity and Depth: Implementing AI across many long tail domains
Diversity and Depth: Implementing AI across many long tail domains
 
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
 
How do we know what we don't know?  Exploring the data and knowledge space th...
How do we know what we don't know?  Exploring the data and knowledge space th...How do we know what we don't know?  Exploring the data and knowledge space th...
How do we know what we don't know?  Exploring the data and knowledge space th...
 
PhDc exam presentation
PhDc exam presentationPhDc exam presentation
PhDc exam presentation
 
Biomedical literature mining
Biomedical literature miningBiomedical literature mining
Biomedical literature mining
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...
 
Xerox2009
Xerox2009Xerox2009
Xerox2009
 
Biological literature mining - from information retrieval to biological disco...
Biological literature mining - from information retrieval to biological disco...Biological literature mining - from information retrieval to biological disco...
Biological literature mining - from information retrieval to biological disco...
 
NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生NAISTビッグデータシンポジウム - 情報 松本先生
NAISTビッグデータシンポジウム - 情報 松本先生
 
ShortStory_bioCaster.pptx
ShortStory_bioCaster.pptxShortStory_bioCaster.pptx
ShortStory_bioCaster.pptx
 
Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domain
 
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical Communications
 
BioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomicsBioInformatics Tools -Genomics , Proteomics and metablomics
BioInformatics Tools -Genomics , Proteomics and metablomics
 
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...Una estrategia para la integración de ontologías, servicios web y PLN en el a...
Una estrategia para la integración de ontologías, servicios web y PLN en el a...
 
Data-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystemData-knowledge transition zones within the biomedical research ecosystem
Data-knowledge transition zones within the biomedical research ecosystem
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 

Último

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Pooja Bhuva
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxDr. Ravikiran H M Gowda
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxCeline George
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfSherif Taha
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...Nguyen Thanh Tu Collection
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentationcamerronhm
 

Último (20)

Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 

A knowledge capture framework for domain specific search systems

  • 1. An Up-to-date Knowledge Base and Focused Exploration System for Human Performance and Cognition AmitSheth LexisNexis Ohio Eminent Scholar Director, Kno.e.sis Center Wright State University RamakanthKavuluru Postdoctoral Research Scientist Kno.e.sis Center Thanks to Dr. Victor Chan for support and guidance HPC-KB team: Christopher Thomas, Wenbo Wang, Alan Smith, Paul Fultz, and Delroy Cameron
  • 2. Focused Knowledge Bases A knowledge base (KB) functions as a standalone reference for a particular domain of interest backend for knowledge-based search, browsing, and exploration of literature
  • 3. What is a KB? “A body of knowledge describing a topic or domain of interest” categories or classes – Neurotransmitters, Disease individuals (instances of classes) Dopamine, Magnesium, Migraine roles (properties/predicates) – inhibits, is a, part of assertions (triples) Dopamine is a neurotransmitter Magnesium treats Migraine
  • 4. Then, What are Ontologies? “Ontology is the basic structure or armature around which a knowledge base can be built” (Swartout and Tate, 1999) “An ontology is an explicit representation of a shared understanding of the important concepts in some domain of interest.” (Kalfoglou, 2001) So, mostly static blocks of well accepted and consensual knowledge
  • 5. Ontologies in Life Sciences The National Center for Biomedical Ontology (NCBO) - Open Biomedical Ontologies (OBO) About 200 ontologies and 1.5 million terms Only part_of and is_a relations in the Gene Ontology Histolysis is_a positive regulation of cell size Requests for changes are expert reviewed before modifications
  • 6. What about Emergent Knowledge, Richer Relationships? New scientific results and insights published everyday backed by experimentation PubMed: 18+ million articles; 1300 new per day Also, what about other predicates besides is_a and part_of (eg., UMLS Semantic Network of 54 predicates). Need a way of capturing and meaningfully utilizing this emerging knowledge
  • 7. Enter HPC-KB NLP Patterns SCOONER
  • 8. Steps in Creating the HPC-KB Carve a focused domain hierarchy out of Wikipedia Extract mentions of entities and relationships in the relevant scientific literature (Pubmed abstracts) to support non-hierarchical guidance. Map extracted entity mentions to concepts and extracted predicates to relationships to create the knowledge-base
  • 9. Workflow Overview HPC keywords Doozer: Base Hierarchy from Wikipedia Focused Pattern based extraction SenseLab Neuroscience Ontologies Initial KB Creation Meta Knowledgebase PubMed Abstracts Knoesis: Parsing based NLP Triples Enrich Knowledge Base NLM: Rule based BKR Triples Final Knowledge Base
  • 10. Hierarchy Using Wikipedia Categories and Graph Structure
  • 11. Triple Extraction Open Extraction No fixed number of predetermined entities and predicates At Knoesis – NLP (parsing and dependency trees) Supervised Extraction Predetermined set of entities and predicates At Knoesis – Pattern based extraction to connect entities in the base hierarchy using statistical techniques At NLM – NLP and rule based approaches
  • 12. Mapping of Triples to Hierarchy Entities in both subject and object must contain at least one concept from the hierarchy to be mapped to the KB Preliminary synonyms based on anchor labels and page redirects in Wikipedia Prolactostatin redirects to Dopamine Predicates (verbs) and entities are subjected to stemming using Wordnet
  • 13.
  • 14.
  • 16. Scooner Features Knowledge-based browsing: Relations window, inverse relations, creating trails Persistent projects: Work bench, browsing history, comments, filtering Collaboration: comments, dashboard, exporting (sub)projects, importing projects
  • 17. Comments on Scooner “The ability to browse predications together with documents will likely reduce the cognitive load required for encountering interesting facts, both for novice users and domain experts.” – Thomas Rindflesch, Researcher in Biomedical Informatics, NLM. “Being able to keep track of multiple articles is a really nice tool to have, and it cuts down on time between jumping back and forth between articles” - Anonymous comment from an AFRL researcher
  • 18. Future Work: Knexpace Automatic Updates Index new abstracts as they arrive Extract relationships as new abstracts arrive Periodically update indices for abstracts and triples Other Maintenance Admin interfaces Application software support
  • 19. Improving the KB Quality & Filtering Adhere to a stricter schema Having a fixed number of predicates and a fixed range and domain for each predicate Ex: For the immunology and chemical warfare agents Predicate: activates Restricted Domain: Cytokines OR μRNA Restricted Range: Macrophages Also useful to directly launch queries of the form Question: ?x activates Macrophages Answer: TNF-α and IFN-γ
  • 20. Normalize Entities and Predicates How do we find Bovine spongiform encephalopathy and mad cow disease are same (Use UMLS Metathesaurus! ) long term memory and long lasting memory are the same computationally (UMLS does not work) More complex similarities: HepG(2) cell line and Human Hepatoma G2 cells which textual forms map to the fixed set of predicates: Do modulates, regulates, stimulates all map to affects?(NLM expert collaboration & AFRL help) NLM tools: MetaMap, SemRep, and other lexical tools
  • 21. Provenance and other meta data Original abstract PMIDs will be captured for each triple. Other data: authors, journal names How about other meta data for filtering: In Vitro / In Vivo If, In Vivo, which organism. If Mice, which type: Ames Dwarf Which techniques are used. ex: Flow Cytometry
  • 22. New Knowledge Example VIP Peptide – increases – Catecholamine Biosynthesis Catecholamines – induce – β-adrenergic receptor activity β-adrenergic receptors – are involved – fear conditioning VIP Peptide – affects – fear conditioning In Cattle In Rats In Humans
  • 23. Domain Specific Provenance For Immunology and Warfare Agent effects Which pathogen: Francisellatularensis Which Strains: U112, LVS, … What is measured: Cells (dendritic, macrophages, monocytes), Proteins, cytokines, chemokines Need to find preexisting taxonomies for organisms, techniques, pathogens; or need to build some and integrate The more the specificity, the more the KB quality
  • 24. Better Ranking of Abstracts and Triples Use search phrases to fine tune ranking of abstracts and triples Just because the user clicked on neurogenesis, does not mean she wants to know everything about it. Predict which triples or abstracts the user might be more interested in the current search session. Show top-k entities in the result set to facilitate filtering based on top concepts Visualize networks of a set of browsed triples
  • 25. Semantic Integration Famous OBO Ontologies (total 7 foundry ontologies) Gene Ontology PRotein Ontology Other domain specific OBO candidateontologies NCBI organismal classification Infectious Disease Human Disease (Tularemia found here, and also in SNOMED, & others too, what to choose for mapping?) Pathogen transmission
  • 26. Semantic Integration, contd… Linked Open Drug Data DrugBank: drugs and drug targets (pathways, structures, sequences) Diseasome: disorders and disease-gene associations LinkedCT: Clinical trials Mendelian Inheritance Online MendelianInhertance in Animals (OMIA) Online Mendelian Inheritance in Man (OMIM) Different formats: owl, obo, rrf

Notas del editor

  1. Just say that knowledge bases are useful for two specific purposes.In the next slide talk about what exactly is a KB, what does it look like.
  2. Emphasize the predicates are primary indicators of new knowledge.People are aware of concepts, but what is interesting is which predicates (if any) connect those concepts
  3. The edges here show part_of and is_a relations: neuroscience is a subclass of science, while brain is part of the general area of cognitive science.So we do not model a strict ontology here.
  4. Ncbi – national center for biotechnology informationObo – open biomedical ontologiesNcbo – natinal center for biomedical ontologySNOMED - Systematized Nomenclature of Medicine
  5. Rrf – rich release format