SlideShare a Scribd company logo
1 of 27
The Role of Openness in Creating a Mind for Life
Open Source, AI, & Biology An AI breakthrough can come from an application in biology It is imperative that this be open source Some steps toward (and questions about) creating an open source AI for understanding life
The first artificial mind will think about molecular biology “You can’t think about thinking without thinking about thinking about something.” Seymour Papert, 1974 “A thorough study of Human Physiology is, in itself, an education broader and more comprehensive than much that passes under that name. There is no side of the intellect which it does not call into play, no region of human knowledge into which either its roots, or its branches, do not extend.” Thomas Huxley,1893
Why AI hasn’t succeeded (yet) People know a lot about the world implicitly  Conversing with a partnerwho doesn’t know these basic things is very frustrating 50 years of failing to capture this “common sense” information computationally suggests: Lack of explicit enumeration makes capture very expensive (encyclopedias don’t have it!)  Still no idea of the extent of this knowledge
People don’t have implicit knowledge of molecular biology ,[object Object],Textbooks Scientific publications Databases (e.g. NCBI) Experiments done in one’s own lab ,[object Object],[object Object]
X J.J. Hornberg et al. / BioSystems 83 (2006) 81–90 Homeostatic networks foil single markers and drugs outcome target
Networks change through time Mjolsness, Sharp, Reinitz,  A Connectionist Model of Development J. Theoretical Bio 1991
Understanding the data “We are close to having a $1,000 genome sequence, but this may be accompanied by a $1,000,000 interpretation.”	- Bruce Korf, president American College of Medical Genetics Not only is the cost of sequencing essentially free, but big computers and big storage are cheap, too.  What will keep us busy for the next 50 years is understanding the data”	- Russ Altman, chair of Biomedical Engineering at Stanford
The Hard Problem Given a set of genomic regions, variants, gene products, and/or concentrations empirically involved in a defined phenotype… Produce: An explanation of the reasons that those genomic regions / variants / products / concentrations are (or are not) relevant to the phenotype Evidence to support the explanation(s) Alternative explanations Reasons to prefer one explanation over another
Answering Why? questions Fundamental to human cognitive development Amazing human facility Even to confabulation Causal explanation is central to science The only question “big data”doesn’t seem to be enoughto answer (cfRamachandran & Hovy, 2002)
Abductive inference “However man may have acquired his faculty of divining the ways of Nature, it has certainly not been by a self-controlled and critical logic.  Even now he cannot give any exact reason for his best guesses…. For though it goes wrong oftener than right, yet the relative frequency with which it is right is on the whole the most wonderful thing in our constitution.” The Essential Peirce: Selected Philosophical Writings v. 2 p. 217
“Two paradoxes are better than one; they may even suggest a solution” –Edward Teller Molecular Systems Biology +  Artificial Intelligence
Explanation is hard Not just about the connection between an explanation and the thing explained, but must also be “consonant” with other explanations. Knowledge is key Have to know many other explanations. Need “judgment” to compare the qualities of alternative explanations. Racunas & Shah’s HyBrow system, but required extensive manually represented knowledge A “complete enough” knowledge-base?
Knowledge-based Computational Biology Widespread use, e.g. Simulation systems (e.g. BioCyc) Question answering systems (e.g. AskHermes or Watson Medicine) High-throughput result analysis (e.g. GOEAST, Ontologizer) Hypothesis generation / testing (e.g. HyQue) Anything that uses an ontology Annotations (e.g. GOA) Cross-species comparisons  NCBO
KB for explanation Knowledge base quality Correctness, timeliness (tracking changes) Completeness A constantly receding goal, that obviously cannot be achieved, but is important anyway Need to cover the material in Textbooks Journal articles Databases
Explanatory inference Even if all the relevant knowledge were available in computationally tractable form… We need inferential methods to Identify possible explanations of complex biological phenomena (symbolic?) Compare alternative explanations in the light of existing evidence (numeric?) History of explanatory inference in AI is suggestive, but key open problems remain
Why does openness matter? Productivity:  Attacking hard problems efficiently Rapid assimilation of effective methods Building on (not ignoring) each other’s results Equity:  Access to scientists with low budgets Distribution to the widest possible community Ethics:  Transparency for AI is a moral value
Transparency is a moral value AI matters – lots of social concerns about loss of control, etc.  2001, Robopocolypse AI is cheap to replicate, and will diverge (if you can build one mind, building millions more is easy).  Too important to be private Technological development in the face of such broad social concern requires earning the trust of the society
Getting there Build on track records of openness OBO &Community-curated Ontologies Semantic Web / OWL / SPARQL / SWRL Open Access Publishing Linked Life Data Breaking down barriers Infrastructure Incentives
Opening a Bazzar To get the productivity advantage, infrastructure matters Technical infrastructure to share, compare and integrate code  Social infrastructure to work together to solve hard problems Motivation Competition Cooperation
Confronting the temptations of being proprietary The temptations: Potential future payoff Avoid effort to conform to the infrastructure Fear of not being able to improve in the future Competition errors Wrong task / evaluation / supplied data Poor process (timing, execution, infrastructure) Doesn’t evolve toward worthy end
Goals Participation from many, previously disparate communities Bio focused: BioCreative, BioNLP, Comp Ling: ACL Shared Tasks, CONLL NIST: TREC, TAC A living, open source collection of useful, modular, repurposable, state of the art software for understanding biomedical texts Major advances in AI
Facilitating an OS community Providing Resources Software (UIMA, U-COMPARE) Compute power Training data (CRAFT, Analysis of analysts) Signal Events Series of competitions based on CRAFT Incentives Prizes for significant achievements
http://bionlp-corpora.sourceforge.net/CRAFT/http://bionlp.sourceforge.net
Remaining challenges Pubmed Central and open access Corporate ownership (Ontotext & LLD) Semantic compatibility of various sources UMLS breadth vs. BFO logic Sharing inference methods & rules Rule syntax (SWRL) is not enough.   DL inference is not enough UIMA equivalent?
How to participate Help design CRAFT competitions Confront publishers about PMC bulk downloads Help define inferential benchmarks
A01-Openness in knowledge-based systems

More Related Content

Viewers also liked (6)

Bosc2011 isobar-fbp
Bosc2011 isobar-fbpBosc2011 isobar-fbp
Bosc2011 isobar-fbp
 
Joanie Nowell Portfolio P Pointpdf
Joanie Nowell Portfolio P PointpdfJoanie Nowell Portfolio P Pointpdf
Joanie Nowell Portfolio P Pointpdf
 
C02-Visualization-Applying visual analytics
C02-Visualization-Applying visual analyticsC02-Visualization-Applying visual analytics
C02-Visualization-Applying visual analytics
 
Link
LinkLink
Link
 
A Niche Perspective on The Personal Care Market
A Niche Perspective on The Personal Care MarketA Niche Perspective on The Personal Care Market
A Niche Perspective on The Personal Care Market
 
Camboya memoria
Camboya memoriaCamboya memoria
Camboya memoria
 

Similar to A01-Openness in knowledge-based systems

Ontological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesOntological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesBarry Smith
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Amit Sheth
 
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptx
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptxSmart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptx
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptxJohn Smart
 
Politics and Pragmatism in Scientific Ontology Construction
Politics and Pragmatism in Scientific Ontology ConstructionPolitics and Pragmatism in Scientific Ontology Construction
Politics and Pragmatism in Scientific Ontology ConstructionMike Travers
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesStefano Di Carlo
 
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Aaron Sloman
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...Maryann Martone
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Mark Wilkinson
 
Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsJie Bao
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giantsBenjamin Good
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.docbutest
 
Biomimetics Steaaling From Nature Uni Of Reading
Biomimetics Steaaling From Nature Uni Of ReadingBiomimetics Steaaling From Nature Uni Of Reading
Biomimetics Steaaling From Nature Uni Of ReadingJake Langford
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...Neuroscience Information Framework
 
Phyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebPhyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebRutger Vos
 
Knowledge graph construction for research & medicine
Knowledge graph construction for research & medicineKnowledge graph construction for research & medicine
Knowledge graph construction for research & medicinePaul Groth
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework Neuroscience Information Framework
 
BEACON 101: Sequencing tech
BEACON 101: Sequencing techBEACON 101: Sequencing tech
BEACON 101: Sequencing techc.titus.brown
 
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...Life, Knowledge and Natural Selection ― How life (scientifically) designs its...
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...William Hall
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 

Similar to A01-Openness in knowledge-based systems (20)

Ontological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesOntological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologies
 
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
Semantics for Bioinformatics: What, Why and How of Search, Integration and An...
 
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptx
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptxSmart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptx
Smart-GoodnessOfTheUniverse-SteppingIntoFuture2022NEW.pptx
 
Politics and Pragmatism in Scientific Ontology Construction
Politics and Pragmatism in Scientific Ontology ConstructionPolitics and Pragmatism in Scientific Ontology Construction
Politics and Pragmatism in Scientific Ontology Construction
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life Sciences
 
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...Ontologies for baby animals and robots From "baby stuff" to the world of adul...
Ontologies for baby animals and robots From "baby stuff" to the world of adul...
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...
 
Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014Presentation to the J. Craig Venter Institute, Dec. 2014
Presentation to the J. Craig Venter Institute, Dec. 2014
 
Ontology - and Reloaded and Revolutions
Ontology - and Reloaded and RevolutionsOntology - and Reloaded and Revolutions
Ontology - and Reloaded and Revolutions
 
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
 
I NTRODUCTION.doc
I NTRODUCTION.docI NTRODUCTION.doc
I NTRODUCTION.doc
 
Biomimetics Steaaling From Nature Uni Of Reading
Biomimetics Steaaling From Nature Uni Of ReadingBiomimetics Steaaling From Nature Uni Of Reading
Biomimetics Steaaling From Nature Uni Of Reading
 
The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...The Neuroscience Information Framework:The present and future of neuroscience...
The Neuroscience Information Framework:The present and future of neuroscience...
 
Phyloinformatics and the Semantic Web
Phyloinformatics and the Semantic WebPhyloinformatics and the Semantic Web
Phyloinformatics and the Semantic Web
 
Paul Groth
Paul GrothPaul Groth
Paul Groth
 
Knowledge graph construction for research & medicine
Knowledge graph construction for research & medicineKnowledge graph construction for research & medicine
Knowledge graph construction for research & medicine
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework
 
BEACON 101: Sequencing tech
BEACON 101: Sequencing techBEACON 101: Sequencing tech
BEACON 101: Sequencing tech
 
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...Life, Knowledge and Natural Selection ― How life (scientifically) designs its...
Life, Knowledge and Natural Selection ― How life (scientifically) designs its...
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 

More from Bioinformatics Open Source Conference

More from Bioinformatics Open Source Conference (20)

Running workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentationRunning workflows through galaxy bosc presentation
Running workflows through galaxy bosc presentation
 
Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011Talk1 ben sadi for_gmod_bosc_2011
Talk1 ben sadi for_gmod_bosc_2011
 
Bosc mercer
Bosc mercerBosc mercer
Bosc mercer
 
Mobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_serviceMobyle 1 0_new_features_new_types_of_service
Mobyle 1 0_new_features_new_types_of_service
 
Bosc2011 arakawa
Bosc2011 arakawaBosc2011 arakawa
Bosc2011 arakawa
 
Talk6 biopython bosc2011
Talk6 biopython bosc2011Talk6 biopython bosc2011
Talk6 biopython bosc2011
 
Unipro ugene bosc 2011 update
Unipro ugene bosc 2011 updateUnipro ugene bosc 2011 update
Unipro ugene bosc 2011 update
 
Bosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-fullBosc2011 ntino-krampis-full
Bosc2011 ntino-krampis-full
 
Bosc talk 7-15-2011x
Bosc talk 7-15-2011xBosc talk 7-15-2011x
Bosc talk 7-15-2011x
 
F02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinuxF02-Cloud-Cloud BioLinux
F02-Cloud-Cloud BioLinux
 
B07-GenomeContent-Biomart
B07-GenomeContent-BiomartB07-GenomeContent-Biomart
B07-GenomeContent-Biomart
 
B03-GenomeContent-Intermine
B03-GenomeContent-IntermineB03-GenomeContent-Intermine
B03-GenomeContent-Intermine
 
G03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCATG03-SemanticWeb-OntoCAT
G03-SemanticWeb-OntoCAT
 
F06-Cloud-Enabling NGS
F06-Cloud-Enabling NGSF06-Cloud-Enabling NGS
F06-Cloud-Enabling NGS
 
D03-NextGen-Bio-NGS
D03-NextGen-Bio-NGSD03-NextGen-Bio-NGS
D03-NextGen-Bio-NGS
 
F07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAMF07-Cloud-Hadoop-BAM
F07-Cloud-Hadoop-BAM
 
C03-Visualization-Webapollo
C03-Visualization-WebapolloC03-Visualization-Webapollo
C03-Visualization-Webapollo
 
F01-Cloud-Mygene.info
F01-Cloud-Mygene.infoF01-Cloud-Mygene.info
F01-Cloud-Mygene.info
 
F03-Cloud-Obiwee
F03-Cloud-ObiweeF03-Cloud-Obiwee
F03-Cloud-Obiwee
 
F05-Cloud-Sequencescape
F05-Cloud-SequencescapeF05-Cloud-Sequencescape
F05-Cloud-Sequencescape
 

Recently uploaded

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Recently uploaded (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

A01-Openness in knowledge-based systems

  • 1. The Role of Openness in Creating a Mind for Life
  • 2. Open Source, AI, & Biology An AI breakthrough can come from an application in biology It is imperative that this be open source Some steps toward (and questions about) creating an open source AI for understanding life
  • 3. The first artificial mind will think about molecular biology “You can’t think about thinking without thinking about thinking about something.” Seymour Papert, 1974 “A thorough study of Human Physiology is, in itself, an education broader and more comprehensive than much that passes under that name. There is no side of the intellect which it does not call into play, no region of human knowledge into which either its roots, or its branches, do not extend.” Thomas Huxley,1893
  • 4. Why AI hasn’t succeeded (yet) People know a lot about the world implicitly Conversing with a partnerwho doesn’t know these basic things is very frustrating 50 years of failing to capture this “common sense” information computationally suggests: Lack of explicit enumeration makes capture very expensive (encyclopedias don’t have it!) Still no idea of the extent of this knowledge
  • 5.
  • 6. X J.J. Hornberg et al. / BioSystems 83 (2006) 81–90 Homeostatic networks foil single markers and drugs outcome target
  • 7. Networks change through time Mjolsness, Sharp, Reinitz, A Connectionist Model of Development J. Theoretical Bio 1991
  • 8. Understanding the data “We are close to having a $1,000 genome sequence, but this may be accompanied by a $1,000,000 interpretation.” - Bruce Korf, president American College of Medical Genetics Not only is the cost of sequencing essentially free, but big computers and big storage are cheap, too. What will keep us busy for the next 50 years is understanding the data” - Russ Altman, chair of Biomedical Engineering at Stanford
  • 9. The Hard Problem Given a set of genomic regions, variants, gene products, and/or concentrations empirically involved in a defined phenotype… Produce: An explanation of the reasons that those genomic regions / variants / products / concentrations are (or are not) relevant to the phenotype Evidence to support the explanation(s) Alternative explanations Reasons to prefer one explanation over another
  • 10. Answering Why? questions Fundamental to human cognitive development Amazing human facility Even to confabulation Causal explanation is central to science The only question “big data”doesn’t seem to be enoughto answer (cfRamachandran & Hovy, 2002)
  • 11. Abductive inference “However man may have acquired his faculty of divining the ways of Nature, it has certainly not been by a self-controlled and critical logic. Even now he cannot give any exact reason for his best guesses…. For though it goes wrong oftener than right, yet the relative frequency with which it is right is on the whole the most wonderful thing in our constitution.” The Essential Peirce: Selected Philosophical Writings v. 2 p. 217
  • 12. “Two paradoxes are better than one; they may even suggest a solution” –Edward Teller Molecular Systems Biology + Artificial Intelligence
  • 13. Explanation is hard Not just about the connection between an explanation and the thing explained, but must also be “consonant” with other explanations. Knowledge is key Have to know many other explanations. Need “judgment” to compare the qualities of alternative explanations. Racunas & Shah’s HyBrow system, but required extensive manually represented knowledge A “complete enough” knowledge-base?
  • 14. Knowledge-based Computational Biology Widespread use, e.g. Simulation systems (e.g. BioCyc) Question answering systems (e.g. AskHermes or Watson Medicine) High-throughput result analysis (e.g. GOEAST, Ontologizer) Hypothesis generation / testing (e.g. HyQue) Anything that uses an ontology Annotations (e.g. GOA) Cross-species comparisons NCBO
  • 15. KB for explanation Knowledge base quality Correctness, timeliness (tracking changes) Completeness A constantly receding goal, that obviously cannot be achieved, but is important anyway Need to cover the material in Textbooks Journal articles Databases
  • 16. Explanatory inference Even if all the relevant knowledge were available in computationally tractable form… We need inferential methods to Identify possible explanations of complex biological phenomena (symbolic?) Compare alternative explanations in the light of existing evidence (numeric?) History of explanatory inference in AI is suggestive, but key open problems remain
  • 17. Why does openness matter? Productivity: Attacking hard problems efficiently Rapid assimilation of effective methods Building on (not ignoring) each other’s results Equity: Access to scientists with low budgets Distribution to the widest possible community Ethics: Transparency for AI is a moral value
  • 18. Transparency is a moral value AI matters – lots of social concerns about loss of control, etc. 2001, Robopocolypse AI is cheap to replicate, and will diverge (if you can build one mind, building millions more is easy). Too important to be private Technological development in the face of such broad social concern requires earning the trust of the society
  • 19. Getting there Build on track records of openness OBO &Community-curated Ontologies Semantic Web / OWL / SPARQL / SWRL Open Access Publishing Linked Life Data Breaking down barriers Infrastructure Incentives
  • 20. Opening a Bazzar To get the productivity advantage, infrastructure matters Technical infrastructure to share, compare and integrate code Social infrastructure to work together to solve hard problems Motivation Competition Cooperation
  • 21. Confronting the temptations of being proprietary The temptations: Potential future payoff Avoid effort to conform to the infrastructure Fear of not being able to improve in the future Competition errors Wrong task / evaluation / supplied data Poor process (timing, execution, infrastructure) Doesn’t evolve toward worthy end
  • 22. Goals Participation from many, previously disparate communities Bio focused: BioCreative, BioNLP, Comp Ling: ACL Shared Tasks, CONLL NIST: TREC, TAC A living, open source collection of useful, modular, repurposable, state of the art software for understanding biomedical texts Major advances in AI
  • 23. Facilitating an OS community Providing Resources Software (UIMA, U-COMPARE) Compute power Training data (CRAFT, Analysis of analysts) Signal Events Series of competitions based on CRAFT Incentives Prizes for significant achievements
  • 25. Remaining challenges Pubmed Central and open access Corporate ownership (Ontotext & LLD) Semantic compatibility of various sources UMLS breadth vs. BFO logic Sharing inference methods & rules Rule syntax (SWRL) is not enough. DL inference is not enough UIMA equivalent?
  • 26. How to participate Help design CRAFT competitions Confront publishers about PMC bulk downloads Help define inferential benchmarks

Editor's Notes

  1. I will have more to say about the importance of broad knowledge later
  2. Humans are very facile at generating explanations. Confabulation is what happens when the process is disconnected from relevant sources of information. Split brain patients explaining why the other hemisphere did something, or Capgrassymdome
  3. I believe that both of these goals are within reach in the next generation.