SlideShare una empresa de Scribd logo
1 de 15
Descargar para leer sin conexión
Artificial Intelligence /Machine Learning
for Secondary Metabolite Prediction
Yannick Djoumbou Feunang
Corteva Agriscience, Indianapolis, IN, US
Online workshop: Computational Applications in Secondary Metabolite Discovery
March 9, 2021
2
In Silico Metabolism Prediction
Task: Given a small molecule, use computational tools to predict the outcome of its
interactions with metabolic enzymes.
Here, we will focus on the structural elucidation of potential metabolites.
(Humans; Bacteria)
(Humans; Plants) (Humans; Plants) (Humans; Plants;
Bacteria)
Limonene
(Citrus)
3
Why is Metabolism so Important?
• Bio/chemo-transformations influence
changes of our chemical exposome
• <2% of detectable peaks identifiable in
large-scale untargeted Metabolomics
• What are these unknowns?
• How to determine their structures?
• How to determine their activities?
• Cheminformatics and AI can be used to:
➢Detect common patterns
➢Predict enzyme/ligand interactions
➢Understand metabolism
➢Generate biologically feasible structures
through simulated reactions
4
Why is Metabolism so Important?
Pro/soft-drug design
Exposure science
ADMET profiling
http://admet.scbdd.com/
Nutrition science
Jin et al. (2017)
Influence of metabolism on a xenobiotic’s pharmacodynamics (PD),
including pharmacological activity (Act), and toxicological effects (Tox).
Djoumbou Feunang et al. (2019);
Strep. griseus
Artemisinin to
Artemisitone-9
Microbial biosynthesis
Liu et al. (2006)
5
Approaches for In silico Metabolism Prediction
• Requires:
• Module to predict (and rank/score) SoMs, enzyme-substrate selectivity, or reaction groups
• Library of reaction templates to apply or select (via prediction) from
• Modules are usually specific (chemical/enzyme classes), or comprehensive (whole species)
• Prediction approaches can be ligand- or structure-based
• Some tools include: ML/DL-based (MetaTrans, Glory), Rule-based (MetabolExpert), Hybrid (BioTransformer,
Meteor Nexus)
Tolclofos-methyl
(Rats; mice)
Substrate specificity Yes/No
Sites of Metabolism
(SoMs)
Yes
No
Yes
Yes
ML/DL-, Rule/Data
mining-based prediction
ML/DL
prediction;
QM/Docking;
fingerprints
Apply
rule
Apply
rule
Hydroxylation
Desulfurization
O-dealkylation
Epoxidation
Reaction
Templates
6
Ligand-based Prediction
• Use structures of ligands/non-ligands to
predict specificity, accessibility to enzymes
• Some methods include, among others:
➢QSAR/QSMR (SoM, BoM, ESS)
➢Data mining/fingerprints (SoM)
➢Substructures/Rules (ESS, Reaction)
• Advantages:
➢Speed
➢Seem to perform as well as structure-based
methods
• Disadvantages:
➢Approaches/models/rule bases tend to not
work on novel chemistries
➢Data quantity and quality is a limiting factor
Strieth-Kalthoff et al. (2020)
Hydroxylation of methyl carbon adjacent to aliphatic ring
7
Structure-based Prediction
• Explicitly model the interaction within the
enzyme’s binding pocket
• Help predicting sites of metabolism (SoMs)
• Some methods include, among others:
➢Protein-ligand docking
➢Molecular dynamics
• Advantages
➢Detailed study of the enzyme-substrate
interaction
• Disadvantages
➢High computational power to model structural
flexibility
Ménard et al. (2012)
8
Examples of In Silico Metabolism Prediction Tools
Based on their coverage, in silico metabolism prediction tools can be classified as:
1. Specific tools, which apply to simple biological systems (e.g. a small set of
enzymes), and often, cover a rather small chemical space.
2. Comprehensive tools, which apply to a more diverse and larger set of enzymes,
species, and cover a larger chemical space.
Prediction Tool Accessibility Spectrum Approach Methods Availability
BioTransformer Web server / Command line Comprehensive Ligand-based Machine Learning
Expert system
Open source
Meteor Nexus GUI-based standalone Comprehensive Ligand-based Machine Learning
Expert system
Commercial
GLORYx Web server Comprehensive Ligand-based Machine Learning Freely available
SmartCYP Web server Specific Combined Machine Learning Freely available
MetaSite GUI-based standalone Specific Combined Machine Learning Commercial
MetabolExpert GUI-based standalone Specific Ligand-based Expert system Commercial
Examples of tools for metabolite prediction
9
Enhancing Metabolite Discovery and Identification
More structures and spectra
available for database search
?
?
?
?
? ?
?
?
?
?
?
?
?
?
?
• >2x105 known metabolites
• > 3x106 expected metabolites
Measuring the exposome
MS/NMR-based Metabolomics
• Expand the databases with predicted
structures
• Add predicted spectra for these metabolites
More structures available for mass/formula
matching to annotate the peaks
?
?
?
√ √
√
√
?
√
?
?
Higher identification rates
10
BioTransformer: Open Source for Metabolite Identification
• Djoumbou-Feunang et al. J Cheminform. 2019 Jan 5;11(1):2
• Open Source: bitbucket.org/djoumbou/biotransformer
• Docker: https://hub.docker.com/r/djoy2018/biotransformer-cl
• RESTful web server: www.biotransformer.ca
Djoumbou Feunang et al. (2019); J. Cheminf.; DOI 10.1186/s13321-018-0324-5
11
BioTransformer: Open source for Metabolite Identification
Epicatechin
11
12
Challenges in Metabolism Prediction
• Challenges in understanding metabolism lead to lower prediction accuracy
➢ML tools can rank SoMs with high accuracy, but not the resulting biotransformations
➢Knowledge-driven approaches achieve higher recall, but suffer from combinatorial explosion
• Limited accessibility/availability of tools
➢Most tools developed in academia are available only via web-sever, raising confidentiality
issues for most potential users
➢Restricted shareability of data generated by commercial tools
• The limited availability of high-quality data
➢Crucial for achieving better accuracy, higher coverage, and larger applicability domains
➢Needed to derive more reaction rules
➢However, data curation (Sites of metabolism, reaction annotation) is tedious and expensive
13
Outlook
• Data sharing
➢Increase the amount of high-quality, publicly available, curated, and downloadable data
(substrate-product, Sites of Metabolism). E.g.: MetXBioDB, PubChem, XMetDB, etc.
➢Develop publicly accessible databases of predicted metabolites (e.g.: BioTransformerDB)
➢Community-wide efforts needed
• Novel, and innovative AI approaches:
➢Seq2Seq transformer architectures have proven applicable for end-to-end learning-based
method, with results comparable to existing tools (MetaTrans: Litsa et al. 2020)
▪ Could improve accuracy, while bypassing manual rule design
➢Bonds of Metabolism (BoMs) seem to provide a better description of reaction centers,
leading to higher accuracy, compared to SoMs (Upcoming - Tian, et al. (2021))
• Open source communities
➢Provide means for easier user feedback loops; valuable for improving prediction tools
➢Provide developers opportunities to improve software tools, in an agile, continuous manner
▪ Successful projects include (BioTransformer, Chemistry Development Kit, Knime)
14
Thank You
• The organizing committee:
➢Fidele Ntie-Kang
➢J. Ludwig Muller
• The Wishart Lab @UofAlberta, Canada
• Corteva Agriscience
• The BioTransformer community
• The listeners
• To learn more about BioTransformer:
➢HS03: In Silico Prediction and Identification of Metabolites with BioTransformer : Enabling
Secondary Metabolite Discovery; March 10, 2021
15

Más contenido relacionado

La actualidad más candente

Introduction to proteomics
Introduction to proteomicsIntroduction to proteomics
Introduction to proteomics
Shryli Shreekar
 

La actualidad más candente (20)

Stem cells and nanotechnology in regenerative medicine and tissue engineering
Stem cells and nanotechnology in regenerative medicine and tissue engineeringStem cells and nanotechnology in regenerative medicine and tissue engineering
Stem cells and nanotechnology in regenerative medicine and tissue engineering
 
Introduction to proteomics
Introduction to proteomicsIntroduction to proteomics
Introduction to proteomics
 
WhatCheck | BioCode Ltd
WhatCheck | BioCode LtdWhatCheck | BioCode Ltd
WhatCheck | BioCode Ltd
 
Flux balance analysis
Flux balance analysisFlux balance analysis
Flux balance analysis
 
Biotech and AI
Biotech and AIBiotech and AI
Biotech and AI
 
Protein structure prediction (1)
Protein structure prediction (1)Protein structure prediction (1)
Protein structure prediction (1)
 
Bioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in BiotechnologyBioinformatics & It's Scope in Biotechnology
Bioinformatics & It's Scope in Biotechnology
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 
3D BIO PRINTING USING TISSUE AND ORGANS
3D BIO PRINTING USING TISSUE AND ORGANS3D BIO PRINTING USING TISSUE AND ORGANS
3D BIO PRINTING USING TISSUE AND ORGANS
 
Artificial intelligence in drug discovery and development
Artificial intelligence in drug discovery and developmentArtificial intelligence in drug discovery and development
Artificial intelligence in drug discovery and development
 
Molecular farming of biopharmacuetical
Molecular farming of biopharmacueticalMolecular farming of biopharmacuetical
Molecular farming of biopharmacuetical
 
Presentation1
Presentation1Presentation1
Presentation1
 
OrganoidPresentation
OrganoidPresentationOrganoidPresentation
OrganoidPresentation
 
Metabolomics Data Analysis
Metabolomics Data AnalysisMetabolomics Data Analysis
Metabolomics Data Analysis
 
Cell Culture Media PPT
Cell Culture Media PPTCell Culture Media PPT
Cell Culture Media PPT
 
Plant bioreactor
Plant bioreactorPlant bioreactor
Plant bioreactor
 
proteomics and system biology
proteomics and system biologyproteomics and system biology
proteomics and system biology
 
Genomic Databases-.pptx
Genomic Databases-.pptxGenomic Databases-.pptx
Genomic Databases-.pptx
 
EST Clustering.ppt
EST Clustering.pptEST Clustering.ppt
EST Clustering.ppt
 
Pubchem
PubchemPubchem
Pubchem
 

Similar a AI and Machine Learning for Secondary Metabolite Prediction

NanoAgents: Molecular Docking Using Multi-Agent Technology
NanoAgents: Molecular Docking Using Multi-Agent TechnologyNanoAgents: Molecular Docking Using Multi-Agent Technology
NanoAgents: Molecular Docking Using Multi-Agent Technology
CSCJournals
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
Jeremy Yang
 
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
Mathew Varghese
 
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
Gerald Lushington
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
Ayesha Aftab
 

Similar a AI and Machine Learning for Secondary Metabolite Prediction (20)

Designing a community resource - Sandra Orchard
Designing a community resource - Sandra OrchardDesigning a community resource - Sandra Orchard
Designing a community resource - Sandra Orchard
 
Java tutorial: Programmatic Access to Molecular Interactions
Java tutorial: Programmatic Access to Molecular InteractionsJava tutorial: Programmatic Access to Molecular Interactions
Java tutorial: Programmatic Access to Molecular Interactions
 
Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...
 
Technology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential NetworksTechnology R&D Theme 1: Differential Networks
Technology R&D Theme 1: Differential Networks
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 
Drug design based on bioinformatic tools
Drug design based on bioinformatic toolsDrug design based on bioinformatic tools
Drug design based on bioinformatic tools
 
Varieties of Self-Awareness and Their Uses in Natural and Artificial Systems ...
Varieties of Self-Awareness and Their Uses in Natural and Artificial Systems ...Varieties of Self-Awareness and Their Uses in Natural and Artificial Systems ...
Varieties of Self-Awareness and Their Uses in Natural and Artificial Systems ...
 
Chemoinformatic File Format.pptx
Chemoinformatic File Format.pptxChemoinformatic File Format.pptx
Chemoinformatic File Format.pptx
 
NanoAgents: Molecular Docking Using Multi-Agent Technology
NanoAgents: Molecular Docking Using Multi-Agent TechnologyNanoAgents: Molecular Docking Using Multi-Agent Technology
NanoAgents: Molecular Docking Using Multi-Agent Technology
 
Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014Intro to in silico drug discovery 2014
Intro to in silico drug discovery 2014
 
Promiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCNPromiscuous patterns and perils in PubChem and the MLSCN
Promiscuous patterns and perils in PubChem and the MLSCN
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
Computational Biology Methods for Drug Discovery_Phase 1-5_November 2015
 
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of ActionA Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
A Biclustering Method for Rationalizing Chemical Biology Mechanisms of Action
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
 
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MININGANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
ANALYSIS OF PROTEIN MICROARRAY DATA USING DATA MINING
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and Application
 
The Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and ApplicationThe Complete Guide for Metabolomics Methods and Application
The Complete Guide for Metabolomics Methods and Application
 

Último

Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Sheetaleventcompany
 
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Sheetaleventcompany
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
Sheetaleventcompany
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
MedicoseAcademics
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan 087776558899
 
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Sheetaleventcompany
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Sheetaleventcompany
 
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Sheetaleventcompany
 

Último (20)

Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsAppMost Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
 
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
Call Girl In Indore 📞9235973566📞 Just📲 Call Inaaya Indore Call Girls Service ...
 
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
 
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
 
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
Premium Call Girls Dehradun {8854095900} ❤️VVIP ANJU Call Girls in Dehradun U...
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
 
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
7 steps How to prevent Thalassemia : Dr Sharda Jain & Vandana Gupta
 
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
 
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
 
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
 
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
Pune Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Pune No💰Adva...
 
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
Gorgeous Call Girls Dehradun {8854095900} ❤️VVIP ROCKY Call Girls in Dehradun...
 
Call 8250092165 Patna Call Girls ₹4.5k Cash Payment With Room Delivery
Call 8250092165 Patna Call Girls ₹4.5k Cash Payment With Room DeliveryCall 8250092165 Patna Call Girls ₹4.5k Cash Payment With Room Delivery
Call 8250092165 Patna Call Girls ₹4.5k Cash Payment With Room Delivery
 
Circulatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanismsCirculatory Shock, types and stages, compensatory mechanisms
Circulatory Shock, types and stages, compensatory mechanisms
 
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
Race Course Road } Book Call Girls in Bangalore | Whatsapp No 6378878445 VIP ...
 
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
Chandigarh Call Girls Service ❤️🍑 9809698092 👄🫦Independent Escort Service Cha...
 
Cardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their RegulationCardiac Output, Venous Return, and Their Regulation
Cardiac Output, Venous Return, and Their Regulation
 
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
 

AI and Machine Learning for Secondary Metabolite Prediction

  • 1. Artificial Intelligence /Machine Learning for Secondary Metabolite Prediction Yannick Djoumbou Feunang Corteva Agriscience, Indianapolis, IN, US Online workshop: Computational Applications in Secondary Metabolite Discovery March 9, 2021
  • 2. 2 In Silico Metabolism Prediction Task: Given a small molecule, use computational tools to predict the outcome of its interactions with metabolic enzymes. Here, we will focus on the structural elucidation of potential metabolites. (Humans; Bacteria) (Humans; Plants) (Humans; Plants) (Humans; Plants; Bacteria) Limonene (Citrus)
  • 3. 3 Why is Metabolism so Important? • Bio/chemo-transformations influence changes of our chemical exposome • <2% of detectable peaks identifiable in large-scale untargeted Metabolomics • What are these unknowns? • How to determine their structures? • How to determine their activities? • Cheminformatics and AI can be used to: ➢Detect common patterns ➢Predict enzyme/ligand interactions ➢Understand metabolism ➢Generate biologically feasible structures through simulated reactions
  • 4. 4 Why is Metabolism so Important? Pro/soft-drug design Exposure science ADMET profiling http://admet.scbdd.com/ Nutrition science Jin et al. (2017) Influence of metabolism on a xenobiotic’s pharmacodynamics (PD), including pharmacological activity (Act), and toxicological effects (Tox). Djoumbou Feunang et al. (2019); Strep. griseus Artemisinin to Artemisitone-9 Microbial biosynthesis Liu et al. (2006)
  • 5. 5 Approaches for In silico Metabolism Prediction • Requires: • Module to predict (and rank/score) SoMs, enzyme-substrate selectivity, or reaction groups • Library of reaction templates to apply or select (via prediction) from • Modules are usually specific (chemical/enzyme classes), or comprehensive (whole species) • Prediction approaches can be ligand- or structure-based • Some tools include: ML/DL-based (MetaTrans, Glory), Rule-based (MetabolExpert), Hybrid (BioTransformer, Meteor Nexus) Tolclofos-methyl (Rats; mice) Substrate specificity Yes/No Sites of Metabolism (SoMs) Yes No Yes Yes ML/DL-, Rule/Data mining-based prediction ML/DL prediction; QM/Docking; fingerprints Apply rule Apply rule Hydroxylation Desulfurization O-dealkylation Epoxidation Reaction Templates
  • 6. 6 Ligand-based Prediction • Use structures of ligands/non-ligands to predict specificity, accessibility to enzymes • Some methods include, among others: ➢QSAR/QSMR (SoM, BoM, ESS) ➢Data mining/fingerprints (SoM) ➢Substructures/Rules (ESS, Reaction) • Advantages: ➢Speed ➢Seem to perform as well as structure-based methods • Disadvantages: ➢Approaches/models/rule bases tend to not work on novel chemistries ➢Data quantity and quality is a limiting factor Strieth-Kalthoff et al. (2020) Hydroxylation of methyl carbon adjacent to aliphatic ring
  • 7. 7 Structure-based Prediction • Explicitly model the interaction within the enzyme’s binding pocket • Help predicting sites of metabolism (SoMs) • Some methods include, among others: ➢Protein-ligand docking ➢Molecular dynamics • Advantages ➢Detailed study of the enzyme-substrate interaction • Disadvantages ➢High computational power to model structural flexibility Ménard et al. (2012)
  • 8. 8 Examples of In Silico Metabolism Prediction Tools Based on their coverage, in silico metabolism prediction tools can be classified as: 1. Specific tools, which apply to simple biological systems (e.g. a small set of enzymes), and often, cover a rather small chemical space. 2. Comprehensive tools, which apply to a more diverse and larger set of enzymes, species, and cover a larger chemical space. Prediction Tool Accessibility Spectrum Approach Methods Availability BioTransformer Web server / Command line Comprehensive Ligand-based Machine Learning Expert system Open source Meteor Nexus GUI-based standalone Comprehensive Ligand-based Machine Learning Expert system Commercial GLORYx Web server Comprehensive Ligand-based Machine Learning Freely available SmartCYP Web server Specific Combined Machine Learning Freely available MetaSite GUI-based standalone Specific Combined Machine Learning Commercial MetabolExpert GUI-based standalone Specific Ligand-based Expert system Commercial Examples of tools for metabolite prediction
  • 9. 9 Enhancing Metabolite Discovery and Identification More structures and spectra available for database search ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? • >2x105 known metabolites • > 3x106 expected metabolites Measuring the exposome MS/NMR-based Metabolomics • Expand the databases with predicted structures • Add predicted spectra for these metabolites More structures available for mass/formula matching to annotate the peaks ? ? ? √ √ √ √ ? √ ? ? Higher identification rates
  • 10. 10 BioTransformer: Open Source for Metabolite Identification • Djoumbou-Feunang et al. J Cheminform. 2019 Jan 5;11(1):2 • Open Source: bitbucket.org/djoumbou/biotransformer • Docker: https://hub.docker.com/r/djoy2018/biotransformer-cl • RESTful web server: www.biotransformer.ca Djoumbou Feunang et al. (2019); J. Cheminf.; DOI 10.1186/s13321-018-0324-5
  • 11. 11 BioTransformer: Open source for Metabolite Identification Epicatechin 11
  • 12. 12 Challenges in Metabolism Prediction • Challenges in understanding metabolism lead to lower prediction accuracy ➢ML tools can rank SoMs with high accuracy, but not the resulting biotransformations ➢Knowledge-driven approaches achieve higher recall, but suffer from combinatorial explosion • Limited accessibility/availability of tools ➢Most tools developed in academia are available only via web-sever, raising confidentiality issues for most potential users ➢Restricted shareability of data generated by commercial tools • The limited availability of high-quality data ➢Crucial for achieving better accuracy, higher coverage, and larger applicability domains ➢Needed to derive more reaction rules ➢However, data curation (Sites of metabolism, reaction annotation) is tedious and expensive
  • 13. 13 Outlook • Data sharing ➢Increase the amount of high-quality, publicly available, curated, and downloadable data (substrate-product, Sites of Metabolism). E.g.: MetXBioDB, PubChem, XMetDB, etc. ➢Develop publicly accessible databases of predicted metabolites (e.g.: BioTransformerDB) ➢Community-wide efforts needed • Novel, and innovative AI approaches: ➢Seq2Seq transformer architectures have proven applicable for end-to-end learning-based method, with results comparable to existing tools (MetaTrans: Litsa et al. 2020) ▪ Could improve accuracy, while bypassing manual rule design ➢Bonds of Metabolism (BoMs) seem to provide a better description of reaction centers, leading to higher accuracy, compared to SoMs (Upcoming - Tian, et al. (2021)) • Open source communities ➢Provide means for easier user feedback loops; valuable for improving prediction tools ➢Provide developers opportunities to improve software tools, in an agile, continuous manner ▪ Successful projects include (BioTransformer, Chemistry Development Kit, Knime)
  • 14. 14 Thank You • The organizing committee: ➢Fidele Ntie-Kang ➢J. Ludwig Muller • The Wishart Lab @UofAlberta, Canada • Corteva Agriscience • The BioTransformer community • The listeners • To learn more about BioTransformer: ➢HS03: In Silico Prediction and Identification of Metabolites with BioTransformer : Enabling Secondary Metabolite Discovery; March 10, 2021
  • 15. 15