SlideShare a Scribd company logo
1 of 1
Download to read offline
This work has been supported by the BBSRC/EPSRC grant: the Manchester Centre for Integrative Systems Biology



                                                                         Gap filling and identifying new reactions
                                                                         in metabolic networks based on
                                                                         metabolite similarity
                                                                         Matthew G.S. Norris, Neil Swainston, Paul D. Dobson, Daniel Jameson, Evangelos
                                                                         Simeonidis, Kieran Smallbone, Naglis Malys
                                                                         Manchester Centre for Integrative Systems Biology, University of Manchester, Manchester M1 7ND, UK


Introduction

The number of published metabolic network reconstructions are                                                                                     Two chemical similarity distributions were generated, resulting from
increasing, as are their applications. However, such reconstructions                                                                              pairs of metabolites that do and do not form a reaction pair in the
commonly include gaps (see Figure 1), which are due to incomplete                                                                                 network (plotted as actual and potential pairs in Figure 3). Mass
source databases or holes in biochemical knowledge reported in                                                                                    differences are calculated, such that potential pairs were only
literature. The filling of such gaps has been aided through automated                                                                             considered if they exhibit a mass difference of an actual pair, resulting
techniques which attempt to mitigate these gaps by adding reactions                                                                               from a known chemical transformation.
from external resources such as KEGG1.
                                                                                                                                                  Results
The approach introduced here is to apply cheminformatics to
determine and quantify chemical similarity across all metabolites in                                                                              It can be seen that the majority of actual metabolite pairs have a
a metabolic network of S. cerevisiae2. The hypothesis is that those                                                                               chemical similarity score greater than 0.7. However, only 8.5% (557)
metabolite pairs of high chemical similarity are likely to form                                                                                   of potential pairs exhibit such similarity. Of these 557, 99 were
reaction pairs, in which one metabolite can be converted to the                                                                                   found to form a reaction pair in KEGG, but were not present in the
other by a single chemical reaction. The similar scoring pairs that                                                                               metabolic network. From these 99 pairs, a number were selected for
do not currently form a reaction pair in the network can be analysed,                                                                             further evaluation, and three examples of this are provided in Table
by either comparison with existing data resources or by literature                                                                                1. The evaluation entailed:
searches, to determine whether they take part in a metabolic reaction.
                                                                                                                                                  •  extraction from KEGG of homologous protein sequences that
Following this approach, preliminary results have led to the discovery                                                                               catalyse these reactions;
of missing information from KEGG, and the assignment of function
and determination of kinetic constants to a gene of previously                                                                                    •  BLAST searching these sequences against a S. cerevisiae protein
unknown function.                                                                                                                                    database to identify candidate enzymes exhibiting this activity;

                                                                                          Figure 1: Gaps in metabolic                             •  literature search and / or experimental validation of the
                                                                                          networks.                                                  activity of these candidates.
                                                                                          Unreachable metabolites are
                                                                                          disconnected from the                                   KEGG       Reaction                                    Similarity   Gene id     KM / µM       Kcat / s-1
                                                                                                                                                  reaction                                                   score
                                                                                          extracellular medium. “Blocked”
                                                                                                                                                  R00585     L-serine + pyruvate <=> hydroxypyruvate          0.87    YFL030W   Gene activity confirmed in
                                                                                          reactions are incapable of                                         + L-alanine                                                             by literature search4
                                                                                          carrying flux as they lead to                           R00720     ITP + H2O <=> IMP + diphosphate                  0.78    YJR069C         2.33           0.14
                                                                                          dead-end metabolites (such as
                                                                                                                                                  R01215     L-valine + pyruvate <=> 3-methyl-2-              0.76    YER152C   No experimental validation
                                                                                          the metabolites f and j). Gap                                      oxobutanoic acid + L-alanine
                                                                                          filling is required to reconcile both
                                                                                          issues.                                                 Table 1: Reactions found for three highly similar scoring metabolite pairs that
                                                                                                                                                  were not present in the metabolic reconstruction. Metabolites that form pairs are
                                                                                                                                                  highlighted in bold. Kinetic constants were determined through protein
Method                                                                                                                                            expression, purification and absorbance assay (see Figure 4).

Metabolites were extracted from a genome-scale metabolic network,                                                                                 Further work
and SMILES strings representing their chemical structure were
acquired. The structures were compared in a pairwise manner using                                                                                 Future directions may include:
the Chemical Development Kit (CDK)3, to determine a chemical
similarity score between each pair (see Figure 2).
                                                                                                                                                  •  focussing on those metabolites that are known to be “dead-ends”
                                                                                                                                                     or are disconnected from the core network, thus more-closely
                                                                                                                                                     integrating the method with network gap filling;

                                                                                                                                                  •  automating the bioinformatics aspect of the pipeline (BLAST
                                                                                                                                                     searching, etc.) to automate the identification of putative enzymes;

                                                                                                                                                  •  apply text-mining to find potential reactions from literature where
                                                                                                                                                     reactions are not present in existing data resources such as KEGG;

                                                                                                                                                  •  application of the approach to metabolite identification in
                                                                                                                                                     metabolomics experiments.



Figure 2: Example of chemical similarity score generated from SMILES strings
using the CDK for the metabolite pair IMP and ITP.


                                        Similarity score distribution of actual and potential metabolite pairs                                                                                         Figure 4: Confirmation of ITP
             40.0                                                                                                                                                                                      pyrophosphohydrolase activity for
                                                                                                                                                                                                       YJR069C. A Malachite Green assay was
                                                                                                                                                                                                       performed to detect orthophosphate,
                                                                                                                                                                                                       indicating hydrolysis of ITP and release
             30.0                                                                                                                                                                                      of pyrophosphate by YJR069C, which is
                                                                                                                                                                                                       further hydrolysed to orthophosphate by
                                                                                                                                                                                                       inorganic phosphatase (IP).
                                                                                                                                                                                                   !
Percentage




             20.0
                                                                                                                                Actual pairs      References
                                                                                                                                Potential pairs

                                                                                                                                                  1KEGG:   kyoto encyclopedia of genes and genomes. Kanehisa M, et al. Nucleic
                                                                                                                                                  Acids Res. 2000, 28, 27-30.
             10.0                                                                                                                                 2A consensus yeast metabolic network reconstruction obtained from a community

                                                                                                                                                  approach to systems biology. Herrgård MJ, et al. Nat Biotechnol. 2008, 26,
                                                                                                                                                  1155-60.
                                                                                                                                                  3Recent developments of the chemistry development kit (CDK) - an open-source

              0.0                                                                                                                                 java library for chemo- and bioinformatics. Steinbeck C, et al. Curr Pharm Des.
                    0.0-0.1   0.1-0.2       0.2-0.3   0.3-0.4   0.4-0.5       0.5-0.6
                                                                  Similarity score
                                                                                        0.6-0.7   0.7-0.8   0.8-0.9   0.9-1.0
                                                                                                                                                  2006, 12, 2111-20.
                                                                                                                                                  4Crystal structure and confirmation of the alanine:glyoxylate aminotransferase

Figure 3: Similarity score distribution of actual and potential metabolite pairs.                                                                 activity of the YFL030w yeast protein. Meyer P, et al. Biochimie. 2005, 87, 1041-7.

More Related Content

What's hot

Gerstner et al Comm Int Biol 2011b
Gerstner et al Comm Int Biol 2011bGerstner et al Comm Int Biol 2011b
Gerstner et al Comm Int Biol 2011bjrgerstn
 
QPS Regulated Bioanalysis of Antibody Drug Conjugates
QPS Regulated Bioanalysis of Antibody Drug ConjugatesQPS Regulated Bioanalysis of Antibody Drug Conjugates
QPS Regulated Bioanalysis of Antibody Drug ConjugatesQPS Holdings, LLC
 
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...Haley D. Norman
 
23355 ftp
23355 ftp23355 ftp
23355 ftpkjalili
 
Biocuration2012 Eugeni Belda
Biocuration2012 Eugeni BeldaBiocuration2012 Eugeni Belda
Biocuration2012 Eugeni Beldaeugenibc
 

What's hot (9)

iGEM UCSD 2015 Poster
iGEM UCSD 2015 PosteriGEM UCSD 2015 Poster
iGEM UCSD 2015 Poster
 
Phasins
PhasinsPhasins
Phasins
 
Gerstner et al Comm Int Biol 2011b
Gerstner et al Comm Int Biol 2011bGerstner et al Comm Int Biol 2011b
Gerstner et al Comm Int Biol 2011b
 
QPS Regulated Bioanalysis of Antibody Drug Conjugates
QPS Regulated Bioanalysis of Antibody Drug ConjugatesQPS Regulated Bioanalysis of Antibody Drug Conjugates
QPS Regulated Bioanalysis of Antibody Drug Conjugates
 
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...
The Assembly, Structure and Activation of Influenza a M2 Transmembrane Domain...
 
23355 ftp
23355 ftp23355 ftp
23355 ftp
 
Biocuration2012 Eugeni Belda
Biocuration2012 Eugeni BeldaBiocuration2012 Eugeni Belda
Biocuration2012 Eugeni Belda
 
Final poster (002)
Final poster (002)Final poster (002)
Final poster (002)
 
NSA2
NSA2NSA2
NSA2
 

Viewers also liked

Continued development of ChEBI towards better usability for the systems biolo...
Continued development of ChEBI towards better usability for the systems biolo...Continued development of ChEBI towards better usability for the systems biolo...
Continued development of ChEBI towards better usability for the systems biolo...Neil Swainston
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...Neil Swainston
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...Neil Swainston
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyNeil Swainston
 
Data standards for systems biology
Data standards for systems biologyData standards for systems biology
Data standards for systems biologyNeil Swainston
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...Neil Swainston
 
Data standards for systems biology
Data standards for systems biologyData standards for systems biology
Data standards for systems biologyNeil Swainston
 

Viewers also liked (7)

Continued development of ChEBI towards better usability for the systems biolo...
Continued development of ChEBI towards better usability for the systems biolo...Continued development of ChEBI towards better usability for the systems biolo...
Continued development of ChEBI towards better usability for the systems biolo...
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
 
Informatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems BiologyInformatics In The Manchester Centre For Integrative Systems Biology
Informatics In The Manchester Centre For Integrative Systems Biology
 
Data standards for systems biology
Data standards for systems biologyData standards for systems biology
Data standards for systems biology
 
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
The Subliminal Toolbox: automating steps in the reconstruction of metabolic n...
 
Data standards for systems biology
Data standards for systems biologyData standards for systems biology
Data standards for systems biology
 

Similar to Network cheminformatics: gap filling and identifying new reactions in metabolic networks based on metabolite similarity

00002 Jc Silva 2006 Mcp V5n1p144
00002 Jc Silva 2006 Mcp V5n1p14400002 Jc Silva 2006 Mcp V5n1p144
00002 Jc Silva 2006 Mcp V5n1p144jcruzsilva
 
Lafont proteins 2007
Lafont proteins 2007Lafont proteins 2007
Lafont proteins 2007scbegay
 
00003 Jc Silva 2006 Mcp V5n4p589
00003 Jc Silva 2006 Mcp V5n4p58900003 Jc Silva 2006 Mcp V5n4p589
00003 Jc Silva 2006 Mcp V5n4p589jcruzsilva
 
Friend NIEHS 2013-03-01
Friend NIEHS 2013-03-01Friend NIEHS 2013-03-01
Friend NIEHS 2013-03-01Sage Base
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicSusan Rey
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicSusan Rey
 
Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...Jarle Pahr
 
Transcription Mechanism.pdf
Transcription Mechanism.pdfTranscription Mechanism.pdf
Transcription Mechanism.pdfBabita Neupane
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engDr. Tan Boon Siong
 
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013Areejit Samal
 
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...SynEnthu
 
Yeast two hybrid system
Yeast two hybrid systemYeast two hybrid system
Yeast two hybrid systemiqraakbar8
 
Enzyme kinetics - Creative Enzymes
Enzyme kinetics - Creative EnzymesEnzyme kinetics - Creative Enzymes
Enzyme kinetics - Creative EnzymesCreative Enzymes
 
White Paper Aptamer Applications
White Paper Aptamer ApplicationsWhite Paper Aptamer Applications
White Paper Aptamer Applicationsiceninebio
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and toolsKAUSHAL SAHU
 
2 complemento receptores - segunda apresentação
2  complemento receptores - segunda apresentação2  complemento receptores - segunda apresentação
2 complemento receptores - segunda apresentaçãoufamimunologia
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyRachelCaico
 

Similar to Network cheminformatics: gap filling and identifying new reactions in metabolic networks based on metabolite similarity (20)

00002 Jc Silva 2006 Mcp V5n1p144
00002 Jc Silva 2006 Mcp V5n1p14400002 Jc Silva 2006 Mcp V5n1p144
00002 Jc Silva 2006 Mcp V5n1p144
 
Lafont proteins 2007
Lafont proteins 2007Lafont proteins 2007
Lafont proteins 2007
 
00003 Jc Silva 2006 Mcp V5n4p589
00003 Jc Silva 2006 Mcp V5n4p58900003 Jc Silva 2006 Mcp V5n4p589
00003 Jc Silva 2006 Mcp V5n4p589
 
Friend NIEHS 2013-03-01
Friend NIEHS 2013-03-01Friend NIEHS 2013-03-01
Friend NIEHS 2013-03-01
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomic
 
Applications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomicApplications of protein array in diagnostics and genomic and proteomic
Applications of protein array in diagnostics and genomic and proteomic
 
Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...Project report: Investigating the effect of cellular objectives on genome-sca...
Project report: Investigating the effect of cellular objectives on genome-sca...
 
Transcription Mechanism.pdf
Transcription Mechanism.pdfTranscription Mechanism.pdf
Transcription Mechanism.pdf
 
Lecture 4 metabolic pathway eng
Lecture 4 metabolic pathway engLecture 4 metabolic pathway eng
Lecture 4 metabolic pathway eng
 
Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013Areejit Samal Emergence Alaska 2013
Areejit Samal Emergence Alaska 2013
 
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...
Conrado et al. 2011 NAR DNA-guided assembly of biosynthetic pathways promotes...
 
Yeast two hybrid system
Yeast two hybrid systemYeast two hybrid system
Yeast two hybrid system
 
Signals of Evolution: Conservation, Specificity Determining Positions and Coe...
Signals of Evolution: Conservation, Specificity Determining Positions and Coe...Signals of Evolution: Conservation, Specificity Determining Positions and Coe...
Signals of Evolution: Conservation, Specificity Determining Positions and Coe...
 
Enzyme kinetics
Enzyme kineticsEnzyme kinetics
Enzyme kinetics
 
Enzyme kinetics - Creative Enzymes
Enzyme kinetics - Creative EnzymesEnzyme kinetics - Creative Enzymes
Enzyme kinetics - Creative Enzymes
 
Enzyme kinetics
Enzyme kinetics Enzyme kinetics
Enzyme kinetics
 
White Paper Aptamer Applications
White Paper Aptamer ApplicationsWhite Paper Aptamer Applications
White Paper Aptamer Applications
 
Functional proteomics, and tools
Functional proteomics, and toolsFunctional proteomics, and tools
Functional proteomics, and tools
 
2 complemento receptores - segunda apresentação
2  complemento receptores - segunda apresentação2  complemento receptores - segunda apresentação
2 complemento receptores - segunda apresentação
 
Year 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro onlyYear 12 biology early comm presentation intro only
Year 12 biology early comm presentation intro only
 

More from Neil Swainston

Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentNeil Swainston
 
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...Neil Swainston
 
Integrative information management for systems biology
Integrative information management for systems biologyIntegrative information management for systems biology
Integrative information management for systems biologyNeil Swainston
 
ChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsNeil Swainston
 
iQconCAT: quantitative proteomics from instrument to browser
iQconCAT: quantitative proteomics from instrument to browseriQconCAT: quantitative proteomics from instrument to browser
iQconCAT: quantitative proteomics from instrument to browserNeil Swainston
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserNeil Swainston
 
QconCat: From Instrument To Browser
QconCat: From Instrument To BrowserQconCat: From Instrument To Browser
QconCat: From Instrument To BrowserNeil Swainston
 

More from Neil Swainston (9)

Data Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software DevelopmentData Integration, Mass Spectrometry Proteomics Software Development
Data Integration, Mass Spectrometry Proteomics Software Development
 
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
Subliminal: exploiting semantic annotations in the reconstruction of metaboli...
 
Integrative information management for systems biology
Integrative information management for systems biologyIntegrative information management for systems biology
Integrative information management for systems biology
 
ChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructionsChEBI and genome scale metabolic reconstructions
ChEBI and genome scale metabolic reconstructions
 
SBML Browse
SBML BrowseSBML Browse
SBML Browse
 
iQconCAT: quantitative proteomics from instrument to browser
iQconCAT: quantitative proteomics from instrument to browseriQconCAT: quantitative proteomics from instrument to browser
iQconCAT: quantitative proteomics from instrument to browser
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To Browser
 
QconCat: From Instrument To Browser
QconCat: From Instrument To BrowserQconCat: From Instrument To Browser
QconCat: From Instrument To Browser
 
libAnnotationSBML
libAnnotationSBMLlibAnnotationSBML
libAnnotationSBML
 

Recently uploaded

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Network cheminformatics: gap filling and identifying new reactions in metabolic networks based on metabolite similarity

  • 1. This work has been supported by the BBSRC/EPSRC grant: the Manchester Centre for Integrative Systems Biology Gap filling and identifying new reactions in metabolic networks based on metabolite similarity Matthew G.S. Norris, Neil Swainston, Paul D. Dobson, Daniel Jameson, Evangelos Simeonidis, Kieran Smallbone, Naglis Malys Manchester Centre for Integrative Systems Biology, University of Manchester, Manchester M1 7ND, UK Introduction The number of published metabolic network reconstructions are Two chemical similarity distributions were generated, resulting from increasing, as are their applications. However, such reconstructions pairs of metabolites that do and do not form a reaction pair in the commonly include gaps (see Figure 1), which are due to incomplete network (plotted as actual and potential pairs in Figure 3). Mass source databases or holes in biochemical knowledge reported in differences are calculated, such that potential pairs were only literature. The filling of such gaps has been aided through automated considered if they exhibit a mass difference of an actual pair, resulting techniques which attempt to mitigate these gaps by adding reactions from a known chemical transformation. from external resources such as KEGG1. Results The approach introduced here is to apply cheminformatics to determine and quantify chemical similarity across all metabolites in It can be seen that the majority of actual metabolite pairs have a a metabolic network of S. cerevisiae2. The hypothesis is that those chemical similarity score greater than 0.7. However, only 8.5% (557) metabolite pairs of high chemical similarity are likely to form of potential pairs exhibit such similarity. Of these 557, 99 were reaction pairs, in which one metabolite can be converted to the found to form a reaction pair in KEGG, but were not present in the other by a single chemical reaction. The similar scoring pairs that metabolic network. From these 99 pairs, a number were selected for do not currently form a reaction pair in the network can be analysed, further evaluation, and three examples of this are provided in Table by either comparison with existing data resources or by literature 1. The evaluation entailed: searches, to determine whether they take part in a metabolic reaction. •  extraction from KEGG of homologous protein sequences that Following this approach, preliminary results have led to the discovery catalyse these reactions; of missing information from KEGG, and the assignment of function and determination of kinetic constants to a gene of previously •  BLAST searching these sequences against a S. cerevisiae protein unknown function. database to identify candidate enzymes exhibiting this activity; Figure 1: Gaps in metabolic •  literature search and / or experimental validation of the networks. activity of these candidates. Unreachable metabolites are disconnected from the KEGG Reaction Similarity Gene id KM / µM Kcat / s-1 reaction score extracellular medium. “Blocked” R00585 L-serine + pyruvate <=> hydroxypyruvate 0.87 YFL030W Gene activity confirmed in reactions are incapable of + L-alanine by literature search4 carrying flux as they lead to R00720 ITP + H2O <=> IMP + diphosphate 0.78 YJR069C 2.33 0.14 dead-end metabolites (such as R01215 L-valine + pyruvate <=> 3-methyl-2- 0.76 YER152C No experimental validation the metabolites f and j). Gap oxobutanoic acid + L-alanine filling is required to reconcile both issues. Table 1: Reactions found for three highly similar scoring metabolite pairs that were not present in the metabolic reconstruction. Metabolites that form pairs are highlighted in bold. Kinetic constants were determined through protein Method expression, purification and absorbance assay (see Figure 4). Metabolites were extracted from a genome-scale metabolic network, Further work and SMILES strings representing their chemical structure were acquired. The structures were compared in a pairwise manner using Future directions may include: the Chemical Development Kit (CDK)3, to determine a chemical similarity score between each pair (see Figure 2). •  focussing on those metabolites that are known to be “dead-ends” or are disconnected from the core network, thus more-closely integrating the method with network gap filling; •  automating the bioinformatics aspect of the pipeline (BLAST searching, etc.) to automate the identification of putative enzymes; •  apply text-mining to find potential reactions from literature where reactions are not present in existing data resources such as KEGG; •  application of the approach to metabolite identification in metabolomics experiments. Figure 2: Example of chemical similarity score generated from SMILES strings using the CDK for the metabolite pair IMP and ITP. Similarity score distribution of actual and potential metabolite pairs Figure 4: Confirmation of ITP 40.0 pyrophosphohydrolase activity for YJR069C. A Malachite Green assay was performed to detect orthophosphate, indicating hydrolysis of ITP and release 30.0 of pyrophosphate by YJR069C, which is further hydrolysed to orthophosphate by inorganic phosphatase (IP). ! Percentage 20.0 Actual pairs References Potential pairs 1KEGG: kyoto encyclopedia of genes and genomes. Kanehisa M, et al. Nucleic Acids Res. 2000, 28, 27-30. 10.0 2A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology. Herrgård MJ, et al. Nat Biotechnol. 2008, 26, 1155-60. 3Recent developments of the chemistry development kit (CDK) - an open-source 0.0 java library for chemo- and bioinformatics. Steinbeck C, et al. Curr Pharm Des. 0.0-0.1 0.1-0.2 0.2-0.3 0.3-0.4 0.4-0.5 0.5-0.6 Similarity score 0.6-0.7 0.7-0.8 0.8-0.9 0.9-1.0 2006, 12, 2111-20. 4Crystal structure and confirmation of the alanine:glyoxylate aminotransferase Figure 3: Similarity score distribution of actual and potential metabolite pairs. activity of the YFL030w yeast protein. Meyer P, et al. Biochimie. 2005, 87, 1041-7.