SlideShare a Scribd company logo
1 of 29
Download to read offline
PubChem
                                         Bioassays as a
                                            Source of
                                        Polypharmacology

                                        Bin Chen, David
                                         Wild, Rajarshi
                                             Guha
PubChem Bioassays as a Source of
                                        Introduction
      Polypharmacology                  Methodology

                                        Visualization

                                        Application
  Bin Chen, David Wild, Rajarshi Guha

           School of Informatics
            Indiana University


      236th ACS National Meeting
           17th August, 2008
PubChem
PubChem Bioassays                                         Bioassays as a
                                                             Source of
                                                         Polypharmacology

                                                         Bin Chen, David
                                                          Wild, Rajarshi
                                                              Guha

                                                         Introduction
   Currently contains 1157 assays                        Methodology

       A number are follow ups of primary screens        Visualization

                                                         Application
   Assay size ranges from 2 to 224,000 molecules
   Many compounds tested in multiple assays
   PubChem web interface support queries that focus on
   individual assays
   Cross-assay queries can be tough
PubChem
Assay Content                                                  Bioassays as a
                                                                  Source of
                                                              Polypharmacology

                                                              Bin Chen, David
                                                               Wild, Rajarshi
                                                                   Guha

                                     AID 1   AID 2   AID 50
  The data is obviously primary                               Introduction

                                                              Methodology
  But the assay description and
                                                              Visualization
  target are also useful pieces of
                                                              Application
  information
  Can we combine
       data
                                     AID 1   AID 2   AID 50
       target
       description
  across multiple assays to draw
  conclusions, gain insight?
PubChem
A Network Model of Bioassays - Goals                                              Bioassays as a
                                                                                     Source of
                                                                                 Polypharmacology

                                                                                 Bin Chen, David
                                                                                  Wild, Rajarshi
                                 PubChem                                              Guha
                                 Bioassays
                                                                                 Introduction
                                                      PubMed      PubChem
                                                                                 Methodology

                            Assay Network              KEGG          GO          Visualization
                             Construction                                        Application
                                                    Link into other data bases
   Storage & Deployment

             RDBMS
                                 Network                  Interactive
    Web      Web                 Models                  Visualization
                     SQL
   Service   Page




                              Mapping to
                           External Networks




                           PPI       Drug -Target
PubChem
Mapping Assay Networks to Real Networks                        Bioassays as a
                                                                  Source of
                                                              Polypharmacology

                                                              Bin Chen, David
    An assay network is an artiļ¬cial network - does not        Wild, Rajarshi
                                                                   Guha
    necessarily have physical meaning
    We need to map the assay network onto a real biological   Introduction

    network                                                   Methodology

                                                              Visualization
         PPI networks
                                                              Application
         metabolic networks
         drug target networks
    Using the mapping, weā€™d like to identify MLSCN
    compounds that might be active against one or more
    nodes in the real network

The stepping stones . . .
    How do we construct the assay network?
    How do we map the network?
PubChem
Mapping Assay Networks to Real Networks                        Bioassays as a
                                                                  Source of
                                                              Polypharmacology

                                                              Bin Chen, David
    An assay network is an artiļ¬cial network - does not        Wild, Rajarshi
                                                                   Guha
    necessarily have physical meaning
    We need to map the assay network onto a real biological   Introduction

    network                                                   Methodology

                                                              Visualization
         PPI networks
                                                              Application
         metabolic networks
         drug target networks
    Using the mapping, weā€™d like to identify MLSCN
    compounds that might be active against one or more
    nodes in the real network

The stepping stones . . .
    How do we construct the assay network?
    How do we map the network?
PubChem
Why Perform a Mapping?                                             Bioassays as a
                                                                      Source of
                                                                  Polypharmacology

                                                                  Bin Chen, David
                                                                   Wild, Rajarshi
                                                                       Guha


       Identify compounds that interacts with two targets in      Introduction

       diļ¬€erent pathways                                          Methodology

                                                                  Visualization
       Alternatively, identify compounds that interact with a
                                                                  Application
       target in a pathway but not in another pathway
       Identify compounds capable of disrupting protein-protein
       interactions
       Our ability to do these will depend on the quality of
       assay data and the way we map the assay network to the
       real network




Hopkins, A.L. et al, Curr. Opin. Chem. Biol, 2006, 16, 127ā€“136
PubChem
Why Perform a Mapping?                                             Bioassays as a
                                                                      Source of
                                                                  Polypharmacology

                                                                  Bin Chen, David
                                                                   Wild, Rajarshi
                                                                       Guha


       Identify compounds that interacts with two targets in      Introduction

       diļ¬€erent pathways                                          Methodology

                                                                  Visualization
       Alternatively, identify compounds that interact with a
                                                                  Application
       target in a pathway but not in another pathway
       Identify compounds capable of disrupting protein-protein
       interactions
       Our ability to do these will depend on the quality of
       assay data and the way we map the assay network to the
       real network




Hopkins, A.L. et al, Curr. Opin. Chem. Biol, 2006, 16, 127ā€“136
PubChem
Assay Network Construction                                                  Bioassays as a
                                                                               Source of
                                                                           Polypharmacology

                                                                           Bin Chen, David
                                                                            Wild, Rajarshi
                               Download              Extract protein            Guha
           PubChem           bioassay XML              target ID's
                                                                           Introduction

                                                                           Methodology
                             Extract activity           Evaluate
                                                                           Visualization
                                 scores                 pairwise
                                                       CLUSTAL             Application
                                                       similarities

                                Exclude
                              compounds
                               with score
                                  < 80




       Connect assays if   Connect assays if       Connect assays if
          they have          their semantic            their target
        compounds in       similarity is greater   similarity is greater
           common                 than X                  than X
PubChem
Assay Network Construction                                        Bioassays as a
                                                                     Source of
                                                                 Polypharmacology

                                                                 Bin Chen, David
                                                                  Wild, Rajarshi
                                                                      Guha


    We will focus on a compound-centric network                  Introduction

                                                                 Methodology
    A semantic network requires some form of annotation on
                                                                 Visualization
    the assays
                                                                 Application
    Initial attempts at annotation assays based on GO terms
    (via descriptions)
    Alternatively, could consider deriving annotations based
    on the targets
    Using protein target similarity restricts one to enzymatic
    assays which leads to a relatively small assay network
PubChem
Assay Network Construction ā€“ Caveats                             Bioassays as a
                                                                    Source of
                                                                Polypharmacology

                                                                Bin Chen, David
                                                                 Wild, Rajarshi
                                                                     Guha

                                                                Introduction

    A compound-centric network is not very rigorous             Methodology

                                                                Visualization
    The PubChem activity score is known to be noisy
                                                                Application
        Currently the only way to look at assay readouts over
        the whole collection
    Using an activity score cutoļ¬€ of 80 is arbitrary
    We havenā€™t considered promiscuity directly, though a
    ļ¬lter would be useful
PubChem
Assay Network - Common Compounds    Bioassays as a
                                       Source of
                                   Polypharmacology

                                   Bin Chen, David
                                    Wild, Rajarshi
                                        Guha

                                   Introduction

                                   Methodology

                                   Visualization

                                   Application
PubChem
Some Network Statistics                                                                  Bioassays as a
                                                                                            Source of
                                                                                        Polypharmacology

                                                                                        Bin Chen, David
                                                                                         Wild, Rajarshi
                                                                                             Guha

                                                                                        Introduction




                                                60
                                                                                        Methodology
  222 assays with a single target




                                                50
                                                                                        Visualization
  Selected the smallest assay if




                                                40
                                                                                        Application




                                    Frequency
  more than assay had the same




                                                30
  target



                                                20
  N = 125, E = 598

                                                10
  Vmax = 40, Vavg = 9.6
                                                0
  ĀÆ
  C = 0.67                                           0   10     20            30   40

                                                              Vertex Degree


                                                Histogram of vertex degree
PubChem
Clustering in the Assay Network    Bioassays as a
                                      Source of
                                  Polypharmacology

                                  Bin Chen, David
                                   Wild, Rajarshi
                                       Guha

                                  Introduction

                                  Methodology

                                  Visualization

                                  Application
PubChem
Assay Network - Common Compounds    Bioassays as a
                                       Source of
                                   Polypharmacology

                                   Bin Chen, David
                                    Wild, Rajarshi
                                        Guha

                                   Introduction

                                   Methodology

                                   Visualization

                                   Application
PubChem
Assay Network - Common Compounds                      Bioassays as a
                                                         Source of
                                                     Polypharmacology

                     388 targets NAD+ -dependent     Bin Chen, David
                                                      Wild, Rajarshi
                     15-hydroxyprostaglandin              Guha

                     dehydrogenase                   Introduction

                     Has active compounds common     Methodology

                     with                            Visualization

                         pim-2-oncogene (505)        Application

                         15-lipoxygenase (887)
                         aldo-keto reductase (381)




   Luteonin                           Genistein
PubChem
Assay Network - Common Compounds    Bioassays as a
                                       Source of
                                   Polypharmacology

                                   Bin Chen, David
                                    Wild, Rajarshi
                                        Guha

                                   Introduction

                                   Methodology

                                   Visualization

                                   Application
PubChem
Assay Network - Common Compounds                                                        Bioassays as a
                                                                                           Source of
                                                                                       Polypharmacology

                                                                                       Bin Chen, David
                                                                                        Wild, Rajarshi
                                                                                            Guha

                                                   749 and 755 target 5-HT1E and       Introduction
                                                   5-HT1A respectively                 Methodology

                                                   Both have a (diļ¬€erent)              Visualization

                                                   compound in common with             Application

                                                   1288 (selectin E)
                                                   Probably promiscuous given
                                                   that they are also active in
                                                   many other assays
                                                   But a selectin inhibitor is known
                                                   to reduce hyperalgesia by
                                                   blocking 5-HT3



Oliviera, M.C.G. et al, Neuroscience, 2007, 145, 708ā€“714
PubChem
Assay Network - Common Compounds                                 Bioassays as a
                                                                    Source of
                                                                Polypharmacology

                                                                Bin Chen, David
                                                                 Wild, Rajarshi
                                                                     Guha


                               Most of these assay pairs have   Introduction

                                                                Methodology
                               closely related targets
                                                                Visualization
                               Tissue non-speciļ¬c alkaline      Application
                               phosphatase and intestinal
                               alkaline phosphatase (1056 &
                               1017)
                               STAT1 and STAT3 (1303 &
                               1310)
                               ER-Ī± and ER-Ī² (1226 & 1228)

lethal factor (B. anthracis) and nF-ĪŗB (942 & 1309) have
one compound in common - podophyllotoxin
PubChem
Mapping an Assay Network      Bioassays as a
                                 Source of
                             Polypharmacology

                             Bin Chen, David
                              Wild, Rajarshi
                                  Guha

                             Introduction

                             Methodology

                             Visualization

                             Application

                  Mapping
                  Function
PubChem
Deļ¬ning a Mapping Function                                         Bioassays as a
                                                                      Source of
                                                                  Polypharmacology

                                                                  Bin Chen, David
                                                                   Wild, Rajarshi
                                                                       Guha

                                                                  Introduction
    Multiple mapping functions can be deļ¬ned
                                                                  Methodology
        exact matches between assay target and external targets
                                                                  Visualization
        similarity between target sequences
                                                                  Application
        similarity between target binding sites
    One could also map edges of one network onto another
        Dependent on the nature of the external network
    Depending on the nature of the deļ¬nition, the mapping
    procedure can be a trivial search or may require an
    optimization scheme if multiple mappings are possible
PubChem
Assay Network to HPRD                                        Bioassays as a
                                                                Source of
                                                            Polypharmacology

                                                            Bin Chen, David
                                                             Wild, Rajarshi
                                                                 Guha


   The HPRD database collects protein-protein interaction   Introduction

   data and pathway membership                              Methodology

                                                            Visualization
   The July 2007 release lists 31,708 PPIā€™s
                                                            Application
   96 assays can be mapped to the unique proteins in
   HPRD
   We construct a HPRD network by identifying the pairs
   from the 96 proteins that have a listed interaction
   When mapping the HPRD network to the assay network,
   we include singleton HPRD nodes
PubChem
HPRD Network    Bioassays as a
                   Source of
               Polypharmacology

               Bin Chen, David
                Wild, Rajarshi
                    Guha

               Introduction

               Methodology

               Visualization

               Application
PubChem
Assay - HPRD Network Mapping    Bioassays as a
                                   Source of
                               Polypharmacology

                               Bin Chen, David
                                Wild, Rajarshi
                                    Guha

                               Introduction

                               Methodology

                               Visualization

                               Application
PubChem
Assay - HPRD Network Mapping                                     Bioassays as a
                                                                    Source of
                                                                Polypharmacology

                                                                Bin Chen, David
                                                                 Wild, Rajarshi
                                                                     Guha

                                                                Introduction

                                                                Methodology
Is this a useful mapping?                                       Visualization

                                                                Application
    Since we map assays to HPRD entries by target ID, we
    arenā€™t getting new information on the assays individually
    But we are able to easily identify assay targets that
    interact with each other (or not)
PubChem
Comparing Two Assays                                                                              Bioassays as a
                                                                                                     Source of
                                                                                                 Polypharmacology

                                                                                                 Bin Chen, David
                                                                                                  Wild, Rajarshi
                                                                                                      Guha

                                                                                                 Introduction

                                                                                                 Methodology
                              Score = 95                Score = 100
                                                                                                 Visualization

                                                                                                 Application
                                           CID 126298

              AID 835                                                AID 1325
            Gene: EPHB4                                            Gene: ABCB1
         Signal Transduction                                        Transporter
       Axon guidance pathway                                      ABC transporters

 Over expressed in breast carcinoma                     Overexpression is related to multidrug
                                                        resistance in chemotherapy
 Preferentially expressed in veins
                                                        Involved in the BBB
 Required for angiogenesis
                                                        Not expressed very highly in vascular
                                                        tissue
PubChem
Comparing Two Assays                                                                       Bioassays as a
                                                                                              Source of
                                                                                          Polypharmacology

                                                                                          Bin Chen, David
                                                                                           Wild, Rajarshi
                                                                                               Guha

                                                                                          Introduction

                                                                                          Methodology

                               Score = 94                  Score = 93                     Visualization

                                                                                          Application
                                            CID 647501

                AID 903                                                AID 755
               Gene: TP53                                            Gene: HTR1A
      Nucleotide Metabolism                                    Signal Transduction
   Apoptosis, Cell Cycle, Cancer                            Neuroactive ligand-receptor

 Controls cell cycle and apoptosis                       Target for anti-depressants

 Inactivated in cancer cells
PubChem
Disrupting PPIā€™s                                                        Bioassays as a
                                                                           Source of
                                                                       Polypharmacology

                                                                       Bin Chen, David
                                        The pairs of interacting        Wild, Rajarshi
                                                                            Guha
                                        targets have compounds
                                        tested against both of them    Introduction

                                                                       Methodology
                                        Majority are inactive or
                                                                       Visualization
           Tyrosine
      3-monooxygenase
                           NF-kappa B   inconclusive in both of them
      activation protein                                               Application
                                        CID 1025314 is active in AID
                                        445 but inactive in AID 903

      Hypoxia                Bcl-2
     Inducible
                     P53
       Factor
PubChem
Summary                                                         Bioassays as a
                                                                   Source of
                                                               Polypharmacology

                                                               Bin Chen, David
                                                                Wild, Rajarshi
                                                                    Guha


   A network view of assays provides with a novel tool for     Introduction

                                                               Methodology
   visualization and summary of the assay collection
                                                               Visualization
   Itā€™s utility beyond visualization is dependent on the way   Application
   we construct the network
   A compound-centric network allows us to use the assay
   collection as a probe into external networks
   Future work will investigate diļ¬€erent forms of the assay
   network focusing on protein target and GO annotation
   similarity

More Related Content

More from Rajarshi Guha

Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomeRajarshi Guha
Ā 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in contextRajarshi Guha
Ā 
Pharos ā€“ A Torch to Use in Your Journey In the Dark Genome
Pharos ā€“ A Torch to Use in Your Journey In the Dark GenomePharos ā€“ A Torch to Use in Your Journey In the Dark Genome
Pharos ā€“ A Torch to Use in Your Journey In the Dark GenomeRajarshi Guha
Ā 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMCRajarshi Guha
Ā 
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformEnhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformRajarshi Guha
Ā 
What can your library do for you?
What can your library do for you?What can your library do for you?
What can your library do for you?Rajarshi Guha
Ā 
So I have an SD File ā€¦ What do I do next?
So I have an SD File ā€¦ What do I do next?So I have an SD File ā€¦ What do I do next?
So I have an SD File ā€¦ What do I do next?Rajarshi Guha
Ā 
Characterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsCharacterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsRajarshi Guha
Ā 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & RRajarshi Guha
Ā 
Fingerprinting Chemical Structures
Fingerprinting Chemical StructuresFingerprinting Chemical Structures
Fingerprinting Chemical StructuresRajarshi Guha
Ā 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Rajarshi Guha
Ā 
When the whole is better than the parts
When the whole is better than the partsWhen the whole is better than the parts
When the whole is better than the partsRajarshi Guha
Ā 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Rajarshi Guha
Ā 
Pushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesPushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesRajarshi Guha
Ā 
Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Rajarshi Guha
Ā 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research DatabaseRajarshi Guha
Ā 
Cloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsCloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsRajarshi Guha
Ā 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleRajarshi Guha
Ā 
Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Rajarshi Guha
Ā 
Quantifying Text Sentiment in R
Quantifying Text Sentiment in RQuantifying Text Sentiment in R
Quantifying Text Sentiment in RRajarshi Guha
Ā 

More from Rajarshi Guha (20)

Pharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark GenomePharos: A Torch to Use in Your Journey in the Dark Genome
Pharos: A Torch to Use in Your Journey in the Dark Genome
Ā 
Pharos: Putting targets in context
Pharos: Putting targets in contextPharos: Putting targets in context
Pharos: Putting targets in context
Ā 
Pharos ā€“ A Torch to Use in Your Journey In the Dark Genome
Pharos ā€“ A Torch to Use in Your Journey In the Dark GenomePharos ā€“ A Torch to Use in Your Journey In the Dark Genome
Pharos ā€“ A Torch to Use in Your Journey In the Dark Genome
Ā 
Pharos - Face of the KMC
Pharos - Face of the KMCPharos - Face of the KMC
Pharos - Face of the KMC
Ā 
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS PlatformEnhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Enhancing Prioritization & Discovery of Novel Combinations using an HTS Platform
Ā 
What can your library do for you?
What can your library do for you?What can your library do for you?
What can your library do for you?
Ā 
So I have an SD File ā€¦ What do I do next?
So I have an SD File ā€¦ What do I do next?So I have an SD File ā€¦ What do I do next?
So I have an SD File ā€¦ What do I do next?
Ā 
Characterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network ModelsCharacterization of Chemical Libraries Using Scaffolds and Network Models
Characterization of Chemical Libraries Using Scaffolds and Network Models
Ā 
Robots, Small Molecules & R
Robots, Small Molecules & RRobots, Small Molecules & R
Robots, Small Molecules & R
Ā 
Fingerprinting Chemical Structures
Fingerprinting Chemical StructuresFingerprinting Chemical Structures
Fingerprinting Chemical Structures
Ā 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D...
Ā 
When the whole is better than the parts
When the whole is better than the partsWhen the whole is better than the parts
When the whole is better than the parts
Ā 
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Exploring Compound Combinations in High Throughput Settings: Going Beyond 1D ...
Ā 
Pushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the PipesPushing Chemical Biology Through the Pipes
Pushing Chemical Biology Through the Pipes
Ā 
Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...Characterization and visualization of compound combination responses in a hig...
Characterization and visualization of compound combination responses in a hig...
Ā 
The BioAssay Research Database
The BioAssay Research DatabaseThe BioAssay Research Database
The BioAssay Research Database
Ā 
Cloudy with a Touch of Cheminformatics
Cloudy with a Touch of CheminformaticsCloudy with a Touch of Cheminformatics
Cloudy with a Touch of Cheminformatics
Ā 
Chemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & ReproducibleChemical Data Mining: Open Source & Reproducible
Chemical Data Mining: Open Source & Reproducible
Ā 
Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?Chemogenomics in the cloud: Is the sky the limit?
Chemogenomics in the cloud: Is the sky the limit?
Ā 
Quantifying Text Sentiment in R
Quantifying Text Sentiment in RQuantifying Text Sentiment in R
Quantifying Text Sentiment in R
Ā 

Recently uploaded

ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
Ā 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
Ā 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
Ā 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
Ā 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
Ā 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
Ā 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
Ā 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
Ā 
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)lakshayb543
Ā 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A BeƱa
Ā 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
Ā 
Millenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxMillenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxJanEmmanBrigoli
Ā 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
Ā 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
Ā 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationRosabel UA
Ā 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxElton John Embodo
Ā 

Recently uploaded (20)

ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
Ā 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
Ā 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
Ā 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
Ā 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Ā 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
Ā 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
Ā 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
Ā 
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)
Visit to a blind student's schoolšŸ§‘ā€šŸ¦ÆšŸ§‘ā€šŸ¦Æ(community medicine)
Ā 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
Ā 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
Ā 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
Ā 
Millenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptxMillenials and Fillennials (Ethical Challenge and Responses).pptx
Millenials and Fillennials (Ethical Challenge and Responses).pptx
Ā 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Ā 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
Ā 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
Ā 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
Ā 
Activity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translationActivity 2-unit 2-update 2024. English translation
Activity 2-unit 2-update 2024. English translation
Ā 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
Ā 
EMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docxEMBODO Lesson Plan Grade 9 Law of Sines.docx
EMBODO Lesson Plan Grade 9 Law of Sines.docx
Ā 

PubChem Bioassays as a Source of Polypharmacology

  • 1. PubChem Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha PubChem Bioassays as a Source of Introduction Polypharmacology Methodology Visualization Application Bin Chen, David Wild, Rajarshi Guha School of Informatics Indiana University 236th ACS National Meeting 17th August, 2008
  • 2. PubChem PubChem Bioassays Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Currently contains 1157 assays Methodology A number are follow ups of primary screens Visualization Application Assay size ranges from 2 to 224,000 molecules Many compounds tested in multiple assays PubChem web interface support queries that focus on individual assays Cross-assay queries can be tough
  • 3. PubChem Assay Content Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha AID 1 AID 2 AID 50 The data is obviously primary Introduction Methodology But the assay description and Visualization target are also useful pieces of Application information Can we combine data AID 1 AID 2 AID 50 target description across multiple assays to draw conclusions, gain insight?
  • 4. PubChem A Network Model of Bioassays - Goals Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi PubChem Guha Bioassays Introduction PubMed PubChem Methodology Assay Network KEGG GO Visualization Construction Application Link into other data bases Storage & Deployment RDBMS Network Interactive Web Web Models Visualization SQL Service Page Mapping to External Networks PPI Drug -Target
  • 5. PubChem Mapping Assay Networks to Real Networks Bioassays as a Source of Polypharmacology Bin Chen, David An assay network is an artiļ¬cial network - does not Wild, Rajarshi Guha necessarily have physical meaning We need to map the assay network onto a real biological Introduction network Methodology Visualization PPI networks Application metabolic networks drug target networks Using the mapping, weā€™d like to identify MLSCN compounds that might be active against one or more nodes in the real network The stepping stones . . . How do we construct the assay network? How do we map the network?
  • 6. PubChem Mapping Assay Networks to Real Networks Bioassays as a Source of Polypharmacology Bin Chen, David An assay network is an artiļ¬cial network - does not Wild, Rajarshi Guha necessarily have physical meaning We need to map the assay network onto a real biological Introduction network Methodology Visualization PPI networks Application metabolic networks drug target networks Using the mapping, weā€™d like to identify MLSCN compounds that might be active against one or more nodes in the real network The stepping stones . . . How do we construct the assay network? How do we map the network?
  • 7. PubChem Why Perform a Mapping? Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Identify compounds that interacts with two targets in Introduction diļ¬€erent pathways Methodology Visualization Alternatively, identify compounds that interact with a Application target in a pathway but not in another pathway Identify compounds capable of disrupting protein-protein interactions Our ability to do these will depend on the quality of assay data and the way we map the assay network to the real network Hopkins, A.L. et al, Curr. Opin. Chem. Biol, 2006, 16, 127ā€“136
  • 8. PubChem Why Perform a Mapping? Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Identify compounds that interacts with two targets in Introduction diļ¬€erent pathways Methodology Visualization Alternatively, identify compounds that interact with a Application target in a pathway but not in another pathway Identify compounds capable of disrupting protein-protein interactions Our ability to do these will depend on the quality of assay data and the way we map the assay network to the real network Hopkins, A.L. et al, Curr. Opin. Chem. Biol, 2006, 16, 127ā€“136
  • 9. PubChem Assay Network Construction Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Download Extract protein Guha PubChem bioassay XML target ID's Introduction Methodology Extract activity Evaluate Visualization scores pairwise CLUSTAL Application similarities Exclude compounds with score < 80 Connect assays if Connect assays if Connect assays if they have their semantic their target compounds in similarity is greater similarity is greater common than X than X
  • 10. PubChem Assay Network Construction Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha We will focus on a compound-centric network Introduction Methodology A semantic network requires some form of annotation on Visualization the assays Application Initial attempts at annotation assays based on GO terms (via descriptions) Alternatively, could consider deriving annotations based on the targets Using protein target similarity restricts one to enzymatic assays which leads to a relatively small assay network
  • 11. PubChem Assay Network Construction ā€“ Caveats Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction A compound-centric network is not very rigorous Methodology Visualization The PubChem activity score is known to be noisy Application Currently the only way to look at assay readouts over the whole collection Using an activity score cutoļ¬€ of 80 is arbitrary We havenā€™t considered promiscuity directly, though a ļ¬lter would be useful
  • 12. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 13. PubChem Some Network Statistics Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction 60 Methodology 222 assays with a single target 50 Visualization Selected the smallest assay if 40 Application Frequency more than assay had the same 30 target 20 N = 125, E = 598 10 Vmax = 40, Vavg = 9.6 0 ĀÆ C = 0.67 0 10 20 30 40 Vertex Degree Histogram of vertex degree
  • 14. PubChem Clustering in the Assay Network Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 15. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 16. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology 388 targets NAD+ -dependent Bin Chen, David Wild, Rajarshi 15-hydroxyprostaglandin Guha dehydrogenase Introduction Has active compounds common Methodology with Visualization pim-2-oncogene (505) Application 15-lipoxygenase (887) aldo-keto reductase (381) Luteonin Genistein
  • 17. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 18. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha 749 and 755 target 5-HT1E and Introduction 5-HT1A respectively Methodology Both have a (diļ¬€erent) Visualization compound in common with Application 1288 (selectin E) Probably promiscuous given that they are also active in many other assays But a selectin inhibitor is known to reduce hyperalgesia by blocking 5-HT3 Oliviera, M.C.G. et al, Neuroscience, 2007, 145, 708ā€“714
  • 19. PubChem Assay Network - Common Compounds Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Most of these assay pairs have Introduction Methodology closely related targets Visualization Tissue non-speciļ¬c alkaline Application phosphatase and intestinal alkaline phosphatase (1056 & 1017) STAT1 and STAT3 (1303 & 1310) ER-Ī± and ER-Ī² (1226 & 1228) lethal factor (B. anthracis) and nF-ĪŗB (942 & 1309) have one compound in common - podophyllotoxin
  • 20. PubChem Mapping an Assay Network Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application Mapping Function
  • 21. PubChem Deļ¬ning a Mapping Function Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Multiple mapping functions can be deļ¬ned Methodology exact matches between assay target and external targets Visualization similarity between target sequences Application similarity between target binding sites One could also map edges of one network onto another Dependent on the nature of the external network Depending on the nature of the deļ¬nition, the mapping procedure can be a trivial search or may require an optimization scheme if multiple mappings are possible
  • 22. PubChem Assay Network to HPRD Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha The HPRD database collects protein-protein interaction Introduction data and pathway membership Methodology Visualization The July 2007 release lists 31,708 PPIā€™s Application 96 assays can be mapped to the unique proteins in HPRD We construct a HPRD network by identifying the pairs from the 96 proteins that have a listed interaction When mapping the HPRD network to the assay network, we include singleton HPRD nodes
  • 23. PubChem HPRD Network Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 24. PubChem Assay - HPRD Network Mapping Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Visualization Application
  • 25. PubChem Assay - HPRD Network Mapping Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Is this a useful mapping? Visualization Application Since we map assays to HPRD entries by target ID, we arenā€™t getting new information on the assays individually But we are able to easily identify assay targets that interact with each other (or not)
  • 26. PubChem Comparing Two Assays Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Score = 95 Score = 100 Visualization Application CID 126298 AID 835 AID 1325 Gene: EPHB4 Gene: ABCB1 Signal Transduction Transporter Axon guidance pathway ABC transporters Over expressed in breast carcinoma Overexpression is related to multidrug resistance in chemotherapy Preferentially expressed in veins Involved in the BBB Required for angiogenesis Not expressed very highly in vascular tissue
  • 27. PubChem Comparing Two Assays Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha Introduction Methodology Score = 94 Score = 93 Visualization Application CID 647501 AID 903 AID 755 Gene: TP53 Gene: HTR1A Nucleotide Metabolism Signal Transduction Apoptosis, Cell Cycle, Cancer Neuroactive ligand-receptor Controls cell cycle and apoptosis Target for anti-depressants Inactivated in cancer cells
  • 28. PubChem Disrupting PPIā€™s Bioassays as a Source of Polypharmacology Bin Chen, David The pairs of interacting Wild, Rajarshi Guha targets have compounds tested against both of them Introduction Methodology Majority are inactive or Visualization Tyrosine 3-monooxygenase NF-kappa B inconclusive in both of them activation protein Application CID 1025314 is active in AID 445 but inactive in AID 903 Hypoxia Bcl-2 Inducible P53 Factor
  • 29. PubChem Summary Bioassays as a Source of Polypharmacology Bin Chen, David Wild, Rajarshi Guha A network view of assays provides with a novel tool for Introduction Methodology visualization and summary of the assay collection Visualization Itā€™s utility beyond visualization is dependent on the way Application we construct the network A compound-centric network allows us to use the assay collection as a probe into external networks Future work will investigate diļ¬€erent forms of the assay network focusing on protein target and GO annotation similarity