This document summarizes Philip Bourne's research using bioinformatics and systems biology approaches for early stage drug discovery. His approach involves characterizing known protein-ligand binding sites, searching for similar off-target binding sites across the proteome, and analyzing networks of potential drug-target interactions to understand drug polypharmacology and repurpose existing drugs. Examples are given of findings that explained drug side effects, identified possible drug repurposing opportunities, and informed multi-target drug strategies. The goal is to develop a high-throughput approach to rationally explore large networks of protein-ligand interactions.
1. Bioinformatics Meets Systems Biology for Early Stage Drug Discovery Philip E. Bourne University of California San Diego [email_address] http://www.sdsc.edu/pb NBIC – April 20, 2011
2.
3.
4.
5.
6.
7.
8.
9.
10.
11. Need to Start with a 3D Drug-Receptor Complex - The PDB Contains Many Examples Computational Methodology Generic Name Other Name Treatment PDBid Lipitor Atorvastatin High cholesterol 1HWK, 1HW8… Testosterone Testosterone Osteoporosis 1AFS, 1I9J .. Taxol Paclitaxel Cancer 1JFF, 2HXF, 2HXH Viagra Sildenafil citrate ED, pulmonary arterial hypertension 1TBF, 1UDT, 1XOS.. Digoxin Lanoxin Congestive heart failure 1IGJ
14. A Reverse Engineering Approach to Drug Discovery Across Gene Families Characterize ligand binding site of primary target (Geometric Potential) Identify off-targets by ligand binding site similarity (Sequence order independent profile-profile alignment) Extract known drugs or inhibitors of the primary and/or off-targets Search for similar small molecules Dock molecules to both primary and off-targets Statistics analysis of docking score correlations … Computational Methodology Xie and Bourne 2009 Bioinformatics 25(12) 305-312
15.
16.
17.
18.
19.
20.
21. Adverse Effects of SERMs cardiac abnormalities thromboembolic disorders ocular toxicities loss of calcium homeostatis ????? Side Effects - The Tamoxifen Story PLoS Comp. Biol. , 2007 3(11) e217
30. binding site comparison protein ligand docking MD simulation & MM/GBSA Binding free energy calculation structural proteome off-target? network construction & mapping drug target Clinical Outcomes 1OHR Possible Nelfinavir Repositioning
31.
32.
33. Distribution of Top Hits on the Human Kinome p-value < 1.0e-3 p-value < 1.0e-4 Manning et al., Science , 2002, V298, 1912 Possible Nelfinavir Repositioning
34. Interactions between Inhibitors and Epidermal Growth Factor Receptor (EGFR) – 74% of binding site resides are comparable 1. Hydrogen bond with main chain amide of Met793 (without it 3700 fold loss of inhibition) 2. Hydrophobic interactions of aniline/phenyl with gatekeeper Thr790 and other residues H-bond: Met793 with quinazoline N1 H-bond: Met793 with benzamide hydroxy O38 EGFR-DJK Co-crys ligand EGFR-Nelfinavir DJK = N-[4-(3-BROMO-PHENYLAMINO)-QUINAZOLIN-6-YL]-ACRYLAMIDE
35. Off-target Interaction Network Identified off-target Intermediate protein Pathway Cellular effect Activation Inhibition Possible Nelfinavir Repositioning PLoS Comp. Biol. , 2011 To Appear
36. Other Experimental Evidence to Show Nelfinavir inhibition on EGFR, IGF1R, CDK2 and Abl is Supportive The inhibitions of Nelfinavir on IGF1R, EGFR, Akt activity were detected by immunoblotting. The inhibition of Nelfinavir on Akt activity is less than a known PI3K inhibitor Joell J. Gills et al. Clinic Cancer Research September 2007 13; 5183 Nelfinavir inhibits growth of human melanoma cells by induction of cell cycle arrest Nelfinavir induces G1 arrest through inhibition of CDK2 activity. Such inhibition is not caused by inhibition of Akt signaling. Jiang W el al. Cancer Res. 2007 67(3) BCR-ABL is a constitutively activated tyrosine kinase that causes chronic myeloid leukemia (CML) Druker, B.J., et al New England Journal of Medicine, 2001. 344 (14): p. 1031-1037 Nelfinavir can induce apoptosis in leukemia cells as a single agent Bruning, A., et al. , Molecular Cancer, 2010. 9 :19 Nelfinavir may inhibit BCR-ABL Possible Nelfinavir Repositioning
44. Map 2 onto 1 – The TB-Drugome http://funsite.sdsc.edu/drugome/TB/ Similarities between the binding sites of M.tb proteins (blue), and binding sites containing approved drugs (red).
51. Drug Failure - The Torcetrapib Story PLoS Comp Biol 2009 5(5) e1000387
52.
53. Computational Evaluation of Drug Off-Target Effects 336 genes 1587 reactions Plos Comp. Biol. 2010 6(9): e1000938 Proteome Drug binding site alignments SMAP Predicted drug targets Drug and endogenous substrate binding site analysis Competitively inhibitable targets Inhibition simulations in context-specific model COBRA Toolbox Predicted causal targets and genetic risk factors Metabolic network Scientific literature Tissue and biofluid localization data Gene expression data Physiological objectives System exchange constraints Flux states optimizing objective Physiological context-specific model Influx Efflux Drug response phenotypes Drug targets Physiological objectives Causal drug targets All targets
Absorption, distribution, metabolism and excretion
Updated for 2009
P distance to environmental boundary; Pi Di and alphai D distance to central atom alpha direction to central atom
This is great data!
Tuberculosis, which is caused by the bacterial pathogen Mycobacterium tuberculosis , is a leading cause of mortality among the infectious diseases. It has been estimated by the World Health Organization (WHO) that almost one-third of the world's population , around 2 billion people, is infected with the disease. Every year, more than 8 million people develop an active form of the disease, which claims the lives of nearly 2 million. This translates to over 4,900 deaths per day , and more than 95% of these are in developing countries. Despite the current global situation, antitubercular drugs have remained largely unchanged over the last four decades. The widespread use of these agents has provided a strong selective pressure for M.tuberculosis, thus encouraging the emergence of resistant strains. Multidrug resistant (MDR) tuberculosis is defined as resistance to the first-line drugs isoniazid and rifampin . The effective treatment of MDR tuberculosis necessitates long-term use of second-line drug combinations , an unfortunate consequence of which is the emergence of further drug resistance. Enter extensively drug resistant (XDR) tuberculosis - M.tuberculosis strains that are resistant to both isoniazid plus rifampin, as well as key second-line drugs . Since the only remaining drug classes exhibit such low potency and high toxicity , XDR tuberculosis is extremely difficult to treat. The rise of XDR tuberculosis around the world imposes a great threat on human health , therefore reinforcing the development of new antitubercular agents as an urgent priority. Very few Mtb proteins explored as drug targets
3,996 proteins in TB proteome 749 solved structures in the PDB, representing a total of 284 proteins (7.2% coverage) ModBase contains homology models for entire TB proteome 1,446 ‘high quality’ homology models were added to the data set Structural coverage increased to 43.8% Retained only those models with a model score of > 0.7 and a Modpipe quality score of > 1.1 (2818 models). There were multiple models per protein. For each TB protein, chose the model with the best model score, and if they were equal, chose the model with the best Modpipe quality score (1703 models). However, 251 (+6) models were removed since they correspond to TB proteins that already have solved structures. 1446 models remained) Score for the reliability of a Model, derived from statistical potentials (F. Melo, R. Sanchez, A. Sali,2001 PDF ). A model is predicted to be good when the model score is higher than a pre-specified cutoff (0.7). A reliable model has a probability of the correct fold that is larger than 95%. A fold is correct when at least 30% of its Calpha atoms superpose within 3.5A of their correct positions. The ModPipe Protein Quality Score is a composite score comprising sequence identity to the template, coverage , and the three individual scores evalue , z-Dope and GA341 . We consider a MPQS of >1.1 as reliable
(nutraceuticals excluded)
Multi-target therapy may be more effective than single-target therapy to treat infectious diseases Most of the proteins listed are potential novel drug targets for the development of efficient anti-tuberculosis chemotherapeutics. GSMN-TB : Genome Scale Metabolic Reaction Network of M.tb (http://sysbio/sbs.surrey.ac.uk/tb) 849 reactions, 739 metabolites, 726 genes Can optimize the model for in vivo growth Carry out multiple gene inhibition and compute the maximal theoretical growth rate (if close to zero, that combination of genes is essential for growth)