SlideShare una empresa de Scribd logo
1 de 23
Descargar para leer sin conexión
Dispensing Processes Impact Computational and
              Statistical Analyses


     Sean Ekins1, Joe Olechno2 Antony J. Williams3


            1 Collaborationsin Chemistry, Fuquay Varina, NC.
                      2 Labcyte Inc, Sunnyvale, CA.
             3 Royal Society of Chemistry, Wake Forest, NC.




  Disclaimer: SE and AJW have no affiliation with Labcyte and have
                  not been engaged as consultants
Where do scientists get
                           chemistry/ biology
                                  data?

                            Databases
                            Patents
                            Papers
                            Your own lab
                            Collaborators


“If I have seen further     Some or all of the
  than others, it is by      above?
   standing upon the        What is common to
 shoulders of giants.”       all? – quality issues

    Isaac Newton
From data hoarding           Pharma company data
   to open data             hoarding - to open data


              (IMDB)




              Me




              Linked Open
              data cloud
              2011
              (Wikipedia)
Simple Rules for licensing      Could data ‘open accessibility’
        “open” data                     equal ‘Disruption’


As we see a future of increased   1: NIH and other international
database integration the          scientific funding bodies should
licensing of the data may be a    mandate …open accessibility for
hurdle that hampers progress      all data generated by publicly
and usability.                    funded research immediately




Williams, Wilbanks and Ekins.
                                  Ekins, Waller, Bradley, Clark and
PLoS Comput Biol 8(9):
                                  Williams. DDT In press, 2013
e1002706, 2012
..drug structure quality is
Data can be found – but …
                                       important


                             More groups doing in silico
                              repositioning
                             Target-based or ligand-based
                             Network and systems biology
                             integrating or using sets of
                              FDA drugs..if the structures
                              are incorrect predictions will
                              be too..
                             Need a definitive set of FDA
                              approved drugs with correct
                              structures
                             Also linkage between in vitro
                              data & clinical data
Finding structures of Pharma molecules is hard




NCATS and MRC
made molecule
identifiers from
pharmas available
with no structures    Southan et al., DDT, 18: 58-70 (2013)
Structure Quality Issues
Database released and within days 100’s of errors found in structures
                                          NPC Browser http://tripod.nih.gov/npc/

               Science Translational Medicine 2011




                                                 Williams and Ekins, DDT, 16: 747-750 (2011)
Data Errors in the NPC Browser:
                                 Analysis of Steroids




   Substructure   # of    # of           No            Incomplete       Complete but

                  Hits   Correct   stereochemistry   Stereochemistry      incorrect

                          Hits                                         stereochemistry



  Gonane          34       5             8                 21                0


  Gon-4-ene       55       12            3                 33                7


  Gon-1,4-diene   60       17            10                23                10



Williams, Ekins and Tkachenko, DDT 17: 685-701 (2012)
Its not just structure quality we
   DDT editorial Dec 2011                 need to worry about




This editorial led to the current
work http://goo.gl/dIqhU
Plastic leaching
        How do you move a
             liquid?




                                 McDonald et al., Science 2008,
                                 322, 917.
                                 Belaiche et al., Clin Chem 2009,
Images courtesy of Bing, Tecan   55, 1883-1884
Moving liquids with sound




Tipless transfer

No cross-contamination, carryover or
leachates

Accurate, precise - assay miniaturization

Images courtesy of Labcyte Inc.
 http://goo.gl/K0Fjz
Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison



                                       Data appears randomly
                                       scattered.

                                       24% had IC50 values > 3
                                       fold weaker using tip-
                                       based dispensing.

                                       8% produced no value
                                       using tip-based dispensing.

                                       No analysis of molecule
                                       properties.

                                     Spicer et al., In Drug Discovery
                                     Technology: Boston, 2005.
Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison




  ~ 40 12 point IC50 values.

  Compounds more active when using acoustic dispensing.
  Correlation in data is poor with many compounds showing
  >10 fold shift in potency depending on dispensing method.

  No analysis of molecule properties.
 Wingfield et al., American Drug Discovery 2008, 3, 24-30.
Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison




• Inhibition of tyrosine kinases at 10 µM for ~10,000 compounds.
• False +ve from acoustic transfer (as measured by subsequent
  IC50 analyses) = 19% of hits.
• False +ve from tip-based transfers = 55% of all hits.
• 60 more compounds were identified as active with acoustic
  transfer.

• No analysis of molecule properties.

    Wingfield, J. Drug Discovery 2012: Manchester, UK, 2012
Using literature data from different dispensing methods to generate
                          computational models

Few molecule structures and corresponding datasets are public

Using data from 2 AstraZeneca patents –

Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery
Studio) were developed using data for 14 compounds

IC50 determined using different dispensing methods

Analyzed correlation with simple descriptors (SAS JMP)

Calculated LogP correlation with log IC50 data for acoustic
dispensing (r2 = 0.34, p < 0.05, N = 14)



  Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
  Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
Examples of IC50 values produced via acoustic transfer with
direct dilution vs those generated with Tip-based dispensing




Acoustically-derived IC50 values were 1.5 to 276.5-fold lower than
for Tip-based dispensing
Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009
Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
A graph of the log IC50 values for tip-based serial dilution
and dispensing versus acoustic dispensing with direct dilution
 shows a poor correlation between techniques (R2 = 0.246).

                                                     1.5

                                                        1

                                                     0.5

                                                        0
           -3    -2.5   -2   -1.5      -1     -0.5          0   0.5   1   1.5
 log IC50-tips




                                                     -0.5

                                                      -1

                                                     -1.5       acoustic
                                                                technique
                                                      -2        always gave
                                                                a more
                                                     -2.5
                                                                potent IC50
                                                      -3        value
                                    log IC50-acoustic
Tyrosine kinase EphB4 Pharmacophores


                                                                  Cyan = hydrophobic

                                                                  Green = hydrogen bond
                                                                  acceptor

                                                                  Purple = hydrogen bond donor

                                                                  Each model shows most
                                                                  potent molecule mapping


           Acoustic                       Tip based

                                 Hydrophobic       Hydrogen      Hydrogen     Observed vs.

                                features (HPF)   bond acceptor   bond donor   predicted IC50

                                                    (HBA)          (HBD)            r



Acoustic mediated process
                                      2               1              1            0.92

Tip-based process
                                      0               2              1            0.80



  Ekins, Olechno and Williams, Submitted 2012
Test set evaluation of pharmacophores

• An additional 12 compounds from AstraZeneca
       Barlaam, B. C.; Ducray, R., WO 2008/132505 A1, 2008

• 10 of these compounds had data for tip based dispensing
  and 2 for acoustic dispensing

• Calculated LogP and logD showed low but statistically
  significant correlations with tip based dispensing (r2=
  0.39 p < 0.05 and 0.24 p < 0.05, N = 36)

• Used as a test set for pharmacophores

• The two compounds analyzed with acoustic liquid
  handling were predicted in the top 3 using the ‘acoustic’
  pharmacophore

• The ‘Tip-based’ pharmacophore failed to rank the
  retrieved compounds correctly
Automated receptor-ligand pharmacophore generation
                        method

    Pharmacophores for the tyrosine kinase EphB4 generated from crystal
structures in the protein data bank PDB using Discovery Studio version 3.5.5
                                                                Cyan =
                                                                hydrophobic

                                                                Green = hydrogen
                                                                bond acceptor

                                                                Purple = hydrogen
                                                                bond donor

                                                                Grey = excluded
                                                                volumes

                                                                Each model shows
                                                                most potent
                                                                molecule mapping



                                                              Bioorg   Med Chem Lett
                                                              2010,    20, 6242-6245.
                                                              Bioorg   Med Chem Lett
                                                              2008,    18, 5717-5721.
                                                              Bioorg   Med Chem Lett
                                                              2008,    18, 2776-2780.
                                                              Bioorg   Med Chem Lett
                                                              2011,    21, 2207-2211.
Summary

• In the absence of structural data, pharmacophores and other
  computational and statistical models are used to guide medicinal
  chemistry in early drug discovery.

• Our findings suggest acoustic dispensing methods could improve HTS
  results and avoid the development of misleading computational models
  and statistical relationships.


• Automated pharmacophores are closer to pharmacophore generated
  with acoustic data – all have hydrophobic features – missing from Tip-
  based pharmacophore model

• Importance of hydrophobicity seen with logP correlation and
  crystal structure interactions

• Public databases should annotate this meta-data alongside biological
  data points, to create larger datasets for comparing different
  computational methods.
Strengths and Weaknesses
• Small dataset size – focused on one compound series

• No previous publication describing how data quality can be
  impacted by dispensing and how this in turn affects
  computational models and downstream decision making.

• No comparison of pharmacophores generated from acoustic
  dispensing and tip-based dispensing.

• No previous comparison of pharmacophores generated from in
  vitro data with pharmacophores automatically generated from
  X-ray crystal conformations of inhibitors.


• Severely limited by number of structures in public domain
  with data in both systems

• Reluctance of many to accept that this could be an issue
The stuff of nightmares?
 How much of the data in databases is generated by tip based serial
  dilution methods
 How much is erroneous
 Do we have to start again?
 How does it affect all subsequent science – data mining etc
 Does it impact Pharmas productivity?

Más contenido relacionado

Similar a Dispensing Processes Impact Computational Analyses

Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoverySean Ekins
 
Acs towards a gold standard database
Acs   towards a gold standard  database Acs   towards a gold standard  database
Acs towards a gold standard database Sean Ekins
 
Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Sage Base
 
Stephen Friend HHMI-Penn 2011-05-27
Stephen Friend HHMI-Penn 2011-05-27Stephen Friend HHMI-Penn 2011-05-27
Stephen Friend HHMI-Penn 2011-05-27Sage Base
 
Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_finalKristi Holmes
 
The Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalThe Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalMark Wilkinson
 
Stephen Friend Cytoscape Retreat 2011-05-20
Stephen Friend Cytoscape Retreat 2011-05-20Stephen Friend Cytoscape Retreat 2011-05-20
Stephen Friend Cytoscape Retreat 2011-05-20Sage Base
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesStefano Di Carlo
 
What Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouWhat Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouEric Ma
 
Genetic_Research_Lesson1_Slides_NWABR.ppt
Genetic_Research_Lesson1_Slides_NWABR.pptGenetic_Research_Lesson1_Slides_NWABR.ppt
Genetic_Research_Lesson1_Slides_NWABR.pptDESMONDEZIEKE1
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems PharmacologyPhilip Bourne
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016Sean Ekins
 
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Sage Base
 
Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Sage Base
 
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Sage Base
 
Research - this time it's personal
Research - this time it's personalResearch - this time it's personal
Research - this time it's personalMark Wilkinson
 
C:\fakepath\bioit world2010
C:\fakepath\bioit world2010C:\fakepath\bioit world2010
C:\fakepath\bioit world2010guestdde063f8
 

Similar a Dispensing Processes Impact Computational Analyses (20)

Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Data Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug DiscoveryData Quality Issues That Can Impact Drug Discovery
Data Quality Issues That Can Impact Drug Discovery
 
Acs towards a gold standard database
Acs   towards a gold standard  database Acs   towards a gold standard  database
Acs towards a gold standard database
 
Complete Human Genome Sequencing
Complete Human Genome SequencingComplete Human Genome Sequencing
Complete Human Genome Sequencing
 
Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04Stephen Friend PRISME Forum 2011-05-04
Stephen Friend PRISME Forum 2011-05-04
 
Stephen Friend HHMI-Penn 2011-05-27
Stephen Friend HHMI-Penn 2011-05-27Stephen Friend HHMI-Penn 2011-05-27
Stephen Friend HHMI-Penn 2011-05-27
 
Scio12 sem web_final
Scio12 sem web_finalScio12 sem web_final
Scio12 sem web_final
 
The Semantic Web - This time... its Personal
The Semantic Web - This time... its PersonalThe Semantic Web - This time... its Personal
The Semantic Web - This time... its Personal
 
Stephen Friend Cytoscape Retreat 2011-05-20
Stephen Friend Cytoscape Retreat 2011-05-20Stephen Friend Cytoscape Retreat 2011-05-20
Stephen Friend Cytoscape Retreat 2011-05-20
 
Tales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life SciencesTales from BioLand - Engineering Challenges in the World of Life Sciences
Tales from BioLand - Engineering Challenges in the World of Life Sciences
 
What Synthetic Biology Can Do For You
What Synthetic Biology Can Do For YouWhat Synthetic Biology Can Do For You
What Synthetic Biology Can Do For You
 
Genetic_Research_Lesson1_Slides_NWABR.ppt
Genetic_Research_Lesson1_Slides_NWABR.pptGenetic_Research_Lesson1_Slides_NWABR.ppt
Genetic_Research_Lesson1_Slides_NWABR.ppt
 
NRNB EAC Report 2011
NRNB EAC Report 2011NRNB EAC Report 2011
NRNB EAC Report 2011
 
Structural Systems Pharmacology
Structural Systems PharmacologyStructural Systems Pharmacology
Structural Systems Pharmacology
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
Stephen Friend Genetic Alliance 25th Anniversary 2011-06-24
 
Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18Stephen Friend Food & Drug Administration 2011-07-18
Stephen Friend Food & Drug Administration 2011-07-18
 
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
Stephen Friend IMPAKT Breast Cancer Conference 2011-05-05
 
Research - this time it's personal
Research - this time it's personalResearch - this time it's personal
Research - this time it's personal
 
C:\fakepath\bioit world2010
C:\fakepath\bioit world2010C:\fakepath\bioit world2010
C:\fakepath\bioit world2010
 

Más de Sean Ekins

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptxSean Ekins
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Sean Ekins
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...Sean Ekins
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Sean Ekins
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseSean Ekins
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Sean Ekins
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueSean Ekins
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchSean Ekins
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2Sean Ekins
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3 Sean Ekins
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2 Sean Ekins
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1 Sean Ekins
 
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...Sean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...Sean Ekins
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b igSean Ekins
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - Sean Ekins
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientistsSean Ekins
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsSean Ekins
 

Más de Sean Ekins (20)

How to Win a small business grant.pptx
How to Win a small business grant.pptxHow to Win a small business grant.pptx
How to Win a small business grant.pptx
 
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
Evaluating Multiple Machine Learning Models for Biodegradation and Aquatic To...
 
A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...A presentation at the Global Genes rare drug development symposium on governm...
A presentation at the Global Genes rare drug development symposium on governm...
 
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...Leveraging Science Communication and Social Media to Build Your Brand and Ele...
Leveraging Science Communication and Social Media to Build Your Brand and Ele...
 
Bayesian Models for Chagas Disease
Bayesian Models for Chagas DiseaseBayesian Models for Chagas Disease
Bayesian Models for Chagas Disease
 
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
Assay Central: A New Approach to Compiling Big Data and Preparing Machine Lea...
 
Drug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issueDrug Discovery Today March 2017 special issue
Drug Discovery Today March 2017 special issue
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or ResearchFive Ways to Use Social Media to Raise Awareness for Your Paper or Research
Five Ways to Use Social Media to Raise Awareness for Your Paper or Research
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
academic / small company collaborations for rare and neglected diseasesv2
 academic / small company collaborations for rare and neglected diseasesv2 academic / small company collaborations for rare and neglected diseasesv2
academic / small company collaborations for rare and neglected diseasesv2
 
CDD models case study #3
CDD models case study #3 CDD models case study #3
CDD models case study #3
 
CDD models case study #2
CDD models case study #2 CDD models case study #2
CDD models case study #2
 
CDD Models case study #1
CDD Models case study #1 CDD Models case study #1
CDD Models case study #1
 
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
Using Machine Learning Models Based on Phenotypic Data to Discover New Molecu...
 
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
CDD: Vault, CDD: Vision and CDD: Models software for biologists and chemists ...
 
The future of computational chemistry b ig
The future of computational chemistry b igThe future of computational chemistry b ig
The future of computational chemistry b ig
 
#ZikaOpen: Homology Models -
#ZikaOpen: Homology Models - #ZikaOpen: Homology Models -
#ZikaOpen: Homology Models -
 
Pros and cons of social networking for scientists
Pros and cons of social networking for scientistsPros and cons of social networking for scientists
Pros and cons of social networking for scientists
 
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery CollaborationsCDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
CDD: Vault, CDD: Vision and CDD: Models for Drug Discovery Collaborations
 

Dispensing Processes Impact Computational Analyses

  • 1. Dispensing Processes Impact Computational and Statistical Analyses Sean Ekins1, Joe Olechno2 Antony J. Williams3 1 Collaborationsin Chemistry, Fuquay Varina, NC. 2 Labcyte Inc, Sunnyvale, CA. 3 Royal Society of Chemistry, Wake Forest, NC. Disclaimer: SE and AJW have no affiliation with Labcyte and have not been engaged as consultants
  • 2. Where do scientists get chemistry/ biology data?  Databases  Patents  Papers  Your own lab  Collaborators “If I have seen further  Some or all of the than others, it is by above? standing upon the  What is common to shoulders of giants.” all? – quality issues Isaac Newton
  • 3. From data hoarding Pharma company data to open data hoarding - to open data (IMDB) Me Linked Open data cloud 2011 (Wikipedia)
  • 4. Simple Rules for licensing Could data ‘open accessibility’ “open” data equal ‘Disruption’ As we see a future of increased 1: NIH and other international database integration the scientific funding bodies should licensing of the data may be a mandate …open accessibility for hurdle that hampers progress all data generated by publicly and usability. funded research immediately Williams, Wilbanks and Ekins. Ekins, Waller, Bradley, Clark and PLoS Comput Biol 8(9): Williams. DDT In press, 2013 e1002706, 2012
  • 5. ..drug structure quality is Data can be found – but … important  More groups doing in silico repositioning  Target-based or ligand-based  Network and systems biology  integrating or using sets of FDA drugs..if the structures are incorrect predictions will be too..  Need a definitive set of FDA approved drugs with correct structures  Also linkage between in vitro data & clinical data
  • 6. Finding structures of Pharma molecules is hard NCATS and MRC made molecule identifiers from pharmas available with no structures Southan et al., DDT, 18: 58-70 (2013)
  • 7. Structure Quality Issues Database released and within days 100’s of errors found in structures NPC Browser http://tripod.nih.gov/npc/ Science Translational Medicine 2011 Williams and Ekins, DDT, 16: 747-750 (2011)
  • 8. Data Errors in the NPC Browser: Analysis of Steroids Substructure # of # of No Incomplete Complete but Hits Correct stereochemistry Stereochemistry incorrect Hits stereochemistry Gonane 34 5 8 21 0 Gon-4-ene 55 12 3 33 7 Gon-1,4-diene 60 17 10 23 10 Williams, Ekins and Tkachenko, DDT 17: 685-701 (2012)
  • 9. Its not just structure quality we DDT editorial Dec 2011 need to worry about This editorial led to the current work http://goo.gl/dIqhU
  • 10. Plastic leaching How do you move a liquid? McDonald et al., Science 2008, 322, 917. Belaiche et al., Clin Chem 2009, Images courtesy of Bing, Tecan 55, 1883-1884
  • 11. Moving liquids with sound Tipless transfer No cross-contamination, carryover or leachates Accurate, precise - assay miniaturization Images courtesy of Labcyte Inc. http://goo.gl/K0Fjz
  • 12. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison Data appears randomly scattered. 24% had IC50 values > 3 fold weaker using tip- based dispensing. 8% produced no value using tip-based dispensing. No analysis of molecule properties. Spicer et al., In Drug Discovery Technology: Boston, 2005.
  • 13. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison ~ 40 12 point IC50 values. Compounds more active when using acoustic dispensing. Correlation in data is poor with many compounds showing >10 fold shift in potency depending on dispensing method. No analysis of molecule properties. Wingfield et al., American Drug Discovery 2008, 3, 24-30.
  • 14. Tip-based Serial Aqueous dilution vs. Acoustic dispensing comparison • Inhibition of tyrosine kinases at 10 µM for ~10,000 compounds. • False +ve from acoustic transfer (as measured by subsequent IC50 analyses) = 19% of hits. • False +ve from tip-based transfers = 55% of all hits. • 60 more compounds were identified as active with acoustic transfer. • No analysis of molecule properties. Wingfield, J. Drug Discovery 2012: Manchester, UK, 2012
  • 15. Using literature data from different dispensing methods to generate computational models Few molecule structures and corresponding datasets are public Using data from 2 AstraZeneca patents – Tyrosine kinase EphB4 pharmacophores (Accelrys Discovery Studio) were developed using data for 14 compounds IC50 determined using different dispensing methods Analyzed correlation with simple descriptors (SAS JMP) Calculated LogP correlation with log IC50 data for acoustic dispensing (r2 = 0.34, p < 0.05, N = 14) Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
  • 16. Examples of IC50 values produced via acoustic transfer with direct dilution vs those generated with Tip-based dispensing Acoustically-derived IC50 values were 1.5 to 276.5-fold lower than for Tip-based dispensing Barlaam, B. C.; Ducray, R., WO 2009/010794 A1, 2009 Barlaam, B. C.; Ducray, R.; Kettle, J. G., US 7,718,653 B2, 2010
  • 17. A graph of the log IC50 values for tip-based serial dilution and dispensing versus acoustic dispensing with direct dilution shows a poor correlation between techniques (R2 = 0.246). 1.5 1 0.5 0 -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 log IC50-tips -0.5 -1 -1.5 acoustic technique -2 always gave a more -2.5 potent IC50 -3 value log IC50-acoustic
  • 18. Tyrosine kinase EphB4 Pharmacophores Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Each model shows most potent molecule mapping Acoustic Tip based Hydrophobic Hydrogen Hydrogen Observed vs. features (HPF) bond acceptor bond donor predicted IC50 (HBA) (HBD) r Acoustic mediated process 2 1 1 0.92 Tip-based process 0 2 1 0.80 Ekins, Olechno and Williams, Submitted 2012
  • 19. Test set evaluation of pharmacophores • An additional 12 compounds from AstraZeneca Barlaam, B. C.; Ducray, R., WO 2008/132505 A1, 2008 • 10 of these compounds had data for tip based dispensing and 2 for acoustic dispensing • Calculated LogP and logD showed low but statistically significant correlations with tip based dispensing (r2= 0.39 p < 0.05 and 0.24 p < 0.05, N = 36) • Used as a test set for pharmacophores • The two compounds analyzed with acoustic liquid handling were predicted in the top 3 using the ‘acoustic’ pharmacophore • The ‘Tip-based’ pharmacophore failed to rank the retrieved compounds correctly
  • 20. Automated receptor-ligand pharmacophore generation method Pharmacophores for the tyrosine kinase EphB4 generated from crystal structures in the protein data bank PDB using Discovery Studio version 3.5.5 Cyan = hydrophobic Green = hydrogen bond acceptor Purple = hydrogen bond donor Grey = excluded volumes Each model shows most potent molecule mapping Bioorg Med Chem Lett 2010, 20, 6242-6245. Bioorg Med Chem Lett 2008, 18, 5717-5721. Bioorg Med Chem Lett 2008, 18, 2776-2780. Bioorg Med Chem Lett 2011, 21, 2207-2211.
  • 21. Summary • In the absence of structural data, pharmacophores and other computational and statistical models are used to guide medicinal chemistry in early drug discovery. • Our findings suggest acoustic dispensing methods could improve HTS results and avoid the development of misleading computational models and statistical relationships. • Automated pharmacophores are closer to pharmacophore generated with acoustic data – all have hydrophobic features – missing from Tip- based pharmacophore model • Importance of hydrophobicity seen with logP correlation and crystal structure interactions • Public databases should annotate this meta-data alongside biological data points, to create larger datasets for comparing different computational methods.
  • 22. Strengths and Weaknesses • Small dataset size – focused on one compound series • No previous publication describing how data quality can be impacted by dispensing and how this in turn affects computational models and downstream decision making. • No comparison of pharmacophores generated from acoustic dispensing and tip-based dispensing. • No previous comparison of pharmacophores generated from in vitro data with pharmacophores automatically generated from X-ray crystal conformations of inhibitors. • Severely limited by number of structures in public domain with data in both systems • Reluctance of many to accept that this could be an issue
  • 23. The stuff of nightmares?  How much of the data in databases is generated by tip based serial dilution methods  How much is erroneous  Do we have to start again?  How does it affect all subsequent science – data mining etc  Does it impact Pharmas productivity?