Tim Cheeseright, Assessing the Similarities of Compound collections using molecular fields: Does it add value?

Tim Cheeseright, Mark Mackey, Rob Scoffin, Martin Slater

Assessing the similarity of compound collections
using molecular fields: Does it add value?

1

Conclusions

> It works brilliantly
> All synthetic steps gave yields of 100%
> All enrichments were perfect
> All new molecules were sub nM
> All QSARs were totally predictive, q2 = 1.0

> We expect the call from Sweden any day now

2

Conclusions

> Work in progress
> 3D similarity can add value to compound
selection
> Full matrix of similarities possibly unnecessary
> Using probes looks like a possible solution
> Not a panacea

3

Agenda & Background

> Fields & similarity
> Generating screening compounds using Fields
> Selecting a 10K “diverse” library for screening
from commercial compounds
> Initial thoughts
> Problems
> More Initial thoughts
> A solution but not a complete one
> Conclusions
4

Field Points

Condensed representation of electrostatic, hydrophobic
and shape properties (“protein‟s view”)
> Molecular Field Extrema (“Field Points”)

2D 3D Molecular Field Points
Electrostatic = Positive
Potential (MEP) = Negative
= Shape
= Hydrophobic
5

Improved MM Electrostatics

> Field patterns from XED force field reproduce
experimental results
Experimental Using XEDs Not using XEDs

Interaction of Acetone and
Any-OH from small molecule
XED adds ‘p-orbitals’ to
crystal structures
get better representation
of atoms
6

Non-Classical Comparisons

7

Molecular Alignment

0.82

0.66 0.98

Cheeseright et al, J. Chem Inf. Mod., 2006, 665
8

Using Fields

> Bioisosteric groups
> Virtual Screening
> Pharmacophore hypothesis
> Qualitative SAR interpretation
> 3D QSAR
> Library Design

9

Field based library design success

10

Libraries from Fields

> Small, custom synthesised libraries (~100s -
1000s compds)
> Low scaffold diversity
> Highly targeted
> Lots of manual design

11

An Opportunity & a Challenge

> Provide a small diverse screening library 10K for
a small biotech company

> Diversity in potential biological targets to be hit

> Minimum redundancy in the set

> Maximum chance of success in finding a lead within
available budget and screening resources

12

Initial thoughts

> Customised design not an option - commercial
compounds only
> Using Fields to successfully select compounds for
screening performed many times
> Virtual screening
> Always in a specific biological context
> What about using Fields to choose a „diverse‟ set
> Possible problem with numbers
> 10,000 cmpd library small
> 9,000,000 commercially available molecules v. large for 3D
diversity

13

Initial thoughts

> Compare 3D and 2D similarities for compound
collections - are we wasting our time?
> Take a small compound collection
> Full NxN calculation
> 3D method = Fields & Shape
> 2D method = atom pairs

> Compare and Contrast

14

Conformations

> 3D Method requires conformations - which
one(s) to use?
> What is the similarity of 2 compounds in 3D ?
> Context is important!
> Highest across all conformations?
> Average ?
> Lowest ?
> For 3D, similarity calculation is Nconfs x Nconfs

15

Compound Collection

> BIONET 'Rule of Three' ('Ro3') Fragment
Library: “7,907 'Ro3'-compliant fragments”
> Conformation hunt on every fragment 
Maximum of 5 conformations (!)
> Full N x N similarity matrix, 3D & 2D (60 Million
data points)

> ~30 compounds failed conformation hunting

17

Problems

> 400Mb of data
> Tedious to use and examine
Pilot study just using the first 500 compounds
> Some chemical families in this area
> Still a large dataset to deal with (250,000 data points)
> 2D similarities and fragments
> Small changes cause disproportionately high changes
> Atom pairs particularly bad
> Switch to KNIME fingerprints
 All 2D values lower than „normal‟

18

Comparing 2D and 3D metrics

Agreement

19

Example - Similar Scores

2D sim = 0.9
101 104

3D field sim = 0.87

22

Example - Higher 3D Sim

2D sim = 0.1
(other methods=0.3)

3D field sim = 0.82

23


2D sim = 0.2

141 454

3D sim = 0.7

24


2D sim = 0.3

(other methods 0.55)
437 440

3D field sim = 0.8

25

So…

> Pilot study suggests some added value
> Full matrix painful even if we could calculate it

> What about a reduced matrix?
> Use „Probe‟ compounds to tease out molecules that are
different in Field space
How many probes?
Across how many molecules

> We were running out of time…

26

Compound selection by Field Diversity

> Proposed workflow for generation of a field diverse library:

9M Pick 200
commercial Calc. 200 X 200
sub-set
compounds 2D similarity Pick 100
Calc. Shape matrix Diverse
Diversity by Field
Property PMI probes
Filters
1.2M Pick 20K
sub-set
Calc. 20K X 100
Field similarity
matrix

Pick 12K
3D PCA on
Field
Field matrix
Diverse set
27

Field Diverse library: Outcome

12K „Field Diverse‟ library mapped by 3D PCA on the
100 x 20,000 „Field Similarity Fingerprint‟
Ammoniums
Piperidines Distinct separation of
charged species within
this space

….so what!!

Benzoic and
aliphatic acids

30

Field Diverse library: Outcome

12K „Field Diverse‟ library mapped by 3D PCA

Distinct separation of by
molecules by size within
this space

….so what!!

Decreasing
Size

31

Deeper - Moderate „Field Similarity‟

Alignment to „template1‟

32


Random selection of mols Alignment to „template1‟

33


Alignment to „template‟

35

Is the chemical space sensible?

Small sulphonamides

Large esters

Two example clusters 36

Conclusions

> Work in progress
> Full similarity matrix shows potential of 3D sim to
add value
> Full matrix difficult to handle and possibly
unnecessary
> Using probes looks like a possible solution
> Not a panacea - still need to play the numbers
game

37

Acknowledgements

> Cresset
> Martin Slater
> Rob Scoffin
> Mark Mackey
> James Melville
> Mission Therapeutics
> Keith Menear

38

Tim Cheeseright, Assessing the Similarities of Compound collections using molecular fields: Does it add value?

Recomendados

Recomendados

Más contenido relacionado

Similar a Tim Cheeseright, Assessing the Similarities of Compound collections using molecular fields: Does it add value?

Similar a Tim Cheeseright, Assessing the Similarities of Compound collections using molecular fields: Does it add value? (20)

Más de Cresset

Más de Cresset (18)

Último

Último (20)

Tim Cheeseright, Assessing the Similarities of Compound collections using molecular fields: Does it add value?

Notas del editor