SlideShare una empresa de Scribd logo
1 de 17
An Updated Comparison of Selected Public and Commercial Bioactive Chemistry Databases ,[object Object],[object Object],[object Object],http://www.cdsouthan.info/Consult/CDS_cons.htm
Entity Relationships: in vitro  activity-to-compound-to-protein mapping   MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLLK Document  Assay  Result  Compound  Sequence Unstructured data  Structured data  Expert extraction and curation
Databases of Bioactive Compounds Public Commercial
Comparing Compound Sets   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Filtration of Sources and Subsets   Dataset Filtered cpds Filtration reduction GVKBio 2,054,151 -8% GVKBio Journals 658,198 -8% GVKBio patents 1,484,218 -7% GVKBIO DD 3,675 -4% GVKBIO CCD 8,864 -1% GVKBIO BACE1 5,228 -11% GVKBIO BACE1 journals 389 -6% GVKBIO BACE1 patents 4,901 -11% WOMBAT 180,856 -18% PubChem 14,965,539 -23% PubChem Prous 4,652 -2% PubChem PDB 5,706 -8% PubChem actives 7,472 -3% PubChem pharmacol 5,311 -63% PubChem MLSMR 233,284 -1% PunChem BindingDB 24,203 -4% PubChem ChEBI 7,428 -31% DrugBank all  4,545 -7% DrugBank approved 1,341 -3% DrugBank experimental 2,999 -6% DNP 144,383 -26% MDDR 176,600 -4% MDDR launched 1,435 -5%
Document Counts
Protein Counts
Compounds-per-protein
Pair-wise Comparison Matrix: 23 X 23   GVKBIO GVKBIO Journals GVKBIO Patents GVKBIO DD GVKBIO CCD WOMBAT PubChem GVKBIO 2,054,151 658,198 1,484,218 2,847 6,178 171,178 925,845 GVKBIO Journals   658,198 88,265 2,779 5,492 169,734 361,192 GVKBIO Patents     1,484,218 1,404 3,149 45,564 633,115 GVKBIO DD       3,675 33 1,060 3,513 GVKBIO CCD         8,864 2,652 7,925 WOMBAT           180,856 133,124 PubChem             14,965,539
Coverage of Commercial Databases by PubChem
Molecular Libraries-Small Molecule Repository MLSMR 233,284,  PubChem actives  7,472
Comparison of Journal Extractions Document ratios  GVK:WOM:BDb 50:9:1
GVKBIO vs WOMBAT vs PubChem
Comparison of Approved Drug Collections
Public vs Commercial Total Merges
Conclusions ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
References and Acknowledgments ,[object Object],www.jcheminf.com/content/1/1/10 PMID: 17897036

Más contenido relacionado

Destacado (7)

SciFinder Scholar CAS Chemistry Database
SciFinder Scholar CAS Chemistry DatabaseSciFinder Scholar CAS Chemistry Database
SciFinder Scholar CAS Chemistry Database
 
Digging out Structures for Repurposing: Non-competitive Intelligence ...
Digging out Structures for Repurposing: Non-competitive Intelligence        ...Digging out Structures for Repurposing: Non-competitive Intelligence        ...
Digging out Structures for Repurposing: Non-competitive Intelligence ...
 
BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013BioAssay Research Database Presentation at the Chem Axon UGM 2013
BioAssay Research Database Presentation at the Chem Axon UGM 2013
 
PubChem Database
PubChem DatabasePubChem Database
PubChem Database
 
20130410 carbohydrates
20130410 carbohydrates20130410 carbohydrates
20130410 carbohydrates
 
Integrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data MiningIntegrating R with the CDK: Enhanced Chemical Data Mining
Integrating R with the CDK: Enhanced Chemical Data Mining
 
PubChem Bioassays as a Source of Polypharmacology
PubChem Bioassays as a Source of PolypharmacologyPubChem Bioassays as a Source of Polypharmacology
PubChem Bioassays as a Source of Polypharmacology
 

Similar a Public and Commercial Bioactive Chemistry Databases (2009)

Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
camunda services GmbH
 
Titer capacity analysis
Titer capacity analysisTiter capacity analysis
Titer capacity analysis
GBX Summits
 
Modelling
ModellingModelling
Modelling
skarri
 
Analyzing the relationship between titer and processing time
Analyzing the relationship between titer and processing timeAnalyzing the relationship between titer and processing time
Analyzing the relationship between titer and processing time
GBX Summits
 
Resume_John Archer_02-22-2016
Resume_John Archer_02-22-2016Resume_John Archer_02-22-2016
Resume_John Archer_02-22-2016
John Archer
 
Final Draft Biology Research Skills Essay
Final Draft Biology Research Skills EssayFinal Draft Biology Research Skills Essay
Final Draft Biology Research Skills Essay
Owen Walton
 
Guide for conducting comparability exercise for biological product registration
Guide for conducting comparability exercise for biological product registrationGuide for conducting comparability exercise for biological product registration
Guide for conducting comparability exercise for biological product registration
Clapbio
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural products
Sunghwan Kim
 

Similar a Public and Commercial Bioactive Chemistry Databases (2009) (20)

Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...Pharma Research Automation by Connecting Researchers with Robots and Systems ...
Pharma Research Automation by Connecting Researchers with Robots and Systems ...
 
FYP report
FYP reportFYP report
FYP report
 
FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Titer capacity analysis
Titer capacity analysisTiter capacity analysis
Titer capacity analysis
 
Enabling higher process titers at a manufacturing facility
Enabling higher process titers at a manufacturing facilityEnabling higher process titers at a manufacturing facility
Enabling higher process titers at a manufacturing facility
 
Modelling
ModellingModelling
Modelling
 
Analyzing the relationship between titer and processing time
Analyzing the relationship between titer and processing timeAnalyzing the relationship between titer and processing time
Analyzing the relationship between titer and processing time
 
PAT Innovation, Christoph Herwig Vienna GBX LIVE
PAT Innovation, Christoph Herwig Vienna GBX LIVEPAT Innovation, Christoph Herwig Vienna GBX LIVE
PAT Innovation, Christoph Herwig Vienna GBX LIVE
 
Resume_John Archer_02-22-2016
Resume_John Archer_02-22-2016Resume_John Archer_02-22-2016
Resume_John Archer_02-22-2016
 
IRJET - IoT based Steroid Measurement in Milk Products
IRJET - IoT based Steroid Measurement in Milk ProductsIRJET - IoT based Steroid Measurement in Milk Products
IRJET - IoT based Steroid Measurement in Milk Products
 
Final Draft Biology Research Skills Essay
Final Draft Biology Research Skills EssayFinal Draft Biology Research Skills Essay
Final Draft Biology Research Skills Essay
 
Clinical sas course syllabus
Clinical sas course syllabusClinical sas course syllabus
Clinical sas course syllabus
 
14-6-Monge
14-6-Monge14-6-Monge
14-6-Monge
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
Review of information retrieval – first experiences in Germany with manufactu...
Review of information retrieval – first experiences in Germany with manufactu...Review of information retrieval – first experiences in Germany with manufactu...
Review of information retrieval – first experiences in Germany with manufactu...
 
GiTools
GiToolsGiTools
GiTools
 
Biomarker Strategies
Biomarker StrategiesBiomarker Strategies
Biomarker Strategies
 
Guide for conducting comparability exercise for biological product registration
Guide for conducting comparability exercise for biological product registrationGuide for conducting comparability exercise for biological product registration
Guide for conducting comparability exercise for biological product registration
 
Exploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural productsExploiting PubChem for drug discovery based on natural products
Exploiting PubChem for drug discovery based on natural products
 
Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017
 

Más de Chris Southan

Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
Chris Southan
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
Chris Southan
 

Más de Chris Southan (20)

Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 

Último

Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
allensay1
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
ZurliaSoop
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for Viewing
Nauman Safdar
 

Último (20)

Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All TimeCall 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
Call 7737669865 Vadodara Call Girls Service at your Door Step Available All Time
 
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NSCROSS CULTURAL NEGOTIATION BY PANMISEM NS
CROSS CULTURAL NEGOTIATION BY PANMISEM NS
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
 
Uneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration PresentationUneak White's Personal Brand Exploration Presentation
Uneak White's Personal Brand Exploration Presentation
 
HomeRoots Pitch Deck | Investor Insights | April 2024
HomeRoots Pitch Deck | Investor Insights | April 2024HomeRoots Pitch Deck | Investor Insights | April 2024
HomeRoots Pitch Deck | Investor Insights | April 2024
 
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book nowGUWAHATI 💋 Call Girl 9827461493 Call Girls in  Escort service book now
GUWAHATI 💋 Call Girl 9827461493 Call Girls in Escort service book now
 
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan CytotecJual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
Jual Obat Aborsi ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan Cytotec
 
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptxQSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
QSM Chap 10 Service Culture in Tourism and Hospitality Industry.pptx
 
Mckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for ViewingMckinsey foundation level Handbook for Viewing
Mckinsey foundation level Handbook for Viewing
 
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
Horngren’s Cost Accounting A Managerial Emphasis, Canadian 9th edition soluti...
 
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAIGetting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
Getting Real with AI - Columbus DAW - May 2024 - Nick Woo from AlignAI
 
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur DubaiUAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
UAE Bur Dubai Call Girls ☏ 0564401582 Call Girl in Bur Dubai
 
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDINGBerhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
Berhampur 70918*19311 CALL GIRLS IN ESCORT SERVICE WE ARE PROVIDING
 
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service AvailableNashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
Nashik Call Girl Just Call 7091819311 Top Class Call Girl Service Available
 
Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1Katrina Personal Brand Project and portfolio 1
Katrina Personal Brand Project and portfolio 1
 
Cannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 UpdatedCannabis Legalization World Map: 2024 Updated
Cannabis Legalization World Map: 2024 Updated
 
Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024Marel Q1 2024 Investor Presentation from May 8, 2024
Marel Q1 2024 Investor Presentation from May 8, 2024
 
New 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck TemplateNew 2024 Cannabis Edibles Investor Pitch Deck Template
New 2024 Cannabis Edibles Investor Pitch Deck Template
 

Public and Commercial Bioactive Chemistry Databases (2009)

  • 1.
  • 2. Entity Relationships: in vitro activity-to-compound-to-protein mapping MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLLK Document Assay Result Compound Sequence Unstructured data Structured data Expert extraction and curation
  • 3. Databases of Bioactive Compounds Public Commercial
  • 4.
  • 5. Filtration of Sources and Subsets Dataset Filtered cpds Filtration reduction GVKBio 2,054,151 -8% GVKBio Journals 658,198 -8% GVKBio patents 1,484,218 -7% GVKBIO DD 3,675 -4% GVKBIO CCD 8,864 -1% GVKBIO BACE1 5,228 -11% GVKBIO BACE1 journals 389 -6% GVKBIO BACE1 patents 4,901 -11% WOMBAT 180,856 -18% PubChem 14,965,539 -23% PubChem Prous 4,652 -2% PubChem PDB 5,706 -8% PubChem actives 7,472 -3% PubChem pharmacol 5,311 -63% PubChem MLSMR 233,284 -1% PunChem BindingDB 24,203 -4% PubChem ChEBI 7,428 -31% DrugBank all 4,545 -7% DrugBank approved 1,341 -3% DrugBank experimental 2,999 -6% DNP 144,383 -26% MDDR 176,600 -4% MDDR launched 1,435 -5%
  • 9. Pair-wise Comparison Matrix: 23 X 23   GVKBIO GVKBIO Journals GVKBIO Patents GVKBIO DD GVKBIO CCD WOMBAT PubChem GVKBIO 2,054,151 658,198 1,484,218 2,847 6,178 171,178 925,845 GVKBIO Journals   658,198 88,265 2,779 5,492 169,734 361,192 GVKBIO Patents     1,484,218 1,404 3,149 45,564 633,115 GVKBIO DD       3,675 33 1,060 3,513 GVKBIO CCD         8,864 2,652 7,925 WOMBAT           180,856 133,124 PubChem             14,965,539
  • 10. Coverage of Commercial Databases by PubChem
  • 11. Molecular Libraries-Small Molecule Repository MLSMR 233,284, PubChem actives 7,472
  • 12. Comparison of Journal Extractions Document ratios GVK:WOM:BDb 50:9:1
  • 13. GVKBIO vs WOMBAT vs PubChem
  • 14. Comparison of Approved Drug Collections
  • 15. Public vs Commercial Total Merges
  • 16.
  • 17.

Notas del editor

  1. PubChem output like small pharma so far