SlideShare una empresa de Scribd logo
1 de 62
Center for Computational Toxicology and Exposure, US-EPA, RTP, NC
http://www.orcid.org/0000-0002-2668-4821
US-EPA Chemicals Dashboard –
an integrated data hub for
environmental science
The views expressed in this presentation are those of the authors and do not necessarily reflect the views or policies of the U.S. EPA
Antony John Williams
williams.antony@epa.gov
Human Toxicology and EHSRC Seminar Series: September 24th 2021
Solution
• Develop a “first-stop-shop” for environmental chemical data
to support EPA and partner decision making:
– Centralized location for relevant chemical data
– Chemistry, exposure, hazard and dosimetry
– Combination of existing data and predictive models
– Publicly accessible, periodically updated, curated
• Easy access to data improves efficiency and ultimately
accelerates chemical risk assessment
CompTox Chemicals Dashboard
883k Chemical Substances
BASIC Search
• Type ahead search using Names,
synonyms and CASRNs
• Millions of identifiers
• Substring search
Search for classes of chemicals
• Examples: “perfluoro”
Detailed Chemical Pages
One more identifier – the DTXSID
• Chemical page: Wikipedia snippet when available, intrinsic
properties, structural identifiers, linked substances
Detailed Chemical Pages
Easy Navigation
• Chemical page: Wikipedia snippet when available, intrinsic
properties, structural identifiers, linked substances
From the Chemical Details Page…
all chemicals with same FORMULA
How many chemicals are associated
through LINKED SUBSTANCES?
• Atrazine, is a herbicide – in MANY commercial products
• The dashboard has salt forms, isotopically labelled forms,
multicomponent forms
• How do we identify what they are???
Linked Substances
• We map chemicals together
using cheminformatics
approaches
• Use desalting, destereo,
split multicomponents etc to
map chemicals together
Atrazine Linked Substances
Experimental and
Predicted Data
Experimental and Predicted Data
• Physchem and Fate & Transport
experimental and predicted data
• Data can be downloaded as Excel, TSV
and CSV files
• Predictions: multiple algorithms
• TEST: Toxicity Estimation Software Tool
• OPERA: OPEn structure–activity/
property Relationship App
• Other commercial algorithms
Property and Fate and Transport Data
~25 MILLION pre-predicted values
• We have built prediction models based on tens of thousands of
property data points curated over the past decade
• We push our “QSAR-Ready” chemical structures through
predictions to produce property predictions
Access to Predictions
OPERA Reports
Real-Time Predictions
17
Toxicity and Properties
Real-Time Predictions
19
Similar reports for TEST predictions
Toxicity Data
(in vitro and in vivo)
Chemical Hazard Data
ToxVal Database
• >50k chemicals
• >770k tox. values
• >30 sources of data
• ~5k journals cited
• ~70k citations
ToxCast and Tox21 bioactivity data for hazard
screening and prediction.
23
• ToxCast: more assays, fewer chemicals, EPA-driven
• Tox21: fewer assays, mostly 1536, driven by consortium
• All Tox21 data are analyzed by multiple partners
• Tox21 data is available analyzed in the ToxCast Data
Pipeline and other pipelines as well
EPA’s ToxCast program at a glance
Tox21 robot
Lets look at the data
Rich data tables – full transparency
Bioactivity Data (ToxCast/Tox21)
Data below for Bisphenol A
26
#Actives for a chemical
Safety Data
Generalized
Read-Across
GenRA (Generalised Read-Across)
Review Available Data Fingerprint indicating available data
Select and Review Analogs
GenRA (Generalised Read-Across)
31
Run GenRA
Target
Source analogues
Red : Toxicity effects.
Blue: No Toxicity effects
Grey : Absence of data
Identifiers and
Nomenclature
Identifiers Support Searches in other
systems
Challenges with Nomenclature
• Be CAREFUL with names! There is a LOT of confusion in the
public domain. CHOOSE sources wisely!
• There are MANY public databases but not many are curated
• All public databases have value but not many curate data
• Example: METHANE on PubChem
https://pubchem.ncbi.nlm.nih.gov/compound/297
CAS Registry Numbers on PubChem
CASRN lookup on the dashboard
Methane is Diamond and Nanotubes?
• These are all Depositor Names for Methane 
Underneath the Dashboard
38
DSSTox_v2
Public
Curated
5. Low
2. Low
6. Untrusted
1. High
3. High
4. Med
7. Incomplete
validated
Public_Untrusted
Public_Low
Public_Medium
Public_High
DSSTox_Low
DSSTox_High 7269
16K
33K
101K
590K
~ 310K pending
~ 150K pending
~ 150K substances in
top 4 QC quality bins
Distribution of curated data
Now at >1.2 MILLION substances
Data quality
Data quality
• Five full time curators register and curate data to elevate quality
Products Searching
What chemicals are in hair care products?
Sources of Exposure to Chemicals
Toxicity Reports and
Literature Searching
Identifiers are used in the app
• Identifiers are used to feed and link into “Literature”
Literature Searching
• Real-time retrieval of data from PubMed ~30
million abstracts and growing)
• Choose from set of pre-defined queries
• Adjust and fine tune queries based on interests
Literature Searching
• “Sifting” of results using
multiple terms
• Frequency counting terms
• Color highlighting of terms
• Download list to Excel
• Send list to PubMed for
downloading ref. file
• Direct link via PubMed ID
External Links
External Links – Also use Identifiers
Names, CASRN, PubChem IDs, InChIs…
49
External Links
• Links to ~90 websites providing access to
additional data on the chemical of interest
Chemical Lists and
Categories
>300 Chemical Lists (and growing)
52
Curated List of Pesticides
•Find list of interest
•Select list and
send to batch
Batch Searching
Batch Searching
• Singleton searches are great but…
• …we generally want data on LOTS of chemicals!
• Typical questions
• What are the structures for a set of chemical names? Set of CASRNs?
• Can I get chemical lists in Excel files? As a list of SMILES strings?
Can I get an SDF file?
• Can I include predicted properties in the download file? OPERA?
TEST?
• Are “these chemicals” screened in Toxcast?
• I’m a mass spectrometrist and need masses and formulae for a list of
chemicals
Batch Search CASRNs
56
Summary and Conclusion
• CompTox Chemicals Dashboard - a
central hub for environmental data
• ~900k chemical substances
• Integrating property data, hazard data,
exposure data, in vitro bioactivity data
• Interrogation of bioactivity data -
• Multiple types of searches
• Batch search for thousands of chemicals
• Real-time property and toxicity predictions
• Downloadable files – CSV, TSV and Excel
Recommended Reading
Recommended Reading
References
• The CompTox Chemistry Dashboard: a community data resource for environmental
chemistry, J. Cheminformatics, 9, 61 (2017)
• EPA’s DSSTox database: History of development of a curated chemistry resource
supporting computational toxicology research, Comp. Tox. 12, 100096 (2019)
• OPERA models for predicting physicochemical properties and environmental fate
endpoints, J. Cheminformatics, 10, 10 (2018)
• Screening Chemicals for Estrogen Receptor Bioactivity Using a Computational Model,
Environ. Sci. Technol. 49, 8804-8814 (2015)
• ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology, Chem. Res.
Toxicol. 29, 1225-51 (2016)
• Development and Validation of a Computational Model for Androgen Receptor Activity,
Chem. Res. Toxicol. 30, 946-964 (2017)
• CERAPP: Collaborative Estrogen Receptor Activity Prediction Project, Environ. Health
Perspect. 124, 1023 (2016)
• Abstract Sifter: a comprehensive front-end system to PubMed, F1000, 6, 2164 (2017)
You want to know more…
• Lots of resources available
• Presentations: https://tinyurl.com/w5hqs55
• Communities of Practice Videos: https://rb.gy/qsbno1
• Manual: https://rb.gy/4fgydc
• Latest News: https://comptox.epa.gov/dashboard/news_info
61
Acknowledgments
• Contact: Williams.Antony@epa.gov
• Feedback and follow-up is
welcomed! Your questions help
• The dashboard is based on the
efforts of many more team
members than us. Many
collaborators provide data also.
62
EPA’s Center for Computational Toxicology and Exposure

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...PFAS Chemistry: Range, Complexity, Groupings, and the CompTox  Chemicals Dash...
PFAS Chemistry: Range, Complexity, Groupings, and the CompTox Chemicals Dash...
 
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
US EPA CompTox Chemistry Dashboard as a source of data to fill data gaps for ...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
Using the US EPA’s CompTox Chemistry Dashboard for structure identification a...
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
Structure identification by Mass Spectrometry Non-Targeted Analysis using the...
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...New developments in delivering public access to data from the National Center...
New developments in delivering public access to data from the National Center...
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted AnalysisThe US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
The US-EPA CompTox Chemicals Dashboard to support Non-Targeted Analysis
 
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
TRIANGLE AREA MASS SPECTOMETRY MEETING: Structure Identification Approaches U...
 
Does bigger mean better in the world of chemistry databases?
Does bigger mean better in the world of chemistry databases? Does bigger mean better in the world of chemistry databases?
Does bigger mean better in the world of chemistry databases?
 
Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...Incorporating new technologies and High Throughput Screening in the design an...
Incorporating new technologies and High Throughput Screening in the design an...
 
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
US-EPA CompTox Chemicals Dashboard – integrating chemistry and biology data t...
 
US-EPA CompTox Chemicals Dashboard providing access to experimental and predi...
US-EPA CompTox Chemicals Dashboard providing access to experimental and predi...US-EPA CompTox Chemicals Dashboard providing access to experimental and predi...
US-EPA CompTox Chemicals Dashboard providing access to experimental and predi...
 
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
Environmental Chemistry Compound Identification Using High Resolution Mass Sp...
 
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
The EPA Comptox Chemicals Dashboard as a Data Integration Hub for Environment...
 
Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...Delivering The Benefits of Chemical-Biological Integration in Computational T...
Delivering The Benefits of Chemical-Biological Integration in Computational T...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Similar a US-EPA Chemicals Dashboard – an integrated data hub for environmental science

The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...Andrew McEachran
 

Similar a US-EPA Chemicals Dashboard – an integrated data hub for environmental science (20)

Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data DashboardsAccessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
 
US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...
US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...
US EPA CompTox Chemicals Dashboard Data Integration Hub to Support Environmen...
 
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
The EPA CompTox Dashboard as a Data Integration Hub for Environmental Chemist...
 
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
The EPA CompTox Chemistry Dashboard and Underpinning Software Architecture
 
Success in decision making data relevance curation
Success in decision making data relevance curationSuccess in decision making data relevance curation
Success in decision making data relevance curation
 
Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...Non-targeted analysis supported by data and cheminformatics delivered via the...
Non-targeted analysis supported by data and cheminformatics delivered via the...
 
EPA CompTox Chemicals Dashboard as a Data Integration Hub for Environmental C...
EPA CompTox Chemicals Dashboard as a Data Integration Hub for Environmental C...EPA CompTox Chemicals Dashboard as a Data Integration Hub for Environmental C...
EPA CompTox Chemicals Dashboard as a Data Integration Hub for Environmental C...
 
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental scienceUS-EPA Chemicals Dashboard – an integrated data hub for environmental science
US-EPA Chemicals Dashboard – an integrated data hub for environmental science
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists EPA CompTox chemicals dashboard: An online resource for environmental chemists
EPA CompTox chemicals dashboard: An online resource for environmental chemists
 
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
 
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
CompTox Chemicals Dashboard: Data and tools to support chemical and environme...
 
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
Progress in Using Big Data in Chemical Toxicity Research at the National Cent...
 
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
The EPA Comptox Chemistry Dashboard: A Web-Based Data Integration Hub for Env...
 
Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...Delivering access to chemistry and bioassay data from the National Center for...
Delivering access to chemistry and bioassay data from the National Center for...
 

Último

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINsankalpkumarsahoo174
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 

Último (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATINChromatin Structure | EUCHROMATIN | HETEROCHROMATIN
Chromatin Structure | EUCHROMATIN | HETEROCHROMATIN
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 

US-EPA Chemicals Dashboard – an integrated data hub for environmental science

  • 1. Center for Computational Toxicology and Exposure, US-EPA, RTP, NC http://www.orcid.org/0000-0002-2668-4821 US-EPA Chemicals Dashboard – an integrated data hub for environmental science The views expressed in this presentation are those of the authors and do not necessarily reflect the views or policies of the U.S. EPA Antony John Williams williams.antony@epa.gov Human Toxicology and EHSRC Seminar Series: September 24th 2021
  • 2. Solution • Develop a “first-stop-shop” for environmental chemical data to support EPA and partner decision making: – Centralized location for relevant chemical data – Chemistry, exposure, hazard and dosimetry – Combination of existing data and predictive models – Publicly accessible, periodically updated, curated • Easy access to data improves efficiency and ultimately accelerates chemical risk assessment
  • 3. CompTox Chemicals Dashboard 883k Chemical Substances
  • 4. BASIC Search • Type ahead search using Names, synonyms and CASRNs • Millions of identifiers • Substring search
  • 5. Search for classes of chemicals • Examples: “perfluoro”
  • 6. Detailed Chemical Pages One more identifier – the DTXSID • Chemical page: Wikipedia snippet when available, intrinsic properties, structural identifiers, linked substances
  • 7. Detailed Chemical Pages Easy Navigation • Chemical page: Wikipedia snippet when available, intrinsic properties, structural identifiers, linked substances
  • 8. From the Chemical Details Page… all chemicals with same FORMULA
  • 9. How many chemicals are associated through LINKED SUBSTANCES? • Atrazine, is a herbicide – in MANY commercial products • The dashboard has salt forms, isotopically labelled forms, multicomponent forms • How do we identify what they are???
  • 10. Linked Substances • We map chemicals together using cheminformatics approaches • Use desalting, destereo, split multicomponents etc to map chemicals together
  • 13. Experimental and Predicted Data • Physchem and Fate & Transport experimental and predicted data • Data can be downloaded as Excel, TSV and CSV files • Predictions: multiple algorithms • TEST: Toxicity Estimation Software Tool • OPERA: OPEn structure–activity/ property Relationship App • Other commercial algorithms
  • 14. Property and Fate and Transport Data ~25 MILLION pre-predicted values • We have built prediction models based on tens of thousands of property data points curated over the past decade • We push our “QSAR-Ready” chemical structures through predictions to produce property predictions
  • 20. Similar reports for TEST predictions
  • 21. Toxicity Data (in vitro and in vivo)
  • 22. Chemical Hazard Data ToxVal Database • >50k chemicals • >770k tox. values • >30 sources of data • ~5k journals cited • ~70k citations
  • 23. ToxCast and Tox21 bioactivity data for hazard screening and prediction. 23 • ToxCast: more assays, fewer chemicals, EPA-driven • Tox21: fewer assays, mostly 1536, driven by consortium • All Tox21 data are analyzed by multiple partners • Tox21 data is available analyzed in the ToxCast Data Pipeline and other pipelines as well EPA’s ToxCast program at a glance Tox21 robot
  • 24. Lets look at the data
  • 25. Rich data tables – full transparency
  • 26. Bioactivity Data (ToxCast/Tox21) Data below for Bisphenol A 26
  • 27. #Actives for a chemical
  • 30. GenRA (Generalised Read-Across) Review Available Data Fingerprint indicating available data Select and Review Analogs
  • 31. GenRA (Generalised Read-Across) 31 Run GenRA Target Source analogues Red : Toxicity effects. Blue: No Toxicity effects Grey : Absence of data
  • 33. Identifiers Support Searches in other systems
  • 34. Challenges with Nomenclature • Be CAREFUL with names! There is a LOT of confusion in the public domain. CHOOSE sources wisely! • There are MANY public databases but not many are curated • All public databases have value but not many curate data • Example: METHANE on PubChem https://pubchem.ncbi.nlm.nih.gov/compound/297
  • 35. CAS Registry Numbers on PubChem
  • 36. CASRN lookup on the dashboard
  • 37. Methane is Diamond and Nanotubes? • These are all Depositor Names for Methane 
  • 39. DSSTox_v2 Public Curated 5. Low 2. Low 6. Untrusted 1. High 3. High 4. Med 7. Incomplete validated Public_Untrusted Public_Low Public_Medium Public_High DSSTox_Low DSSTox_High 7269 16K 33K 101K 590K ~ 310K pending ~ 150K pending ~ 150K substances in top 4 QC quality bins Distribution of curated data Now at >1.2 MILLION substances
  • 41. Data quality • Five full time curators register and curate data to elevate quality
  • 42. Products Searching What chemicals are in hair care products?
  • 43. Sources of Exposure to Chemicals
  • 45. Identifiers are used in the app • Identifiers are used to feed and link into “Literature”
  • 46. Literature Searching • Real-time retrieval of data from PubMed ~30 million abstracts and growing) • Choose from set of pre-defined queries • Adjust and fine tune queries based on interests
  • 47. Literature Searching • “Sifting” of results using multiple terms • Frequency counting terms • Color highlighting of terms • Download list to Excel • Send list to PubMed for downloading ref. file • Direct link via PubMed ID
  • 49. External Links – Also use Identifiers Names, CASRN, PubChem IDs, InChIs… 49
  • 50. External Links • Links to ~90 websites providing access to additional data on the chemical of interest
  • 52. >300 Chemical Lists (and growing) 52
  • 53. Curated List of Pesticides •Find list of interest •Select list and send to batch
  • 55. Batch Searching • Singleton searches are great but… • …we generally want data on LOTS of chemicals! • Typical questions • What are the structures for a set of chemical names? Set of CASRNs? • Can I get chemical lists in Excel files? As a list of SMILES strings? Can I get an SDF file? • Can I include predicted properties in the download file? OPERA? TEST? • Are “these chemicals” screened in Toxcast? • I’m a mass spectrometrist and need masses and formulae for a list of chemicals
  • 57. Summary and Conclusion • CompTox Chemicals Dashboard - a central hub for environmental data • ~900k chemical substances • Integrating property data, hazard data, exposure data, in vitro bioactivity data • Interrogation of bioactivity data - • Multiple types of searches • Batch search for thousands of chemicals • Real-time property and toxicity predictions • Downloadable files – CSV, TSV and Excel
  • 60. References • The CompTox Chemistry Dashboard: a community data resource for environmental chemistry, J. Cheminformatics, 9, 61 (2017) • EPA’s DSSTox database: History of development of a curated chemistry resource supporting computational toxicology research, Comp. Tox. 12, 100096 (2019) • OPERA models for predicting physicochemical properties and environmental fate endpoints, J. Cheminformatics, 10, 10 (2018) • Screening Chemicals for Estrogen Receptor Bioactivity Using a Computational Model, Environ. Sci. Technol. 49, 8804-8814 (2015) • ToxCast Chemical Landscape: Paving the Road to 21st Century Toxicology, Chem. Res. Toxicol. 29, 1225-51 (2016) • Development and Validation of a Computational Model for Androgen Receptor Activity, Chem. Res. Toxicol. 30, 946-964 (2017) • CERAPP: Collaborative Estrogen Receptor Activity Prediction Project, Environ. Health Perspect. 124, 1023 (2016) • Abstract Sifter: a comprehensive front-end system to PubMed, F1000, 6, 2164 (2017)
  • 61. You want to know more… • Lots of resources available • Presentations: https://tinyurl.com/w5hqs55 • Communities of Practice Videos: https://rb.gy/qsbno1 • Manual: https://rb.gy/4fgydc • Latest News: https://comptox.epa.gov/dashboard/news_info 61
  • 62. Acknowledgments • Contact: Williams.Antony@epa.gov • Feedback and follow-up is welcomed! Your questions help • The dashboard is based on the efforts of many more team members than us. Many collaborators provide data also. 62 EPA’s Center for Computational Toxicology and Exposure