SlideShare a Scribd company logo
1 of 26
AMOS: the EPA database of analytical methods
and open mass spectral database supporting
non-targeted analysis
Gregory Janesch1, Erik Carr1, Vicente Samano2, James McCord3,
Jacqueline Bangma3, Jon Sobus4 and Antony Williams4
1. ORAU Student Services Contractor 2. Senior Environmental Employment Program
3. Center for Environmental Measurement and Modeling and 4. Center for Computational Toxicology & Exposure,
ALL at the U.S. Environmental Protection Agency
October 2023: FDA Cheminformatics Workshop
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Background
• A huge number of openly available sources exist for spectra,
documentation of analytical procedures, etc.
• Search engines can easily find high-traffic sources, but
maybe not niche-but-high-quality ones
– Most are not “structurally-enabled”
• Useful to have complementary types of data alongside each
other, especially with consistent substance identifiers
• Non-targeted analysis can benefit from a broad, high-quality
experimental database as a reference
2
About AMOS - General
• AMOS is a cheminformatics application integrating spectra
and analytical methods with consistent substance identifiers
• Provides mappings between substances and records (method
documents, experimental spectra, etc.)
• Under development for ~18 months as a “proof-of-concept”;
not yet available publicly
3
About AMOS - Data
• Three categories of records:
– Spectra (~210,000)
– Methods (~4100)
– Fact Sheets (>3000)
• Most data are open access, some are just external links
– All data links back to the original source, if possible
• Data is being continually updated (new datasets & updates)
• Many chemicals of interest to EPA – PFAS, pesticides, etc.
4
About AMOS - Curation
• Identifiers vary between different sources so we must curate
5
• A single chemical can have dozens
of names
– FTOH 10:1
– 10:1 FTOH
– 10:1 Fluorotelomer alcohol
– 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,11
-Henicosafluoroundecan-1-ol
– 1-Undecanol, 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,
10,10,11,11,11-heneicosafluoro-
About AMOS – Curation Issue Example
6
About AMOS – Curating Methods
• Often have a table of
substances
– Can be extracted with scripts
• Sample matrix, limits of
detection, etc. still need to be
manually collected
• Some are old, scanned
documents that require fully
manual work
7
Spectra
• About 210,000 experimental spectra covering about 21,500
substances (not including externally-linked ones)
• Most are from external sources
– About 90% between MassBank EU, MoNA, & HMDB
• EPA labs now providing spectra (especially PFAS)
• Includes metadata like instrument settings (when possible)
8
Methods
• Almost 4100 in AMOS so far from an assortment of
vendors, publications, and government agencies
– Agencies including US-EPA, DEA, CDC, FDA, OSHA, USGS, USDA
– Vendors including Agilent, Shimadzu, LECO, Sciex
• Searchable on analytes, matrix, analytical methodology,
source
• Methods can be linked to sets of spectra
9
• Search by DTXSID,
InChIKey, CAS number,
name
• Can filter on record types or
other information
• InChIKeys and some names
will prompt disambiguation
10
General Search
General Search – Disambiguation
11
InChIKey example search:
General Search – Spectra
12
General Search – Spectra
13
General Search – Methods & Fact Sheets
14
General Search – Methods & Fact Sheets
15
Method with Spectra
16
Batch Search
17
• Search a set of DTXSIDs,
download info on spectra and
methods and links to original data
Methods List
18
Methods List – Filtering
19
Similar Method Search
20
Similar Method Search
21
Spectrum Search
22
Connections to Other Applications
• Other apps often deal with
focused subsets of
chemicals; AMOS’s data
can augment that
• API endpoints have been
built for an NTA application
– Originally just in silico spectra
23
Future Work
• Add more data assembled from EPA labs (standards)
• Improvements to spectral searching – in testing
– Structure, substructure, and similarity searching
• Expand spectral and chromatographic metadata
• Additional integration with other EPA applications
– Mostly just simple links to AMOS pages at the moment
• Hoping to release to the public in 2024
24
Summary
• AMOS combines multiple kinds of analytical chemistry
data
– Primarily mass spectrometry data
– Growing steadily for the foreseeable future
• Data can be queried via a cheminformatically-oriented
application
• Intended to be useful as both an independent application
and a way of augmenting other EPA applications 25
Acknowledgements
• Greg Janesch – Database and App Development
• Sakuntala Sivasupramaniam – curation
• Tyler Carr – curation, visualizations
• Joshua Powell, Asif Rashid, Freddie Valone – assorted
technical support
If you want to help, send information regarding analytical
methods and method articles to williams.antony@epa.gov
26

More Related Content

Similar to AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis

Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AIDatabricks
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Kamel Mansouri
 

Similar to AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis (20)

Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
Applying Cheminformatics to Develop a Structure Searchable Database of Analyt...
 
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
Using Cheminformatics Approaches to Develop a Structure Searchable Database o...
 
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
Cheminformatics tools and chemistry data underpinning mass spectrometry analy...
 
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
US-EPA Cheminformatics Support for Delivering Data Related to Chemicals of E...
 
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
Introduction to Cheminformatics: Accessing data through the CompTox Chemicals...
 
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
Accessing information for Per- & Polyfluoroalkyl Substances using the US EPA ...
 
Delivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applicationsDelivering chemical-associated data via EPA web applications
Delivering chemical-associated data via EPA web applications
 
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
Applications of the US EPA’s CompTox Chemistry Dashboard to support structure...
 
Cheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting ExposomicsCheminformatics Support for MS Supporting Exposomics
Cheminformatics Support for MS Supporting Exposomics
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
Data delivery from the US-EPA Center for Computational Toxicology and Exposur...
 
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
Comparison of lists of per- and polyfluoroalkyl substances (PFAS) based on di...
 
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
Integrating an Analytical Methods and Mass Spectral Database with Cheminforma...
 
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
Accessing Environmental Chemistry Data via Data Dashboards and Applications t...
 
Drug Discovery and Development Using AI
Drug Discovery and Development Using AIDrug Discovery and Development Using AI
Drug Discovery and Development Using AI
 
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
Consensus Models to Predict Endocrine Disruption for All Human-Exposure Chemi...
 
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...Structure identification approaches using the EPA CompTox Chemicals Dashboard...
Structure identification approaches using the EPA CompTox Chemicals Dashboard...
 
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
Cheminformatics Tools to Access Data for PFAS and Constituents of Fluorine-Fr...
 
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
The US-EPA CompTox Chemicals Dashboard – a key player in the domain of Open S...
 
Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...Development of a Tool for Systematic Integration of Traditional and New Appro...
Development of a Tool for Systematic Integration of Traditional and New Appro...
 

Recently uploaded

SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptxSaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptxPat (JS) Heslop-Harrison
 
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdf
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdfTowards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdf
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdfSujay Rao Mandavilli
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfbyp19971001
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfSuchita Rawat
 
A Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert EinsteinA Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert Einsteinxgamestudios8
 
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoonintarciacompanies
 
GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationAreesha Ahmad
 
Warming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptxWarming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptxGlendelCaroz
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil RecordSangram Sahoo
 
Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentslevieagacer
 
Costs to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of UgandaCosts to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of UgandaTimothyOkuna
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsSérgio Sacani
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisAreesha Ahmad
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismAreesha Ahmad
 
Precision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentationPrecision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentationscvns2828
 
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptxNanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptxssusera4ec7b
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...yogeshlabana357357
 
Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Fabiano Dalpiaz
 
Fun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfFun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfhoangquan21999
 
NuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfNuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfpablovgd
 

Recently uploaded (20)

SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptxSaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
 
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdf
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdfTowards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdf
Towards a revolution in the social sciences FINAL FINAL FINAL FINAL FINAL.pdf
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
 
A Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert EinsteinA Scientific PowerPoint on Albert Einstein
A Scientific PowerPoint on Albert Einstein
 
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
 
GBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolationGBSN - Microbiology (Unit 5) Concept of isolation
GBSN - Microbiology (Unit 5) Concept of isolation
 
Warming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptxWarming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptx
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil Record
 
Electricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 studentsElectricity and Circuits for Grade 9 students
Electricity and Circuits for Grade 9 students
 
Costs to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of UgandaCosts to heap leach gold ore tailings in Karamoja region of Uganda
Costs to heap leach gold ore tailings in Karamoja region of Uganda
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 
Precision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentationPrecision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentation
 
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptxNanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...
 
Fun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdfFun for mover student's book- English book for teaching.pdf
Fun for mover student's book- English book for teaching.pdf
 
NuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdfNuGOweek 2024 programme final FLYER short.pdf
NuGOweek 2024 programme final FLYER short.pdf
 

AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis

  • 1. AMOS: the EPA database of analytical methods and open mass spectral database supporting non-targeted analysis Gregory Janesch1, Erik Carr1, Vicente Samano2, James McCord3, Jacqueline Bangma3, Jon Sobus4 and Antony Williams4 1. ORAU Student Services Contractor 2. Senior Environmental Employment Program 3. Center for Environmental Measurement and Modeling and 4. Center for Computational Toxicology & Exposure, ALL at the U.S. Environmental Protection Agency October 2023: FDA Cheminformatics Workshop The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
  • 2. Background • A huge number of openly available sources exist for spectra, documentation of analytical procedures, etc. • Search engines can easily find high-traffic sources, but maybe not niche-but-high-quality ones – Most are not “structurally-enabled” • Useful to have complementary types of data alongside each other, especially with consistent substance identifiers • Non-targeted analysis can benefit from a broad, high-quality experimental database as a reference 2
  • 3. About AMOS - General • AMOS is a cheminformatics application integrating spectra and analytical methods with consistent substance identifiers • Provides mappings between substances and records (method documents, experimental spectra, etc.) • Under development for ~18 months as a “proof-of-concept”; not yet available publicly 3
  • 4. About AMOS - Data • Three categories of records: – Spectra (~210,000) – Methods (~4100) – Fact Sheets (>3000) • Most data are open access, some are just external links – All data links back to the original source, if possible • Data is being continually updated (new datasets & updates) • Many chemicals of interest to EPA – PFAS, pesticides, etc. 4
  • 5. About AMOS - Curation • Identifiers vary between different sources so we must curate 5 • A single chemical can have dozens of names – FTOH 10:1 – 10:1 FTOH – 10:1 Fluorotelomer alcohol – 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,11 -Henicosafluoroundecan-1-ol – 1-Undecanol, 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9, 10,10,11,11,11-heneicosafluoro-
  • 6. About AMOS – Curation Issue Example 6
  • 7. About AMOS – Curating Methods • Often have a table of substances – Can be extracted with scripts • Sample matrix, limits of detection, etc. still need to be manually collected • Some are old, scanned documents that require fully manual work 7
  • 8. Spectra • About 210,000 experimental spectra covering about 21,500 substances (not including externally-linked ones) • Most are from external sources – About 90% between MassBank EU, MoNA, & HMDB • EPA labs now providing spectra (especially PFAS) • Includes metadata like instrument settings (when possible) 8
  • 9. Methods • Almost 4100 in AMOS so far from an assortment of vendors, publications, and government agencies – Agencies including US-EPA, DEA, CDC, FDA, OSHA, USGS, USDA – Vendors including Agilent, Shimadzu, LECO, Sciex • Searchable on analytes, matrix, analytical methodology, source • Methods can be linked to sets of spectra 9
  • 10. • Search by DTXSID, InChIKey, CAS number, name • Can filter on record types or other information • InChIKeys and some names will prompt disambiguation 10 General Search
  • 11. General Search – Disambiguation 11 InChIKey example search:
  • 12. General Search – Spectra 12
  • 13. General Search – Spectra 13
  • 14. General Search – Methods & Fact Sheets 14
  • 15. General Search – Methods & Fact Sheets 15
  • 17. Batch Search 17 • Search a set of DTXSIDs, download info on spectra and methods and links to original data
  • 19. Methods List – Filtering 19
  • 23. Connections to Other Applications • Other apps often deal with focused subsets of chemicals; AMOS’s data can augment that • API endpoints have been built for an NTA application – Originally just in silico spectra 23
  • 24. Future Work • Add more data assembled from EPA labs (standards) • Improvements to spectral searching – in testing – Structure, substructure, and similarity searching • Expand spectral and chromatographic metadata • Additional integration with other EPA applications – Mostly just simple links to AMOS pages at the moment • Hoping to release to the public in 2024 24
  • 25. Summary • AMOS combines multiple kinds of analytical chemistry data – Primarily mass spectrometry data – Growing steadily for the foreseeable future • Data can be queried via a cheminformatically-oriented application • Intended to be useful as both an independent application and a way of augmenting other EPA applications 25
  • 26. Acknowledgements • Greg Janesch – Database and App Development • Sakuntala Sivasupramaniam – curation • Tyler Carr – curation, visualizations • Joshua Powell, Asif Rashid, Freddie Valone – assorted technical support If you want to help, send information regarding analytical methods and method articles to williams.antony@epa.gov 26