The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass...
-
Upload
andrew-mceachran -
Category
Science
-
view
175 -
download
0
Transcript of The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High
Resolution Mass Spectrometry Data
Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,
Jordan Foster and Jeff Edwards
National Center for Computational ToxicologyU.S. Environmental Protection Agency, RTP, NC
August 21-25, 2016ACS Fall Meeting, Philadelphia, PA
http://orcid.org/0000-0003-1423-330X
The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA
Who is NCCT?
• National Center for Computational Toxicology – part of EPA’s Office of Research and Development
• Research driven by EPA’s Chemical Safety for Sustainability Research Program– Develop new approaches to evaluate the safety of chemicals– Integrate advances in biology, biotechnology, chemistry, exposure
science and computer science
• Goal - To identify chemical exposures that may disrupt biological processes and cause adverse outcomes.
2
Our Dashboard Applications
• Some of our Web-based Applications
3
Introducing Our Latest Dashboard https://comptox.epa.gov
4
• >720,000 chemicals• >10 years assembling data
Bisphenol A
5
Physicochemical Properties
6
Bioassay Screening Data
7
Functional Use and Composition
8
Advanced MS Searches
9
Monoisotopic Mass Search
10
Monoisotopic Mass Search
11
Found 344 results for '215.096 ± 0.005 amu'
Formula Search
12
Formula Search
13
Found 8 results for 'C8H14ClN5'
Formula SearchingFormulae matching Bisphenol A
14
Formula Search Results
15
Download to Excel
16
Download as SDF file
17
SDF file downloaded to desktop
18
Rank-Ordering of “Known-Unknowns” using ChemSpider
19
Comparing Performance
20
721k structures
Does the Dashboard Add Value?
• Remember:– Focus on high quality data and curation– Data sources include EPA data sources and a focus on
environmental chemistry
• No “dilution” by chemical vendors
21
Dilution Example…Morphine Skeleton
22
Bisphenol A as an exampleChemSpider: 1564 Structures
23
Bisphenol A as an exampleDashboard: 215 Structures
24
Chemical Identification Dashboard vs ChemSpider
Sorted by number of references (ChemSpider) or data sources (Dashboard)
Monoisotopic Mass (+/- 0.005 amu) Search
Position of compound sorted
Source of List # of Compounds
Search Tool Mean Position
Median Position #1 #2 #3 #4 #5+
McEachran et al Wastewater
34 ChemSpider 1.8 1 28 5 0 0 1
Dashboard 1.3 1 31 2 0 0 1
Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1
Dashboard 1.7 1 10 2 0 0 1
Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1Dashboard 1.6 1 12 3 3 1 0
Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0
Dashboard vs ChemSpiderRanking Summary
Mass-based Searching Formula Based SearchingDashboard ChemSpider Dashboard ChemSpider
Cumulative Average Position 1.3 2.2 1.2 1.4% in #1 Position 85% 70% 88% 80%
• Selected peer-reviewed publications• 162 total individual chemicals in search
ChemSpider 6926 Results!!!
27
Tacedinaline
Methyl Red
C.I Disperse Yellow 3
Using Functional Use to Sort Candidates
28
Anti-cancer Drug
Microbiological Indicator Dye
Textile/Product Dye
Same top hits – different ranking90 hits only versus 6926 hits
29
18
17
4 Tacedinaline
Methyl Red
C.I Disperse Yellow 3
Dashboard: External Links to Analytical Methods
30
National Environmental Methods Index
31
RSC Analytical Abstracts
32
Integrated Google Chemical Searches
33
Google Chemical Searches Enhanced with Query Terms
34
Non-Targeted Analysis Research
- 1 Dust Sample- Negative Ionization Mode- 300 Extracted “Molecular Features”
1) Prioritize “Molecular Features”
2) Correctly assign formulas
3) Correctly assign structures
4) Determine chemical sources
5) Predict chemical concentrations
C17H19NO3 12 µg/g
(1)
(2) (3) (4) (5)
What is contained in house dust, waste streams etc???
Previous Work with Suspect-Screening
The dashboard is being enhanced to support Non-targeted Analysis
Future Work
• Presently researching rank-ordering based on other criteria – Pubmed
• Additional links to methods – CDC NIOSH• Links to Mass Spec databases – Thermo’s
mzCloud, Massbank. Metlin etc. • Consider predicting metabolites and
degradants• Searching based on “MS-ready” structures
37
“MS Ready” structures
• Many compounds are salts – searches should be on the “neutral form”
• Need to search for adducts (+Na, +K, +NH4), decarboxylation, loss of water etc.
38
Conclusions
• Dashboard support for MS is focused on NTA research – related to chemical exposure
• Dashboard outperforms ChemSpider for ranking chemicals of environmental concern
• New searches developed with Non-targeted Analysis in mind - new rank-ordering approaches in development
39
Acknowledgements
EPA NCCTChris GrulkeJeff EdwardsAnn RichardJordan FosterJennifer SmithAndrew McEachran*Michelle Krzyzanowski
EPA NERLJon Sobus
* = ORISE Participant