The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass...

40
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data Antony J. Williams , Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski, Jordan Foster and Jeff Edwards National Center for Computational Toxicology U.S. Environmental Protection Agency, RTP, NC August 21-25, 2016 ACS Fall Meeting, Philadelphia, PA http://orcid.org/0000-0003-1423- 330X The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA

Transcript of The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass...

Page 1: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High

Resolution Mass Spectrometry Data

Antony J. Williams†, Andrew McEachran, Jon Sobus, Chris Grulke, Jennifer Smith, Michelle Krzyzanowski,

Jordan Foster and Jeff Edwards

National Center for Computational ToxicologyU.S. Environmental Protection Agency, RTP, NC

August 21-25, 2016ACS Fall Meeting, Philadelphia, PA

http://orcid.org/0000-0003-1423-330X

The views expressed in this presentation are those of the author and do not necessarily reflect the views or policies of the U.S. EPA

Page 2: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Who is NCCT?

• National Center for Computational Toxicology – part of EPA’s Office of Research and Development

• Research driven by EPA’s Chemical Safety for Sustainability Research Program– Develop new approaches to evaluate the safety of chemicals– Integrate advances in biology, biotechnology, chemistry, exposure

science and computer science

• Goal - To identify chemical exposures that may disrupt biological processes and cause adverse outcomes.

2

Page 3: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Our Dashboard Applications

• Some of our Web-based Applications

3

Page 4: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Introducing Our Latest Dashboard https://comptox.epa.gov

4

• >720,000 chemicals• >10 years assembling data

Page 5: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Bisphenol A

5

Page 6: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Physicochemical Properties

6

Page 7: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Bioassay Screening Data

7

Page 8: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Functional Use and Composition

8

Page 9: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Advanced MS Searches

9

Page 10: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Monoisotopic Mass Search

10

Page 11: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Monoisotopic Mass Search

11

Found 344 results for '215.096 ± 0.005 amu'

Page 12: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Formula Search

12

Page 13: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Formula Search

13

Found 8 results for 'C8H14ClN5'

Page 14: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Formula SearchingFormulae matching Bisphenol A

14

Page 15: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Formula Search Results

15

Page 16: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Download to Excel

16

Page 17: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Download as SDF file

17

Page 18: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

SDF file downloaded to desktop

18

Page 19: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Rank-Ordering of “Known-Unknowns” using ChemSpider

19

Page 20: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Comparing Performance

20

721k structures

Page 21: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Does the Dashboard Add Value?

• Remember:– Focus on high quality data and curation– Data sources include EPA data sources and a focus on

environmental chemistry

• No “dilution” by chemical vendors

21

Page 22: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Dilution Example…Morphine Skeleton

22

Page 23: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Bisphenol A as an exampleChemSpider: 1564 Structures

23

Page 24: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Bisphenol A as an exampleDashboard: 215 Structures

24

Page 25: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Chemical Identification Dashboard vs ChemSpider

Sorted by number of references (ChemSpider) or data sources (Dashboard)

Monoisotopic Mass (+/- 0.005 amu) Search

Position of compound sorted

Source of List # of Compounds

Search Tool Mean Position

Median Position #1 #2 #3 #4 #5+

McEachran et al Wastewater

34 ChemSpider 1.8 1 28 5 0 0 1

Dashboard 1.3 1 31 2 0 0 1

Misc. NTA Compounds 13 ChemSpider 2 1 7 5 0 0 1

Dashboard 1.7 1 10 2 0 0 1

Bade et al (2016) 19 ChemSpider 2.1 1 11 2 5 0 1Dashboard 1.6 1 12 3 3 1 0

Rager et al (2016) 24 ChemSpider 2.25 1 15 2 1 2 4 Dashboard 1.08 1 22 2 0 0 0

Page 26: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Dashboard vs ChemSpiderRanking Summary

Mass-based Searching Formula Based SearchingDashboard ChemSpider Dashboard ChemSpider

Cumulative Average Position 1.3 2.2 1.2 1.4% in #1 Position 85% 70% 88% 80%

• Selected peer-reviewed publications• 162 total individual chemicals in search

Page 27: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

ChemSpider 6926 Results!!!

27

Tacedinaline

Methyl Red

C.I Disperse Yellow 3

Page 28: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Using Functional Use to Sort Candidates

28

Anti-cancer Drug

Microbiological Indicator Dye

Textile/Product Dye

Page 29: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Same top hits – different ranking90 hits only versus 6926 hits

29

18

17

4 Tacedinaline

Methyl Red

C.I Disperse Yellow 3

Page 30: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Dashboard: External Links to Analytical Methods

30

Page 31: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

National Environmental Methods Index

31

Page 32: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

RSC Analytical Abstracts

32

Page 33: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Integrated Google Chemical Searches

33

Page 34: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Google Chemical Searches Enhanced with Query Terms

34

Page 35: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Non-Targeted Analysis Research

- 1 Dust Sample- Negative Ionization Mode- 300 Extracted “Molecular Features”

1) Prioritize “Molecular Features”

2) Correctly assign formulas

3) Correctly assign structures

4) Determine chemical sources

5) Predict chemical concentrations

C17H19NO3 12 µg/g

(1)

(2) (3) (4) (5)

What is contained in house dust, waste streams etc???

Page 36: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Previous Work with Suspect-Screening

The dashboard is being enhanced to support Non-targeted Analysis

Page 37: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Future Work

• Presently researching rank-ordering based on other criteria – Pubmed

• Additional links to methods – CDC NIOSH• Links to Mass Spec databases – Thermo’s

mzCloud, Massbank. Metlin etc. • Consider predicting metabolites and

degradants• Searching based on “MS-ready” structures

37

Page 38: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

“MS Ready” structures

• Many compounds are salts – searches should be on the “neutral form”

• Need to search for adducts (+Na, +K, +NH4), decarboxylation, loss of water etc.

38

Page 39: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Conclusions

• Dashboard support for MS is focused on NTA research – related to chemical exposure

• Dashboard outperforms ChemSpider for ranking chemicals of environmental concern

• New searches developed with Non-targeted Analysis in mind - new rank-ordering approaches in development

39

Page 40: The EPA iCSS Chemistry Dashboard to Support Compound Identification Using High Resolution Mass Spectrometry Data

Acknowledgements

EPA NCCTChris GrulkeJeff EdwardsAnn RichardJordan FosterJennifer SmithAndrew McEachran*Michelle Krzyzanowski

EPA NERLJon Sobus

* = ORISE Participant