Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of...

27
Automated Read-Across for REACH Dr. Anne Bonhoff April 25, 2017

Transcript of Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of...

Page 1: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

Automated Read-Across for REACH

Dr. Anne BonhoffApril 25, 2017

Page 2: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

UL Collaboration to Develop Predictive Toxicity

Tool for Read-across

UL Collaborates with Prof. Dr. Thomas Hartung,

Chair for Evidence-based Toxicology at Johns

Hopkins Bloomberg School of Public Health,

Baltimore, and University of Konstanz, Germany;

also Director of their Centers for Alternatives

to Animal Testing (CAAT)

and Thomas Luechtefeld,

a PhD student,

and UL’s team of scientists and toxicologists.

2

Page 3: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

3

Read-across

Data gap filling

concluding from

(structurally) similar

chemicals

Category approach

Test only

representatives of a

group of similar

chemicals or

complex mixtures

REACH Data-rich

substances registered

2010 and 2013:

75% of dossiers use read-

across

Other alternatives hardly

used

Expertise in industry low

Low acceptance by EChA

Page 4: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

The opportunity of Big Data & Bioinformatics

Page 5: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

Deep Learning

Page 6: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

6

Big biological data in toxicology

Unified and curated data-base

Page 7: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

7

ALTEX

April

2016

issue

Page 8: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

A marriage of technologiesRead-across• Support weight of

evidence• Circumstantial• Manual• Unclear acceptability

(Q)SAR• Data-mining by

computer• Broader applicability• Can be validated with

enormous consequences for acceptability

Automated Read-Across = “REACH-across”• Mines local “similarity space”• Comprehensive use of available data• Expresses certainty• Validation on the way

Page 9: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

REACHACROSS™

UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Page 10: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

REACHACROSS™ SOFTWARE PIPELINE

UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Data Source

+

Similarity

=

Graph Algorithms

0 1 0 0 1 1 0 𝑀(0 1 0 0 . . . , 0 0 1 0 . . .) = −∞ ≤ 𝑥 ≤ ∞

MetricFingerprinter

Page 11: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

SIMPLE MODELS

11UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

A B90%

Page 12: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

NETWORK EFFECT

12UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Page 13: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

INCREASING RELEVANCE

13UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Unlabeled EINECS ----- Labeled ANNEX• 1387 ANNEX SMILES• 33383 EINECS SMILES

300 ANNEX Chemicals600 ANNEX Chemicals1387 ANNEX Chemicals

Page 14: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

NETWORK FEATURES

14UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Page 15: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

?+

+

+

+

+

+

+

++

+

-

-

-

---

-

-

-

-

-

-

-

--

-

-

-

-

--

-

-

--

-

-

-

-

-

- --

++

+

+

+

+

+

+

+

+

Local similarity

= prediction

Regional similarity

= uncertainty

-

+

+

+

+

-

-

--

Positive =proximity to positives

Negative =proximity to negatives

Confidence =the more data the closer

Page 16: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

CHEMICAL PLANET

16UL and the UL logo are trademarks of UL LLC © 2017. Proprietary & Confidential.

Page 17: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

Toxic

Non-toxic

# of

substances

By choosing the cut-off, the balance

of sensitivity and specificity is

changed

Test performance characteristics are a choice:

Test Sensitivity vs. Specificity

For REACH-across this is even more complex:Two thresholds (proximity to positives and to negatives)

These choices influence coverage, i.e. for how many

unknown chemicals a prediction can be made

Page 18: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

Skin sensitization (Luechtefeld et al., 2016):

Simple classification by nearest neighbor

Accuracies better than different animal TG

against each other, but coverage drops

Page 19: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

Status of the tool March 2017

All endpoints for REACH

2018 <10 tons/a

Coming soon:

acute fish toxicity Internal validation

for unprecedented

number of

chemicals

Accuracy similar

to animal test

Works for most

chemicals, to

increase with more

databases

incorporated

Page 20: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining
Page 21: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

ULREACHACROSS.COM

Page 22: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

ULREACHACROSS.COM

Smiles code for Methanol

Select Endpoint

Page 23: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

ULREACHACROSS.COM

Request Name

Smiles code for Methanol

Select Endpoint

Page 24: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

ULREACHACROSS.COM

Select EndpointReport Ready

Page 25: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

REACHACROSS™ REPORT

Page 26: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

QUESTIONS?

ULREACHACROSS.COM

Page 27: Automated Read-Across for REACH · A marriage of technologies Read-across • Support weight of evidence • Circumstantial • Manual • Unclear acceptability (Q)SAR • Data-mining

THANK YOU