Reaction Representation and Structure Transformation with Ambit-SMIRKS.

1
C H 3 N N N N H OH (1) Effective representation of SMARTS Queries based on CDK (full Daylight syntax) (2) Fast structure isomorphism /mapping/ (3) Support of recursive SMARTS (4) Syntax extensions C H 3 N + N N N H Cl O - Reaction Representation and Structure Transformation with Ambit-SMIRKS. Application in Metabolite Prediction [1] N. Jeliazkova, N. Kochev, AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments, Mol. Inf., 30: 707–720, 2011 [2] Jeliazkova N., Jeliazkov V. AMBIT RESTful web services: an implementation of the OpenTox application programming interface, Journal of Cheminformatics 2011, 3:18, doi:10.1186/1758-2946-3-18. [3] C. Steinbeck, Y. Han, S. Kuhn, O. Horlacher, E. Luttmann, E. Willighagen, The Chemistry Development Kit (CDK):  An Open-Source Java Library for Chemo- and Bioinformatics, J. Chem. Inf. Comput. Sci., 43: 493–500, 2003 [4] http://toxtree.sourceforge.net [5] P. Rydberg, D. Gloriam, J. Zaretzki, C. Breneman, L. Olsen, SMARTCyp: A 2D Method for Prediction of Cytochrome P450-Mediated Drug Metabolism, ACS Med. Chem. Lett., 2010, 1 (3), pp 96-100 (1) Parsing of SMIRKS linear notations into internal reaction (transformation) representations based on CDK objects (2) Application of the stored reactions against target molecules for actual transformation of the target chemical objects Nina Jeliazkova a* , Nikolay T. Kochev b , Patrik Rydberg c , Svetlana Avramova b a) Ideaconsult Ltd, 4 A. Kanchev str., Sofia 1000, Bulgaria b) University of Plovdiv, Department of Analytical Chemistry and Computer Chemistry, 24 Tsar Assen St., Plovdiv, Bulgaria c) Department of Drug Design and Pharmacology, Faculty of Health and Medical Sciences, University of Copenhagen *To whom correspondence should be addressed. e-mail: [email protected], twitter: @10705013 Ambit-SMIRKS is a new extension of the Ambit-SMARTS Java library [1], both part of the Ambit2 [2] project. Implemented on top of Chemistry Development Kit [3], the SMIRKS module is used to enable metabolite predictions in Toxtree [4], once that site of metabolisms are predicted by SMARTCyp [5]. References: Ambit-SMIRKS main tasks: Ambit-SMARTS basic features: (1) Toxtree is an open-source application that predicts various kinds of toxic effects, mostly by applying structural alerts, arranged in a decision tree fashion http://toxtree.sourceforge.net/predict/ (2) SMARTCyp (Cytochrome P450-Mediated Drug Metabolism) model is developed by Patrik Rydberg et al [5] and is included as Toxtree module since Toxtree 2.1.0. SMIRKS Application in metabolite generation Predictions via AMBIT web services (OpenTox API ) http://ambit.sf.net Application of Ambit-SMIRKS module for prediction of Omeprazole metabolism Sites of metabolism as predicted by SMARTCyp Ambit-SMIRKS generated metabolites Input SMIRKS: [N:1]1[C:2][C:3]=[C:4][C:5]1>>[N:1]1[C:2]=[C:3][C:4]=[C:5]1 Parse SMIRKS components [N:1]1[C:2][C:3]=[C:4][C:5]1 [N:1]1[C:2]=[C:3][C:4]=[C:5]1 Parse Mapping CDK object CDK object Parse Reactant SMARTS Parse Product SMARTS SMIRKS Parsing http://tinyurl.com/testreaction Basic metabolic reactions in SMARTCyp implemented by Toxtree Aliphatic hydroxylation Aromatic hydroxylation O-dealkylation Epoxidation Amine hydroxylation N-oxidation Thioester bond breaking Alcohol oxidation N-dealkylation Dioxolane demethylenation S-oxidation S-oxidation Aromatization of dihydropyridines Dihydropyrrole aromatization Aldehyde oxidation Desulphurization of phosphor NH N H C H 3 CH 3 CH 3 O H C H 3 CH 3 N C H 3 CH 3 CH 3 NH C H 3 O C H 3 CH 3 H O C H 3 CH 3 N C H 3 H CH 3 N C H 3 OH N H N O S C H 3 CH 3 O S C H 3 CH 3 O C H 3 O O C H 3 OH O R S R 1 O R OH R 1 SH + OH C H 3 P C H 3 CH 3 O C H 3 P C H 3 CH 3 S O O O H OH CH 2 O + O C H 3 H C H 2 O N H N H C H 3 CH 3 C H 3 CH 3 O Importance of metabolism prediction The transformations can be applied on various sites of the target molecule in several modes: (1) single (2) non-overlapping, (3) non-identical, (4) non-homomorphic or (5) externally specified list of sites. Transformation mapping modes Toxic metabolites are formed in some cases leading to unwanted adverse effects. Safety testing of metabolites has become a requirement according to the guidance for industry issued by regulatory institutions. O H N N N H NH O H N N NH N H N NH NH OH N N NH C - NH C - OH N N OH NH N N H SMARTS searching Transformation application 1 2 3 NH N NH NH OH N N NH NH OH N Input Target Molecule (SMILES, InChI, CML, MOL file) Reactant fragment found at 3 positions Potential incorrect structure due to mappings collision Non-overlapping mode Single mode Non-identical mode Work with mapping combinations (equivalent to the non-overlapping mode for this structure) Virtual safety testing (Ames models) Reaction mapping object CH 3 N N N N H Cl NH N N N H Cl Model 1 Model 2 Model 3 1 Clozapine 0 1 0 2 Norclozapine 0 1 0 3 Clozapine N-oxide 1 1 0 4 Hydroxyclozapine 0 1 0 (1) (4) (2) (3) CH 3 N C H 3 CH 3 CH 3 N C H 3 O - S C H 3 CH 3 O S C H 3 CH 3 1 4 5 1 2 2 3 3 4 5 metabolite metabolite metabolite

description

Best poster award at http://www.opentox.org/meet/opentoxeu2013/opentoxeuro13awards http://www.opentox.org/meet/opentoxeu2013/opentoxeu2013posters/ 7. Reaction Representation and Structure Transformation with Ambit-SMIRKS. Application in Metabolite prediction Ambit-SMIRKS is a new extension of the Ambit-SMARTS Java library [1], both part of the Ambit2 [2] project. The modules are implemented on top of the Chemistry Development Kit (CDK) [3]. Ambit-SMIRKS performs two main tasks: (1) parsing of SMIRKS linear notations into internal reaction (transformation) representations based on CDK objects and (2) application of the stored reactions against target molecules for actual transformation of the target chemical objects. The transformations can be applied on various sites of the target molecule in several modes: single, non-overlapping, non-identical, non-homomorphic or externally specified list of sites. Ambit-SMARTS implements the entire SMARTS language specification as defined by Daylight, plus additional syntax extensions to make software compliant with SMARTS modifications made by third party software packages such as OpenEye, MOE and OpenBabel. The SMIRKS library utilizes the SMARTS parser and the efficient substructure searching algorithm implemented within Ambit-SMARTS package [1]. Typically most SMIRKS implementations support SMILES plus simple SMARTS syntax features. However, Ambit-SMIRKS module supports full SMARTS syntax for reactions specification. The SMIRKS module is used to enable metabolite predictions in Toxtree (since version 2.5.0) [4], once the site of metabolism is predicted by SMARTCyp [5]. Toxtree is a flexible and user-friendly open-source application that predicts various kinds of toxic effects, mostly by applying structural alerts, arranged in a decision tree fashion. SMARTCyp (Cytochrome P450-Mediated Drug Metabolism) model is originally developed by Patrik Rydberg et al [5] and was included as Toxtree module since Toxtree 2.1.0. Ambit-SMIRKS functionality is available as a Java library as well as OpenTox Algorithm API compatible Web service. We also welcome testing the SMIRKS at the web page http://apps.ideaconsult.net:8080/ambit2/depict/reaction. References [1] N. Jeliazkova, N. Kochev, AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments, Mol. Inf., 30: 707–720, 2011 [2] Jeliazkova N., Jeliazkov V. AMBIT RESTful web services: an implementation of the OpenTox application programming interface, Journal of Cheminformatics 2011, 3:18, doi:10.1186/1758-2946-3-18. [3] C. Steinbeck, Y. Han, S. Kuhn, O. Horlacher, E. Luttmann, E. Willighagen, The Chemistry Development Kit (CDK):  An Open-Source Java Library for Chemo- and Bioinformatics, J. Chem. Inf. Comput. Sci., 43: 493–500, 2003 [4] http://toxtree.sourceforge.net [5] P. Rydberg, D. Gloriam, J. Zaretzki, C. Breneman, L. Olsen, SMARTCyp: A 2D Method for Prediction of Cytochrome P450-Mediated Drug Metabolism

Transcript of Reaction Representation and Structure Transformation with Ambit-SMIRKS.

Page 1: Reaction Representation and Structure Transformation with Ambit-SMIRKS.

CH3

N

NN

NH

OH

(1) Effective representation of SMARTS Queries based on CDK (full Daylight syntax) (2) Fast structure isomorphism /mapping/ (3) Support of recursive SMARTS (4) Syntax extensions

CH3

N+

NN

NH

Cl

O-

Reaction Representation and Structure Transformation with Ambit-SMIRKS.

Application in Metabolite Prediction

[1] N. Jeliazkova, N. Kochev, AMBIT-SMARTS: Efficient Searching of Chemical Structures and Fragments, Mol. Inf., 30: 707–720, 2011 [2] Jeliazkova N., Jeliazkov V. AMBIT RESTful web services: an implementation of the OpenTox application programming interface, Journal of Cheminformatics 2011, 3:18, doi:10.1186/1758-2946-3-18. [3] C. Steinbeck, Y. Han, S. Kuhn, O. Horlacher, E. Luttmann, E. Willighagen, The Chemistry Development Kit (CDK):  An Open-Source Java Library for Chemo- and Bioinformatics, J. Chem. Inf. Comput. Sci., 43: 493–500, 2003 [4] http://toxtree.sourceforge.net [5] P. Rydberg, D. Gloriam, J. Zaretzki, C. Breneman, L. Olsen, SMARTCyp: A 2D Method for Prediction of Cytochrome P450-Mediated Drug Metabolism, ACS Med. Chem. Lett., 2010, 1 (3), pp 96-100

(1) Parsing of SMIRKS linear notations into internal reaction (transformation) representations based on CDK objects (2) Application of the stored reactions against target molecules for actual transformation of the target chemical objects

Nina Jeliazkova a*, Nikolay T. Kochev b, Patrik Rydbergc, Svetlana Avramova b a) Ideaconsult Ltd, 4 A. Kanchev str., Sofia 1000, Bulgaria

b) University of Plovdiv, Department of Analytical Chemistry and Computer Chemistry, 24 Tsar Assen St., Plovdiv, Bulgaria c) Department of Drug Design and Pharmacology, Faculty of Health and Medical Sciences, University of Copenhagen

*To whom correspondence should be addressed. e-mail: [email protected], twitter: @10705013

Ambit-SMIRKS is a new extension of the Ambit-SMARTS Java library [1], both part of the Ambit2 [2] project. Implemented on top of Chemistry Development Kit [3], the SMIRKS module is used to enable metabolite predictions in Toxtree [4], once that site of metabolisms are predicted by SMARTCyp [5].

References:

Ambit-SMIRKS main tasks: Ambit-SMARTS basic features:

(1) Toxtree is an open-source application that predicts various kinds of toxic effects, mostly by applying structural alerts, arranged in a decision tree fashion http://toxtree.sourceforge.net/predict/

(2) SMARTCyp (Cytochrome P450-Mediated Drug Metabolism) model is developed by Patrik Rydberg et al [5] and is included as Toxtree module since Toxtree 2.1.0.

SMIRKS Application in metabolite generation

Predictions via AMBIT web services (OpenTox API )

http://ambit.sf.net

Application of Ambit-SMIRKS module for prediction of Omeprazole metabolism

Sites of metabolism as predicted by SMARTCyp Ambit-SMIRKS generated metabolites

Input SMIRKS: [N:1]1[C:2][C:3]=[C:4][C:5]1>>[N:1]1[C:2]=[C:3][C:4]=[C:5]1

Parse SMIRKS components

[N:1]1[C:2][C:3]=[C:4][C:5]1 [N:1]1[C:2]=[C:3][C:4]=[C:5]1

Parse Mapping

CDK object CDK object

Parse Reactant SMARTS

Parse Product SMARTS

SMIRKS Parsing http://tinyurl.com/testreaction

Basic metabolic reactions in SMARTCyp implemented by Toxtree

Aliphatic hydroxylation Aromatic hydroxylation

O-dealkylation Epoxidation

Amine hydroxylation N-oxidation

Thioester bond breaking Alcohol oxidation

N-dealkylation Dioxolane demethylenation

S-oxidation S-oxidation

Aromatization of dihydropyridines

Dihydropyrrole aromatization

Aldehyde oxidation Desulphurization of phosphor

NH NH

CH3

CH3CH3

OH

CH3

CH3

NCH3 CH3

CH3

NHCH3

OCH3 CH3

H

OCH3

CH3

NCH3 H

CH3

NCH3 OH

NH

N

O

SCH3 CH3

O

SCH3 CH3O

CH3

O O

CH3 OH

O

R S

R1

O

R OHR

1SH+

OH

CH3P

CH3 CH3

OCH3P

CH3 CH3

S

O O OH

OH

CH2

O+

OCH3 H

CH2 O

NH NH

CH3 CH3

CH3 CH3

O

Importance of metabolism prediction

The transformations can be applied on various sites of the target molecule in several modes: (1) single (2) non-overlapping, (3) non-identical, (4) non-homomorphic or (5) externally specified list of sites.

Transformation mapping modes

Toxic metabolites are formed in some cases leading to unwanted adverse effects. Safety testing of metabolites has become a requirement according to the guidance for industry issued by regulatory institutions.

OH N

NNH

NH

OH N

N NHNH

NNH

NH

OHN

NNH

C-

NHC

-

OHN

NOH

NHNNH

SMARTS searching Transformation application

1

2

3

NH

NNH

NH

OHN

NNH

NH

OHN

Input Target Molecule (SMILES, InChI, CML, MOL file)

Reactant fragment found at 3 positions

Potential incorrect structure due to mappings collision

Non-overlapping mode

Single mode

Non-identical mode

Work with mapping combinations (equivalent to the non-overlapping mode for this structure)

Virtual safety testing (Ames models)

Reaction mapping object

CH3NN

N

NH

Cl

NH

NN

NH

Cl

Model 1 Model 2 Model 3

1 Clozapine 0 1 0

2 Norclozapine 0 1 0

3 Clozapine N-oxide 1 1 0

4 Hydroxyclozapine 0 1 0

(1)

(4) (2)

(3)

CH3

NCH3 CH3

CH3

NCH3 O

-

SCH3 CH3

O

SCH3 CH3

1 4

5 1

2 2 3 3

4

5

metabolite

metabolite metabolite