Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v...

34
1 Agricultural Products Group ChemAxon’s Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon, Henry Liu

Transcript of Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v...

Page 1: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

1

Agricultural Products Group

ChemAxon’s Marvin & JChem (v 3.1.3)vs.

MDL® ISIS/Draw ISIS/Host (v 4.0)

Seong Jae Yu, David Roush, Usha GaneshYoung Moon, Henry Liu

Page 2: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

2

Agricultural Products Group

• Business Discussion

• Scientific Evaluation

• Technical Evaluation

Outline

Comparison of MDL® ISIS/Host with JChem Base

Page 3: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

3

Agricultural Products Group

Business Evaluation

• Customer Base

• Customer Support

• Cost/Benefit

Page 4: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

4

Agricultural Products Group

Business Evaluation

• Customer Base

• Customer Support

• Cost/Benefit

Page 5: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

5

Agricultural Products Group

ChemAxon Clients

Industrial Clients: >200

Academic Clients: >900

For specific information: http://www.chemaxon.com/aboutus.html

Page 6: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

6

Agricultural Products Group

Business Evaluation

• Customer Base

•Customer Support

• Cost/Benefit

Page 7: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

7

Agricultural Products Group

Business Evaluation

• Customer Base

• Customer Support

• Cost/Benefit

Page 8: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

8

Agricultural Products Group

Scientific Evaluation

• DatabaseUsed 1.8 million compounds to create a testing database

• Searches 51 simple sub-structure searches 51 similarity searches

64 complex searchestautomers, double bond, stereochemistry, multiple substituents, valence, ring size, chain length, etc.

Total 115 structures were evaluated

Page 9: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

9

Agricultural Products Group

Substructure Search Comparison (1)

NN

NO

O

N

NO

O

NONN

Query MDL® ChemAxon

5545 hits

79 hits

6087 hits

3 hits

5545 hits

79 hits

6087 hits

3 hits

25 out of 51 searches showed identical results

Page 10: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

10

Agricultural Products Group

Substructure Search Comparison (2)

N

N

O

Single/aromatic

double

Query ChemAxon

43 hits 10 hits

Absent

N

N

O

double/aromatic

43 HitsNew Query

Change Bond Definition

S

N

NN

O

O

S

N

NN

O

O

N

N

N

NO

O

N

N

N

NO

O

MDL®

Page 11: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

11

Agricultural Products Group

Substructure Search Comparison (3)

N

NO

O

single

Query ChemAxon

153 hits 146 hits

AbsentN

O

NO

R

SN N

NR

F FF

O

O

MDL®

Page 12: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

12

Agricultural Products Group

Differences in MDL® Aromatic Definition

N

NO

O

single/aromatic

MDL® uses a precise definition for aromaticity of 6-membered rings

N

N

O

O

N

N

O

O

N

N

O

O

N

N

O

O

Ph

CN

N

NO

O

single

N

O

NO

R

SN

N

N

R

F FF

O

O

Query 2Query 1

MDL® uses a dual definition for aromaticity of 5-membered rings

Page 13: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

13

Agricultural Products Group

Substructure Search Comparison (4)

Query ChemAxon

4959 hits 5020 hitsN

N N

N

N

N

N N

Absent

MDL®

Page 14: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

14

Agricultural Products Group

NO

NO

O

NO

NO

O

NOT Lists

ChemAxon

1 hit 4 hits

Query

NO

NNOT [F,Cl,Br,I]

N

N

O

OO

H

N

NO

N

H

N

NO

OO

H

MDL®

Page 15: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

15

Agricultural Products Group

Bidentate R groups

R

OH

ChemAxon

N/A hits 656 hits

MDL®

Page 16: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

16

Agricultural Products Group

Atom Types

MDL® can not specify aromatic or aliphatic atoms

ChemAxon can specify aromatic or aliphatic atoms

O

ChemAxon Hits

Oa

OA

14

216

230

Query

Aromatic

Aliphatic

Aromatic/Aliphatic

Page 17: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

17

Agricultural Products Group

Similarity Search Comparison (1)

N

NN

O

S

O

Query ChemAxon

0 hits(70%)

4 hits(75%)

ChemAxon Hits

N

NN

O N

S

OO N

N

O N

S

O N

N

N

S

O

O

N

O

N

NN

S

O

O

N

O

MDL®

Page 18: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

18

Agricultural Products Group

NNO

Similarity Search Comparison (2)

Query ChemAxon

308 hits 302 hits

N

N

O

N

N

O

N

N

O

N

N

OO

NNO 1-3

MDL®

Page 19: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

19

Agricultural Products Group

Overall Performance

BRIM_ID Substructure Similarity Substructure SimilarityBID-10007608 2.5 36.3 1 5BID-10055418 7.4 9.4 6 3BID-10622149 8.9 33.6 8 3BID-10699051 6.7 13.9 19 3BID-10978789 2.6 18.3 0 2BID-11192424 2.0 10.2 7 2BID-11252857 8.3 17.5 20 5BID-11706292 4.1 31.9 2 2BID-11800796 3.2 10.3 21 3BID-11885540 2.0 15.9 0 3

Average 4.8 19.7 8.4 3.1

MDL® (sec) Chemaxon (sec)

• Measure the performance (sec) based on 3 simultaneous users

Page 20: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

20

Agricultural Products Group

ChemAxon – Product Overview

Page 21: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

21

Agricultural Products Group

ChemAxon Standardizer

• Can remove counterions

• Aromaticity definition

• Structure standardization

Page 22: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

22

Agricultural Products Group

ChemAxon Standardizer

• Can remove counterions• Aromaticity definition• Structure standardization

N NHCl

Page 23: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

23

Agricultural Products Group

ChemAxon Standardizer

• MDL® treats 5-membered heterocycles as non-aromatic

(Kekulé structure)

• ChemAxon aromatizes any 4n+2 ring system

ISISKekulé structure

ChemAxonResonant structure

S S

• Can remove counterions• Aromaticity definition• Structure standardization

Page 24: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

24

Agricultural Products Group

• Can remove counterions• Aromaticity definition• Structure standardization

5 hits in database 203 hits in database

NS

N

NS

N

NS

N

ChemAxon Standardizer

Page 25: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

25

Agricultural Products Group

ChemAxon Standardizer

• Can remove counterions• Aromaticity definition• Structure standardization

N+

O

O

NO

O

N

O

N+

O

25753 hitsin database

9 hitsin database

1105 hitsin database

92 hitsin database

Page 26: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

26

Agricultural Products Group

Scientific Conclusion

• ChemAxon

- Is more logical and chemically meaningful in searches

- Can perform bidentate and atom-type searches

- Showed better search performance

- Can integrate other modules

- Flexible

• MDL®

- Can give unexpected results in similarity searches

- Databases (MDDR, ACD, REACCS)

Page 27: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

27

Agricultural Products Group

Technical Comparison

Transparency

Proprietary (opaque) data/table structure

Clear understanding of: Flow of Data Structure of Data Execution Process

SDFile Processing

Lack of necessary tools to process large SDFiles

Requires several custom programs

Simple SDFile processing and manipulation

ISIS ChemAxon

ISIS ChemAxon

Page 28: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

28

Agricultural Products Group

1 Extract Ref #s per vendor 1 hr

2 Pipeline Pilot: split vendor files to {existing, discontinued, and new} per vendor

3 hr

3 Combine all the new files, filter, and generate unique/duplicate files 1 hr

4 Run the unique structures in a temp database in ISIS for uniqueness 1 hr

5 Remove duplicates from (4)

6 Export current ISIS 4 hr

7 Compare the unique file with export from (6) – generate unique and duplicate sdf

2 hr

8 Check if any of unique structures from (7) exists in ISIS 4 hr

9 If any from (8) remove from unique and duplicate files 1 hr

10

Assign compound ids to the new compounds 1 hr

11

Import into ISIS 2 hr

12

Create index 4 hr

13

Populate the Oracle master/detail tables 1 hr

14

Verify and Delete discontinued compounds 3 hrs

Testing at every stage 3 hrs

Total 31 hrs

Technical ComparisonExample of processing SD files in MDL® ISIS/Host

Page 29: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

29

Agricultural Products Group

Technical ComparisonExample of processing SD files in ChemAxon

1 Populate filters in a table (Non-recurring)

7 hr2 Import vendor file into temporary table

3 Compare the vendor table to filter table and delete filtered

4 Run a procedure that insert new compounds into master/detail tables and flag discontinued.

5 (1-4) for all vendors

6 Create index on master table 0 hr

7 Verify dependencies and delete the compounds from detail table 4 hr

Total 11 hr

Page 30: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

30

Agricultural Products Group

Supported Platforms • ISIS® : Sun Solaris®, Windows® Servers• JChem: Sun Solaris®, Windows® Servers, Linux®, Irix®, MAC®

Technical Comparison

Technologic Transparency• ISIS®:Unclear Data/Table Structures• JChem: Clear Understanding of:

Flow of Data Structure of Data Execution Process

Native Oracle Tables and Procedures

Processing SD Files• ISIS®:31 hours, Pipeline® Pilot & ISIS® • JChem: 11 hours, JChem

Supported Databases• ISIS®:Oracle®• JChem: Oracle®, MySQL®, SQL Server, PostgreSQL, Access™, DB2

Performance• ISIS®:Slow similarity search• JChem: Fast similarity search

Page 31: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

31

Agricultural Products Group

Technical Conclusion

• Clear and straightforward understanding of– Data Representation

– System Architecture

• Integrated system – Quicker and less error-prone

– Less hassle for software development

• From technical point of view, ChemAxon is favorable

Page 32: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

32

Agricultural Products Group

Summary: Business

ChemAxon was the better choice

Page 33: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

33

Agricultural Products Group

Major Differences: result from aromatic bond definitions

• Used 1.8 million vendor compounds to create a testing database

• Prepared 115 different structures for comparison

51 simple sub-structure search 51 similarity search 64 complex search

Summary: Scientific

• Not List includes only your choices with ChemAxon

• Bidentate searches and atom types possible in ChemAxon

• MDL® has MDDR, ACD and REACCS databases

Page 34: Agricultural Products Group 1 ChemAxons Marvin & JChem (v 3.1.3) vs. MDL® ISIS/Draw ISIS/Host (v 4.0) Seong Jae Yu, David Roush, Usha Ganesh Young Moon,

34

Agricultural Products Group

Summary: Technical

• Clear and straightforward understanding of: Data Representation System Architecture

• Integrated system