Granatum_LiSIs_BIBE_2012_presentation_v4.0

22
A Workflow System for Virtual Screening in Cancer Chemoprevention Kannas C. C. , Achilleos K. G., Antoniou Z., Kalvari I., Kirmitzoglou I., Nicolaou C. A., Promponas V. J., Pattichis C. S. 13 th November 2012 IEEE 12 th International Conference on BioInformatics & BioEngineering

Transcript of Granatum_LiSIs_BIBE_2012_presentation_v4.0

Page 1: Granatum_LiSIs_BIBE_2012_presentation_v4.0

A Workflow System for Virtual Screening in CancerChemoprevention

Kannas C. C., Achilleos K. G., Antoniou Z., Kalvari I., Kirmitzoglou I., Nicolaou C. A., Promponas V. J., Pattichis C. S.

13th November 2012IEEE 12th International Conference on BioInformatics &

BioEngineering

Page 2: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

2

About LiSIs platform Objectives What is LiSIs? Virtual Screening & Scientific Workflow Management Systems

LiSIs platform Virtual Screening Process Template Docking Models Predictive Models 3rd Party Tools The GRANATUM-LiSIs Platform

Concluding Remarks Acknowledgement Questions?

13th November 2012

Outline

Page 3: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

3

About LiSIs platform

13th November 2012

Page 4: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

4

Provide a set of tools to create virtual screening process for the discovery of novel agents with desired properties.

Provide a set of tools to create data-driven models designed to predict biochemical properties of interest.

Provide an environment to create, update, store and share Scientific Workflows, that is accessible via a web interface.

13th November 2012

Objectives

Page 5: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

5

Life Science Informatics Scientific Workflow Management System

Computational Environment for Virtual Screening. Cancer Chemoprevention Research

Computational Tools borrowed from Drug Discovery Process.

GRANATUM project (http://www.granatum.org) Partially funded by the European Commission under the

Seventh Framework Programme in the area of Virtual Physiological Human (ICT-2009.5.3).

13th November 2012

What is LiSIs?

Page 6: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

6

Virtual Screening Computational counterpart of biological screening. Goal: decrease the number of compounds physically

screened. Scientific Workflow Management Systems

(SWMS) Computational environments which facilitate the

design and execution of computational experiments (workflows).

Known SWMS: Taverna (http://www.taverna.org.uk/) KNIME (http://www.knime.org/ ) Galaxy (http://galaxy.psu.edu/)

13th November 2012

Virtual Screening & Scientific Workflow Management Systems

Page 7: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

7

LiSIs platformAvailable @ http://lisis.cs.ucy.ac.cy

13th November 2012

Page 8: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

8 13th November 2012

Virtual Screening Process Template

Input• GRANA

TUM platform File Loader

• File Readers

Preprocessing• File format

transformations

• Property Normalizer

• Descriptors Calculation

• Fragmentation

• Coordinates Calculation

• Protein Cleaner

Processing• Descriptor

Filters• Similarity

Search• Substructure

Search• Docking

Models• Predictive

Models

Postprocessing• Cleaning• Formatting• Merging

Output• Storage• Visualization

Page 9: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

9

Protein Preparation: Input

Protein.pdb file Process

Remove water molecules Add hydrogen atoms Calculate binding domain coordinates Clean protein from co-crystalized molecule

Output Modified protein.pdb file

13th November 2012

Docking Model PreparationDocking Model

Protein (pdb file)

Add Hydrogens

Remove H2

O

Calculate Pocket Coordinates

Separate reference ligand

Page 10: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

10 13th November 2012

Predictive Model Preparation

Predictive Model

Biological dataChemical dataAlgorithm

• Algorithm parameters

Page 11: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

11 13th November 2012

Predictive Model Preparation

• Chembl• Pubchem• Literature

Data gathering

Page 12: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

12

3rd Party Tools

13th November 2012

Page 13: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

13

Galaxy is an open, web-based platform for data intensive biomedical research.

Free public server. Deploy locally. Deploy in a Cluster/Grid environment Deploy in the cloud (CloudMan Amazon EC2).

http://galaxy.psu.edu/

13th November 2012

Galaxy

Page 14: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

14

Galaxy configuration: Web Server(s) (Handle Users Requests) Job Manager (Job Management) Job Handler(s) (Job Execution)

Galaxy Tools are wrappers around command line applications.

Galaxy runs command line jobs at the background.

13th November 2012

Galaxy Integration Methodology

Page 15: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

15

A collection of cheminformatics and machine-learning software written in C++ and Python.

The core algorithms and data structures are written in C++. Wrappers are provided to use the toolkit from either Python or Java.

Additionally, the RDKit distribution includes a PostgreSQL-based cartridge that allows molecules to be stored in relational database and retrieved via substructure and similarity searches.

Features Overview RDKit Home Page

13th November 2012

RDKit

Page 16: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

16

R is a language and environment for statistical computing and graphics.

R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible.

One of R's strengths is the ease with which well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed. Great care has been taken over the defaults for the minor design choices in graphics, but the user retains full control.

The R Project13th November 2012

The R Project

Page 17: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

17

AutoDock Vina is a new open-source program for drug discovery, molecular docking and virtual screening, offering multi-core capability, high performance and enhanced accuracy and ease of use.

Features: Accuracy Compatibility with AutoDock Tools Ease of Use Flexible Side Chains

Some receptor side chains can be chosen to be treated as flexible during docking.

Speed AutoDock Vina

13th November 2012

AutoDock Vina

Page 18: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

18

LiSIs platform DemoAvailable @ http://lisis.cs.ucy.ac.cy

13th November 2012

Page 19: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

19

http://lisis.cs.ucy.ac.cy

13th November 2012

LiSIs platform Live Demo!!!

Page 20: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

20

Fill a current void in chemoprevention, and in general life sciences, research.

Enabling researchers to utilize state of the art computational techniques to search for the discovery of novel agents with desired properties.

13th November 2012

Concluding Remarks

Page 21: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

21

Acknowledgement

13th November 2012

Computer Science, UCYProf. C. S. PattichisC. A. Nicolaou (PhD)Z. AntoniouK. G. AchilleosBiological Sciences, UCYDr. V. J. PromponasI. KirmitzoglouI. Kalvari

Cancer Biology and Chemoprevention laboratory at Department of Biological Sciences, UCY.

Cancer Chemoprevention and Epigenomics Workgroup at German Cancer Research Center.

Members of the GRANATUM consortium.

Page 22: Granatum_LiSIs_BIBE_2012_presentation_v4.0

IEEE 12th International Conference on BioInformatics & BioEngineering

22 13th November 2012

Questions?