Granatum_LiSIs_BIBE_2012_presentation_v4.0
-
Upload
christos-kannas -
Category
Technology
-
view
200 -
download
0
Transcript of Granatum_LiSIs_BIBE_2012_presentation_v4.0
A Workflow System for Virtual Screening in CancerChemoprevention
Kannas C. C., Achilleos K. G., Antoniou Z., Kalvari I., Kirmitzoglou I., Nicolaou C. A., Promponas V. J., Pattichis C. S.
13th November 2012IEEE 12th International Conference on BioInformatics &
BioEngineering
IEEE 12th International Conference on BioInformatics & BioEngineering
2
About LiSIs platform Objectives What is LiSIs? Virtual Screening & Scientific Workflow Management Systems
LiSIs platform Virtual Screening Process Template Docking Models Predictive Models 3rd Party Tools The GRANATUM-LiSIs Platform
Concluding Remarks Acknowledgement Questions?
13th November 2012
Outline
IEEE 12th International Conference on BioInformatics & BioEngineering
3
About LiSIs platform
13th November 2012
IEEE 12th International Conference on BioInformatics & BioEngineering
4
Provide a set of tools to create virtual screening process for the discovery of novel agents with desired properties.
Provide a set of tools to create data-driven models designed to predict biochemical properties of interest.
Provide an environment to create, update, store and share Scientific Workflows, that is accessible via a web interface.
13th November 2012
Objectives
IEEE 12th International Conference on BioInformatics & BioEngineering
5
Life Science Informatics Scientific Workflow Management System
Computational Environment for Virtual Screening. Cancer Chemoprevention Research
Computational Tools borrowed from Drug Discovery Process.
GRANATUM project (http://www.granatum.org) Partially funded by the European Commission under the
Seventh Framework Programme in the area of Virtual Physiological Human (ICT-2009.5.3).
13th November 2012
What is LiSIs?
IEEE 12th International Conference on BioInformatics & BioEngineering
6
Virtual Screening Computational counterpart of biological screening. Goal: decrease the number of compounds physically
screened. Scientific Workflow Management Systems
(SWMS) Computational environments which facilitate the
design and execution of computational experiments (workflows).
Known SWMS: Taverna (http://www.taverna.org.uk/) KNIME (http://www.knime.org/ ) Galaxy (http://galaxy.psu.edu/)
13th November 2012
Virtual Screening & Scientific Workflow Management Systems
IEEE 12th International Conference on BioInformatics & BioEngineering
7
LiSIs platformAvailable @ http://lisis.cs.ucy.ac.cy
13th November 2012
IEEE 12th International Conference on BioInformatics & BioEngineering
8 13th November 2012
Virtual Screening Process Template
Input• GRANA
TUM platform File Loader
• File Readers
Preprocessing• File format
transformations
• Property Normalizer
• Descriptors Calculation
• Fragmentation
• Coordinates Calculation
• Protein Cleaner
Processing• Descriptor
Filters• Similarity
Search• Substructure
Search• Docking
Models• Predictive
Models
Postprocessing• Cleaning• Formatting• Merging
Output• Storage• Visualization
IEEE 12th International Conference on BioInformatics & BioEngineering
9
Protein Preparation: Input
Protein.pdb file Process
Remove water molecules Add hydrogen atoms Calculate binding domain coordinates Clean protein from co-crystalized molecule
Output Modified protein.pdb file
13th November 2012
Docking Model PreparationDocking Model
Protein (pdb file)
Add Hydrogens
Remove H2
O
Calculate Pocket Coordinates
Separate reference ligand
IEEE 12th International Conference on BioInformatics & BioEngineering
10 13th November 2012
Predictive Model Preparation
Predictive Model
Biological dataChemical dataAlgorithm
• Algorithm parameters
IEEE 12th International Conference on BioInformatics & BioEngineering
11 13th November 2012
Predictive Model Preparation
• Chembl• Pubchem• Literature
Data gathering
IEEE 12th International Conference on BioInformatics & BioEngineering
12
3rd Party Tools
13th November 2012
IEEE 12th International Conference on BioInformatics & BioEngineering
13
Galaxy is an open, web-based platform for data intensive biomedical research.
Free public server. Deploy locally. Deploy in a Cluster/Grid environment Deploy in the cloud (CloudMan Amazon EC2).
http://galaxy.psu.edu/
13th November 2012
Galaxy
IEEE 12th International Conference on BioInformatics & BioEngineering
14
Galaxy configuration: Web Server(s) (Handle Users Requests) Job Manager (Job Management) Job Handler(s) (Job Execution)
Galaxy Tools are wrappers around command line applications.
Galaxy runs command line jobs at the background.
13th November 2012
Galaxy Integration Methodology
IEEE 12th International Conference on BioInformatics & BioEngineering
15
A collection of cheminformatics and machine-learning software written in C++ and Python.
The core algorithms and data structures are written in C++. Wrappers are provided to use the toolkit from either Python or Java.
Additionally, the RDKit distribution includes a PostgreSQL-based cartridge that allows molecules to be stored in relational database and retrieved via substructure and similarity searches.
Features Overview RDKit Home Page
13th November 2012
RDKit
IEEE 12th International Conference on BioInformatics & BioEngineering
16
R is a language and environment for statistical computing and graphics.
R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, ...) and graphical techniques, and is highly extensible.
One of R's strengths is the ease with which well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed. Great care has been taken over the defaults for the minor design choices in graphics, but the user retains full control.
The R Project13th November 2012
The R Project
IEEE 12th International Conference on BioInformatics & BioEngineering
17
AutoDock Vina is a new open-source program for drug discovery, molecular docking and virtual screening, offering multi-core capability, high performance and enhanced accuracy and ease of use.
Features: Accuracy Compatibility with AutoDock Tools Ease of Use Flexible Side Chains
Some receptor side chains can be chosen to be treated as flexible during docking.
Speed AutoDock Vina
13th November 2012
AutoDock Vina
IEEE 12th International Conference on BioInformatics & BioEngineering
18
LiSIs platform DemoAvailable @ http://lisis.cs.ucy.ac.cy
13th November 2012
IEEE 12th International Conference on BioInformatics & BioEngineering
19
http://lisis.cs.ucy.ac.cy
13th November 2012
LiSIs platform Live Demo!!!
IEEE 12th International Conference on BioInformatics & BioEngineering
20
Fill a current void in chemoprevention, and in general life sciences, research.
Enabling researchers to utilize state of the art computational techniques to search for the discovery of novel agents with desired properties.
13th November 2012
Concluding Remarks
IEEE 12th International Conference on BioInformatics & BioEngineering
21
Acknowledgement
13th November 2012
Computer Science, UCYProf. C. S. PattichisC. A. Nicolaou (PhD)Z. AntoniouK. G. AchilleosBiological Sciences, UCYDr. V. J. PromponasI. KirmitzoglouI. Kalvari
Cancer Biology and Chemoprevention laboratory at Department of Biological Sciences, UCY.
Cancer Chemoprevention and Epigenomics Workgroup at German Cancer Research Center.
Members of the GRANATUM consortium.
IEEE 12th International Conference on BioInformatics & BioEngineering
22 13th November 2012
Questions?