Practical Pgn2 (1)

download Practical Pgn2 (1)

of 8

Transcript of Practical Pgn2 (1)

  • 7/30/2019 Practical Pgn2 (1)

    1/8

    Structural Bioinformatics Practical Session

    Today, in the basic part you will learn now to use the automated docking software Autodock3 andAutoDockTools for docking the ligands in the active site of a protein. In the advanced part, you willperform a similarity search with specified properties and molecular constitution based on the naturalligand, dock the hits and rank based on their docking energies.

    The protein used in the current exercise is trypsin. Trypsin is a proteolytic enzyme found in thesmall intestine which hydrolyses proteins into smaller peptides or aminoacids. The aspartate residue(ASP189) in the active site is responsible for this activity. Trypsin cleaves the proteins at the C-terminal side of the amino acids lysine and arginine except, when either is followed by proline.

    Keywords: AutoDock, AutoDockTools,In-silico Docking, Similarity Search and Virtual Screening

    Basic Part -In-Silico Docking

    The aim of this part is to get used to the docking software by re-docking the ligand present in thecrystal structure of a protein using an automated docking suite called 'AutoDock'. The GUI forAutoDock is AutoDockTools (ADT), which will be used to perform the entire docking task. Moreinformation is available on the AutoDock suite homepage http://autodock.scripps.edu/

    The latest versions of ADT and autodock are available from the home page.

    Getting Started

    1.Open a terminal window. Create a directory called 'docking' by typing 'mkdir docking'.Type 'cddocking' to enter the docking directory.

    ImportantYou need to set the stack size unlimited for AutoDock and AutoGrid programs. Depending on theshell you are using, enter one of the commands below. You can see your shell type with thecommand echo $SHELL

    For bash users Type 'ulimit -s unlimited' and press . For csh / kcsh users Type 'limit stacksize unlimited' and press .

    2.Fetch the protein-ligand completed 3PTB.pdb from Protein Data Bank (www.pdb.org), andsave in the directory 'docking'.Now you need to extract the protein and ligand coordinates into separate files.Extract Protein and Ligand from the 3PTB.pdb2.1.Using grep

    [user@host1 docking]$grep ATOM 3PTB.pdb > protein.pdb[user@host1 docking]$grep HETATM 3PTB.pdb | grep HOH -v | grep CA -v > ligand.pdb

    3.Start the ADT

    3.1.Type 'adt&' in the terminal and press . After ADT has loaded uncheck the button'float Camera' to join the viewer and the menu of ADT

    ADT is a viewer used to run docking jobs for AutoDock. Every structure in the ADT can berotated using the middle mouse button, translated by pressing right mouse button and it can bezoomed by pressing 'SHIFT' button on keyboard and middle mouse button.

  • 7/30/2019 Practical Pgn2 (1)

    2/8

    After ADT is loaded, begin the docking task

    The Docking Task

    4. Preparing the Protein file protein.pdbqs

    4.1.Open the protein.pdb file File > Read Molecule

    4.2.Color the protein Color > By Atom Type

    A dialog box appears, check 'lines' and press 'OK'

    4.3.Adding polar hydrogens to the protein Edit > Hydrogens > Add

    Choose 'Only Polar' from the dialog box and press 'OK'

    4.4.Adding charges Edit > Charges > Add Kollman Charges

    Total charge added is desplayed in a dialog box. Press 'OK'

    4.5.Check total charges on the protein Edit > Charges > Check Totals on Residues

    A dialog box appears displaying no residues with integral chrarge found. Press 'OK'

    4.6.Adding solvent parameters and save as .pdbqs files Grid > Macromolecule > Add Solvent Parameters.

    A dialog box Is Molecule already in viewer? appears. Press 'Yes' and select theprotein from the list. Press 'Select Molecule'. It will now prompt to save the file. Save itas protein.pdbqs in the directory 'docking'.

    4.7.Remove the protein file from the viewer Edit > Delete > Delete AtomSet

    5.Preparing Ligand file - ligand.pdbq5.1.Opent the ligand.pdb files

    Ligand > Input Molecule > ReadMolecule

    Choos filetype as '.pdb' and read the ligand.pdb file.A dialog box displaying the information about ligand appears, press OK

    5.2.Adding charges Edit > Charges > Compute Gasteiger

    Total charge added is displayed in a dialog. Press OK

    5.3.Check total charges on the ligand Edit > Charges > Check Totals on residues

    A dialog box appears displaying no residues with integral charge found. Press 'OK'

    5.4.Define Rigid root for ligand Ligand > Define rigid root automatically

    You can see a green sphere in the molecule which is defined as root. Root is the centerabout which the ligand is rotated during the docking procedure.

  • 7/30/2019 Practical Pgn2 (1)

    3/8

    5.5.Define rotatable bonds in the ligand

    Ligand > Rotatable Bonds > Define Rotatable BondsYou can see the rigid bonds colored red and flexible or rotatable bonds colored green.Keep the default options and press 'Done'.

    5.6.Setting the active torsions Ligand > Rotatable Bonds > Set number of active Torsions

    Keep the default values in the dialog box. Click on the text box and press . Thenpress 'Dismiss'

    5.7.Save the ligand.pdbq file in directory 'docking' Ligand > Write PDBQ

    Save this file as ligand.pdbq in directory 'docking'

    5.8.Remove the ligand from the viewer. Edit > Delete > Delete AtomSet

    6.AutoGrid6.1.Select the protein files

    Grid > Macromolecule > Read MacromoleculeSelect the protein.pdbqs file from the directory 'docking'. A dialog box displaying theinformation about protein appears. Press 'OK'

    6.2.Select the ligand file Grid >Set Map Types > By Reading Formatted Files

    Select the ligand.pdbq file from the directory 'docking'

    A dialog box displaying ligand atom types appears, keep the default values and press'Accept'Now you can see the ligand in the active site of the protein.

    6.3.Setting the grid box on the protein Grid > Set Grid

    You will see a dialog box showing the default dimensions of the grid box. For thisdocking you will set the grid box centered on the ligand in the complex.In the grid dialog box,

    Set X Y Z dimensions to 60 each Center > Center on ligand

    Visualize the modified grid box View > Show Box

    Toggle the show box button Save the grid settings

    File > Close saving current

    6.4.Write the Grid Parameter file. Grid > Write GPF

    Save the file as protein.gpf in the direcotry docking

    6.5.Start AutoGrid Run > Start AutoGrid

    In the dialog box press 'Launch'Autogrid will run in background and when it completes a message autogrid3:Successfulcompletion will appear in the terminal from where ADT was launched.

  • 7/30/2019 Practical Pgn2 (1)

    4/8

    Wait till Autogrid completes.

    7.AutoDock7.1.Select the protein

    Docking > Set Macromolecule > Choose MacromoleculeSelect the protein file from list and press 'Select Molecule'

    7.2.Select the ligand Docking > Set Ligand Parameters > Choose Ligand

    Select the ligand file from list and press 'Select Molecule'. A dialog box displaying'AutoDpf Ligand' parameters appears, keep the default values and press 'Accept'.

    7.3.Set search parameters for docking' Docking > Set search parameters > Genetic Algorithm Parameters

    A dialog displaying the 'GA' parameters appears. Keep the default values and press

    'Accept'

    7.4.Set run parameters fro docking Docking > Set Docking Run Parameters

    A dialog displaying these parameters appears. Keep the default values and press'Accept'

    7.5.Write the docking parameter files Docking > Write DPF > GALS.dpf

    Save the file as ligand.protein.dpf

    7.6.Start AutoDock Run > Start AutoDock

    In the dialog box press 'Launch'AutoDock will run in background and when it completes a message autodock3:Successful completion at localhost will appear in the terminal from where ADT waslaunched. Wait till AutoDock completes.

    7.7.Delete the ligand file from the viewer Edit > Delete > Delete Molecule

    Select the ligand file from the list and press Delete Molecule.

    8.Analyzing the docking results

    After Autodock compltes the docking results are saved in a file named ligand.protein.dlg in thedirectory 'docking'8.1.Load the ligand.protein.dlg into the viewer

    Analyze > Docking Logs > Read Docking LogSelect the ligand.protein.dlg file from the dorectory 'docking'. A dialog box displayingthe total docked conformationa appears, press 'OK'.

    8.2.Color the protein and ligand appropriated for clear visualization. Color protein in CPK and zoom the active site.

    Select > Select From StringIn the dialog box, select 'protein' from the 'Molecule List' drop down and press'Select'. You can see the protein selected in the viewer. Press 'Dismiss' to remove the

  • 7/30/2019 Practical Pgn2 (1)

    5/8

    'Select From String' dialog. Un/Display > CPK

    In the dialog box decrease the 'Scale Factor' to 0.6, press 'OK' Color > By Residue Type > RasmolAmino

    In the dialog select CPK and press 'OK'

    Zoom the active site on the protein.

    Display the ligand in Ball Stick Select > Select From String

    In the dialog box, deselect 'protein' from the 'molecule list' dropdown and select'ligand'.Press 'Select'. You can see the ligand selected in the viewer. Press 'Dismiss' toremove the 'Select From String' dialog.

    Un/Display > Sticks and BallsIn the dialog box leave the default values, press 'OK'

    Color > By Atom Type

    In the dialog select 'Balls and Sticks' and press 'OK'Press 'Clear Selection' in the main menu.

    8.3.View the docked conformations Analyze > Conformations > Show Conformations

    You will see a dialog box with information about each docked conformation. Press thefront arrow to scroll the conformations. You can simultaneously see the conformationsin the viewer. The conformation with the lowest docking energy is ranked best byAutoDock.

    Make a note of the binding energy.

    8.4.View Clustering Analyze > Clusterings > Show Clusterings

    You can see the clusters of docked conformations based on Binding energies.

    8.5.Observing the docking energies of the ligand in .dlg file You can see the energies of the docked conformations in the .dlg file. Open and scroll

    down the protein.ligand.dlg file until you find 'CLUSTERING HISTOGRAM' Make a note of the cluster rank, lowest docked energy, number of conf. in the cluster.

    Molecular databases for Drug design

    In this exercise you will learn about the molecular databases used for drug design. Here is a briefintroduction of two freely available databases, The ZINC database and the NCI database.

    1. The ZINC Database ( http://zinc.docking.org )

    The ZINC database is a collection of over 3.3 million compounds from various commercialand academic resources. ZINC is provided by the Shoichet Laboratory in the Department ofPharmaceutical Chemistry at the University of California, San Francisco (UCSF).

    At the ZINC database you canBrowse / download subsets by vendor (provider),

    Browse / download pre-classified subsets specified by molecular properties,Search the database based on molecular constitution & properties andUpload your own molecules and define them as a set.

  • 7/30/2019 Practical Pgn2 (1)

    6/8

    Right now molecules from 22 vendors can be downloaded from zinc. Each molecule in zinc hasrepresentation of its multiple protonation states and multiple tautomeric forms for each substance.

    2. The NCI database

    The NCI database is a collection of 250,251 molecules publicly and freely available fromNCI's Developmental Therapeutics Program (DTP). The entire collection with various biologicalactivity information and calculated properties can be freely downloaded athttp://cactus.nci.nih.gov/ncidb2/download.html

    The NCI database has an advanced browser to search the molecules. The advanced browsercan be found at http://129.43.27.140/ncidb2/ .Using the advanced NCI browser you can search thedatabase using various criteria like Substructure similarity, Exact similarity, Formula, Molecularweight etc.,

    More documentation on how to perform queries and can be found at

    http://129.43.27.140/ncidb2/help.html

    Advanced part Molecular Databases &In-Silico Virtual Screening

    The aim of this part of exercise is to search for ligands with similar molecular constitution as thenatural ligand with specified molecular properties as constraints using the interface at zinc database.Then you need to randomly select few hits (at least 5), dock them to Trypsin individually throughthe learnt procedure in the previous session (basic part) and rank the molecules according to thedocking energies. This procedure if done at large scale in an automated fashion is called In-silicovirtual screening. You need to use the macromolecule file protein.pdbqs from the previouspractical session.

    1.Performing search and Screening the hitsCreate a directory 'screening' by typing 'mkdir screening' in the home directory.

    NOTE: If you have problems in this section, you can copy the ligands.zip file at the coursehomepage and skip until 1.11

    1.1.Open the web browser and visit the Zinc Database at http://zinc.docking.org Zinc is a free database of over 3.3 million commercially-available compounds for

    virtual screening.

    1.2.Click on Search and Browse link to open the interface to search the database. This will open the search interface for the zinc database on right you can set the

    property constraints for the search and on right you can see a Java molecular editor inwhich you can draw the molecular constitution that you want to search.

    1.3.Draw the natural ligand in the molecular editor. While drawing, observe that hydrogens areadded automatically to the nitrogens. You can see the 2d structure of the natural ligand in thefigure below. Now click on 'save SMILES' button just below the editor. NH2 will appear inthe zinc molecular area on the nitrogen with the single bond in the figure below.

  • 7/30/2019 Practical Pgn2 (1)

    7/8

    1.4.Specify the property constraints. Here we specify the constraints from Lipinsky's rule of fivewhich is derived empirically from the analysis of the World Drug Index on properties thatmaximizes an oral drug candidate's probability of surviving clinical development. For thisexercise we skip the partition coefficient constraint logp whose value should be less than 5.

    1.5.

    Specify the following constraints Molecular Weight

  • 7/30/2019 Practical Pgn2 (1)

    8/8

    1.12. We have protein.pdbqs from the previous practical session (In-Silico Docking) inthe docking directory. Copy the protein.pdbqs into all the directories in the screening folder.

    1.13. For all the ligands perform the dockings as done in the previous practical session

    (In-Silico Docking) in the respective directories.a)Alterations during dockings

    Skip 4. Preparing the protein file protein.pdbqs Skip 5. Preparing the Ligand file-ligand.pdbq since we already have the

    zinc_id.pdbq files from the PRODRG server. In 6.Autogrid at 6.3 setting the grid box on the protein, Set X Y Z dimensions to 60

    each. DO NOT Center on ligand. Instead, specify the x center: -1.856, y center: 14.366and z center: 16.748. This is because the ligand might be away from the protein initiallyand centering the grid on the ligand will exclude the protein from the docking.

    1.14. Note down the binding energies for each ligand and tabulate the results

    zinc id Molecular wt. Docking Energy # of conf, in top cluster

    Make a note of the ligand with lowest binding energy.

    -- Chaitanya, Francesco & P G Nyholm,Biognos AB