Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture...
-
Upload
duongtuyen -
Category
Documents
-
view
229 -
download
0
Transcript of Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture...
![Page 1: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/1.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 1
Lecture 1 - Introduction to Structural Bioinformatics
Motivation and Basics of Protein Structure
![Page 2: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/2.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 2
Objectives of the course� Understanding protein function.� Applications to Computer Aided Drug
Design.� Development of efficient algorithms
to evaluate the above “in silico”.� Emphasis on the “structure” related
problems – Geometric Computing in Molecular Biology.
� Show relevance to other spatial “pattern discovery” tasks.
![Page 3: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/3.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 3
Most of the Protein Structure slides – courtesy of Hadar Benyaminy.
![Page 4: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/4.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 4
Textbook
There is no single, double or triple textbook for this course.
Most of the material is based on journal articles and research done by the Wolfson-Nussinov Structural Bioinformatics group at TAU.
Nevertheless :
![Page 5: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/5.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 5
Recommended Literature (1):
� Setubal and Meidanis, Introduction to Computational Biology, (1997).
� A. Lesk, Introduction to Protein Architecture, 2’nd edition (2001).
� S.L. Salzberg, D.B.Searls, S. Kasif(editors), Computational Methods in Molecular Biology, (1998).
![Page 6: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/6.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 6
Recommended Literature (2):
� Branden and Tooze, Introduction to Protein Structure (2’nd edition).
� D. Gusfield, Algorithms on Strings, Trees and Sequences, (1997).
� Voet and Voet, Biochemistry (or, any other Biochemistry book in the Library).
� M. Waterman, Introduction to Computational Biology.
![Page 7: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/7.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 7
Strongly Recommended Literature (currently not in the library):
� Protein Bioinformatics.� Structural Bioinformatics.
![Page 8: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/8.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 8
Recommended Web Sites:
� Enormous number of sites.� Search using “google”.� PDB site http://www.rcsb.org/pdb/� Birbeck course on protein structure.
![Page 9: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/9.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 9
Journals :
� Proteins : Structure, Function, bioinformatics.� Journal of Computational Biology.� Bioinformatics (former CABIOS).� Journal of Molecular Biology.� Journal of Computer Aided Molecular Design.� Journal of Molecular Graphics and Modelling.� Protein Engineering.
![Page 10: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/10.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 10
Computational Biology Conferences:
� ISMB - International Conference on Intelligent Systems in Molecular Biology.
� RECOMB - Int. Conference of Computational Molecular Biology.
� ECCB - European Conference on Computational Bio.
� WABI - Workshop of Algorithms in Bioinformatics .
![Page 11: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/11.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 11
Cell- the basic life unit
![Page 12: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/12.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 12
Different cell types
![Page 13: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/13.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 13
Size of protein molecules (diameter)
� cell (1x10-6 m) µµµµ microns
� ribosome (1x10-9 m) nanometers
� protein (1x10-10 m) angstroms
![Page 14: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/14.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 14
The central dogma
� DNA ---> RNA ---> Protein
� {A,C,G,T} {A,C,G,U} {A,D,..Y}
� 4 letter alphabets 20 letter alphabet
� Sequence of nucleic acids seq of amino acids
![Page 15: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/15.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 15
When genes are expressed, the genetic information (base sequence) on DNA is first transcribed (copied) to a molecule of messenger RNA in a process similar to DNA replicationThe mRNA molecules then leave the cell nucleus and enter the cytoplasm, where triplets of
bases)(codons) forming the genetic code specify the particular amino acids that make up an individual protein.This process, called translation, is accomplished by ribosomes (cellular components composedof proteins and another class of RNA) that read the genetic code from the mRNA, and
transfer RNAs (tRNAs) that transport amino acids to the ribosomes for attachment to the)www.ornl.gov/hgmis/publicat/primer/(From growing protein.
![Page 16: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/16.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 16
Proteins – our molecular machines(samples of protein tasks)
� Catalysis (enzymes).� Signal propagation.� Transport.� Storage.� Receptors (e.g. antibodies – immune system).� Structural proteins (hair, skin, nails).
![Page 17: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/17.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 17
Amino acids and the peptide bond
Cβ – first side chain carbon (except for glycine).
![Page 18: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/18.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 18
Primary through Quaternary structure
� Primary structure: The order of the amino acids composing the protein.
� AASGDXSLVEVHXXVFIVPPXIL…..
![Page 19: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/19.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 19
Folding of the Protein Backbone
![Page 20: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/20.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 20
The Holy Grail - Protein Folding
� How does a protein “know” its 3-D structure ?
� How does it compute it so fast ?� Relatively primitive computational
folding models have proved to be NP complete even in the 2-D case.
![Page 21: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/21.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 21
Secondary structure
3.6 residues/turn (5.4 A dist.)
![Page 22: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/22.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 22
Bond. Hydrogen bond.
β strands and sheets
![Page 23: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/23.jpg)
![Page 24: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/24.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 24
Wire-frame or ribbons display
![Page 25: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/25.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 25
Space-fill display
![Page 26: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/26.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 26
Tertiary structure: full 3D folded structure of the polypeptide chainRibonuclease - PDB code 1rpg
![Page 27: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/27.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 27
Quaternary structure
The interconnections and organization of more than one polypeptide chain.
Example :Transthyretindimer (1tta)
![Page 28: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/28.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 28
Determination of protein structures
� X-ray Crystallography
� NMR (Nuclear Magnetic Resonance)
� EM (Electron microscopy)
� Nano – sensors (?)
![Page 29: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/29.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 29
X-ray Crystallography
� Crystallization
� Each protein has a unique X-ray pattern diffraction.
� The electron density map is used to build a model of the protein.
![Page 30: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/30.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 30
Nuclear Magnetic Resonance
� Performed in an aqueous solution.� NMR analysis gives a set of estimates
of distances between specific pairs of protons (H – atoms).
� Solved by Distance Geometry methods.� The result is an ensemble of models
rather than a single structure.
![Page 31: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/31.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 31
An NMR result is an ensemble of modelsCystatin (1a67)
![Page 32: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/32.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 32
The Protein Data Bank (PDB)
� International repository of 3D molecular data.
� Contains x-y-z coordinates of all atoms of the molecule and additional data.
![Page 33: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/33.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 33
Feb. 2003 – about 20,000 structures.
![Page 34: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/34.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 34
![Page 35: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/35.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 35
Classification of 3D structures
![Page 36: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/36.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 36
SCOP
� Provides a description of the structural and evolutionary relationships between all proteins whose structure is known.
� Created largely by manual inspection.
� J. Mol. Biol. 247, 536-540, 1995
![Page 37: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/37.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 37
SCOP
![Page 38: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/38.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 38
CATH - Protein Structure Classification http://www.biochem.ucl.ac.uk/bsm/cath/
![Page 39: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/39.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 39
CATH
� Class: derived from secondary structure content.
� Architecture: gross orientation of secondary structures, independent of connectivities.
� Topology: clusters according to topological connections and numbers of secondary structures.
![Page 40: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/40.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 40
� Homology: clusters according to structure and function.
![Page 41: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/41.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 41
![Page 42: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/42.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 42
� PDB http://pdb.tau.ac.il� PDB http://www.rcsb.org/pdb/� CATH
http://www.biochem.ucl.ac.uk/bsm/cath/� SCOP http://scop.mrc-
lmb.cam.ac.uk/scop/
![Page 43: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/43.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 43
Restriction enzymes
![Page 44: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/44.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 44
![Page 45: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/45.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 45
The Structural Genomics Pipeline(X-ray Crystallography)
Basic Steps
Target Selection
Crystallomics• Isolation,• Expression,• Purification,• Crystallization
DataCollection
StructureSolution
StructureRefinement
Functional Annotation Publish
Bioinformatics• Distant
homologs • Domain recognition
AutomationBioinformatics• Empirical
rules
AutomationBetter sources
Software integrationDecision Support
MAD Phasing Automatedfitting
Bioinformatics• Alignments• Protein-protein
interactions• Protein-ligand
interactions• Motif recognition
No?
Borrowed from Bourne’s (UCSD) lecture on CADD
![Page 46: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/46.jpg)
Human GenomeHuman GenomeProjectProject
DNA&Protein Sequences
PROTEINPROTEINSTRUCTURESTRUCTURE
Computer Computer AssistedAssisted
Drug DesignDrug Design
Biological Biological FunctionFunction
X-ray cryst.NMR, EM
TAU Structural Bioinformatics LabMB)–CS, Nussinov -(Wolfson
![Page 47: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/47.jpg)
Structural Bioinformatics Lab GoalsDevelopment of state of the artalgorithmic methods to tackle major computational tasks in protein structure analysis, biomolecular recognition, and Computer Assisted Drug Design.
Establish truly interdisciplinary collaboration between Life and Computer Sciences.
![Page 48: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/48.jpg)
Bioinformatics and Genomics -Economic Impact
•Medicine and public health.
•Pharmaceutics.
•Agriculture.
•Food industry.
•Biological Computers (?).
![Page 49: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/49.jpg)
Bioinformatics and Genomics -the Computational Viewpoint
•Molecular Biology is becoming a Computational Science.
•The emergence of large databases of DNA, proteins, small molecules and drugs requires computational techniques to analyze the data.
•Efficient CPU and memory intensive algorithms are being developed.
•Many of the computational tasks have analogs in other well established fields of Computer Science allowing cross-fertilization of ideas.
![Page 50: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/50.jpg)
Bioinformatics - Computational Genomics
� DNA mapping.� Protein or DNA sequence comparisons ,
primary structure.� Exploration of huge textual databases.� In essence one- dimensional methods
and intuition.� Graph - theoretic methods.
![Page 51: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/51.jpg)
Structural Bioinformatics -Structural Genomics
� Elucidation of the 3D structures of biomolecules.
� Analysis and comparison of biomolecular structures.
� Prediction of biomolecular recognition.� Handles three-dimensional (3-D)
structures.� Geometric Computing.
![Page 52: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/52.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 52
Why bother with structureswhen we have sequences ?
� In evolutionary related proteins structure is much better preserved than sequence.
� Structural motifs may predict similar biological function .
� Getting insight into protein folding. Recovering the limited (?) number of protein folds.
![Page 53: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/53.jpg)
Case in Point :Protein Structural
Comparison
ApoAmicyanin - 1aaj Pseudoazurin - 1pmy
![Page 54: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/54.jpg)
Geometric Task :
Given two configurations of points in the three dimensional space,
find those rotations and translations of one of the point sets which produce “large” superimpositions of corresponding 3-D points.
![Page 55: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/55.jpg)
Remarks :
The superimposition pattern is not knowna-priori – pattern discovery .
The matching recovered can be inexact.
We are looking not necessarily for thelargest superimposition, since other matchings may have biological meaning.
![Page 56: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/56.jpg)
Algorithmic Solution
About 1 sec. Fischer, Nussinov, Wolfson ~ 1990.
![Page 57: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/57.jpg)
Applications
� Classification of protein databases by structure.
� Search of partial and disconnectedstructural patterns in large databases.
� Detection of structural pharmacophoresin an ensemble of drugs.
� Comparison and detection of drug receptor active sites.
![Page 58: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/58.jpg)
Geometric Matching task = Geometric Pattern Discovery
Cα constellations - before Superimposed constellations
![Page 59: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/59.jpg)
Analogy with Object Recognition in Computer
Vision
Wolfson, “Curve Matching”,1987.
![Page 60: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/60.jpg)
Multiple Structural Alignment (Globin example)
Leibowitz, Fligelman, Nussinov, Wolfson, - ISMB’99 – Heidelberg.
![Page 61: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/61.jpg)
Biomolecular Recognition -docking
� Predict association of protein molecules.
� Predict binding of a protein molecule with a potential drug.
� Scan libraries of drugs to detect a suitable inhibitor for a target molecule.
![Page 62: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/62.jpg)
Docking Algorithms
� Rigid receptor-ligand and protein-protein docking.
� Flexible receptor-ligand docking allowing a small number of hinges either in the ligand or the receptor.
![Page 63: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/63.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 63
Docking - Problem Definition� Given a pair of molecules find
their correct association:
+ =
![Page 64: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/64.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 64
Docking - Trypsin and BPTI
![Page 65: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/65.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 65
Docking - Relevance� Computer aided drug design – a new drug
should fit the active site of a specific receptor.
� Understanding of the biochemical pathways - many reactions in the cell occur through interactions between the molecules.
� Crystallizing large complexes and finding their structure is difficult.
![Page 66: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/66.jpg)
Flexible DockingCalmodulin with M13 ligand
Sandak, Nussinov, Wolfson - JCB 1998.
![Page 67: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/67.jpg)
Flexible Docking HIV Protease Inhibitor
Sandak, Nussinov, Wolfson - CABIOS 1995.
![Page 68: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/68.jpg)
Software Infrastructure
� Development of a software infrastructure for Geometric Computing in Molecular Biology.
� Object oriented, C++ library.� Speed up development of new and
re-usability of old software.� Development of building blocks for
fast testing of new ideas.
![Page 69: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/69.jpg)
Cross - fertilization 1� Analogous tasks appear in
Computer Vision, Medical Imaging, Structural Bioinformatics, Target Recognition.
� Similar software and hardware can handle all of these Geometric Computing tasks - method based cross fertilization.
![Page 70: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/70.jpg)
Cross - fertilization 2� Bioinformatics brings together
Computer Scientists, Molecular Biologists, Chemists etc. to tackle major problems in Computational Biology and Computer Assisted Drug Design - task based cross-fertilization.
![Page 71: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/71.jpg)
Conclusions 1
� Molecular Biology and Biotechnology have entered a stage in which advanced algorithmic methods make the difference between theory and practice.
� Only true interdisciplinary collaboration among Computer and Life scientists can deliver biologically relevantcomputational techniques.
![Page 72: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/72.jpg)
Conclusions 2
� The b.c. (before Computer Science) algorithms in Computational Biology/Biotechnology, which have been mostly developed by chemists and physicists, are analogous to the first generation CS algorithms. The current state-of-the-art of CS (~fifth generation) provides a quantum leap.
![Page 73: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/73.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 73
Sample of Topics to be covered� Protein and DNA sequence alignment.� Protein structural alignment and classification.� Biomolecular recognition prediction – docking.� Folding (homology modelling, threading, ab-
initio).� Distance Geometry for structure calculation
from NMR data (?)� Computer Assisted Structural Drug Design.
![Page 74: Structural Bioinformatics Lecture 1 - Introduction tobioinfo3d.cs.tau.ac.il/Education/CS0304/Lecture 1 - Introduction to... · Structural Bioinformatics 2004 Prof. Haim J. Wolfson](https://reader033.fdocuments.us/reader033/viewer/2022050918/5b5bff3f7f8b9ad21d8b4c9d/html5/thumbnails/74.jpg)
Structural Bioinformatics 2004 Prof. Haim J. Wolfson 74
GRADING
� Exercises - 50%.� Final (individual) Project, which involves
heavy programming, based on the exercises – 50%.
� Most likely, all the students will get the same project assignment.
� The exact grading details will be supplied by the TA, Maxim Shatsky.