Protein 3D-structure analysis
Transcript of Protein 3D-structure analysis
![Page 1: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/1.jpg)
Protein 3D-structure analysis
why and how
![Page 2: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/2.jpg)
3D-structures are precious sources of
information
• Shape and domain structure
• Protein classification
• Prediction of function for uncharacterized proteins
• Interaction with other macromolecules
• Interactions with small ligands: metal ions, nucleotides, substrates, cofactors and inhibitors
• Evidence for enzyme mechanism
• Structure-based drug development
• Posttranslational modifications: disulfide bonds, N-glycosylation,…
• Experimental evidence for transmembrane domains
![Page 3: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/3.jpg)
Structure of the polypeptide chain
Convenient, but
real molecules fill
up space
Colors:
Carbon=light grey
Nitrogen=blue
Oxygen=red
Sulfur=yellow
1B6Q
Visualization with DeepView
![Page 4: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/4.jpg)
Structure of the polypeptide chain
Another
view of
the same
1B6Q
JMOL cartoon
![Page 5: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/5.jpg)
Structure of the polypeptide chain
Space-filling model of
the same
Colors:
Carbon=light grey
Nitrogen=blue
Oxygen=red
Sulfur=yellow
1B6Q
JMOL
![Page 6: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/6.jpg)
Basics of protein structure
Primary structure
Secondary structure
Tertiary structure
Quaternary structure
Nota bene: some proteins
are inherently disordered
![Page 7: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/7.jpg)
Primary structure: the protein sequence
20 amino acids with different characteristics:
small, large, polar, lipophilic, charged,…
• http://schoolworkhelper.net/amino-acids-categories-function/
![Page 8: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/8.jpg)
Structure of the polypeptide chain
Alpha carbon atom
http://www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=bioinfo&part=A135#A146
![Page 9: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/9.jpg)
The folding pattern of a polypeptide chain
can be described in terms of the angles of
rotation around the main chain bonds
Phi and psi describe the main chain conformation. Omega corresponds to the trans (omega=180) or cis (omega=0)
conformation. Except for Pro, trans is the more stable conformation
http://swissmodel.expasy.org/course/
N
![Page 10: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/10.jpg)
Key facts about a polypeptide chain:
• Chemical bonds have characteristic lengths.
• The peptide bond has partial double-bond character,
meaning it is shorter, and rigid
• Other bonds are single bonds (but: restriction of rotation due
to steric hindrance)
![Page 11: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/11.jpg)
Ramachandran plot (1) : Each type of secondary structure has a characteristic
combination of phi and psi angles
http://swissmodel.expasy.org/course/text/chapter1.htm
![Page 12: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/12.jpg)
Ramachandran plot (2) : For each possible conformation, the structure is examined for close contacts
between atoms. Atoms are treated as hard spheres with dimensions corresponding
to their van der Waals radii. Angles, which cause spheres to collide correspond to
sterically disallowed conformations of the polypeptide backbone (white zone).
Red: no steric hindrance
Yellow: some steric
constraints
White: “forbidden zone”
Exception: Gly has no side
chain and can be found in the
white region
When determining a protein structure, nearly all residues should be in
the permitted zone (excepting a few Gly)
![Page 13: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/13.jpg)
Secondary structure: helices, strands, turns and loops
![Page 14: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/14.jpg)
Secondary structure: Alpha-helix
Characteristics:
Helical residues have negative phi and psi
angles, typical values being -60 degrees and
-50 degrees
Every main chain C=O and N-H group is
hydrogen-bonded to a peptide bond 4 residues
away (i.e. Oi to Ni+4). This gives a very regular,
stable arrangement.
3.6 residues per turn
5.4 Å repeat along the helix axis
Each residue corresponds to a rise of ca. 1.5 Å
![Page 15: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/15.jpg)
Secondary structure: beta strands
http://swissmodel.expasy.org/course/text/chapter1.htm
Characteristics:
Positive psi angles, typically
ca. 130 degrees, and negative
phi values, typically ca. -140
degrees
No hydrogen bonds amongst
backbone atoms from the
same strand!
![Page 16: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/16.jpg)
Beta strands can form
parallel or antiparallel beta-sheets
Characteristics:
Stabilized by hydrogen bonds
between backbone atoms from
adjacent chains
The axial distance between
adjacent residues is 3.5 Å.
There are two residues per
repeat unit which gives the
beta-strand a 7 Å pitch
![Page 17: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/17.jpg)
Turns and loops
Loop: general name for a mobile part of the polypeptide chain
with no fixed secondary structure
Turn: several types, defined structure, requirement for
specific aa at key positions, meaning they can be predicted.
The polypeptide chain “makes a U-turn” over 2-5 residues.
Loop between
beta strands
![Page 18: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/18.jpg)
Supersecondary structures: Composed of 2-3 secondary structure elements
Examples:
Helix-turn-helix motifs, frequent in
DNA-binding proteins
Coiled coils, e.g from myosin
![Page 19: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/19.jpg)
Tertiary structure
Domains, repeats, zinc fingers…
Domain: independently folded part of a protein. Average size,
about 150 aa residues, lower limit ca 50 residues
Repeats: several types: LRR, ANK, HEAT…. Composed of
few secondary structure elements. Stabilized by interactions
between repeats; can form large structures.
Zinc fingers: several types; structure is stabilized by bound
zinc ion
EF Hands: structure is stabilized by bound calcium
LRR domain
![Page 20: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/20.jpg)
Quaternary structure: subunit structure
• STRING database
• IntAct, DIP, MINT
Examples:
- Homodimer
- Complex between ligand and receptor,
enzyme and substrate
- Multisubunit complex
![Page 21: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/21.jpg)
STRING http://string-db.org/
![Page 22: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/22.jpg)
![Page 23: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/23.jpg)
Protein folding
Many proteins can fold rapidly and spontaneously (msec
range)
The physicochemical properties of the polypeptide chain
(=the protein sequence) determines protein structure
One sequence -> one stable fold
NB: some proteins or parts of proteins are intrinsically
disordered=unstructured in the absence of a specific
ligand (e.g. Mineralocorticoid receptor ligand-binding domain)
See also: http://en.wikipedia.org/wiki/Protein_folding
![Page 24: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/24.jpg)
Protein structures:
the need for classification
How similar/dissimilar are these proteins?
![Page 25: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/25.jpg)
Protein structure classification:
quantitative criteria
Purely alpha-helical structure
Purely beta-strand structure,
Mixed
Topology (= orientation & connectivity of
structural elements)
Single domain vs multidomain proteins
![Page 26: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/26.jpg)
Implications of structural similarity:
Evolution
Function prediction
![Page 27: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/27.jpg)
27
Reasons for structural similarity
• Similarity arises due to divergent evolution (homologues) from a common ancestor - structure much more highly conserved than sequence
• Similarity due to convergent evolution (analogues)
– Similarity due to there being a limited number of ways of packing helices and strands in 3D space
– no significant sequence similarity, but proteins may use similar structural locations as active sites
– NB: at low sequence identity, it is difficult to know whether 2 sequences share a common ancestor, or not
![Page 28: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/28.jpg)
28
Current dogma
• Proteins with similar sequences have similar 3D-structures
• Proteins with similar 3D-structure are likely to have similar function (generally true, but exceptions exist)
• Proteins with similar function can have entirely different sequences (subtilisin vs chymotrypsin: same active site geometry, no detectable sequence similarity)
![Page 29: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/29.jpg)
Protein 3D-structure analysis
methodology
![Page 30: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/30.jpg)
Protein structure initiatives: technical progress and
automatisation
Primary structure
Secondary structure
Tertiary structure
Quaternary structure
Nota bene: some proteins
are inherently disordered
![Page 31: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/31.jpg)
31
Most protein structures are determined
using X-ray crystallography
![Page 32: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/32.jpg)
32
Parameters affecting crystallisation:
Physico-chemical: find the right conditions
Precipitants: type and concentration
pH
Solvant, buffer composition
Temperature, Pressure
Time
Biological
Protein purity, presence of ligands
Biological source (native vs heterologous expression, PTMs)
Sequence (micro) heterogeneities
Conformational (micro) heterogeneities
Some proteins are intrinsically unstructured
Tetragonal lysozyme crystals
![Page 33: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/33.jpg)
33
Data acquisition
Monochrome X-ray beam focused on a crystal
Atoms within the crystal diffract the beam; each type of crystal gives a characteristic
diffraction pattern that can be recorded and used to calculate the structure
The more complex the sample, the more complex the diffraction pattern (practical
Problems due to weak and/or diffuse spots)
![Page 34: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/34.jpg)
34
Resolution and structural knowledge
The resolution affects the amount of information that can be obtained
The resolution depends on the quality of the crystals, how similar protein molecules in the crystal are to each other, and how well ordered they are throughout the entire crystal.
Res (A) Structural information X-ray
4.0 Global fold, some indication of secondary Useful structure
3.5 Secondary structure
3.0 Most side chains are positioned
2.5 All side-chains, phi-psi angles constrained, Typical waters located
1.5 phi-psi angles well defined, hydrogen Very good atoms begin to appear
1.0 Hydrogens are visible Possible
![Page 35: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/35.jpg)
35
Nuclear magnetic resonance (NMR)
Measures the energy levels of magnetic atoms, i.e. atoms with odd electron numbers: 1H, 13C, 15N, 19F, 31P
Energy levels of an atom are influenced by the local environment (chemical shifts) Via covalent bonds
Through space, max. 5A apart: Nuclear Overhauser Effect (NOE)
NMR can identify atoms that are close together, also those that are close in space but not linked by direct covalent bonds
Chemical shifts can define secondary structures
NMR spectra yield a set of peaks that correspond to the interactions between pairs of atoms
From these, one can calculate the protein structure
![Page 36: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/36.jpg)
36
Nuclear magnetic resonance (NMR)
Advantage: done with proteins in solution
But: still requires high protein concentrations
Advantage/Disadvantage: conformational heterogeneity – proteins move
NMR results usually yield ca 20 closely similar but non-identical structures
Disadvantage: cannot determine the structures of large proteins (ok up to 30 kDa, feasible up to 60 kDa)
NMR spectra are used to study small proteins or isolated domains
![Page 37: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/37.jpg)
37
An ensemble of 15-20 closely similar structures
Dynamic aspects of the structure
Less “precise” than rigid X-ray structures
NMR output
![Page 38: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/38.jpg)
38
X-ray vs NMR: principles
X-ray
X-rays Diffraction
Pattern
Direct detection of atom positions Crystals
NMR
RF
RF Resonance
H0
Indirect detection of H-H distances In solution
![Page 39: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/39.jpg)
39
Electron Microscopy
Developed in 1930 to overcome
limitations of Light Microscopes
Based on Light Transmission
Microscope principle
Potentially resolutions of 1Å are possible
Allows to reconstruct 3D structures
from 2D projections
![Page 40: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/40.jpg)
40
Transmission Electron Microscope
Cryo-EM image of GroEL chaperonin complexes, showing end views (rings) and side views (stripes).
![Page 41: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/41.jpg)
41
Methods for determining 3D structures
Advantages Disadvantages
No crystals needed Conformation of protein in solution Dynamic aspects (conformation ensemble view)
Highly concentrated solution (1mM at least)
Isotope substitution (13C, 15N)
Limited maximum weight (about 60 kD)
NMR
No 3D-crystals needed Direct image
Large radiation damage Need 2D crystals or large complexes Artefacts
Electron Microscopy
High resolution (up to 0.5Å)
No protein mass limit
X-ray Crystallo- graphy
Crystals needed
Artefacts due to crystallization (Enzyme in open vs closed Conformation) Structure is a static average
![Page 42: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/42.jpg)
Working with protein structures
Databases and tools
![Page 43: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/43.jpg)
One central archive for 3D-structure
data: wwPDB (www.wwpdb.org)
• What can you find there?
• How to find a structure for protein x?
• How to find a structure with bound z?
![Page 44: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/44.jpg)
What if ?
• If the structure has not been determined, is
there a structure for a similar protein?
• Can we predict the structure of a protein?
How?
![Page 45: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/45.jpg)
World-wide PDB (wwPDB)
Four member sites:
same data, but different presentation and tools • RCSB PDB (www.rcsb.org/pdb/home/home.do)
• PDBe (http://www.ebi.ac.uk/pdbe/)
• PDBj (www.pdbj.org/)
• BMRB (Biological Magnetic Resonance Data Bank,
www.bmrb.wisc.edu)
![Page 46: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/46.jpg)
Information and tools you can find at
RCSB PDB
PDB file and Header document
Structure viewer
Links
Good documentation and help:
http://www.rcsb.org/robohelp_f/#search_database/how_to_search.htm
http://www.ebi.ac.uk/pdbe/#m=2&h=0&e=1&r=0&l=0&a=0&w=1-3-2
http://www.ebi.ac.uk/pdbe/#m=2&h=0&e=0&r=0&l=0&a=0&w=0
![Page 47: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/47.jpg)
Finding protein structures at RCSB PDB
Via main query window: PDB ID, or text
![Page 48: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/48.jpg)
Advanced search: query with
UniProt AC
![Page 49: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/49.jpg)
Advanced search: query via BLAST
![Page 50: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/50.jpg)
Finding protein structures at RCSB PDB
For E.coli alkB DNA repair dioxygenase
![Page 51: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/51.jpg)
Information from protein 3D-structures about E.coli alkB
DNA repair dioxygenase
![Page 52: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/52.jpg)
PDB file
![Page 53: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/53.jpg)
Header: Atom coordinates http://www.rcsb.org/pdb/static.do?p=education_discussion/Looking-at-Structures/coordinates.html
X, Y, Z, occupancy and temperature factor
![Page 54: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/54.jpg)
Header: protein sequence and ligand
information
HET: Heteroatoms=non-protein
atoms: small molecules and ions, each
with a unique abbreviation
![Page 55: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/55.jpg)
Information from protein 3D-structures about E.coli alkB
DNA repair dioxygenase
![Page 56: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/56.jpg)
Viewing a structure (JMOL): Right-click in window to change parameters
![Page 57: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/57.jpg)
Access to Ligand explorer
Further down on same page..
![Page 58: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/58.jpg)
Ligand interactions
![Page 59: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/59.jpg)
Very easy to use, excellent tools and links:
PDBsum
www.ebi.ac.uk/thornton-srv/databases/pdbsum/
![Page 60: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/60.jpg)
Ligand interactions
![Page 61: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/61.jpg)
Ligand interactions
![Page 62: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/62.jpg)
Ligand interactions
![Page 63: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/63.jpg)
Finding structures with ligands:
substrates, inhibitors, drugs…
Often, people use non-hydrolyzable substrate
analogs in their structure, e.g. ATP analogs. There
are many different ATP analogs!
In PDB, every chemical compound has its own
abbreviation
If you want to study proteins with bound ATP or ATP
analogs, you have to use the adequate tools
![Page 64: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/64.jpg)
Finding structures with ligands
http://www.rcsb.org/pdb/static.do?p=help/advancedSearch/index.html
![Page 65: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/65.jpg)
Naming & finding chemical entities
http://www.ebi.ac.uk/chebi/advancedSearchForward.do
![Page 66: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/66.jpg)
E.coli alkB DNA repair dioxygenase:
finding similar proteins/structures
![Page 67: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/67.jpg)
??? Why so few
similar proteins??
![Page 68: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/68.jpg)
Structural Similarities for PDB 3I3Q The following structural similarities have been found using the jFATCAT-rigid
algorithm. In order to reduce the number of hits, a 40% sequence identity
clustering has been applied and a representative chain is taken from each cluster.
Root mean square deviation
![Page 69: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/69.jpg)
Root Mean Square Deviation (rmsd): describes how well the alpha-carbon atoms of 2 proteins
superimpose .
www.lce.hut.fi/teaching/S-114.500/Protein_Structure1.pdf
![Page 70: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/70.jpg)
3D-structure classification and alignment
Structure description1 Structure description 2
Comparison algorithm
Scores
Similarity
![Page 71: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/71.jpg)
Protein structure classification:
quantitative criteria
Purely alpha-helical structure
Purely beta-strand structure,
Mixed
Topology
Single domain vs multidomain proteins
![Page 72: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/72.jpg)
73
Structure Classification Databases
• SCOP (MRC Cambridge) – Structural Classification of Proteins
– Murzin et al. 1995
– Largely manual (visual inspection)
, last update June 2009
• CATH (University College, London) – Class, Architecture, Topology and Homologous superfamily
– Orengo et al. 1993, 1997
– Manual and automatic method, last update Sept 2011
• DALI/FSSP (EBI, Cambridge)
– Fold classification based on Structure-Structure alignment of
Proteins
– Holm and Sander, 1993
– Completely automatic, updated every 6 months (last in 2011)
![Page 73: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/73.jpg)
74
SCOP database
Classes
• All alpha proteins
• All beta proteins
• Alpha and beta proteins (a/b) - Mainly parallel beta sheets
• Alpha and beta proteins (a+b) - Mainly antiparallel beta sheets (segregated alpha and
beta regions)
• Multi-domain proteins (alpha and beta) - Folds consisting of two or more domains
belonging to different classes
• Membrane and cell surface proteins
• Small proteins
http://scop.berkeley.edu/
![Page 74: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/74.jpg)
75
CATH – Protein Structure Classification
• Hierarchical classification of protein domain structures in PDB.
• Mostly automated classification
• Domains are clustered at four major levels:
Class
Architecture
Topology
Homologous superfamily
Sequence family
http://nar.oxfordjournals.org/content/27/1/275.full
www.cathdb.info/
![Page 75: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/75.jpg)
Finding similar protein structures via the
DALI database
![Page 76: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/76.jpg)
Finding similar protein structures via the
DALI server Select neighbours (check boxes) for viewing as multiple structural alignment or
3D superimposition. The list of neighbours is sorted by Z-score (A measure of the
statistical significance of the result relative to an alignment of random structures).
Similarities with a Z-score lower than 2 are spurious.
![Page 77: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/77.jpg)
Alignment and 3D-superposition of 3i49A and 2iuwA (another family member: 18% seq identity, z-score 17.7 and 2.8 A rmsd)
![Page 78: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/78.jpg)
Alignment and 3D-superposition of 3i49A and 2iuwA (another family member: 18% seq identity, z-score 17.7 and 2.8 A rmsd)
Conservation of secondary structure
Sequence alignment: residues essential for catalysis are conserved
![Page 79: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/79.jpg)
Protein structure prediction
• Ab initio: only meaningful for small proteins (up
to ca 120 residues)
• Homology modeling: can give highly valuable
results
• The higher the sequence similarity, the higher the
chance that proteins have similar structures
• Proteins with similar structures often (but not
always!!) have similar functions
• For function prediction, you need a basis:
characterized proteins
![Page 80: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/80.jpg)
The Protein Model Portal: access to all publicly accessible
protein models and 3D-structures Out of 538’000 UniProtKB/Swiss-Prot entries, 429’000 have a link to Protein Model Portal
http://www.proteinmodelportal.org/?
![Page 81: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/81.jpg)
Example: alkB fromC.crescentus
![Page 82: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/82.jpg)
![Page 83: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/83.jpg)
http://www.proteinmodelportal.org/?pid=documentation#modelquality
![Page 84: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/84.jpg)
Manual homology modeling
![Page 85: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/85.jpg)
Ab initio and comparative protein
structure prediction http://robetta.bakerlab.org/
![Page 86: Protein 3D-structure analysis](https://reader031.fdocuments.us/reader031/viewer/2022020702/61fb19d02e268c58cd5a287c/html5/thumbnails/86.jpg)
Protein 3D-structure analysis
and now it is up to you:
Practicals