Protein Conformation Prediction (Part II)
description
Transcript of Protein Conformation Prediction (Part II)
Doug RaifordLesson 18
Framework model Secondary structure
first Assemble secondary
structure segments Hydrophobic
collapse Molten: compact but
denatured Formation of
secondary structure after: settles in
van der Waals forces and hydrogen bonds require close proximity
04/21/23 2Protein Conformation Prediction (Part II)
De novo (or ab initio) From the beginning
or from first principles
Comparative/Homology Based Sequence similarity
04/21/23 3Protein Conformation Prediction (Part II)
Find a similar protein of known structure
Structure should be similar
04/21/23 Protein Conformation Prediction (Part II) 4
Know the phi and psi angles of the similar protein
Can apply those same angles
Known as threading
04/21/23 Protein Conformation Prediction (Part II) 5
What are chances that lengths will the same
Where put longer portionsWhere put gaps
04/21/23 Protein Conformation Prediction (Part II) 6
Once again, MSAs
3D PSSM (Threading Server)Remember?
Position specific similarity matrix Profiles
3D PSSM performs MSA’s but augments with additional 3D alignments Aligning “known 3D conformations” in three
dimensions
04/21/23 Protein Conformation Prediction (Part II) 7
Molecular dynamics Summation of all forces
exerted at all locations simultaneously
Computationally intensive
Do not fully understand such forces as hydro-phobic avoidance of solvent
04/21/23 Protein Conformation Prediction (Part II) 8
Secondary structure predictionAccuracy mid to upper 70’sWork the loops to fold secondary
structures into energetically optimal conformation
04/21/23 Protein Conformation Prediction (Part II) 9
See how often aa’s show up at specific positions in secondary structure
Chou-Fasman Empirical parameters
for , , and -turns▪ P(,aa)=f(aa)/ave() ▪ P(,aa)=f(aa)/ave()▪ P(-turn,aa)=f(aa)/ave(-turn)
04/21/23 Protein Conformation Prediction (Part II) 10
Name P(a) P(b) P(turn)Alanine 142 83 66Arginine 98 93 95Aspartic Acid 101 54 146• • •
Valine 106 170 50
ID regions where 4 out of 6 (3 of 5 for ) contiguous residues have P(a-helix) > 100
Extend the helix in both directions until a set of four contiguous residues that have an average P(a-helix) < 100 is reached.
04/21/23 Protein Conformation Prediction (Part II) 11
1 MAKYNEKKEK KRIAKERIDI LFSLAERVFP YSPELAKRYV ELALLVQQKA HHHHH HHHHHHHHHH H HHHHHHHH HHHHHHHHHH
51 KVKIPRKWKR RYCKKCHAFL VPGINARVRL RQKRMPHIVV KCLECGHIMRT SSTTTT SB TTT B BTTTEEEEE E SSS EEEE EETTTTEEEE
101 YPYIKEIKKR RKEKMEYGGL VPREE
Chou and Fasman also determined turn frequencies
Most hairpins are three in length When p(-turn) = f(j)f(j+1)f(j+2)f(j+3) is
greater than P() or P()
04/21/23 Protein Conformation Prediction (Part II) 12
Name P(a) P(b) P(turn) f(i) f(i+1) f(i+2) f(i+3)
Alanine 142 83 66 0.06 0.076 0.035 0.058Arginine 98 93 95 0.070 0.106 0.099 0.085Aspartic Acid 101 54 146 0.147 0.110 0.179 0.081• • •
Valine 106 170 50 0.062 0.048 0.028 0.053
Patterns can be used to augment these statistical approaches
In some cases, one side of helices like water Every 4th aa hydrophilic Helps ID helix Helps ID that solvent exposed
Other patterns: coiled coils
04/21/23 Protein Conformation Prediction (Part II) 13
Does this sound familiar?Probability of a sequence of
occurrences?
04/21/23 Protein Conformation Prediction (Part II) 14
Hairpin positio
n 1
Hairpin positio
n 1
Hairpin positio
n 2
Hairpin positio
n 2
Hairpin positio
n 3
Hairpin positio
n 3
Hidden Markov Model
Hidden states helix, beta sheet, turn
04/21/23 Protein Conformation Prediction (Part II) 15
Proteins organized into Domains
Domains composed of motifs
PFAM Database of
protein families Hidden Markov
Models
04/21/23 Protein Conformation Prediction (Part II) 16
04/21/23 Protein Conformation Prediction (Part II) 17
Critical Assessment of techniques for protein Structure Prediction
Biannual conference: contestSecret newly experimentally
determined structures
04/21/23 Protein Conformation Prediction (Part II) 18
CASP1 (1994) | CASP2 (1996) | CASP3 (1998) | CASP4 (2000) | CASP5 (2002) | CASP6 (2004) | CASP7 (2006) | CASP8 (2008) | CASP9 (2010)| CASP10(2012)
CASP1 (1994) | CASP2 (1996) | CASP3 (1998) | CASP4 (2000) | CASP5 (2002) | CASP6 (2004) | CASP7 (2006) | CASP8 (2008) | CASP9 (2010)| CASP10(2012)
Root mean square (RMS) for angles
No intermol contactsSecondary structureSurfaceBuried
04/21/23 Protein Conformation Prediction (Part II) 19
Have seen comparative homology based
HMM based rely on multiple sequence alignments: homology
Now turn to De novo Split into two: Ab initio
and knowledge based
04/21/23 Protein Conformation Prediction (Part II) 20
Build a list of possible conformations (25) for each segment (length 9) Predicted secondary
structure Database of structures
Randomly draw from this list, apply ψ and φ, and score conformation Monte Carlo simulated
annealing procedure04/21/23 Protein Conformation Prediction (Part II) 21
Scoring global conformation hydrophobic burial Electrostatics Disulfide bonding Main chain hydrogen
bonding Strand pairing Sheet formation Helix-strand interactions Excluded volume04/21/23 Protein Conformation Prediction (Part II) 22
04/21/23 23Protein Conformation Prediction (Part II)
04/21/23 Protein Conformation Prediction (Part II) 24
Letter
Name Definition
H Alpha helix (4-12)
Two or more consecutive bridge partners at i and i+4.
B Isolated beta-bridge residue
Must not have a neighbor that qualifies it for H, E, G, or I status. Bridge partner is identified in BP1 or BP2 column.
E Strand ("extended")
Has at least one bridge partner and at least one neighbor bridged in parallel or antiparallel.
G 3-10 helix Two or more consecutive bridge partners at i and i+3.
I pi helix Two or more consecutive bridge partners at i and i+5.
T Turn Bridge partner at i+3, i+4, or i+5, but no bridged neighbor that would qualify them for H, G, or I status.
S Bend Local curvature greater than 70 degrees, measured as the angle between alpha carbons at i-2, i, and i+2.
blank
None Meets none of the criteria above.