KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl
description
Transcript of KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl
![Page 1: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/1.jpg)
KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS
Firas Khatib, Carol Rohl Department of Biomolecular Engineering
Can you tell which of these two
protein chains are knotted?
![Page 2: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/2.jpg)
How can we tell if a protein chain is knotted?
This was the highest scoring model using Rosetta’s automated server Robetta for Target 202 in the
CASP6 blind prediction experiment.
Unfortunately this model is knotted. Knotted
Region
Can you tell which of these two
protein chains are knotted?
![Page 3: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/3.jpg)
How can we tell if a protein chain is knotted?
KnotFree
This was the model submitted
by our group using the same alignment as Robetta. It is
devoid of knots.
![Page 4: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/4.jpg)
• Knots in polypeptide chains have been found in very few proteins. [Nureki et al., 2002]
• Only 8 knots have been found in over 2000 known protein structures. [Taylor, 2000, 2002]
• We set out to write a program that checked modeled structures for knots.
Why Knots?
![Page 5: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/5.jpg)
Detecting Knots without Knot Theory
• Joining the ends of a protein chain can create additional knots and cause other problems.
• In daily life we often encounter knots without closed loops: telephone cords, ethernet cables, extension cords, headphones…etc.
• We define such knots as: “pulling both ends results in a tangle that cannot be undone.”
![Page 6: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/6.jpg)
Potential Knot
This is 1Ni5A. How can we find out if it is knotted?
Pulling Both Ends of the Chain
![Page 7: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/7.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 8: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/8.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 9: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/9.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 10: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/10.jpg)
Simplifying
Rotating and moving
Pulling Both Ends of the Chain
![Page 11: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/11.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 12: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/12.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 13: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/13.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 14: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/14.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 15: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/15.jpg)
Simplifying
Pulling Both Ends of the Chain
![Page 16: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/16.jpg)
Simplifying
Simplifying
Pulling Both Ends of the Chain
![Page 17: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/17.jpg)
N and C termini connected. Therefore 1Ni5A has no knot.
Pulling Both Ends of the Chain
![Page 18: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/18.jpg)
• This concept of “pulling the ends of the chain”
was the basis for the Knotfind Algorithm.
• Knotfind goes through the chain three residues at
a time and simulates the pulling of the ends of the
chain in an attempt to simplify it.
The Knotfind Algorithm
![Page 19: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/19.jpg)
Example of our Algorithm
First iteration through the chain
![Page 20: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/20.jpg)
Example of our Algorithm
Therefore this chain has no knots
![Page 21: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/21.jpg)
This chain cannot be simplified further, so
this is a knot.
In this final stage, all triangles of
three consecutive residues have line segments going through
them.
Example of our Algorithm with a knot
![Page 22: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/22.jpg)
Localizing the Knot
But once we know a knot exists, how
do we find it?
![Page 23: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/23.jpg)
Localizing the Knot
Knotfind specifies the region that
could not be simplified.
![Page 24: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/24.jpg)
Localizing the Knot
Using Knotfind we are able to locate the knotted region of the chain.
![Page 25: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/25.jpg)
Knotfind Results on CASP5 Experiment
After running Knotfind on all of Rosetta’s comparative modeling CASP5 decoy sets from University of Washington’s group:
Total number of chains : 45,366 Total number of knots : 2,163
4.8% of the decoys had knots.
Why do comparative modeling techniques lead to knots in some modeled structures?
![Page 26: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/26.jpg)
We noticed three types of knots
• Two different loop regions linking with one another.
• A loop region threading itself through the template.
• A loop region knotting with itself:
This occurs extremely rarely and is likely the result of a high gap penalty.
![Page 27: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/27.jpg)
• For CASP6 we filtered our loops:
– Libraries of loops were filtered before being placed onto the template.
– They were screened for knots with themselves as well as knots with the template.
How can we prevent knots?
Target 262
The green region is a loop region that conflicted with the template. Because of this, we selected a different alignment for this target.
![Page 28: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/28.jpg)
How did we prevent knots in CASP6?
Robetta’s best model knots itself
with the template
One of our knotted decoys that we
discarded due to two conflicting loop
regions
Both models used the same template,
shown in blue
![Page 29: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/29.jpg)
Knotfind Results after CASP6
After running Knotfind on all of Rosetta’s comparative modeling CASP6 decoy sets from UCSC:
Total number of chains : 117,200 Total number of knots : 1,300
1.1% of the decoys had knots.
For CASP5, 4.8% of the decoys had knots.
![Page 30: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/30.jpg)
Conclusions
• The Knotfind algorithm is simple and fast.
• Knot formation correlates with breaks in the chain.
• Early filtering of loop libraries helps prevent the formation of knots.
• Future work: detecting regions that may not define a knot but are unproteinlike (such as tied shoelaces) .
![Page 31: KNOTFIND: A SIMPLE ALGORITHM FOR DETECTING KNOTS IN PROTEINS Firas Khatib, Carol Rohl](https://reader030.fdocuments.us/reader030/viewer/2022020220/56813c00550346895da55cf8/html5/thumbnails/31.jpg)
Acknowledgements
Carol RohlJosue SamayoaDavid Bernick
Craig LoweMatt WeirauchPaul BourkeBrian Raney
Art LyubimovKevin Karplus