Geometric Search and Crystal Structure Determination

22
Geometric Search and Crys tal Structure Determinati on Geometric Search and Geometric Search and Crystal Structure Crystal Structure Determination Determination Andy Wilson Andy Wilson

description

Geometric Search and Crystal Structure Determination. Andy Wilson. Overview. Driving Problem Crystallographic Methods Relation to computational geometry Solving for phases Complications and Future Work Conclusions. Driving Problem. What is a protein’s shape? Shape determines function - PowerPoint PPT Presentation

Transcript of Geometric Search and Crystal Structure Determination

Page 1: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Geometric Search and Crystal Structure DeterminationStructure Determination

Andy WilsonAndy Wilson

Page 2: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

OverviewOverview

Driving ProblemDriving Problem Crystallographic MethodsCrystallographic Methods Relation to computational geometryRelation to computational geometry Solving for phasesSolving for phases Complications and Future WorkComplications and Future Work ConclusionsConclusions

Page 3: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Driving ProblemDriving Problem

What is a protein’s shape?What is a protein’s shape?– Shape determines functionShape determines function– DNA sequencing gives amino acid sequenceDNA sequencing gives amino acid sequence– AA sequence determines primary structureAA sequence determines primary structure– Unknown mapping between AA sequence and Unknown mapping between AA sequence and

secondary structuresecondary structure

Page 4: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Two MoleculesTwo Molecules

Page 5: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

More MoleculesMore Molecules

Page 6: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Driving Problem 2Driving Problem 2

Mechanical optimization alone won’t workMechanical optimization alone won’t work– Problem space has several thousand dimensionsProblem space has several thousand dimensions– Local minima are Local minima are everywhereeverywhere– Can Can refine a “close” guessrefine a “close” guess

Humans are good at fitting modelsHumans are good at fitting models– Need something to fit toNeed something to fit to– Can hand off to automated methodsCan hand off to automated methods

Page 7: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

ObjectiveObjective

Generate an approximate electron density Generate an approximate electron density map.map.

Have a scientist fit a model to this map.Have a scientist fit a model to this map. Refine using other methods.Refine using other methods.

Page 8: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Observing ProteinsObserving Proteins

X-ray crystallography to the rescue!X-ray crystallography to the rescue!– Crystallize protein, exploit repetitive structureCrystallize protein, exploit repetitive structure– Observe electron density of moleculeObserve electron density of molecule

XRC alone isn’t good enoughXRC alone isn’t good enough– Output is (roughly) the Fourier transform of the Output is (roughly) the Fourier transform of the

electron density mapelectron density map– BUT… the phases are lost irretrievablyBUT… the phases are lost irretrievably– Magnitudes aren’t enough to reconstruct inputMagnitudes aren’t enough to reconstruct input

Page 9: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Recovering PhasesRecovering Phases

Random phasesRandom phases– doesn’t work - not enough structure in doesn’t work - not enough structure in

magnitudesmagnitudes Similar structureSimilar structure

– Guess that trial molecule is like a known oneGuess that trial molecule is like a known one Direct methodsDirect methods

– Exploit relationships between reflectionsExploit relationships between reflections

Page 10: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

InputInput Cloud of reflectionsCloud of reflections

– each corresponds to a beam each corresponds to a beam of X-raysof X-rays

– has position, magnitude, has position, magnitude, (unknown) phase(unknown) phase

– arranged on regular latticearranged on regular lattice Symmetry groupSymmetry group

– If molecule has symmetry, If molecule has symmetry, only need solve part of itonly need solve part of it

Page 11: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

OutputOutput

System of linear constraints on phasesSystem of linear constraints on phases

Phase and magnitude for reflectionsPhase and magnitude for reflections

Approximate electron density mapApproximate electron density map

Page 12: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Direct MethodsDirect Methods

Solve for sums of phases of a group of Solve for sums of phases of a group of reflectionsreflections

Probabilistic, symbolic methodProbabilistic, symbolic method Relates phases of 3 or 4 reflectionsRelates phases of 3 or 4 reflections Objective: solve for enough phases to Objective: solve for enough phases to

synthesize a rough electron mapsynthesize a rough electron map Method: search for Method: search for phase invariantsphase invariants

Page 13: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Phase InvariantsPhase Invariants

Groups of 3 or 4 reflections with a certain Groups of 3 or 4 reflections with a certain geometric relationshipgeometric relationship

Sum of phases remains constantSum of phases remains constant– probably! Probability increases with strength of probably! Probability increases with strength of

reflectionsreflections Invariant to rotation of moleculeInvariant to rotation of molecule

Page 14: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Finding InvariantsFinding Invariants

Compute strength of reflectionsCompute strength of reflections Choose tripletsChoose triplets Test strengthTest strength Incorporate into constraintsIncorporate into constraints

Page 15: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Reflection StrengthReflection Strength For each reflection:For each reflection:

– Find average magnitude of Find average magnitude of nearby reflectionsnearby reflections

– ““Strength” of a reflection is its Strength” of a reflection is its magnitude divided by average magnitude divided by average over neighborhoodover neighborhood

– An especially strong reflection An especially strong reflection has magnitude greater than 2x has magnitude greater than 2x averageaverage

Page 16: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Searching for InvariantsSearching for Invariants

Choose three reflections Choose three reflections hh, , kk, , -(h-k)-(h-k)

If vector sum of positions is zero, sum of If vector sum of positions is zero, sum of phases is (probably) zerophases is (probably) zero

Page 17: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Searching for invariantsSearching for invariants

Naïve search is at least N choose 2 (or 3)Naïve search is at least N choose 2 (or 3)– O(nO(n22) or O(n) or O(n33), which is expensive with >20000 ), which is expensive with >20000

reflectionsreflections Accelerate search with a spatial data Accelerate search with a spatial data

structurestructure– k-D tree is well suited to this taskk-D tree is well suited to this task– Has to support nearest-neighbor queriesHas to support nearest-neighbor queries– Could probably fake it with range queriesCould probably fake it with range queries

Page 18: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Let the computer searchLet the computer search

Idea 1: search for third reflectionIdea 1: search for third reflection– Pick the first two with for-loopsPick the first two with for-loops– Search nearest neighbors to look for the thirdSearch nearest neighbors to look for the third– Accept or reject based on distance, strengthAccept or reject based on distance, strength

Idea 2: search small chunksIdea 2: search small chunks– Subdivide space with a regular gridSubdivide space with a regular grid– Choose 3 chunks in “invariant pattern”Choose 3 chunks in “invariant pattern”

Page 19: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Using the resultsUsing the results

Invariants specify constraints on phasesInvariants specify constraints on phases Fix one phase, then solve for othersFix one phase, then solve for others Use phases and magnitudes to construct Use phases and magnitudes to construct

electron mapelectron map Let scientist try to fit model to mapLet scientist try to fit model to map Generate more constraints if necessaryGenerate more constraints if necessary

Page 20: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

ComplicationsComplications Strength of reflectionsStrength of reflections

– As problem size increases, strength goes downAs problem size increases, strength goes down Do certain structures make certain patterns?Do certain structures make certain patterns?

– Disulfide bondsDisulfide bonds– Alpha helicesAlpha helices– Beta sheets/barrelsBeta sheets/barrels

Memory localityMemory locality– Nested loops in search are harmfulNested loops in search are harmful

Page 21: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

Future WorkFuture Work

Implement invariant search.Implement invariant search.– CORWIN already has groundworkCORWIN already has groundwork

Consider substructure invariants.Consider substructure invariants.– Finding them is hardFinding them is hard– Searching for them is even harderSearching for them is even harder

Page 22: Geometric Search and Crystal Structure Determination

Geometric Search and Crystal Structure Determination

For More InformationFor More Information

GRIP library or team membersGRIP library or team members Dickerson and Geis, Dickerson and Geis, Protein Structure and Action.Protein Structure and Action. Glusker and Trueblood,Glusker and Trueblood, Crystal Structure Analysis: A Crystal Structure Analysis: A

Primer.Primer. Schenk, Schenk, Introduction to Structure Invariants and Introduction to Structure Invariants and

SeminvariantsSeminvariants..

See Andy or Darlene Freedman to get these.See Andy or Darlene Freedman to get these.