DNA Computing: Mathematics with Molecules

58
DNA Computing: Mathematics with Molecules Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701 [email protected]

description

DNA Computing: Mathematics with Molecules. Russell Deaton Professor Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701 [email protected]. What is DNA Computing (DNAC) ?. The use of biological molecules, primarily DNA, DNA analogs, and RNA, for computational purposes. - PowerPoint PPT Presentation

Transcript of DNA Computing: Mathematics with Molecules

Page 1: DNA Computing: Mathematics with Molecules

DNA Computing: Mathematics with Molecules

Russell DeatonProfessorComp. Sci. & Engr.The University of ArkansasFayetteville, AR [email protected]

Page 2: DNA Computing: Mathematics with Molecules

What is DNA Computing (DNAC) ?

The use of biological molecules, primarily DNA, DNA analogs, and RNA, for computational purposes.

Page 3: DNA Computing: Mathematics with Molecules

Why Nucleic Acids?• Density (Adleman, Baum):

– DNA: 1 bit per nm3, 1020 molecules– Video: 1 bit per 1012 nm3

• Efficiency (Adleman)– DNA: 1019 ops / J– Supercomputer: 109 ops / J

• Speed (Adleman):– DNA: 1014 ops per s– Supercomputer: 1012 ops per s

Page 4: DNA Computing: Mathematics with Molecules

What makes DNAC possible?• Great advances in molecular biology

– PCR (Polymerase Chain Reaction)– DNA Microarrays– New enzymes and proteins– Better understanding of biological molecules

• Ability to produce massive numbers of DNA molecules with specified sequence and size

• DNA molecules interact through template matching reactions

Page 5: DNA Computing: Mathematics with Molecules

What is a the typical methodology?

• Encoding: Map problem instance onto set of biological molecules and molecular biology protocols

• Molecular Operations: Let molecules react to form potential solutions

• Extraction/Detection: Use protocols to extract result in molecular form

Page 6: DNA Computing: Mathematics with Molecules

PHYSICAL STRUCTURE OF DNA

Nitrogenous Base

34 Å

MajorGroove

Minor Groove

Central Axis

Sugar-PhosphateBackbone

20 Å5’ C

3’ OH

3’ 0HC 5’

5’

3’

3’

5’

Page 7: DNA Computing: Mathematics with Molecules

What is an example?

• “Molecular Computation of Solutions to Combinatorial Problems”

• Adleman, Science, v. 266, p. 1021.

Page 8: DNA Computing: Mathematics with Molecules
Page 9: DNA Computing: Mathematics with Molecules

Algorithm

• Generate Random Paths through the graph.

• Keep only those paths that begin with vin and end with vout.

• If graph has n vertices, then keep only those paths that enter exactly n vertices.

• Keep only those paths that enter all the vertices at least once.

• In any paths remain, say “Yes”; otherwise, say “No”

Page 10: DNA Computing: Mathematics with Molecules

INTER-STRAND HYDROGEN BONDING

Adenine Thymine

to Sugar-PhosphateBackbone

to Sugar-PhosphateBackbone

(+) (-)

(+)(-)

Hydrogen Bond

Guanine Cytosine

to Sugar-PhosphateBackbone

to Sugar-PhosphateBackbone

(-) (+)

(+)(-)

(+)(-)

Page 11: DNA Computing: Mathematics with Molecules

STRAND HYBRIDIZATION

A B

a b

A B

ab

b

B

a

A

HEAT

COOL

ba

A B

OR

100° C

Page 12: DNA Computing: Mathematics with Molecules

DNA LIGATION

’ ’

’ ’

Ligase Joins 5' phosphateto 3' hydroxyl

’ ’

Page 13: DNA Computing: Mathematics with Molecules

Encoding0

1

2

‘GCATGGCC

‘AGCTTAGG

‘ATGGCATG

CCGGTCGA’

CCGGTACC’

‘GCATGGCCAGCTTAGG CCGGTCGA’

‘GCATGGCCATGGCATG CCGGTACC’

00 21

Page 14: DNA Computing: Mathematics with Molecules
Page 15: DNA Computing: Mathematics with Molecules

Massively Parallel SearchV1

E0->1

V0 V2 V3 V4 V5 V6

E1->2 E2->3 E3->4 E4->5 E5->6

V6

E0->6

V0

V3

E0->3

V0 V2 V3 V4 V5 V6

E3->2 E2->3 E3->4 E4->5 E5->6

V5

E4->5

V4 V1 V2

E5->1 E1->2

Page 16: DNA Computing: Mathematics with Molecules

Algorithm

• Generate Random Paths through the graph.

• Keep only those paths that begin with vin and end with vout.

• If graph has n vertices, then keep only those paths that enter exactly n vertices.

• Keep only those paths that enter all the vertices at least once.

• In any paths remain, say “Yes”; otherwise, say “No”

Page 17: DNA Computing: Mathematics with Molecules

DNA Polymerase

Page 18: DNA Computing: Mathematics with Molecules

POLYMERASE CHAIN

REACTION

Page 19: DNA Computing: Mathematics with Molecules

Start = V0, Stop = V6V1

E0->1

V0 V2 V3 V4 V5 V6

E1->2 E2->3 E3->4 E4->5 E5->6

V6

E0->6

V0

V3

E0->3

V0 V2 V3 V4 V5 V6

E3->2 E2->3 E3->4 E4->5 E5->6

V5

E4->5

V4 V1 V2

E5->1 E1->2

Page 20: DNA Computing: Mathematics with Molecules

Algorithm

• Generate Random Paths through the graph.

• Keep only those paths that begin with vin and end with vout.

• If graph has n vertices, then keep only those paths that enter exactly n vertices.

• Keep only those paths that enter all the vertices at least once.

• In any paths remain, say “Yes”; otherwise, say “No”

Page 21: DNA Computing: Mathematics with Molecules

GEL ELECTROPHORESIS - SIZE SORTING

BufferGel

Electrode

Electrode

Samples

Faster

Slower

Page 22: DNA Computing: Mathematics with Molecules

Right LengthV1

E0->1

V0 V2 V3 V4 V5 V6

E1->2 E2->3 E3->4 E4->5 E5->6

V6

E0->6

V0

V3

E0->3

V0 V2 V3 V4 V5 V6

E3->2 E2->3 E3->4 E4->5 E5->6

Page 23: DNA Computing: Mathematics with Molecules

Algorithm

• Generate Random Paths through the graph.

• Keep only those paths that begin with vin and end with vout.

• If graph has n vertices, then keep only those paths that enter exactly n vertices.

• Keep only those paths that enter all the vertices at least once.

• In any paths remain, say “Yes”; otherwise, say “No”

Page 24: DNA Computing: Mathematics with Molecules

ANTIBODY AFFINITY

CACCATGTGAC

GTGGTACACTG B

PMP

+

Anneal

CACCATGTGAC

GTGGTACACTG B+

CACCATGTGAC

GTGGTACACTG B PMP

Bind

Add oligo withBiotin label

Heat and cool

Add Paramagnetic-Streptavidin

Particles

Isolate with MagnetN

S

Page 25: DNA Computing: Mathematics with Molecules

Every VertexV1

E0->1

V0 V2 V3 V4 V5 V6

E1->2 E2->3 E3->4 E4->5 E5->6

V3

E0->3

V0 V2 V3 V4 V5 V6

E3->2 E2->3 E3->4 E4->5 E5->6

Page 26: DNA Computing: Mathematics with Molecules

Algorithm

• Generate Random Paths through the graph.

• Keep only those paths that begin with vin and end with vout.

• If graph has n vertices, then keep only those paths that enter exactly n vertices.

• Keep only those paths that enter all the vertices at least once.

• In any paths remain, say “Yes”; otherwise, say “No”

Page 27: DNA Computing: Mathematics with Molecules

Hamiltonian PathV1

E0->1

V0 V2 V3 V4 V5 V6

E1->2 E2->3 E3->4 E4->5 E5->6

Page 28: DNA Computing: Mathematics with Molecules

Mismatches

Page 29: DNA Computing: Mathematics with Molecules

DNA Word Design

• Importance of Template-Matching Hybridization Reactions in DNA Computing (DNAC)

• Sequence design should implement DNAC architecture.– Planned Hybridizations– Problem Size– Subsequent Processing Reactions

• Designed sequences should minimize unplanned “cross-hybridizations.”

• Consequences of Bad Designs: Errors and Poor Efficiency

Page 30: DNA Computing: Mathematics with Molecules

DNA Word Design

• Design problem is hard.

• As number of sequences required to represent the problem increases, this constraints increasingly conflicts with the requirement of non-crosshybridization.

• How much of DNA sequence space is available for computation?

Page 31: DNA Computing: Mathematics with Molecules

Why In Vitro?

• In Vitro Selection and Evolution• PCR as tool for selection• Ability to synthesis huge, random starting

populations• Mutagenesis• Oligos manufactured under conditions for use• Use massive parallelism of DNAC to solve

word design problem

Page 32: DNA Computing: Mathematics with Molecules

Protocol Outline

• Start with huge population of random sequences with attached primers.

• Anneal rapidly to quench oligos in mismatched configurations.

• Using temperature as a control, melt most mismatched pairs.

• Amplify and purify• Repeat

Page 33: DNA Computing: Mathematics with Molecules
Page 34: DNA Computing: Mathematics with Molecules
Page 35: DNA Computing: Mathematics with Molecules

Experimental Results

Page 36: DNA Computing: Mathematics with Molecules

Experimental Results

Page 37: DNA Computing: Mathematics with Molecules

Latest Results

Page 38: DNA Computing: Mathematics with Molecules

DNA Memories

Page 39: DNA Computing: Mathematics with Molecules

OverviewInput DNAs

(Unknown Seq.)Sequences Comple-

mentary to Input DNAs

Memory DNA Strands(With the 3’ end Comple-

mentary to the Input DNAs)

New UnknownInput DNAs

Learning Recall Output

Tag1Random

Probe

Separates Memory DNA Strands that Match orPartially Match the

New Inputs from ThoseThat Don’t Match

Labeled Tag Sequence Complements

Page 40: DNA Computing: Mathematics with Molecules

Learning

• Learning: Information acquired from examples rather than programmed

• Protocol to store input DNAs (possibly of unknown sequence)

• Higher level representation of the input sequences• Not individual sequence memories but whole populations• Clustering of input sequences in vitro• Massively random and parallel copying or sampling

depending on number of inputs and probes

Page 41: DNA Computing: Mathematics with Molecules
Page 42: DNA Computing: Mathematics with Molecules

Base-by-Base Amplification

Input DNA

Tag Probe Extension

Page 43: DNA Computing: Mathematics with Molecules

Sampling

Input DNA

Tag Probe Extension

Page 44: DNA Computing: Mathematics with Molecules

Energy Surface Manipulation through Learning

Energy

Input Sequence

Energy

Input Sequence

Before Learning After Learning

Page 45: DNA Computing: Mathematics with Molecules

Tags

• Non-Crosshybridizing Sequences• Convenient for Input/Output in absence of input

sequence information• Manipulate memory without input sequences• Implement DNA2DNA Computations (Landweber

and Lipton, DNA 3)

Page 46: DNA Computing: Mathematics with Molecules

Recall

• Hybridization to retrieve memories

• Similar sequences patterns matched

• Pattern matching done against whole memory

• Single memory associated with single tags

• Memory composite of output on multiple tags

Page 47: DNA Computing: Mathematics with Molecules
Page 48: DNA Computing: Mathematics with Molecules
Page 49: DNA Computing: Mathematics with Molecules

Experiments

• Test learning and recall with plasmid

• Test of sensitivity in concentration

• Test coverage of input sequence space with:– Plasmids (5k bp)– E. Coli (5M bp)

• Test sequence resolution of protocols

Page 50: DNA Computing: Mathematics with Molecules
Page 51: DNA Computing: Mathematics with Molecules

LearningInput 1 is a 3 kb linear DNA

(pBluescript)

Input 2 is a 5 kb linear DNA(x 174)

I1 I2 I1 I2

Input DNAs40

60

100

M1 M2

Memory DNAs

Page 52: DNA Computing: Mathematics with Molecules

Input 1 Input 2

Stained Gel Blot with Memory 1(pBluescript)

Blot with Memory 2(x 174)

Plasmid inputs learned, similar sequences recalled, and dissimilar not matched.

Recall

Page 53: DNA Computing: Mathematics with Molecules

Concentration Sensitivity

1. Plasmids digested with Hpa II

2. 1 g pBluescript3. 10ng - 800ng x 1744. Blotted with x 174

memory5. 1% x 174 detected in

background of pBluescript

Page 54: DNA Computing: Mathematics with Molecules

Input Space Coverage

• Randomly digested input

• Learning on both inputs

• Blots nearly identical

Page 55: DNA Computing: Mathematics with Molecules

E. coli

1. E. coli digested

2. 219bp fragment of x 174 added

3. Learning with and without fragment

4. Fragment distinguished when learned

Page 56: DNA Computing: Mathematics with Molecules

Application

Page 57: DNA Computing: Mathematics with Molecules

Team

• Russell Deaton, University of Arkansas, Computer Science and Engineering

• Junghuei Chen, University of Delaware, Chemistry and Biochemistry

• Hong Bi, University of Delaware, Chemistry and Biochemistry

• Max Garzon, University of Memphis, Computer Science• Harvey Rubin, University of Pennsyvania, School of

Medicine• David Wood, University of Delaware, Computer and

Information Science

Page 58: DNA Computing: Mathematics with Molecules

Acknowledgement

• This work was supported by the NSF QuBIC program, award number EIA-0130385