Structure Determinaon by NMR - Jay Ponder Lab Home Page

Post on 03-Feb-2022

2 views 0 download

Transcript of Structure Determinaon by NMR - Jay Ponder Lab Home Page

Structure and Properties of Nucleic Acid Components

Human DNA: 3 Billion Base Pairs

Bdl001.pdb

Double Helical B DNA

N

N N

N

NH2

N

N

O

O

H CH3

N

N N

N

O

H2N

H

N

N

NH2

O

A

T

G

C

ACGT bases.cwg

COMPLEMENTARY WATSON CRICK BASE PAIRINGBETWEEN TWO ANTIPARALLEL STRANDS

A base recognizes T, and G recognizes C

N

N

O

N

H

H

N

N

O

OP

O

O

-O

H

H

O-

OP

O

O

O

H

O

N

NN

G C

A T

N

N

NN

N

O

OP

O

O

-O

HH

H

CH3

O-

OP

O

O

OO

O

NN

THE FOUR LETTER CODE: A, C, G, T

B DNA Structure

ACGTTGCA

ACGT

CA

GTTGCA

5'3'

3'5'

3'5'

3'5'

5'3'

ACGTTGCA

5'3'

3'5'

ACGTTGCA

5'3'

3'5'

DNA Replication: 5' to 3', template directedvia Watson Crick Base Pairing

DNA Polymerase

GCA

T

CAT G

5'

5'

3' 5'

3'

5'

O

O P O

O O-

OPO-O

O

O

OH

O

OPO-O O

O

OPO- O

O O

OO-P

O

OO

O P O

OTEMPLATE

PRIMER

B:

O

OO-P

O

O

OPO- O

O

O

OH

O

O- OP

P O

P O-

O

OO-

O-

OOP

O-O

O

O

OH

Replication of DNA

DNA Polymerase

A T G T T A

A - UU - A

G - CU - A

A - U

5'

5'

S

H

NO

O

O

H

ON

O

HH

3'

5'

5'

3'

5'pp

p

3'

TRANSCRIPTION

mRNA

aa-tRNA

DNA

DNA to RNA to PROTEIN

..

codo

n

anti-codon

TRANSLATION

tRNA

Ribosome

O

OH

OH

O

O- OP

P O

P O-

O

OO-

O-

O

O

OH

OH

O

OPO- O

O O

OO-P

O

B:

POLYMERIZATION

TEMPLATE

OO

O P O

O

O

OO-P

O

O

OPO- O

O

O

OPO-O O

O

OH

OHO

O P O

O O-

OH

3'

5'3'

5'

ACG U

CG U

A

PO

P-O

O O-O O-

O

O O-

P O

PO

P-O

O O-O O-

O

O O-

P O

DNA to RNA: Transcription by RNA polymerase

RNA Polymerase

fMetformylmethionine

mRNA to Protein: Translation

N

O

H

SCH3

H

O

NR2

O

B

OHO

O

O

HNH2

R3

O

B

OHO

O

OB

OHHO

O

O

N

O

H

SCH3

H

O

NR

O

B

OHO

O

O

H

shiftposition

O

O

O OH

B

O

R

NH2

O

H

SCH3

H

O

N

B

OHO

O

O

N'N'N'

O

O

O OH

B

O

RNH2

5'

U A C

O

O

O OH

B

ON

O

H

SCH3

H

5'

tailcoding region stopcodon

startcodon

3'U A G

leader

' A U G N N N

Translation

OHO OH

OHHO

CHO

OH

OHOH

OH O

OH

HO

OHHO

O

H

HO OH

HO

CHO

OH

OHOH

H O

OH

HO

HHO

OH

OHCHO

CH2OHOHH

OH

THE SUGAR SUBUNIT OF DNA AND RNA

+

+

D-(-)-ribose D-ribofuranoside

alpha-anomer beta-anomer

beta-anomer alpha-anomer

D-(+)-glyceraldehyde

FISHER PROJECTION

2-deoxyribose

D-deoxyribofuranoside

POP-OO- O-

O

O OO Base

HO OH

O Base

OH

HO

HO

OH

O

O

O-Base-O OP

HO

OH

P POPOO- O-

O

O OO Base

HO

O

O-

-O

α

α

αβ

β

γ

NTP or dNTP

guanylic acid

cytidylic acidadenylic acid

guanosinecytidineadenosine

NDP or dNDP

NMP or dNMPNucleotide monophosphate:

Nucleoside:

uradylic acid (RNA)

thymidylic acid (DNA)

uridine (RNA)thymidine (DNA)

NOTE: In DNA the OHin the brackets is absent

OH

OBaseO

OPO

-O

5'

3'

2',3'-cyclic AMP3',5'-cyclic AMP(cAMP)

Cyclic Nucleotide

Nucleosides and Nucleotides

O

OBaseHO

O

P-O O

3' 2'

O

O P

O-

O

O O O

O

H

O

O

N

N

H

O

O

N

N H

CH3

H

CH3

O

T

OP

O

T

OO

O-

dNpdNpdN = d(NpNpN) = d(NNN) for oligodeoxyribonucleotides

NpNpN = NNN for oligoribonucleotides

d(NNN)Nd(NNN) for mixed oligo ribo and deoxyribonucleotides (embedded substrates, such as embedded RNA)

When strands associate the two sequences are given, both in the 5'-3'

d(ACGTATG)·d(CATACGT)

Polymers.

poly (dA) : homopolymer d(AAA)n = d(AAAAAAA...)poly (dA-dT): heteropolymer d(AT)n = d(ATATATA...)poly (dA,dT): random copolymer d(ATTAATTTTATATTA...)

When polymer strands associate the stoichiometry is given: poly (A) · 2 poly(U)

Structure and Nomenclaturefor dinucleotides and polymers

Table. Rate of hydrolysis of the bases at pH 1. (Kochetkov and Budovskii 1972)

nucleoside rate (´ 104 sec) dA 4.3 dG 8.3 dC .0011 dU .001 A .0036 G .0093 C .00001 U .00001

O

O OP

O

OO- O-

O

O

P

OH B

HO H

O

O OP

O

OO- O-

O

O

P

O

N

O OP

O

OO- O-

O

O

P

NN

NH2N

H

O

N

O OP

O

OO- O-

O

O

P

NN

NH2N

File: acidhyd.cwg

Mechanism for the acid hydrolysis of theglycosidic bond in DNA.

O

Base

O OP

O

OO- O-

O

O

P

O

O OP

O

OO- O-

O

O

P

OH

O

O OP

O

OO- O-

OOP

H

OH H

H

hemiacetalabasic site

O

O OP

O

OO- O-

OOP

H

OH

B

O

OP

O

OO-

H

OH

H -O

O-O

O

P

B

OH

OH

-O

O-O

O

PO-P

O

OO-

3'-phosphate 5'-phosphate

enolization

β-elimination(retro-Michael Rxn)

β-elimination(retro-Michael Rxn)

hydrolysis

Mechanism of strand cleavagefollowing glycosidic bond hydrolysis

BB H

In the presence of piperidine, the b-elimination reactions may take place through the enamine.

Path B

Path A

3'-phosphate

2'-phosphateH

OH B

HB

O

O

OP

O

B

OO

O

P

H

H

O

O

OP

O

PO O

B

O

O

OP

O

PO O

B

HO

H

2',3'-cyclicphosphate

H

OHB

H B

O

O

OP

O

PO O

B

transesterificationO

O

O

P

P

O

OO O

R

H

B

B

B H

O

O

OP

O

PO O

B

H B

BO

H

2',3'-cyclicphosphate

RNA strand cleavage under basic condition

Table. Basic Hydrolysis of RNA. Substrate Conditions Time Temperature Result ApA 0.5 N NaOH 8 h 28ºC 47% hydrolysis RNA 0.1 N NaOH 20 min 100ºC mononucleotides RNA 10% piperidine 90 min 100ºC mononucleotides RNA 1.0 N NaOH 0.7 min 27ºC 1% cleavage RNA pH 9 30 min 70ºC 1% cleavage Table. Acidic Hydrolysis of RNA. Substrate Conditions Time Temperature Result ApA 0.3 N HCl 8 h 45ºC 69% hydrolysis RNA pH 1 100 min 20ºC 10% cleavage RNA pH 2 4 min 100 ºC 10% cleavage RNA pH 3 40 min 100 ºC 10% cleavage

Rates of Hydrolysis of RNA under basic and acidic conditions

NN

N

NH

H

H

H

NN

H

H

H

H

NN

NN

N

N

O

N

H

HH

H N

O H

H

HH

H

H

NN

N

N

N HH

H

H

H NN

O

O

CH3

HH

H

9

8

Bases are shown in their Watson Crick base-pairing arrangement

Gua Cyt

ThyAdothymineadenine

cytosineguanine

6

54

3

21

7

65

4

3 2

1

PyrimidinesPurines

Basic Constituents of Nucleic AcidsTHE BASES

The Four Letter Code: A, C, G, Twhere A recongizes T and C recognizes G

Minor RNA nucleosides, adenine derivatives

m1A m6A

m2A

N

NN

N

NH2

CH3

N

NN

N

NH2

CH3 N

NN

N

NCH3H

N

NN

N

N

O

N

CO2H

OH

CH3

H

H

N

NN

N

NH

N

NN

N

NH

S CH3

N

NN

N

N

O

N

OH

CH3

H

H

O N

OH OH

OH

H

N

NN

N

NHH

OHO

HO OCH3

N

NN

N

O

H

N

NN

N

O

CH3

I m1I

AmN+

t6Ams2t6Ai6A

(dm6A in DNA)

Minor RNA nucleosides, cytosine derivatives

N

N

O

CH3

NH

m3C

N

N

S

NH2

H3C

s2m5C

N

N

O

NH2

OHO

HO OCH3

Cm

s2C

N

N

S

NH2

N

N

O

N

O

CH3

H

m5C

N

N

S

NH2

H3C

ac4C

(dm5C in DNA)

N

N

O

NH2

OR

(DNA only, R=H or a sugar, foundin T even bacteriophages)

Minor RNA nucleosides, guanine derivatives

m1G

m2G

N

NN

N

O

CH3

NH2 N

NN

N

O

NCH3

H

H

N

NN

N

O

OHO

HO OCH3

NH2

H

Gm

N

NN

N

O

NCH3

CH3

H

m2G2

N

NN

N

O

NH2

HH3C

m7G

N

NN

N

O

CH3

N

N O

OCH3

OCH3O

H

CH3

Y

(dm2G in DNA)2

(dm7G in DNA)

(dm1G in DNA)

N

N

O

H

O

Minor RNA nucleosides, Uracil derivatives

D or hU

N

N

O

CH3

O

m3U rT or T

N

N

O

H

O

O

O

HO

V

N

N

O

H

O

RO

O

cm5U R=H, cmm5U, R=CH3

N

N

O

H

S

N

N

S

H

O

H3C

N

N

O

H

O

H3C

N

N

O

H

O

NCH3

HN

N

O

H

O

RO

O

N

N

S

H

O

RO

O

s2cm5U R=H, s2am5U, R=CH3

s2mam5Us2m5U s2U

N

N

O

O CO2H

NH2 N

N

O

H

O

OHO

HO OCH3

UmX

N N

O

H

O

OHO

HO OCH3

H

Y or YU

N

N

O

H

O

HO

(found in DNA of SP 8bacteriophage)

N

NN

N

O

NH

H

H

N

NN

N

HN

H

N

N

O

NHH

N

N

O

O

HCH3

Localization of lone pairs in bases.

N

NN

N

HN

H

N

N

O

NHH

N

NN

N

O

NH

H

H

N

N

O

O

HCH3

In plane lone pairs, σ type

out of plane lone pairs, p typeconjugated to π system

POH

O

O

O Base

HO OH

HO

OH

O

O

O

BaseHO OP

O

HO

OH

O

O

OBaseOP

N

NN

N

HN

H

H

N

N

O

NHH

H

N

NN

N

O

N

H

H

H

N

N

O

O

HCH3

N

N

O

O

H

OH

O

O

OBaseO OP

O

HO

OH

O

O

O

BaseOP

P

OO

O

O Base

HO OH

HO P

OO

O

O Base

HO OH

O

pKa=9.7

pKa=7pKa=1

pKa=9.4 pKa=9.3pKa=4.2pKa=3.5

+ +

pKa = 1

H

uracilthymineguaninecytosineadenine

ACID-BASE PROPERTIES

H H

N

N

O

O

CH3

H

N

NN

N

O

NH

H

H

N

NN

N

NH

H

N

NN

N

HN

H

N

NN

N

NH

H

etc

N

N

O

NHH

N

N

O

NH

H

N

N

O

NH

H

N

NN

N

O

NH

H

H

N

NN

N

O

NH

H H

N

N

O

O

HCH3

N

N

O

O

CH3

H

TAUTOMERS OF THE BASES

amino imino imino

keto enol keto

amino/keto imino/keto imino/enol

keto enol enol

No.

Figure

homo purine 7 homops1.cw2 pyrimdine 5 homopy1.cw2

hetero purine 4 hetpu1.cw2 pyrmidine 1 homopy1.cw2

purine-pyrimidine

AT 4 pupybp1.cw2

GC 2 Total 23

Base Pairing Combinations

ΝΝ

Ν

Ν

Ν ΗΗ

ΗΗ Ν

Ν

Ν

ΝΝ

homo purine - symmetric

Ν

Ν

Ν

Ν

Ν

Η

Η

Η

Η

Ν

ΝΝ

Ν

Ν

Ν Ν

Ν

Ν Ο

Η

ΝΗΗ

ΗΗΝ

Η

Ο Ν

Ν

Ν ΝΝ

Ν

ΝΝ

Ο

Η

Ν

Η

Η

Η

Η

ΝΗ

Ο

ΝΝ

Ν

Ν

homo purine - asymmetric

ΝΝ

Ν

Ν

Ν ΗΗ

Η

Η

Ν

Ν

Ν

Ν

Ν

Ν

Ν

Ν

Ν

Ο

Η

Ν Η

Η

Η

Η

ΝΗ

Ο

ΝΝ

Ν

Ν

ΝΝ

Ν

Ν

Ο

Η

Ν ΗΗ Η

Η

ΝΗ

Ο

ΝΝ

Ν

Ν

1 2

3 4

56

7

Base pairing schemes for identical purines

anti

anti

Hoogsteen

Reverse Hoogsteen

homo pyrimidine, symmetric

Ν Ν

Ο

Ο

Η

Η

Ο

Ο

Ν Ν

Ν

Ν

Ο

Ο

ΗΝ

Ν

Ο

Ο

Η

ΝΝ

Ν

Ο

Η

Η

Η

Η

Ο

Ν

ΝΝ

Ν

Ν

Ν

Ο

ΗΗ

ΗΗ

Ο

Ν

Ν

Ν

Ν

Ν

Ο

Ο

ΗΝ

Ν

Ν

Ο

ΗΗ

homo pyrimidine, asymmetric

12 13

1415

16

ΝΝ

Ο

Ο

ΗΝ

Ν

Ν

Ο

ΗΗ

Ν

Ν

Ο

Ο

Η

Η

Ο

Ο

Ν Ν

17

18

hetero pyrimidine, symmetric

Base pairing schemes for pyrimidines

anti

anti

ΝΝ

Ν

Ν

Ν ΗΗ

ΗΗ Ν

Η

Ο

Ν

Ν

ΝΝ

hetero purine symmetric heteropurine - asymmetric

89

10 11

ΗΗ Ν

Ν

Ν

ΝΝ

ΗΗΝΗ

Ο

ΝΝ

ΝΝ

Η

ΗΝ

Ν

Ν

Ν

ΝΝΝ

Ν

ΝΟ

Η

ΝΗ Η

ΗΗ

Ν Η

Ο

Ν Ν

ΝΝΗ

Η

Ν

Ν

Ν

Ν

Ν

Base Pairing Schemes for mixed purines

anti

anti

ΝΝ

Ν

Ν

Ν ΗΗ

Η

Ο

Ο

ΝΝΝ

Ν

Ν

Ν

Ο

Η

Ν ΗΗ

ΗΗ

Ο

Ν

ΝΝ

19 20

Pseudosymmetric, Watson-Crick Base Pair

Ν

ΝΝ

Ν

Ο

Η

Ν Η

Η

Ν

Ν

Ν

Ο

Η ΗΝ

Ν

Ν

Ν

Ν ΗΗ

ΝΝ

Ο

Ο

Η

21 22

Η

Ο

Ο

ΝΝ

Η

Η

Ν

Ν

Ν

Ν

Ν

23

Η

Ο

Ο

Ν

Ν

Ν

Ν

Ν

Ν

Ν

Η

Η

24

Hoogsteen Reversed Hoogsteen

Reversed Watson-Crick, assymetric

Ν Ν

Ν

ΝΟ

Η

Ν ΗΗ

ΝΝ

Ο

Ο

Η

ΝΝ

Ν

Ο

Η

Η

Η

Η

Ν

ΝΝ

Ν

Ν

Ν

Ν

Ο

Ο

Η

ΗΗΝ

Η

ΟΝ

Ν

ΝΝ

ΝΝ

Ν

Ο

Η

Η

ΝΝ

Ν

ΝΝΗ

Η

25 26

2728

Purine-Pyrimidine Base Pairs

anti anti

anti

anti

anti

Conformational Analysis of Nucleosides and Nucleotides

H5"H5'

OC4'

C5'

O

O3'

H3'

C3'

OH4'

PO2-

OH1'O

H1'

α β

γ δ

ε ζ

χ χ

Definition of Angles in Nucleotides

pyrimidines

C2

C6

C4

N8

purines

C4'O

H3'

H5'

H5" O5'H4'

C5'

O3'O

O

PO

PO2-

O5'O

O

O

C3'

OBase

OH

HO

τ1

τ2

τ3

τ4 τ0 The torsion bonds aredefined by the sugar ringatoms. Example, τ1 is defined by O4'-C1'-C2'-C3'

Definition of the bond angles in deoxyribosering system and the pseudorotation equation

Pseudorotation equation:

τi = τmax.cos(P+144.(i+3))

where P is the phase angle

OBaseOH

OH

OBase

OH

OH

OOH

HO

BaseO

BaseOH

OH

Twist FormEnvelope form

C2'-endo C2'-exo C2'-endo, C3'-exo C2'-exo, C3'-endoT2E 2E

4

5

32

1

4 atoms in plane1 atom out of plane (#5)

4

5

32 1

3 atoms in plane1 atom below plane (#4)1 atom above plane (#5)

O0 0 O O O

P=162

23 T2

3

P=0P=180P=342

Definition of envelope and twistforms of deoxyribose ring system

Pseudorotation Cycle for Ribose Ring

Potential Energy Surface for the Deoxyribose Ring System as a function of τ and P.

τ P

E = kcal, perpendicular to plane of paper

OBase

HO

HO O

BaseHO

HO O

BaseHO

HO

Bad VDWinteractionC2'

H

H

C1'

H

HOC4'

Bad VDWinteractiondue to eclipsedbond

Origin of the high energy of the O4'-exoconformation of deoxyribonucleosides

2' 1'

Figure: synant1.cw2

ON

RO

RO

N

N

N

NH2

ON

RO

RO

N

N

N

NH2

anti syn

Glycosyl Bond Angle Terms

O

PO

O

O-

σ*LUMO

nHOMO

anomericeffect:

Anomeric effect in phosphodiesters explainspreference for gauche conformations

A & B duplex DNA

N3

N N1

N

O N

O6N2

H

H

HN

N9

O

O

O

P

OO

OP

5'O O

O--O

H

5'

3'

H

3'

GO

OC

O

O

P

5'

P

5'3'

3'

O

O-O

-O

5'

3'

G

5'

3'

C

O

O

OP

N9 N3 HN2

3'

5'

OO-

O

O

N1O6

O

P

3'

5'

O O-

DIFFERENT PERSPECTIVES ON THE WATSON CRICK BASE PAIR (BP)

Looking into minor groove

minor groove side

MAJOR GROOVE SIDE

C2 pseudo axis of symmetry

TOP VIEW

SIDE VIEW

SCHEMATIC VIEW

5'-G-3'3'-C-5'

5' 3'G

5'3'C