From Gene to Protein · 3¢ 3¢ 3 ¢ 5¢ G G G GC C T C A A A A A A A ... G PPP Stop codon UTR...
Transcript of From Gene to Protein · 3¢ 3¢ 3 ¢ 5¢ G G G GC C T C A A A A A A A ... G PPP Stop codon UTR...
LECTURE PRESENTATIONSFor CAMPBELL BIOLOGY, NINTH EDITION
Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert B. Jackson
© 2011 Pearson Education, Inc.
Lectures byErin Barley
Kathleen Fitzpatrick
From Gene to Protein
Chapter 17
Lectures modified by Garrett Dancik
Overview of gene expression
© 2011 Pearson Education, Inc.
TAGC
4-character alphabet
20-character alphabet
• Questions:– A gene (DNA) is physically located on what molecule?– Where in a cell is DNA stored?
transcription translationCentral Dogma of Molecular Biology:
Basic Principles of Transcription and Translation• RNA is the bridge between genes and the proteins
for which they code• Transcription is the synthesis of RNA under the
direction of DNA• Transcription produces messenger RNA (mRNA)• Translation is the synthesis of a polypeptide,
using information in the mRNA– What are the building blocks of polypeptides?
• Where does translation occurs?
© 2011 Pearson Education, Inc.
• A primary transcript is the initial RNA transcript from any gene prior to processing
• The central dogma is the concept that cells are governed by a cellular chain of command: DNA ® RNA ® protein
© 2011 Pearson Education, Inc.
Figure 17.3
DNA
mRNARibosome
Polypeptide
TRANSCRIPTION
TRANSLATION
TRANSCRIPTION
TRANSLATION
Polypeptide
Ribosome
DNA
mRNA
Pre-mRNARNA PROCESSING
(a) Bacterial cell (b) Eukaryotic cell
Nuclearenvelope
Figure 17.3a-1
Transcription and Translation in Prokaryotes
TRANSCRIPTION DNA
mRNAmRNA
TRANSLATIONRibosome
Polypeptide
• In prokaryotes, translation of mRNA can begin before transcription has finished
RNA PROCESSING
mRNA
Nuclearenvelope
DNA
Pre-mRNA
TRANSCRIPTION
• In a eukaryotic cell, the nuclear envelope separates transcription from translation
• Eukaryotic RNA transcripts are modified through RNA processing to yield finished mRNA
Transcription and Translation in Eukaryotes
TRANSLATION Ribosome
Polypeptide
RNA PROCESSING
mRNA
TRANSCRIPTION DNA
Nuclearenvelope
The Genetic Code
• The genetic code refers to the code (rules) that governs how a DNA sequence is converted to a protein (amino acid) sequence
• There are 20 amino acids, but only four nucleotide bases in DNA (and RNA)– What is the minimum number of nucleotides
needed to code for a single amino acid?
© 2011 Pearson Education, Inc.
DNAtemplatestrand
TRANSCRIPTION
mRNA
TRANSLATION
Protein
Amino acid
Codon
Trp Phe Gly
5¢
5¢
Ser
U U U U U3¢
3¢
5¢3¢
G
G
G G C C
T
C
A
A
AAAAA
T T T T
T
G
G G G
C C C G GDNAmolecule
Gene 1
Gene 2
Gene 3
C C
• The genetic code is a triplet code where a 3-nucleotide DNA word codes for a 3-nucleotide mRNA word (a codon) which codes for an amino acid
anti-sense (template) strand
sense (non-template) strand
• During transcription, one of the two DNA strands, called the template strand, provides a template for ordering the sequence of complementary nucleotides in an RNA transcript
• The template strand is always the same strand for a given gene
• During translation, the mRNA base triplets, called codons, are read in the 5¢ to 3¢ direction
• Each codon specifies the amino acid (one of 20) to be placed at the corresponding position along a polypeptide
© 2011 Pearson Education, Inc.
Cracking the Code
• All 64 codons were deciphered by the mid-1960s• Of the 64 triplets, 61 code for amino acids; 3
triplets are “stop” signals to end translation• The genetic code is redundant (more than one
codon may specify a particular amino acid) but not ambiguous; no codon specifies more than one amino acid
• Codons must be read in the correct reading frame (correct groupings) in order for the specified polypeptide to be produced
© 2011 Pearson Education, Inc.
Second mRNA base
Firs
t mR
NA
base
(5¢ e
nd o
f cod
on)
Third
mR
NA
base
(3¢ e
nd o
f cod
on)
UUUUUCUUA
CUUCUCCUACUG
Phe
Leu
Leu
Ile
UCUUCCUCAUCG
Ser
CCUCCCCCACCG
UAUUAC
Tyr
Pro
Thr
UAA StopUAG Stop
UGA Stop
UGUUGC
Cys
UGG Trp
GCU
U
C
A
U
U
C
C
CA
U
A
A
A
G
G
His
Gln
Asn
Lys
Asp
CAU CGUCACCAACAG
CGCCGACGG
G
AUUAUCAUA
ACUACCACA
AAUAACAAA
AGUAGCAGA
Arg
Ser
Arg
Gly
ACGAUG AAG AGG
GUUGUCGUAGUG
GCUGCCGCAGCG
GAUGACGAAGAG
Val Ala
GGUGGCGGAGGG
GluGly
G
UCA
Met orstart
UUG
G
• 64 triplets– 61 code for
amino acids– 3 triplets are “stop” codons
• The genetic code is redundant but not ambiguous
Properties of the Genetic Code
The Universality of the Genetic Code
• The genetic code is nearly universal, shared by the simplest bacteria to the most complex animals
• Genes can be transcribed and translated after being transplanted from one species to another
• Examples: https://www.bbc.com/news/science-environment-14882008
• https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4006694/
© 2011 Pearson Education, Inc.
Pig expressing a jellyfish gene
Transcription is the DNA-directed synthesis of RNA: a closer look• Transcription consists of three steps
– Initiation• The DNA sequence where RNA polymerase
attaches is called the promoter– Elongation
• RNA polymerase pries the DNA strands apart and hooks together the RNA nucleotides
• The RNA is complementary to the DNA template strand, but uracil replaces thymine
– Termination• in bacteria, the sequence signaling the end of
transcription is called the terminator© 2011 Pearson Education, Inc.
Promoter
RNA polymeraseStart point
DNA5¢3¢
Transcription unit
3¢5¢
Promoter
RNA polymeraseStart point
DNA5¢3¢
Transcription unit
3¢5¢
Initiation
5¢3¢
3¢5¢
Nontemplate strand of DNA
Template strand of DNARNAtranscriptUnwound
DNA
1
Promoter
RNA polymeraseStart point
DNA5¢3¢
Transcription unit
3¢5¢
Elongation
5¢3¢
3¢5¢
Nontemplate strand of DNA
Template strand of DNARNAtranscriptUnwound
DNA2
3¢5¢3¢5¢
3¢
RewoundDNA
RNAtranscript
5¢
Initiation1
Promoter
RNA polymeraseStart point
DNA5¢3¢
Transcription unit
3¢5¢
Elongation
5¢3¢
3¢5¢
Nontemplate strand of DNA
Template strand of DNARNAtranscriptUnwound
DNA2
3¢5¢3¢5¢
3¢
RewoundDNA
RNAtranscript
5¢
Termination3
3¢5¢
5¢ Completed RNA transcript
Direction of transcription (“downstream”)
5¢3¢
3¢
Initiation1
RNA Polymerase Binding and Initiation of Transcription
• Promoters signal the transcriptional start point and usually extend several dozen nucleotide pairs upstream of the start point
• Transcription factors mediate the binding of RNA polymerase and the initiation of transcription
• The completed assembly of transcription factors and RNA polymerase II bound to a promoter is called a transcription initiation complex
• A promoter called a TATA box is crucial in forming the initiation complex in eukaryotes
© 2011 Pearson Education, Inc.
Figure 17.8
Transcription initiationcomplex forms
3
DNAPromoter Nontemplate strand
5¢3¢
5¢3¢
5¢3¢
Transcriptionfactors
RNA polymerase IITranscription factors
5¢3¢
5¢3¢
5¢3¢
RNA transcript
Transcription initiation complex
5¢ 3¢
TATA box
TT T T T TA A A AA
A AT
Several transcriptionfactors bind to DNA
2
A eukaryotic promoter1
Start point Template strand
• Transcription in eukaryotes
Elongation of the RNA Strand
• As RNA polymerase moves along the DNA, it untwists the double helix, 10 to 20 bases at a time
• Transcription progresses at a rate of 40 nucleotides per second in eukaryotes
• A gene can be transcribed simultaneously by several RNA polymerases
• Nucleotides are added to the 3¢ end of the growing RNA molecule
© 2011 Pearson Education, Inc.
Nontemplatestrand of DNA
RNA nucleotides
RNApolymerase
Templatestrand of DNA
3¢
3¢5¢
5¢
5¢3¢
Newly madeRNA
Direction of transcription
A
A A A
AA
A
T
T TT
TTT G
GG
C
C C
CC
G
C CC A AA
U
U
U
end
Figure 17.9
Termination of Transcription
• The mechanisms of termination are different in bacteria and eukaryotes
• In bacteria, the polymerase stops transcription after reaching a terminator sequence and the mRNA can be translated without further modification
• In eukaryotes, RNA polymerase II transcribes the polyadenylation signal sequence (AAUAAA); the RNA transcript is released 10–35 nucleotides past this polyadenylation sequence
© 2011 Pearson Education, Inc.
Eukaryotic cells modify RNA after transcription
• Enzymes in the eukaryotic nucleus modify pre-mRNA (RNA processing) before the genetic messages are dispatched to the cytoplasm
• During RNA processing, both ends of the primary transcript are usually altered
• Also, usually some interior parts of the molecule are cut out, and the other parts spliced together
© 2011 Pearson Education, Inc.
Alteration of mRNA Ends
• Each end of a pre-mRNA molecule is modified in a particular way
– The 5¢ end receives a modified nucleotide 5¢cap
– The 3¢ end gets a poly-A tail• These modifications share several functions
– They seem to facilitate the export of mRNA– They protect mRNA from hydrolytic enzymes– They help ribosomes attach to the 5¢ end of the
mRNA
© 2011 Pearson Education, Inc.
Protein-codingsegment
Polyadenylationsignal
5¢ 3¢
3¢5¢ 5¢Cap UTR Startcodon
G P P P
Stopcodon UTR
AAUAAA
Poly-A tail
AAA AAA…
mRNA in Eukaryotes
• Note that pre-mRNA (shown on subsequent slides)– does not contain the 5' Cap or Poly-A tail– does contain introns which are spliced out or
removed
• mRNA has the following form:
Split Genes and RNA Splicing
• Most eukaryotic genes and their RNA transcripts have long noncoding stretches of nucleotides that lie between coding regions
• These noncoding regions are called intervening sequences, or introns
• The other regions are called exons because they are eventually expressed, usually translated into amino acid sequences
• RNA splicing removes introns and joins exons, creating an mRNA molecule with a continuous coding sequence
© 2011 Pearson Education, Inc.
Figure 17.11
5¢ Exon Intron Exon5¢ Cap
Codon numbers 1-30 31-104
mRNA 5¢Cap
5¢
Intron Exon
3¢ UTR
Introns cut out andexons spliced together
3¢
105-146
Poly-A tail
Codingsegment
Poly-A tail
UTR1-146
Pre-mRNA following addition of 5’ Cap and Poly-A tail:
The Functional and Evolutionary Importance of Introns• Some introns contain sequences that may
regulate gene expression• Some genes can encode more than one kind of
polypeptide, depending on which segments are treated as exons during splicing
• This is called alternative RNA splicing– approximately 95% of genes with multiple exons
are alternatively spliced!– the number of different proteins an organism can
produce is much greater than its number of genes
© 2011 Pearson Education, Inc.
• Proteins often have a modular architecture consisting of discrete regions called domains
• In many cases, different exons code for the different domains in a protein
• Exon shuffling may result in the evolution of new proteins
© 2011 Pearson Education, Inc.
GeneDNA
Exon 1 Exon 2 Exon 3Intron Intron
Transcription
RNA processing
Translation
Domain 3
Domain 2
Domain 1
Polypeptide
Figure 17.13
Concept 17.5: Mutations of one or a few nucleotides can affect protein structure andfunction
• Mutations are changes in the genetic material of a cell or virus
• Point mutations are chemical changes in just one base pair of a gene
• The change of a single nucleotide in a DNA template strand can lead to the production of an abnormal protein
© 2011 Pearson Education, Inc.
Figure 17.23
Wild-type hemoglobin
Wild-type hemoglobin DNA3¢
3¢
3¢5¢
5¢ 3¢
3¢5¢
5¢5¢5¢3¢
mRNA
A AGC T T
A AGmRNA
Normal hemoglobinGlu
Sickle-cell hemoglobinVal
AA
AUG
GT
T
Sickle-cell hemoglobin
Mutant hemoglobin DNAC
Types of Small-Scale Mutations
• Point mutations within a gene can be divided into two general categories
– Nucleotide-pair substitutions– Nucleotide-pair insertions or deletions
© 2011 Pearson Education, Inc.
Nucleotide-Pair Substitutions
• A nucleotide-pair substitution replaces one nucleotide and its partner with another pair of nucleotides
• Silent mutations have no effect on the amino acid produced by a codon because of redundancy in the genetic code
• Missense mutations still code for an amino acid, but not the correct amino acid
• Nonsense mutations change an amino acid codon into a stop codon, nearly always leading to a nonfunctional protein
© 2011 Pearson Education, Inc.
Figure 17.24a
Wild typeDNA template strand
mRNA5¢
5¢
Protein
Amino endStopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
A instead of G
(a) Nucleotide-pair substitution: silent
StopMet Lys Phe Gly
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
5¢3¢
3¢5¢A
A AA A A A
A ATT T T T T
T T TTC C C CG G G G
AA
A G A A A AG GGU U U U U
T
U 3¢5¢U instead of C
Figure 17.24b
Wild typeDNA template strand
mRNA5¢
5¢
Protein
Amino endStopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
T instead of C
(a) Nucleotide-pair substitution: missense
StopMet Lys Phe Ser
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
5¢3¢
3¢5¢A
A AA A A A
A ATT T T T T
T T TTC C T CG
GGA
A G A A A AA GGU U U U U 3¢5¢
A C
C
GA instead of G
Figure 17.24c
Wild typeDNA template strand
mRNA5¢
5¢
ProteinAmino end
StopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
A instead of T
(a) Nucleotide-pair substitution: nonsense
Met
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
5¢3¢
3¢5¢A
AA A A A
A ATT A T T T
T T TTC C CG
GGA
A G U A A AGGU U U U U 3¢5¢
C
C
G
T instead of CCGT
G
Stop
U instead of A
Insertions and Deletions
• Insertions and deletions are additions or losses of one or more nucleotide pairs in a gene
• These mutations often have a disastrous effect on the resulting protein more often than substitutions do
• Insertion or deletion of nucleotides may alter the reading frame, producing a frameshift mutation
© 2011 Pearson Education, Inc.
Figure 17.24d
Wild typeDNA template strand
mRNA5¢
5¢
ProteinAmino end
StopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
(b) Nucleotide-pair insertion or deletion: frameshift causingimmediate nonsense
Extra A
5¢3¢
5¢
3¢
3¢
5¢
Met
1 nucleotide-pair insertionStop
A C A A GT T A TC T A C GT A T AT G T CT GG A T GA
A G U A U AU GAU G U U C
A TA
AGExtra U
Figure 17.24e
DNA template strand
mRNA5¢
5¢
ProteinAmino end
StopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
(b) Nucleotide-pair insertion or deletion: frameshift causingextensive missense
Wild type
missing
missing
A
U
A A AT T TC C A T TC C GA AT T TG GA A ATCG G
A G A A GU U U C A AG G U 3¢
5¢3¢
3¢5¢
Met Lys Leu Ala
1 nucleotide-pair deletion
5¢
Figure 17.24f
DNA template strand
mRNA5¢
5¢
ProteinAmino end
StopCarboxyl end
3¢3¢
3¢
5¢
Met Lys Phe Gly
A
A
A AA A A A
A ATT T T T T
T T TTC C C C
C
C
G G G GG
G
A
A A A AG GGU U U U U
(b) Nucleotide-pair insertion or deletion: no frameshift, but oneamino acid missing
Wild type
AT C A A A A T TC C GT T C missing
missing
Stop
5¢3¢
3¢5¢
3¢5¢
Met Phe Gly
3 nucleotide-pair deletion
A GU C A AG GU U U U
T GA A AT T TT CG G
A A G
Polypeptide
RibosomeTrp
Phe Gly
tRNA withamino acidattached
Aminoacids
tRNA
Anticodon
Codons
U U U UG G G G C
A C CC
CG
A A A
CGC
G
5¢ 3¢mRNA
Translation• A cell translates a mRNA
message into protein with the help of transfer RNA (tRNA)
• tRNA transfer amino acids to the growing polypeptide in a ribosome
• Translation is a complex process in terms of its biochemistry and mechanics
Figure 17.15
Amino acidattachmentsite
3¢
5¢
Hydrogenbonds
Anticodon
(a) Two-dimensional structure (b) Three-dimensional structure(c) Symbol used
in this book
Anticodon Anticodon3¢ 5¢
Hydrogenbonds
Amino acidattachmentsite5¢
3¢
A A G
• Each carries a specific amino acid on one end• Each has an anticodon on the other end; the
anticodon base-pairs with a complementary codon on mRNA
tRNA structure
Ribosomes
• Ribosomes facilitate specific coupling of tRNA anticodons with mRNA codons in protein synthesis
• The two ribosomal subunits (large and small) are made of proteins and ribosomal RNA (rRNA)
• Bacterial and eukaryotic ribosomes are somewhat similar but have significant differences: some antibiotic drugs specifically target bacterial ribosomes without harming eukaryotic ribosomes
© 2011 Pearson Education, Inc.
tRNAmolecules
Growingpolypeptide Exit tunnel
E P A
Largesubunit
Smallsubunit
mRNA5¢ 3¢
(a) Computer model of functioning ribosome
Exit tunnel Amino end
A site (Aminoacyl-tRNA binding site)
Smallsubunit
Largesubunit
E P A mRNAE
P site (Peptidyl-tRNAbinding site)
mRNAbinding site
(b) Schematic model showing binding sites
E site (Exit site)
(c) Schematic model with mRNA and tRNA
5¢ Codons
3¢tRNA
Growing polypeptide
Next aminoacid to beadded topolypeptidechain
Figure 17.17
• A ribosome has three binding sites for tRNA– The P site holds the tRNA that carries the
growing polypeptide chain– The A site holds the tRNA that carries the next
amino acid to be added to the chain– The E site is the exit site, where discharged
tRNAs leave the ribosome• As with transcription, there are three steps to
translation– Initiation– Elongation– Termination
© 2011 Pearson Education, Inc.
InitiatortRNA
mRNA
5¢
5¢3¢Start codon
Smallribosomalsubunit
mRNA binding site
3¢
Translation initiation complex
5¢ 3¢3¢U
UA
A GC
P
P site
i+
GTP GDP
Met Met
Largeribosomalsubunit
E A
5¢
Initiation of Translation
Elongation of the Polypeptide Chain
• During the elongation stage, amino acids are added one by one to the preceding amino acid at the C-terminus of the growing chain
• Each addition involves proteins called elongation factors and occurs in three steps: codon recognition, peptide bond formation, and translocation
• Translation proceeds along the mRNA in a 5′ to 3′ direction
© 2011 Pearson Education, Inc.
Amino end ofpolypeptide
mRNA
5¢
E
Psite
Asite
3¢
Amino end ofpolypeptide
mRNA
5¢
E
Psite
Asite
3¢
E
GTP
GDP + P i
P A
Amino end ofpolypeptide
mRNA
5¢
E
Psite
Asite
3¢
E
GTP
GDP + P i
P A
E
P A
Amino end ofpolypeptide
mRNA
5¢
E
Asite
3¢
E
GTP
GDP + P i
P A
E
P A
GTP
GDP + P i
P A
E
Ribosome ready fornext aminoacyl tRNA
Psite
Termination of Translation
• Termination occurs when a stop codon in the mRNA reaches the A site of the ribosome
• The A site accepts a protein called a release factor
• The release factor causes the addition of a water molecule instead of an amino acid
• This reaction releases the polypeptide, and the translation assembly then comes apart
© 2011 Pearson Education, Inc.
Releasefactor
Stop codon(UAG, UAA, or UGA)
3¢
5¢
Termination of Translation
Releasefactor
Stop codon(UAG, UAA, or UGA)
3¢
5¢
3¢
5¢
Freepolypeptide
2 GTP
2 GDP + 2 iP
Releasefactor
Stop codon(UAG, UAA, or UGA)
3¢
5¢
3¢
5¢
Freepolypeptide
2 GTP
5¢
3¢
2 GDP + 2 iP
Completedpolypeptide
Incomingribosomalsubunits
Start ofmRNA(5¢ end)
End ofmRNA(3¢ end)(a)
Polyribosome
Ribosomes
mRNA
(b) 0.1 µm
GrowingpolypeptidesPolyribosomes
• A number of ribosomes can translate a single mRNA simultaneously, forming a polyribosome (or polysome)
• Polyribosomes enable a cell to make many copies of a polypeptide very quickly
Completing and Targeting the Functional Protein
• Often translation is not sufficient to make a functional protein
• Polypeptide chains are modified after translation or targeted to specific sites in the cell– Proteins targeted to specific sites in the cell contain
a signal sequence– What organelle functions as a "highway" within a
cell?
© 2011 Pearson Education, Inc.
TRANSCRIPTION DNA
RNApolymerase
ExonRNAtranscript
RNAPROCESSING
NUCLEUS
Intron
RNA transcript(pre-mRNA)
Poly-A
Poly-A
Aminoacyl-tRNA synthetase
AMINO ACIDACTIVATION
AminoacidtRNA
5¢Cap
Poly-A3¢
GrowingpolypeptidemRNA
Aminoacyl(charged)tRNA
Anticodon
Ribosomalsubunits
A
AETRANSLATION
5¢ Cap
CYTOPLASM
PE
Codon
Ribosome
5¢
3¢
What is a gene?• a region of DNA
that can be expressed to produce a final functional product, either • a polypeptide or • an RNA
molecule