Using the genome Studying expression of all genes simultaneously 1.Microarrays: “reverse...

Post on 02-Jan-2016

220 views 1 download

Tags:

Transcript of Using the genome Studying expression of all genes simultaneously 1.Microarrays: “reverse...

Using the genomeStudying expression of all genes simultaneously1.Microarrays: “reverse Northerns”2.High-throughput sequencing3. Bisulfite sequencing to detect C methylation

Using the genomeBisulfite sequencing to detect C methylationChIP-chip or ChIP-seq to detect chromatin modifications: 17 mods are associated with active genes in CD-4 T cells

Generating the histone codeHistone acetyltransferases add acetic acidDeacetylases “reset” by removing the acetate

Generating the histone codeCDK8 kinases histones to repress transcriptionAppears to interact with mediator to block transcriptionPhosphorylation of Histone H3 correlates with activation of heat shock genes!Phosphatases reset the genes

Generating the histone codeRad6 proteins ubiquitinate histone H2B to repress transcriptionPolycomb proteins ubiquitinate histone H2A to silence genes

Generating the histone codeRad6 proteins ubiquitinate histone H2B to repress transcriptionPolycomb proteins ubiquitinate histone H2A to silence genesA TFTC/STAGA module mediates histone H2A and H2B deubiquitination, coactivates nuclear receptors, and counteracts heterochromatin silencing

Generating the histone codeMany proteins methylate histones: highly regulated!

Generating the histone codeMany proteins methylate histones: highly regulated!Methylation status determines gene activity

Generating the histone codeMany proteins methylate histones: highly regulated!Methylation status determines gene activityMutants (eg Curly leaf) are unhappy!

Generating the histone codeMany proteins methylate histones: highly regulated!Methylation status determines gene activityMutants (eg Curly leaf) are unhappy!Chromodomain protein HP1 can tell the difference between H3K9me2 (yellow)& H3K9me3 (red)

Generating the histone codeChromodomain protein HP1 can tell the difference between H3K9me2 (yellow) & H3K9me3 (red)Histone demethylases have been recently discovered

Generating methylated DNASi RNA are key: RNA Pol IV generates antisense or foldback RNA, often from TE

Generating methylated DNASi RNA are key: RNA Pol IV generates antisense or foldback RNA, often from TERDR2 makes it DS, 24 nt siRNA are generated by DCL3

Generating methylated DNARDR2 makes it DS, 24 nt siRNA are generated by DCL3AGO4 binds siRNA, complex binds target & Pol V

Generating methylated DNARDR2 makes it DS, 24 nt siRNA are generated by DCL3AGO4 binds siRNA, complex binds target & Pol VPol V makes intergenic RNA, associates with AGO4-siRNA to recruit “silencing Complex” to target site

Generating methylated DNARDR2 makes it DS, 24 nt siRNA are generated by DCL3AGO4 binds siRNA, complex binds target & Pol VPol V makes intergenic RNA, associates with AGO4-siRNA to recruit “silencing Complex” to target siteAmplifies signal!extends meth-ylated region

Using the genomeMany sites provide gene expression data online• NIH Gene expression omnibus

http://www.ncbi.nlm.nih.gov/geo/ provides access to many different types of gene expression data

Using the genomeMany sites provide gene expression data online• NIH Gene expression omnibus

http://www.ncbi.nlm.nih.gov/geo/ provides access to many different types of gene expression data

•Many different sites provide “digital Northerns” or other comparative analyses of gene expression• http://cgap.nci.nih.gov/SAGE• http://www.weigelworld.org/research/projects/

geneexpressionatlas

Using the genomeMany sites provide gene expression data online• NIH Gene expression omnibus

http://www.ncbi.nlm.nih.gov/geo/ provides access to many different types of gene expression data

•Many different sites provide “digital Northerns” or other comparative analyses of gene expression• http://cgap.nci.nih.gov/SAGE• http://www.weigelworld.org/research/projects/

geneexpressionatlas• MPSS (massively-parallel signature sequencing)

http://mpss.udel.edu/

Using the genomeMany sites provide gene expression data online• NIH Gene expression omnibus

http://www.ncbi.nlm.nih.gov/geo/ provides access to many different types of gene expression data

•Many different sites provide “digital Northerns” or other comparative analyses of gene expression• http://cgap.nci.nih.gov/SAGE• http://www.weigelworld.org/research/projects/

geneexpressionatlas• MPSS (massively-parallel signature sequencing)

http://mpss.udel.edu/• Use it to decide which tissues to extract our RNA from

Using the genomeMany sites provide gene expression data onlineMany sites provide other kinds of genomic data online• http://encodeproject.org/ENCODE/

Post-transcriptional regulationNearly ½ of human genome is transcribed, only 1% is coding• 98% of RNA made is non-coding

Post-transcriptional regulationNearly ½ of human genome is transcribed, only 1% is coding• 98% of RNA made is non-coding•Fraction increases with organism’s complexity

Known NcRNAs classes and functions

Implication in diseases

Implication in diseases

Transcription in Eukaryotes3 RNA polymerasesall are multi-subunit complexes 5 in common 3 very similar variable # unique onesPlants also have Pols IV & V •make siRNA

Transcription in EukaryotesRNA polymerase I: 13 subunits (5 + 3 + 5 unique) acts exclusively in nucleolus to make 45S-rRNA precursor

Transcription in EukaryotesPol I: acts exclusively in nucleolus to make 45S-rRNA precursor•accounts for 50% of total RNA synthesis

Transcription in EukaryotesPol I: acts exclusively in nucleolus to make 45S-rRNA precursor• accounts for 50% of total RNA synthesis• insensitive to -aminitin

Transcription in EukaryotesPol I: only makes 45S-rRNA precursor• 50 % of total RNA synthesis• insensitive to -aminitin•Mg2+ cofactor•Regulated @ initiation frequency

Processing rRNA1) ~ 100 bases are methylated• C/D box snoRNA pick sites• One for each!

Processing rRNA1) ~ 100 bases are methylated• C/D box snoRNA pick sites• One for each!2) ~ 100 Us are changed to PseudoU• H/ACA box snoRNA pick sites• One for each!

Processing rRNA1) ~ 100 bases are methylated• C/D box snoRNA pick sites2) ~ 100 Us are changed to PseudoU• H/ACA box snoRNA pick sites3) Some snoRNA direct modification

of tRNA and snRNA

Processing rRNA1) ~ 200 bases are modified2) 45S pre-rRNA is cut into 28S, 18S and 5.8S products by

ribozymes• RNase MRP cuts between 18S & 5.8S• U3, U8, U14, U22, snR10 and snR30 also guide cleavage

Processing rRNA1) ~ 200 bases are methylated2) 45S pre-rRNA is cut into 28S, 18S and 5.8S products3) Ribosomes are assembled w/in

nucleolus

RNA Polymerase III makes ribosomal 5S and tRNA (+ some snRNA, scRNA, etc)>100 different kinds of ncRNA ~10% of all RNA synthesisCofactor = Mn2+ cf Mg2+

sensitive to high [-aminitin]

Processing tRNA

1) tRNA is trimmed

• 5’ end by RNAse P

(1 RNA, 10 proteins)

Processing tRNA

1) tRNA is trimmed

2) Transcript is spliced

• Some tRNAs are

assembled from 2 transcripts

Processing tRNA

1) tRNA is trimmed

2) Transcript is spliced

3) CCA is added to 3’ end

• By tRNA nucleotidyl

transferase (no template)tRNA +CTP -> tRNA-C + PPi

tRNA-C +CTP--> tRNA-C-C + PPitRNA-C-C +ATP -> tRNA-C-C-A + PPi

Processing tRNA

1) tRNA is trimmed

2) Transcript is spliced

3) CCA is added to 3’ end

4) Many bases are modified

• Protects tRNA

• Tweaks protein synthesis

Processing tRNA

1) tRNA is trimmed

2) Transcript is spliced

3) CCA is added to 3’ end

4) Many bases are modified

5) No cap! -> 5’ P

(due to 5’ RNAse P cut)

Splicing: the spliceosome cycle

1) U1 snRNP (RNA/protein complex) binds 5’ splice site

Splicing:The spliceosome cycle1) U1 snRNP binds 5’ splice site2) U2 snRNP binds “branchpoint”

-> displaces A at branchpoint

Splicing:The spliceosome cycle1) U1 snRNP binds 5’ splice site2) U2 snRNP binds “branchpoint”

-> displaces A at branchpoint3) U4/U5/U6 complex binds intron

displace U1spliceosome has now assembled

Splicing:RNA is cut at 5’ splice sitecut end is trans-esterified to branchpoint A

Splicing:5) RNA is cut at 3’ splice site6) 5’ end of exon 2 is ligated to 3’ end of exon 17) everything disassembles -> “lariat intron” is degraded

Splicing:The spliceosome cycle

Splicing:

Some RNAs can self-splice!

role of snRNPs is to increase rate!

Why splice?

Splicing:

Why splice?

1) Generate diversity

exons often encode protein domains

Splicing:Why splice?

1) Generate diversityexons often encode protein domainsIntrons = larger target for insertions, recombination

Why splice?

1) Generate diversity

>94% of human genes show alternate splicing

Why splice?

1) Generate diversity

>94% of human genes show alternate splicing

same gene encodes

different protein

in different tissues

Why splice?

1) Generate diversity

>94% of human genes show alternate splicing

same gene encodes

different protein

in different tissues

Stressed plants use

AS to make variant

stress-response proteins

Why splice?

1) Generate diversity

>94% of human genes show alternate splicing

same gene encodes

different protein

in different tissues

Stressed plants use

AS to make variant

Stress-response

proteins

Splice-regulator

proteins control AS:

regulated by cell-specific

expression and phosphorylation

Why splice?

1)Generate diversity

Trabzuni D, et al (2013)Nat Commun. 22:2771.

•Found 448 genes that were expressed differently by gender in human brains (2.6% of all genes expressed in the CNS).

•All major brain regions showed some gender variation, and 85% of these variations were due to RNA splicing differences

Why splice?

1)Generate diversity

Wilson LOW, Spriggs A, Taylor JM, Fahrer AM. (2014). A novel splicing outcome reveals more than 2000 new mammalian protein isoforms. Bioinformatics 30: 151-156

Splicing created a frameshift, so was annotated as “nonsense-mediated decay”

an alternate start codon rescued the protein, which was expressed

Why splice?

Splicing created a frameshift, so was annotated as “nonsense-mediated decay”

an alternate start codon rescued the protein, which was expressed

Found 1849 human & 733 mouse mRNA that could encode alternate protein isoforms the same way

So far 64 have been validated by mass spec

Regulatory ncRNA1. SiRNA direct DNA-methylation via RNA-dependent

DNA-methyltansferase2. In other cases direct RNA degradation

mRNA degradation• lifespan varies 100x• Sometimes due to AU-rich 3' UTR sequences • Defective mRNA may be targetedby NMD, NSD, NGD

Other RNA are targeted by small interfering RNA

Other mRNA are targeted by small interfering RNA• defense against RNA viruses• DICERs cut dsRNA into 21-28 bp

Other mRNA are targeted by small interfering RNA• defense against RNA viruses• DICERs cut dsRNA into 21-28 bp• helicase melts dsRNA

Other mRNA are targeted by small interfering RNA• defense against RNA viruses• DICERs cut dsRNA into 21-28 bp• helicase melts dsRNA• - RNA binds RISC

Other mRNA are targeted by small interfering RNA• defense against RNA viruses• DICERs cut dsRNA into 21-28 bp• helicase melts dsRNA• - RNA binds RISC• complex binds target

Other mRNA are targeted by small interfering RNA• defense against RNA viruses• DICERs cut dsRNA into 21-28 bp• helicase melts dsRNA• - RNA binds RISC• complex binds target• target is cut

Small RNA regulation

• siRNA: target RNA viruses (& transgenes)

•miRNA: arrest translation of targets

• created by digestion of foldback

Pol II RNA with mismatch loop

Small RNA regulation

• siRNA: target RNA viruses (& transgenes)

•miRNA: arrest translation of targets

• created by digestion of foldback

Pol II RNA with mismatch loop

•Mismatch is key difference:

generated by different Dicer

Small RNA regulation

• siRNA: target RNA viruses (& transgenes)

•miRNA: arrest translation of targets

• created by digestion of foldback

Pol II RNA with mismatch loop

•Mismatch is key difference:

generated by different Dicer

•Arrest translation in animals,

target degradation in plants

small interfering RNA mark specifictargets•once cut they are removed by endonuclease-mediated decay

Most RNA degradation occurs in P bodies• recently identified cytoplasmic sites where exosomes & XRN1 accumulate when cells are stressed

Most RNA degradation occurs in P bodies• recently identified cytoplasmic sites where exosomes & XRN1 accumulate when cells are stressed •Also where AGO & miRNAs accumulate

Most RNA degradation occurs in P bodies• recently identified cytoplasmic sites where exosomes & XRN1 accumulate when cells are stressed •Also where AGO & miRNAs accumulate•w/o miRNA P bodies dissolve!

Thousands of antisense transcripts in plants1. Overlapping genes

Thousands of antisense transcripts in plants1. Overlapping genes2. Non-coding RNAs

Thousands of antisense transcripts in plants1. Overlapping genes2. Non-coding RNAs3. cDNA pairs

Thousands of antisense transcripts in plants1. Overlapping genes2. Non-coding RNAs3. cDNA pairs4. MPSS

Thousands of antisense transcripts in plants1. Overlapping genes2. Non-coding RNAs3. cDNA pairs4. MPSS5. TARs

Thousands of antisense transcripts in plants

Hypotheses

1. Accident: transcription unveils “cryptic promoters” on opposite strand (Zilberman et al)

Hypotheses

1. Accident: transcription unveils “cryptic promoters” on opposite strand (Zilberman et al)

2. Functional

a. siRNA

b. miRNA

c. Silencing

Hypotheses

1. Accident: transcription unveils “cryptic promoters” on opposite strand (Zilberman et al)

2. Functional

a. siRNA

b. miRNA

c. Silencing

d. Priming: chromatin remodeling requires transcription!