Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP...

30
Chapter 14 Genomes and Genomics

Transcript of Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP...

Page 1: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Chapter 14 Genomes and Genomics

Page 2: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Sequencing DNAdideoxy (Sanger) method

ddGTP ddATP ddTTP ddCTP

5’TAATGTACG TAATGTACTAATGTATAATGTTAATGTAATTAATAT

Fred Sanger, Nobel prize 1980

Page 3: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Sequencing DNAdideoxy (Sanger) method

Leroy Hood, CaltechFluorescence based sequencing

Norm Dovici – Capillary electrophoresis

Page 4: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Sequencing DNAdideoxy (Sanger) method

Page 5: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Genomics era: High-throughput DNA sequencing

The first high-throughput genomics technology was automated DNA sequencing in the early 1990.

In September 1999, Celera Genomics completed the sequencing of the Drosophila genome.

Baker’s yeast, Saccharomyces cerevisiae (15 million bp), was the first eukaryotic genome to be sequenced.

TIGR (The Institute for Genomics Research) 1995 – first whole genome sequence, H. influenza

Page 6: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Genomics: Completed genomes as 2002

Currently the genome of over 600 organisms are sequenced:

This generates large amounts of information to be handled by individual computers.

http://www.genomesonline.org/

Page 7: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Cloning/librariesBAC, YAC and ESTs

• BAC = bacterial artificial chromosome– 150 kb, replicate in E.coli

• YAC = yeast artificial chromosome– 150 kb -1.5 Mb, replicate in yeast

Page 8: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Assemblingcontigs

Page 9: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Ordered-clone Sequencing

Clones ordered by restriction enzyme sites

Page 10: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Annotation

• ORF – open reading frame

• EST- Expressed sequence tag – Based on mRNA

• Comparative genomics

Page 11: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

The trend of data growth

01

234

56

78

1980 1985 1990 1995 2000

Years

Nu

cle

oti

de

s(b

illio

n)

21st century is a century of biotechnology:

Microarray: Global expression analysis: RNA levels of every gene in the genome analyzed in parallel.

Proteomics:Global protein analysis generates by large mass spectra libraries.

Metabolomics:Global metabolite analysis: 25,000 secondary metabolites characterized

Genomics: New sequence information is being produced at increasing rates. (The

contents of GenBank double every year)

Glycomics:Global sugar metabolism analysis

Page 12: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

How to handle the large amount of information?

Drew Sheneman, New Jersey--The Newark Star Ledger

Answer: bioinformatics and Internet

Page 13: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Bioinformatics history

IBM 7090 computer

In1960s: the birth of bioinformatics

Margaret Oakley Dayhoff created:The first protein databaseThe first program for sequence assembly

There is a need for computers and algorithms that allow: Access, processing, storing, sharing, retrieving, visualizing, annotating…

Page 14: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

DNA (nucleotide sequences) databases

They are big databases and searching either one should produce similar results because they exchange information routinely.

-GenBank (NCBI): www.ncbi.nlm.nih.gov

-Arabidopsis: (TAIR) www.arabidopsis.org

Specialized databases:Tissues, species… -ESTs (Expressed Sequence Tags)

~at NCBI ~at TIGR

- ...many more!

Page 15: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Comparative genomics

BLAST – basic local alignment and search tool(http://www.ncbi.nlm.nih.gov/)

Homologsorthologsparalogs

Page 16: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

QuestionYou are a researcher who has tentatively identified a human homolog of a yeast gene. You determine the DNA sequence of cDNAs of both your yeast gene and the human gene and decide to compare the gene sequences, as well as the predicted protein sequence of each, using alignment software. You would expect the greatest sequence identity from comparisons of the:

a. cDNA sequencesb. Protein sequencesc. Genomic DNA sequencesd. Both (a) and (b) will give you equivalent sequence similaritye. All will give equivalent sequence similarity

Page 17: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

What is a microarray?

Page 18: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Types of Arrays

• Expression Arrays – cDNA– Genome

• Affymetrix (GeneChip®)

• Agilent

• Tiling arrays

Page 19: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Overview of Microarrays

Page 20: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Transcription Profiling of a mutant

WT

mutant

Page 21: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

A “good” microarray plate

Red = only in treatment

Green = only in normal

Yellow = found in both

Black = found in neither

Page 22: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

ResultsResults

100’s of genes identified,

those turned on, those turned off

Page 23: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Expression mapred = up regulatedgreen= down regulated

Page 24: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Question

Microarray technology directly involves:

a. PCR

b. DNA sequencing

c. Hybridization

d. RFLP detection

e. None of the above

Page 25: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Protein – protein interactions

• ChIP (chomatin immunoprecipitation)

• Yeast two hybrid

• Bi Molecular Fluorescence Complementation (BMFC)

Page 26: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

ChIP and ChIP- chip

Page 27: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Yeast two hybrid

Page 28: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Citovsky et al., 2006

Bi Molecular Fluorescence Complementation(BMFC)

Page 29: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Reverse genetics

• Gene knockouts

• RNAi

• Overexpression

• Altered expression

Page 30: Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.

Summary

• DNA Sequencing and the rise of genomics

• Annotation of genome sequence– Comparative genomics– Functional genomics

• Protein-protein interactions

• ESTs

• Reverse genetics