Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by:...

26
Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong

Transcript of Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by:...

Page 1: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Environmental Genome Shotgun Sequencing of the

Sargasso Sea

J. Craig Venter et al.

Presented by: Hormuzdiyar Dasenbrock and Ryan Wong

Page 2: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Environmental Genome Shotgun Sequencing of the

Sargasso Sea J. Craig Venter et al.

Presented by: Hormuzdiyar Dasenbrock and Ryan Wong

What is bacteriorhodopsin (aka proteorhodopsin)!?!?

Page 3: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

J. Craig Venter

• 1992 TIGR

• 1995 H. Influenza

• 1998 Celera

• 2001 Human Genome

http://i.timeinc.net/time/magazine/archive/covers/2000/1101000703_400.jpg

Page 4: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Continuing the Quest…

http://www.bbsr.edu/About_BBSR/Virtual_Tour/vt_wbii/wbsea.jpg

Page 5: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Working Hard!

Page 6: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Fig. 1. MODIS-Aqua satellite image of ocean chlorophyll in the Sargasso Sea grid about the BATS site from 22 February 2003. The station locations are overlain with their respective identifications.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Sargasso Sea Sites

Page 7: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Sampling, Sequencing, and Assembly

• Focus on microbial organisms

• Shotgun Sequencing

• Assembly through computation

Page 8: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Whole Genome Shotgun Sequencing

http://occawlonline.pearsoned.com/bookbind/pubbooks/bc_mcampbell_genomics_1/medialib/method/shotgun.html

Page 9: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Whole

Figure 3. Anatomy of whole-genome assembly. Overlapping shredded bactig fragments (red lines) and internally derived reads from five different individuals (black lines) are combined to produce a contig and a consensus sequence (green line). Contigs are connected into scaffolds (red) by using mate pair information. Scaffolds are then mapped to the genome (gray line) with STS (blue star) physical map information.

Venter et al. Science. 2001 Feb 16;291(5507):1304-51.

Whole Genome Shotgun Sequencing

Page 10: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Sargasso Sea Data

Table S2. Summary statistics on the sequence and assembly data.

Samples 1-4 Samples 5-7• Sequence reads 1,662,328 325,608

sequence (Mbp) 1,361 265

• Contigs 121,477 N/Asequence (Mbp) 256.0 N/A

• Scaffolds 64,398 N/Aspan (Mbp) 400.0 N/A

• Mini-scaffolds 217,015 153,458span (Mbp) 820.7 518.4sequence (Mbp) 353.3 250.5

• Singletons 215,038 18,692sequence (Mbp) 169.9 15.0

• Total nonredundantsequence (Mbp) 779.2 265.5

• Scaffolds>3x 333 N/A• Span>3x (Mbp) 30.9 N/A

• Annotated genes 1,001,987 212,220• 16S rRNA genes 1,164 248• rhodopsin homologs 650 132

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 11: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Interpretation of DNA sequences

• Compare to published sequences

• Identify possible genes

Page 12: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Shewanella

Figure S3. Gene conservation among closely related Shewanella species. The outermost concentric circle of the above diagram depicts the competed genomic sequence of Shewanella oneidensis MR1 []. Fragments from environmental sequencing were compared to this completed Shewanella genome and are shown in the inner concentric circles and were given boxed outlines. Genes for the outermost circle have been assigned pseudo-spectrum colors based on the position of those genes along the chromosome, where genes nearer to the start of the genome are colored in red, genes neared to the end of the genome are colored in blue. Fragments from environmental sequencing were subjected to an analysis that identifies conserved gene order between those fragments and the completed Shewanella genome. Genes on the environmental genome segments that exhibited conserved gene order are colored with the same color assignments as the Shewanella MR1 chromosome. Colored regions on the environmental segments xhibiting color differences from the adjacent outermost concentric ircle are the result of conserved gene order with other MR1 regions and probably represent chromosomal rearrangments. Genes that did not exhibit conserved gene order are colored in black.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 13: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Prochlorococcus

• Fig. 2. Gene conservation among closely related Prochlorococcus. The outermost concentric circle of the diagram depicts the competed genomic sequence of Prochlorococcus marinus MED4 (11). Fragments from environmental sequencing were compared to this completed Prochlorococcus genome and are shown in the inner concentric circles and were given boxed outlines. Genes for the outermost circle have been assigned psuedospectrum colors based on the position of those genes along the chromosome, where genes nearer to the start of the genome are colored in red, and genes nearer to the end of the genome are colored in blue. Fragments from environmental sequencing were subjected to an analysis that identifies conserved gene order between those fragments and the completed Prochlorococcus MED4 genome. Genes on the environmental genome segments that exhibited conserved gene order are colored with the same color assignments as the Prochlorococcus MED4 chromosome. Colored regions on the environmental segments exhibiting color differences from the adjacent outermost concentric circle are the result of conserved gene order with other MED4 regions and probably represent chromosomal rearrangements. Genes that did not exhibit conserved gene order are colored in black.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 14: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Crenarchaeal 4B7

Fig. 3. Comparison of Sargasso Sea scaffolds to Crenarchaeal clone 4B7. Predicted proteins from 4B7 and the scaffolds showing significant homology to 4B7 by tBLASTx are arrayed in positional order along the x and y axes. Colored boxes represent BLASTp matches scoring at least 25% similarity and with an e value of better than 1e-5. Black vertical and horizontal lines delineate scaffold borders.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 15: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Genes FoundTIGR role category   Total

genes  

Amino acid biosynthesis   37,118  

Biosynthesis of cofactors, prosthetic groups, and carriers   25,905  

Cell envelope 27,883  

Cellular processes   17,260  

Central intermediary metabolism   13,639  

DNA metabolism 25,346  

Energy metabolism   69,718  

Fatty acid and phospholipid metabolism 18,558  

Mobile and extrachromosomal element functions 1,061  

Protein fate  8,768  

Protein synthesis 48,012  

Purines, pyrimidines, nucleosides, and nucleotides 19,912  

Regulatory functions  8,392  

Signal transduction 4,817  

Transcription  12,756  

Transport and binding proteins 49,185  

Unknown function   38,067  

Miscellaneous  1,864  

Conserved hypothetical  794,061  

Total number of roles assigned  1,242,230  

Total number of genes   1,214,207  

Table 1. Gene count breakdown by TIGR role category. Gene set includes those found on assemblies from samples 1 to 4 and fragment reads from samples 5 to 7. A more detailed table, separating Weatherbird II samples from the Sorcerer II samples is presented in the SOM (table S4). Note that there are 28,023 genes which were classified in more than one role category.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 16: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Applications: Community Ecology

• Species Diversity and Richness

• PCR Approaches or Genomic Sequencing?

• 1412 rRNA Genes and 148 New Phylotypes

Page 17: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Phylogenetic Comparisons

Rec A

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 18: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Major Phylogenetic Groups and Abundance

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 19: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Variability in Species Abundance

• Stoichiometry

• Shewanella and Burkholderia

• Archaea clone 4B7

Page 20: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Plasmids

• 10 Plasmids with Transfer Genes

• Metal Resistance

• UmuCD DNA Pol

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 21: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Other Filtered Organisms

• Eukaryotic– 69 18S rRNA Genes– Low Coverage

• Phages– 71 Scaffolds with Phage Genes

Page 22: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Photobiology of the Sargasso Sea• Photosynthesis: Rubis

CO genes• Light-Driven Proton

Pump Proteorhodopsin

Beja O. et al. Science. 2000 Sep 15;289(5486):1902-6.

Page 23: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Distribution of Rhodopsins

• 13 Subfamilies • CFB Scaffold• Energy

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Venter et al. Science. 2004 Apr 2;304(5667):66-74. Epub 2004 Mar 04

Page 24: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Conclusions

• New way to characterize ecosystems and evaluate species diversity

• Identification of novel genes

• Phylogenetic Trees

Page 25: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Future Directions

• Sorcerer II Expedition• Return to Sargasso Sea• Eukaryotic and Viral Analysis• Evolutionary Analysis• Energy Sources?• Synthetic Genomes

Page 26: Environmental Genome Shotgun Sequencing of the Sargasso Sea J. Craig Venter et al. Presented by: Hormuzdiyar Dasenbrock and Ryan Wong.

Criticisms

• Limitations of Shotgun Sequencing realized?

• Only 2 Genomes Sequenced

• Complimentary Studies

• Cost and Availability