The UCSC Genome Browser

18
The UCSC Genome Browser From Men to Mice WJ Kent, C Sugnet, T Furey, T Pringle, M Schwartz, R Baertsch, R Weber, K Roskin, D Thomas, S Rogic, M Diekhans, F Hsu, D Karolchik, D Haussler

description

The UCSC Genome Browser. From Men to Mice. WJ Kent, C Sugnet, T Furey, T Pringle, M Schwartz, R Baertsch, R Weber, K Roskin, D Thomas, S Rogic, M Diekhans, F Hsu, D Karolchik, D Haussler. Cardiac Troponin T2. Comparative Genomics at BMP10. Normalized eScores. Mouse/Human Synteny. - PowerPoint PPT Presentation

Transcript of The UCSC Genome Browser

Page 1: The UCSC Genome Browser

The UCSC Genome BrowserFrom Men to Mice

WJ Kent, C Sugnet, T Furey, T Pringle, M Schwartz, R Baertsch, R Weber, K Roskin, D Thomas, S Rogic, M Diekhans, F Hsu, D Karolchik, D Haussler

Page 2: The UCSC Genome Browser
Page 3: The UCSC Genome Browser

Cardiac Troponin T2

Page 4: The UCSC Genome Browser

Comparative Genomics at BMP10

Page 5: The UCSC Genome Browser

Normalized eScores

Page 6: The UCSC Genome Browser

Mouse/Human Synteny

Page 7: The UCSC Genome Browser

Track Options & FiltersMini-buttons bring up track options such as those for spliced EST track below.

Page 8: The UCSC Genome Browser

Which EST to Sequence?

Page 9: The UCSC Genome Browser

MGC ESTS Drawn in Red

Page 10: The UCSC Genome Browser

DNA Coloring

Page 11: The UCSC Genome Browser

gctcgttcaggggtaaaggtgtattctagatCCACAACAAGCCCCGTGGTCTAGCACAGC AAAGAGAAAAAAAGAGAACACGAAAATGCCCTTGCTCCCCTCCGGGGGCCCCTTTTGTGC GGTTCTTGCCAACGCAGCAGCCCTCCTGCTATATAGCCCGCCGCGCCgCAGCCCCACCCG CTCAGCGCCGCCGCCCCACCAGCTCAGCACCGCCGTGCGCCCAGCCAGCCATGGGGAAGG TGAGCCCAGCCTGCGCCCCGGGACCCCGGAGCTTCCTCCATCGCGGGGGCCAGAGACTGG GGCAGGAGCAGGCCTGTGAGACCTCGCCTTGTCCCGCCTTGCCTTGCAGATCACCCTCTA CGAGGACCGGGGCTTCCAGGGCCGCCACTATGAATGCAGCAGCGACCACCCCAACCTGCA GCCCTACTTGAGCCGCTGCAACTCGGCGCGCGTGGACAGCGGCTGCTGGATGCTCTATGA GCAGCCCAACTACTCGGGCCTCCAGTACTTCCTGCGCCGCGGCGACTATGCCGACCACCA GCAGTGGATGGGCCTCAGCGACTCGGTCCGCTCCTGCCGCCTCATCCCCCACGTGAGTAC ATCCTCAAGTCAGGACCCAGGCCCTCAGGACACTCACTGGAtgGTTTCAAGCAAAAGTTA AACATTAGAAGTAGTGATCAGTcacaataaCTGAGAGTGGACAAAAGATGAACTATAGTG GATTAAGTCAATAGagttTGCTCCCCACATAAGCAAAGTATTACCCAGACAcCAGTTAAT caCAATTAATCCACAAATATGTATTGAGTAGGAATGTGTCTCCTGCCctAGGGGTTGTAT

Coloring CRYGD Start

Page 12: The UCSC Genome Browser

Gene Expression Tracks

Page 13: The UCSC Genome Browser

Alt Splicing Tracks

Page 14: The UCSC Genome Browser

Complex Transcription

Page 15: The UCSC Genome Browser

Add Your Own Tracks

• Users can extend the browser with their own tracks.

• User tracks can be private or public.

• No programming required.

• GFF, GTF, PSL or BED formats supported#chrom start end [name strand score]

chr1 1302347 1302357 SP1 + 800

chr1 1504778 1504787 SP2 — 980

Page 16: The UCSC Genome Browser

The Underlying Database

• Power users and bioinformaticians sometimes want underlying database.

• There is a table for each track. • Larger tracks have a table for each chromosome.• Format of a track table generally similar to add-

your-own track formats.• Pieces of database available from ‘tables’ browser.• Whole database available as tab-separated files.

Page 17: The UCSC Genome Browser

Parasol and Kilo Cluster

• UCSC cluster has 1000 CPUs running Linux

• 1,000,000 BLASTZ jobs in 25 hours for mouse/human alignment

• We wrote Parasol job scheduler to keep up.– Very fast and free.

– Jobs are organized into batches.

– Error checking at job and at batch level.

Page 18: The UCSC Genome Browser

Acknowledgements

NHGRI, The Wellcome Trust, HHMI, NCI, and Taxpayers in the US and worldwide.

Whitehead, Sanger, Wash U, Baylor, Stanford, DOE, and the international sequencing centers.

NCBI, Penn State, Ensembl, Genoscope, The SNP Consortium, UC Berkeley, LBL, LLL, Riken, The Mammalian Gene Collection, Softberry, IMIM, Affymetrix, Perlagen, Rosetta, the Mouse Homology Group

The thousands of people who worked on the sequence and annotations