BITS: UCSC genome browser - Part 1

88
Paco Hulpiau UCSC genome browsing http://www.bits.vib.be

description

These are the first lecture slides of the BITS bioinformatics training session on the UCSC Genome Browser. See http://www.bits.vib.be/index.php?option=com_content&view=article&id=17203990:orange-genome-browsers-ucsc-training&catid=81:training-pages&Itemid=190

Transcript of BITS: UCSC genome browser - Part 1

Page 1: BITS: UCSC genome browser - Part 1

Paco Hulpiau

UCSCgenome browsing

http://www.bits.vib.be

Page 2: BITS: UCSC genome browser - Part 1

Introduction

§ Browse genes in their genomic context

§ See features in and around a specific gene

§ Investigate genome organization and explore larger

chromosome regions

§ Search and retrieve information on a gene- and

genome-scale

§ Compare genomes

Page 3: BITS: UCSC genome browser - Part 1

Introduction

§ Collaboration between main genome browsers

Ensembl, UCSC and NCBI

» use same genome assemblies

» interlinking between sites

§ Ensembl Genome Browser: http://www.ensembl.org/

§ NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/

§ UCSC Genome Browser: http://genome.ucsc.edu/

Page 4: BITS: UCSC genome browser - Part 1

Introduction

Page 5: BITS: UCSC genome browser - Part 1

Introduction

Page 6: BITS: UCSC genome browser - Part 1

Introduction

Page 7: BITS: UCSC genome browser - Part 1

Introduction

Page 8: BITS: UCSC genome browser - Part 1

Introduction

Page 9: BITS: UCSC genome browser - Part 1

Introduction

§ Collaboration between main genome browsers

Ensembl, UCSC and NCBI

» use same genome assemblies

» interlinking between sites

§ Ensembl Genome Browser: http://www.ensembl.org/

§ NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/

§ UCSC Genome Browser: http://genome.ucsc.edu/

Page 10: BITS: UCSC genome browser - Part 1

Introduction

Page 11: BITS: UCSC genome browser - Part 1

Introduction

Page 12: BITS: UCSC genome browser - Part 1

Introduction

§ Collaboration between main genome browsers

Ensembl, UCSC and NCBI

» use same genome assemblies

» interlinking between sites

§ Ensembl Genome Browser: http://www.ensembl.org/

§ NCBI Map Viewer: http://www.ncbi.nlm.nih.gov/mapview/

§ UCSC Genome Browser: http://genome.ucsc.edu/

Page 13: BITS: UCSC genome browser - Part 1

Introduction

Page 14: BITS: UCSC genome browser - Part 1

Introduction

Page 15: BITS: UCSC genome browser - Part 1

Introduction

Page 16: BITS: UCSC genome browser - Part 1

Introduction

§ Other genome browsers and genome databases:

http://genome.jgi-psf.org Eukaryotic (143) and prokaryotic (505) genomes

http://www.xenbase.org Xenopus tropicalis

http://flybase.org Drosophila genes & genomes

http://www.wormbase.org C. elegans and some related nematodes

http://www.tigr.org => http://www.jcvi.org/ Comprehensive Microbial Resource (CMR) => http://cmr.jcvi.org/tigr-scripts/CMR/CmrHomePage.cgi

http://genolist.pasteur.fr Microbial genomes

Page 17: BITS: UCSC genome browser - Part 1

Introduction

Page 18: BITS: UCSC genome browser - Part 1

Introduction

Page 19: BITS: UCSC genome browser - Part 1

§ The UCSC Genome browser was created by the

Genome Bioinformatics Group

at the University of California Santa Cruz (UCSC).

http://genome.ucsc.edu/

Page 20: BITS: UCSC genome browser - Part 1

§ The Genome Browser zooms and scrolls

over chromosomes, showing the work of

annotators worldwide.

Page 21: BITS: UCSC genome browser - Part 1

§ Blat quickly maps your sequence to the genome.

BLAT is not BLAST !

BLAT works by keeping an index of the entire genome in memory.

The index consists of all non-overlapping DNA 11-mers or protein 4-mers.

The index is used to find areas of probable homology, which are then

loaded into memory for a detailed alignment.

BLAT on DNA can quickly find sequences of 95% and greater similarity

of length 40 bases or more.

BLAT on proteins finds sequences of 80% and greater similarity of length

20 amino acids or more.

Page 22: BITS: UCSC genome browser - Part 1

§ The Table Browser provides convenient

access to the underlying database.

Page 23: BITS: UCSC genome browser - Part 1

§ The Gene Sorter displays a sorted table of genes

that are related to one another.

The relationship can be one of several types, including protein-

level homology,

similarity of gene expression profiles,

or genomic proximity.

Page 24: BITS: UCSC genome browser - Part 1

§ In-Silico PCR searches a sequence database with a pair of PCR

primers, using an indexing strategy for fast performance.

§ When successful, the search returns a file (fasta) containing all

sequences in the database that lie between and include the

primer pair.

Page 25: BITS: UCSC genome browser - Part 1

§ Genome Graphs is a tool for displaying

genome-wide data sets such as the results

of genome-wide SNP association studies,

linkage studies and homozygosity mapping.

Page 26: BITS: UCSC genome browser - Part 1

§ Galaxy allows you to do analyses you cannot do

anywhere else without the need to install or

download anything.

§ You can analyze multiple alignments, compare

genomic annotations and much more...

Page 27: BITS: UCSC genome browser - Part 1

§ VisiGene lets you browse through a large

collection of in situ mouse and frog images.

Page 28: BITS: UCSC genome browser - Part 1

§ The Proteome Browser provides a wealth of

protein information presented in the form of

graphical images of tracks and histograms

and links to other sites.

Page 29: BITS: UCSC genome browser - Part 1

§ The Utilities page contains links to some tools

created by the UCSC Genome Bioinformatics Group.

§ DNA Duster & Protein Duster remove non-sequence

related characters from an input sequence.

Page 30: BITS: UCSC genome browser - Part 1

§ The Utilities page contains links to some tools

created by the UCSC Genome Bioinformatics Group.

§ DNA Duster & Protein Duster remove non-sequence

related characters from an input sequence.

Page 31: BITS: UCSC genome browser - Part 1
Page 32: BITS: UCSC genome browser - Part 1

Clade – Genome - Assembly

Page 33: BITS: UCSC genome browser - Part 1
Page 34: BITS: UCSC genome browser - Part 1
Page 35: BITS: UCSC genome browser - Part 1
Page 36: BITS: UCSC genome browser - Part 1
Page 37: BITS: UCSC genome browser - Part 1
Page 38: BITS: UCSC genome browser - Part 1
Page 39: BITS: UCSC genome browser - Part 1
Page 40: BITS: UCSC genome browser - Part 1

GENOMEBROWSERDISPLAY

Page 41: BITS: UCSC genome browser - Part 1

POSITIONCONTROL

Page 42: BITS: UCSC genome browser - Part 1

TRACK CONTROL

Page 43: BITS: UCSC genome browser - Part 1

Navigation: position control

Page 44: BITS: UCSC genome browser - Part 1
Page 45: BITS: UCSC genome browser - Part 1
Page 46: BITS: UCSC genome browser - Part 1

Navigation: position control

§ Click the zoom in and zoom out buttons on top

to zoom in or out 1.5, 3 or 10-fold

on the center of the window

Page 47: BITS: UCSC genome browser - Part 1

Navigation: position control

§ Zoom in 3-fold by clicking anywhere

on the base position track

§ Zoom to a specific region using “drag and zoom”

Page 48: BITS: UCSC genome browser - Part 1

Navigation: position control

§ To scroll the view of the display horizontally

by set increments of 10%, 50% or 95%

of the displayed size (as given in base pairs)

click the corresponding move arrow

Page 49: BITS: UCSC genome browser - Part 1

Navigation: position control

§ To scroll the left of right side by a specified number of

vertical gridlines while keeping the opposite side fixed

click the appropriate move start or move end

arrow

Page 50: BITS: UCSC genome browser - Part 1

Navigation: position control

§ To display a (completely) different position

enter the new location in the position/search text

box

§ You can also jump to an other gene location

Page 51: BITS: UCSC genome browser - Part 1

Annotation Tracks

TRACK CONTROL

Page 52: BITS: UCSC genome browser - Part 1

HIDE = removes a track from view

FULL = each item on a separate line

Page 53: BITS: UCSC genome browser - Part 1

DENSE = all items collapsed into single line

SQUISH = all items on several lines PACKED and at 50% height

PACK = each item separate and efficiently stacked (full height)

Page 54: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 55: BITS: UCSC genome browser - Part 1

Annotation Tracks

§ Different genome/assembly => different tracks!

Page 56: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 57: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 58: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 59: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 60: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 61: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 62: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 63: BITS: UCSC genome browser - Part 1

Annotation Tracks

§ Now try to change the tracks as follows

Page 64: BITS: UCSC genome browser - Part 1

Annotation Tracks

§ and...

Page 65: BITS: UCSC genome browser - Part 1

SQUISH

PACK

FULL

DENSE

SQUISH

Page 66: BITS: UCSC genome browser - Part 1

UTR EXON

INTRON

direction of transcription

EXON

Page 67: BITS: UCSC genome browser - Part 1
Page 68: BITS: UCSC genome browser - Part 1
Page 69: BITS: UCSC genome browser - Part 1
Page 70: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 71: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 72: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 73: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 74: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 75: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 76: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 77: BITS: UCSC genome browser - Part 1

Annotation Tracks

Page 78: BITS: UCSC genome browser - Part 1

Browser graphics in PDF

TABLE BROWSER

GET DNA

CLICK LINE

CURRENT BROWSER GRAPHIC IN PDF

TO GET OTHER DATA

Page 79: BITS: UCSC genome browser - Part 1

CURRENT BROWSER GRAPHIC IN PDF

TO GET OTHER DATA

1

Page 80: BITS: UCSC genome browser - Part 1
Page 81: BITS: UCSC genome browser - Part 1
Page 82: BITS: UCSC genome browser - Part 1

Exercises (I)

1) Search for your gene of interest

on Human Feb. 2009 (GRCh37/hg19) Assembly

» Include 1000 base pairs up- and downstream

» Only show the tracks:

RefSeq Genes (pack)

Conservation (full, primates only)

» Save graphical view as PDF (exercises1_1)

Page 83: BITS: UCSC genome browser - Part 1

Exercises (I)

2) How many transcripts are there?

» Compare UCSC Genes with RefSeq and Ensembl genes!

» Save graphical view as PDF (exercises1_2)

Page 84: BITS: UCSC genome browser - Part 1

Exercises (I)

3) What are the flanking genes?

Are these conserved outside mammals?

» Zoom out until you can see at least

two or three flanking genes

(may need to hide some tracks, leave RefSeq on)

» Now have a look in the chicken genome

» Save graphical view as PDF

(exercises1_3a en exercises1_3b)

Page 85: BITS: UCSC genome browser - Part 1

Exercises (I)

4) Is there any regulatory information available?

» Change the view to see the genomic region upstream

(exon 1 and ~2000 upstream) and open some regulatory tracks

e.g. ORegAnno, TFBS Conserved, TS miRNA sites

» Save graphical view as PDF (exercises1_4)

Page 86: BITS: UCSC genome browser - Part 1
Page 87: BITS: UCSC genome browser - Part 1
Page 88: BITS: UCSC genome browser - Part 1