Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the...
-
Upload
molly-stokes -
Category
Documents
-
view
236 -
download
2
Transcript of Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the...
![Page 1: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/1.jpg)
Tutorial 3
BLAST
1
![Page 2: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/2.jpg)
BLAST tutorial
• How to use BLAST• Score vs. E-value• Exercise
• Cool story of the day: How Alzheimer is studied in yeast
2
![Page 3: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/3.jpg)
BLAST program DatabaseQuery
BLAST
What is BLAST?• Basic Local Alignment Search Tool• Set of similarity search programs for exploring
sequence databases.
3
![Page 4: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/4.jpg)
Why perform a similarity search?
• Find genes/proteins with possibly similar function
• Find the origin of a sequence (what organism it is taken form)
• Different degrees of similarity can be found in database search
4
![Page 5: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/5.jpg)
Query type Database type
blastn Genomic Genomicblastp Proteomic Proteomicblastx Translated genomic Proteomictblastn Proteomic Translated genomic
tblastx Translated genomic Translated genomic
BLAST Databases
5
Genomic: A T G CProteomic: G A S T C V L I M P F Y W D E N Q H K R
Translated genomic: The query is genomic, translated to protein using 6 possible reading frames
ATGCCGTTC -> MPF , CR, AV
![Page 7: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/7.jpg)
Place Query
Choose Database
?
7
Job title – helpful when running multiple runs
In case you want to restrict to a specific organism
In case you want to eliminate specific sequences
Query and DB parameters
![Page 8: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/8.jpg)
How to choose the database?
A good place to start if you don’t know what you’re looking for
nr/nt : non-redundant nucleotide
8
Depends on what you’re looking for…
![Page 9: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/9.jpg)
Alignment parameters
9
Optimizes the parameters for the
desired similarity level of the search
![Page 10: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/10.jpg)
10
Alignment parameters
Threshold for results significance
Primary word match (16-64 nt)
Scores of matching and mismatching bases
Cost to create and extend a gap
![Page 11: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/11.jpg)
11
How to interpret BLAST results?
![Page 12: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/12.jpg)
Search for homologous to chick “olfactory receptor 6” gene
12
![Page 13: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/13.jpg)
Search results
13
![Page 14: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/14.jpg)
14
Query sequence
Matched sequences from DBs
Graphic Summary
![Page 15: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/15.jpg)
15
Descriptions
Sequence Identifier
+ link
Sequence description
Score(bits)
%Coverage
%Identity
E value
![Page 16: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/16.jpg)
16
Descriptions
Query covered=55%Only 55% of the query is covered => ~230 bp
Identity=71%Out of the 230 bp of alignment only 71% was of matches
E-value=But this alignment is very significant
![Page 17: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/17.jpg)
17
AlignmentsQuery info
Alignment info
Alignment
![Page 18: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/18.jpg)
It is possible to get multiple hits
per sequence
18
![Page 19: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/19.jpg)
E-values and scores
19
![Page 20: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/20.jpg)
Score vs. E-value• The score is a measure of the similarity of the query
to the a sequence from the database. • The E-value is a measure of the reliability of the
score.
The definition of the E-value is: The number of expected alignments with observed score or higher due to chance.
20
![Page 21: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/21.jpg)
Score vs. E-value
Score (S) = (identities + mismatches) - gaps
Depends on search space
Query length(bp) Effective length (total number of bases) of the database(bp)
Depends on scoring system
Score
Bit Score (S’):
21
• E-values cannot be compared across different DBs, even if the score is the same.
‘
![Page 22: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/22.jpg)
Intuition for “significance”• Think of the query as a ball, each color represents a part
of the sequence.• The DB is a pool of colored balls.
• If the ball has many colors (longer query) – there is a higher probability to see the same color in the pool by chance.
• If the pool of balls is very big, there is a higher probability to see one of the balls colors in the pool.
22
![Page 23: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/23.jpg)
The typical threshold for a good E-value from a BLAST search is E=10-6≈e-6 or lower. This does not mean that higher E-values are given for queries with no biological significance.
23
E-value Threshold
http://www.youtube.com/watch?v=Z7ek7UoP7Bg&src_vid=nO0wJgZRZJs&feature=iv&annotation_id=annotation_234259
![Page 24: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/24.jpg)
E-value vs. P-valueP-value is the probability that an event will happen by chanceE-value is correction of the P-value considering the DB size.
So if the probability to find a sequence is 0.001 in a 1,000,000 entries DB the number of expected alignments we will find is 1,000!
24http://homepages.ulb.ac.be/~dgonze/TEACHING/stat_scores.pdfhttp://www.ncbi.nlm.nih.gov/BLAST/tutorial/
![Page 25: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/25.jpg)
Exercise
25
![Page 26: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/26.jpg)
Find homologs for CFTR gene in human
26
You can put the gene ID rather than
the sequence
Human DB only
We’ll start with high similarity
![Page 27: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/27.jpg)
27
![Page 28: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/28.jpg)
28
Now change to more distinct
sequences
![Page 29: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/29.jpg)
29
We get more results
![Page 30: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/30.jpg)
Find homologs for CFTR gene in other organisms
30
Not only human
sequences
![Page 31: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/31.jpg)
31
![Page 32: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/32.jpg)
32
Where to run a nucleotide sequence - blastn or blastx ?
blastn (genomic vs. genomics)
blastx(translated genomics vs. proteomic)
ncRNA
If you know your sequence is a protein – blastx is better, since you will get more reliable results.
![Page 33: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/33.jpg)
Cool Story of the day
How Alzheimer is studied in yeast
![Page 34: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/34.jpg)
Alzheimer's disease (AD)• Alzheimer's disease leads to nerve cell
death and tissue loss throughout the brain. • Symptoms can include confusion,
aggression, trouble with language, and long term memory loss. Gradually, bodily functions are lost, ultimately leading to death.
• There are no available treatments that stop or reverse the progression of the disease.
• The disease is associated with plaques and tangles in the brain.
34http://www.alz.org/braintour/alzheimers_changes.asphttp://en.wikipedia.org/wiki/Alzheimer's_disease
![Page 35: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/35.jpg)
How can AD be studied in yeast?
Yeast cells lack the specialized processes of neuronal cells and the cell-cell communications that modulate neuropathology. However, the most fundamental features of eukaryotic cell biology evolved before the split between yeast and metazoans.
35Treusch et al. Science (2011)http://lindquistlab.wi.mit.edu/
![Page 36: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/36.jpg)
36Thinakaran et al JOURNAL OF BIOLOGICAL CHEMISTRY 2008
Beta-amyloid () peptide is one of the hypothesized causes of AD. The most toxic form of Ab, Ab 1-42, is generated by proteolytic cleavage of APP, the transmembrane amyloid precursor protein.
![Page 37: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/37.jpg)
Susan Linquist’s lab showed it was toxic when expressed in yeast. Later they tested the affect of this protein on rat neuron cells and in C.elegans neurons.
To recapitulate this multicompartment trafficking in yeast, we fused an endoplasmic reticulum (ER) targeting signal to the N terminus of Ab 1-42.
37Treusch et al. Science (2011)http://lindquistlab.wi.mit.edu/
![Page 38: Tutorial 3 BLAST 1. BLAST tutorial How to use BLAST Score vs. E-value Exercise Cool story of the day: How Alzheimer is studied in yeast 2.](https://reader035.fdocuments.us/reader035/viewer/2022081419/5697bfe21a28abf838cb42ad/html5/thumbnails/38.jpg)
• The researchers looked for suppressor genes that had homologs in Human and C.elegans
38Treusch et al. Science (2011)
Wild-type worms invariably have five glutamatergic neurons in their tails.