Post on 10-Apr-2018
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 1/20
Bioinformatics Workshop 2
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 2/20
The background to today¶s task: HER oncogenes
Nomenclature: EGFR, HER, erb-b
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 3/20
Tasks
Download the answersheet (2) documentfrom blackboard
Go to Genbank homepage.http://www.ncbi.nlm.nih.gov/
Search for all the Human EGFR protein
sequences
Remember to select proteins in the drop menu
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 4/20
EGFR (Homo sapiens):
epidermal growth factor
receptor
Click on
Have a look around the site!
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 5/20
Click on HPRB
FU 228 - 270
REC 361 - 481
FU 496 - 547
FU 552 - 601
REC 57 - 168
TM 646 - 668
Tyr_Kinase 712 - 968
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 6/20
Go ³Back´
Click on Reference Sequence Details
Click on Swiss Prot P00533
Then click on Fasta
Open the file and copy the sequence
on to your clipboard
Click back and then the NCBI logo
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 7/20
Click on ³Blast´
³Human´
Database ³RefSeq´
Programme: BlastP
Then Click Away
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 8/20
Click On ³Select all´ and then
³Distance Tree of Results´
Copy
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 9/20
Go back to Blast: This time search whole
database (not just human)
(get the sequence from your saved
EGFR FAST A file)
Putative conserved domains have been detected,
click on the image below for detailed results.
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 10/20
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 11/20
Tasks
Next you will need to go to
http://www.megasoftware.net/mega41.html
When you get the MEGA link, download thewindows version on to your D Drive.
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 12/20
Launch Mega
Download ³Sequences for class´
from the blackboard site
File > Open Data
Selected ³Sequences for class´
Ignore the Error Message
Utilities > Convert to MEGA format
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 13/20
Back to MEGA main page
Alignment >
AlignmentExplorer Clustal>
Create a new alignment>
Click No for protein>
Then«.
Data>Open>
Retrieve Sequences from file
Click on ³Sequences for class´
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 14/20
Edit>Select All
Alignment>Align by CLUST AL
OK to all ³DEFAULT´
File > Exit
Save Data to MEGA file
Open data in MEGA file
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 15/20
Click Data> Data Explorer
Display Colour Cells
Highlight ³Conserved Sites´
Download to XL (Excel)
Where are the variable and conservedregions?
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 16/20
Go back to main MEGA site and click on
Phylogeny
..and play
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 17/20
Tasks
Looking at the tree diagram, write a
short paragraph explaining what you
think the tree means. You may wish toinclude things such as the basal taxon,
description of who is related to who,
whether there are any major groupingsor clades etc. This section should be no
more than 150 words.
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 18/20
What is bootstrapping?
Which do you think are the
oncogenic forms?
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 19/20
Redo the ³alignments´ (on main MEGA
page) but highlighting only the Human
normal and oncogenic forms
What are the conserved and
variable sequences?
How do these relate toprotein motif functions?
8/8/2019 Bioinf Workshop2a (1)
http://slidepdf.com/reader/full/bioinf-workshop2a-1 20/20
Submission
Send your answer sheet to me by next
week.
Include your Exel spread sheets if you
wish!!