Bioinf Workshop2a (1)

Post on 10-Apr-2018

216 views 0 download

Transcript of Bioinf Workshop2a (1)

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 1/20

Bioinformatics Workshop 2

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 2/20

The background to today¶s task: HER oncogenes

 Nomenclature: EGFR, HER, erb-b

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 3/20

Tasks

Download the answersheet (2) documentfrom blackboard

Go to Genbank homepage.http://www.ncbi.nlm.nih.gov/

Search for all the Human EGFR protein

sequences

Remember to select proteins in the drop menu

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 4/20

EGFR (Homo sapiens):

epidermal growth factor 

receptor 

Click on

Have a look around the site!

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 5/20

Click on HPRB

FU 228 - 270

REC 361 - 481

FU 496 - 547

FU 552 - 601

REC 57 - 168

TM 646 - 668

Tyr_Kinase 712 - 968

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 6/20

Go ³Back´

Click on Reference Sequence Details

Click on Swiss Prot P00533

Then click on Fasta

Open the file and copy the sequence

on to your clipboard

Click back and then the NCBI logo

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 7/20

Click on ³Blast´

³Human´

Database ³RefSeq´

Programme: BlastP

Then Click Away

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 8/20

Click On ³Select all´ and then

³Distance Tree of Results´

Copy

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 9/20

Go back to Blast: This time search whole

database (not just human)

(get the sequence from your saved

EGFR FAST A file)

Putative conserved domains have been detected,

click on the image below for detailed results.

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 10/20

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 11/20

Tasks

Next you will need to go to

http://www.megasoftware.net/mega41.html

When you get the MEGA link, download thewindows version on to your D Drive.

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 12/20

Launch Mega

Download ³Sequences for class´

from the blackboard site

File > Open Data

Selected ³Sequences for class´

Ignore the Error Message

Utilities > Convert to MEGA format

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 13/20

Back to MEGA main page

 Alignment >

 AlignmentExplorer Clustal>

Create a new alignment>

Click No for protein>

Then«.

Data>Open>

Retrieve Sequences from file

Click on ³Sequences for class´

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 14/20

Edit>Select All

 Alignment>Align by CLUST AL

OK to all ³DEFAULT´

File > Exit

Save Data to MEGA file

Open data in MEGA file

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 15/20

Click Data> Data Explorer 

Display Colour Cells

Highlight ³Conserved Sites´

Download to XL (Excel)

Where are the variable and conservedregions?

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 16/20

Go back to main MEGA site and click on

Phylogeny

..and play

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 17/20

Tasks

Looking at the tree diagram, write a

short paragraph explaining what you

think the tree means. You may wish toinclude things such as the basal taxon,

description of who is related to who,

whether there are any major groupingsor clades etc. This section should be no

more than 150 words.

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 18/20

What is bootstrapping?

Which do you think are the

oncogenic forms?

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 19/20

Redo the ³alignments´ (on main MEGA

page) but highlighting only the Human

normal and oncogenic forms

What are the conserved and

variable sequences?

How do these relate toprotein motif functions?

8/8/2019 Bioinf Workshop2a (1)

http://slidepdf.com/reader/full/bioinf-workshop2a-1 20/20

Submission

Send your answer sheet to me by next

week.

Include your Exel spread sheets if you

wish!!