DRASTIC Database Resource for Analysis of Signal Transduction in Cells Gary Lyon Interrogating the...

45
DRASTIC D atabase R esource for A nalysis of S ignal T ransduction in C ells www.drastic.org.uk Gary Lyon Interrogating the DRASTIC Gene Expression Database 30 April 2004

Transcript of DRASTIC Database Resource for Analysis of Signal Transduction in Cells Gary Lyon Interrogating the...

Page 1: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

DRASTIC Database Resource for Analysis of Signal Transduction in Cells

www.drastic.org.uk

Gary Lyon

Interrogating the DRASTIC Gene Expression Database

30 April 2004

Page 2: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Aim of DRASTICAim of DRASTIC

To understand signal transduction in response to plant pathogens and other environmental stresses.

To assist with putting into context the results of our own gene discovery work within the PPI Programme

and

Publicity !

Page 3: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Why do we need ‘DRASTIC’?Why do we need ‘DRASTIC’?

• Published gene expression data is not searchable.

• Too much data to remember e.g. microarray data.

• Cannot match ‘unknown’ genes with prior expression data (14.2% of entries in the database are ‘unknown’).

• Gene names associated with certain accession numbers change with time.

• Cell biology is complex. [Simple answers to complex problems are always wrong]

Page 4: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

For exampleFor example

• One gene can have a variety of names : HBZip homeobox domain HD-zip homeobox protein homeobox domain zipper protein transcription factor, homeobox protein

• Names can be wrong: ‘HB AtHB-14 like’ should be ‘AtHB-9’ ‘Htf9C’ should be ‘RNA methyltransferase-related’ ‘endo 1,4-beta-mannosidase like’ should be ‘protein kinase family’

• Names can be confusing: ‘HSR201 like’ ‘RSH2 :Rel-SpoT homology’

Page 5: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

www.drastic.org.uk

Page 6: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Access database

• Incorporates published data from microarrays and Northerns of ESTs regulated by various treatments

(i) Environmental stress e.g. drought, NaCl, high and low temperatures

(ii) Pathogens and elicitors (salicylic acid, ethylene, jasmonates)

• 424 references

• 266 treatments

• 67 plant species

• 10,193 gene accessions

Page 7: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 8: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 9: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Selection by Gene nameSelection by Gene name

Page 10: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 11: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 12: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 13: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 14: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

treatment 1 treatment 2 treatment 3

1

2

3

4

5

6

7

Potential signalling networks

Page 15: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 16: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 17: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Funded by a 1 year PGRA grant from Carnegie Trust awarded to:University of Abertay

– Dr Les Ball, Dr Louis Natanson (Computing)– Prof Kevan Gartland, Dr Jill Gartland (Biotech.)– Davina Button (RA)

University of Edinburgh– Prof Peter Ghazal (GTI; Scottish Centre for Genomic Technology and

Informatics)

University of St Andrews– Dr Ishbel Duncan (Computer Science)

Aim:

–To build an intelligent and generic system for new hypothesis formulation from complex biochemical pathway databases.

Davina Button

Page 18: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

‘Road Map’

Page 19: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Options with the new database

Page 20: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Genes induced by BTH

Page 21: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

pathogen induced – incompatible (Arabidopsis)

Page 22: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 23: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 24: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 25: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Pathways e.g glycolysis enzymes

Conversion of glucose to pyruvate

• Wrong pathway

• Insufficient data

• Some errors (different time points? low homology!)

• Evidence of another pathway

Possible interpretations:-

Page 26: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 27: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 28: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 29: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

1. Les Ball (Abertay),

2. Prof Bonnie Webber (School of Informatics, Edinburgh University),

3. CABI.

• Data input and

• Data analysis

Could be used to provide a putative relationship between genes/proteins based on existing knowledge in the literature. This model could be combined with information in the gene expression database to provide a draft version of a regulatory gene network.

Text mining

Page 30: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Web stats - Location of users

Impact factors ?!

Page 31: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 32: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

DRASTIC

Database Resource for Analysis of Signal Transduction in Cells

SCRI

Gary Lyon

Adrian Newton

Bruce Marshall

University of Abertay

Les Ball

Louis Natason

Alasdair Houston

www.drastic.org.uk

Page 33: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Can we group treatments?

Page 34: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 35: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 36: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Genes up-regulated by Sulphur depletionGenes up-regulated by Sulphur depletion

Page 37: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Another exampleAnother example

The same gene can have different accession numbers – a big problem with genes of unknown function.

However, by converting accession numbers into AGI numbers we have shown that for the following ESTs

down-regulated by :-chitin (viz H37231, R90140, T41806), drought (viz AV823744), ethylene (viz R90140), low oxygen (At2g10940) or sodium chloride (AV823744),

or up-regulated by salicylic acid (R90140, H37231)

are all the same gene viz At2g10940

Page 38: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

up-regulated down-regulatedArabidopsis 5052 1246potato 168 8tomato 393 213Nicotiana tabacum 258 87pepper 113 0rice 234 43 ethylene 105 20salicylic acid 330 146jasmonates (methyl) 344 135jasmonic acid 78 2 Ecc 35 0Eca 3 0P. infestans (incompatible) 15 1P. infestans (compatible) 51 3 cold 436 187drought 690 263sodium chloride 546 248wounding 510 63 Abscisic acid 359 46 Total in database 7127 1828

Plants

Treatments

Pathogens

Environmental stresses

Number of entries in the Gene expression Database - examples

Page 39: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 40: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 41: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 42: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

What else could we do with the data?What else could we do with the data?

• Identify potato and barley orthologs of stress induced genes

• Map the position of the stress inducible genes

• Statistical analysis of signal transduction genes

• What are the differences between different plant tissues e.g. roots v. leaves.

Page 43: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.

Information from Maleck et al., Nature Genetics (Dec 2000) 26, 403-410

Out of 50 accession numbers checked (March 2004):-

• 26 (52%) were correctly identified

• 3 (6%) were wrongly identified (though 2 of these could be classed as ‘additional information being made available’ with only 1 really wrong.

• 13 (26%) are newly identified with a gene name (these were originally described (‘no homology’)

• 8 (16%) remain unknown but have an AGI number (these were originally described as ‘no homology’)

 

Page 44: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.
Page 45: DRASTIC Database Resource for Analysis of Signal Transduction in Cells  Gary Lyon Interrogating the DRASTIC Gene Expression Database.