Statistical Analysis of Microarray Data By H. Bjørn Nielsen.

Statistical Analysis of

Microarray Data

ByH. Bjørn Nielsen

Sample PreparationHybridization

Array designProbe design

QuestionExperimental Design

Buy Chip/Array

Statistical AnalysisFit to Model (time series)

Expression IndexCalculation

Advanced Data AnalysisClustering PCA Classification Promoter AnalysisMeta analysis Survival analysis Regulatory Network

ComparableGene Expression Data

Normalization

Image analysis

The DNA Array Analysis Pipeline

What's the question?

Typically we want to identify differentially expressed genes

without alcohol with alcohol

alcohol dehydrogenase

Example: alcohol dehydrogenase is expressed at a higher level when alcohol is added to the media

He’s going to say it

However, the measurements contain stochastic noise

There is no way around it

Statistics

Noisymeasurements p-value

statistics

You can choose to think of statistics as a black box

But, you still need to understand how to interpret the results

P-valueThe chance of rejecting the null hypothesis by coincidence----------------------------For gene expression analysis we can say: the chance that a gene is categorized as differentially expressed by coincidence

The output of the statistics

The statistics gives us a p-value for each gene

We can rank the genes according to the p-value

But, we can’t really trust the p-value in a strict statistical way!

Why not!

For two reasons:

1. We are rarely fulfilling all the assumptions of the statistical test

2. We have to take multi-testing into account

The t-test Assumptions

1. The observations in the two categories must be independent

2. The observations should be normally distributed

3. The sample size must be ‘large’(>30 replicates)

Multi-testing?

In a typical microarray analysis we test thousands of genes

If we use a significance level of 0.05and we test 1000 genes. We expect 50 genes to be significant by chance

1000 x 0.05 = 50

log2 fold change (M)

Volcano Plot

What's inside the black box ‘statistics’

t-test or ANOVA

The t-test

Calculate T

Lookup T in a table

The t-test II

The t-test tests for difference in means ()

Intensityof gene x

Density wt wt mut mutant

The t statistic is based on the sample mean and variance

The t-test III

Conclusion

• Array data contains stochastic noise– Therefore statistics is needed to conclude on

differential expression

• We can’t really trust the p-value• But the statistics can rank genes• The capacity/needs of downstream

processes can be used to set cutoff• FDR can be estimated• t-test is used for two category tests• ANOVA is used for multiple categories

Statistical Analysis of Microarray Data By H. Bjørn Nielsen.

Documents

Transcript of Statistical Analysis of Microarray Data By H. Bjørn Nielsen.

Microarray Analysis - The Basicstgirke/HTML_Presentations/Manuals/Microarray/... · Microarray Analysis The Basics Thomas Girke December 9, 2011 Microarray Analysis Slide 1/42

Hafslund ASA 24 October 2013 Finn Bjørn Ruyter, CEO · 24 October 2013 Finn Bjørn Ruyter, CEO. Agenda 1) Third quarter 2013 – Finn Bjørn Ruyter, CEO 2) Hafslund Heat – Finn

Tutorial Microarray

Lecture 9 Microarray experiments MA plots Normalization of microarray data

DNA Arrays - CBS · Print Microarray Hybridize to microarray cDNA Library or Oligo Probes Prepare Sample. Page 4 The benchtop microarray facility... Synthesis Microarray synthesis

Microarray Data Pre-processing - Bioinfo · Microarray Data Pre-processing Ana H. Barragan Lid. Hybridized Microarray •Imaged in a microarray scanner •Scanner produces fluorescence

Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker.

Professor Bjørn Asheim, Deputy Director ,

BioVLAB-Microarray: Microarray Data Analysis in …web.ornl.gov/~choij/papers/BioVLAB.pdf · BioVLAB-Microarray: Microarray Data Analysis in Virtual Environment Youngik Yang, Jong

Microarray analysis

Claus Bjørn Billehøj - Copenhagen Smart City

Microarray Technology

The religious art of Bjørn Bjørneboe

Microarray Basics, and Planning a Microarray Experiment

Sjølogistikkonferansen 201308 stavanger samskip multimodal - bjørn waglen

Bjørn forslund ironroad vms - februar 2012

Peter Bjørn Larsen - Öresund Smart City Hub

Plant Microarray

Analysing microarray expression data through effective ...web.cs.ucla.edu/~zaniolo/papers/Analysing microarray expression... · Analysing microarray expression data through ... Analysing

Good Management Effective Organisations Bjørn Bauer PlanMiljø.