Acknowledgements

1
Comparison of Microarray Data Generated from Degraded RNA using Five Different Target Synthesis Methods and Commercial Microarrays Scott Tighe and Tim Hunter Microarray Core Facility, Vermont Genetics Network at the University of Vermont, Burlington, Vermont, USA Acknowledgements Methods Result s Conclusions As the need to collect gene expression data on degraded RNA continues, an evolution of new reagents and arrays now on the market may improve these data sets. The focus of this study is to evaluate the effectiveness of five different target amplification strategies with three types of Affymetrix GeneChips employing RNA at four different levels of degradation. For this study, reference RNA was non-chemically degraded to final RIN values of 10, 6, 4, and 2 as determined by the Agilent Bioanalyzer. Each RNA was amplified using standard Affymetrix protocols, ExpressArt by AmpTec, and NuGEN’s Ovation V2, WT Pico, and WT FFPE kits. Preliminary results indicate that data generated from RNA with a RIN of 9, 6, and 4 performed well on Exon 1.0 ST arrays, Gene 1.0 ST arrays, and 3’ U133a 2.0 arrays using the ExpressArt, NuGEN WT Pico, and FFPE reagents. For RNA with a RIN value of 2, preliminary data suggests that the NuGEN WT Pico and FFPE provide the highest correlation to data generated by higher quality RNA followed by data generated by the ExpressArt reagents. RT-qPCR was also performed and revealed a similar trend to the microarray data. This research was funded through the Vermont Genetics Network at the University of Vermont (UVM) through Grant Number P20 RR16462 from the INBRE Program of the NIH National Center for Research Resources. We Gratefully acknowlegde John Burke at X-Ray Biotique Systems for outstanding software support. RNA Source Ambion First Choice™ Human Brain reference RNA was degraded to four levels determined by the Agilent Bioanalyzer 2100 using open tube method. Abstrac t RIN 10 RIN 6 RIN 4 RIN 2 Target Preparation Method 3’ Affymetrix U133a 2.0 Arrays Standard Affymetrix protocol with ENZO IVT NuGEN Ovation V2 NuGEN Pico NuGEN FFPE AmpTec Micro with ENZO IVT Affymetrix Exon 1.0 ST and Gene 1.0 ST Arrays Standard Affymetrix Protocol NuGEN Pico NuGen FFPE AmpTec Micro with Affy labeling Data Analysis Quality control data analysis was performed using GCOS, Expression Console 1.1, and X-RAY version 3.99385 Method Array Input(ng) rRNA reduction Std 3’ Affy- ENZO IVT 3’ U133a2.0 1000 NO AmpTec-RT - ENZO IVT 3’ U133a2.0 475 NO NuGEN Ovation 3’ U133a2.0 80 NO NuGEN Pico 3’ U133a2.0 50 NO NuGEN FFPE 3’ U133a2.0 50 NO Method Array Input(ng) rRNA reduction NuGEN Pico Gene/Exon 50 NO NuGEN FFPE Gene/Exon 50 NO Affymetrix WT Gene/Exon 100 NO AmpTec-RT-Affy-Label Gene/Exon 500 NO Std 1ug Affy Protocol Not tested- rReduction CV to high 3’ Array data: U133a 2.0 Images Std Affy (cRNA) NuGEN Ovation cDNA) AmpTec TR Micro (cRNA) NuGEN FFPE (cDNA) NuGEN Pico (cDNA) Affymetrix 100ng Amptec TR NuGEN FFPE NuGEN Pico RIN 10 RIN 6 RIN 4 RIN 2 RIN 10 RIN 6 RIN 4 RIN 2 Human Gene Array 1.0 ST Images 100ng Affy Prep AmpTec TR NuGEN FFPE NuGEN Pico RIN 10 RIN 6 RIN 4 RIN 2 Human Exon Array 1.0 ST Images #120 % present and Background Percent present indicates the number of genes detected above background. Low background is desirable. These data indicate good results for all samples except AmpTec RIN 2. 3’-5’ ratios and Scaling Factor High 3-5’ ratios indicate degradation of input RNA. High Scaling Factor indicates a dim chip or reduced signal. These results show that Oligo[dT] priming is negatively impacted by degradation indicated by high 3-5’ ratios for RIN 6, 4, 2 RNA. Moderate scaling factors increases are noted for NuGEN RIN 6-2 RNAs. Data for AmpTec TR reagents were very good. 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 A ffy-1-EX A ffy-2-EX A ffy-3-EX A ffy-4-EX A m ptec-1-EX A m ptec-2-EX A m ptec-3-EX A m ptec-4-EX N uG EN -FFPE-1-EX N uG EN -FFPE-2-EX N uGEN -FFPE-3-EX N uG EN -FFPE-4-EX N uG EN -Pico-1-EX N uG EN -Pico-2-EX N uG EN -Pico-3-EX N uG EN -Pico-4-EX pos_vs_neg_auc m ad_residual_m ean rle_m ean 0 50 100 150 200 250 300 350 400 450 500 A ffy-1-E X A ffy-2-E X A ffy-3-E X A ffy-4-E X A m ptec-1-E X A m ptec-2-E X A m ptec-3-E X A m ptec-4-E X N uG E N -FFPE -1-E X N uG E N -FFPE -2-E X N uG E N -FFPE -3-E X N uG E N -FFPE -4-E X N uGE N -Pico-1-E X N uGE N -Pico-2-EX N uGEN -Pico-3-EX N uG EN -Pico-4-EX pm _m ean bgrd_m ean Area Under the Curve (AUC) Values, MAD residuals, and RLE Mean AUC is the statistical measure of the percent true positive signal above the negative signal. AUC values must be above 0.8 to be considered good. MAD residual means are values that indicate how a dataset deviates from the expected mean. High values are an indication of problematic data. RLE means values also indicates variability of data. High values indicate problematic data. Perfect Match and Background Means The values of perfect match probes verses background probes results in a direct correlation with signal to noise ratios. The greater the differential, the “cleaner” the signal. High background can negatively affect a true signal. Good results are indicated by AmpTec RIN 10-4, NuGEN FFPE, and NuGEN Pico. Area Under the Curve (AUC), MAD residuals, RLE Mean These data suggest that AmpTec RIN 10-4 and all NuGEN Pico and FFPE have consistently good values. Differential Gene Detection Number of differentially detected genes as a function of Target preparation method, degradation level, and Genechip type determined by X-ray software. Note that differential detection can be skewed by a low call rate. Percent Venn Diagram When analyzing degraded RNA using X- Ray software at the gene-level, the Exon 1.0 ST array had more genes detected than the Gene 1.0 ST Array. Correlation Plots Correlation Plots were generated using Affymetrix Expression Console software. These plots indicate similarity between samples. Red indicates higher similarity. 3’ Array –U133a 2.0 Gene 1.0 ST Array Exon 1.0 St Array Perfect Match and Background Means These data indicate very good signal to noise (PM mean-BKGD Mean) for AmpTec RIN 10 and all NuGEN preps. 0.00 10.00 20.00 30.00 40.00 50.00 60.00 70.00 80.00 1-A ffy 2-A ffy 3-A ffy 4-A ffy 1-NuG EN-O vatio 2-NuG EN-O vatio 3-NuG EN-O vatio 4-NuGEN-O vatio 1-A m ptec-E nzo 2-A m ptec-E nzo 3-A m ptec-E nzo 4-A m ptec-E nzo 1-N uG EN -FFPE 2-N uG E N -F FP E 3-N uG EN -FFPE 4-N uG EN -FFPE 1-NuG EN -Pico 2-N uG EN-Pico 3-NuG EN-Pico 4-NuG EN-Pico %P BG Avg Oligo[dT] Random Priming 0 10 20 30 40 50 60 70 80 1-A ffy 1-NuG EN-Ovation 1-A m ptec-Enzo 1-N uG EN -FFPE 1-N uG EN -Pico 2-A ffy 2-NuG EN-Ovation 2-Am ptec-Enzo 2-N uG EN -FFPE 2-N uG EN-Pico 3-A ffy 3-N uGEN -O vation 3-Am ptec-Enzo 3-N uG EN -FFPE 3-N uG EN -Pico 4-A ffy 4-NuGEN -O vation 4-Am ptec-Enzo 4-N uG EN -FFPE 4-N uG EN -Pico G A P 3-5-ratio SF B -actin 3-5-ratio good arrays have high pm _m ean and low bgrd_m ean 1:R IN =10 2:R IN =6 3:R IN =3 4:R IN =1 0 100 200 300 400 500 600 700 1-A m ptec-G A .rm a 2-A m ptec-G A .rm a 3-A m ptec-G A .rm a 4-A m ptec-G A .rm a 1-A ffy-G A .rm a 2-A ffy-G A .rm a 3-A ffy-G A .rm a 4-A ffy-G A .rm a 1-N uG E N -FFP E -G A .rm a 2-N uG E N -FFP E -G A .rm a 3-N uG E N -FFP E -G A .rm a 4-N uG E N -FFP E -G A .rm a 1-NuGEN-Pico-GA.rma 2-NuGEN-Pico-GA.rma 3-NuGEN-Pico-GA.rma 4-NuGEN-Pico-GA.rma Preliminary QC results are no substitute for transcript-level bioinformatics. Complete analysis of these data are in progress but not yet available. We have demonstrated that microarray analysis of degraded RNA is possible and usable and Exon Arrays appear to be most forgiving, but more costly. - No one statistical value demonstrates data quality for Exon, Gene, or 3’ Arrays - Standard Oligo[dT] target prep methods readily reveal elevated 3’- 5’ ratios for B-actin and GAPDH for degraded RNA. B-actin is more sensitive to degradation than GAPDH. - AmpTec 3’ to 5’ ratios remain low regardless of RNA degradation and may not be an appropiate indicator of data quality and must be considered with Background Ave and % present. - All methods including Affy, AmpTec, NuGEN Pico and FFPE generated acceptable Exon Data for RIN values 10, 6 ,4 based on AUC data, Residual Mean, and RLE values. However, when the exact same hybridization mixes were applied to Gene arrays, background means were elevated for the Affy-100ng method. - The Affy 100ng and AmpTec preps demonstrated much higher Background Means compared to other methods - Good results were obtained for with RIN 2 RNA using NuGEN Pico and FFPE methods when analyzed by RMA Gene levels for Exon Arrays .

description

Comparison of Microarray Data Generated from Degraded RNA using Five Different Target Synthesis Methods and Commercial Microarrays Scott Tighe and Tim Hunter Microarray Core Facility, Vermont Genetics Network at the University of Vermont, Burlington, Vermont, USA. - PowerPoint PPT Presentation

Transcript of Acknowledgements

Page 1: Acknowledgements

Comparison of Microarray Data Generated from Degraded RNA using Five Different Target Synthesis Methods and Commercial Microarrays

Scott Tighe and Tim Hunter Microarray Core Facility, Vermont Genetics Network at the University of Vermont, Burlington, Vermont, USA

Acknowledgements

Methods

Results

Conclusions

As the need to collect gene expression data on degraded RNA continues, an evolution of new reagents and arrays now on the market may improve these data sets. The focus of this study is to evaluate the effectiveness of five different target amplification strategies with three types of Affymetrix GeneChips employing RNA at four different levels of degradation. For this study, reference RNA was non-chemically degraded to final RIN values of 10, 6, 4, and 2 as determined by the Agilent Bioanalyzer. Each RNA was amplified using standard Affymetrix protocols, ExpressArt by AmpTec, and NuGEN’s Ovation V2, WT Pico, and WT FFPE kits. Preliminary results indicate that data generated from RNA with a RIN of 9, 6, and 4 performed well on Exon 1.0 ST arrays, Gene 1.0 ST arrays, and 3’ U133a 2.0 arrays using the ExpressArt, NuGEN WT Pico, and FFPE reagents. For RNA with a RIN value of 2, preliminary data suggests that the NuGEN WT Pico and FFPE provide the highest correlation to data generated by higher quality RNA followed by data generated by the ExpressArt reagents. RT-qPCR was also performed and revealed a similar trend to the microarray data.

This research was funded through the Vermont Genetics Network at the University of Vermont (UVM) through Grant Number P20 RR16462 from the INBRE Program of the NIH National Center for Research Resources. We Gratefully acknowlegde John Burke at X-Ray Biotique Systems for outstanding software support.

RNA Source

Ambion First Choice™ Human Brain reference RNA was degraded to four levels determined by the Agilent Bioanalyzer 2100 using open tube method.

Abstract

RIN 10 RIN 6 RIN 4 RIN 2

Target Preparation Method

3’ Affymetrix U133a 2.0 Arrays

Standard Affymetrix protocol with ENZO IVTNuGEN Ovation V2NuGEN PicoNuGEN FFPEAmpTec Micro with ENZO IVT

Affymetrix Exon 1.0 ST and Gene 1.0 ST Arrays

Standard Affymetrix ProtocolNuGEN PicoNuGen FFPEAmpTec Micro with Affy labeling

Data Analysis

Quality control data analysis was performed using GCOS, Expression Console 1.1, and X-RAY version 3.99385

Method Array Input(ng) rRNA reductionStd 3’ Affy- ENZO IVT 3’ U133a2.0 1000 NOAmpTec-RT - ENZO IVT 3’ U133a2.0 475 NONuGEN Ovation 3’ U133a2.0 80 NONuGEN Pico 3’ U133a2.0 50 NONuGEN FFPE 3’ U133a2.0 50 NO

Method Array Input(ng) rRNA reductionNuGEN Pico Gene/Exon 50 NONuGEN FFPE Gene/Exon 50 NOAffymetrix WT Gene/Exon 100 NOAmpTec-RT-Affy-Label Gene/Exon 500 NO

Std 1ug Affy Protocol Not tested- rReduction CV to high

3’ Array data: U133a 2.0 Images

Std Affy (cRNA)

NuGEN Ovation cDNA)

AmpTec TR Micro(cRNA) NuGEN FFPE (cDNA)

NuGEN Pico (cDNA)

Affymetrix 100ng

Amptec TR

NuGEN FFPE

NuGEN Pico

RIN 10 RIN 6 RIN 4 RIN 2

RIN 10 RIN 6 RIN 4 RIN 2

Human Gene Array 1.0 ST Images

100ng Affy Prep

AmpTec TR

NuGEN FFPE

NuGEN Pico

RIN 10 RIN 6 RIN 4 RIN 2

Human Exon Array 1.0 ST Images

#120

% present and BackgroundPercent present indicates the number of genes detected above background. Low background is desirable. These data indicate good results for all samples except AmpTec RIN 2.

3’-5’ ratios and Scaling Factor High 3-5’ ratios indicate degradation of input RNA. High Scaling Factor indicates a dim chip or reduced signal. These results show that Oligo[dT] priming is negatively impacted by degradation indicated by high 3-5’ ratios for RIN 6, 4, 2 RNA. Moderate scaling factors increases are noted for NuGEN RIN 6-2 RNAs. Data for AmpTec TR reagents were very good.

0.10.20.30.40.50.60.70.80.9

11.11.2

Affy-

1-EX

Affy-

2-EX

Affy-

3-EX

Affy-

4-EX

Ampt

ec-1

-EX

Ampt

ec-2

-EX

Ampt

ec-3

-EX

Ampt

ec-4

-EX

NuGE

N-FF

PE-1

-EX

NuGE

N-FF

PE-2

-EX

NuGE

N-FF

PE-3

-EX

NuGE

N-FF

PE-4

-EX

NuGE

N-Pic

o-1-

EX

NuGE

N-Pic

o-2-

EX

NuGE

N-Pic

o-3-

EX

NuGE

N-Pic

o-4-

EX

pos_vs_neg_aucmad_residual_meanrle_mean

050

100150200250300350400450500

Affy-

1-EX

Affy-

2-EX

Affy-

3-EX

Affy-

4-EX

Ampt

ec-1-

EX

Ampt

ec-2

-EX

Ampt

ec-3

-EX

Ampt

ec-4

-EX

NuGE

N-FF

PE-1-

EX

NuGE

N-FF

PE-2

-EX

NuGE

N-FF

PE-3

-EX

NuGE

N-FF

PE-4

-EX

NuGE

N-Pic

o-1-E

X

NuGE

N-Pic

o-2-

EX

NuGE

N-Pic

o-3-

EX

NuGE

N-Pic

o-4-

EX

pm_mean

bgrd_mean

Area Under the Curve (AUC) Values, MAD residuals, and RLE MeanAUC is the statistical measure of the percent true positive signal above the negative signal. AUC values must be above 0.8 to be considered good. MAD residual means are values that indicate how a dataset deviates from the expected mean. High values are an indication of problematic data. RLE means values also indicates variability of data. High values indicate problematic data.

Perfect Match and Background MeansThe values of perfect match probes verses background probes results in a direct correlation with signal to noise ratios. The greater the differential, the “cleaner” the signal. High background can negatively affect a true signal. Good results are indicated by AmpTec RIN 10-4, NuGEN FFPE, and NuGEN Pico.

Area Under the Curve (AUC), MAD residuals, RLE MeanThese data suggest that AmpTec RIN 10-4 and all NuGEN Pico and FFPE have

consistently good values.

Differential Gene DetectionNumber of differentially detected genes as a function of Target preparation method, degradation level, and Genechip type determined by X-ray software. Note that differential detection can be skewed by a low call rate.

Percent Venn Diagram

When analyzing degraded RNA using X-Ray software at the gene-level, the Exon 1.0 ST array had more genes detected than the Gene 1.0 ST Array.

Correlation Plots Correlation Plots were generated

using Affymetrix Expression Console software. These plots

indicate similarity between samples. Red indicates higher

similarity.

3’ Array –U133a 2.0 Gene 1.0 ST Array Exon 1.0 St Array

Perfect Match and Background Means These data indicate very good signal to noise (PM mean-BKGD Mean)

for AmpTec RIN 10 and all NuGEN preps.

0.00

10.00

20.00

30.00

40.00

50.00

60.00

70.00

80.00

1-Affy

2-Affy

3-Affy

4-Affy

1-NuG

EN-O

vation

2-NuG

EN-O

vation

3-NuG

EN-O

vation

4-NuG

EN-O

vation

1-Amptec-Enzo

2-Amptec-Enzo

3-Amptec-Enzo

4-Amptec-E

nzo

1-NuG

EN-F

FPE

2-NuG

EN-FFPE

3-NuG

EN-FFPE

4-NuG

EN-F

FPE

1-NuG

EN-Pico

2-NuG

EN-Pico

3-NuG

EN-Pico

4-NuG

EN-P

ico

%PBG Avg

Oligo[dT] Random Priming

0

10

20

30

40

50

60

70

80

1-A

ffy

1-N

uGE

N-O

vatio

n

1-A

mpt

ec-E

nzo

1-N

uGE

N-F

FPE

1-N

uGE

N-P

ico

2-A

ffy

2-N

uGE

N-O

vatio

n

2-A

mpt

ec-E

nzo

2-N

uGE

N-F

FPE

2-N

uGE

N-P

ico

3-A

ffy

3-N

uGE

N-O

vatio

n

3-A

mpt

ec-E

nzo

3-N

uGE

N-F

FPE

3-N

uGE

N-P

ico

4-A

ffy

4-N

uGE

N-O

vatio

n

4-A

mpt

ec-E

nzo

4-N

uGE

N-F

FPE

4-N

uGE

N-P

ico

GAP 3-5-ratioSFB-actin 3-5-ratio

QC Metrics Gene Array 1.0 ST

good arrays have high pm_mean and low bgrd_mean1:RIN=10 2: RIN=6 3:RIN=3 4:RIN=1

0

100

200

300

400

500

600

700

1-A

mpt

ec-G

A.rm

a

2-A

mpt

ec-G

A.rm

a

3-A

mpt

ec-G

A.rm

a

4-A

mpt

ec-G

A.rm

a

1-A

ffy-G

A.rm

a

2-A

ffy-G

A.rm

a

3-A

ffy-G

A.rm

a

4-A

ffy-G

A.rm

a

1-N

uGE

N-F

FPE

-GA

.rma

2-N

uGE

N-F

FPE

-GA

.rma

3-N

uGE

N-F

FPE

-GA

.rma

4-N

uGE

N-F

FPE

-GA

.rma

1-N

uGE

N-P

ico-

GA

.rma

2-N

uGE

N-P

ico-

GA

.rma

3-N

uGE

N-P

ico-

GA

.rma

4-N

uGE

N-P

ico-

GA

.rma

1-N

uGE

N-P

icoZ

-GA

.rma

2-N

uGE

N-P

icoZ

-GA

.rma

3-N

uGE

N-P

icoZ

-GA

.rma

4-N

uGE

N-P

icoZ

-GA

.rma

Preliminary QC results are no substitute for transcript-level bioinformatics. Complete analysis of these data are in progress but not yet available. We have demonstrated that microarray analysis of degraded RNA is possible and usable and Exon Arrays appear to be most forgiving, but more costly.

- No one statistical value demonstrates data quality for Exon, Gene, or 3’ Arrays

- Standard Oligo[dT] target prep methods readily reveal elevated 3’- 5’ ratios for B-actin and GAPDH for degraded RNA. B-actin is more sensitive to degradation than GAPDH.

- AmpTec 3’ to 5’ ratios remain low regardless of RNA degradation and may not be an appropiate indicator of data quality and must be considered with Background Ave and % present.

- All methods including Affy, AmpTec, NuGEN Pico and FFPE generated acceptable Exon Data for RIN values 10, 6 ,4 based on AUC data, Residual Mean, and RLE values. However, when the exact same hybridization mixes were applied to Gene arrays, background means were elevated for the Affy-100ng method.

- The Affy 100ng and AmpTec preps demonstrated much higher Background Means compared to other methods

- Good results were obtained for with RIN 2 RNA using NuGEN Pico and FFPE methods when analyzed by RMA Gene levels for Exon Arrays .