Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse...

19
Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University of California, Los Angeles BIOCOMP’07 Conference, 6/26/07

Transcript of Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse...

Page 1: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Differential Network Analysis inMouse Expression Data

Tova FullerSteve Horvath

Department of Human GeneticsUniversity of California, Los AngelesBIOCOMP’07 Conference, 6/26/07

Page 2: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Outline

• Introduction:– Single versus differential network analysis

• Differential Network construction• Results• Functional Analysis• Conclusion

Page 3: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Goals of Single Network Analysis

• Identifying genetic pathways (modules)• Finding key drivers (hub genes)• Modeling the relationships between:

– Transcriptome– Clinical traits / Phenotypes– Genetic marker data

Page 4: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Validation set 1 Validation set 2

Single Network WGCNA

1 gene co-expression networkMultiple data sets may be used for validation

Page 5: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Goals of Differential NetworkAnalysis

• Uncover differences in modules andconnectivity in different data sets– Ex: Human versus chimpanzee brains (Oldham

et al. 2006)

• Differing toplogy in multiple networks revealsgenes/pathways that are wired differently indifferent sample populations

Oldham MC, Horvath S, Geschwind DH (2006) Conservation and evolution of gene coexpression networks in humanand chimpanzee brains. Proc Natl Acad Sci U S A 103, 17973-17978.

Page 6: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

NETWORK 1

Differential Network WGCNA

2+ gene co-expression networksIdentify genes and pathways that are:

1. Differentially expressed2. Differentially wired

NETWORK 2

Page 7: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

• Single network analysis female BxH mice revealed a weight-related module (Ghazalpour et al. 2006)

• Samples: Constructed networks from mice from extrema ofweight spectrum:– Network 1: 30 leanest mice– Network 2: 30 heaviest mice

• Transcripts: Used 3421 most connected and varying transcripts

BxH Mouse Data

Ghazalpour A, Doss S, Zhang B, Wang S, Plaisier C, Castellanos R, Brozell A, Schadt EE, Drake TA, Lusis AJ, Horvath S (2006) Integrating genetic and network analysis to characterize genes related to mouse weight. PLoS genetics 2, e130

NETWORK 1 NETWORK 2

135 FEMALES

Page 8: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Methods

Compute Comparison MetricsCompute Comparison Metrics•• Difference in expression:Difference in expression: t-test t-test statisticstatistic•• Compare difference in connectivity:Compare difference in connectivity: DiffKDiffK

Identify significantly different genes/pathwaysIdentify significantly different genes/pathwaysPermutation testPermutation test

Functional analysis of significant genes/pathwaysFunctional analysis of significant genes/pathwaysDAVID databaseDAVID databasePrimary literaturePrimary literature

Page 9: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Computing Comparison Metrics

DIFFERENTIAL EXPRESSION

t-test statistic computed for each gene, t(i)

DIFFERENTIAL CONNECTIVITY

K1(i) = k1(i) K2(i) = k2(i) max(k1) max(k2)

DiffK(i): difference in normalizedconnectivities for each gene:

DiffK(i) = K1(i) – K2(i)

Page 10: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Sector Plot

We visualize the comparison metrics via a sectorplot:

• x-axis: DiffK

• y-axis: t statistics

We establish sector boundaries to identifyregions of differentially expressed and/orconnected regions

• |t| = 1.96 corresponding to p = 0.05

• |DiffK| = 0.4

Page 11: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

no.perms: number ofpermutations

For each sector j, wecompare the number ofgenes in unpermuted andpermuted sectors (nobs andnperm)

Permutation test:Identifying significant sectors

!

p j =# times (nobs

j" nperm

j) +1

no.perms+1

NETWORK 1 NETWORK 2

PERMUTE

Page 12: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Sector Plot Results

0.010.001

0.001 0.001X

X X

X

Page 13: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Functional AnalysisSECTOR 3

High t statisticHigh DiffK

Yellow module in leanGrey in obese

(63 genes)

Genes in these sectors have higher connectivity in lean than obese mice:~ pathways potentially disregulated in obesity ~

SECTOR 5Low t statisticHigh Diff K(28 genes)

Page 14: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Sector 3:Functional Analysis Results

DAVID Database• “Extracellular”:

– extracellular region (38% of genes p = 1.8 x 10-4)– extracellular space (34% of genes p = 5.7 x 10-4)

• signaling (36% of genes p = 5.4 x 10-4)• cell adhesion (16% of genes p = 7.7 x 10-4)• glycoproteins (34% of genes p = 1.6 x 10-3)• 12 terms for epidermal growth factor or its related proteins

– EGF-like 1 (8.2% of genes p = 8.7 x 10-4),

– EGF-like 3 (6.6% of genes p = 1.6 x 10-3),– EGF-like 2 (6.6% of genes p = 6.0 x 10-3),– EGF (8.2% of genes p = 0.013)– EGF_CA (6.6% of genes p = 0.015)

Page 15: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Sector 3:Functional Analysis Results

Primary Literature

• Results supported by a study on EGF levelsin mice (Kurachi et al. 1993)– EGF found to be increased in obese mice– Obesity was reversed in these mice by:

• Administration of anti-EGF• Sialoadenectomy

Kurachi H, Adachi H, Ohtsuka S, Morishige K, Amemiya K, Keno Y, Shimomura I, Tokunaga K, Miyake A, Matsuzawa Y, et al.(1993) Involvement of epidermal growth factor in inducing obesity in ovariectomized mice. The American journal of physiology265, E323-331

Page 16: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Sector 5:Functional Analysis Results

DAVID Database

• Enzyme inhibitor activity (p = 2.9 x 10-3)*• Protease inhibitor activity (p = 6.0 x 10-3)• Endopeptidase inhibitor activity (p = 6.0 x 10-3)• Dephosphorylation (p = 0.012)• Protein amino acid dephosphorylation (p = 0.012)• Serine-type endopeptidase inhibitor activity (p =

0.042)

* p values shown are corrected using Bonferroni correction

Page 17: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Itih1 and Itih3• Enriched for all categories shown previously• Located near a QTL for hyperinsulinemia (Almind and Kahn 2004)• Itih3 identified as a gene candidate for obesity-related traits based on

differential expression in murine hypothalamus (Bischof and Wevrick2005)

Serpina3n and Serpina10• Enriched for enzyme inhibitor, protease inhibitor, and endopeptidase inhibitor• Serpina10, or Protein Z-dependent protease inhibitor (ZPI) has been found to

be associated with venous thrombosis (Van de Water et al. 2004)

Sector 5:Functional Analysis Results

Primary Literature

Almind K, Kahn CR (2004) Genetic determinants of energy expenditure and insulin resistance in diet-induced obesity in mice. Diabetes 53, 3274-3285Bischof JM, Wevrick R (2005) Genome-wide analysis of gene transcription in the hypothalamus. Physiological genomics 22, 191-196Van de Water N, Tan T, Ashton F, O'Grady A, Day T, Browett P, Ockelford P, Harper P (2004) Mutations within the protein Z-dependent protease inhibitor gene are associated with venous thromboembolic disease: a new form of thrombophilia. Bjh 127, 190-194

Page 18: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Conclusions

• Differential Network Analysis reveals pathways that areboth differentially regulated and connected in mouse obesity– Genes that are differentially connected may/may not be differentially

expressed

• Primary literature supports biological plausibility of thesepathways in weight related disorders– Sector 3 & EGF pathways: potential EGF causality in obesity– Sector 5 & serine protease pathways: potential link between obesity

and venous thrombosis

• These results help identify targets for validation withbiological experiments

Page 19: Differential Network Analysis in Mouse Expression Data · Differential Network Analysis in Mouse Expression Data Tova Fuller Steve Horvath Department of Human Genetics University

Acknowledgements

Guidance

HORVATH LABSteve HorvathJason AtenJun DongPeter LangfelderAi LiWen LinAnja PressonLin WangWei Zhao

An R tutorial may be found at:http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/DifferentialNetworkAnalysis

Collaboration

LUSIS LABJake LusisAnatole GhazalpourThomas Drake

Funding

UCLA Medical Scientist TrainingProgram (MD/PhD)