Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the...

35
Path Analysis Danielle Dick Boulder 2006
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    215
  • download

    1

Transcript of Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the...

Page 1: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Path Analysis

Danielle Dick

Boulder 2006

Page 2: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Path Analysis

• Allows us to represent linear models for the relationships between variables in diagrammatic form

• Makes it easy to derive expectation for the variances and covariances of variables in terms of the parameters proposed by the model

• Is easily translated into matrix form for use in programs such as Mx

Page 3: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Example

q

B

on

A

FD

lk

11E

m

1

C

p

Page 4: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Conventions of Path Analysis• Squares or rectangles denote observed variables.

• Circles or ellipses denote latent (unmeasured) variables.

• (Triangle denote means, used when modeling raw data)

• Upper-case letters are used to denote variables.

• Lower-case letters (or numeric values) are used to denote covariances or path coefficients.

Page 5: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Conventions of Path Analysis

• Single-headed arrows or paths (–>) are used to represent causal relationships between variables under a particular model - where the variable at the tail is hypothesized to have a direct influence on the variable at the head.

A –> B

• Double-headed arrows (<–>) are used to represent a covariance between two variables, which may arise through common causes not represented in the model. They may also be used to represent the variance of a variable.

A <–> B

Page 6: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Conventions of Path Analysis

• Double-headed arrows may not be used for any variable which has one or more single-headed arrows pointing to it - these variables are called endogenous variables. Other variables are exogenous variables.

• Single-headed arrows may be drawn from exogenous to endogenous variables or from endogenous variables to other endogenous variables.

Page 7: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Conventions of Path Analysis

• Omission of a two-headed arrow between two exogenous variables implies the assumption that the covariance of those variables is zero (e.g., no genotype-environment correlation).

• Omission of a direct path from an exogenous (or endogenous) variable to an endogenous variable implies that there is no direct causal effect of the former on the latter variable.

Page 8: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Tracing Rules of Path Analysis

• Trace backwards, change direction at a double-headed arrow, then trace forwards.

• This implies that we can never trace through double-headed arrows in the same chain.

• The expected covariance between two variables, or the expected variance of a variable, is computed by multiplying together all the coefficients in a chain, and then summing over all possible chains.

Page 9: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Example

q

B

on

A

FD

lk

11E

m

1

C

p

EXOGENOUSVARIABLES

ENDOGENOUSVARIABLES

Page 10: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Exercises

• Cov AB =

• Cov BC =

• Cov AC =

• Var A =

• Var B =

• Var C =

• Var E

Page 11: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Covariance between A and B

q

B

pn

A

FD

lk

11E

m

1

C

p

q

B

pn

A

FD

lk

11E

m

1

C

p

Cov AB = kl + mqn + mpl

q

B

pn

A

FD

lk

11E

m

1

C

p

Page 12: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Exercises

• Cov AB =

• Cov BC =

• Cov AC =

• Var A =

• Var B =

• Var C =

• Var E

Page 13: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Expectations

• Cov AB = kl + mqn + mpl

• Cov BC = no

• Cov AC = mqo

• Var A = k2 + m2 + 2 kpm

• Var B = l2 + n2

• Var C = o2

• Var E = 1

q

B

on

A

FD

lk

11E

m

1

C

p

Page 14: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Quantitative Genetic Theory

• Observed behavioral differences stem from two primary sources: genetic and environmental

Page 15: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Quantitative Genetic Theory

• Observed behavioral differences stem from two primary sources: genetic and environmental

eg

P

G E

PHENOTYPE

11

EXOGENOUSVARIABLES

ENDOGENOUSVARIABLES

Page 16: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Quantitative Genetic Theory

• There are two sources of genetic influences: Additive and Dominant

ed

P

D E

PHENOTYPE

11

a

A

1

Page 17: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Quantitative Genetic Theory

• There are two sources of environmental influences: Common (shared) and Unique (nonshared)

cd

P

D C

PHENOTYPE

11

a

A

1

e

E

1

Page 18: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

In the preceding diagram…

• A, D, C, E are exogenous variables

• A = Additive genetic influences

• D = Non-additive genetic influences (i.e., dominance)

• C = Shared environmental influences

• E = Nonshared environmental influences

• A, D, C, E have variances of 1

• Phenotype is an endogenous variable

• P = phenotype; the measured variable

• a, d, c, e are parameter estimates

Page 19: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Univariate Twin Path Model

A1 D1 C1 E1

P

1 1 1 1

a d c e

Page 20: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Univariate Twin Path Model

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

1 1 1 11 1 1 1

a d c e e c d a

Page 21: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Univariate Twin Path Model

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 22: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Assumptions of this Model

• All effects are linear and additive (i.e., no genotype x environment or other multiplicative interactions)

• A, D, C, and E are mutually uncorrelated (i.e., there is no genotype-environment covariance/correlation)

• Path coefficients for Twin1 = Twin2

• There are no reciprocal sibling effects (i.e., there are no direct paths between P1 and P2

Page 23: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Tracing Rules of Path Analysis

• Trace backwards, change direction at a double-headed arrow, then trace forwards.

• This implies that we can never trace through double-headed arrows in the same chain.

• The expected covariance between two variables, or the expected variance of a variable, is computed by multiplying together all the coefficients in a chain, and then summing over all possible chains.

Page 24: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the Variance of P1

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 25: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the Variance of P1

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 26: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the Variance of P1

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

P=a(1)a

Page 27: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the Variance of P1

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

P=a(1)a + d(1)d + c(1)c + e(1)e

Page 28: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the Variance of P1

P1 P1A1 A1

a a1

P1 P1D1 D1

d d1

P1 P1C1 C1

c c1

P1 P1E1 E1

e e1

=

=

=

=

1a2

1d2

1c2

1e2

Var P1 = a2 + d2 + c2 + e2

Page 29: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the MZ Covariance

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 30: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the MZ Covariance

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 31: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the MZ Covariance

P1 P2A1 A2

a a1

P1 P2D1 D2

d d1

P1 P2C1 C2

c c1

P1 P2E1 E2

e e

=

=

=

=

1a2

1d2

1c2

CovMZ = a2 + d2 + c2

Page 32: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the DZ Covariance

A1 D1 C1 E1

PTwin1

E2 C2 D2 A2

PTwin2

MZ=1.0 DZ=0.5

MZ=1.0 DZ=0.25

MZ & DZ = 1.0

1 1 1 11 1 1 1

a d c e e c d a

Page 33: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the DZ covariance

CovDZ = ?

Page 34: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

Calculating the DZ covariance

P1 P2A1 A2

a a0.5

P1 P2D1 D2

d d0.25

P1 P2C1 C2

c c1

P1 P2E1 E2

e e

=

=

=

=

0.5a2

0.25d2

1c2

CovDZ = 0.5a2 + 0.25d2 + c2

Page 35: Path Analysis Danielle Dick Boulder 2006. Path Analysis Allows us to represent linear models for the relationships between variables in diagrammatic form.

VarTwin1 Cov12

Cov21 VarTwin2

a2 + d2 + c2 + e2 a2 +d2 + c2

a2 + d2 + c2 a2 + d2 + c2 + e2

Twin Variance/Covariance

a2 + d2 + c2 + e2 0.5a2 + 0.25d2 + c2

0.5a2 + 0.25d2 + c2 a2 + d2 + c2 + e2

MZ =

DZ =