A Constrained Instrumental Variable Approach,

A Constrained Instrumental Variable Approach, and its Application to Mendelian Randomization with Pleiotropy

Lai J iang (Lady Davis Ins titute & McGill), Karim Oualkacha (UQAM),

Celia Greenwood (Lady Davis Ins titute & McGill)

Montreal Caus al Inference Works hop, J uly 2016

Outline

1. The problem

2. Solutions and es timators

3. Simulations – s imple example

4. Simulations - an example borrowed from reality

5. Finally …

2

3

The Problem

Instrumental Variables

We are interes ted in the causal effect of X (phenotype) on Y (dis ease)

U: unobserved variable that may confound the as socia tion between X and Y

G: a variable (genotype) we want to use as an ins trumental variable

DAG: U

G X Y

4

Core Conditions for Instrumental Variable analysis

U

G X Y 1) G independent of U 2) G not marginally independent of X 3) G and Y are independent conditional on X and U

5

Potential violations of assumptions in MR

Linkage Disequilibrium (Condition 3 or Condition 1)

Genetic Heterogeneity (Weak IV)

Population Stra tifica tion (Condition 3)

6

G2 U

G1 X Y

G1

G2

G3

X

U

Y

G1

P

X Y

U

Pleiotropy A SNP (G) is associated with another intermediate phenotype (Z, other than X) which also has an effect on the disease Y.

Z

G X Y

U

G X Y

Z U (a) (b)

7

• Condition (3) – Y and G independent given X and U – is violated in (a). • Condition (1) – G and U independent – is violated in (b).

(Original) Problem

9

Estimators

Causal effect estimation: IV assumptions satisfied

One SNP

G X Y

U

δ β

ρ η

In the presence of pleiotropy

Z

G X

U

δ

ω

β

ξ η

ρ

Y

In the presence of pleiotropy

Y

13

Z

G X

U

Potential solutions when there is pleiotropy

1. Naïve Method (TSLS us ing a ll SNPs ) {Biased} [Baum 2003] 2. Adjus t for G for Z, us e Gres as IV {Unbiased} 3. Limited Information Maximum Likelihood. (LIML) {Biased} [Burges s 2013] 4. Stepwise Selection Methods -- s elect subset of G {Incons is tent} [Bai 2008] 5. Inverse Probability Weights [Cole 2008]. 6. Cons tra ined Ins trumental Variables

14

1. Naïve method in practice (#1, #2)

1. Select SNPs s trongly as socia ted with phenotype of interes t X

2. Remove SNPs s trongly as socia ted with pleiotropic phenotypes Z

3. Remove weak SNPs to avoid overfitting & bias

4. If the resulting ins trumental variable G s till has a s trong pleiotropic effect,

cons ider regres s ing G|Z and use Gres ins tead of G (ie. Method #2)

5. TSLS model fitting us ing G (or Gres) -> X -> Y

3. LIML

Limited Information Maximum Likelihood uses ins truments to rectify bias in a regres s ion Y ~ X, a ris ing when X is correla ted with res iduals

• i.e. Pleiotropy leads to correla tion between X and Z

LIML takes into account the covariances of the errors

More s table to weak ins trumental variable bias

• Z is not used

16

4. Stepwise Selection Methods.

Step by s tep methods (such as (Bai, 2008)):

G: Construct a new IV based on a subset S

Select new features into S With respect to X | G

Delete features in S with respect to Z | G

17

Implementation details a re crucial here

1. Specific criteria (s ignificance thres holds ) for s election and deletion?

2. How to s ummarize the information in the final s et S? PCA? PLS? Etc…

3. Stop s trategy?

4. Starting S (previous publis hed s ources )?

5. Backward? Forward? Bidirectional?

5. Inverse Probability Weight Adjusted Regression (IPWAR)

18

19

Constrained Instrumental Variable Method

Constrained Instrumental Variable

20

Y

Z

G X

U


-

21


• When 𝑝 < 𝑛 there is an exact solution

• In fact there are 𝑝 − 𝑘 orthogonal solutions

• 𝐺𝑟𝑟𝑟 = 𝐼𝑛 − 𝑃𝑍 𝐺, the projection of 𝐺 to subspace orthogonal to columns of 𝑍

• We search for a direction in Span(𝐺), 𝐺∗, mos t informative for 𝑋, orthogonal to 𝑍

• 𝐺∗ belongs to Span(𝐺𝑟𝑟𝑟)

• 𝑆𝑝𝑆𝑛 𝐺𝑟𝑟𝑟 = 𝑆𝑝𝑆𝑛 𝐺∗ + 𝑆𝑝𝑆𝑛(𝑐𝑐𝑐𝑝𝑐𝑐𝑐𝑐𝑛𝑐 𝑐𝑜 𝐺∗𝑖𝑛 𝐺𝑟𝑟𝑟 𝑠𝑠𝑠𝑠𝑝𝑆𝑐𝑐)

• Complement is not informative for X

• Hence, CIV has identical performance to s tra ightforward 𝐺𝑟𝑟𝑟


23

What about when 𝑝 > 𝑛?

Penalized Constrained Instrumental Variable

L0 penalty

LASSO penalty

Non-convex Penalties

…...

Penalization

24

Which penalty?

• LASSO (L1) penalty does not a lways yield a sparse solution automatically for this problem.

• Neither do non-convex penalties (e.g. SCAD)

25

L0 penalty

• L0 penalty: 𝑠 0 = ∑ 𝑠𝑗0

𝑗 < 𝜆 directly enforces spars ity

• However, greedy s earch for optimal 𝑠, 𝜆 is 𝑛𝑝-hard in computational complexity

• Alternatively, cons ider smoothed approximate L0 penalties as 𝜎 → 0:

𝑜𝜎 𝑥 = 𝑐𝑥𝑝 −𝑥2

2𝜎2 → �1, 𝑥 = 00, 𝑥 ≠ 0

So 𝑠 0 ≈ 𝑝 − ∑ 𝑜𝜎(𝑠𝑗)𝑗

Implementation

Two methods :

1. If 𝐩 < 𝐧, linear a lgebra solution

2. If 𝐩 > 𝐧, we propose a numerical solution for penalized CIV Start from an initial gues s 𝑠, and an initial penalty 𝜎 = max |𝑠| Iteratively decreas e 𝜎 → 𝜎𝑚𝑚𝑛 a long a chos en s equence Cros s validation to choos e 𝜆 for each 𝜎 bas ed on MSE of (𝑌|𝐺,𝑠)

3. Boots trap for s tandard errors

27

Simulations

28

• Naive TSLS

• Adjusting each G for Z, then TSLS

• LIML

• Stepwise Selection Methods (forward, backward)

• Inverse Probability Weight Adjusted Regression (IPWAR)

• CCA/sparseCCA

• CIV (Constrained Instrumental Variable)

• Smoothed CIV

29

Methods compared

Bias as a function of 𝜉 • N=200 • 20 SNPs • 5 SNPs as s ociated with X. • Among thes e 5: • 2 SNPs als o as s ociated with Z • 𝛽: Effect 𝑋 → 𝑌

Conditional TSLS/ G res iduals CCA, CCA_LASSO LIML Backward/ Forward IPWAR CIV/Smoothed CIV

Bias and standard error 𝝃=1 𝝃=5

Estimate Mean square error

Estimate Mean square error

G residual 1.09 0.01 1.08 0.01

IPWAR 2.00 1.21 15.30 2.20

CIV 1.09 0.01 1.09 0.02

CIV smooth 1.01 0.01 1.05 0.01

SNPs selected over simulations

Method SNP1 SNP2 SNP3 SNP4 SNP5 Others

CIV 0.76 0.76 1.00 1.0 1.0 ~0.2

Smoothed CIV

0.00 0.00 0.29 0.62 0.29 0.0

Note: Applied hard threshold: 0.1*max(estimate)

SNPs selected CIV Smoothed CIV

34

Contrived Example

LDL-HDL-CHD

HDL/LDL Simulation

Bias Bootstrap

variance Lower_CI Upper_CI

Naive -0.090 0.025 -0.145 -0.034

G residuals 0.004 0.027 -0.050 0.059

LIML -0.112 0.027 -0.169 -0.066

IPWAR 0.066 0.025 0.011 0.121

Backward -0.008 0.042 -0.094 0.079

Forward 0.006 0.055 -0.097 0.109

CCA_LASSO -0.090 0.030 -0.147 -0.033

CCA -0.090 0.028 -0.147 -0.032

CIV 0.002 0.031 -0.052 0.057

TRUE -0.0002 0.031 -0.067 0.066

HDL/LDL Simulation

37

Finally…

Comments

• The smoothing CIV selects SNPs as socia ted with X and not Z in a non-greedy s earch for the optimal subset

• G_res , CCA etc s elect SNPs differently and will keep overlapping SNPs

• For 𝑝 > 𝑛 or 𝑝 large • CIV will work • G_res iduals will not work or will become uns table

• 𝑝 > 𝑛 is a very unlikely Mendelian randomization s cenario

Questions

• Other applications of the smoothed L0 penalty? • We create one ins trument

• What about creating s everal ins truments through the s ame kinds of projections (p-k)?

• Links to • Targeted learning • Multiple invalid ins truments (averages of weak ins truments )

(New) problem

• Computationally efficient variable s election subject to linear cons tra ints

Thank you! Acknowledgements: CIHR Ludmer Centre for Neuroinformatics and Mental Health Brent Richards & his group

41

A Constrained Instrumental Variable Approach,

Documents

Transcript of A Constrained Instrumental Variable Approach,