Causal Analysis After Haavelmo - University of...

107
Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II Causal Analysis After Haavelmo James Heckman and Rodrigo Pinto Econ 312, Spring 2019 James Heckman and Rodrigo Pinto Causal Analysis

Transcript of Causal Analysis After Haavelmo - University of...

Page 1: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Causal Analysis After Haavelmo

James Heckman and Rodrigo Pinto

Econ 312, Spring 2019

James Heckman and Rodrigo Pinto Causal Analysis

Page 2: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Economics has forgotten its own past.(Judea Pearl, 2012)

James Heckman and Rodrigo Pinto Causal Analysis

Page 3: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Haavemo’s Contributions to Causality:

Two seminal papers (1943, 1944):

1 Formalized Yule’s credo: Correlation is not causation.(1895 paper on pauperism written when Yule was at UCL)

2 Laid the foundations for counterfactual policy analysis.

3 Distinguished fixing (causal operation) from conditioning(statistical operation).

4 Distinguished definition of causal parameters from theiridentification from data.

5 Formalized Marshall’s notion of ceteris paribus (1890).

Most Important

Causal effects are determined by the impact of hypotheticalmanipulations of an input on an output.

James Heckman and Rodrigo Pinto Causal Analysis

Page 4: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Frisch: “Causality is in the Mind”

“. . . we think of a cause as something imperative which exists in the

exterior world. In my opinion this is fundamentally wrong. If we strip

the word cause of its animistic mystery, and leave only the part that

science can accept, nothing is left except a certain way of

thinking. . . [T]he scientific . . . problem of causality is essentially a

problem regarding our way of thinking, not a problem regarding the

nature of the exterior world.” (Frisch 1930, p. 36, published 2011;

emphasis added)

James Heckman and Rodrigo Pinto Causal Analysis

Page 5: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Haavelmo’s Insights:

1 What are Causal Effects?• Not empirical descriptions of actual worlds,• But descriptions of hypothetical worlds.

2 How are they obtained?• Through Models—idealized thought experiments.• By varying—hypothetically—the inputs causing outcomes.

3 But what are models?• Frameworks defining causal relationships among variables.• Based on scientific knowledge.

James Heckman and Rodrigo Pinto Causal Analysis

Page 6: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

GoalIncorporate Haavelmo’s insights into a causal framework that:

• Complies with his ideas

and also

• Benefits from modern mathematical/statistical language.

James Heckman and Rodrigo Pinto Causal Analysis

Page 7: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Revisiting Haavemo’s Ideas on Causality

• Insight: express causality through a hypothetical modelassigning independent variation to inputs determiningoutcomes.

• Data: generated by an empirical model that shares somefeatures with the hypothetical model.

• Identification: relies on evaluating causal parameters definedin the hypothetical model using data generated by the empiricalmodel.

• New Tools: use the modern language of Directed AcyclicGraphs (DAGs).

• Comparison: how a causal framework inspired by Haavelmo’sideas relates to other approaches (Pearl, 2009).

James Heckman and Rodrigo Pinto Causal Analysis

Page 8: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Plan of the Talk

1 Revisit Haavelmo’s concept of causality using language of DAGs.

• Causal Model• Independence Relationships based on Bayesian Networks• “Fixing” Operator• Fixing: standard statistical tools do not apply

2 Present New Causal framework inspired by Haavelmo’s concept ofhypothetical variation of inputs.

• Hypothetical Model for Examining Causality• Benefits of a Hypothetical Model• Identification: connecting Hypothetical and Empirical Models.

3 Compare Haavelmo’s approach with Pearl’s Do-calculus.

• Haavelmo’s approach: thinks outside the box• Do-calculus: requires complex ad hoc tools to work within the

box

James Heckman and Rodrigo Pinto Causal Analysis

Page 9: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Plan of the Talk

4 Limitations of DAGs for econometric identification:

• Cannot identify the standard IV Model or the Generalized RoyModel.

• Can identify the “front door” model.

5 Review:

• Do-calculus of Pearl (2009) and a Framework Motivated byHaavelmo’s Analysis.

6 Simultaneous Equations:

• Hypothetical Models and Simultaneous Equations.

7 Conclusion:

• Examine Haavelmo’s fundamental contributions.• Causal framework inspired by Haavelmo’s ideas.• Beyond DAGs.

James Heckman and Rodrigo Pinto Causal Analysis

Page 10: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

1. Haavelmo’s Approach to Causality

James Heckman and Rodrigo Pinto Causal Analysis

Page 11: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Goals of a Causal Model

• We use Haavelmo’s insight, linking causality to independentvariation of variables in a hypothetical model.

• Build a causal framework that characterizes the process ofdefining, identifying, and estimating causal relationships.

Task Description Requirements1 Defining Causal Models A Scientific Theory

A Mathematical Framework2 Identifying Causal Parameters Mathematical Analysis

from Known Population Connect Hypothetical ModelDistribution Functions of Data with Data Generating Process

(Identification in the Population)3 Estimating Parameters from Statistical Analysis

Real Data Estimation and Testing Theory

Heckman (2008)James Heckman and Rodrigo Pinto Causal Analysis

Page 12: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Mathematical Frameworkof a Modern Causal Model

Causal Model: defined by 4 components:1 Random Variables that are observed and/or unobserved by

the analyst: T = {Y ,U ,X ,V }.2 Error Terms that are mutually independent: εY , εU , εX , εV .3 Structural Equations that are autonomous: fY , fU , fX , fV .

• By Autonomy we mean deterministic functions that are“invariant” to changes in their arguments (Frisch, 1938).

4 Causal Relationships that map the inputs causing eachvariable:Y = fY (X ,U , εY );X = fX (V , εX );U = fU(V , εU);V = fV (εV ).

Econometric approach explicitly models unobservables that driveoutcomes and produce selection problems.Distribution of unobservables is often the object of study.

James Heckman and Rodrigo Pinto Causal Analysis

Page 13: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Directed Acyclic Graph (DAG) Representation

Model: Y = fY (X ,U, εY );X = fX (V , εX );U = fU(V , εU);V = fV (εV ).

Causal Model Inside the Box

D Y

V

X

M

U

R D Y

V

X

M

U

R

R D Y

VX

R D Y

V

X Y

UV

X Y

UV

X~

Notation:

• Children: Variables directly caused by other variables:Ex: Ch(V ) = {U,X}, Ch(X ) = Ch(U) = {Y }.

• Descendants: Variables that directly or indirectly cause other variables:Ex: D(V ) = {U,X ,Y }, D(X ) = D(U) = {Y }.

• Parents: Variables that directly cause other variables:Ex: Pa(Y ) = {X ,U}, Pa(X ) = Pa(U) = {V }.

James Heckman and Rodrigo Pinto Causal Analysis

Page 14: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Properties of this Causal Framework

• Recursive Property: No variable is a descendant of itself.

Why is it useful?Autonomy + Independent Errors

+ Recursive Property⇒ Bayesian Network Tools Apply

• Bayesian Network: Translates causal links into independencerelations using Statistical/Graphical Tools.

• Statistical/Graphical Tools:

1 Local Markov Condition (LMC): A variable is independent ofits non-descendants conditioned on its parents;

2 Graphoid Axioms (GA): A set of independence and conditionalindependence relationships. Dawid (1979)

James Heckman and Rodrigo Pinto Causal Analysis

Page 15: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Local Markov Condition (LMC) (Kiiveri, 1984; Lauritzen, 1996)

If a model is acyclical, i.e., Y /∈ D(Y ) ∀ Y ∈ T , then any variable isindependent of its non-descendants, conditional on its parents:

LMC : Y ⊥⊥ V \ (D(Y ) ∪ Y )|Pa(Y ) ∀ Y ∈ V .

Graphoid Axioms (GA) (Dawid, 1979)

Symmetry: X ⊥⊥ Y |Z ⇒ Y ⊥⊥ X |Z .Decomposition: X ⊥⊥ (W ,Y )|Z ⇒ X ⊥⊥ Y |Z .

Weak Union: X ⊥⊥ (W ,Y )|Z ⇒ X ⊥⊥ Y |(W ,Z ).

Contraction: X ⊥⊥ W |(Y ,Z ) and X ⊥⊥ Y |Z⇒ X ⊥⊥ (W ,Y )|Z .

Intersection: X ⊥⊥ W |(Y ,Z ) and X ⊥⊥ Y |(W ,Z )

⇒ X ⊥⊥ (W ,Y )|Z .Redundancy: X ⊥⊥ Y |X .

James Heckman and Rodrigo Pinto Causal Analysis

Page 16: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• Application of these tools generates:Y ⊥⊥ V |(U ,X ), U ⊥⊥ X |V .

James Heckman and Rodrigo Pinto Causal Analysis

Page 17: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Analysis of Counterfactuals: The Fixing Operator

• Fixing: Causal operation sets X -inputs of structural equations to x .

Standard Model Model under Fixing

V = fV (εV ) V = fV (εV )U = fU(V , εU) U = fU(V , εU)X = fX (V , εX ) X = xY = fY (X ,U, εY ) Y = fY (x ,U, εY )

• Importance: Establishes a framework for counterfactuals.

• Counterfactual: Y (x) represents outcome Y when X is fixed at x .

• Linear Case: Y = Xβ + U + εY and Y (x) = xβ + U + εY .

James Heckman and Rodrigo Pinto Causal Analysis

Page 18: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Joint Distributions

1 Model Representation under Fixing:

Y = fY (x ,U, εY ); X = x ; U = fU(V , εU); V = fV (εV ).

2 Standard Joint Distribution Factorization:

P(Y ,V ,U|X = x) = P(Y |U,V ,X = x)P(U|V ,X = x)P(V |X = x).

= P(Y |U,V ,X = x)P(U|V )P(V|X = x)

because U ⊥⊥ X |V by LMC.

= P(Y |U,X = x)P(U|V )P(V |X = x)

because Y ⊥⊥ V |X ,U by LMC.

3 Factorization under Fixing X at x:

P(Y ,V ,U|X fixed at x) = P(Y |U,X = x)P(U|V )P(V).

James Heckman and Rodrigo Pinto Causal Analysis

Page 19: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• Conditioning X at x affects the distribution of V .

• Fixing X at x does not affect the distribution of V .

James Heckman and Rodrigo Pinto Causal Analysis

Page 20: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Understanding the Fixing Operator(Error Term Representation)

The definition of causal model permits the following operations:

1 Through iterated substitution we can represent all variablesas functions of error terms.

2 This representation clarifies the concept of fixing.

James Heckman and Rodrigo Pinto Causal Analysis

Page 21: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Representing the Models Through Their Error Terms

Standard Model Model under Fixing

V = fV (εV ) V = fV (εV )U = fU(fV (εV ), εU) U = fU(fV (εV ), εU)X = fX (fV (εV ), εX ) X = x

Outcome Equation

Standard Model: Y = fY (fX (fV (εV ), εX ), fU(fV (εV ), εU), εY ).

Model under Fixing: Y = fY (x, fU(fV (εV ), εU), εY ).

James Heckman and Rodrigo Pinto Causal Analysis

Page 22: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Understanding the Fixing Operator

1 Cumulative error distribution function: Qε.

2 Conditioning: For Y = fY (fX (fU(εU), εX ), fU(εU), εY ),

E (Y |X = x) =

∫AfY (fX (fV (εV ), εX ), fU(fV (εV ), εU), εY )

dQε(ε)∫A dQε

A = {ε ; fX (fV (εV ), εX ) = x}

Imposes restrictions on values of error terms.

3 Fixing: (Y = fY (x, εX ), fU(εU), εY ))

E (Y (x)) =

∫fY (x, εX ), fU(fV (εV ), εU), εY )dQε(ε)

Imposes no restriction on values assumed by theerror terms.

James Heckman and Rodrigo Pinto Causal Analysis

Page 23: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Fixing 6= Conditioning

Conditioning: Statistical exercise that considers the dependencestructure of the data-generating process.

Y conditioned on X : Y |X = x

Linear Case: E (Y |X = x) = xβ + E(U|X = x);

E (εY |X = x) = 0.

Fixing: Causal exercise that hypothetically assigns values to inputsof the autonomous equation we analyze.

Y when X is fixed at x ⇒ Y (x) = fY (x ,U , εY )

Linear Case: E (Y (x)) = xβ + E(U);

E (εY ) = 0.

James Heckman and Rodrigo Pinto Causal Analysis

Page 24: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Average Causal Effects: X is fixed at x , x ′ :ATE = E(Y (x))− E(Y (x ′))

James Heckman and Rodrigo Pinto Causal Analysis

Page 25: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Fixing: A Causal (not statistical) Operation

• Problem: Fixing is a Causal Operation defined Outside ofstandard statistics.

• Not Readily Comprehended: Its justification/representationdoes not follow from standard statistical arguments.

• Consequence: Frequent source of confusion in statisticaldiscussions.

• Question: How can we make statistics converse with causality?

• Solution: Think outside the box!

James Heckman and Rodrigo Pinto Causal Analysis

Page 26: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Thinking Outside the Box

• This was Haavelmo’s great insight.

1 New Model: Define a Hypothetical Model with desiredindependent variation of inputs.

2 Usage: Hypothetical Model allows us to examine causality.3 Characteristic: usual statistical tools apply.4 Benefit: Fixing translates to statistical conditioning.5 Formalizes the motto “Causality is in the Mind.”6 Clarifies the notion of identification.

Identification:Expresses causal parameters defined in the hypothetical model usingprobabilities of the empirical model that govern the data-generatingprocess.

James Heckman and Rodrigo Pinto Causal Analysis

Page 27: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

2. Empirical and Hypothetical Causal Models

James Heckman and Rodrigo Pinto Causal Analysis

Page 28: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Hypothetical Model

Formalizing Haavelmo’s InsightEmpirical Model: Governs the data-generating process.Hypothetical Model: Abstract model used to examine causality.

The hypothetical model has the following properties:

1 Same set of structural equations as the empirical model.

2 Appends a hypothetical variable that we fix.

3 Hypothetical variable not caused by any other variable.

4 Replaces the input variables we seek to fix by the hypotheticalvariable.

James Heckman and Rodrigo Pinto Causal Analysis

Page 29: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Hypothetical Model

• Hypothetical Variable: X replaces the X -inputs of structuralequations.

• Characteristic: X is an external variable; i.e., no parents.

• Usage: Hypothetical variable X enables analysts to examine fixingusing standard tools of probability.

• Notation:

1 Empirical Model: (TE,PaE,DE,ChE,PE,EE) denote thevariable set, parents, descendants, children, probability andexpectation of the empirical model.

2 Hypothetical Model: (TH,PaH,DH,ChH,PH,EH) denote thevariable set, parents, descendants, children, probability andexpectation of the hypothetical model.

James Heckman and Rodrigo Pinto Causal Analysis

Page 30: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Hypothetical Model and the Data-Generating Process

The hypothetical model is not a speculative departure from theempirical data-generating process but an expanded version of it.

James Heckman and Rodrigo Pinto Causal Analysis

Page 31: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Example of the Hypothetical Model for Fixing X

The Associated Hypothetical Model

Y = fY (X ,U , εY );X = fX (V , εX );U = fU(V , εU);V = fV (εV ).

Empirical Model Hypothetical Model

D Y

V

X

M

U

R D Y

V

X

M

U

R

R D Y

VX

R D Y

V

X Y

UV

X Y

UV

X~

LMC LMC

Y ⊥⊥ V |(U,X ) Y ⊥⊥ (X ,V )|(U, X )

U ⊥⊥ X |V U ⊥⊥ (X , X )|VX ⊥⊥ (U,V ,X )

X ⊥⊥ (U,Y , X )|V

James Heckman and Rodrigo Pinto Causal Analysis

Page 32: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Benefits of a Hypothetical Model

• Formalizes Haavelmo’s insight of Hypothetical variation;

• Statistical Analysis: Bayesian Network Tools apply(Local Markov Condition; Graphoid Axioms);

• Clarifies the definition of causal parameters;

1 Causal parameters are defined under the hypothetical model;2 Observed data is generated through the empirical model;

• Distinguishes definition from identification;

1 Identification requires us to connect the hypothetical andempirical models.

2 Allows us to evaluate causal parameters defined in thehypothetical model using data generated by the empiricalmodel.

James Heckman and Rodrigo Pinto Causal Analysis

Page 33: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Benefits of a Hypothetical Model

1 Versatility: Targets causal links, not variables.

2 Simplicity: Does not require definition of any statistical operationoutside the realm of standard statistics.

3 Completeness: Automatically generates Pearl’s do-calculus when itapplies (Pinto 2013).

Most Important

Fixing in the empirical model is translated to statistical conditioning inthe hypothetical model:

EE(Y (x))︸ ︷︷ ︸Causal Operation Empirical Model

= EH(Y |X = x)︸ ︷︷ ︸Statistical Operation Hypothetical Model

James Heckman and Rodrigo Pinto Causal Analysis

Page 34: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Identification

• Hypothetical Model allows analysts to define and examinecausal parameters.

• Empirical Model generates observed/unobserved data.

• Identification: Express causal parameters of the hypotheticalmodel through observed probabilities in the empirical model.

• Requires: connecting probability distributions of Hypotheticaland Empirical Models.

James Heckman and Rodrigo Pinto Causal Analysis

Page 35: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

How to Connect the Empirical and Hypothetical Models?

1 By sharing the same error terms and structural equations,conditional probabilities of some variables of the hypotheticalmodel can be written in terms of the probabilities of theempirical model.

2 Conditional independence properties of the variables in thehypothetical model facilitate connecting the hypothetical andempirical models.

James Heckman and Rodrigo Pinto Causal Analysis

Page 36: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Connecting the Hypothetical and Empirical Models

Same error terms and structural equations generate:

1 Distribution of non-children of X (i.e., V ∈ TE \ ChH(X )) are thesame in hypothetical and empirical models.

PH(V |PaH(V )) = PE(V |PaE(V )),V ε(TE \ ChH(X ))

2 Distribution of children of X (i.e. V ∈ ChH(X )) are the same inhypothetical and empirical models whenever X and X areconditioned on x :

PH(V |PaH(V ) \ {X}, X = x) = PE(V |PaE(V ) \ {X},X = x).

James Heckman and Rodrigo Pinto Causal Analysis

Page 37: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Connecting the Empirical and Hypothetical Models

Thus,

1 Distribution of non-descendants of X are the same in thehypothetical and empirical models.

2 Distributions of variables conditional on X and X at the samevalue of x in the empirical model and in the hypothetical modelare the same.

3 Distribution of an outcome Y ∈ TE when X is fixed at x is thesame as the distribution of Y conditional on X = x in Y ∈ TH.

James Heckman and Rodrigo Pinto Causal Analysis

Page 38: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Connecting the Hypothetical and Empirical Models

1 L-1: Let W ,Z be any disjoint set of variables in TE \ DH(X )then:

PH(W |Z ) = PH(W |Z , X ) = PE(W |Z )∀{W ,Z} ⊂ TE\DH(X ).

2 T-1: Let W ,Z be any disjoint set of variables in TE then:

PH(W |Z ,X = x , X = x) = PE(W |Z ,X = x) ∀ {W ,Z} ⊂ TE.

3 C-1: Let X be uniformly distributed in the support of X andlet W ,Z be any disjoint set of variables in TE. Then

PH(W |Z ,X = X ) = PE(W |Z ) ∀ {W ,Z} ⊂ TE.

4 Matching: Let Z ,W be any disjoint set of variables in TE

such that, in the hypothetical model, X ⊥⊥ W |(Z , X ), then

PH(W |Z , X = x) = PE(W |Z ,X = x).

James Heckman and Rodrigo Pinto Causal Analysis

Page 39: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Matching: A Solution that Connectsthe Empirical and Hypothetical Models

Matching

If there exists a variable V not caused by X such thatX ⊥⊥ Y |V , X , then EH(Y |V , X = x) under the hypothetical modelis equal to EH(Y |V ,X = x) under the empirical model.

Obs: LMC for the hypothetical model generates X ⊥⊥ Y |V , X .Thus, by matching, treatment effects EE(Y (x)) can be obtained by:

EE(Y (x)) =

∫EH(Y |V = v , X = x)dFV (v)︸ ︷︷ ︸

In Hypothetical Model

=

∫EE(Y |V = v ,X = x)dFV (v)︸ ︷︷ ︸

In Empirical Model

But if V is not observed, the model is not identified without further assumptions.

James Heckman and Rodrigo Pinto Causal Analysis

Page 40: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Some Remarks on Our Causal Framework

• Statistical relationships among variables emerge asconsequences of applying LMC and GA.

• Causal effects are associated with the causal links replaced byhypothetical variables.

• Our framework allows for the same hypothetical variable tooperate through distinct causal effects (such as mediation).

James Heckman and Rodrigo Pinto Causal Analysis

Page 41: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Models for Mediation Analysis

1. Empirical Model 2. Total Effect of X on Y

X M Y

X M Y

X

3. Indirect Effect of X on Y 4. Direct Effect of X on Y

X M Y

X

X M Y

X

James Heckman and Rodrigo Pinto Causal Analysis

Page 42: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

3. Comparing Haavelmo’s Approachwith Pearl’s (2000) Do-calculus

James Heckman and Rodrigo Pinto Causal Analysis

Page 43: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Do-calculus

• Stays within the box.• Attempt: Counterfactual manipulations using the empirical

model.• Goal: Expressions obtained from a hypothetical model.• Tools: Uses causal/graphical/statistical rules outside statistics.• Fixing: Uses do(X ) = x for fixing X at x in the DAG for allX -inputs (does not allow to target causal links separately).• Flexibility: Does not easily define complex treatments, such as

treatment on the treated; i.e.,

EE(Y |X = 1, X = 1)− EE(Y |X = 1, X = 0).

Claim: Identification using the hypothetical model is transparentand does not require additional causal rules, only standard statisticaltools.

James Heckman and Rodrigo Pinto Causal Analysis

Page 44: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Do-calculus: Summary

1 Haavelmo approach: Thinks outside the causal box.

2 Do-calculus: Requires complex new statistical rules.

James Heckman and Rodrigo Pinto Causal Analysis

Page 45: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Defining the Do-calculus

What is the do-calculus?

A set of three graphical/statistical rules that convert expressions ofcausal inference into probability equations.

1 Goal: Identify causal effects from non-experimental data.

2 Application: Bayesian network structure; i.e., Directed AcyclicGraph (DAG) that represents causal relationships.

3 Identification method: Iteration of do-calculus rules togenerate a function that describes treatment effects statisticsas a function of the observed variables only (Tian and Pearl2002, Tian and Pearl 2003).

James Heckman and Rodrigo Pinto Causal Analysis

Page 46: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Characteristics of Pearl’s Do-Calculus

1 Information: DAG only provides information on the causalrelationships among variables. LMC and GA give conditionalindependence relationships.

2 Not Suited for examining implications of assumptions fromfunctional forms.

3 Identification: If the information in a DAG is sufficient toidentify causal effects, then:

There exists a sequence of application of the rules of theDo-Calculus that generates a formula for causal effects basedon observational quantities (Huang and Valtorta 2006,Shpitser and Pearl 2006).

James Heckman and Rodrigo Pinto Causal Analysis

Page 47: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Characteristics of Pearl’s Do-Calculus

4 Limitation: Does not allow for additional information outsidethe DAG framework.

i Only applies to the information content of a DAG.ii IV is not identified through Do-calculusiii Why? Requires assumptions outside DAGs: linearity,

monotonicity, separability.

James Heckman and Rodrigo Pinto Causal Analysis

Page 48: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Notation for the Do-calculus

More notation is needed to define these rules:

DAG NotationLet X ,Y ,Z be arbitrary disjoint sets of variables (nodes) in a causalgraph G .

• GX : DAG that modifies G by deleting the arrows pointing to X .

• GX : DAG that modifies G by deleting arrows emerging from X .

• GX ,Z : DAG that modifies G by deleting arrows pointing to Xand emerging from Z .

James Heckman and Rodrigo Pinto Causal Analysis

Page 49: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examples of DAG Notation

G GX GX GX ,U

V

X

U

Y

V

X

U

Y

V

X

U

Y

V

X

U

Y

James Heckman and Rodrigo Pinto Causal Analysis

Page 50: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Do-calculus Rules

• Assumes the Local Markov Condition and independence of ε.

Let G be a DAG and let X ,Y ,Z ,W be any disjoint sets ofvariables. The do-calculus rules are:

• Rule 1: Insertion/deletion of observations:Y ⊥⊥ Z |(X ,W ) under GX

⇒ P(Y |do(X ),Z ,W ) = P(Y |do(X ),W ).• Rule 2: Action/observation exchange:Y ⊥⊥ Z |(X ,W ) under GX ,Z

⇒ P(Y |do(X ), do(Z ),W ) = P(Y |do(X ),Z ,W ).• Rule 3: Insertion/deletion of actions:Y ⊥⊥ Z |(X ,W ) under GX ,Z(W )

⇒ P(Y |do(X ), do(Z ),W ) = P(Y |do(X ),W ),where Z (W ) is the set of Z -nodes that are not ancestors ofany W -node in GX .

James Heckman and Rodrigo Pinto Causal Analysis

Page 51: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examples

G G X

V

XZ

U

Y

W

V

XZ

U

Y

W

G X , Y GX , Z(W )

V

XZ

U

Y

W

V

XZ

U

Y

W

James Heckman and Rodrigo Pinto Causal Analysis

Page 52: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Understanding the Rules of Do-Calculus

Let G be a DAG; then, for any disjoint sets of variables X ,Y ,Z ,W :Rule 1: Insertion/deletion of observations

If Y ⊥⊥ Z |(X ,W )︸ ︷︷ ︸Statistical Relation

under GX︸︷︷︸Graphical Criterion

then

Pr(Y |do(X ),Z ,W ) = Pr(Y |do(X ),W )︸ ︷︷ ︸Equivalent Probability Expression

James Heckman and Rodrigo Pinto Causal Analysis

Page 53: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Do-Calculus Exercise

G GX

V

X

U

Y

V

X

U

Y

1 LMC to X under GX generates

X ⊥⊥ (U,Y )|V ⇒ X ⊥⊥ (U,Y )|V .

2 Now if X ⊥⊥ (U,Y )|V holds under GX , then, by Rule 2,

P(Y |do(X ),V ) = P(Y |X ,V ). (1)

∴ E (Y |do(X ) = x) =

∫E (Y |V = v , do(X ) = x)dFV (v)︸ ︷︷ ︸

Using do(X); i.e., Fixing X

=

∫E (Y |V = v ,X = x)dFV (v)︸ ︷︷ ︸

Replace “do” with Standard Statistical Conditioning

by Equation (1)

James Heckman and Rodrigo Pinto Causal Analysis

Page 54: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

4. Limitations of DAGs for Econometric Identification

James Heckman and Rodrigo Pinto Causal Analysis

Page 55: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Failure of Do-CalculusDoes not Generate Standard IV Results

The simplest instrumental variable model consists of four variables:

1 A confounding variable U that is external and unobserved,

2 An external instrumental variable Z ,

3 An observed variable X caused by U and Z ,

4 An outcome Y caused by U and X .

XZ

U

Y

XZ

U

Y

XZ

U

Y

XZ

U

Y

0

1

2

3

XZ

U

Y

4

James Heckman and Rodrigo Pinto Causal Analysis

Page 56: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

4.1 Do-Calculus Non-identification of the IV Model

• Limitation: IV model is not identified by literature that reliesexclusively on DAGs.

• Why? IV identification relies on assumptions outside the scopeof the DAG literature.

• LMC generates the conditional independence relationships:Y ⊥⊥ Z |(U ,X ) and U ⊥⊥ Z .

• TSLS: X �⊥⊥ Z holds; thus, the IV model satisfies thenecessary criteria to apply the method of Two Stage LeastSquares (TSLS).

• Assumption Outside of DAGs: TSLS identifies the IV modelunder linearity.

James Heckman and Rodrigo Pinto Causal Analysis

Page 57: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Do-Calculus and IV

The Do-Calculus applied to the IV Model generates:

1 Pr(Y |do(X ), do(Z )) = Pr(Y |do(X ),Z ) = Pr(Y |do(X )),

2 Pr(Y |do(Z )) = Pr(Y |Z )

Only establishes the exogeneity of the instrumental variable Z .Insufficient to identify Pr(Y |do(X )).

• The instrumental variable model is not identified applying therules of the do-calculus.

• Indeed, in this framework it is impossible to identify the causaleffect of X on Y without additional information.

• The instrumental variable model is identified under furtherassumptions such as linearity, separability, and monotonicity.

• However, these assumptions are outside the scope of thedo-calculus.

James Heckman and Rodrigo Pinto Causal Analysis

Page 58: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The “do-calculus” does identify the “front-door” model.

Link to Appendix I

James Heckman and Rodrigo Pinto Causal Analysis

Page 59: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Table 1: “Front-Door” Empirical and Hypothetical Models

1. Pearl’s “Front-Door” Empirical Model 2. Our Version of the “Front-Door” Hypothetical Model

T = {U,X ,M,Y } T = {U,X ,M,Y , X}ε = {εU , εX , εM , εY } ε = {εU , εX , εM , εY }Y = fY (M,U, εY ) Y = fY (M,U, εY )X = fX (U, εX ) X = fX (U, εX )

M = fM(X , εM) M = fM(X , εM)U = fU(εU) U = fU(εU)

U

MX Y

U

MX Y

X

Pa(U) = ∅, Pa(U) = Pa(X ) = ∅,Pa(X ) = {U} Pa(X ) = {U}Pa(M) = {X} Pa(M) = {X}

Pa(Y ) = {M,U} Pa(Y ) = {M,U}

Y ⊥⊥ X |(M,U) Y ⊥⊥ (X ,X )|(M,U)

M ⊥⊥ U|X M ⊥⊥ (U,X )|XX ⊥⊥ (M, X ,Y )|U

U ⊥⊥ (M, X )

X ⊥⊥ (X ,U)

PE(Y ,M,X ,U) = PH(Y ,M,X ,U, X ) =

PE(Y |M,U) PE(X |U) PE(M|X ) PE(U) PH(Y |M,U) P(X |U) PH(M|X ) PH(U) PH(X )

PE(Y ,M,U|do(X ) = x) = PH(Y ,M,U,X |X = x) =

PE(Y |M,U) PE(M|X = x) PE(U) PH(Y |M,U) P(X |U) PH(M|X = x) PH(U)

James Heckman and Rodrigo Pinto Causal Analysis

Page 60: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The “do-calculus” cannot identify the Generalized Roy Model.

Link to Appendix II

James Heckman and Rodrigo Pinto Causal Analysis

Page 61: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

5. Summary of Do-calculus and Haavelmo

James Heckman and Rodrigo Pinto Causal Analysis

Page 62: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Summarizing Do-calculus of Pearl (2009) andHaavelmo’s Framework

• Common Features of Haavelmo and Do-Calculus:

1 Autonomy (Frisch, 1938)2 Errors Terms: ε mutually independent3 Statistical Tools: LMC and GA apply4 Counterfactuals: Fixing or Do-operator is a Causal, not

statistical, Operation.

• Distinctive Features of Haavelmo and Do-Calculus:

Haavelmo Do-calculusApproach: Thinks Outside the Box Applies Complex ToolsIntroduces: Hypothetical Model Graphical RulesIdentification: Connects PH and PE Iteration of RulesVersatility: Basic Statistics Apply Extra Notation/Tools

James Heckman and Rodrigo Pinto Causal Analysis

Page 63: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

6. Hypothetical Models and Simultaneous Equations

James Heckman and Rodrigo Pinto Causal Analysis

Page 64: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

The Simultaneous Equation Model

A system of two equations:

Y1 = gY1(Y2,X1,U1) (7a)

Y2 = gY2(Y1,X2,U2). (7b)

• Variables: TE = {Y1,Y2,X1,X2,U1,U2}.• Assumptions: U1 ⊥⊥ U2 and (U1,U2) ⊥⊥ (X1,X2).

(made only to simplify the argument)• LMC condition breaks down.• Matzkin (2008) relaxes these assumptions and identifies

causal effects for U1 ⊥�⊥ U2 and (U1,U2) ⊥�⊥ (X1,X2).

Fixing readily extends to a system of simultaneous equations for Y1

and Y2, whereas the fundamentally recursive methods based onDAGs do not (Pearl, 2009).

James Heckman and Rodrigo Pinto Causal Analysis

Page 65: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Characteristics of the Simultaneous Equation Model

• Autonomy: the causal effect of Y2 on Y1 when Y2 is fixed aty2 is given by

Y1(y2) = gY1(y2,X ,U1).

• Symmetrically:

Y2(y1) = gY2(y1,X ,U2).

• Define hypothetical random variables Y1, Y2 such that:• Y1, Y2 replaces the Y1,Y2 inputs on Equations (7a) and (7b).• (Y1, Y2) ⊥⊥ (X1,X2,U1,U2); and Y1 ⊥⊥ Y2.• TH = {Y1, Y2,Y1,Y2,X1,X2,U1,U2}.• Assume a common support for (Y1,Y2) and (Y1, Y2).

James Heckman and Rodrigo Pinto Causal Analysis

Page 66: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Counterfactuals of the Simultaneous Equation Model

• Distribution of Y1 when Y2 is fixed at y2 is given by

PH(Y1|Y2 = y2).

• Average causal effect of Y2 on Y1 when Y2 is fixed at y2 andy ′2 values:

EH(Y1|Y2 = y2)− EH(Y1|Y2 = y ′2)

• Notation: EH denotes expectation over the probabilitymeasure PH of the hypothetical model.

James Heckman and Rodrigo Pinto Causal Analysis

Page 67: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Empirical Model for Simultaneous Equations

X2 Y2 Y1 X1

U2 U1

James Heckman and Rodrigo Pinto Causal Analysis

Page 68: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Some Hypothetical Models for Y2 and Y1, Respectively

X Y Y X

U U YY

James Heckman and Rodrigo Pinto Causal Analysis

Page 69: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Completeness Assumption

• Common Assumption: completeness—the existence of atleast a local solution for Y1 and Y2 in terms of (X1,X2,U1,U2):

Y1 = φ1(X1,X2,U1,U2) (8a)

Y2 = φ2(X1,X2,U1,U2). (8b)

• Reduced form equations (see, e.g., Matzkin, 2008, 2013).

• Inherit the autonomy properties of the structural equations.

James Heckman and Rodrigo Pinto Causal Analysis

Page 70: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

If X1 and X2 are disjoint, then use ILS to identify:

∂Y1

∂X2=∂gY1(Y2,X1,U1)

∂Y2

∂Y2

∂X2

∂Y1

∂X2=∂gY1(Y2,X1,U1)

∂X2

=∂gY1(·)∂Y2(·)

∂Y2(·)∂X2

∂Y1

∂X2

∂Y2

∂X2

=

∂φ1(·)∂X2

∂φ2(·)∂X2

=∂gY1(·)∂Y2

But cannot be identified by the rules of the do-calculus.Source: Heckman (2008)

James Heckman and Rodrigo Pinto Causal Analysis

Page 71: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

7. Pearl’s Lament

James Heckman and Rodrigo Pinto Causal Analysis

Page 72: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

An Econometric Awakening? Pearl (2009, page 379)

“In almost every one of his recent articles James Heckman stresses theimportance of counterfactuals as a necessary component of economicanalysis and the hallmark of econometric achievement in the past cen-tury. For example, the first paragraph of the HV article reads: ‘they[policy comparisons] require that the economist construct counterfac-tuals. Counterfactuals are required to forecast the effects of policiesthat have been tried in one environment but are proposed to be ap-plied in new environments and to forecast the effects of new policies.’Likewise, in his Sociological Methodology article (2005), Heckmanstates: ‘Economists since the time of Haavelmo (1943, 1944) haverecognized the need for precise models to construct counterfactuals...The econometric framework is explicit about how counterfactuals aregenerated and how interventions are assigned...’ ”

James Heckman and Rodrigo Pinto Causal Analysis

Page 73: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

An Econometric Awakening? Pearl (2009, page 379)

“And yet, despite the proclaimed centrality of counterfactuals ineconometric analysis, a curious reader will be hard pressed to iden-tify even one econometric article or textbook in the past 40 years inwhich counterfactuals or causal effects are formally defined. Needed isa procedure for computing the counterfactual Y (x , u) in a well-posed,fully specified economic model, with X and Y two arbitrary variablesin the model. By rejecting Haavelmo’s definition of Y (x , u), based onsurgery, Heckman commits econometrics to another decade of divisionand ambiguity, with two antagonistic camps working in almost totalisolation.”

James Heckman and Rodrigo Pinto Causal Analysis

Page 74: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

An Econometric Awakening? Pearl (2009, page 379)

“Economists working within the potential-outcome framework of theNeyman-Rubin model take counterfactuals as primitive, unobservablevariables, totally detached from the knowledge encoded in structuralequation models (e.g., Angrist 2004; Imbens 2004). Even those ap-plying propensity score techniques, whose validity rests entirely on thecausal assumption of ‘ignorability,’ or unconfoundedness, rarely knowhow to confirm or invalidate that assumption using structural knowl-edge....”

James Heckman and Rodrigo Pinto Causal Analysis

Page 75: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

An Econometric Awakening? Pearl (2009, page 379)

“Economists working within the structural equation framework (e.g.,Kennedy 2003; Mittelhammer et al. 2000; Intriligator et al. 1996) arebusy estimating parameters while treating counterfactuals as meta-physical ghosts that should not concern ordinary mortals. They trustleaders such as Heckman to define precisely what the policy implica-tions are of the structural parameters they labor to estimate, and torelate them to what their colleagues in the potential-outcome camp aredoing.... Economists will do well resurrecting the ideas of Haavelmo(1943), Marschak (1950), and Strotz and Wold (1960) and reinvigo-rating them with the logic of graphs and counterfactuals presented inthis book.”

James Heckman and Rodrigo Pinto Causal Analysis

Page 76: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

8. Conclusion

James Heckman and Rodrigo Pinto Causal Analysis

Page 77: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examined Haavelmo’s fundamental contributions

• Distinction between causation and correlation (first formalanalysis).

• Distinguished definition of causal parameters (though processof creating hypothetical models) from their identification fromdata.

• Explained that causal effects of inputs on outputs are definedunder abstract models that assign independent variation toinputs.

• Clarified concepts that are still muddled in some quarters ofstatistics.

• Formalizes Frisch’s notion that causality is in the mind.

James Heckman and Rodrigo Pinto Causal Analysis

Page 78: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Causal Framework Inspired by Haavelmo’s Ideas

• Contribution: Causal framework inspired by Haavelmo.• Introduce hypothetical models for examining causal effects.• Assigns independent variation to inputs determining outcomes.• Enables us to discuss causal concepts such as fixing using an

intuitive approach.• Fixing is easily translated to statistical conditioning.• Eliminates the need for additional extra-statistical graphical/

statistical rules to achieve identification (in contrast with thedo-calculus).• Identification relies on evaluating causal parameters defined in

the hypothetical model using data generated by the empiricalmodel.• Achieved by applying standard statistical tools to

fundamentally recursive Bayesian Networks.

James Heckman and Rodrigo Pinto Causal Analysis

Page 79: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Beyond DAGs

• We discuss the limitations of methods of identification that rely on thefundamentally recursive approach of Directed Acyclic Graphs.

• Haavelmo’s framework can be extended to the fundamentallynon-recursive framework of the simultaneous equations model withoutviolating autonomy.

• Simultaneous equations are fundamentally non-recursive and fall outsideof the framework of Bayesian causal nets and DAGs.

• Haavelmo’s approach also covers simultaneous causality whereas otherframeworks cannot, except through ad hoc rules such as “shutting down”equations.

• Haavelmo’s framework allows for a variety of econometric methods can beused to secure identification of this class of models (see, e.g., Matzkin,2012, 2013.)

James Heckman and Rodrigo Pinto Causal Analysis

Page 80: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

9. Appendix I

James Heckman and Rodrigo Pinto Causal Analysis

Page 81: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Comparing Analyses Based on the Do-calculuswith Those From the Hypothetical Model

• We illustrate the use of the do-calculus and the hypotheticalmodel approaches by identifying the causal effects of awell-known model that Pearl (2009) calls the “Front-Doormodel.”

• It consists of four variables: (1) an external unobserved variableU , (2) an observed variable X caused by U , (3) an observedvariable M caused by X , and (4) an outcome Y caused by Uand M .

James Heckman and Rodrigo Pinto Causal Analysis

Page 82: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Table 2: “Front-Door” Empirical and Hypothetical Models

1. Pearl’s “Front-Door” Empirical Model 2. Our Version of the “Front-Door” Hypothetical Model

T = {U,X ,M,Y } T = {U,X ,M,Y , X}ε = {εU , εX , εM , εY } ε = {εU , εX , εM , εY }Y = fY (M,U, εY ) Y = fY (M,U, εY )X = fX (U, εX ) X = fX (U, εX )

M = fM(X , εM) M = fM(X , εM)U = fU(εU) U = fU(εU)

U

MX Y

U

MX Y

X

Pa(U) = ∅, Pa(U) = Pa(X ) = ∅,Pa(X ) = {U} Pa(X ) = {U}Pa(M) = {X} Pa(M) = {X}

Pa(Y ) = {M,U} Pa(Y ) = {M,U}

Y ⊥⊥ X |(M,U) Y ⊥⊥ (X ,X )|(M,U)

M ⊥⊥ U|X M ⊥⊥ (U,X )|XX ⊥⊥ (M, X ,Y )|U

U ⊥⊥ (M, X )

X ⊥⊥ (X ,U)

PE(Y ,M,X ,U) = PH(Y ,M,X ,U, X ) =

PE(Y |M,U) PE(X |U) PE(M|X ) PE(U) PH(Y |M,U) P(X |U) PH(M|X ) PH(U) PH(X )

PE(Y ,M,U|do(X ) = x) = PH(Y ,M,U,X |X = x) =

PE(Y |M,U) PE(M|X = x) PE(U) PH(Y |M,U) P(X |U) PH(M|X = x) PH(U)

James Heckman and Rodrigo Pinto Causal Analysis

Page 83: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• The do-calculus identifies P(Y |do(X )) through four steps,which we now perform.

• Steps 1, 2, and 3 identify P(M |do(X )), P(Y |do(M)), andP(Y |M , do(X )), respectively.

James Heckman and Rodrigo Pinto Causal Analysis

Page 84: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

1 Invoking LMC for variable M of DAG GX (DAG 1 of Table 3)generates X ⊥⊥ M . Thus, by Rule 2 of the do-calculus, weobtain P(M |do(X )) = P(M |X ).

James Heckman and Rodrigo Pinto Causal Analysis

Page 85: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Table 3: Do-Calculus and the Front-Door Model

1. Modified Front-Door Model G X = GM 2. Modified Front-Door Model GM

U

MX Y

U

MX Y

(Y ,M) ⊥⊥ X |U (X ,M) ⊥⊥ Y |U(X ,U) ⊥⊥ M (Y ,U) ⊥⊥ M|X

3. Modified Front-Door Model G X ,M 4. Modified Front-Door Model G X ,M

U

MX Y

U

MX Y

(X ,M) ⊥⊥ (Y ,U) (Y ,M,U) ⊥⊥ XU ⊥⊥ M

James Heckman and Rodrigo Pinto Causal Analysis

Page 86: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

1 Invoking LMC for variable M of DAG GX (DAG 1 of Table 3) generatesX ⊥⊥ M. Thus, by Rule 2 of the do-calculus, we obtainP(M|do(X )) = P(M|X ).

2 Invoking LMC for variable M of DAG GM (DAG 1 of Table 3) generatesX ⊥⊥ M. Thus, by Rule 3 of the do-calculus, P(X |do(M)) = P(X ). Inaddition, applying LMC for variable M of DAG GM (DAG 2 of Table 3)generates M ⊥⊥ Y |X . Thus, by Rule 2 of do-calculus,P(Y |X , do(M)) = P(Y |X ,M).

Therefore, P(Y |do(M)) =∑

x′∈supp(X )

P(Y |X = x ′, do(M)) P(X = x ′|do(M))

=∑

x′∈supp(X )

P(Y |X = x ′,M) P(X = x ′),

where “supp” means support.

3 Invoking LMC for variable M of DAG GX ,M (DAG 3 of Table 3) generates

Y ⊥⊥ M|X .

James Heckman and Rodrigo Pinto Causal Analysis

Page 87: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

4 Thus, by Rule 2 of the do-calculus, P(Y |M, do(X )) = P(Y |do(M), do(X )). In addition,applying LMC for variable X of DAG G X ,M (DAG 4 of Table 3) generates

(Y ,M,U) ⊥⊥ X . By weak union and decomposition, we obtain Y ⊥⊥ X |M. Thus, byRule 3 of the do-calculus, we obtain that P(Y |do(X ), do(M)) = P(Y |do(M)). Thus,P(Y |M, do(X )) = P(Y |do(M), do(X )) = P(Y |do(M)).

5 We collect the results from the three previous steps to identify P(Y |do(X )):

P(Y |do(X ) = x)

=∑

m∈supp(M)

P(Y |M, do(X ) = x) P(M|do(X ) = x)

=∑

m∈supp(M)

P(Y |do(M) = m, do(X ) = x)︸ ︷︷ ︸Step 3

P(M = m|do(X ) = x)

=∑

m∈supp(M)

P(Y |do(M) = m)︸ ︷︷ ︸Step 3

P(M = m|do(X ) = x)

=∑

m∈supp(M)

( ∑x′∈supp(X )

P(Y |X = x ′,M) P(X = x ′)

)︸ ︷︷ ︸

Step 2

P(M = m|X = x)︸ ︷︷ ︸Step 1

.

James Heckman and Rodrigo Pinto Causal Analysis

Page 88: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• We use the do-calculus to identify the desired causal parameter,using the approach inspired by Haavelmo’s ideas.

• We replace the relationship of X on M by a hypotheticalvariable X that causes M .

• We use PE to denote the probability of the Front-Door modelthat generates the data (Column 1 of Table 2) and PH for thehypothetical model (Column 2 of Table 2).

James Heckman and Rodrigo Pinto Causal Analysis

Page 89: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Lemma 1

In the Front-Door hypothetical model,(1) Y ⊥⊥ X |M ,(2) X ⊥⊥ M ,and(3) Y ⊥⊥ X |(M ,X ).

James Heckman and Rodrigo Pinto Causal Analysis

Page 90: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Proof

By LMC for X , we obtain (Y ,M , X ) ⊥⊥ X |U . By LMC for Y , weobtain Y ⊥⊥ (X , X )|(M ,U). By Contraction applied to(Y ,M , X ) ⊥⊥ X |U and Y ⊥⊥ (X , X )|(M ,U), we obtain(Y ,X ) ⊥⊥ X |(M ,U). By LMC for U , we obtain (M , X ) ⊥⊥ U . ByContraction applied to (M , X ) ⊥⊥ U and (Y ,M , X ) ⊥⊥ X |U , weobtain (X ,U) ⊥⊥ (M , X ). The second relationship of the Lemma isobtained by Decomposition. In addition, by Contraction on(Y ,X ) ⊥⊥ X |(M ,U) and (M , X ) ⊥⊥ U , we obtain(Y ,X ,U) ⊥⊥ X |M . The two remaining conditional independencerelationships of the Lemma are obtained by Weak Union andDecomposition.

James Heckman and Rodrigo Pinto Causal Analysis

Page 91: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Applying these results,

PH(Y |X = x)

=∑

m∈supp(M)

PH(Y |M = m, X = x) PH(M = m|X = x)

=∑

m∈supp(M)

PH(Y |M = m) PH(M = m|X = x)

=∑

m∈supp(M)

( ∑x′∈supp(X )

PH(Y |X = x ′,M = m) PH(X = x ′|M = m)

)PH(M = m|X = x)

=∑

m∈supp(M)

( ∑x′∈supp(X )

PH(Y |X = x ′,M = m) PH(X = x ′)

)PH(M = m|X = x)

=∑

m∈supp(M)

( ∑x′∈supp(X )

PH(Y |X = x ′, X = x ′,M = m) PH(X = x ′)

)PH(M = m|X = x)

=∑

m∈supp(M)

( ∑x′∈supp(X )

PE(Y |M,X = x ′)︸ ︷︷ ︸by T1

(in text)

PE(X = x ′)︸ ︷︷ ︸by Lemma1

)PE(M = m|X = x)︸ ︷︷ ︸

by M1(in text)

.

James Heckman and Rodrigo Pinto Causal Analysis

Page 92: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• The second equality comes from relationship (1) Y ⊥⊥ X |M ofLemma 1.

• The fourth equality comes from relationship (2) X ⊥⊥ M ofLemma 1.

• The fifth equality comes from relationship (3) Y ⊥⊥ X |(M ,X )of Lemma 1.

• The last equality links the distributions of the hypotheticalmodel with the ones of the empirical model.

James Heckman and Rodrigo Pinto Causal Analysis

Page 93: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• The first term uses Theorem 1 to equatePH(Y |X = x ′, X = x ′,M = m) = PE(Y |M ,X = x ′).

• The second term uses the fact that X is not a child of X ; thus,by Lemma, PH(X = x ′) = PE(X = x ′).

• Finally, the last term uses Matching applied to M . Namely, theLMC for M generates M ⊥⊥ X |X in the hypothetical model.

• Then, by Matching, PH(M |X = x) = PE(M |X = x).

James Heckman and Rodrigo Pinto Causal Analysis

Page 94: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

• Both frameworks produce the same final identification formula.

• The methods underlying them differ greatly.

• Concept in the framework inspired by Haavelmo is the notion ofa hypothetical model.

Return to main text.

James Heckman and Rodrigo Pinto Causal Analysis

Page 95: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

10. Appendix II

James Heckman and Rodrigo Pinto Causal Analysis

Page 96: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Do-Calculus and the Generalized Roy Model

James Heckman and Rodrigo Pinto Causal Analysis

Page 97: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Generalized Roy Model

The Generalized Roy Model stems from six variables:

1 V : unobserved confounding variable V not caused by anyvariable;

2 X: observed pre-treatment variables X caused by V ;

3 Z: instrumental variable Z caused by X ;

4 T : treatment choice T that caused by Z , V , and X ;

5 U : unobserved variable U caused by T , V , and X ;

6 Y : outcome of interest Y caused by T , U , and X .

James Heckman and Rodrigo Pinto Causal Analysis

Page 98: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Generalized Roy Model

R D Y

VX

A

R D Y

VX

A

D~

D~

Z T Y

UV

XZ D Y

UVX

This figure represents causal relations of the Generalized RoyModel. Arrows represent direct causal relations. Circles representunobserved variables. Squares represent observed variables.

James Heckman and Rodrigo Pinto Causal Analysis

Page 99: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Key Aspects of the Generalized Roy Model

1 T is caused by Z , V ;

2 U mediates the effects of V on Y (that is, V causes U);

3 T and U cause Y ; and

4 Z (instrument) not caused by V , U and does not directly causeY , U .

We are left to examine the cases whether:

1 V causes X (or vice-versa),

2 X causes Z (or vice-versa),

3 X causes T ,

4 X causes U ,

5 T causes U , and

6 X causes Y .

The combinations of all these causal relations generate 144 possiblemodels (Pinto, 2013).

James Heckman and Rodrigo Pinto Causal Analysis

Page 100: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Key Aspects of the Generalized Roy Model (Pinto, 2013)R D Y

VX

A

R D Y

VX

A

D~

D~

Z T Y

UV

XZ T Y

UVX

Z D Y

UVX

Dashed lines denote causal relations that may not exist or, if theyexist, the causal direction can go either way. Dashed arrows denotecausal relations that may not exist, but, if they exist, the causaldirection must comply with the arrow direction.

James Heckman and Rodrigo Pinto Causal Analysis

Page 101: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

“Marginalizing” the Generalized Roy Model

• We examine the identification of causal effects of theGeneralized Roy Model using a simplified model w.l.o.g.

• Suppress variables X and U .

• This simplification is usually called marginalization in the DAGliterature (Koster (2002), Lauritzen (1996), Wermuth (2011)).

James Heckman and Rodrigo Pinto Causal Analysis

Page 102: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Marginalizing the Generalized Roy Model

GZ G

U

X YZ

U

X YZ

This figure represents causal relations of the Marginalized RoyModel. Arrows represent direct causal relations. Circles representunobserved variables. Squares represent observed variables.Note: Z is exogenous; thus, conditioning on Z is equivalent tofixing Z .

James Heckman and Rodrigo Pinto Causal Analysis

Page 103: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examining the Marginalized Roy Model – 1/4

• Y ⊥⊥ Z in GX , by Rule 1

Pr(Y |do(X ),Z ) = Pr(Y |do(X ))

• Y ⊥⊥ Z , in GX ,Z , by Rule 3

Pr(Y |do(X ),Z ) = Pr(Y |do(X ))

• Y ⊥⊥ Z |X in GX ,Z , by Rule 2

Pr(Y |do(X ), do(Z )) = Pr(Y |do(X ),Z )

GX = GX ,Z = GX ,Z

U

X YZ

James Heckman and Rodrigo Pinto Causal Analysis

Page 104: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examining the Marginalized Roy Model – 2/4

• Under GX , Y �⊥⊥ X , thus Rule 2 does not apply.

• Under GX ,Z , Y �⊥⊥ X |Z , thus Rule 2 does not apply.

GX = GX ,Z

U

X YZ

James Heckman and Rodrigo Pinto Causal Analysis

Page 105: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examining the Marginalized Roy Model – 3/4

• GZ ⇒ Y ⊥⊥ Z , thus by Rule 2, Pr(Y |do(Z )) = Pr(Y |Z ).

GZ

U

X YZ

James Heckman and Rodrigo Pinto Causal Analysis

Page 106: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Examining the Marginalized Roy Model – 4/4Modifications

• Under GX ,Z , Y �⊥⊥ (X ,Z ), thus Rule 2 does not apply.

GX ,Z

U

X YZ

James Heckman and Rodrigo Pinto Causal Analysis

Page 107: Causal Analysis After Haavelmo - University of Chicagojenni.uchicago.edu/econ312/Slides/Heckman-Pinto-Causal... · 2019. 4. 2. · 2 Laid the foundations for counterfactual policy

Approach Models Do-Calculus DAG Limitations Comparing Models Lament Conclusion Appendix I Appendix II

Conclusion on Roy

The Do-Calculus applied to the Marginalized Roy Model generates:

1 Pr(Y |do(X ), do(Z )) = Pr(Y |do(X ),Z ) = Pr(Y |do(X )),

2 Pr(Y |do(Z )) = Pr(Y |Z )

These relations only corroborate the exogeneity of the instrumentalvariable Z and are not sufficient to identify Pr(Y |do(X )).

Identification of the Roy ModelTo identify the Roy Model, we make assumption on how Z impactsX , i.e., monotonicity/separability.These assumptions cannot be represented in a DAG.These assumptions are associated with properties of how Z causesX and not only if Z causes X .

Return to main text.

James Heckman and Rodrigo Pinto Causal Analysis