ECO 375: Introduction to Econometrics · 2020-01-30 · ECO375: Introduction to EconometricsBrian...
Transcript of ECO 375: Introduction to Econometrics · 2020-01-30 · ECO375: Introduction to EconometricsBrian...
Dependence Structure between Random VariablesMoving from Correlation to Causation
ECO 375: Introduction to EconometricsMoving from Correlation to Causation
Brian PhelanDePaul University
January 12, 2020
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 1 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Statistics vs. Econometrics
Statistics Emphasizes...Properties of Random VariablesDependence Structure of Random Variables
A random variable is a variable whose outcome determinedin part by chance.
Econometrics Emphasizes...Understanding the causal relationship between randomvariablesUses include:
test economic theoriesestimate economic parameters (e.g. elasicities)Predict economic aggregates (GDP, unemployment, etc)Measure efficacy and distortions of policies (e.g. taxes,minimum wages)
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 2 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Statistics and Econometrics are both interested in...
The relationship between variables
Dependence over time
If it rains today is it more likely to rain tomorrow?
Dependence across variables:
How are height and weight related?
How is the IQ of one twin related to the IQ of the other twin?
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 3 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Scatter Plot
A scatter plot is a good way to see the relationshipbetween random variables
Sample of N observations with the following information:
Individual, Age, Education, Height, Weight, etc.
For each observation, 2 pieces of data (X,Y)
Plot each point for all observations in sample
For example, Height vs. Weight
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 4 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Scatter Plot: Height and Weight of Adult FemalesScatter Plot: Height and Weight of Adult Females
ECO375: Introduction to Econometrics Moving from Correlation 28 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 5 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Scatter Plot: Cigarette Consumption and TaxesScatter Plot: Cigarette Consumption and Taxes
ECO375: Introduction to Econometrics Moving from Correlation 29 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 6 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Scatter Plot: Wins and Home Attendance in BaseballScatter Plot: Wins and Home Attendance in Baseball
ECO375: Introduction to Econometrics Moving from Correlation 30 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 7 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Scatter Plot: IQ of Twins Raised ApartScatter Plot: IQ of Twins Raised Apart
ECO375: Introduction to Econometrics Moving from Correlation 31 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 8 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 41 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 9 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 42 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 10 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 43 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 11 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 44 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 12 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 45 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 13 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation: An ExampleCorrelation: An Example
ECO375: Introduction to Econometrics Moving from Correlation 46 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 14 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Limitation
Correlation coefficient is a convenient way to measure astatistical relationship between two variables
It does not however signify anything more than statisticalrelationship
It does not get us any closer to saying whether two thingsare causally related
Correlation does not equal causation
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 15 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Births to Unwed Mothers
Risen from 5% in 1960 to 37% in 2006
Predictive of many child outcomes
Low birth weight, increased mortality, poor performance inschools, etc.
Many potential explanations
Poor performance of male wages & employment, risingdivorce
Is there a magic bullet explanation?
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 16 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
What is Correlated to Births to Unwed Mothers?What is Correlated to Births to Unwed Mothers?
Figure:ECO375: Introduction to Econometrics Moving from Correlation 50 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 17 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
What is Correlated to Births to Unwed Mothers?What is Correlated to Births to Unwed Mothers?
ECO375: Introduction to Econometrics Moving from Correlation 51 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 18 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
What is Correlated to Births to Unwed Mothers?What is Correlated to Births to Unwed Mothers?
ECO375: Introduction to Econometrics Moving from Correlation 52 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 19 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
What is Correlated to Births to Unwed Mothers?What is Correlated to Births to Unwed Mothers?
ECO375: Introduction to Econometrics Moving from Correlation 53 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 20 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation Does Not Equal CausationCorrelation Does Not Equal Causation
ECO375: Introduction to Econometrics Moving from Correlation 55 / 79
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 21 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation Does Not Equal CausationCorrelation Does Not Equal Causation
ECO375: Introduction to Econometrics Moving from Correlation 56 / 79
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 22 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Correlation Does Not Equal Causation
��������� ���� ����������
������������������������� ��� ���������� ���
!"#"�$%&'()$*�+,-,�!)."'#/)0#�%1�23'4(&5#&')�"06�7)0#)'$�1%'�!4$)"$)�7%0#'%5�8�9'):)0#4%0
;)6$<))#�#"035403$7<))$)�(%0$&/)6�=>??@ABC@D�EFCG�7%'')5"#4%0*�HI,JKL�M'NO,HIJOHKP
QRSTURRV�VWXYZ[XYT \URRTR�]̂XT_̀ RSaOOO aOOK aOOa aOOb aOOI aOOc aOOd aOOJ aOOe aOOHaOOO aOOK aOOa aOOb aOOI aOOc aOOd aOOJ aOOe aOOH
aOO�6)"#<$IOO�6)"#<$dOO�6)"#<$eOO�6)"#<$ae,c5f$bO5f$bK,c5f$bb5f$
#g5)':43)0,(%/Brian Phelan DePaul University ECO 375: Introduction to Econometrics 23 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Conflating Correlation and Causation
Above examples purposely silly, but distinguishingcausation from correlation can be very difficult.
For example, there is a very strong correlation betweengraduating college and earning more money later in life(we saw the same with economics in particular), is this acausal relationship?
Why might this be causal?Why might this be spurious correlation?Why does knowing the difference matter?
What about pro-Marriage policies
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 24 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Ceteris Paribus Condition
We want to know what happens to some variable, Y, whenwe increase the variable X by one unit – holding all otherrelevant factors/variables fixed (i.e. ceteris paribus).
If ceteris paribus condition holds, we can estimate causaleffects.
Why does the correlation between college graduation andearnings violate the ceteris paribus condition?The SI program at DePaul claims that students that attendsix or more SI sessions have a 90% pass rate.
This is a conditional mean.Is it causal? Why or why not?
Is there some way to get the ceteris paribus condition tohold?
Multivariate regression will help, but it is not a silver bullet.This will continuously be a challenge.
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 25 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
How to measure Causal Effect?
Ideal approach to measuring causality based on thescientific method.
Build two identical worldsVary one thing in the “experimental” world, but not in the“control” worldCompare outcomes between the two worlds. The differenceis the causal effect due to varying the one variable.
This works in many areas in the natural sciences, is thisfeasible for economic applications?
No, not really. No way to make two exact worlds.So, what can be done?
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 26 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Measuring Causal Effects in Economics
If the scientific method is not feasible in economics, how dowe move forward with understanding causality?Many ways, but the Randomized Control Trial (RCT) is thebest possible way.How does a RCT work?
1 Draw a large sample of individuals2 Randomly assign half to get a treatment and half to not get
it3 Compare outcomes between treated and control groups
How does randomization overcome the ceteris paribuscondition?
new heart drugBecoming more common in economics, but randomizationnot possible in most econ applications
Very expensiveOften unethical
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 27 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Econometrics
Seeks to uncover CAUSAL relationships betweenvariables.
Understand strength of relationship b/t variablesTest TheoriesPredict Aggregates (GDP, Inflation, etc)
Try to replicate random assignment approach, but for manyeconomic applications, cannot randomly assign many“treatments”
teen birth and outcomes of childrenmore education and earningsobesity and health
Standard comparisons of people in different treatmentgroups will not work. What to do?
Multivariate RegressionUse TheoryLook for “Natural Experiments”
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 28 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Using Theory to Organize Empirical Models
Economic theory helps frame question: what affects what?
People/firms/organizations are purposefulFirms maximize profitsPeople maximize happiness/utility
Economic agents maximize objectives subject toconstraints
Consumers have limited budgets, firms must paywages/rent to produce, etc.
Econ theory has strong ideas about what consumers andfirms choose and what affects those choices.
Comparative statics also “predicts” how behavior changeswhen parameters change.
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 29 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Theory of Demand
Model set up (Intermediate Micro)Consumers derive utility from consumption of 2 goods (x, y)
U = u (x, y)x = f (px, py, I);y = f (px, py, I)
Predictions from the model
How demand changes when prices rise(
∂Y∂py
)This model is very clear about what affects what, i.e. thedirection of causality. That’s a start.
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 30 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Building a Model
Before building a statistical model that will allow us topredict the changes in outcomes, we need to assume adirection of causation
What happens to quantity demanded when priceincreases?Hours of study impact gradesYears of education alter earnings ability
Our model will only accurately measure the impact of “x ony” if this assumption is correct
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 31 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Break Variables into 2 Groups
Exogenous (external conditions)
Constraints on behavior“Treatments” Factors that can be altered“Independent” variables
Endogenous outcomesChoice variablesOutcomes of systems“Dependent” variables
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 32 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Optimal Decision Making Guiding Empirical Analyses
In economics, all optimal decisions can be boiled down tomarginal thinking with economic agents taking all actionswhere the marginal benefit exceeds the marginal costs.
So, an empirical analysis of crime (a choice variable)should take into account:
the marginal benefits of crimethe marginal costs of crime (implicit and explicit costs)
What do you think should be included in each?
Gary Becker’s model of crime, has crime a function of:“Wage” for time spent in criminal activity, probability ofgetting caught, probability of getting convicted, penalty ifconvicted, income, wage for time spent in legal work, andage.
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 33 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
An Economic Example
Public finance economists are interested in the productivityof government spending
Two largest components of local spending are schools andpublic safety
Will hiring more police reduce crime?Let y = crime rate (crime per person)Let x = police employed per person
Interested in estimating the slope Dy/Dx how will crimechange when a city hires more police
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 34 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
An Economic Example
Collect data on a cross section of cities
61 cities with populations in excess of 250K
Estimate a simple linear modelYi = β0 + Xiβ1 + εi
In words: b1 will measure relationship between police percapita (X) and crime (Y)
Predict: What is the sign (+ or -) of β1?
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 35 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Police and Violent CrimePolice and Violent Crime
ECO375: Introduction to Econometrics Moving from Correlation 74 / 79Brian Phelan DePaul University ECO 375: Introduction to Econometrics 36 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Highest Violent Crime Rates
Does a positive coefficient mean that more police causesmore crime?Only if the ceteris paribus condition holds, which means...?
As you go from the cities with the most police to the leastpolice, nothing else about the city changes. Is that likely tobe true?
Who are the highest crime cities?Rank of Police
Crime Rate Force Size1 Detroit 102 Memphis 273 Baltimore 64 Nashville 25
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 37 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
USNWR, October 2009
Children fed candy and sweets on a daily basis are morelikely to be convicted of violent crimes as adults, a newstudy finds.
Researchers . . . . looked at data on 17,415 children born in1970 in the United Kingdom.
69% of those who committed violent acts also reportedeating candy daily at age 10 (vs. 42% of those who did notcommit a violent act)
What is the mechanism here?Is the ceteris paribus condition violated?
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 38 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
Basic Statistical Model: OLS
Ordinary least squares (OLS) regression
This model is the base for a vast majority of statisticalanalysis in social sciences
Highly stylized models with tremendous power
Power comes from assumptionsIf assumptions are wrong, model is wrong
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 39 / 40
Dependence Structure between Random VariablesMoving from Correlation to Causation
OLS: Problem
Can always estimate the basic modelYi = β0 + Xiβ1 + εi
But, this does not mean the estimate for β1 is any good (foreither model)
Biggest problem in empirical work is that people generateestimates – assume they are causal – when there aregood reasons to believe they are not
Brian Phelan DePaul University ECO 375: Introduction to Econometrics 40 / 40