Regression With Panel Data

89
Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-1 Panel Data Potential unobserved heterogeneity is a form of omitted variables bias. “Unobserved heterogeneity” refers to omitted variables that are fixed for an individual (at least over a long period of time). A person’s upbringing, family characteristics, innate ability, and demographics (except age) do not change.

Transcript of Regression With Panel Data

Page 1: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-1

Panel Data

• Potential unobserved heterogeneity is a form of omitted variables bias.

• “Unobserved heterogeneity” refers to omitted variables that are fixed for an individual (at least over a long period of time).

• A person’s upbringing, family characteristics, innate ability, and demographics (except age) do not change.

Page 2: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-2

Panel Data (cont.)

• With cross-sectional data, there is no particular reason to differentiate between omitted variables that are fixed over time and omitted variables that are changing.

• However, when an omitted variable is fixed over time, panel data offers another tool for eliminating the bias.

Page 3: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-3

Panel Data (cont.)

• Panel Data is data in which we observe repeated cross-sections of the same individuals.

• Examples:– Annual unemployment rates of each state over

several years

– Quarterly sales of individual stores over several quarters

– Wages for the same worker, working at several different jobs

Page 4: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-4

Panel Data (cont.)

• By far the leading type of panel data is repeated cross-sections over time.

• The key feature of panel data is that we observe the same individual in more than one condition.

• Omitted variables that are fixed will take on the same values each time we observe the same individual.

Page 5: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-5

Panel Data (cont.)

• Some of the most valuable data sets in economics are panel data sets.

• Longitudinal surveys return year after year to the same individuals, tracking them over time.

Page 6: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-6

Panel Data (cont.)

• For example, the National Longitudinal Survey of Youth (NLSY) tracks labor market outcomes for thousands of individuals, beginning in their teenage years.

• The Panel Survey of Income Dynamics provides high-quality income data.

Page 7: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-7

Example: Cross-Industry Wage Disparities

• A great puzzle in labor economics is the presence of cross-industry wage disparities.

• Workers of seemingly equivalent ability, in seemingly equivalent occupations, receive different wages in different industries.

• Do high-wage industries actually pay higher wages, or do they attract workers of unobservably higher quality?

Page 8: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-8

Cross-Industry Wage Differentials (cont.)

• Gibbons and Katz (Review of Economic Studies 1992) exploited panel data to explore these differentials.

• They observed workers in 1984 and 1986.

• They focused on workers who lost their 1984 jobs because of plant closings (on the grounds that plant closings are unlikely to be correlated with an individual worker’s abilities). They looked only at workers who were re-employed by 1986.

Page 9: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-9

Cross-Industry Wage Differentials (cont.)

• Gibbons and Katz estimated wages as

where Xkit are demographic variables, and Dit are a set of dummy variables for being employed in different industries

_11 1 1

_

ln ..

..

Industryit o it k kit it

Industry mm it it

w X X D

D

Page 10: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-10

Cross-Industry Wage Differentials

• Estimating with simple OLS, Gibbons and Katz estimate ’s that are very similar to other estimates of cross-industry wage differentials.

_11 1 1

_

ln ..

..

Industryit o it k kit it

Industry mm it it

w X X D

D

Page 11: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-11

Cross-Industry Wage Differentials (cont.)

• Gibbons and Katz speculated that any unmeasured ability is fixed over time and equally rewarded in all industries.

• Differencing the 1986 and 1984 observations eliminated the vi

_11 1 1

_

ln ..

..

Industryit o it k kit it

Industry mm it i it

w X X D

D v

Page 12: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-12

Cross-Industry Wage Differentials (cont.)

_11986 1 1 1986 1986 1 1986

_1986 1986

_11984 1 1 1984 1984 1 1984

_1984 1984

ln ..

..

ln ..

..

ln

Industryi o i k ki i

Industry mm i i i

Industryi o i k ki i

Industry mm i i i

i

w X X D

D v

w X X D

D v

w

_11 1 1

_

..

..

Industryi k ki i

Industry mm i i

X X D

D

Page 13: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-13

Cross-Industry Wage Differentials (cont.)

• The estimated industry coefficients from the differenced equation are about 80% of the estimated industry coefficients from the levels equation.

• Unobserved worker ability appears to explain relatively little of the cross-industry wage differentials.

Page 14: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-14

A Panel Data DGP

0 1 1 2 2 3 3

2

' '

..1... ; 1...

( ) 0

( )( ) 0 ' '( ) 0 , ,

if OR for all

it i it i t K Kit it

it

it

it i t

jit it

Y X X X Xi n t T

E

VarE i i t tE X j i t

Page 15: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-15

Panel Data DGPs

• Notice that when we have panel data, we index observations with both i and t.

• Pay close attention to the subscripts on variables.

• Some variables vary only across time or across individual.

Page 16: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-16

Panel Data DGP’s (cont.)

0 1 1 2 2 3 3

2

2

3

..1... ; 1...

For example, varies only by individual, and is fixed over time. might be a variable such as race or gender.

varies only by t

it i it i t K Kit it

i

i

t

Y X X X Xi n t T

XX

X

3

1

1

ime, and is fixed across . might be national unemployment.

varies across BOTH individual and time. For example, might refer to wages.

t

it

it

iX

XX

Page 17: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-17

Panel Data DGP’s (cont.)

0 1 1 2 2 3 3

0

..1... ; 1...

.

One of the key features of the DGP is that we allow each individual to have a distinct intercept This intercept includes ALL

it i it i t K Kit it

i

Y X X X Xi n t T

i

aspects of unobserved heterogeneity that are fixed over the length of the panel.

Page 18: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-18

Panel Data DGP’s (cont.)

• In this DGP, the 0i are fixed across samples.

• The unmeasured heterogeneity is the same in every sample.

• This DGP is called the “Distinct Intercepts” DGP.

• It is suitable for panels of states or countries, where the same individuals would be selected in each sample.

Page 19: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-19

Panel Data DGP’s (cont.)

• With longitudinal data on individual workers or consumers, we draw a different set of individuals from the population each time we collect a sample.

• Each individual has his/her own set of fixed omitted variables.

• We cannot fix each individual intercept.

Page 20: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-20

Another Panel Data DGP

0 1 1 2 2 3 3

2

' '

2'

'

..1... ; 1...

( ) 0 ( )

( ) 0 ' ' ( ) 0

( ) 0 ' ( )( ) 0 , ',( ) 0 , ,

if OR

for for all for all

E

it it i t K Kit i it

it it

it i t i

i i i v

it i

jit it

Y X X X X vi n t T

E Var

E i i t t E v

E v v i i Var vE v i i tE x j i t

( ) 0 , ,

( ) 0 , ,

ITHER for all

OR for at least some jit i

jit i

E X v j i t

E X v j i t

Page 21: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-21

Panel Data DGP’s

• In this DGP, we return to a model with a single intercept for all data points, 0

• However, we break the error term into two components:

• When we draw an individual i, we draw one vi that is fixed for that individual in all time periods.

• vi includes all fixed omitted variables.

it i itv

Page 22: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-22

Panel Data DGP’s (cont.)

• In the Distinct Intercepts DGP, the unobserved heterogeneity is absorbed into the individual-specific intercept 0i

• In the second DGP, the unobserved heterogeneity is absorbed into the individual fixed component of the error term, vi

• This DGP is an “Error Components Model.”

Page 23: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-23

Panel Data DGP’s (cont.)

• The Error Components DGP comes in two flavors, depending on .

• If , then the unobserved heterogeneity is uncorrelated with the explanators.

• OLS is unbiased and consistent.

( )jit iE X v

( ) 0jit iE X v

Page 24: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-24

Panel Data DGP’s (cont.)

• If , then the unobserved heterogeneity IS correlated with the explanators.

• OLS is BIASED and INCONSISTENT.

( ) 0jit iE X v

Page 25: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-25

Panel Data DGP’s (cont.)

• Panel data is most useful in the second Error Components case.

• When , OLS is inconsistent.

• Using panel data, we can create a consistent estimator: Fixed Effects.

( ) 0jit iE X v

Page 26: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-26

Fixed Effects (Chapter 16.2)

• The Fixed Effects Estimator

• Used with EITHER the distinct intercepts DGP OR the error components DGP with

• Basic Idea: estimate a separate intercept for each individual

( ) 0ijt iE X v

Page 27: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-27

Fixed Effects (cont.)

• The simplest way to estimate separate intercepts for each individual is to use dummy variables.

• This method is called the least squares dummy variable estimator.

Page 28: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-28

Fixed Effects (cont.)

• We have already seen that we can use dummy variables to estimate separate intercepts for different groups.

• With panel data, we have multiple observations for each individual. We can group these observations.

Page 29: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-29

Fixed Effects (cont.)

• Least Squares Dummy Variable Estimator:

1. Create a set of n dummy variables, D j, such that D j = 1 if i = j.

2. Regress Yit against all the dummies, Xt , and Xit variables (you must omit Xi variables and the constant).

Page 30: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-30

Fixed Effects (cont.)

• The LSDV estimator is conceptually quite simple.

• In practice, the tricky parts are:

1. Creating the dummy variables

2. Entering the regression into the computer

3. Reporting results

Page 31: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-31

Fixed Effects (cont.)

• Suppose we have a longitudinal dataset with 300 workers over 10 years.

• n = 300

• We must create 300 dummy variables and then specify a regression with 300+explanators.

• How do we do this in our software package?

Page 32: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-32

Fixed Effects (cont.)

• Our regression output includes 300 intercepts.

• Usually, we are not interested in the intercepts themselves.

• We include the dummy variables to control for heterogeneity.

Page 33: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-33

Fixed Effects (cont.)

• In reporting your regression output, it is preferable to note that you have included “individual fixed effects.”

• Then omit the dummy variable coefficients from your table of results.

Page 34: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-34

Fixed Effects (cont.)

• At some point, n becomes too large for the computer to handle easily.

• Modern computers can implement LSDV for ever larger data sets, but eventually LSDV becomes computationally intractable.

Page 35: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-35

Fixed Effects (cont.)

• A computationally convenient alternative is called the Fixed Effects Estimator.

• Technically, only this strategy is “Fixed Effects;” using dummy variables is LSDV.

• In practice, econometricians tend to refer to either method as Fixed Effects.

Page 36: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-36

Fixed Effects (cont.)

• The initial insight for the Fixed Effects estimator: if we DIFFERENCE observations for the same individual, the vi cancels out.

Page 37: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-37

Fixed Effects (cont.)

0 1 1 2 2

' 0 1 1 ' 2 2 '

' 1 ' '( ) 0 ( ) 0 0

it it i i it

it it i i it

it it it it it it

Y X X vY X X v

Y Y X X

Page 38: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-38

Fixed Effects (cont.)

• When we difference, the heterogeneity term vi drops out.

• (In the distinct intercepts model, the 0i would drop out).

• By assumption, the it are uncorrelated with the Xit

• OLS would be a consistent estimator of 1

Page 39: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-39

Fixed Effects (cont.)

• If T = 2, then we have only 2 observations for each individual (as in the Gibbons and Katz example).

• Differencing the 2 observations is efficient.

• If T > 2, then differencing any 2 observations ignores valuable information in the other observations for each individual.

Page 40: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-40

Fixed Effects (cont.)

• We can use all the observations for each individual if we subtract the individual-specific mean from each observation.

Page 41: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-41

Fixed Effects (cont.)

0 1 2

0 1 2

1

1

( ) 0 ( ) 0 0

1

1 1

where

Note:

it it i i it

i i i i i

it i it i it i

T

i itt

i i i

Y X X v

Y X X v

Y Y X X

Y YT

v n v vn n

Page 42: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-42

Fixed Effects (cont.)

1

Fixed Effects:

1) Construct

2) Regress

FEit it i

FEit it i

FE FEit it it

y Y Y

x X X

y x

Page 43: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-43

Fixed Effects (cont.)

• The Fixed Effects and DVLS estimators provide exactly identical estimates.

• The n-T-k term of the Fixed Effects e.s.e.’s must be adjusted to account for the extra n degrees of freedom that have been used.

• The computer can make this adjustment.

Page 44: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-44

Fixed Effects (cont.)

• Demeaning each observation by the individual-specific mean eliminates the need to create n dummy variables.

• FE is computationally much simpler.

Page 45: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-45

Fixed Effects (cont.)

• Fixed Effects (however estimated) discards all variation between individuals.

• Fixed Effects uses only variation over time within an individual.

• FE is sometimes called the “within” estimator.

Page 46: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-46

Fixed Effects (cont.)

• Fixed Effects discards a great deal of variation in the explanators (all variation between individuals).

• Fixed Effects uses n degrees of freedom.

• Fixed Effects is not efficient if

• Could we use OLS?

( ) 0it iE X v

Page 47: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-47

Checking Understanding (cont.)

0 1

2

' '

2

'

'

( ) 0 ( )

( ) 0 ' '

( ) 0 ( )( ) 0 ' ( ) 0 ,( ) 0 , ', ( ) 0 ,

if OR

for for all for all for all

Question: is OLS cons

it it i it

it it

it i t

i i v

i i it i

it i it it

Y X v

E Var

E i i t t

E v Var vE v v i i E x v i tE v i i t E X i t

istent and efficient?

Page 48: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-48

Checking Understanding (cont.)

• Because X is uncorrelated with either v or , OLS is consistent in the uncorrelated version of the error components DGP.

• The error terms are homoskedatic.

2 2( ) ( ) it i it vVar Var v

Page 49: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-49

Checking Understanding (cont.)

• However, the covariance between disturbances for a given individual is

' '

2'

2 2

2'

' 2 2

( , ) ([ ] [ ])

( ) 2 ( ) ( )

( )

( , )( , )( )

it it i it i it

i i it it it

i v

it it vit it

it v

Cov E v v

E v E v E

E v

CovCorrVar

Page 50: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-50

Checking Understanding (cont.)

• In the presence of serial correlation, OLS is inefficient.

Page 51: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-51

Random Effects

• When unobserved heterogeneity is uncorrelated with explanators, panel data techniques are not needed to produce a consistent estimator.

• However, we do need to correct for serial correlation between observations of the same individual.

Page 52: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-52

Random Effects (cont.)

• When , panel data provides a valuable tool for eliminating omitted variables bias. We use Fixed Effects to gain the benefits of panel data.

• When , panel data does not offer special benefits. We use Random Effects to overcome the serial correlation of panel data.

( ) 0it iE X v

( ) 0it iE X v

Page 53: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-53

Random Effects (cont.)

• The key idea of random effects:

– Estimate v2 and

2

–Use these estimates to construct efficient weights of panel data observations

Page 54: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-54

Random Effects (cont.)

2

1 1 12

2

:

1

( 1)

1) Estimate the regression using Fixed Effects.2) Construct Fixed Effects residuals,

3) Estimate

4) Estimate the regression using OLS

5) Estimate

it

T n T

it it i

u

u uTs

n T k

s

2 2 2

2 2 2

:

as usual

6) Because

v

vs s s

Page 55: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-55

Random Effects (cont.)

• Once we have estimates of v2 and

2, we can re-weight the observations

optimally.

• These calculations are complicated, but most computer packages can implement them.

Page 56: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-56

Example: Production Functions (Chapter 16.2)

• We have data from 625 French firms from 16 countries for 8 years.

• We wish to estimate a Cobb–Douglas production function:

• Taking logs:

• We estimate using random effects.

1 20i i i iQ L K

0 1 2ln( ) ln( ) ln( ) ln( ) ln( )i i i iQ L K

Page 57: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-57

TABLE 16.1 Random Effects Estimation of a Cobb–Douglas Production Function for a Sample of Manufacturing Firms

Page 58: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-58

Example: Production Functions

• The estimated coefficients of 0.30 for capital and 0.69 for labor are similar to estimates using US data.

• We also get similar results using fixed effects estimation.

Page 59: Regression With Panel Data

TABLE 16.2 Fixed Effects Estimation of a Cobb–Douglas Production Function for a Sample of Manufacturing Firms

Page 60: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-60

Example: Production Functions

• We arrive at similar estimates using either random effects or fixed effects.

• Because only fixed effects controls for unobserved heterogeneity that is correlated with the explanators, the similarity between the two estimates suggests that unobserved heterogeneity is not creating a large bias in this sample.

Page 61: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-61

Example: Production Functions (cont.)

• The fixed effects estimator discards all variation between firms, and must use 624 more degrees of freedom than random effects.

• Moving from RE to FE increases the e.s.e. on capital from 0.0116 to 0.0145

• The e.s.e. on labor moves from 0.0118 to 0.0132

Page 62: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-62

Example: Production Functions (cont.)

• The RE estimator provides more precise estimates

• We would prefer to use RE instead of FE.

• However, RE might be inconsistent if

• We need a test to help determine whether it is safe to use RE.

( ) 0it iE X v

Page 63: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-63

The Hausman Test (Chapter 16.4)

• Hausman’s specification test for error components DGPs provides guidance on whether

• The key idea: if , then the inconsistent RE estimator and the consistent FE estimator converge to different estimates.

( ) 0it iE X v

( ) 0it iE X v

Page 64: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-64

The Hausman Test (cont.)

• If , then the unobserved heterogeneity is uncorrelated with X and does not create a bias.

• RE and FE are both consistent.

• For two consistent estimators to provide significantly different estimates would be surprising.

( ) 0it iE X v

Page 65: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-65

The Hausman Test (cont.)

• We know the FE estimator is consistent even when

• The problem with FE is its inefficiency.

• FE is not as precise as RE.

• Although FE is imprecise, it may provide a good enough estimate to detect a large bias in RE.

( ) 0it iE X v

Page 66: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-66

The Hausman Test (cont.)

• If FE is very imprecise, then the Hausman test has very weak power and cannot rule out even large biases.

• If FE is very precise, then the Hausman test has very good power, but we gain little benefit from switching to the more efficient RE.

Page 67: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-67

The Hausman Test (cont.)

• If FE is somewhat precise, then the Hausman test can warn us away from using RE in the presence of a large bias, but there is still room for substantial efficiency gains in switching to RE.

• With the French manufacturing firms, FE is precise enough to reject the null even though the two estimates are fairly close.

Page 68: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-68

TABLE 16.4 Hausman Specification Test of French Firms’ Error Components’ Correlation with Explanators

Page 69: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-69

The Hausman Test

• Warning: fixed effects exacerbates measurement error bias.

• There is likely to be less variation in X within the experience of a single individual than across several individuals.

• Small measurement errors can become large relative to the within-variation in X.

Page 70: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-70

The Hausman Test (cont.)

• The Hausman Test warns us that RE and FE provide significantly different estimates.

• This difference could arise because of omitted variables bias in RE, caused by

• This difference could ALSO arise because of measurement error biases in FE.

E( X itvi ) 0

Page 71: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-71

Review

• Potential unobserved heterogeneity is a form of omitted variables bias.

• “Unobserved heterogeneity” refers to omitted variables that are fixed for an individual (at least over a long period of time).

• A person’s upbringing, family characteristics, innate ability, and demographics (except age) do not change.

Page 72: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-72

Review (cont.)

• Panel Data is data in which we observe repeated cross-sections of the same individuals.

• By far the leading type of panel data is repeated cross-sections over time.

Page 73: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-73

Review (cont.)

• The key feature of panel data is that we observe the same individual in more than one condition.

• Omitted variables that are fixed will take on the same values each time we observe the same individual.

Page 74: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-74

Review (cont.)

• We learned 3 different DGP’s for panel data.

• In the distinct intercept DGP, across samples we would observe the same individuals with the same unobserved heterogeneity.

• Each i has its own intercept, 0i, that is fixed across samples.

Page 75: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-75

A Panel Data DGP

0 1 1

2

' '

..1... ; 1...

( ) 0

( )( ) 0 ' '( ) 0 , ,

if OR for all

it i it K Kit it

it

it

it i t

jit it

Y X Xi n t T

E

VarE i i t tE X j i t

Page 76: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-76

Review

• Error components DGP’s are suitable when we would draw different individuals across samples.

• When each i is drawn, its unobserved heterogeneity is captured in a vi term.

• We learned two error components DGP, depending on whether the vi is correlated with the Xkit ’s.

Page 77: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-77

Another Panel Data DGP

0 1 1 2 2 3 3

2

' '

2'

'

..1... ; 1...

( ) 0 ( )

( ) 0 ' ' ( ) 0

( ) 0 ' ( )( ) 0 , ',( ) 0 , ,

if OR

for for all for all

E

it it i t K Kit i it

it it

it i t i

i i i v

it i

jit it

Y X X X X vi n t T

E Var

E i i t t E v

E v v i i Var vE v i i tE x j i t

( ) 0 , ,

( ) 0 , ,

ITHER for all

OR for at least some jit i

jit i

E X v j i t

E X v j i t

Page 78: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-78

Review

• If , OLS would be inconsistent

• By estimating a separate intercept for each individual, we can control for the vi

• We learned two equivalent strategies: DVLS and FE.

E( X itvi ) 0

Page 79: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-79

Review (cont.)

• The simplest way to estimate separate intercepts for each individual is to use dummy variables.

• This method is called the least squares dummy variable estimator.

Page 80: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-80

Review (cont.)

• Least Squares Dummy Variable Estimator:

1. Create a set of n dummy variables, D j , such that D j = 1 if i = j

2. Regress Yit against all the dummies, Xt , and Xit variables (you must omit Xi variables and the constant)

Page 81: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-81

Review (cont.)

1

Fixed Effects:

1) Construct

2) Regress

FEit it i

FEit it i

FE FEit it it

y Y Y

x X X

y x

Page 82: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-82

Review (cont.)

• Demeaning each observation by the individual-specific mean eliminates the need to create n dummy variables.

• FE is computationally much simpler.

Page 83: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-83

Review (cont.)

• Fixed Effects (however estimated) discards all variation between individuals.

• Fixed Effects uses only variation over time within an individual.

• FE is sometimes called the “within” estimator.

Page 84: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-84

Review (cont.)

• Because X is uncorrelated with either v or , OLS is consistent in the uncorrelated version of the error components DGP.

• OLS disturbances are homoskedatic. Var( it ) Var(vi it ) v

2 2

Page 85: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-85

Review (cont.)

• However, the covariance between disturbances for a given individual is

' '

2'

2 2

2'

' 2 2

( , ) ([ ] [ ])

( ) 2 ( ) ( )

( )

( , )( , )( )

it it i it i it

i i it it it

i v

it it vit it

it v

Cov E v v

E v E v E

E v

CovCorr

Var

Page 86: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-86

Review (cont.)

• When unobserved heterogeneity is uncorrelated with explanators, panel data techniques are not needed to produce a consistent estimator.

• However, we do need to correct for serial correlation between observations of the same individual.

Page 87: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-87

Review (cont.)

• When , panel data provides

a valuable tool for eliminating omitted variables bias. We use Fixed Effects to gain the benefits of panel data.

( ) 0it iE X v

Page 88: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-88

Review (cont.)

• When , panel data is less convenient than an equal-sized cross-sectional data set. We use Random Effects to overcome the serial correlation of panel data.

E( X itvi ) 0

Page 89: Regression With Panel Data

Copyright © 2006 Pearson Addison-Wesley. All rights reserved. 24-89

Review (cont.)

• The key idea of random effects:

– Estimate v2 and

2

–Use these estimates to construct efficient weights of panel data observations