Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa...

22
Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006

Transcript of Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa...

Page 1: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Estimating Lifetime Earnings Distributions Using Copulas

Lorraine Dearden

Emla Fitzsimons

Alissa Goodman

Greg Kaplan

ESRC Methods Festival

19 July 2006

Page 2: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Motivation

1. To illustrate the use of copulas as a tool for modeling earnings dynamics

2. Example of an application:– estimating the distribution of lifetime

earnings for graduates and non-graduates in the UK

Page 3: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Modelling Earnings Dynamics

Approach 11. Specify a dynamic model for

earnings e.g.

2. Estimate paramers – usually requires T>=3 for identification

3. Simulate earnings paths from estimated model

Approach 21. Estimate conditional cross-

sectional distributions directly:

2. Estimate a mobility process, e.g. with transition matrices

3. Simulate earnings paths using cross-sectional distributions and assumed mobility process

Copula methodology is a formalization of approach 2

, , ,

, , 1 ,

log i a i i a i a

i a i a i a

y z

z z u

1{ (log | )}Ta aF y X

Page 4: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Background on Copulas

• Consider 2 RV’s (X,Y) with continuous joint CDF F(x,y)• Sklar’s Theorem (1959) allows us to decompose joint

distribution into a unique function of marginal distributions:

F(x,y) = C(FX(x), Fy(y)) where C:[0,1]2 [0,1]

• A simple example:– Independence Copula: C(u,v) = uv

• Think of joint distribution as being decomposed into:– Elements relating only to marginal distributions– Elements relating to dependence structure (copula)

Page 5: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

More on Copulas

• In terms of densities:

• Decomposition generalizes to any n-dimensional distribution

• Likelihood function for data from {(xi, yi) }i=1..N

1 1 1

log ( ) log ( ) log ( ( ), ( ))N N N

i i i ii i i

f x f y c F x F y

( , ) ( ) ( ) ( ( ), ( ))X Y X Yf x y f x f y c F x F y

Page 6: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Why copulas?

1. Technique for generating rich dependence structures with few parameters and short panels

2. Allows for marginals to be estimated nonparametrically so that estimation of dependence structure is robust to misspecification of marginals

3. Allows for combination of panel and cross-sectional data, which each type of data employed where it is best suited

4. Allows us to model non-linearities in mobility, such as tail dependence. The degree of linear correlation in wages can vary across the cross-sectional age-specific distribution

Page 7: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

A Copula Model For Earnings

• Aim is to obtain reliable estimate of X-sectional distribution of vector of residuals w={wa}, a=1…T.

• Denote conditional density f(w|X)

where a is lab mkt experience;

T is total # years in labour force;

gender, graduate status X1

, ,log ,i a i i ay g a X w

Page 8: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Copula Decomposition of Earnings

• By copula decomposition, we can write density of vector of earnings residuals as

• Assume a first-order Markov process for :

1

( | ) | , ,..., | , | | ,T

a T aa

f w X c F w a X F w T X X f w a X

1

1 1 1 1 1 11 1

( | ) | , , | 1, | , | ,T T

a a aa a

f w X c F w a X F w a X a X f w a X

1{ }Ta aw

Page 9: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Parametric Assumptions

1. Marginal Distributions– Estimate nonparametrically

2. Copula Function1. Gaussian copula: dependence structure implicit in a bivariate

normal dist• 1 parameter: rho = rank correlation

• Tail dependence (correlation in extremes of dist) approaches 0

• Linear correlation only

2. T-copula: depedence structure implicit in a a bivariate t-distiribution

• 2 parameters: rho + df (degrees of freedom)

• df governs additional dependence in tails of the distribution

Page 10: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

t-copula vs Gaussian copula

Rank correlation is equal in both samples (0.75)

Page 11: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Estimation Procedure

3 stages:1. First stage regression to estimate g(a,X) and generate the

vectors of residuals,

2. Use re-scaled ECDFS to estimate

3. Maximize likelihood with respect to copula parameters

• Estimate separately for males, females, graduates and non-graduates

• g(.) controls for race, region and time effects• All copula parameters allowed to vary by labor market experience• Consistent but not efficient.

, , , 1 , 1 ,1 1

ˆ ˆˆ ˆlog ( ( | ), ( | ) | )T N

i a i a i a i a i aa i

c F w X F w X X

1ˆ ˆ{ ( | )}Ta aF w X

1ˆ{ }Ta aw

Page 12: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Data Sources

(1) Labour Force Survey (LFS): quarterly survey; each hh interviewed for 5 successive qrtrs since spring 1992

Before spring 1997: earnings data in 5th qrtr only Afterwards: earnings data in 1st & 5th qrtrs

(2) British Household Panel Survey (BHPS): annual survey; ongoing panel since 1991

Up to 11 years of earnings obs (to 2001)

Page 13: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Sample Selection

• Sample of 19-60 yr old employees• Trim earnings values above (below) 99th (1st) percentile

for each group• Use residuals from regression of log real weekly wages

on dummies for year, race, region of residence • Convert residuals to annual units

Page 14: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Choosing a copula

1. Pseudo-Likelihood Based Test - Chen and Fan (2005)• Accounts for estimation error in marginal distribution, non-nested

models and fact that none of models under consideration is true DGP

2. Aikaike Information Criterion

t stat p-value

Male Graduates 21.66 0.00Male Non-graduates 30.13 0.00Female Graduates 22.22 0.00Female Non-graduates 35.46 0.00

T-Copula Gauss

Male Graduates 1.77 1.28Male Non-graduates 1.29 1.01Female Graduates 1.78 1.37Female Non-graduates 1.77 1.39

Page 15: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Parameter Estimates

Page 16: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Simulations & Employment Transitions

Dist of lifetime earnings is specified by piecing together sequences of bivariate dists 10,000 draws of earnings paths are made from this lifetime dist (separately

by group)

Moreover, we incorporate labour market mobility into these trajectories

Mobility parameters obtained from 2 employment transition models

(1) Prob of becoming employed at age a given ue at a-1(2) Prob of becoming unemployed at age a given emp at a-1

each estimated using up to 11 years of BHPS data

Also estimate re-entry rank upon re-employment

Page 17: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Lifetime Earnings Distributions0

.01

.02

.03

.04

.05

Pro

po

rtio

n

0 1 2 3Total lifetime earnings, £millions

Males

0.0

1.0

2.0

3.0

4.0

5

Pro

po

rtio

n

0 1 2 3Total lifetime earnings, £millions

Graduates

Non-Graduates

Females

Page 18: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Copula vs ARMA models

Copula framework v flexible and general – a/c for tail dependence– different dependence in different parts of dist; allow this to vary

by age and other X’s– non-parametric marginal distributions

…but at expense of not explicitly allowing for unobserved heterogeneity

Natural to ask whether in moving to copula model, we are

neglecting important features of dynamic process And what features we capture that a linear model can not

Page 19: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Comparison of Models

Strategy 11. Estimate rich ARMA model on BHPS2. Estimate same ARMA model on data simulated from

copula model

Strategy 21. Estimate copula model on BHPS 2. Estimate copula model on data simulated from ARMA

model

• Use BHPS because 3 periods required for identification of ARMA model

Page 20: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Linear Model of Earning Dynamics

, , ,

, , 1 ,

2 2 2, ,~ 0, , ~ 0, , ~ 0,

i t i i t i t

i t i t i t

i i t i t u

w z

z z u

N N u N

Copula Sims BHPS Data Copula Sims BHPS Datarho 0.963 0.931 0.907 0.928V(fixed effect) 0.056 0.077 0.018 0.143V(persistent 0.011 0.041 0.026 0.050V(transitory) 0.027 0.039 0.018 0.040

Graduates Non-Graduates

Page 21: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Copula Estimates on Linear Simulations

• t-copula and gauss give same likelihood – df maximsed at +Inf

• Gaussian ARMA model can not capture the non-linear dependence structure in wages

ModelT-Copula Gauss T-Copula Gauss

Male Graduates 5218 5220 1.19 1.19Male Non-graduates 7001 7003 1.37 1.37

Maximized Likelihood Aikaike Information

Page 22: Estimating Lifetime Earnings Distributions Using Copulas Lorraine Dearden Emla Fitzsimons Alissa Goodman Greg Kaplan ESRC Methods Festival 19 July 2006.

Conclusions and Summary

• Illustrated the use of copulas in modelling earnings dynamics

• Highlighted departures of copula based models from traditional linear ARMA models

• Provided and example of an application for constructing lifetime earnings distributions