Cross Sectional and Panel Data II

CROSS SECTIONAL AND PANEL DATA IIPaul Gompers

Harvard Business School

February 26, 2009

TODAY

Look at a variety of papers that examine panel data.

Start with methodological papers. Differences in differences is common approach to

estimated an effect in panel data when there is an “exogenous” intervention and a contemporaneous non-intervention group.

Look at some of the statistical issues that may plague these tests.

TODAY

A variety of natural experiments that can be viewed as “exogenous” events. Election cycles Lawsuits Oil price shocks State law passage

Key point from these papers. All of them examine a particular issue in a panel

data set. Find an intervention that allows the researcher to

look at pre and post behavior of the firms to answer an economic question.

HOW MUCH SHOULD WE TRUST DIFFERENCES IN DIFFERENCES ESTIMATES?Bertrand, Dufo, and Mullainathan

QJE 2004

MOTIVATION

Lots of papers try to control for events by comparing changes in one group that has been affected by an event to a contemporaneous group has not been affected.

Essentially, the non-affected group becomes the control for the affected group.

Look at differences in the differences between these groups.

Most papers focus solely on whether the intervention is endogenous

Standard errors may be inconsistent.

PROBLEMS

DD estimates are usually OLS run in repeated cross sections.

Potential for significant serial correlation in residuals of OLS regression. DD estimates usually use long time series. Dependent variables in DD are typically highly

positively serially correlated. Treatment variables typically changes very little

within a state over time.

PAPER’S APPROACH

Create a series of placebo laws and look at employment data on female wages. Laws are not real. They randomly assign years

and states for a test. Would only expect significant relationship

between intervention (i.e., the placebo law) and female real wages 5% of the time at typical significance levels.

Also create Monte Carlo simulation. Compare a variety of fixes to deal with the

serial correlation in residuals.

SURVEY OF THE LITERATURE

Looked at 92 papers published in six journals from 1990 to 2000 that utilized differences in differences estimation.

Table I. Review of papers that use DD.

PLACEBO TESTS

Utilize CPS and a sample of women’s wages Table II Look at 1979 to 1999 540,000 reports and (50*21-1050) state year

observations Do 200 independent observations of placebo

laws. Also, because there are only 50 actual states in

the real sample, perform a Monte Carlo simulation where the “states” are generated from the state-level empirical distribution from the CPS with replacement.

Look at Type I and Type II error (inducing a true 2% effect)

PLACEBO TESTS

Look at effect of sample size and time series length of panel. Small T reduces problem of the Type I error. Table III

CORRECTIONS

Look at a variety of parametric corrections To adjust for serial correlation, typical to set an

AR(1) structure to residuals. Table IV.

Typical corrections don’t do very well.

BLOCK BOOTSTRAPPING

Correction for dealing with autocorrelation structure.

Keep all observations from the same state together.

Construct a bootstrap sample by drawing with replacement 50 matrices using the entire time series of observations for each state (Ys,Vs) Run the regression of Y on V. Draw large number of bootstrap samples (200).

Table V. Works well when N is large.

SIMPLE METHOD

Aggregate time series into two periods. Works reasonable well. If all laws passed same time, can do simple

aggregation into average before and after intervention.

If interventions are at different times: Run regressions without intervention variables. Take residuals for intervention states before and after

intervention then average them before and after. Table VI.

With small N, works okay. As N increases, works better. Downside is the power of Type II tests decreases.

EMPIRICAL VARIANCE-COVARIANCE MATRIX

Can estimate the Var-covar matrix by assuming that it is block diagonal with each state having the same autocorrelation process.

Use the states to estimate the Var-covar matrix.

Table VII. Works well with large N. Does poorly with small.

CONCLUSION

Need to worry about serial correlation within the DD sample.

A number of techniques to deal with problem.

Based on the structure of your panel, a different approach may be appropriate.

FIXING MARKET FAILURES OR FIXING ELECTIONS? AGRICULTURAL CREDIT IN INDIA

Shawn Cole

MOTIVATION

India nationalized banks “Reduce poverty, encourage growth”

Private banks controlled by industrial houses

Farmers victim to rural money-lenderIndustrial credit requires co-ordination

At a potential cost?

Inefficiency (lazy bureaucrats)Capture of banks by politicians

...don't worry, I spoke to banks for loans, told agriculturalists for seeds, discussed with firms for fertilisers, ordered weather dept to ensure a good monsoon!

THIS PAPER Provide micro foundations for observed cross-country relationships between bank ownership and performance

Test theories of capture Credit follows election year cycles

Agricultural credit is up to 20% higher in election years than non-election years

Rule out other causes for cycles Captured credit is used strategically targeted

Cycles are twice as large in “swing districts” than in non-swing districts

No evidence politicians reward their supporters Measure costs of distortions:

Default rates are higher in election years The marginal political loan has no measurable effect on

agricultural output Even the average loan does not affect agricultural investment

SETTING Banking in India

In 1969, and 1980, government nationalized largest banks Branch expansion law vastly increases scale and scope of

banking Currently more than 60,000 bank branches Most banks government-owned (ca. 90% assets)

Government intervention increases importance of agricultural lending

Nationalized banks lend more to agriculture All banks required to lend minimum percentage to agriculture

Agricultural credit is important Government banks lend substantial amounts to

agriculture 17% of value of portfolio 40% of loans, or over 20m loans

Private banks lend (less) to agriculture Agriculture is important

60% of labor force works in agriculture 24% of GDP

POLITICIZATION OF LENDING

Politicians promise agricultural credit prior to elections

Informal interaction between bankers and politicians

“State Level Bankers Committees” Comprised of political appointees and bankers Quarterly meetings to monitor lending Staff turns over when government changes

DATA Panel of 412 districts in 19 states, annual

data 1992-1999 Credit Data

“Basic Statistical Returns,” Reserve Bank of India Census of loans Identifies bank, loan amount, location, industry,

interest rate, repayment status Elections Data

Election Commission of India Constituency-level results, aggregated to district-level 32 elections in 19 states

Output Data Planning Commission of India

POLITICAL CYCLES: EMPIRICAL STRATEGY

• “Naïve” OLS: Is credit higher in election years?

• But, elections may be called early by optimizing politicians. Use constitutional schedule to create an instrument for election year

• Stronger test - impose structure of entire election cycle

POLITICAL CYCLES OLS:

District fixed effect: d

Region-year fixed effect: rt Dummy for election year: Est: (Rain in district as additional control): Raindt

Comparing districts in Gujarat, when Gujarat is having an election, to districts in Maharashtra, when Maharashtra is not having an election

Ydt = drtRaindt + Estdt

OLS: EFFECT OF ELECTIONS ON TOTAL CREDIT Table 2

Examine credit by type of bank. Use a variety of estimation techniques.

Sumstat

POLITICAL CYCLES Election cycles in a state are required every five

years Elections may be called early (10 of 32 in sample

are called early) Instrument: a dummy for whether it is a

scheduled election (Khemani, 2004) Does not take credit for elections called during

“booms”

POLITICAL CYCLES: LESSONS

In election years, level of agricultural credit from public sector banks is 6% higher than in non-election years.

No differences for non-agricultural credit (precise)

No differences for private banks (imprecise) Table 3 – Examines by year and loan type

and bank type.

-.2

-.1

0.1

.2Lo

g A

gric

ultu

ral C

red

it

0 -4 -3 -2 -1 0 -4 -3 -2 -1 0 -4 -3 -2 -1 0 -4 -3 -2 -1Years Until Election

Cycles in Agricultural Credit from Public Banks

POLITICAL CYCLES-CONCLUSION Public sector banks increase agricultural credit

by approximately 5-8 percentage points during elections

Private banks too small to “undo” cycle Amount dwarfs legal campaign spending limits

Average constituency has roughly ~$1.4m in agricultural credit from public sector banks

5 percent of $1.4m is $70,000 Implied Subsidy:

Lower interest rate: 3%=$2,100 Outside option (moneylender) or nonpayment: 51-100%, $70,000

Total spending on state elections by candidate is limited to $1,000-$14,000

TARGETED ALLOCATION• Competing theories of targeted allocation of resources:

– Politically close areas, to win elections

– Areas which support majority party (patronage)

• Need measure of the local strength of state governing party (SGP) in previous election– Define SGP as party with more than 50% of seats, or member of

ruling coalition

– Define Margin of Victory in a constituency (Mcdst) as

• Share of votes of SGP minus share of votes of next strongest challenger

• 100% if SGP candidate runs unopposed• –(Share of Winner) if SGP does not field a candidate

MEASURE OF POLITICAL SUPPORT Aggregation

Test for PatronageCredit targeted to areas in which party enjoys more support

Mdst= District average of Mcdst

Test for Swing VoterCredit targeted to areas in which previous election was close

TEST FOR CONSTANT SWING TARGETING

Table 7 Look at amount of credit controlling for years to

election and how close the election was.

-.15

0.1

5Lo

g A

gric

ultu

ral C

red

it

-.5 0 .5Margin of Victory

Targeted Distribution of Credit

-.15

0.1

5Lo

g A

g. C

red

it

-.5 0 .5margin

Four Years Before Scheduled Election

-.15

0.1

5Lo

g A

g. C

red

it

-.5 0 .5margin

Three Years Before Scheduled Election

-.15

0.1

5Lo

g A

g. C

red

it

-.5 0 .5margin

Two Years Before Scheduled Election

-.15

0.1

5Lo

g A

g. C

red

it

-.5 0 .5margin

One Year Befor Scheduled Election

-.15

0.1

5Lo

g A

g. C

red

it

-.5 0 .5margin

Scheduled Election Year

Note: The panels in the figure graph the predicted relationship between agricultural credit levels frompublic sector banks and political support of the state majority party. Each panel gives the relationship for a different year in the electoral cycle.

Targeted Lending Levels Over the Election Cycle

INTERPRETING COEFFICIENTS

Standard deviation of absolute margin of victory is .11 Moving from a district with a margin of victory of

0 to a margin of victory of .11 reduces the size of the cycle by approximately 5 percentage points

-.0

4-.

02

0.0

2.0

4.0

6

-4 -3 -2 -1 0Years Until Scheduled Election

Public Banks

-.4

-.2

0.2

.4


Private Banks

Note: Predicted agricultural credit for a notional district in which the margin of victory in theprevious election was fifteen. Dotted lines give the 95 percent confidence interval.

Figure 3: Cycles in Level of Credit, Non-Swing District-.

050

.05

.1


Public Banks

-.4

-.3

-.2

-.1

0


Private Banks

Note: Predicted agricultural credit for a notional district in which the margin of victory in theprevious election was zero. Dotted lines give the 95 percent confidence interval.

Figure 2: Cycles in Level of Credit, Swing District

TARGETED ALLOCATION (CONCLUDED)

Evidence consistent with “swing voter” models; patronage and programmatic redistribution can be ruled out

Government ownership of banks introduces distortions in credit (but cannot yet make welfare statements)

DO ELECTIONS AFFECT LOAN REPAYMENT? Challenges

Do not observe panel of loans, only district aggregate

Evidence of cycles in repayment

suggest costliness

Use “more than 6 months” late as indicator for defaultMany agricultural loans are for

harvest/season

Table 7

Lending Cycles and Non-Performing Loans

Table 8 Examine being late on loans.

ELECTIONS AFFECT LOAN REPAYMENT

There are cycle in loan repayment rates

Suggests electoral cycle is costly Enforcement is more lax in election years The marginal electoral loan may be more likely

to default Non-performing loans are written off following an

election

DO POLITICAL LOANS AFFECT OUTPUT

Data - Agricultural Output at the state level, 1992-1999 Log real value of agricultural output

Election Schedule serves as instruments for agricultural credit

DO POLITICAL LOANS AFFECT OUTPUT?FIRST STAGE & REDUCED FORM/ IV Table 9

Look at output and credit. Panel A is reduced form and Panel B is IV.

OUTPUT CONCLUSION

No measurable effect on output

TWO

FOUR

CONCLUSION

Combining theories of political cycles and tactical transfers helps identify manipulation of public resources Agricultural lending exhibits substantial lending

cycles Lending is targeted in election years, not in non-

election years Results unlikely to be caused by omitted factors Private banks do not exhibit these distortions

LESSONS

Strong evidence for “Political” view of government involvement in bank credit

Evidence against “Development” view Explains why politicians favor public banks,

and agricultural credit in particular Politicians may care more about re-election

than delivering patronage to core supporters

WHAT DO FIRMS DO WITH CASH WINDFALLS?Blanchard, Lopez-de-Silanes, and Shleifer

JFE 1994

MOTIVATION

How does one provide evidence of investment policy, internal opportunities, and cash flow sensitivity?

What do firms do with cash windfalls when investment opportunities are unchanged? In perfect capital markets, would give money

back to shareholders if holding cash inside firm is tax disadvantaged.

In imperfect capital markets: If firm was capital constrained, should increase

investment in projects it couldn’t do. If managers pursue their own objectives, then could

get very perverse behavior.

DATA Process of identifying cash windfall firms is

quite selective: Looked in WSJ index for mentions of “Antitrust”,

“Patents”, and “Suits” from 1980 to 1986. Find 110 firms that won awards

(Easier today with the web and ability to electronically search news.)

Use four criteria to create sample: Should not affect firm’s marginal Q Award should be significant Require 10K and proxies. Focus only on award winners.

Final sample is 11 firms. Table 1.

SIGNIFICANCE OF AWARD

Table 2 Look at award as fraction of sales and assets.

Table 3 Look at main line of business Look at firm Qs, sales, investment, and debt.

Subtract size of award from Q. These are pretty bad firms with poor investment

opportunities.

MARKET’S REACTION

Do event study. Don’t know the exact amount of

leakage/anticipation of award. Look at CARs centered on award date for firms:

100, 10, and 3 days. Deflate market value change imputed by the

CAR by the size of the award. In general, increase in market cap is small fractio

nof award. Interpretation.

Award is wasted. Award is anticipated.

INVESTMENT

Look at change in investments pre and post award.

Deflate change in investment rate by assets or net award.

Table 6. Small increases in investment on average. Jamesbury is interesting example.

Starts constructing new plant.

ASSET SALES AND ACQUISITIONS

Table 7. Not much asset sales.

Table 8. More troubled firms seem to buy assets. DASA (formerly a telecomm equipment

company) buys oil wells and credit collection business.

UNC quires a communications carrier and starts air services like pilot training.

CHANGE IN FINANCIAL POLICIES

Look at change in debt pre and post-award. Firms seem to add debt after award.

Table 9. Consistent with either commitment to pay out

cash or increase in debt capacity. Look at payout policy.

No increase in dividends. Increase in repurchases – Table 10. Repurchases tend to be targeted at large

investors. Reduction in oversight.

EXECUTIVE COMPENSATION

Look at changes in executive compensation. Cash and stock compensation. Table 11 – Looks like increases in cash primarily.

CONCLUSIONS

Examination looks like cash windfalls are typically used to entrench management.

Increase in pet projects, acquisition. Increase in exec comp. Targeted repurchases.

Methodology is interesting in approach of trying to cleanly identify sample. One more table than data points.

CASH FLOW AND INVESTMENT: EVIDENCE FROM INTERNAL CAPITAL MARKETSOwen Lamont

JF 1997

MOTIVATION

Similar to Blanchard, et al., trying to identify group of firms that have change in cash flow and see how it affects investment.

Want a shock that is unanticipated (exogenous) and its affect on multi-divisional firms that have businesses whose prospects are uncorrelated with the other division’s prospects.

Natural experiment: Oil shock of 1986.

Severe and rapid real price decline in oil prices. Look at diversified oil companies who had other

businesses unrelated to oil.

INTERNAL CAPITAL MARKETS

If external finance is costly, then internal capital markets can allocate money across divisions to overcome some of the financing wedges that exist between internal and external cash.

If capital markets were perfect, would expect no change in investment with a decline in the unrelated segment cash flow.

OIL SHOCK OF 1985

Increase production by Saudi Arabia Figure 1 – Real crude prices. Large change in production and investment plans

for real crude companies. Table I.

Real declines in profitability.

SAMPLE

Look at multi-segment reporting firms. Issue with multi-segment firms.

Firms report segments that are more than 10% of firm revenues.

Reporting is endogenous. Can change from year to year. Does not necessarily represent how firm is organized.

Extract all firms from Compustat in 1985 that had either their primary or secondary SIC code in oil and gas extraction (SIC code 13). Firms classified as oil-dependent if at least 25% of their

cash flow came from oil extraction.

SAMPLE

Look at non-oil segments. Exclude firms with financial services operations

or by-products of oil and gas. First pass is to use judgment. Second test, look at time series of correlation of

profits and investments from Annual Survey of Manufacturing for non-oil segment and real oil prices.

Table II- Sample of firms. Table III – Non-oil segment at firms.

Several firms have multiple non-oil segments in data sample.

TESTS

Examine investment pre- and post-oil shock for non-oil segments. Table V – Raw and industry adjusted investment

for non-oil segments.

CASH FLOW OR COLLATERAL

Change in investment could be due to higher cost of external finance.

Look at firms that have a bond downgrade. Table VII – all done at the industry level for

non-oil segments. Change in investment not driven by bond rating

or shock variable or cash flow.

OVER OR UNDERINVESTMENT

Testing between capital market imperfection (underinvesment after shock) vs. agency costs (overinvestment prior to shock). Look at industry adjusted investment rates. Table VIII

Go from roughly average/median to below industry average/medians.

Table IX Look at profitability. Non-oil segments were less profitable than industry

averages prior to shock and then increase to average after shock.

SUBSIDIES TO NON-OIL DIVISIONS

Table X. Cut deepest for divisions in which CF<I prior to

shock. Also, look at segment investment as a

function of oil cash flows to firm sales. Table XI

OBSERVATIONS

Appears that diversified oil companies subsidize non-oil divisions with free cash flow generated by the oil profits.

This subsidization appears to be the results of agency costs.

More generally, nice natural experiment to provide evidence of investment behavior within firms.

ENJOYING THE QUIET LIFE? CORPORATE GOVERNANCE AND MANAGERIAL PREFERENCESBertrand and Mullainathan

JPE 2003

MOTIVATION

Managers typically own very little of their company stock. 90% own less than 5% (Ofek and Yermack

(2000)) Are managers who are entrenched empire

building (Williamson (1964) vs. quiet life consumers (Hicks (1935))?

Lots of studies look at the relationship between measures of corporate governance and firm performance/activities. But internal governance is endogenous.

Firms choose what level of shareholder protection, etc. they have.

DEALING WITH ENDOGENEITY

Look at enactment of state antitakeover laws. Typically viewed as outside the control of any

one firm. Can look at effect on companies incorporated in

states that pass these lasws. Research design.

Examine both firm level and plant level data. Plant level data is key.

Firms have plants in many states. Anti-takeover rules apply to where firm is incorporated.

STATE ANTI-TAKEOVER LAWS

Good history of first and second generation anti-takeover laws.

Table 1. List of states and years that the laws passed.

Focus on Business Combination Laws Moratorium (3 – 5 years) on specified transaction

between target and a raider holding a certain % of the stock unless board votes

Literature on impact of laws. Stock price reaction – small negative.

Hard to pin down date. Evidence on actual takeover incidence is mixed.

PAPER’S STRATEGY

Use difference in differences comparisons between states that adopt BC laws and those that don’t.

Have data at the firm level and the plant level. Can control for local economic conditions.

DATA

LRD – Longitudinal Research Datafile Census – large probability sample of US

manufacturing firms Have data on individual plants. Matched to Compustat

Issue with Compustat – has only most recent state of incorporation, i.e., it is not historical.

VARIABLES

Average hourly production worker wage Capital stock Return on capital Plant births and deaths Table 2 – Summary stats

REGRESSIONS.

Firm Levelyjklt = tjXjkltBCkt + Estjklt

Plant Levelyijklt = tikXijkltylt(-i)BCkt + Estjklt

WAGES.

Do managers increase wages after BC law? Why would they increase wages above profit

maximizing levels? Reduce turnover. Reduce complaints. Allow for clustering at the state of location level

to adjust for serial correlation. Table 3

Column 5 – look at reverse causality.

PLANT BIRTHS AND DEATHS

Look at death of plants Table 5 – lower rate of plant closings.

Plant births. Table 6 – BC law decreases plant birth by 2%.

Large relative to unconditional plant birth probability of 7%.

No effect on capex or firm size – Tables 7 and 8.

PRODUCTIVITY

Seems as if managers are paying higher wages and being less aggressive.

Does this affect plant level performance. Look at TFP

TFP is the residual from this regression. Second measure of productivity is the return

on capital.

log(output) = tlog(wage billi log(capitali log(materialii

REGRESSIONS

Table 9 Results appear to show lower productivity when

TFP percentiles are measuresd. Marginal effect on return on capital.

OBSERVATIONS

Results consistent with managers that are insulated from takeover “enjoying the quiet life”

Good natural experiment. Lots of interesting variation and results. Exploits location/incorporation differences. Nice data set on plant level variables.

DOES CORPORATE GOVERNANCE MATTER IN COMPETITIVE INDUSTRIESGroud and Mueller

Working paper 2007

IDEA

Notion that market competition weeds out firm inefficiencies.

Would expect that corporate governance is less important when firms face highly competitive markets.

Do the Bertrand Mullainathan analysis, but add in the level of competition in the industry. Expect that in highly concentrated industries, the

Bertrand/Mullainathan result would be particularly strong.

Not so in highly fragmented/competitive industries.

DATA

Main data is Compustat: Exclude utilities – regulated. Use state of location and state of incorporation.

Not necessarily the same thing. Table I – Summary Compute 3-digit Herfindahl index.

Test robustness at 2 and 4 digit. Industry definition is particularly tough.

SIC codes not very good for high technology companies.

Mapping to actual competitors may be problematic.

MAIN APPROACH

Use differences in differences Look at operating performance about how it

depends upon concentration

yijklt =tiXijkltBCktHerfindahljt(BCkt x Herfindahljtjklt

CAVEAT

Not looking at LRD data Looking at Compustat Table III – Look at ROA/assets

BC alone associated with lower ROA Once interaction term put in, all the effect of BC

is due to concentrated industries Standard errors are clustered at the state level to

deal with serial correlation of residuals at the state level.

MEASUREMENT ERROR

Perhaps Herfindahl is measured with error. Imprecise industries Wrong measure of competitiveness.

Use dummy variable for Herfindahl above and below the median or with three Herfindahl dummies. No dependent upon functional form of the

Herfindahl. Table IV.

ROBUSTNESS CHECKS

Use past Herf to control for reverse causality – Table VII.

Look at alternative performance measures – Table VIII.

Look only at manufacturing – Table IX. Alternative correction from Bertrand, Duflo,

and Mallainathan (2004) – Table X.

EVENT STUDY

Try to replicate Karpoff and Malatesta (1989) Difficulty

When is first announcement? What date do you use? What is the market expectation through time?

Look for first announcement of the law. Exhaustive search where possible. Only find news stories about 19 of the 30 laws. But this covers the big states.

EVENT STUDY ISSUES Typical event study methodology assumes

that all events are independent. Solutions:

Typically, just form portfolios of affected and non-affected firms.

Here, authors form equally weighted portfolios of firms that are incorporated in the state and run market model on equally weighted CRSP market portfolio.

Estimate model for days -241 to -41. Get factor loadings. Create predicted returns and subtract from

actual return.

EVENT STUDY

Look at CARs for days -40 to -, -3 to -2, -1 to 0, 1 to 2 and 1 to 10.

Separate out firms based upon Herfindahl in the previous groups.

Table XII. Biggest price drop is in the concentrated

industries. BC laws affect performance of the concentrated

firms that face less competition.

CONTRIBUTION

Interesting extension of prior work. Thoughtful methodology.

Cross Sectional and Panel Data II

Documents

Transcript of Cross Sectional and Panel Data II