Topic 4.2

55
ECON 377/477

Transcript of Topic 4.2

Page 1: Topic 4.2

ECON 377/477

Page 2: Topic 4.2

Topic 4.2

Stochastic Frontier AnalysisPart 2

Page 3: Topic 4.2

Outline

• Distance functions• Cost frontiers• Decomposing cost efficiency• Scale efficiency• Panel data models• Accounting for the production environment• Conclusions

3ECON377/477 Topic 4.2

Page 4: Topic 4.2

Distance functions• Distance functions can be used to estimate the

characteristics of multiple-output production technologies in cases where we have no price information and/or it is inappropriate to assume firms minimise costs or maximise revenues

• Examples arise when an industry is regulated• Input distance functions tend to be used instead

of output distance functions when firms have more control over inputs than outputs, and vice versa

• We consider only input distance functions

4ECON377/477 Topic 4.2

Page 5: Topic 4.2

Distance functions• Assume we have access to cross-sectional data

on I firms• An input distance function defined over M

outputs and N inputs takes the form

where xni is the n-th input of firm i; qmi is the m-th output; and di

I ≥ 1 is the maximum amount by which the input vector can be radially contracted without changing the output vector

5

1 2 1 2( , , , , , )I Ii i i Ni i i Mid d x x x q q q

ECON377/477 Topic 4.2

Page 6: Topic 4.2

Distance functions• The function dI(.) is non-decreasing, linearly

homogeneous and concave in inputs, and non-increasing and quasi-concave in outputs

• The first step in econometric estimation of an input distance function is to choose a functional form for dI(.)

• It is convenient to choose a functional form that expresses the log-distance as a linear function of (transformations of) inputs and outputs

6ECON377/477 Topic 4.2

Page 7: Topic 4.2

Distance functions• For example, if we choose the Cobb-Douglas

functional form then the model becomes

where vi is a random variable introduced to account for errors of approximation and other sources of statistical noise

• This function is non-decreasing, linearly homogeneous and concave in inputs if βn ≥ 0 for all n and if

7

01 1

ln ln lnN M

Ii n ni m mi i

n m

d x q v

1

1.N

nn

ECON377/477 Topic 4.2

Page 8: Topic 4.2

Distance functions

• It is also quasi-concave in outputs if non-linear functions of the first- and second-order derivatives of di

I with respect to the outputs are non-negative

• Econometric estimation would be reasonably straightforward were it not for the fact that the dependent variable is unobserved

8ECON377/477 Topic 4.2

Page 9: Topic 4.2

Distance functions

• Some substitution and re-arrangement enables us to obtain a homogeneity-constrained model

where is a non-negative variable associated with technical inefficiency

• Our decision to express ln diI as a linear

function of inputs and outputs results in a model that is in the form of the stochastic production frontier

9

1

01 1

ln ln lnN M

Ni n ni Ni m mi i in m

x x x q v u

ln Ii iu d

ECON377/477 Topic 4.2

Page 10: Topic 4.2

Distance functions

• This model is discussed in Part 1 of this topic

• It follows that we can estimate the parameters of the model using the ML technique that is also discussed in Part 1

• A radial input-oriented measure of technical efficiency is:

• But there are two common problems in the estimation of distance functions

10

1exp( ).i iI

i

TE ud

ECON377/477 Topic 4.2

Page 11: Topic 4.2

Distance functions

• These problems are:1. The explanatory variables may be correlated with the composite

error term

2. Estimated input distance functions often fail to satisfy the concavity and quasi-concavity properties implied by economic theory

• A solution to the first problem is to estimate the model in an instrumental variables framework

• A solution to the second problem is to impose regularity conditions by estimating the model in a Bayesian framework

11ECON377/477 Topic 4.2

Page 12: Topic 4.2

Cost frontiers

• When price data are available and it is reasonable to assume firms minimise costs, we can estimate the economic characteristics of the production technology (and predict cost efficiency) using a cost frontier

• In the case where we have cross-sectional data, the cost frontier model can be written in the general form

ci ≥ c(w1i, w2i, …, wNi, q1i, q2i, …, qMi)

12ECON377/477 Topic 4.2

Page 13: Topic 4.2

Cost frontiers

• In this equation, ci is the observed cost of firm i, wni is the n-th input price and qmi is the m-th output

• Note that c(.) is a cost function that is non-decreasing, linearly homogeneous and concave in prices

• The implication of the equation is that the observed cost is greater than or equal to the minimum cost

• The first step in estimating the relationship is to specify a functional form for c(.)

13ECON377/477 Topic 4.2

Page 14: Topic 4.2

Cost frontiers

• The Cobb-Douglas cost frontier model is:

where vi is a symmetric random variable representing errors of approximation and other sources of statistical noise and ui is a non-negative variable representing inefficiency

• This function is non-decreasing, linearly homogeneous and concave in inputs if the βn are non-negative and satisfy the constraint

14

01 1

ln ln lnN M

i n ni m mi i in m

c w q v u

1

1.N

nn

ECON377/477 Topic 4.2

Page 15: Topic 4.2

Cost frontiers

• A translog model is obtained in a similar way• Both models can be written in the compact form:

• A measure of cost efficiency is the ratio of minimum cost to observed cost, which can be easily shown to be: CEi = exp(-ui)

• Check CROB (pp. 267-269) where they present annotated SHAZAM output from the estimation of a half-normal translog cost frontier defined over a single output and three inputs

15

ln i Ni i i ic w v u x β

ECON377/477 Topic 4.2

Page 16: Topic 4.2

Decomposing cost efficiency

• When we have data on input quantities or cost-shares, cost efficiency can be decomposed into technical and allocative efficiency components

• One approach involves estimating a cost frontier together with a subset of cost-share equations

• We focus on a slightly different decomposition method, estimating a production frontier together with a subset of the first-order conditions for cost minimisation

16ECON377/477 Topic 4.2

Page 17: Topic 4.2

Decomposing cost efficiency

• Consider a single-output Cobb-Douglas production frontier:

• Minimising cost subject to this technology constraint entails writing out the Lagrangean, and setting the first-order derivatives to zero

• Taking the logarithm of the ratio of the first and n-th of these first-order conditions yields:

for n = 2, …, N

17ECON377/477 Topic 4.2

01

ln ln .N

i n ni i in

q x v u

1 1 1ln lni ini

ni ni n

w x

w x

Allocative efficiency

Page 18: Topic 4.2

Decomposing cost efficiency

• In this equation, ηni is a random error term introduced to represent allocative inefficiency

• It is positive, negative or zero depending on whether the firm over-utilises, under-utilises or correctly utilises input 1 relative to input n

• A firm is regarded as being allocatively efficient if and only if ηni = 0 for all n

• Observe that inputs appear in ratio form

18ECON377/477 Topic 4.2

Page 19: Topic 4.2

Decomposing cost efficiency

• Thus, a radial expansion in the input vector (an increase in technical inefficiency) will not cause a departure from the first-order conditions

• But a change in the input mix (allocative inefficiency) will clearly cause a departure from the first-order conditions

• We can estimate the N equations by ML under the (reasonable) assumptions that the vis, uis and ηnis are iid as univariate normal, half-normal and multivariate normal random variables, respectively

19ECON377/477 Topic 4.2

Page 20: Topic 4.2

Decomposing cost efficiency

• That is,

• Scale economies are measured by

20ECON377/477 Topic 4.2

2~ (0, ),i vv iidN

2~ (0, ),i uu iidN

2 3( , ,..., ) ( , ).i i i Ni iidN η 0 Σ

1

N

nn

r

Page 21: Topic 4.2

Decomposing cost efficiency

• CROB (p. 271) show that the cost function and its associated system takes the form:

where:

and α is a non-linear function of the βn

21ECON377/477 Topic 4.2

1

1ln ln ln ( ) ( ) ( ln )

Nn

i ni i i i in

c w q v r u r A rr r

12 2

1ln exp( )

N N

i n ni n nin n

Ar

Technical efficiency

Allocative efficiency

Page 22: Topic 4.2

Decomposing cost efficiency

• The term ui/r measures the increase in the log-cost due to technical inefficiency

• The term Ai – ln r measures the increase due to allocative inefficiency

• A measure of cost efficiency is the ratio of minimum cost to observed cost:

CEi = CTEi × CAEi

where the component CTEi = exp(–ui/r) is due to technical inefficiency, and the component CAEi = exp(ln r – Ai) is due to allocative inefficiency

22ECON377/477 Topic 4.2

Page 23: Topic 4.2

Decomposing cost efficiency

• We can obtain point predictions for CTEi and CAEi by substituting predictions for ui and ηni into these expressions

• If the technology exhibits constant returns to scale (r = 1), then:

CTEi = TEi = exp(–ui)

CAEi = AEi ≡ exp(–Ai)

• Thus, CEi = TEi × AEi, which is the familiar expression from Topic 2

23ECON377/477 Topic 4.2

Page 24: Topic 4.2

Decomposing cost efficiency

• CROB illustrate the method and present annotated SHAZAM output in Table 10.2 from the estimation of a three-input Cobb-Douglas production frontier and decomposition of cost efficiency into its two components

• For simplicity, they estimate the production frontier in a single-equation framework, although more efficient estimators could be obtained by estimating the frontier in a seemingly unrelated regression framework

24ECON377/477 Topic 4.2

Page 25: Topic 4.2

Scale efficiency

• To measure scale efficiency, we must have a measure of productivity and a method for identifying the most productive scale size (MPSS)

• In the case of a single-input production function, we can measure productivity using the AP

• The MPSS is the point of maximum AP(x)• The first-order condition for a maximum can be

easily rearranged to show that the MPSS is the point where the elasticity of scale is 1 and the firm experiences local constant returns to scale

25ECON377/477 Topic 4.2

Page 26: Topic 4.2

Scale efficiency

• To measure scale efficiency, we set the elasticity of scale to 1 and solve for the MPSS, denoted x*

• Scale efficiency at any input level x is:

• This procedure generalises to the multiple-input case, although a measure of productivity is a little more difficult to conceptualise

26ECON377/477 Topic 4.2

*

*

*

( ) ( )AP( )SE( ) 1.

AP( )

f x f xxx

x x x

Page 27: Topic 4.2

Scale efficiency

• Think of the input vector x as one unit of a composite input, so that kx represents k units of input

• A measure of productivity is the ray average product (RAP):

• Set the elasticity of scale to 1 and solve for the optimal number of units of the composite input, denoted k*

27ECON377/477 Topic 4.2

( )RAP( ) .

xx

f kk

k

Page 28: Topic 4.2

Scale efficiency

• A measure of scale efficiency at input level kx is:

or, if k = 1:

• A solution can be obtained for a translog functional form and the associated measure of scale efficiency derived

28ECON377/477 Topic 4.2

*

* *

( ) ( )RAP( )SE( ) 1,

RAP( )

x xxx

x

f k f kkk

k k k

*

* *

( ) ( )RAP( )SE( ) 1.

RAP( )

x xxx

x

f f k

k k

Page 29: Topic 4.2

Scale efficiency

• If the production frontier takes the translog form

the scale efficiency measure becomes

where

29ECON377/477 Topic 4.2

01 1 1

1ln ln ln ln

2

N N N

i n ni nm ni mi i in n m

y x x x v u

2(1 ( ))SE( ) exp

2

xx

1 1

( ) lnxN N

n nm min m

x

Page 30: Topic 4.2

Scale efficiency

• Now, ε(x) is the elasticity of scale evaluated at x, and

• If the production frontier is concave in inputs, β will be less than zero and the scale efficiency measure will be less than or equal to one

30ECON377/477 Topic 4.2

1 1

.N N

nmn m

Page 31: Topic 4.2

Panel data models

• We now extend discussion of frontier models to the case where panel data are available

• Panel data sets enable us to obtain more efficient estimators of the unknown parameters and more efficient predictors of technical efficiencies

• They often allow us to:o relax some of the strong distributional assumptionso obtain consistent predictions of TEso investigate changes in technical efficiencies

31ECON377/477 Topic 4.2

Page 32: Topic 4.2

Panel data models

• They also enable us to investigate changes in the underlying production technology over time

• A panel data model can be written as:

where a subscript ‘t’ is added to represent time• If we assume the vits and uits are independently

distributed, we can estimate the parameters of this model using the methods described in Topic 4.1

32ECON377/477 Topic 4.2

ln it it it itq v u x β

Page 33: Topic 4.2

Panel data models

• A problem with assuming the uits are independently distributed is that we fail to reap any of the benefits listed above

• Moreover, for many industries the independence assumption is unrealistic – all other things being equal, we expect efficient firms to remain reasonably efficient from period to period, and we hope that inefficient firms improve their efficiency levels over time

• For these reasons, we need to impose some structure on the inefficiency effects

33ECON377/477 Topic 4.2

Page 34: Topic 4.2

Panel data models

• It is common to classify different structures on the inefficiency effects according to whether they are time-invariant or time-varying

• One of the simplest structures we can impose on the inefficiency effects is

uit = ui i = 1, …, I; t = 1, …., T

where ui is treated as either a fixed parameter or a random variable

• These models are known as the fixed effects model and random effects model, respectively

34ECON377/477 Topic 4.2

Page 35: Topic 4.2

Panel data models

• The fixed effects model can be estimated in a standard regression framework using dummy variables

• The estimated model can only be used to measure efficiency relative to the most efficient firm in the sample so our estimates may be unreliable if the number of firms is small

• The random effects model can be estimated using either least squares or ML techniques

35ECON377/477 Topic 4.2

Page 36: Topic 4.2

Panel data models

• The ML approach involves making stronger distributional assumptions concerning the uis

• Estimating models in a random effects framework using the ML method allows us to disentangle the effects of inefficiency and technological change

36ECON377/477 Topic 4.2

Page 37: Topic 4.2

Panel data models

• The likelihood function for this model is a generalisation of the likelihood function for the half-normal stochastic frontier model discussed in Topic 4.1

• Formulas for firm-specific and industry efficiencies are also generalisations of the formulas presented in Topic 4.1

• The hypothesis testing procedures discussed in Topic 4.1 are also applicable

37ECON377/477 Topic 4.2

Page 38: Topic 4.2

Panel data models

• Models with time-invariant inefficiency effects can be conveniently estimated using FRONTIER and LIMDEP

• CROB illustrate this estimation in Table 10.3, which contains annotated FRONTIER output from the estimation of a truncated-normal frontier

• Note that significant differences exist between the first-order coefficient estimates reported in this table and those reported in Table 9.6 where no account is taken of the panel nature of the data

38ECON377/477 Topic 4.2

Page 39: Topic 4.2

Panel data models

• Two models that allow for time-varying technical inefficiency take the form:

where α, β and η are unknown parameters to be estimated

• The Battese and Coelli function involves only one unknown parameter, and is less flexible

39ECON377/477 Topic 4.2

12( ) 1 exp( )f t t t

( ) exp ( )f t t T

Kumbhakar model

Battese and Coelli model

Page 40: Topic 4.2

Panel data models

• A limitation of both functions is that they do not allow for a change in the rank ordering of firms over time

• The firm that is ranked n-th at the first time period is always ranked n-th

• That is, if ui < uj, then

for all t

40ECON377/477 Topic 4.2

( )it iu f t u ( ) j jtf t u u

Page 41: Topic 4.2

Panel data models

• The Kumbhakar and Battese and Coelli models can both be estimated under the assumption that ui has a truncated normal distribution:

• Again, the likelihood function is a generalisation of the likelihood function for the half-normal stochastic frontier model, as are formulas for firm-specific and industry efficiencies

• Hypotheses concerning individual coefficients can be tested using a z test or LR test, but they are usually tested using an LR test if there is more than one coefficient in the test

41ECON377/477 Topic 4.2

2~ ( , ).i uu iidN

Page 42: Topic 4.2

Panel data models

• Null hypotheses of special interest areo H0: α = β = 0 or H0: η = 0 (time-invariant efficiency effects)

o H0: µ = 0 (half-normal inefficiency effects at time period T)

• CROB present annotated FRONTIER output from the estimation of a frontier in Table 10.4

• They are unable to reject both null hypotheses that the technological change effect is zero and η = 0

42ECON377/477 Topic 4.2

Page 43: Topic 4.2

Panel data models

• These hypothesis test results suggest that the model is having difficulty distinguishing between output increases due to technological progress and output increases due to improvements in technical efficiency

• Several more flexible models are discussed in the efficiency literature

• Notably, Cuesta (2000) specifies a model of the form that generalises the Battese and Coelli model and allows the temporal pattern of inefficiency effects to vary across firms 

43ECON377/477 Topic 4.2

Page 44: Topic 4.2

Accounting for the production environment

• The ability of a manager to convert inputs into outputs is often influenced by exogenous variables that characterise the environment in which production takes place

• It is useful to distinguish between non-stochastic variables that are observable at the time key production decisions are made and unforeseen stochastic variables that can be regarded as sources of production risk (events of any type that might lead managers to seek some form of liability insurance)

44ECON377/477 Topic 4.2

Page 45: Topic 4.2

Accounting for the production environment

• The simplest way to account for non-stochastic environmental variables is to incorporate them directly into the non-stochastic component of the production frontier

• In the case of cross-sectional data this leads to a model of the form:

where zi is a vector of (transformations of) environmental variables and γ is a vector of unknown parameters

45ECON377/477 Topic 4.2

ln i i i i iq v u x β z γ

Page 46: Topic 4.2

Accounting for the production environment

• This model has exactly the same error structure as the conventional stochastic frontier model discussed in Topic 4.1

• Thus, all the estimators and testing procedures discussed in that part of the topic are available

• Our predictions of firm-specific technical efficiency now vary with both the traditional inputs and the environmental variables

46ECON377/477 Topic 4.2

Page 47: Topic 4.2

Accounting for the production environment

• The preferred method to deal with observable environmental variables is to allow them directly to influence the stochastic component of the production frontier

• Assume

• and

47ECON377/477 Topic 4.2

ln i i i iq v u x β

2~ ( , ).i i uu N z γ

Page 48: Topic 4.2

Accounting for the production environment

• The inefficiency effects in the frontier model have distributions that vary with zi, so they are no longer identically distributed

• The likelihood function is a generalisation of the likelihood function for the conventional model, as are measures of firm-specific and industry efficiency

• The model has also been generalised to the panel data case

48ECON377/477 Topic 4.2

Page 49: Topic 4.2

Accounting for the production environment

• A simple way to account for production risk is to append another random variable to the frontier model to represent the combined effects of any variables that are unobserved at the time input decisions are made

• If we assume this random variable has a symmetric distribution, then it is difficult to distinguish it from the noise vi

• Alternatively, if we assume it has a non-negative distribution, it is difficult to distinguish it from the inefficiency effect ui

49ECON377/477 Topic 4.2

Page 50: Topic 4.2

Accounting for the production environment

• This suggests that, for all intents and purposes, we can persist with the conventional stochastic frontier model, although we should recognise that the two error components now measure the effects of noise, inefficiency and risk

• But the conventional frontier model has two undesirable risk propertieso The signs of the MPs are the same as the signs of the

associated marginal riskso The model does not permit substitutability between state-

contingent outputs

50ECON377/477 Topic 4.2

Page 51: Topic 4.2

Accounting for the production environment

• One way to overcome the first problem is to assume the composed error term is heteroskedastic

• One way to allow for substitution between state-contingent outputs is to estimate a state-contingent stochastic frontier of the form

where βj is a vector of unknown parameters and vi and ui represent noise and inefficiency, respectively (but not risk)

51ECON377/477 Topic 4.2

ln i i j i iq v u x β

Page 52: Topic 4.2

Accounting for the production environment

• This model is identical to the conventional stochastic frontier model, except the coefficient vector βj is permitted to vary across risky states of nature, j = 1, …, J

• Estimation is complicated by the fact that states of nature are typically unobserved or data are sparse

• This problem can be overcome by estimating the model in a Bayesian mixtures framework, and using this model to identify output shortfalls due to inefficiency and output shortfalls due to adverse conditions

52ECON377/477 Topic 4.2

Page 53: Topic 4.2

Conclusions

• Two other possible methods for estimating multiple-output technologies are not discussed

• First, we can use profit frontiers when input and output prices are available and it is reasonable to assume firms maximise profits

• Methods to estimate profit frontiers are similar to those available for estimating cost frontiers

• Second, we can aggregate multiple outputs into a single output measure using index number methods, and estimate the technology in a conventional single-output framework

53ECON377/477 Topic 4.2

Page 54: Topic 4.2

Conclusions

• The decision to estimate a distance function, cost frontier, profit frontier or single-output production frontier is one of the many decisions facing researchers who want to estimate efficiency using a parametric approach

• Researchers must also make choices concerning functional forms, error distributions, estimation methods and software

• The need to make so many choices is often seen as a disadvantage of the parametric approach

54ECON377/477 Topic 4.2

Page 55: Topic 4.2

Conclusions

• We have two simple pieces of advice:1. Always make decisions on a case-by-case basis

2. Whenever it is possible, explore alternative models and estimation methods and (formally or informally) assess the adequacy and robustness of the results obtained

55ECON377/477 Topic 4.2