Chapter 8 Review Quadratic Functions § 8.3 Graphing Quadratic Equations in Two Variables.
1 QUADRATIC EXPLANATORY VARIABLES We will now consider models with quadratic explanatory variables...
-
Upload
beatrice-heyer -
Category
Documents
-
view
226 -
download
0
Transcript of 1 QUADRATIC EXPLANATORY VARIABLES We will now consider models with quadratic explanatory variables...
1
QUADRATIC EXPLANATORY VARIABLES
uXXY 223221
We will now consider models with quadratic explanatory variables of the type shown. Such a model can be fitted using OLS with no modification.
2
QUADRATIC EXPLANATORY VARIABLES
However, the usual interpretation of a parameter, that it represents the effect of a unit change in its associated variable, holding all other variables constant, cannot be applied. It is not possible for X2 to change without X2
2 also changing.
uXXY 223221
3
QUADRATIC EXPLANATORY VARIABLES
Differentiating the equation with respect to X2, one obtains the change in Y per unit change in X2. Thus, the impact of a unit change in X2 on Y, (2 + 23X2), is a function of X2.
uXXY 223221
2322
2dd
XXY
4
QUADRATIC EXPLANATORY VARIABLES
This means that 2 has an interpretation that is different from that in the ordinary linear model where it is the unqualified effect of a unit change in X2 on Y.
uXXY 223221
2322
2dd
XXY
5
QUADRATIC EXPLANATORY VARIABLES
In this model, 2 should be interpreted as the effect of a unit change in X2 on Y for the special case where X2 = 0. For nonzero values of X2, the coefficient will be different.
uXXY 223221
2322
2dd
XXY
6
QUADRATIC EXPLANATORY VARIABLES
3 also has a special interpretation. If we rewrite the model as shown, 3 can be interpreted as the rate of change of the coefficient of X2, per unit change in X2.
uXXY 223221
2322
2dd
XXY
uXXY 22321
7
QUADRATIC EXPLANATORY VARIABLES
Only 1 has a conventional interpretation. As usual, it is the value of Y (apart from the random component) when X2 = 0.
uXXY 223221
2322
2dd
XXY
uXXY 22321
8
QUADRATIC EXPLANATORY VARIABLES
There is a further problem. We know that the estimate of the intercept may have no sensible meaning if X2 = 0 is outside the data range. If X2 = 0 lies outside the data range, the same type of distortion can happen with the estimate of 2.
uXXY 223221
2322
2dd
XXY
uXXY 22321
9
QUADRATIC EXPLANATORY VARIABLES
We will illustrate this with the earnings function. The table gives the output of a quadratic regression of earnings on schooling (SSQ is defined as the square of schooling).
------------------------------------------------------------------------------. gen SSQ = S*S. reg EARNINGS S SSQ
Source | SS df MS Number of obs = 540-------------+------------------------------ F( 2, 537) = 59.69 Model | 20372.4953 2 10186.2477 Prob > F = 0.0000 Residual | 91637.7357 537 170.647553 R-squared = 0.1819-------------+------------------------------ Adj R-squared = 0.1788 Total | 112010.231 539 207.811189 Root MSE = 13.063
------------------------------------------------------------------------------ EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- S | -2.772317 2.119128 -1.31 0.191 -6.935114 1.390481 SSQ | .1829731 .0737308 2.48 0.013 .0381369 .3278092 _cons | 22.25089 14.92883 1.49 0.137 -7.075176 51.57695------------------------------------------------------------------------------
10
QUADRATIC EXPLANATORY VARIABLES
The coefficient of S implies that, for an individual with no schooling, the impact of a year of schooling is to decrease hourly earnings by $2.77.
------------------------------------------------------------------------------. gen SSQ = S*S. reg EARNINGS S SSQ
Source | SS df MS Number of obs = 540-------------+------------------------------ F( 2, 537) = 59.69 Model | 20372.4953 2 10186.2477 Prob > F = 0.0000 Residual | 91637.7357 537 170.647553 R-squared = 0.1819-------------+------------------------------ Adj R-squared = 0.1788 Total | 112010.231 539 207.811189 Root MSE = 13.063
------------------------------------------------------------------------------ EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- S | -2.772317 2.119128 -1.31 0.191 -6.935114 1.390481 SSQ | .1829731 .0737308 2.48 0.013 .0381369 .3278092 _cons | 22.25089 14.92883 1.49 0.137 -7.075176 51.57695------------------------------------------------------------------------------
11
QUADRATIC EXPLANATORY VARIABLES
The intercept also has no sensible interpretation. Literally, it implies that an individual with no schooling would have hourly earnings of $22.25, which is implausibly high.
------------------------------------------------------------------------------. gen SSQ = S*S. reg EARNINGS S SSQ
Source | SS df MS Number of obs = 540-------------+------------------------------ F( 2, 537) = 59.69 Model | 20372.4953 2 10186.2477 Prob > F = 0.0000 Residual | 91637.7357 537 170.647553 R-squared = 0.1819-------------+------------------------------ Adj R-squared = 0.1788 Total | 112010.231 539 207.811189 Root MSE = 13.063
------------------------------------------------------------------------------ EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- S | -2.772317 2.119128 -1.31 0.191 -6.935114 1.390481 SSQ | .1829731 .0737308 2.48 0.013 .0381369 .3278092 _cons | 22.25089 14.92883 1.49 0.137 -7.075176 51.57695------------------------------------------------------------------------------
12
QUADRATIC EXPLANATORY VARIABLES
The quadratic relationship is illustrated in the figure. Over the range of the actual data, it fits the observations tolerably well. The fit is not dramatically different from those of the linear and semilogarithmic specifications.
------------------------ EARNINGS | Coef. -----------+------------ S | -2.772317 SSQ | .1829731 _cons | 22.25089 ------------------------
-20
0
20
40
60
80
100
120
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Years of schooling (highest grade completed)
Ho
url
y ea
rnin
gs
($)
quadratic
13
QUADRATIC EXPLANATORY VARIABLES
However, when one extrapolates beyond the data range, the quadratic function increases as schooling decreases, giving rise to implausible estimates of both 1 and 2 for S = 0.
-20
0
20
40
60
80
100
120
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Years of schooling (highest grade completed)
Ho
url
y ea
rnin
gs
($)
quadratic
------------------------ EARNINGS | Coef. -----------+------------ S | -2.772317 SSQ | .1829731 _cons | 22.25089 ------------------------
14
QUADRATIC EXPLANATORY VARIABLES
In this example, we would prefer the semilogarithmic specification, as do all wage-equation studies. The slope coefficient of the semilogarithmic specification has a simple interpretation and the specification does not give rise to nonsensical predictions outside the data range.
-20
0
20
40
60
80
100
120
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Years of schooling (highest grade completed)
Ho
url
y ea
rnin
gs
($)
quadratic
------------------------ EARNINGS | Coef. -----------+------------ S | -2.772317 SSQ | .1829731 _cons | 22.25089 ------------------------
15
QUADRATIC EXPLANATORY VARIABLES
In this example, we would prefer the semilogarithmic specification, as do all wage-equation studies. The slope coefficient of the semilogarithmic specification has a simple interpretation and the specification does not give rise to nonsensical predictions outside the data range.
-20
0
20
40
60
80
100
120
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
Years of schooling (highest grade completed)
Ho
url
y ea
rnin
gs
($)
quadratic
------------------------ EARNINGS | Coef. -----------+------------ S | -2.772317 SSQ | .1829731 _cons | 22.25089 ------------------------
16
QUADRATIC EXPLANATORY VARIABLES
The data on employment growth rate, e, and GDP growth rate, g, for 25 OECD countries in Exercise 1.4 provide a less problematic example of the use of a quadratic function.
Average annual percentage growth rates
Employment GDP Employment GDP
Australia 1.68 3.04 Korea 2.57 7.73Austria 0.65 2.55 Luxembourg 3.02 5.64Belgium 0.34 2.16 Netherlands 1.88 2.86Canada 1.17 2.03 New Zealand 0.91 2.01Denmark 0.02 2.02 Norway 0.36 2.98Finland –1.06 1.78 Portugal 0.33 2.79France 0.28 2.08 Spain 0.89 2.60Germany 0.08 2.71 Sweden –0.94 1.17Greece 0.87 2.08 Switzerland 0.79 1.15Iceland –0.13 1.54 Turkey 2.02 4.18Ireland 2.16 6.40 United Kingdom 0.66 1.97Italy –0.30 1.68 United States 1.53 2.46Japan 1.06 2.81
17
QUADRATIC EXPLANATORY VARIABLES
The output from a quadratic regression is shown. gsq has been defined as the square of g.
------------------------------------------------------------------------------. gen gsq = g*g. reg e g gsq
Source | SS df MS Number of obs = 25-------------+------------------------------ F( 2, 22) = 20.15 Model | 15.9784642 2 7.98923212 Prob > F = 0.0000 Residual | 8.7235112 22 .396523236 R-squared = 0.6468-------------+------------------------------ Adj R-squared = 0.6147 Total | 24.7019754 24 1.02924898 Root MSE = .6297
------------------------------------------------------------------------------ e | Coef. Std. Err. t P>|t| [95% Conf. Interval]-------------+---------------------------------------------------------------- g | 1.200205 .3862226 3.11 0.005 .3992287 2.001182 gsq | -.0838408 .0445693 -1.88 0.073 -.1762719 .0085903 _cons | -1.678113 .6556641 -2.56 0.018 -3.037877 -.3183494------------------------------------------------------------------------------
18
QUADRATIC EXPLANATORY VARIABLES
The quadratic specification appears to be an improvement on the hyperbolic function fitted in a previous slideshow. It is more satisfactory than the latter for low values of g, in that it does not yield implausibly large negative predicted values of e. The only defect is that it predicts that the fitted value of e starts to fall when g exceeds 7.
------------------------ e | Coef. -----------+------------ g | 1.200205 gsq | -.0838408 _cons | 22.25089 ------------------------
-4
-3
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
equadratic
hyperbolic
19
QUADRATIC EXPLANATORY VARIABLES
Why stop at a quadratic? Why not consider a cubic, or quartic, or a polynomial of even higher order? There are usually several good reasons for not doing so.
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
20
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
Diminishing marginal effects are standard in economic theory, justifying quadratic specifications, at least as an approximation, but economic theory seldom suggests that a relationship might sensibly be represented by a cubic or higher-order polynomial.
21
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
The second reason follows from the first. There will be an improvement in fit as higher-order terms are added, but because these terms are not theoretically justified, the improvement will be sample-specific.
22
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
Third, unless the sample is very small, the fits of higher-order polynomials are unlikely to be very different from those of a quadratic over the main part of the data range.
23
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
These points are illustrated by the figure, which shows cubic and quartic regressions with the quadratic regression. Over the main data range, from g = 1.5 to g = 4, the fits of the cubic and quartic are very similar to that of the quadratic.
24
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
R2 for the quadratic specification is 0.647. For the cubic and quartic it is 0.651 and 0.658, relatively small improvements.
25
QUADRATIC EXPLANATORY VARIABLES
-2
-1
0
1
2
3
0 1 2 3 4 5 6 7 8
GDP growth rate
emp
loym
ent
gro
wth
rat
e
cubic
quartic
cubic
quartic
Further, the cubic and quartic curves both exhibit implausible characteristics. The cubic declines even more rapidly than the quadratic for high values of g, and the quartic has strange twists at its extremities.
Copyright Christopher Dougherty 2011.
These slideshows may be downloaded by anyone, anywhere for personal use.
Subject to respect for copyright and, where appropriate, attribution, they may be
used as a resource for teaching an econometrics course. There is no need to
refer to the author.
The content of this slideshow comes from Section 4.3 of C. Dougherty,
Introduction to Econometrics, fourth edition 2011, Oxford University Press.
Additional (free) resources for both students and instructors may be
downloaded from the OUP Online Resource Centre
http://www.oup.com/uk/orc/bin/9780199567089/.
Individuals studying econometrics on their own and who feel that they might
benefit from participation in a formal course should consider the London School
of Economics summer school course
EC212 Introduction to Econometrics
http://www2.lse.ac.uk/study/summerSchools/summerSchool/Home.aspx
or the University of London International Programmes distance learning course
20 Elements of Econometrics
www.londoninternational.ac.uk/lse.
11.07.25