Rational Addiction Evidence from Carbonated Soft Drinks...

30
Rational Addiction Evidence from Carbonated Soft Drinks Xiaoou Liu and Rigoberto Lopez Department of Agricultural and Resource Economics University of Connecticut W.B. Young Building, 1376 Storrs Road Storrs, CT 06269 June 30, 2009 Contributed Paper prepared for presentation at the International Association of Agricultural Economists Conference, Beijing, China, August 16-22, 2009. Copyright 2009 by the authors. All rights reserved. Readers may make verbatim copies of this document for non-commercial purposes by any means, provided that this copyright notice appears on all such copies.

Transcript of Rational Addiction Evidence from Carbonated Soft Drinks...

Rational Addiction Evidence from Carbonated Soft Drinks

Xiaoou Liu and Rigoberto Lopez

Department of Agricultural and Resource Economics

University of Connecticut

W.B. Young Building, 1376 Storrs Road

Storrs, CT 06269

June 30, 2009

Contributed Paper prepared for presentation at the International Association of Agricultural

Economists Conference, Beijing, China, August 16-22, 2009.

Copyright 2009 by the authors. All rights reserved. Readers may make verbatim copies ofthis document for non-commercial purposes by any means, provided that this copyright noticeappears on all such copies.

ABSTRACT

Rational Addiction Evidence from Carbonated Soft Drinks. XIAOOU LIU (Email: liuxi-

[email protected], Department of Agricultural and Resource Economics, Storrs, CT

06269) RIGOBERTO LOPEZ (Professor and Department Head, Department of Agricultural

and Resource Economics, Storrs, CT 06269)

This paper applies the Becker-Murphy (1988) theory of rational addiction to the case of

carbonated soft drinks, using a time-varying parameter model and scanner data from 46 U.S.

cities. Empirical results provide strong evidence that carbonated soft drinks are rationally

addictive, thus opening the door to taxation and regulation. Taking rational addition into

account, estimated demand elasticities are much lower than previous estimates using scanner

data.

Key words: rational addiction, carbonated soft drinks, time-varying parameter model

i

INTRODUCTION

Addiction is commonly defined as excessive physical or psychological dependence to a sub-

stance or activity. From a public policy perspective, addiction is an important issue as

excessive consumption of addictive commodities lead to negative health outcomes. This

chapter is the first to focus on addiction to carbonated soft drinks (herein CSDs) whose ex-

cess consumption can contribute to the obesity epidemic and health related complications.

If a product is addictive, then this provides a rationale for government intervention.

Economists have a long standing interest in modeling addictive processes due to their

important in explaining certain consumption behaviors that seem to contradict the ratio-

nality basis of economics. From an economic perspective, addictions are always associated

with two conditions: reinforcement and tolerance. Reinforcement refers to the fact that

more consumption of addictive commodities at present will lead to more consumption in the

future. Tolerance implies that more consumption at present will lead to lower future utility

of a given amount of future consumption.

In a milestone paper, Becker and Murphy (1988, herein BM) theoretically illustrate that

many addictions previously thought to have been irrational are consistent with consumer

choice theory given stable consumer preferences. The study points out that addictive behav-

ior is rational if an increase in current consumption of an addictive product increases future

consumption of that product. In the BM model, both past and future consumptions raise

the marginal utility of current consumption. Thus, to test a rational addiction hypothesis is

equivalent to ask whether consumption today is dependent on both past consumption and

future consumption.

To test the theoretical model of rational addiction, Becker, Grossman, and Murphy (1994,

herein BGM) examined whether lower past and future prices for cigarettes raise current

cigarette consumption. Furthermore, empirical applications of the BM framework have also

1

been conducted on cigarettes (Baltagi and Griffin, 2001), alcohol (Waters and Sloan, 1995;

Grossman, Chaloupka, and Sirtaian, 1998; Bentzen and Eriksson, 1999; Baltagi and Griffin,

2002), caffeine (Olekalns and Bardsley, 1996), and Richards, Patterson and Tegene (2007)

on nutrient consumption.

This chapter applies the BM rational addiction model to carbonated soft drinks in the

United States. This product provides an excellent case study for the following reasons. First,

addiction is apparent given the fact that CSDs per capita consumption in U.S. is more than

tripled since 1975 (Allshouse, 2004).1 Second, excessive consumption of CSDs has direct

implications for the obesity epidemic and type 2 diabetes in children whose incidence has

paralleled the rise in consumption of CSDs, particularly via the consumptions of excess

calories (Harnack, Stang and Story, 1999 and Apovian, 2004). Third, this chapter is the first

study to test rational addiction nature of carbonated soft drinks, thus adding to the scarce

economic literature on food as an addiction.

The first objective of this study is to present evidence whether or not CSDs are addictive.

This is equivalent to test the hypothesis that if higher past and future prices raise current

consumption of CSDs . The second objective is to assess the potential of price policies that

relate health problems to CSDs consumption, as consumption of an addictive product is

irresponsive to market prices so mechanic adjustments of a market are not efficient to keep

consumption at a healthy level.

To achieve the objectives, this chapter empirically implements the BM framework using a

time-varying parameter model (Kim, 2006) and supermarket scanner data from 46 U.S. cities.

Empirical results reveal that CSDs are rationally addictive as past and future consumptions

signicantly increase current consumption, resulting in signicantly lower price elasticities than

those found in previous studies that do not take addiction into account. Thus, results indicate

1The increasing pattern is observed to parallel to the increases of obesity and type 2 diabetes, especiallyamong children and adolescents.

2

that, because of rational addiction behavior, CSD consumption is highly irresponsive to

changes in market prices and highlights the need of incorporating an inter-temporal linkage

in CSDs demand modeling. Ultimately, the results confirm the candidacy of CSDs for

government intervention given its addictive nature.

LITERATURE REVIEW

The study of rational addiction draws back to the attempts to model habit persistence. For

example, Pollak (1970) postulates the parameters of consumers’ utility function to depend

in a specific way on past consumption and derives a dynamic demand function. He con-

cludes that if goods are habit persistent, consumers’ current preference will depend on the

consumption pattern in the past. Ryder and Heal (1973) find the similar results that con-

sumers’ satisfaction from a given bundle of goods depend not only on that bundle but also

on past consumption.

Before the idea of rational habit formation is first discussed by Spinnewyn (1981), the

studies of habit persistent only focus on the impact of past consumption on current con-

sumption. The idea of considering habit persistence as a rational behavior proves that if

consumers’ taste changes endogenously through habit stocks that depend on past consump-

tion, the current consumption of a rational consumer will depend on the future habit forming.

This scope finally leads to the emergence of studies in rational addiction.

The concept of rational addiction is introduced by Stigler and Becker (1977), and further

developed by Iannaccone (1984, 1986). In a rational addiction framework, although con-

sumers are aware of the cost of addictive consumption, they still make the addiction choice

because gains from addiction exceed costs of being addictive in the future. Based on this

rationality, Becker and Murphy (1988) derive the explicit long-run and short-run demand

functions for addictive products and relate temporarily stressful events to permanent addic-

3

tions. Stronger addiction to a product requires a greater effect of past consumption of the

product on current consumption. This powerful complementariness between past consump-

tion and current consumption cause steady-state consumption levels of an addictive product

to be unstable. Thus, any small deviations from the consumption at an unstable steady

state can lead to large cumulative rises or to rapid falls in consumption over time.

Gruber and Koszegi (2001) criticize BM model by constraining the analysis to time

consistent consumers. The evidence from studies in psychology shows that consumers tend to

discount less for far futures than closer futures and thereby consumers are time-inconsistent.

From this aspect, the rationality hypothesis of an addiction is rejected as the inconsistent

discounting pattern raises the specter of intra-personal conflict over decisions for future.

However, this model cannot be distinguished from BM model empirically.

BGM (1994) is the first empirical study of rational addition. The study concludes that

smoking is rationally addictive by showing that the current cigarettes consumption depends

on both past consumption and future consumption. The estimates show strong evidence of

reinforcement and tolerance effects regarding cigarette consumption. The estimated long-run

price elasticity at the sample mean of BGM data is 1.8 times more than the short-run price

elasticity and reveals the fact that a permanent price reduction have moro significant effect

in long run rather than in short run.

Besides the typically considered addictive products, a variety of empirical studies have

been done on food addiction to respond the concerns raised by harmful health consequence of

excessive food consumption. Cawley (1999) analyzes consumers propensity of food addiction,

and Grootendorst (2004) find that that milk and eggs are also rationally addictive. Richards

et al. (2007) test a rational addiction hypothesis for specific macronutrients in foods and

find a particularly strong addiction to carbohydrates. This study suggests that a price

based policy, or sin tax that change the expected future costs and benefits of consuming

carbohydrate-intensive foods may be effective in controlling excessive nutrient intake.

4

THEORETICAL MODEL

Following Becker, Grossman and Murphy (1994), an infinite lived representative consumer

with wealth A0 is assumed to maximize the sum of his lifetime utilities respect to a budget

constraint, i.e.,

max∞∑

t=1

θt−1U(Ct, Ct−1, Zt),

with C0 = C0 and subject to:∑∞

t=1 βt−1(Zt + ptCt) = A0. U is a quadratic utility function

of the rationally addicted consumer, Ct is the amount of CSDs consumption at time t, Ct−1

is the amount of CSDs consumption at time t− 1, and Zt is the amount of a non-addictive

composite product consumption at time t. The discount factor θ is equal to 11+r

, where r is

real interest rate. Zt is taken as numeraire and pt denotes the price of CSDs. Consumption

Ct is assumed to have no income effect.2

The first-order conditions are:

Uz(Ct, Ct−1, Zt) = λ, (1)

U1(Ct, Ct−1, Zt) + θU2(Ct+1, Ct, Zt+1) = λpt. (2)

Equation (1) shows that marginal utility of non-addictive consumption in each period Uz is

equivalent to the marginal utility of wealth λ. Equation (2) illustrates that the marginal

utility of wealth multiplied by the current price equals the marginal utility of current CSD

consumption, U1, plus the discounted marginal effect of current consumption at the t on the

utility at time t + 1, U2 is summed to be negative because of the harmful health effects of

excessive CSD consumption.

To solve the first-order conditions, Becker et al. (1994) assumed a linear quadratic utility

2This assumption indicates that consumption over time has no effect on the present value of wealth A0.

5

function of Zt, Ct, Ct−1 as:

U(Zt, Ct, Ct−1) = αzZt +αzz

2Z2

t + α1Ct +α11

2C2

t + α2Ct−1 +α22

2C2

t−1

+ α1zCtZt + α2zCt−1Zt + α12CtCt−1.

After solving the first-order condition for Zt and plugging the result in the first-order condi-

tion for Ct, a linear difference equation for current CSDs consumption is obtained as

Ct = β1Ct−1 + θβ1Ct+1 + β2pt (3)

where

β1 =−(α11αzz − α1zα2z)

(α11αzz + α21z) + θ(α22αzz − α2

2z)

β2 =−(αzzλ)

(α11αzz + α21z) + θ(α22αzz − α2

2z)

Equation (3) illustrates that current consumption of a rationally addictive product is a

function of past consumption Ct−1 , future consumption Ct+1, and current price pt. β2 is

negative because of the concavity assumption of utility function. θ appears as a coefficient

for both past and future consumption. To meet the reinforcement and tolerance conditions

of rational addiction, a positive value of β1 is required to capture the phenomena that both

of past and future consumptions raise current consumption. Additionally, the greater of β1

is, the more addictive the product is.

The solution for the difference equation in (3) is:

Ct =1

β1φ1(φ2 − φ1)

∞∑s=1

φs1h(t+ s) +

1

β1φ2(φ2 − φ1)

∞∑s=0

φ−s2 h(t− 2) (4)

+1

φt2

(C0 − 1

β1φ1(φ2 − φ1))∞∑

s=1

φs1h(s)), (5)

6

where

h(t) = β0 + β2pt−1,

φ1 =1−

√(1− 4β2

1θ)

2β1

,

φ2 =1 +

√(1− 4β2

1θ)

2β1

.

with 4β21θ < 1 for stability.

The impact of a temporary CSD price change at time m on CSD consumption at time t

can then be derived as3:

dCt

dpm

∣∣∣∣m>t

=β2φ

m−t1

β1(φ2 − φ1)

[1− (

φ1

φ2

)t

], (6)

dCt

dpm

∣∣∣∣m<t

=β2φ

m−t2

β1(φ2 − φ1)

[1− (

φ1

φ2

)m

], (7)

dCt

dpt

=β2

β1(φ2 − φ1)

[1− (

φ1

φ2

)t

]. (8)

β1 determines the signs of equation (6)-(8). Equation (8) is used to compute a completely

unanticipated own price effect by setting t or m equal to 1, while the fully anticipated price

effect is obtained by setting t or m equal to infinity. For addictive products, anticipated

future price increases reduce current consumption, as consumption at different periods are

compliments.

The permanent price reduction beginning at time t effects on the current consumption

is given by:

dCt

dp∗t=

β2[1− (φ1/φ2)t]

β1(1− φ1)(φ2 − φ1).

3The derived prices effect from equation (5) is a temporary price effect because the derivation of equation(6)-(8) assumes prices in other periods are held constant.

7

By setting t goes to infinity, the equation becomes

dCt

dp∗t=

β2

β1(1− φ1)(φ2 − φ1),

and gives a short-run price effect.4

The effect of a permanent price reduction over all periods on the consumption Ct is

dCt

dp=

β2φ−t2

β1(φ2 − φ1)

[φt

2

φ2 − 1− 1− (φ1/φ2)

t

β1(1− φ1)(φ2 − φ1)

]+

β2[1− (φ1/φ2)t]

β1(1− φ1)(φ2 − φ1). (9)

Equation (9) is used to calculate the long run price effect of the permanent price reduction

by setting t goes to infinity:

dC∞dp

=β2

β1(1− φ1)(φ2 − 1).

In BM’s rational addiction framework, permanent price changes of an addictive product

have a small short-run effect on current consumption, which explains a general perception

that addicts do not respond much to price changes. Long-run demand for addictive products

tends to be more elastic than short-run demand. Furthermore, a permanent price change

shows greater impact on current consumption than a temporary price change.

EMPIRICAL MODEL

For the empirical implementation, this study adopts a time-varying parameter model. The

model allows the estimated coefficients to vary over time by incorporating the information

about consumers choice over time. Thus, the estimated time path of rational addiction coef-

ficient β1 not only provides more information about the changes in degree of addictiveness,

4Becker and Murphy (1988) defines a short-run price effect as the effect of a completely unanticipatedpermanent price reduction (including prices of all future time periods) on current consumption.

8

but also incorporates the changing degrees of uncertainty over time on rationally addictive

preferences. Furthermore, estimates of the time-varying discount factor θ could be used to

test whether consumers have a time-consistent discount factor, as assumed in the rational

addiction framework.

An empirical equation for panel data can be written as follows:

cit = λi + β1tci,t−1 + β1tθtci,t−1 + β2tpit + β4tyit +mit, (10)

where the subscript i denotes the ith cross-sectional unit and the subscript t denotes the tth

period. cit is CSDs consumption in period t, and pit is the retail price, yit is income, and mit

is an unobserved shock. To capture the variation of marginal utility of wealth among cities,

a city level fixed effect variable λi is included, as suggested both by BGM and Hsiao (1986).

λi can be eliminated by taking the first difference of equation (10):

∆Ct = (β1t − β1,t−1∆Ct−1 + (β1tθt − β1,t−1θt−1)∆Ct+1 (11)

+ (β2t − β2,t−1)∆Pt + (β4t − β4,t−1)∆Yt + ∆Mt,

where

∆Ct = [∆c1t, . . . ,∆cit, . . . ,∆cnt]′,

∆Pt = [∆p1t, . . . ,∆pit, . . . ,∆pnt]′,

∆Yt = [∆y1t, . . . ,∆yit, . . . ,∆ynt]′,

∆Mt = [∆m1t, . . . ,∆mit, . . . ,∆mnt]′.

Following Kim (2006), the forward-looking behavior Ci,t+1 is modeled as

∆Ct+1 = β0t + β3tEt(∆Pt+1), (12)

9

where Et(·) refers to the expectation formed conditional on information set at time t.

By plugging equation (12) into equation (12) the change of current consumption becomes

∆Ct = (β1tθt − β1,t−1θt−1)β0t + (β1t − β1,t−1)∆Ct−1 (13)

+ (β2t − β2,t−1)∆Pt + (β1tθt − β1,t−1θt−1)β3t∆Pt+1 + (β4t − βt,t−1)∆Yt + et,

with

βjt = a0j + a1jβj,t−1 + εjt, εjt ∼ i.i.d. N(0, σ2ε,j) (14)

θt = a0,5 + a1,5θt−1 + ε5,t, ε5,t ∼ i.i.d. N(0, σ2ε,5) (15)

for j = 1, 2, 3, 4. The new error terms et is equal to (−(β1tθt − β1,t−1θt−1)β3t(∆Pt+1 −

E(∆Pt+1)) + ∆Mt). The discount factor θt is constrained between 0 and 1.

Furthermore, to model the time-varying variance of unobserved shocks, the model as-

sumes the distribution of et following a GARCH (1,1) process, i.e.,

et|Ψt−1 ∼ N(0, σe,t), (16)

σ2e,t = α0 + α1e

2t−1 + α2σ

2e,t−1, (17)

where Ψt−1 refers to information up to t− 1.

EMPIRICAL ISSUES

The empirical implementations to equation (14) raise three empirical issues: endogeneity,

non-linearity, and heteroscedasticity. This section will propose a solution algorithm to these

issues before conducting an estimation.

10

Endogeneity

Equation (14) implies that the lagged variable ∆Ct−1 is correlated with ∆Mt, even under

the assumption that ∆Mt is not serially correlated. Rivers and Voung (1988) suggests a

two-step procedure to solve this issue. The first step is to regress the endogenous variable

∆Ct−1 against exogenous variables. The estimated regression is used to create a new variable

∆Ct−1. The second step is to estimate the objective function with ∆Ct−1 instead of ∆Ct−1.

The second endogeneity issue is caused by the correlation between regressor ∆Pt+1 and

the disturbance term et, as part of et is constructed to contain ∆Pt+1. Kim (2006) suggests

a general approach to solve this issue by decomposing et into two uncorrelated parts. The

first step to proceed this approach is to formulate ∆Pt+1 with a set of instrument variables

It:

∆Pt+1 = I ′tδt + vt, vt ∼ N(0, σ2v,t) (18)

with

δt = b0 + b1δt−1 + ut, ut ∼ i.i.dN(0,Σu), (19)

σ2v,t = c0 + c1v

2t−1 + c2σ

2v,t−1. (20)

By construction, the relationship between the endogenous regressors ∆Pt+1 and the vector

of IV It is also modeled with the time-varying coefficient δt and an unobserved heteroscedastic

shock vt, which is specified to follow a GARCH (1,1) process. These specifications implement

that uncertainty associated with future prices can vary over time with a time variation in

σt, even in absence of heteroscedasticity in vt.

The second step is to decompose ∆Pt+1 into two components: predicted component and

prediction error component; i.e.,

∆Pt+1 = E(∆Pt+1|Ψt−1) + vt|t−1,

11

vt|t−1 = Ω1/2t|t−1v

∗t , v

∗t ∼ i.i.d.N(0, 1)

where Ψt−1 is information up to time t−1 and Ωt|t−1 is the time-varying conditional variance

for prediction error. Both vt|t−1 and Ωt|t−1 can be obtained by estimating equation (18)-(20)

through a maximum likelihood procedure based on Harvey et al.’s (1992) modified Kalman

filter.

The prediction errors v∗t is then standardized by assuming the covariance structure be-

tween v∗t and et as: v∗t

et

∼ N

0

0

, 1 ρσe,t

ρσe,t σ2e,t

,

where ρ is a constant correlation coefficient. Apply the Cholesky decomposition to the

covariance matrix, resulting in the following relationship between v∗t and et:

v∗t

et

=

1 0

ρσe,t

√1− ρ2σe,t

εt

wt

, εt

wt

∼ i.i.d.N

0

0

, 1 0

0 1

, (21)

Rewrite equation (21) as:

et = ρσe,tv∗t + w∗t , w

∗t ∼ N(0, (1− ρ2)σ2

e,t), (22)

where w∗t is uncorrelated with v∗t .

After the transformations, equation (22) decomposes et in equation (14) into two com-

ponents: v∗t , which is correlated with ∆Pt+1; and w∗t , which is uncorrelated with any of

12

regressors. Plugging equation (22) into equation (14), equation (14) then becomes

∆Ct = (β1tθt − β1,t−1θt−1)β0t + (β1t − β1,t−1)∆Ct−1 (23)

+ (β1tθt − β1,t−1θt−1)β3t∆Pt+1 + (β2t − β2,t−1)∆Pt

+ (β4t − β4,t−1)∆Yt + ρσe,tv∗t + w∗t ,

where v∗t is now considered as an additional regressor. The new disturbance term w∗t is

uncorrelated with any of the regressors.

Nonlinearity and heteroscedasticity

The coefficients β1t, θt, β0,t, β3t in equation (24) are not identified without further implemen-

tations, as they appear in the equation through multiplication. To solve this issue, rewrite

the non-linear equation (24) and (14)-(17) as

∆Ct = f(xt; βt) + ρσe,tv∗t + w∗t w

∗t ∼ N(0, (1− ρ2)σ2

e,t), (24)

with

βt = µ+ Fβt−1 + εt εt ∼ i.i.d.N(0,Σε),

σ2e,t = α0 + α1e

2t−1 + α2σ

2e,t−1,

where βt = [β0t β1t β2t β3t β4t θt]′, Σε is a diagonal matrix, and

f(xt; βt) = (β1tθt − β1,t−1|t−1θt−1|t−1)β0t + (β1t − β1,t−1|t−1)∆Ct−1 (25)

+ (β1tθt − β1,t−1|t−1θt−1|t−1)β3t∆Pt+1 (26)

+ (β2t − β2,t−1|t−1)∆Pt + (β4t − β4,t−1|t−1)∆Yt, (27)

13

where βt−1|t−1 = E(βt−1|Ψt−1), x′t = [1 ∆Ct−1 ∆Pt−1 ∆Yt] and εt and w∗t are independent.5

As suggested in Harvey (1989) and Kim (2006), the problem is solved by applying a linear

approximation to equation (24). The linear approximation takes a taylor series expansion of

the non-linear function f(xt; βt) around βt = βt|t−1 as:6

∆Ct = f(xt; βt|t−1) +∂f(xt; βt|t−1)

∂βt

(βt − βt|t−1) + ρσe,tv∗t + w∗t . (28)

Evaluating the first derivatives in equation (28) and rearranging terms, the linear ap-

proximation to equation (24) takes the form

Yt = X ′tβt + ρσe,tv∗t + w∗t , (29)

where

Yt = ∆Ct + β1,t−1|t−1∆Ct−1 + β2,t−1|t−1∆Pt

+ β4,t−1|t−1∆Yt + (β0,t|t−1θt|t−1 + β3,t|t−1 + β3,t|t−1∆Pt+1θt|t−1)β1,t|t−1

+ (β2,t|t−1β3,t|t−1∆Pt+1 + β1,t|t−1β0.t|t−1)θt|t−1,

Xt =

(β1,t|t−1θt|t−1 − β1,t−1|t−1θt−1|t−1)

β0,t|t−1θt|t−1 + ∆Ct−1 + β3,t|t−1∆Pt+1θt|t−1

∆Pt

(β1,t|t−1θt|t−1 − β1,t−1|t−1θt−1|t−1)∆Pt+1

Yt

(β1,t|t−1β0,t|t−1 + β1,t|t−1β3,t|t−1∆Pt+1)

To deal with the issue of heteroscedasticity in the disturbance terms, Harvey et al. (1992)

5Here assumes βt−1 = βt−1|t−1 because there is no randomness associated with the expectation conditionalon the information set at time t− 1, which is already realized at time t.

6βt|t−1 = E(βt|Ψt−1)

14

suggests to include w∗t in the state-space equation (29) to formulate

Yt = [X ′t 1]

βt

w∗t

+ ρσe,tv∗t = X ′tβt + ρσe,tv

∗t , (30)

where βt

w∗t

=

µ

0

+

F 0

0′6 0

βt−1

w∗t−1

+

εt

w∗t

, εt

w∗t

∼ N

06

0

, Σε 06

0 (1− ρ2)σ2e,t

,

and

βt = A+Bβt−1 + Vt, Vt ∼ (0, Qt),

which is ultimately estimated equation.

ESTIMATION ALGORITHM

The estimation of equations (30) is conducted using the two-step, maximum likelihood esti-

mator procedure outlined by Heckman (1976). The first step is to estimate equation (18)-(20)

using maximum likelihood estimation based on Harvey et al.’s (1992) modified Kalman filter,

and calculate the one-step-ahead forecast errors, v∗t|t−1. The second step is to use a maxi-

mum likelihood method via the Kalman filter to estimate equation (30) with v∗t as a bias

correction term. The prediction errors v∗t for expected prices enter to the estimation as the

bias correction term to capture the changing degrees of uncertainty associated with future

prices into the estimation of a forward-looking behavior in the rational addiction framework.

15

The Kalman filter with Yt and Xt proceeds as follows:7

βt|t−1 = A+Bβt−1|t−1, (31)

Λt|t−1 = BΛt−1|t−1B′, (32)

ηt|t−1 = Yt − X ′tβt|t−1 − ρσe,tv∗t , (33)

Ht|t−1 = X ′tΛt−1|tXt + ρ2σ2e,t, (34)

βt|t = βt|t−1 + Λt|t−1XtH−1t|t−1ηt|t−1, (35)

Λt|t = Λt|t−1 − Λt|t−1XtH−1t|t−1X′tΛt|t−1, (36)

Λt+1|t = BΛt|tB′ + Qt+1. (37)

One problem of processing the above Kalman filer is that e2t−1 is unknown but needed

to calculate σ2e,t, which is employed in the Qt in equations (32), and equation (33). e2t−1 is

approximated by E(e2t−1|Ψt−1), where Ψ is information set at time t− 1. That is,

et−1 = v∗t−1ρσe,t−1 + w∗t−1,

and

et−1 = E(et−1|Ψt−1) + (et−1 − E(et−1|Ψt−1)).

7Variables βt|t−1 = E(βt|Ψt−1); Pt|t−1 = E[(βt − βt|t−1)(βt − βt|t−1)′]; ηt|t−1 = Yt − Yt|t−1; and Yt|t−1 =E(Yt|Ψt−1)

16

Thus,

E(e2t−1|Ψt−1) = E(et−1|Ψt−1)2 + E(et−1 − E(et−1|Ψt−1))

2 (38)

= (v∗t−1ρσe,t−1 + E(w∗t−1|Ψt−1))2 (39)

+ E(w∗t−1 − E(w∗t−1|Ψt−1))2, (40)

where E(w∗t−1|Ψt−1) is the last element of βt−1|t−1 and its mean squared error E(w∗t−1 −

E(x∗t−1|Ψt−1))2 is given by the last diagonal element of Λt−1|t−1.

The main database used in this study comes from information Resources, Inc. (IRI),

which is available in the Food Marketing Center at the University of Connecticut. The

data consists of 20 quarterly, city-level observations from 1988 through 1992 over 46 U.S.

metropolitan areas collected from supermarket scanners for a total of 920 observations (20×

46). The variables used in the estimation are volume sales, price, in-store sale promotions,

population, and median income. Consumption is measured as per capita volume sale and

the price is simply the average retail prices at the city level.

EMPIRICAL RESULTS

Table 1 reports the empirical results of hyper-parameters of the first step estimation given

by equations (18)-(20). The dependent variable is future price changes and the explana-

tory variables are current prices changes, current median income changes, and the change

of in-store promotions. Three hyper-parameters b1,1, b1,2 and b1,3 in equation (19), which

associated with the time-varying coefficients in equation (18), are statistically significant at

1% significant level and carry negative signs with absolute values less than 1.

The variance associated with the random shocks for time-varying coefficient of current

change of in-store promotion, Σv,3 is statistically significant at 1% significant level, while

Σv,1 and Σv,2 are not significantly different from zero. The coefficient of lagged time-varying

17

Table 1: First-step estimates of hyper-parameters

Parameters Estimates Std. Err. Parameters Estimates Std. Err.

b1,1 -0.5988 (0.0814) Σv,1 0.0266 (0.3851)

b1,2 -0.3479 (0.1068) Σv,2 0.0000 (1.0534)

b1,3 -0.2739 (0.1203) Σv,3 1.6982 (0.0483)

c1 0.1080 (0.1917) c2 0.1913 (0.1440)

* This table reports estimates for equation (18)-(20).

variance σ2v,t−1, c2, is estimated to take the value of 0.1913.

As discussed in Hsiao (2003), to test the time-varying specification of the coefficients,

is equivalent to test for the null hypothesis that bi,i = 0 and Σv,i = 0 (i = 1, 2, 3). If

the hypothesis cannot be rejected for a coefficient estimate, the coefficient is reduced to

the standard non-time-varying type. The test statistics χ2 reported in Table 1 implies

that the coefficients of current price change and current in-store promotion change are time

varying, but the coefficient of current change of median income is not distinguishable from

the standard non-time-varying coefficients.

Table 2: First-step estimates of time-variable coefficients

Parameters Mean Std. Err. Std. Dev. Std. Err.

δ1 -0.2337 (0.1984) 0.0394 (0.0031)

δ2 -0.0927 (0.0006) 0.0000 (0.0000)

δ3 -0.2965 (1.3363) 1.7847 (0.0856)

* Standard errors are calculated by delta method.

Conditional on the hyper-parameter estimates reported in Table 1, the time-varying

18

Figure 1: Time-varying response of future price to current price

coefficients are calculated by the proposed Kalman filter with equation (31)-(37). Overall,

18 evaluations are obtained for each time-varying coefficient.8 The mean and standard

deviation of each coefficient are shown in table 2. The coefficients of current price change

δ1 and change of median income δ2 are statistically significantly at 5% significant level. The

standard deviation of δ2 takes value 0, which indicates the non-time-varying property of δ2.

Figure 1 depicts the significant time-varying coefficients δ1 with its 95% confidence interval.

The error correction term v∗t is then calculated based on the estimates to proceed the second

step.

The estimates of hyper-parameters from equation (14)-(15) for the second step are re-

ported in Table 3. The dependent variable is changes in current CSD consumption per

capita, and the explanatory variables are changes in past consumption per capita, changes

8The total number of quarters are reduced to 18 because of ∆Pt−1 and ∆Pt+1 are used as explanatoryvariables.

19

Table 3: First-step estimates of time-variable coefficients

Parameters Estimates Std. Err. Parameters Estimates Std. Err.

a1,0 -0.0025 (0.0055) σε,0 0.0002 (0.1350)

a1,1 -0.9999 (0.0000) σε,1 0.0000 (0.0000)

a1,2 0.4137 (0.0030) σε,2 0.0004 (0.0000)

a1,3 -0.7199 (0.0159) σε,3 0.0002 (0.1441)

a1,4 0.1147 (0.0030) σε,4 0.0001 (0.0001)

a1,5 -0.0001 (0.0105) σε,5 0.0000 (0.0032)

α1 -0.0171 (0.0001)

α2 -0.0116 (0.0206)

ρ -0.9522 (0.0058)

* This table reports the hyper-parameters of the model given by equation (14)-(15)with a bias correction term. Intercepts are not included.

in current consumption per capita, changes in current prices, changes in future consumption

per capita, and changes in current median income. a1,i and δε,i (i = 0, 1, 5) correspond to

the time-varying coefficient βi,t in equation (14).

Table 4 shows the mean and standard deviation estimates for time-variable coefficients

in table 3. The positive estimates of the coefficient for past consumption, β1,t, reveals

the reinforcement feature of CSDs consumption. Meanwhile, the positive and statistically

signicant estimates of the coefficient for future prices, β3,t, indicates the looking-forward

behavior of CSDs consumption. Thus, the hypothesis that CSDs is rationally addictive

is accepted according to these empirical results. Furthermore, the estimated coefficient θt

shows that the discount factor does not vary over the examining period because the standard

deviation cannot be rejected to differ from zero. This evidence conrms the assumption of

20

Table 4: Second-step estimates of time-variable coefficients

Parameters Mean Std. Err. Std. Dev. Std. Err.

β0,t 0.3413 (0.0129) 0.0000 (0.0000)

β1,t 0.2905 (0.0006) 0.0279 (0.0014)

β2,t -0.1692 (0.0040) 0.0002 (0.0000)

β3,t 0.3988 (0.0081) 0.0000 (0.0000)

β4,t 0.0187 (0.0081) 0.0000 (0.0000)

θt 0.2601 (0.0067) 0.0000 (0.0000)

time-consistent consumers made by BM, and further shows that the empirical results are

valid to conclude the rational addiction feature of CSDs consumption.

As shown in BGM, the estimated value of β1,t indicates the degree of rational addiction

to carbonated soft drinks. Figure 2 graphs the time path of β1,t from the 3rd quarter of 1988

to the 3rd quarter of 1992. The greatest value of estimated β1,t always occur in the second

quarter of each year, which indicate that the consumers get most addictive to carbonated

soft drinks at the second quarter. The least addiction occurs in the following third quarter of

each year. The potential explanation is, due to the big stock accumulated and most addicted

to the product in the second quarter, consumers do not need to be as more addictive as other

quarters to support their satisfaction in consuming soft drinks, that is, the autoregressive

nature of the addiction indicated by the lagged value β1,t dominates the seasonality effect

from the disturbance.

The estimated coefficient β2,t illustrates the time-varying response of current consumption

to current price, as shown in Figure 3. The price effect on current consumption is the greatest

in the fourth quarter of 1988 through out the whole sample, while the lowest price effect

21

Figure 2: Time-varying response of current consumption to lagged consumption

happens in the fourth quarter of 1991. The greatest price effect for each year usually occurs

in the second quarter, expect in 1989, which the fourth quarter owns the greatest price effect

on current consumption.

Table 5 reports the computed price elasticities with respect to various price changes

dened in Section 3 at sample means of average retail price and consumption. The values are

calculated based on estimation results reported in table 4. As discussed in BM, permanent

price changes of addictive product may have a small short-run effect on the consumption of

the product. This conclusion could be the potential explanation of a general perception that

addicts do not respond much to changes in price.

Estimates of own price elasticities imply that a 10-percent temporary increase in the

current price causes a 0.223 percent decline in current CSD consumption if the price increase

is anticipated. If the price increase is unanticipated, a 10-percent price increase will lead to

a 0.221 percent decrease in current consumption.

22

Figure 3: Time-varying response of current consumption to current price

Estimate of the short-run response of current consumption to a permanent price change

is 0.0226, that is, a 10-percent decline in CSD prices leads to a short-run increase in con-

sumption of 0.226 percent. The long-run response of consumption to a permanent price

change is 0.0319, which is about 1.4 times of the short-run price elasticity and indicates that

a 10-percent reduction in price will lead to 0.319 percent increase in CSD.

For the estimates of cross price effects, a 10-percent unanticipated reduction in price at

time t causes an 0.018 percent increase in consumption at time t + 1. The corresponding

price reduction on consumption at time t− 1 is 0.061.

Hence, the estimates in time-varying coefficients and demand elasticities have jointly il-

lustrated that CSDs are rationally addictive because past and future changes in consumption

have statistically signicant effects on current consumption and CSDs consumption is highly

inelastic to market prices. The estimated CSDs demand elasticities in this paper are much

lower than previous estimates using scanner data. For example, Kinnucan et al. (2001) es-

23

Table 5: Average price elasticities for CSD as a rationally addictive

Elasticities SE Elasticities SE

Own price Future price

unanticipated -0.0221 (0.0035) unanticipated -0.0018 (0.0002)

anticipated -0.0223 (0.0033) Past price

Short-run -0.0226 (0.0029) unanticipated -0.0061 (0.0007)

Long-run -0.0319 (0.0060)

timate the own price elasticity of CSDs at approximately 0.1372, Yen et al. (2004) at -0.52,

Zheng and Kaiser (2008) at -0.151 (AIDS model) and -0.164 (Rotterdam Model), and Dube

et al. (2004) at ranging from -2.0 to -3.0 for a brand-level analysis.

CONCLUSIONS AND DISCUSSION

This paper is the first to test and confirm that CSDs are rationally addictive. This conclusion

is based on finding that past and future consumptions have significant effects on current

consumption. Taking this into account leads to significantly lower own price elasticities

of demand for CSDs than found in previous studies that do not take rational addition

into account. Given that CSDs are rationally addictive, with serious health consequences

(e.g., obesity) for consumers, the findings of this paper provide a rationale for government

intervention on behalf of public health.

The analysis presented in this paper could be expanded in several worthwhile directions.

The first is to include an analysis of quitting behavior of rational addiction within the cold

turkey effect discussed in Becker and Murphy (1988) if the stock of consumption is lowered

sufficiently. Another one is to include the existing taxation in the model and analyze the

24

effects of taxation on current consumption of CSDs in a rational addiction framework. The

third is to take into account interpersonal externalities as well as external costs in the design

of optimal intervention policies that go beyond price policies.

REFERENCES

[1] Apovian, C (2004), “Sugar-Sweetened Soft Drinks, Obesity, and Type 2 Diabetes”,

Journal of American Medical Association, 292, pp. 978-979.

[2] Baltagi, B., and J. Griffin (2001), ”The Econometrics of Rational Addiction: The Case

of Cigarettes.”, Journal of Business and Economic Statistics, 19, pp. 449-454.

[3] Baltagi, B., and J. Griffin (2002), ”Rational Addiction to Alcohol: Panel Data Analysis

of Liquor Consumption”, Health Economics, 19, pp. 449-454.

[4] Becker, G. and K. Murphy (1988), ”A Theory of Rational Addiction”, Journal of Po-

litical Economy, 96, pp. 675-700.

[5] Becker, G., M. Grossman and K. Murphy (1994), ”An Empirical Analysis of Cigarette

Addiction”, American Economics Review, 84, pp. 396-417.

[6] Bentzen, J., T. Eriksson and S. Valdemar (1999), ”Rational Addiction and Alcohol

Consumption: Evidence from the Nordic Countries”, Journal of Consumer Policy, 22,

pp. 257-279.

[7] Capps, O. and J. Griffin (1998), “Effect of a Mass Merchandiser on Traditional Food

Retailers”, Journal of Food Distribution Research, 29, pp. 1-7.

[8] Cawley, J. (1999), ”Rational Addiction, the Consumption of Calories and Body Weight”,

Ph.D. Dissertation, University of Chicago.

25

[9] Chaloupka, F. (1991), ”Rational Addictive Behavior and Cigarette Smoking”, Journal

of Political Economy, 99, pp. 722-742.

[10] Chaloupka, F., M. Grossman, and J. Tauras (1998), ”The Demand for Cocaine and

Marijuana by Youth”, NBER working paper No. W6411.

[11] Dube. J-P. (2004), ”Multiple Discreteness and Product Differentiation: Demand for

Carbonated Soft Drinks.”, Marketing Science, 23, pp. 66-81.

[12] Grossman, M., F. Chaloupka and I. Sirtalan (1998), ”An Empirical Analysis of Alcohol

Addiction: Results from the Monitoring the Future Panels”, Economic Inquiry, 36, pp.

39-48

[13] Grossman, M. and F. Chaloupka (1998), ”The Demand for Cocaine by Young Adults:

A Rational Addiction Approach”, Journal of Health Economics, 17, pp. 427-474.

[14] Gruber, J. and B. Koszegi (2001), ”Is Addiction ’Rational’? Theory and Evidence”,

Quarterly Journal of Economics,116, pp. 1261-1303.

[15] Harnack, L., J. Stang and M. Story (1999), “Soft Drink Consumption Among US Chil-

dren and AdolescentsNutritional Consequences”, Journal of the American Dietetic As-

sociation, 99, pp. 436-441.

[16] lannaccone, L. (1984), ”Consumption Capital and Habit Formation with an Application

to Religious Participation.”, Ph.D. dissertation, University of Chicago.

[17] lannaccone, L. (1986), ”Addiction and Satiation”, Economics Letters, 21, pp. 95-99.

[18] Kim, C-J., and C. Nelson (2006), ”Estimation of A Forward-Looking Monetary Policy

Rule: A Time-Varying Parameter Model Using Ex Post Data”, Journal of Monetary

Economics, 53, pp. 1949-1966

26

[19] Kim, C-J., J. Piger, and R. Startz (2008), “Estimation of Markov Regime-Switching

Regression Models with Endogenous Switching”, Journal of Econometrics, 143, pp.

263–273.

[20] Kinnucan, H., Y. Miao, H. Xiao and H. Kaiser (2001), ”Effects of Advertising on U.S.

Non-Alcoholic Beverage Demand: Evidence from a Two-Stage Rotterdam Model.”, Ad-

vances in Applied Microeconomics: Advertising and Differentiated Products, In M.R.

Baye and J.P. Nelson, ed.

[21] Olekalnsand, N. and P. Bardsley (1996), ”Rational Addiction to Caffeine: An Analysis

of Coffee Consumption”, Journal of Political Economy, 104, pp. 1100-1104.

[22] Richards, T., P. Patterson and A. Tegene (2007), ”Obesity and Nutrient Consumption:

A Rational Addiction?”, Contemporary Economic Policy, 25, pp. 309-324.

[23] Spinnewyn, F. (1981), ”Rational Habit Formation.”, European Economics Review, 15,

pp. 91-109.

[24] Stigler, G. and G. Becker (1977), ”De Gustibus Non Est Disputandum.”, American

Economics Review, 67, pp. 76-90.

[25] Waters, T. and F. Sloan (1995), ”Why Do People Drink? Tests of The Rational Addic-

tion Model”, Applied Economics, 27, pp. 727-736.

[26] Yen, S., B. Lin, D. Smallwood and M. Andrews (2004), ”Demand for Nonalcoholic

Beverages: the Case of Low-Income Households”, Agribusiness, 20, pp. 309321.

[27] Zheng, Y. and H. Kaiser (2008), ”Estimating Asymmetric Advertising Response: An

Application to U.S. Nonalcoholic Beverage Demand”, Journal of Agricultural and Ap-

plied Economics, 40, pp. 837-849.

27

[28] Zywicki, T., Holt, D., and Ohlhausen, M. (2004), Obesity and Advertising Policy, 12

GEORGE MASON L. REV. 979.

28