How Much Collusion? A Meta-Analysis On Oligopoly Experiments
Optimal Collusion in Oligopoly Supergames: Marginal...
Transcript of Optimal Collusion in Oligopoly Supergames: Marginal...
Optimal Collusion in Oligopoly Supergames:
Marginal Costs Matter
Etienne Billette de Villemeur∗, Laurent Flochel∗∗, Bruno Versaevel�
December 31, 2004
∗ Corresponding author. University of Toulouse 1, IDEI & GREMAQ, 21 allee de Brienne,
31000 Toulouse France ([email protected])
∗∗ University of Lyon 2 & GATE, 93 chemin des Mouilles, BP 167 69131 Ecully Cedex France
�Ecole de Management de Lyon & GATE, 23 avenue Guy de Collongue, 69134 Ecully Cedex
France ([email protected])
1
Optimal Collusion in Oligopoly Supergames:
Marginal Costs Matter
Abstract
In a standard oligopoly supergame with identical Þrms, a necessary condition on the level of
marginal costs is derived for optimal collusion to be sustainable, either in prices or in quantities,
for any degree of product differentiation, and any number of Þrms. Only in the Cournot version
of the model, and with at most three Þrms, the level of marginal costs plays no role in the
sustainability of collusion. Otherwise, for any degree of differentiation and any size of industry,
the marginal cost must be higher than a threshold value for an optimal single-period punishment
to be obtained. When parameter values are such that no optimal single-period punishment can
be implemented, a multi-period punishment can always be constructed that leads Þrms to
sustain collusion at the proÞt-maximizing price or quantity level. In that case, punishments
last longer when the level of marginal costs decreases. By contrast, the length of the punishment
phase does not impact the minimum discount factor for which collusion can be implementable.
More differentiation and/or concentration unambiguously facilitate collusion. When Þrms use
prices, the optimal punishment phase lasts longer than when they use quantities, and the
boundary condition on the discount factor is more stringent.
JEL ClassiÞcation: C72; D43; L13.
Keywords: Collusion ; Product differentiation ; Industry concentration ; Cournot ; Bertrand.
1 Introduction
Antitrust authorities are interested in relying on simple guidelines in order to investigate the
behavior of Þrms. This motivates the understanding of the impact of product differentiation and
industry concentration on the sustainability of collusive agreements in oligopolistic industries.
In a legal context in which collusive agreements cannot be contractually enforced, it is well
known that each Þrm may Þnd it proÞtable to deviate from a tacitly collusive price or quantity
that maximizes industry proÞts. This may not be the case however if a deviation can be
sufficiently and credibly �punished� via lower industry prices or larger quantities in a subsequent
period of time. In order to identify structural conditions for the sustainability of collusive
strategies, the theoretical literature has constructed a class of dynamic models usually referred
to as supergames. These models feature a repeated market game in which Þrms maximize a
ßow of discounted individual proÞts by non-cooperatively choosing a price or a quantity over
an inÞnite number of periods.
One stream of that literature follows Friedman (1971) by considering trigger strategies (com-
monly referred to as �grim� strategies) which call for reversion to the one-shot stage game Nash
equilibrium forever when a deviation is detected in a previous period. A general result is that
collusion can be sustained if the discount factor is above a threshold value. In a duopoly model
with horizontal differentiation, Deneckere (1983, 1984) Þnds that tacitly collusive agreements
in prices are less easily sustained than in quantities unless goods are highly substitutable. This
is sensitive to the two-Þrm assumption, since in an extension of the same model Majerus (1988)
demonstrates that, with three Þrms or more, a collusion in prices is less easily sustained than
in quantities for all degrees of substitutability. Other studies examine the relationship between
alternative formalizations of product differentiation and the sustainability of collusion. They in-
clude papers by Chang (1991) and Ross (1992), who conÞrm in two spatial competition models
3
that collusion is more easily sustained when products are increasingly differentiated.1
A weakness that is common to all models of collusion with trigger strategies (i.e. pun-
ishments of inÞnite duration) is that they rule out the possibility of modulating the level of
punishments. More precisely, by assuming that Þrms revert to the Nash equilibrium of the
one-shot stage game in all periods that follow a deviation, they put an upper bound on the
severity of punishments.
This problem is tackled by a second stream of the literature, which reconsiders the impact
of various model speciÞcations on the sustainability of collusion with a stick-and-carrot penal
code. In the latter mechanism, if a Þrm deviates from the collusive strategy, all Þrms play a
punishment strategy before returning to the collusive price or quantity. In a repeated Cournot
(i.e., quantity-setting) oligopoly model, with identical sellers of a homogenous good and con-
stant marginal costs, Abreu (1986) characterizes the most severe punishments as �optimal� by
proving that they lead Þrms to sustain the most proÞtable collusive strategy. A dual interpre-
tation is to consider the minimum discount factor that sustains a given collusive outcome. Also
in a repeated Cournot model, Wernerfelt (1989) uses a stick-and-carrot framework to Þnd that
more product differentiation may render collusion less sustainable when the number of Þrms is
relatively large.2 In a repeated Bertrand (i.e., price-setting) duopoly model, with spatial differ-
entiation, and constant marginal costs normalized to zero, Hackner (1996) demonstrates that a
symmetric stick-and-carrot structure is an optimal price path, and conÞrms that differentiation
tends to facilitate collusive agreements. It is also demonstrated that, when the punishment
price is constrained to be non-negative, a prolonged price war can sustain the same collusive
1With vertical differentiation, Hackner (1994) Þnds that more differentiation unambiguously implies more
sustainable collusion in a Bertrand setup, with a trigger penal code. In the present paper we concentrate on
horizontal differentiation exclusively.2Although of interest, this ambiguous result is derived from demand assumptions (adapted from Deneckere,
1983) which are not standard (on this see Osterdal, 2003, pp. 54-55).
4
strategy as in the unrestricted case. With two Þrms and constant marginal costs again, but
with another speciÞcation of the horizontal differentiation assumption, Lambertini and Sasaki
(2002) Þnd a qualitatively similar relationship between product substitutability and collusion
sustainability.3 In addition, for all degrees of product differentiation, they show that collusion is
less easily sustainable in Bertrand than in Cournot. This is obtained in a setup where quantities
are constrained to be non-negative but prices may fall below zero.4
In this paper, we show that the level of constant marginal costs matters for collusion sus-
tainability. Our starting point is the intuitive proposition by Lambertini and Sasaki (2001) that
collusion in a repeated oligopoly game with high marginal costs may be more easily sustainable
than with lower marginal costs, all other things remaining equal. The underlying idea is that,
if prices are constrained to be positive, the ability to punish a deviation by charging prices
below marginal costs is limited when these costs are close to zero. We unveil a condition on
the level of marginal costs that is necessary for collusion to be sustainable, either in prices or in
quantities, in a stick-and-carrot setup. This is done with standard demand speciÞcations, for
any degree of product differentiation, and any number of Þrms.
The introduction of positive marginal costs, together with a non-negativity constraint on
prices as in Hackner (1996), leads to several new results. We Þnd that only in the Cournot
version of the model, and with at most three Þrms, the level of marginal costs plays no role in the
3Lambertini and Sasaki (2002) also consider complements. Under non-negative quantity constraints, they
Þnd that longer (i.e., multi-period) punishments are required only when products are close complements and
the quantity is the strategic variable.4Beyond the degree of product differentiation and the number of Þrms, other studies focus on the role of
various cost and demand speciÞcations. For example, Rothschild (1999) demonstrates that collusion sustainabil-
ity depends on Þrms� relative efficiencies. Other examples include Lambertini (1996), Tyagi (1999), and Collie
(2004), who investigate the relationship between assumptions on the form of the demand function and Þrms�
ability to collude in a repeated games with trigger strategies.
5
sustainability of collusion. Otherwise, for any degree of differentiation and any number of Þrms,
the marginal cost must be higher than a threshold value. This comes in addition to the usual
lower bound condition on the discount factor. High marginal costs can thus be included in the
list of factors that facilitate collusion. When parameter values are such that no optimal single-
period punishment can be implemented, we Þnd that a longer � i.e., multi-period � punishment
can always be constructed that leads Þrms to sustain collusion. More precisely, the length of the
punishment phase is demonstrated to be a substitute for the level of marginal costs. By contrast,
it does not impact on the critical threshold value of the discount factor that allow to sustain
collusion in multi-period punishment scheme. A comparative statics analysis establishes that
more differentiation and/or concentration in the industry relax the sustainability conditions on
the marginal cost and the discount factor, and thereby facilitate collusion. Eventually, when
Þrms use prices as opposed to quantities, we Þnd that the optimal punishment phase lasts
longer, and the boundary condition on the discount factor is more stringent. In that sense,
collusion is less easily sustainable in Bertrand than in Cournot.
The remainder of the paper is organized as follows. Section 2 describes the model. In Section
3, we investigate the role of the level of marginal costs on the existence of an optimal single-
period punishment, either in prices or in quantities. In Section 4, we consider situations in
which collusion is not sustainable with a single-period punishment. Conditions of sustainability
of optimal collusion for the Bertrand and Cournot cases are compared in Section 5. Final
remarks appear in Section 6. Computations are in the appendix.
6
2 The Model
Identical Þrms in N = {1, . . . , n} maximize intertemporal proÞts by simultaneously and non-cooperatively choosing either quantities (Cournot supergame) or prices (Bertrand supergame)
in an inÞnitely repeated game over t = 1, 2, ...,∞. The discount factor δ = 1/(1+ r), where r isthe single period interest rate, is common to all Þrms. Each Þrm i incurs a constant marginal
(unit) cost c ∈ [0, 1] to supply a non-negative quantity qi. Over all periods, each consumer hasthe same identical utility function, adapted from Hackner (2000), of the form
U(q, I) =nXi=1
qi − 12
nXi=1
q2i + 2γXi6=j
qiqj
+ I, (1)
which is quadratic in the consumption of q-products and linear in the consumption of the com-
posite I-good (i.e., the numeraire).5 The parameter γ ∈ (0, 1) measures product substitutabilityas perceived by consumers. If γ → 0, the demand for the different product varieties are inde-
pendent and each Þrm has monopolistic market power, while if γ → 1, the products are perfect
substitutes. Consumers maximize utility subject to the budget constraintPpiqi+I ≤ m, where
m denotes income, pi is the non-negative price of product i, and the price of the composite
good is normalized to one. By symmetry, we notePj 6=i qj = (n− 1)qj .
Bertrand In the price-setting version of the model, from (1) the demand function for Þrm
i writes
qi(pi, pj) =
1− pi if pi ≤ �pi(pj)
11+γ(n−1)
³1− 1+γ(n−2)
1−γ pi +γ(n−1)1−γ pj
´if �pi(pj) ≤ pi ≤ �pi(pj)
0 if pi ≥ �pi(pj)
, (2)
5In Hackner (2000), quantities qi are multiplied by a parameter ai, that is a measure the distinctive quality
of each variety i. Here we exclude vertical product differentiation by assuming that ai = 1, all i ∈ N .
7
and the demand for each other symmetric Þrm j writes
qj(pi, pj) =
0 if pi ≤ �pi(pj)
11+γ(n−1)
³1− 1
1−γ pj +γ1−γ pi
´if �pi(pj) ≤ pi ≤ �pi(pj)
1−pj1+γ(n−2) if pi ≥ �pi(pj)
, (3)
with �pi(pj) =1γ [pj − (1− γ)] and �pi(pj) =
11+γ(n−2) (1− γ + γ (n− 1) pj). The piecewise
deÞnition of demand functions is a consequence of the non-negativity constraint we impose
on quantities. Remark however that the two functions (24) and (25) are continuous in their
respective arguments. This implies that proÞt functions are also continuous. One can check
that they are also concave in the relevant decision variable.
Cournot In the quantity-setting version of the model, from (1) Þrm i�s inverse demand
function in each period is
pi(qi, qj) = max{0, 1− qi − γ(n− 1)qj}, (4)
all qj ≥ 0, j 6= i, and the inverse demand for each other symmetric Þrm j is deÞned by
pj(qi, qj) = max {0, 1− γqi − (1 + γ(n− 2))qj} . (5)
Here again, it is straightforward to check that a Þrm�s proÞt function is continuous and the
associated maximization problem is convex.
Penal Codes Both in the Bertrand and Cournot versions of the model, a strategy proÞle
is a set of available actions a, including a collusive price (quantity), which yields joint proÞt
maximization, and a punishment price (quantity), which leads to low per-Þrm proÞts. In the
stage game, Þrms� strategies in prices (quantities) may differ from period to period. An action
path {at}∞t=1 is deÞned as an inÞnite sequence of n-dimensional vector prices (quantities), as
8
chosen by each Þrm in each period. We assume all Þrms initially follow a collusive path by
choosing the collusive action aM . The collusive price and quantity are given by pM = (1 + c) /2
and qM = (1− c) / [2 (1 + γ(n− 1))] respectively. If the collusive strategies are played by allÞrms in all periods, each Þrm earns a sum of proÞts ΠM = πM/ (1− δ), where πM refers to
symmetric single-period collusive proÞts. All Þrms have a short-run incentive to deviate, that
is to lower (increase) its own price (quantity) in order to increase individual proÞts at every
other Þrm�s expense. If such a deviation by one Þrm i in N is detected in period t, all Þrms
switch to the punishment action aP in period t+ 1.
As standard in the literature, we consider two alternative penal codes, namely the �trigger�
penal code and the �stick-and-carrot� penal code. We are interested in investigating the con-
ditions on marginal costs, product differentiation, and size of the industry, in which collusion
is sustainable in a stick-and-carrot setup. We compare them with conditions on the parameter
values for which Þrms collude in equilibrium in the trigger model, we thereby use as a bench-
mark. Both in the trigger and stick-and-carrot cases, we have a collusive equilibrium of the
supergame if all Þrms Þnd it proÞtable to adopt the collusive action whenever the action path
calls for them to do so, and to implement the penal code whenever the action path calls for
them to do so.
¥ Following Friedman (1971), in the trigger penal code, all Þrms revert forever to the stage
game Nash equilibrium in the period that follows any deviation from the collusive strategy. In
this case, each Þrm earns a discounted ßow of proÞts Πdi = πdi + δπ
N/ (1− δ), all i, where πdirefers to a Þrm i�s single-period deviation proÞts, and πN refers to symmetric single period
Nash equilibrium proÞts. For the collusive strategy to be sustainable, the discounted ßow
of proÞts from adhering to the collusive strategy forever must exceed the discounted ßow of
9
proÞts obtained in the case of deviation, i.e. ΠM ≥ Πdi . It follows that collusion is sustainablewith a trigger penal code if and only if the discount factor is higher than a threshold level
δtrigger =¡πdi − πM
¢/¡πdi − πN
¢. Remark that this result holds true for all (γ, c) ∈ [0, 1)2 and
all n ≥ 2.
¥ In a single-period stick-and-carrot penal code a la Abreu (1986) also, all Þrms initially follow
a collusive path by choosing the collusive action aM . If a deviation by any Þrm occurs, all
Þrms switch to the punishment action aP in the next period (the stick). After one period of
punishment, if any deviation from aP is detected, the punishment phase restarts, otherwise all
Þrms resume the collusive behavior by adopting aM forever (the carrot).
The need for a punishment is rooted in the fact that each individual Þrm, assuming that all
other Þrms play the collusive action, has a short-run incentive to lower (increase) its own price
(quantity) to increase individual proÞts at every other Þrm�s expense. Then the choice of a low
(high) punishment price (quantity) aP in the next period renders free-riding less attractive. As
standard in the penal code literature, we say punishments are optimal if, for a given discount
parameter δ, they lead Þrms to obtain the highest level of sustainable collusive proÞts. As
established by Abreu (1986, p. 192), these optimal punishments are also the most severe.
Following Lambertini and Sasaki (2002), we adopt an equivalent interpretation according to
which a penal code is optimal if, given a collusive action aM , it requires the lowest critical
threshold in the discount factor in order to sustain the collusive equilibrium path. Intuitively,
this is because the lower the discount factor, the smaller the present value of future collusive
proÞts, and the higher the incentives to deviate.
10
3 Collusion Sustainability
In this section, we investigate the role of the level of marginal costs on the existence of an optimal
single-period punishment, either in prices or in quantities, in the stick-and-carrot mechanism.
As in the existing literature, we also examine the effects of the degree of product differentiation
and of the number of Þrms.
For each player to have no incentive to deviate, a deviation must be followed by a punishment
that leads the discounted ßow of proÞts to be less than the actualized stream of collusive
equilibrium proÞts. Moreover, for the punishment to be a credible threat, one should verify
that Þrms do implement the punishment strategy. This occurs if individual gains to deviate
from the punishment phase are smaller than the loss incurred by prolonging the punishment
by one more period. The strategy proÞle {aM , aP } is a subgame perfect equilibrium of the
supergame if and only if the two conditions are satisÞed, as made formal in the system of
incentive constraints
πdi (aM )− π(aM ) ≤ δ (π(aM )− π(aP )) , (6)
πdi (aP )− π(aP ) ≤ δ (π(aM )− π(aP )) , (7)
where π(a) denotes each Þrm�s stage proÞt when all Þrms play a, and πdi (a) is Þrm i�s proÞt from
a one-shot best deviation from the action a selected by all Þrms j 6= i, all i, with a = aM , aP .The Þrst condition says that the incentive for the initial deviation must be smaller than what is
lost due to the punishment phase. The second condition says that the incentive to deviate from
the punishment phase must be smaller than the loss incurred by prolonging the punishment by
one more period. We look for an optimal punishment action a∗P and the threshold level of the
discount factor δK such that, if a∗P is prescribed and δ ≥ δK , then Þrms can sustain collusion
11
at aM . The subscript K = B,C refers to the Bertrand or Cournot versions of the model,
respectively.
Characterization of an optimal punishment The optimal punishment action a∗P and the
threshold level of the discount factor δK are such that (6) and (7) hold with equality.6 To see
that, assume Þrst that none of the two inequalities is binding for a = a∗P and δ = δK . Since both
expressions on the right-hand side of the inequality sign are continuous in δ and monotonically
decreasing when the discount parameter is decreasing, there exists a value �δ < δK for which
the system still holds true, a contradiction. Assume now that exactly one inequality is binding.
Recalling that proÞt functions πdi (.) and π(.) are continuous in Þrms� strategies, by changing
slightly the punishment action from a∗P to �aP one can relax the binding constraint and still let
the other inequality hold. Now with δ = δK and �aP , both constraints (6) and (7) are slack,
contradicting the previously established claim. As a result both constraints must be binding
for a = a∗P and δ = δK . It follows that
δK =πdi (aM )− π(aM )π(aM )− π(aP ) . (8)
A Þnal condition is that Þrms have incentives to participate in the game after deviation also.
This occurs when the individual rationality constraint is satisÞed (Lambson, 1987), that is
π(a∗P ) +∞Xτ=1
(δK)τ πM ≥ 0. (9)
In words, the losses incurred by a Þrm in the punishment period must not overbalance the dis-
counted stream of collusive proÞts earned in the following periods. We Þnd that this constraint
is always satisÞed for the speciÞcations of the model (on this see the Appendix, sections 7.1.1.
and 7.1.2).
When Þrms detect a deviation from the collusive strategy aM , they all switch to the pun-
6This is a restatement of Abreu (1986, Theorem 15) in the new context of the present model speciÞcations.
12
ishment phase before returning to collusive proÞts. As a change in parameters (γ, n, c) impacts
each Þrm�s deviation proÞts and collusive proÞts (i.e., πdi (aM ), πdi (aP ), and π(aM )), from (6)
and (7) it also impacts the optimal punishment action a∗P and thereby the lowest discount fac-
tor δK . The following proposition gives conditions on collusion sustainability under an optimal
single-period punishment.
Proposition 1 Optimal collusion is implementable with a single-period punishment if and only
if c ≥ max{0, cK} and δ ≥ δK , where K = B,C. If K = C and n ≤ 3 then cK = 0 for all γ, n,otherwise cK > 0. In all cases δK ≤ δtrigger.
Proof See Appendix 7.1. ¥
There are several results in the proposition. As standard in the literature, we obtain that
the single-period punishment proÞts must be sufficiently valued by a deviating Þrm. This means
that the discount factor must be sufficiently close to 1, i.e. δ ≥ δK . Another result is that thelevel of marginal costs matters for collusion sustainability.
For an intuitive interpretation of this claim, recall Þrst that, in all models of tacit collusion,
a necessary condition for the sustainability of agreements is the existence of a sufficiently severe
punishment mechanism, as formalized in (6). This holds when the discount factor is high
enough and punishment proÞts are low enough. More precisely, for a given discount factor, the
higher the deviation payoffs, and the lower the collusive proÞts, the tougher the punishment
must be for collusion to remain sustainable. In a trigger penal code, a deviation implies that
Þrms stop colluding and revert to the one-shot stage game Nash equilibrium forever. A stick-
and-carrot setup authorizes a shorter and more severe punishment phase that may lead Þrms to
earn negative proÞts for some time without violating the individual rationality constraint (9).
The severity of punishment depends on the extend to which Þrms can charge a non-negative
13
price sufficiently below marginal costs. This leads to identify the following fact: the higher the
level of c, the larger the range [0, c] of below-cost non-negative prices that Þrms can possibly
charge in a punishment phase.
Secondly, note that a more severe punishment is likely to render deviation from the pun-
ishment path more proÞtable. This is captured in the present stick-and-carrot penal code, by
a second necessary condition displayed in (7), which requires that the punishment is enforced
when a previous-period deviation is observed. When punishment proÞts are negative (which
holds true with below marginal cost pricing), the deviator can be better off by interrupting
production temporarily. This occurs when goods are relatively close substitutes and the mar-
ginal cost is sufficiently high.7 We thus have another effect that may potentially countervail
the former one: the higher the marginal cost c, the more severe the maximal punishment, and
the higher the incentive to deviate from a severe punishment path.
Proposition 1 establishes that the latter effect is always dominated by the former one. This
leads to the conclusion that cmust lie above a threshold value cK for collusion to be sustainable.
Remark that this constraint on c is ineffective in the quantity-setting version of the model with
two or three Þrms. This implies that results obtained in the literature with a duopoly and a
constant marginal cost normalized to zero, all other speciÞcations remaining equal, are robust
to the introduction of a positive marginal cost. Otherwise, the constraint on c must be satisÞed
for a single-period punishment to be optimal. This result contrasts sharply with trigger penal
code models, in which one can easily check that the sustainability of collusion is not directly
connected to the level of marginal costs (at least in the linear cost setup).
7When goods are differentiated enough and/or the marginal cost is very low, a Þrm can earn positive proÞts
by deviating from the punishment strategy (on this see Appendices 7.1.1 and 7.1.2).
14
Comparative statics We now turn to the impact of a change in the differentiation parameter
γ or in the number of Þrms n on the threshold values cK and δK .
Proposition 2 For K = B,C and for all γ, n :
(i)∂cK∂γ ≥ 0, ∂δK
∂γ > 0;
(ii)∂cK∂n ≥ 0, ∂δK
∂n > 0.
Proof The sign of derivatives can be obtained by direct computation of expressions given in the
Lemmas 1 and 2 of Appendix 7.1. ¥
This proposition establishes that the threshold values cK and δK evolve in the same direction
when parameter values change. We Þnd that an increase in product differentiation (i.e., a
reduction in γ) facilitates collusive agreements, as it relaxes the constraints on c and δ. This
extends to the present model speciÞcations a previous result unveiled by Chang (1991) with a
trigger penal code, and by Hackner (1996) and Lambertini and Sasaki (2002) in a stick-and-
carrot framework with two Þrms. The driving force behind this claim is that a rise in product
substitutability renders a deviation from the collusive or punishment path individually more
proÞtable. This effect dominates all other changes in the expression of δK as displayed in (8).
This contrasts with the fact that, with a trigger penal code, δtrigger is not monotonic in γ. An
interesting and novel result relates to the level of marginal costs. When product substitutability
increases, individual deviation proÞts rise, but simultaneously punishment proÞts are lowered.
We Þnd that the magnitude of the former effect dominates the latter. In consequence, an
increase in product substitutability makes a deviation from the collusive path more likely. This
results in the strengthening of the constraint on c, that is Proposition 2(i). An increase in
the number of Þrms has similar effects, as claimed in Proposition 2(ii). Again, this is because
an individual deviation becomes most proÞtable, both with price and quantity competition,
15
whereas the reduction in punishment proÞts is relatively limited.
4 Multi-Period Punishments
In this section we focus on situations in which optimal collusion is not sustainable with a
single-period penal code. This occurs when parameter values are such that one of the two
constraints on c and δ we described in Proposition 1 is violated. To do that, we Þrst introduce
a multiple-period extension to our model, before comparing optimal symmetric punishments
with the trigger penal code equilibrium benchmark.
As in Lambertini and Sasaki (2002), consider a multi-period stick-and-carrot penal code
in which, if any deviation from aM by any Þrm is detected, all Þrms switch to a l-period
punishment phase (the stick) during which they play aP,k, with k = 1, . . . , l. In any period of
punishment, if a deviation from the punishment action by a Þrm is detected, the punishment
phase restarts for l more periods, after which all Þrms revert to the initial collusive path aM
forever (the carrot).
For Þrms not to be incentivized to deviate from the collusive path, a deviation should lead
them to earn a discounted ßow of proÞts which is less than their discounted ßow of collusive
equilibrium proÞts. This writes
πdi (aM ) +l−1Xk=1
δkπ(aP,k) + δlπ(aP,l) +
∞Xτ=l+1
δτπ(aM ) ≤∞Xτ=0
δτπ(aM ), (10)
where π(a) denotes each Þrm�s stage symmetric proÞts when all Þrms charge a, and πdi (a) is
Þrm i�s proÞts from a one-shot best deviation from the action a, as selected by all Þrms j 6= i, alli, j. For some parameter values, which satisfy conditions described in the previous section, this
inequality may hold with a single-period punishment, that is l = 1. However, a longer duration
of punishment is needed when parameter values are such that the severity of a single-period
16
punishment is not sufficient to deter deviation, in which case l > 1. In the latter situation,
we consider punishment phases in which the l− 1 Þrst periods of punishment are the toughestpossible. The most severe action, we denote by aP , is such that prices are driven down to zero.
This occurs when, in the Bertrand version of the model, all Þrms charge pP= 0, while in the
Cournot version all Þrms expand their output so that pi(qP , qP ) = 0, all i. Then in the l-th
Þnal period of punishment, Þrms play an optimal strategy aP,l. Remark that, for all l ≥ 1,
when marginal costs are positive Þrms may earn negative proÞts if they play the punishment
action aP,l.
When a Þrm deviates from the collusive strategy, a second incentive constraint must be
satisÞed so that Þrms implement the penal code. In words, the individual gain to deviate from
the punishment phase must be smaller than the loss incurred by prolonging the punishment
phase of l more periods. Formally, this writes
s−1Xk=1
δkπ (aP ) + δsπdi (a
sP ) +
s+l−1Xk=s+1
δkπ (aP ) + δs+lπ (aP,l) +
∞Xτ=s+l+1
δτπM
≤l−1Xk=1
δkπ(aP ) + δlπ(aP,l) +
∞Xτ=l+1
δτπM ,
for any period s in which a Þrm deviates from the penal code, with 1 ≤ s ≤ l (we adopt the
convention thatPs−1k=1 δ
kπ (aP ) = 0 if s = 1).
The strategy proÞle {aM , aP , aP,l} is a subgame perfect equilibrium of the supergame if andonly if (10) and (11) are satisÞed. There are l constraints in (11). The only one that will be
binding is for s = 1, since the incentive to deviate from the penal code is higher in the Þrst
period of punishment than in any subsequent period. The system of inequalities can thus be
rewritten as
πdi (aM )− π(aM ) ≤ δl (π(aM )− π(aP,l)) +¡δ(1− δl−1)/(1− δ)¢ (π(aM )− π(aP )) ,
πdi (aP )− π(aP ) ≤ δl (π(aM )− π(aP,l)) + δl−1 (π(aP,l)− π(aP )) .(12)
17
The optimal punishment action a∗P,l and the threshold level of the discount factor δK,l
are such that (12) holds with equality (the proof proceeds as in the single-period case, by
observing that for any l > 1, both expressions on the right-hand side of the inequality sign are
monotonically decreasing when δ decreases). A reorganization of terms leads to
δK,l =πdi (aM )− π(aM )πdi (aM )− πdi (aP )
, all l > 1, (13)
In this multi-period penal code, the individual rationality constraint for a deviation not to end
the game writes
l−1Xk=1
δkπ(aP ) + δlπ(a∗P,l) +
∞Xτ=l+1
δτπM ≥ 0. (14)
Remark that a shift to a longer punishment phase always results in lower intertemporal pun-
ishment proÞts. This is because π(aP,l) ≥ π(aP ) and π(aM ) ≥ π(aP,l+1). In words, for a givenperiod, the proÞts obtained in a short (i.e., l-period) punishment phase are bounded from below
by the proÞts earned in a longer (l+1-period) phase. The following proposition gives conditions
on collusion sustainability under an optimal multi-period punishment.
Proposition 3 Optimal collusion is implementable with a l-period punishment if and only if
max{0, cK,l} ≤ c ≤ max{0, cK,l−1} and δ ≥ δK,l(c), where K = B,C and l > 1. The threshold
value cK,l is strictly decreasing when l increases, but δK,l(c) is constant in l. Moreover, for all c,
there always exists a length l of punishment phase such that optimal collusion is implementable
if δ ≥ δK,l(c), with δK,l(c) ≤ δtrigger.
Proof Recall that tacit collusion is implementable if and only if (i) there exists a sufficiently
severe punishment mechanism, (ii) there are no incentive to deviate from this punishment, and (iii)
there are no incentives to drop out from the game, i.e. the individual rationality constraint (14) holds
18
true. The Þrst two conditions stated in (12) can be rewritten as
Xl (δ, c) ≤ π(aP,l, c) ≤ Yl (δ, c) , (15)
where the argument c highlights the fact that all expressions depend on the value of the marginal cost
while the lower-bound Xl (δ, c) and the upper-bound Yl (δ, c) of the last-period punishment proÞts
π(aP,l, c) are given by:
Xl (δ, c) ≡ 1
δl−11
1− δ¡πdi (aP , c)− π (aP , c)
¢− δ
1− δ π (aM , c) +1
1− δ π (aP , c) ,(16)
Yl (δ, c) ≡ π(aM , c)− 1
δl¡πdi (aM , c)− π (aM , c)
¢+
1
δl−11− δl−11− δ (π (aM , c)− π (aP , c)) .
(17)
Condition (14) can also be rewritten as Zl (δ, c) ≥ 0, where Zl increases when δ increases. Clearly,since an increase in δ relax the three constraints, if collusion is implementable with a discount factor
δK,l it is implementable for any discount factor δ ≥ δK,l. It is also plain that the lower bound of thediscount factor δK,l depends a priori on the marginal cost c since the inequality (16) must be binding
for a = a∗P,l and δ = δK,l and both Xl (δ, c) and Yl (δ, c) depends on c. In contrast, the impact of
c on collusion sustainability is less straightforward. Since Zl (δ, c) is decreasing in c, an increase in
the marginal costs makes the third constraint more stringent. Moreover, the last-period punishment
proÞts π(aP,l, c) also depend on c.
Note that, while a longer length of punishment results in lower intertemporal proÞts, the proÞts in
the last period of the punishment phase increase as the length l of the punishment phase increases. To
see that, observe that πdi (aP ) > π(aP ) implies that the lower bound of the last period punishment
proÞts Xl (δ) strictly increases with l. Since the inequality (16) must be binding for a = a∗P,l and
δ = δK,l and the critical discount factor δK,l as deÞned by equation (13) does not change with
19
l, it follows that π(a∗P,l+1) > π(a∗P,l). Note also that obvious feasibility constraints impose that
π(aP ) ≤ π(a∗P,l) ≤ π(aM ), all l.The threshold level of marginal cost cK,l is the lowest marginal cost that allow to implement
collusion by the means of a punishment period of length l. It happens to be also the highest marginal cost
that allow to implement collusion by the means of a punishment period of length l+1. It corresponds
indeed to the level of marginal costs such that both feasibility constraints π(aP ) ≤ π(a∗P,l) and
π(a∗P,l+1) ≤ π(aM ) are binding. Formally
π¡aP , cK,l
¢= −cK,lq (aP ) = Xl
¡δK,l, cK,l
¢= Yl
¡δK,l, cK,l
¢,
π¡aM , cK,l
¢= Xl+1
¡δK,l, cK,l
¢= Xl+1
¡δK,l+1, cK,l
¢.
By deÞnition, collusion implementation by the means of a l-period punishment with a∗P,l = aP is a
tantamount to collusion implementation by the means of a l+1-period punishment with a∗P,l+1 = aM .
The get the result that collusion can be implemented by the means of a l-period punishment for any
cK,l ≤ c ≤ cK,l−1, it is thus sufficient to show that, if collusion can be implemented with last periodpunishment proÞts π(aP ) < π(a
∗P,l) < π(aM ), then collusion can also be implemented by the means
of a l-period punishment for a strictly smaller and a strictly larger marginal cost c. Both Xl and Yl
depends continuously on δ and c. Moreover the difference Yl −Xl is monotonic, strictly increasing inδ. It follows that it is always possible to haveXl (δ, c) ≤ Yl (δ,c), possibly by adjusting δ, both after anincrease or a decrease in c. The only limit to this changes in the marginal cost follow from the ability to
match the value Xl¡δK,l (c) ,c
¢= Yl
¡δK,l (c) , c
¢with punishment proÞts π(aP,l, c). One can show
however that the difference π(aP,l, c)−Xl (δ,c) is continuous and increasing aP,l and continuous anddecreasing in c. Indeed,
∂ [π(aP,l, c)−Xl (δ, c)]∂c
=1
δl−11
1− δ¡qdi (aP )− q(aP )
¢− δ
1− δ q(aM )+1
1− δ q(aP )−q(aP,l) < 0.
To see that, observe that since q(aP,l) is decreasing with aP,l it is sufficient to show that the inequality
20
holds for aP,l = aM . Since q(aP ) > qdi (aP ), it is also sufficient to focus on the case l = 2. Thus the
monotonicity of π(aP,l, c)−Xl (δ, c) in c will follow from
δ <q(aP )− qdi (aP )q(aP )− q(aM )
.
As a result π(aP,l, c) − Xl (δ, c) is always decreasing in c since qdi (aP ) ≤ q(aM ).It follows that, if
collusion can be implemented by the means of a l-period punishment for level c of marginal costs, it
can also be implemented for a strictly higher (resp. lower) c as long as aP,l can be increased (resp.
decreased). One also get that cK,l < cK,l−1.
To sum up, if collusion can be implemented by using a l-period punishment for some (δ, c) , one
can deÞne δK,l+1 (c) = δK,l (c) and cK,l+1 < cK,l so that collusion can also be implemented for
any δ ≥ δK,l (c) and cK,l+1 ≤ c ≤ cK,l by using a l + 1-period punishment. The proof proceeds
by induction. Appendix 7.2 focus on the 2-period case and establishes that collusion can indeed be
sustained for δ ≥ δK,2 (c) and cK,2 ≤ c ≤ cK . In particular, the threshold value δK,l (c) = δK,2 (c)is exhibited.
It remains to demonstrate that a trigger penal code can be replicated by a stick-and-carrot penal
code. Observe that, for any particular set of values taken by each of the structural parameters (c, n, γ),
it is always possible to construct a multi-period stick-and-carrot penal code which associates to each
Þrm a stream of proÞts which, evaluated in the Þrst period of punishment, is exactly equal to the
actualized sum of proÞts earned in the punishment phase of the trigger penal code. In our notation,
this replication argument is equivalent to saying that there exists a pair (l, aP,l) such that
l−1Xk=1
δkπ(aP ) + δlπ(aP,l) +
∞Xτ=l+1
δτπM| {z }≡ΠS&C(l,aP,l)
=∞Xτ=1
δτπN| {z }≡Πtrigger
, (18)
where the subscript S&C stands for �stick-and-carrot�, aP is the most severe action (i.e., either a price
equal to zero or a quantity which drives each Þrm�s price down to zero), and l ≥ 1. To see that, it is
21
sufficient to Þnd an integer l such that ΠS&C (l + 1, aM ) = ΠS&C (l, aP ) ≤ Πtrigger ≤ ΠS&C (l, aM )holds true. By continuity of ΠS&C (l, aP,l) in aP,l, there exist a last period punishment value such
that ΠS&C (l, aP,l) = Πtrigger. Rewrite the expression on the left-hand side of the equality side in
(18) for aP,l = aM :
ΠS&C (l, aM ) =δ(1− δl−1)1− δ π(aP ) + δ
lπM +δl+1
1− δ πM . (19)
Recalling that π(aP ) ≤ 0 (by construction) together with πM > 0 makes it clear that ΠS&C¡l, aM
¢is
decreasing when l increases. Moreover, as πM > πN (by deÞnition), it follows thatΠS&C¡l = 1, aM
¢=
(δ/ (1− δ))πM > Πtrigger and liml→+∞ΠS&C¡l, aM
¢= δ/(1− δ)π(aP ) < 0 < Πtrigger. There-
fore there exists a value for l such that ΠS&C (l + 1, aM ) ≤ Πtrigger ≤ ΠS&C (l, aM ) hence a pair
(l, aP,l) such that (18) holds true. ¥
There are several results in this proposition. Firstly, the constraint on the marginal cost
c we identiÞed in Proposition 1 for the single-period punishment case Þnds a counterpart in
the multi-period setup. Secondly, the latter constraint on c is relaxed by increasing the length
of the punishment phase. In other words, a longer duration of punishment is a �substitute�
to a high marginal cost. In contrast, the constraint on the discount factor cannot be relaxed
by the increase in l since δK,l depends on c (as well as n and γ) but not on the length of the
punishment phase l. The last result of Proposition 3 consists in showing that the constraint on
marginal costs can be fully relaxed by sufficiently lengthening the punishment phase. In other
words, if δ ≥ δK,l, there exists a punishment code that allow to implement collusion for any levelof the marginal costs c. This echoes an existing result by Hackner (1996), who demonstrates
that an optimal stick-and-carrot symmetric punishment a∗P always exists (Proposition 4, p.
624) with two Þrms, constant marginal costs normalized to zero, and in a spatial differentiation
framework. Proposition 3 extends this result to the general case, that is with n ≥ 2 Þrms,
constant marginal costs c in [0, 1), and for any value taken by the exogenous differentiation
22
parameter γ in [0, 1).
Corollary 4 δK,l ≥ δK , all l > 1 , K = B,C.
Proof Since δK,l does not depend on l, it is sufficient to show that δK,l=2 ≥ δK . This is doneby comparing values of δK,l=2 and δK as given in Appendices 7.1 and 7.2. ¥
To illustrate, we now consider the simplest possible case of multiple-period punishment, that
is l = 2, to obtain the following result.
Remark 1 For K = B ,C and for all γ, n :
If K = C and n ≤ 4 then cK,l=2 = 0 for all γ, n; otherwise cK,l=2 > 0. Moreover∂cK,l=2∂γ > 0,
∂δK,l=2∂γ > 0,
∂cK,l=2
∂n > 0, and∂δK,l=2∂n > 0.
It remains to compare the role of the nature of competition on the sustainability of collusion.
We tackle this next.
5 Price vs. Quantity
We now compare the threshold values cK and δK , as deÞned in Section 3, in the Bertrand
(K = B) and Cournot (K = C) versions of the model.
Proposition 5 For all γ, n : cB > cC and δB > δC .
This proposition establishes that collusion is less easily sustainable in a price-setting oligopoly
than in a quantity-setting one. This means Þrst that, in a Bertrand setup, a longer punishment
than in the Cournot case may be required. Indeed, since cB > cC , there exist parameters (c, n, γ)
which are such that collusion is not sustainable with a single-period punishment in Bertrand,
while a single-period punishment is sufficient in Cournot. From Proposition 3, we know there
23
exists l such that cC,l ≤ c, hence optimal collusion with be implementable in Bertrand with
a multi-period punishment. Moreover, the constraint on the discount factor is more stringent
when Þrms use prices as opposed to quantities. This claim holds for any degree of product
substitutability, and for all numbers of Þrms.
This can be made intuitive by observing that the incentives to deviate in the Bertrand case
are higher than in the Cournot setup. This is because a deviating Þrm can capture the whole
demand in a price-setting oligopoly. This does not apply in a quantity-setting oligopoly, because
a unilateral expansion in some Þrm�s output cannot eliminate its rivals� demand. By contrast,
the range of punishment possibilities [−cq, πM ) is the same in the Bertrand and Cournot cases,since we use symmetric punishments. In both versions of the model, the severity of punishments
thus depends only on the level of marginal costs.
To illustrate the multi-period case, we now focus on two-period punishment phases. For the
sake of clarity, denote by cK,l=1 and δK,l=1 the threshold values deÞned in the single-period
framework.
Remark 2 For all γ, n : cB,l > cC,l; and for all c, γ, n : δB,l > δC,l0 , l ≥ l0, with l, l0 = 1, 2.
This remark establishes that the results offered in Proposition 4 are still valid in a two-
period context. It can be easily extended to more than two periods. This is because the most
severe punishments π(aP ) = −cq inßicted in the Þrst l − 1 periods are the same in Bertrandand Cournot, with aP = p
P= 0 in the price-setting case, and pi(aP , aP ) = pi(qP , qP ) = 0
in the quantity version. The difference between the Bertrand and Cournot cases is therefore
fully captured by the difference in the unilateral deviation proÞts πdi (aP,l) and the optimal
punishment proÞts π(aP,l) in the last period l, as in the single-period framework.
24
6 Conclusion
We have presented an oligopoly supergame model with differentiated goods and positive mar-
ginal costs, under Bertrand or Cournot behavior. In this setup, we are able to derive quite
general results which are policy relevant. They conÞrm that antitrust authorities should con-
centrate investigations on industries in which Þrms are few and sell highly differentiated goods.
On top of that, we argue that the level of marginal costs should be added to the list of struc-
tural factors that facilitate collusion. Another interesting Þnding is that a longer punishment
phase is a substitute for a low level of marginal costs. However, real-world circumstances may
limit the duration of below cost price wars. In this case, a low level of marginal costs would
constitute an impediment to tacitly collusive agreements.
7 Appendix
In this appendix, we determine conditions of collusion sustainability by computing the threshold
values of c and δ for all parameter values.
7.1 Single-period punishment
The optimal punishment action aP and the minimum threshold δK are deÞned by
πdi (aP )− π (aP ) = πdi (aM )− π (aM ) (20)
and
δK =πdi (aM )− π (aM )π (aM )− π (aP ) . (21)
We check that the individual rationality constraint in the punishment phase is satisÞed by
verifying that the value of discounted proÞts from the punishment period onwards is strictly
25
positive, that is
π(aP ) +δ
1− δ π(aM ) ≥ 0. (22)
Since ∂∂δ
³π(aP ) +
δ1−δπ(aM )
´> 0, it will be sufficient to check in the following that
π(aP ) +δK
1− δKπ(aM ) ≥ 0 (23)
to conclude that the individual rationality constraint is satisÞed for all δ > δK .
Both in the Bertrand and in the Cournot cases, the collusive proÞts, price and individual
quantities are respectively given by πMi = (1− c)2 /4 (1 + γ (n− 1)) , pM = (1 + c) /2 and
qM = (1− c) /2 (1 + γ(n− 1)) .
7.1.1 Bertrand case
We start by recalling the Bertrand related content of Proposition 1, and by displaying threshold
values of c and δ. Then we turn to the proof.
Lemma 6 Collusion can be sustained by a single-period punishment price if and only if c ≥ cBand δ ≥ δB, where:
cB =
c0B if 0 ≤ γ < min½2((n−3)+(n−1)
√2)
2n+n2−7 , n−3+√n2−1
3n−5
¾c1B if min
½2((n−3)+(n−1)
√2)
2n+n2−7 , n−3+√n2−1
3n−5
¾≤ γ
≤ max½n−3+√n2−1
3n−5 ;(n−3)(n2−2n+5)+(n2−1)
√(n2−2n+5)
2(12n−7n2+2n3−11)
¾c2B if γ ≥ max
½n−3+√n2−1
3n−5 ;(n−3)(n2−2n+5)+(n2−1)
√(n2−2n+5)
2(12n−7n2+2n3−11)
¾with
c1B =
�c1B for n = 2
�c1B for n > 3
26
and
δB =
116
(γ(n−3)+2)2(1−γ)(γ(n−2)+1) if 0 ≤ γ ≤ min{n−3+
√n2−1
3n−5 , γ1} and min{0, c0B} ≤ c ≤ 1 for all n
γ2(2n−3)+γ(3−n)−1(2γ−1)(1+nγ−γ) if max
nn−3+√n2−1
3n−5 ; γ2
o≤ γ ≤ 1 and c2B ≤ c ≤ 1 for all n
((2γ−1)(1+γ)−γ2)(γ−2)2(γ2−2)(γ2−4γ+2)+4γ2
√(1−γ)(γ+γ2−1) if
n−3+√n2−13n−5 ≤ γ ≤ γ2 and �c1B ≤ c ≤ 1 for n = 2
γ2 (n−1)2(γ(n−3)+2)2 if γ1 ≤ γ ≤ n−3+√n2−1
3n−5 and �c1B ≤ c ≤ f (γ) for n > 3
where γ1 and γ2 are expressions of n, and f (γ) = (1− γ) / (1 + γ (n− 2)) .
Proof: The demand function is given by
qi(pi, pj) =
1− pi if pi ≤ �pi(pj)
11+γ(n−1)
³1− 1+γ(n−2)
1−γ pi +γ(n−1)1−γ pj
´if �pi(pj) ≤ pi ≤ �pi(pj)
0 if pi ≥ �pi(pj)
,
(24)
and demand for each other symmetric Þrm j is deÞned by
qj(pi, pj) =
0 if pi ≤ �pi(pj)
11+γ(n−1)
³1− 1
1−γ pj +γ1−γ pi
´if �pi(pj) ≤ pi ≤ �pi(pj)
1−pj1+γ(n−2) if pi ≥ �pi(pj)
, (25)
with �pi(pj) =1γ [pj − (1− γ)] and �pi(pj) = 1
1+γ(n−2) (1− γ + γ (n− 1) pj).Firm i�s proÞt writes πi(pi, pj) = (pi − c) qi(pi, pj), with c < 1. This proÞt function is locally
27
concave in each market structure. One can Þnd that the best deviation proÞt for Þrm i writes
πdi (p) =
0 if p ≤ pj
[(1−γ)(1−c)+γ(n−1)(p−c)]24(1+γ(n−1))(1+γ(n−2))(1−γ) if p ≤ p ≤ pj(1−pj)γ2 ((p− c)− (1− γ) (1− c)) if p ≤ p ≤ pj¡1−c2
¢2if p ≥ pj
(26)
where pj− c ≡ − (1−γ)(1−c)
γ(n−1) , pj − c ≡ (1−γ)(1−c)(2+γ(2n−3))2(1−γ)+γ(n−1)(2−γ) , pj − c ≡ (1−c)(2−γ)
2 . Note that
pj> 0⇔ c >
(1− γ)(1 + γ (n− 2)) ≡ f(γ).
We now compute the best deviation to the collusive price. Since pj< pM < pj and pM ≤
pj ⇔ γ ≤ ¡n− 3 +√n2 − 1¢ / (3n− 5) < 1, when all other Þrms charge pM , Þrm i�s maximum
deviation proÞt is given by
πdi (pM ) =
(1−c)2[2+γ(n−3)]2
16(1+γ(n−1))(1+γ(n−2))(1−γ) if γ ≤ n−3+√n2−13n−5 ,³
1−c2γ
´2(2γ − 1) if γ ≥ n−3+√n2−1
3n−5 .
(27)
In order to solve (20), we distinguish several cases that depend on the form of πdi (pP ) and
πdi (pM ).
� Case 1. πdi (pP ) = 0 if 0 ≤ pP ≤ pj and c > f(γ)
The left-hand side term of (20) is a convex parabolic function of p that takes value 0 if either
p = c or p = 1. Since pj< c < 1, we know that it is monotone decreasing for all p ≤ p
j.
Case 1.1. γ ≤ n−3+√n2−13n−5
We Þnd:
(i) πdi (0)− π (0) > (=)¡πdi (pM )− π (pM )
¢⇔ c > (=) �c1B with
�c1B ≡ (17−10n+n2)γ2−8(3−n)γ+8−4(2+γ(n−3))√(1−γ)(1+γ(n−2))
γ2(n−1)2 .
28
Remark that �c1B is a monotone increasing function of γ, with limγ→1�c1B = 1, and that for
γ = n−3+√n2−13n−5 the difference �c1B − f (γ) is negative if n = 2, nil if n = 3 and positive if n > 3.
(ii) πdi
³pj
´− π
³pj
´< (=)
¡πdi (pM )− π (pM )
¢⇔ γ > (=)γ1 with
γ1 ≡2¡(n− 3) + (n− 1)√2¢
2n+ n2 − 7 ,
with γ1 > (=)(n−3+
√n2−1)
3n−5 ⇔ n < (=) 3. We also Þnd that �c1B = f (γ) if γ = γ1.
We conclude that (20) admits a unique solution pP ∈ (0, pj) iff n > 3, γ1 ≤ γ ≤ n−3+√n2−13n−5
and c ≥ �c1B. The threshold value of the discount factor is given by
δ1B ≡γ2 (n− 1)2
(γ(n− 3) + 2)2 ∈ (0, 1).
From (23) , the individual rationality constraint is satisÞed since one can easily check that
π(pP ) +δ1Bπ(pM )
1−δ1B= 0.
Case 1.2. γ ≥ n−3+√n2−13n−5
We Þnd:
(i) πdi (0)− π (0) > πdi (pM )− π (pM )⇔ c > c2B with
c2B ≡(2n− 1) γ2 + (3− n) γ − 1− 2γp(2γ − 1) (1 + γ (n− 1))
(2γ − 1) (1 + γ (n− 1))− γ2 ,
which is a monotone increasing function of γ, with limγ→1c2B = (
√n − 1)/(√n + 1) < 1. For
γ = n−3+√n2−13n−5 we check that �c1B = c2B and c
2B − f (γ) is negative if n = 2, nil if n = 3 and
positive if n > 3.
(ii) πdi
³pj
´− π
³pj
´<¡πdi (pM )− π (pM )
¢⇔ γ > γ2 with
γ2 ≡³(n− 3) ¡n2 − 2n+ 5¢+ ¡n2 − 1¢p(n2 − 2n+ 5)´
2 (2n3 − 7n2 + 12n− 11)
and γ2 > (=)(n−3+
√n2−1)
3n−5 ⇔ n < (=) 3 and c2B = f (γ) if γ = γ2.
29
We conclude that (20) admits a unique solution pP ∈ (0, pj) if γ ≥ max{n−3+√n2−1
3n−5 ; γ2}and c ≥ c2B. The discount factor threshold is given by
δ2B ≡γ2 (2n− 3) + γ (3− n)− 1(2γ − 1) (1 + γ(n− 1)) ∈ (0, 1).
From (23) , the individual rationality constraint is satisÞed since π(pP ) +δ2Bπ(pM )
1−δ2B = 0.
� Case 2 : πdi (pP ) = [(1−γ)(1−c)+γ(n−1)(pj−c)]24(1+γ(n−1))(1+γ(n−2))(1−γ) if sup{0, pj} ≤ pP ≤ pj
The lower bound of pP is sup{0, pj} since pP must be non-negative. The left-hand side termof (20) is a convex parabolic function of pP which takes value 0 for pP =
1+c−γ(1−c(n−2))2+γ(n−3) ≡ ÿp
only. As pj< ÿp < pj , this function is monotone decreasing on [pj , ÿp) and monotone non-
decreasing on [ÿp, pj ].
Case 2.1. γ ≤ n−3+√n2−13n−5
The only admissible solution to (20) is pP − c = 12 (1− c) 2−γ(n+1)2+γ(n−3) , with pP < ÿp < pj for all
parameter values. We Þnd pP > pj ⇔ γ < γ1 and pP > 0⇔ c > c0B with
c0B ≡−2 + γ(n+ 1)2 + γ(3n− 5) .
We check that (i) −2+γ(n+1)2+γ(3n−5) > 0⇔ γ > 2(n+1) <
n−3+√n2−13n−5 and (ii) 2
(n+1) < γ1, all n.
We conclude that there is a solution pP such that pj < pP < pj for all γ < min{γ1, n−3+√n2−1
3n−5 }and c ≥ c0B. We also Þnd
δ3B ≡1
16
(γ(n− 3) + 2)2(1− γ) (γ(n− 2) + 1) ∈ (0, 1).
From (23) ,the individual rationality constraint is satisÞed since π(pP ) +δ3Bπ(pM )
1−δ3B > 0 for all
γ < min{γ1, n−3+√n2−1
3n−5 }.
30
Case 2.2. γ ≥ n−3+√n2−13n−5
The only admissible solution pP to (20) is such that pP < pj for all parameters, pP ≥ pj ⇔[γ ≤ γ2 and c ≥ f(γ)] and pP ≥ 0⇔ c ≥ �c1B, with
�c1B ≡γ(γ − 1) +p(1− γ) (γ + γ2 − 1)
γ +p(1− γ) (γ + γ2 − 1) .
We check that �c1B ≤ f(γ) whenever n−3+√n2−13n−5 ≤ γ ≤ γ2, that �c1B = f(γ) if γ = γ2 and that
�c1B = c0B if γ =
n−3+√n2−13n−5 . We conclude that there exists a solution pP such that pj < pP < pj
whenever n−3+√n2−1
3n−5 ≤ γ ≤ γ2 and c ≥ �c1B, which is a non empty case only for n = 2. Now forn = 2, we Þnd
δ4B ≡(2γ − 1) (γ + 1)− γ2
(γ2 − 2) (γ2 − 4γ + 2) + 4γ2p(1− γ) (γ2 + γ − 1) (γ − 2)2 ∈ (0, 1).From (23) , the individual rationality constraint is satisÞed since π(pP ) +
π(pM )δ4B
1−δ4B > 0.
� Case 3 : πdi (pP ) = (1−pj)γ2 ((pj − c)− (1− γ) (1− c)) if pj ≤ pP ≤ pj
The left-hand side of (20) is increasing when p increases, all p ∈ [pj , pj ].
Case 3.1. γ ≤ n−3+√n2−13n−5
We use πdi¡pj¢ − π ¡pj¢ > πdi (pM ) − π (pM ) ⇔ γ < n−3+√n2−1
3n−5 to conclude that (20) has
no solution in the range [pj , pj ] and for γ ≤ n−3+√n2−13n−5 .
Case 3.2. γ ≥ n−3+√n2−13n−5
There is no admissible solution to (20) in the range [pj , pj ] and for γ ≥ n−3+√n2−13n−5 .
7.1.2 Cournot case
Let us start by recalling the Cournot related content of Proposition 1 before expressing the
threshold values of c and δ.
31
Lemma 7 An optimal single-period punishment quantity qp exists in any Cournot supergame
if δ ≥ δC with γ ∈ (0, 1) for n = 2, 3, or c ≥ cC and n > 3, where
cC =
c0C =
γ(n−1)−23γ(n−1)+2 if
2n−1 ≤ γ ≤ min
n21+
√2
n−1 , 1o
c1C =
³(2+γ(n−1))
√(1+γn−γ)−2(1+γ(n−1))
´³2(1+γ(n−1))+(2+γ(n−1))
√(1+γn−γ)
´ if minn21+√2n−1 , 1o≤ γ.
and
δC =
116(2+γ(n−1))21+γ(n−1) if n < 6 or if n ≥ 6 and 2
n−1 ≤ γ ≤ minn21+
√2
n−1 , 1o
³(2+γ(n−1))
√(1+γn−γ)−2(1+γ(n−1))
´³2(1+γ(n−1))+(2+γ(n−1))
√(1+γn−γ)
´ if n ≥ 6 and minn21+√2n−1 , 1o≤ γ.
Proof: The inverse demand function for Þrm i and all other symmetric Þrms j are given
by
pi(qi, qj) = max{0, 1− qi − γ(n− 1)qj}, all qj ≥ 0, j 6= i,
pj(qi, qj) = max{0, 1− γqi − (1 + γ(n− 2))qj}. (28)
Symmetric proÞts write
π(q) =
(1− q (1 + γ(n− 1))− c)q if q ≤ 1(1+γ(n−1))
−cq if q ≥ 1(1+γ(n−1))
, (29)
One can easily check that the one-shot best deviation proÞt is
πdi (q) =
14 [1− c− γ (n− 1) q]2 if q ≤ 1−c
γ(n−1) ;
0 otherwise.
(30)
Since qM < 1−cγ(n−1) for all parameter values, π
di (qM ) =
(1−c)216
(γ(n−1)+2)2(1+γ(n−1))2 . Now, the optimal
punishment strategy qP is a solution in q of
πdi (q)− π (q) = πdi (qM )− π (qM ) . (31)
There are two cases:
32
� Case 1 : πdi (q) = 14 (1− c− γ (n− 1)) qj)2 if q ≤ 1−c
γ(n−1)
Solution to (31) is qP =12 (1− c) 3γ(n−1)+2
(2+γ(n−1))(1+γ(n−1)) > 0 which is admissible if and only
if qP ≤ 1−cγ(n−1) ⇔ γ ∈ £0,min©2 ¡1 +√2¢ / (n− 1) , 1ª¢, where min©2 ¡1 +√2¢ / (n− 1) , 1ª =
1⇔ n < 6. The price pi(qP , qP ) ≥ 0 iff c ≥ c0C with
c0C ≡γ (n− 1)− 23γ(n− 1) + 2 .
This frontier c0C is upward sloping and intersects c = 0 for γ = 2n−1 . Note that we need
2n−1 < 1 ⇔ n > 3 for the frontier to be above zero. Now we compute the threshold value of δ
to Þnd
δ1C =1
16
(2 + γ (n− 1))21 + γ (n− 1) < 1.
From (23) , the individual rationality constraint is satisÞed since π(qP ) +δ1Cπ(qM )
1−δ1C > 0 ⇔ γ ∈µ2(3−2
√3)
(n−1) ,2(3+2
√3)
(n−1)
¶which is always true since γ ≤ 21+
√2
n−1 < 23+2
√3
n−1 , all n.
� Case 2 : πdi (q) = 0 if q > 1−cγ(n−1)
The only admissible solution to (31) is qP =(1−c)(2+γ(n−1))
³2+√1+γ(n−1)
´4(1+γ(n−1))2 if and only if
γ ∈h21+
√2
n−1 , 1´and n ≥ 6. We Þnd that qp ≤ (1− c) /γ (n− 1) iff c > c1C with
c1C ≡(2 + γ(n− 1))p(1 + γ(n− 1))− 2 (1 + γ(n− 1))2 (1 + γ(n− 1)) + (2 + γ(n− 1))p(1 + γ(n− 1)) .
This frontier c1C is upward sloping and we check that c1C = c
0C for γ = 2
1+√2
n−1 . Now we compute
the threshold value of δ to Þnd:
δ2C =
µγ (n− 1)
(2 + γ (n− 1))¶2< 1.
From (23) , the individual rationality constraint is satisÞed since π(qP ) +δ2Cπ(qM )
1−δ2C = 0.
33
7.2 Two-period punishment
We solve the two-period punishment case which occurs when collusion cannot be sustained by
using a single-period punishment, that is if c is below a threshold cK . We examine in turn the
Bertrand and Cournot cases.
7.2.1 Bertrand
During the Þrst period of punishment, Þrms inßict the most severe punishment, that is pP = 0.
We solve in pP,2 and δ the system
πdi (pM )− π(pM ) = δ2 (π(pM )− π(pP,2)) + δ (π(pM )− π(0)) ,
πdi (0)− π(0) = δ2 (π(pM )− π(pP,2)) + δ (π(pP,2)− π(0)) , (32)
with
πdi (0) =
[(1−γ)(1−c)+γ(n−1)(−c)]2
4(1+γ(n−1))(1+γ(n−2))(1−γ) if 0 ≤ c ≤ f (γ) ,0 if c ≥ f (γ) ,
πdi (pM ) =
(1−c)2[2+γ(n−3)]2
16(1+γ(n−1))(1+γ(n−2))(1−γ) if γ ≤ n−3+√n2−13n−5 ,³
1−c2γ
´2(2γ − 1) if γ ≥ n−3+√n2−1
3n−5 .
and
π(pM ) =(1− c)2
4 (1 + γ (n− 1)) and π(0) =−c
1 + γ (n− 1) .
We start by recalling a partial result (that appears in Proposition 2) before displaying the
thresholds values of c and δ.
Lemma 8 For n ≥ 2, an optimal two-period punishment penal code exists for cB ≥ c ≥
34
cB,l=2 (γ) and δ > δB,l=2 where:
cB,l=2 =
0 if 2n+1 ≤ γ ≤ γ4 and n = 2
c0B,l=2 if γ5 ≤ γ ≤ min©¡n− 3 +√n2 − 1¢ / (3n− 5) , γ6ª and n ≥ 3
ec0B,l=2 if γ6 ≤ γ ≤ ¡n− 3 +√n2 − 1¢ / (3n− 5) and n > 4c1B,l=2 if max
©γ4,¡n− 3 +√n2 − 1¢ / (3n− 5)ª ≤ γ ≤ γ3
c2B,l=2 if max©γ3,¡n− 3 +√n2 − 1¢ / (3n− 5)ª ≤ γ ≤ 1
and for n = 2:
δB,l=2 =
γ(c−1)2(c+1)(γ(c−3)+4(1−4)) if
2n+1 ≤ γ ≤ n−3+√n2−1
3n−5
(γ2+γ−1)(c−1)2(−1+γ)(c−1)2+γ(−2(c−1)2+(2c(c−1)+γ)γ2) if
n−3+√n2−13n−5 ≤ γ ≤ γ3
γ+γ2−1(2γ−1)(1+γ) if γ3 ≤ γ ≤ 1
and for n ≥ 3:
δB,l=2 =
−(c−1)2(n−1)γ(c+1)(γn(3c−1)−4(1−c)+γ(5−7c)) if
2n+1 ≤ γ ≤ min
nn−3+√n2−1
3n−5 , γ6
oand
min{0, c0B,l=2} ≤ c ≤ min{c0B, f (γ)}
γ2(n−1)2(2+γ(n−3))2 if γ1 ≤ γ ≤ n−3+√n2−1
3n−5 and max{f (γ) , �c0B,l=2} ≤ c ≤ �c1B
γn(2γ−1)−3γ(γ−1)−1(2γ−1)(1+γ(n−1)) if n−3+
√n2−1
3n−5 ≤ γ ≤ 1 and max{f (γ) , c2B,l=2} ≤ c ≤ c2B
(c−1)2(γ−1)(γ2(2n−3)+γ(3−n)−1)(1+γ(n−2))D if n−3+
√n2−1
3n−5 ≤ γ ≤ γ3 andc1B,l=2 ≤ c ≤ f (γ) and n = 3, 4
with D = 1+¡¡2(1− 2c) + 3c2¢n2 − 2 ¡3− 7c+ 5c2¢n+ 5− 12c+ 8c2¢ γ4 + (−3 (c− 1)2 n2 +
(15c− 13) (c− 1)n−14+30c−16c2)γ3+((1−c)2n2−9(c−1)2n+14(c−1)2)γ2+2(c−1)2(n−
35
3)γ + c2 − 2c. We verify that δB,l=2 is continuous over the range of γ.
Proof: There are two cases:
� Case 1. πdi (0) = [(1−γ)(1−c)+γ(n−1)(−c)]24(1+γ(n−1))(1+γ(n−2))(1−γ) if 0 ≤ c ≤ f (γ)
We have to distinguish two subcases according to the optimal deviation to the collusive
price.
Case 1.1. πdi (pM ) =(1−c)2[2+γ(n−3)]2
16(1+γ(n−1))(1+γ(n−2))(1−γ) if γ ≤ n−3+√n2−13n−5
Only one solution to (32) is below the collusive price and is positive if c0B,l=2 ≤ c ≤ c0B withc0B,l=2 = 0 for all γ if n = 2 and
c0B,l=2 ≡γ(n−1)((19−14n+3n2)γ2−(24−8n)γ+8)−4(2+γ(n−3))2
√(1−γ)(1+γ(n−2))
(−97+131n−59n2+9n3)γ3+16(n−2)(2n−5)γ2−8(11−5n)γ+16
for n ≥ 3, all γ. We also obtain
δ0B,l=2 =− (c− 1)2 (n− 1) γ
(c+ 1) ((n(3c− 1)− 7c+ 5) γ − 4(1− c)) .
Case 1.2. πdi (pM ) =³1−c2γ
´2(2γ − 1) if γ ≥ n−3+√n2−1
3n−5
System (32) has only one admissible solution pP . Now for n = 2, pP has four roots in c.
Two of them are negative and two are positive, we denote by c0 and c00, with 0 < c0 < c1B < c00.
We show that c0 = 0 for γ = γ4. Since pP < 0 ⇔ γ > γ4 for c = 0, we conclude that
pP > 0 ⇔ c > max {c0, 0} and we deÞne c1B,l=2 ≡ c0. For n = 3, 4, since we have to consider
only γ > γ4, we know that pP < 0 for c = 0. We also Þnd that pP > 0 for c = f(γ) and for
n−3+√n2−13n−5 ≤ γ < γ3. Eventually, for n ≥ 5, (32) has no solution in pP since c1B,l=2 > f(γ) for
all γ ≥ n−3+√n2−13n−5 . We also obtain
δ1B,l=2 =(c− 1)2 (γ − 1) ¡(2n− 3) γ2 + (3− n) γ − 1¢ (1 + γ (n− 2))
D,
36
where D is as expressed above.
� Case 2. πdi (0) = 0 if (1−γ)(1+γ(n−2)) ≡ f (γ) ≤ c ≤ cB
Again we distinguish two subcases.
Case 2.1. πdi (pM ) =(1−c)2[2+γ(n−3)]2
16(1+γ(n−1))(1+γ(n−2))(1−γ) if γ ≤ n−3+√n2−13n−5
Remark that this subcase cannot occur if n = 2, 3. Moreover, for n = 4, we have c1B,l=2 <
f (γ) and collusion can always be sustained with a two-period punishment. For n > 4, only one
solution to (32) is below the collusive price and is positive if c > �c0B,l=2 with
�c0B,l=2 ≡ A−4(2+γ(n−3))2√2(1−γ)(1+γ(n−2))((5−4n+n2)γ2+(2n−6)γ+2)
γ4(n−1)4 ,
where A =¡161− 212n+ 102n2 − 20n3 + n4¢ γ4+16 (n− 3)3 γ3+48 (n− 3)2 γ2+64(n−3)γ+
32.
Let γ6 be deÞned implicitly by �c0B,l=2 = c
0B,l=2 = f (γ6). We also obtain
δ2B,l=2 =γ2 (n− 1)2
(2 + γ (n− 3))2 .
Case 2.2. πdi (pM ) =³1−c2γ
´2(2γ − 1) if γ ≥ n−3+√n2−1
3n−5
Only one solution to (32) is below the collusive price and is positive if c > c2B,l=2 with
c2B,l=2 =B−2γ(2γ−1)(γn−γ+1)
√(4n−5)γ2+(−2n+6)γ−2
((2n−3)γ2+(−n+3)γ−1)2
where B =¡−1− 4n+ 4n2¢ γ4−2 (2n− 1) (n− 3) γ3+¡11− 10n+ n2¢ γ2−(6− 2n) γ+1.We
also obtain
δ3B,l=2 =γn (2γ − 1)− 3γ (γ − 1)− 1(2γ − 1) (1 + γ (n− 1)) .
We also show that cB2 > cB,l=22 (γ) on the range
³max
nγ2,
n−3+√n2−13n−5
o, 1ifor all n ≥ 2.
37
7.2.2 Cournot
We solve in qP,2 and δ the following system
πdi (qM )− π(qM ) = δ2 (π(qM )− π(qP,2)) + δ³π(qM )− π(qP )
´, (33)
πdi (qP )− π(qP ) = δ2 (π(qM )− π(qP,2)) + δ³π(qP,2)− π(qP )
´, (34)
where qPis the Þrst-period punishment quantity which is such that pi
³qP, qP
´= 0 which
yields to π(qP) = −cq
P. We have
πdi (qP ) =
14
³1− c− γ (n− 1) q
P
´2if q
P≤ 1−c
γ(n−1) ⇔ c < 11+γ(n−1) ,
0 otherwise.
The two-period case requires that c < cC(γ). For n = 4, 5, πdi (qP ) = 0 is impossible since
11+γ(n−1) > c
0C . For n ≥ 6, since 1
1+γ(n−1) < c1C for γ = 1, there exists a ùγ such that
11+γ(n−1) <
cC ⇔ γ > ùγ.
There are two cases:
� Case 1. πdi (qP ) = 14
³1− c− γ (n− 1) q
P
´2if 0 < c < min
n1
1+γ(n−1) , cCo
Only one solution to (33) is above the collusive quantity and is below qPif c > c0C,l=2 with
c0C,l=2 ≡γ(n−1)((3(1−n)2)γ2+(8(n−1))γ+8)−4(2+γ(n−1))2
√(1+γ(n−1))
9(n−1)3γ3+32(1−n)2γ2−40(1−n)γ+16 ,
which is below cC for all γ and n > 4. Moreover c0C,l=2 intersects1
1+γ(n−1) for n > 9. We Þnd
also
δ1C,l=2 =γ (1− c)2 (n− 1)
(1 + c) ((n(1 + 3c)− 1 + 3c) γ + 4− 4c) .
� Case 2. πdi (qP ) = 0 if 11+γ(n−1) ≤ c ≤ cC which is possible for n ≥ 6 only.
38
Here again, there is only one solution to (33) above the collusive quantity. This second-
period punishment is below qPif c > c1C,l=2 with
c1C,l=2 =E−4(2+γ(n−1))2
q2(1+γ(n−1))((1−n)2γ2+2(n−1)γ+2)
γ4(n−1)4 ,
where E = (1− n)4 γ4 − 16 (1− n)3 γ3 + 48 (1− n)2 γ2 − 64(1− n)γ + 32. We also obtain
δ2C,l=2 =γ2 (n− 1)2
(γ(n− 1) + 2)2 .
We check that c1C,l=2 ≤ cC for all γ and all n > 4, and that c1C,l=2 > 11+γ(n−1) only for n ≥ 9.
39
8 References
1. Abreu, D. (1986). �Extremal equilibria of oligopolistic supergames�, Journal of Economic
Theory, 39, 191-225.
2. Chang, M.-H. (1991). �The effects of product differentiation on collusive pricing�, Inter-
national Journal of Industrial Organization, 9(3), 453-469.
3. Collie, D. (2004). �Collusion and the elasticity of demand�, Economics Bulletin, 12(3),
1-6..
4. Deneckere, R. (1983). �Duopoly supergames with product differentiation�, Economics
Letters, 11, 37-42.
5. Deneckere, R. (1984). �Corrigenda�, Economics Letters, 15, 385-87.
6. Friedman, J. W. (1971). �A non-cooperative equilibrium for supergames�, Review of
Economic Studies, 38(1), 1-12.
7. Hackner, J. (1994). �Collusive pricing in markets for vertically differentiated products�,
International Journal of Industrial Organization, 12(2), 155-177.
8. Hackner, J. (1995). �Endogenous product design in an inÞnitely repeated game�, Inter-
national Journal of Industrial Organization, 13(2), 277-299.
9. Hackner, J. (1996). �Optimal symmetric punishments in a Bertrand differentiated prod-
ucts duopoly�, International Journal of Industrial Organization,14(5), 611-630.
10. Hackner, J. (2000). �A note on price and quantity competition in differentiated oligopolies�,
Journal of Economic Theory, 77, 181-191.
40
11. Lambertini, L. (1996). �Cartel stability and the curvature of market demand�, Bulletin
of Economic Research, 48, 329-34.
12. Lambertini, L. and Sasaki, D. (2001). �Marginal costs and collusive sustainability�, Eco-
nomics Letters, 72, 117-120.
13. Lambertini, L. and Sasaki, D. (2002). �Non-negative quantity constraints and the dura-
tion of punishment�, Japanese Economic Review, 53, 77-93.
14. Lambson, V. E. (1987). �Optimal penal codes in price-setting supergames with capacity
constraints�, Review of Economic Studies, 54(3), 385-397.
15. Majerus, D. (1988). �Price vs. quantity competition in oligopoly supergames�, Economics
Letters, 27, 293-297.
16. Osterdal, L. P. (2003). �A note on the stability of collusion in differentiated oligopolies�,
Research in Economics, 57, 53-64.
17. Ross, T. W. (1992). �Cartel stability and product differentiation�, International Journal
of Industrial Organization, 10(1), 1-13.
18. Rothschild, R. (1999.) �Cartel stability when costs are heterogeneous�, International
Journal of Industrial Organization, 17(5), 717-734.
19. Tyagi, R. K. (1999). �On the relationship between product substitutability and tacit
collusion�, Managerial and Decision Economics, 20, 293-298.
20. Wernerfelt, B. (1989). �Tacit collusion in differentiated Cournot games�, Economics
Letters, 29(4), 303-306.
41