Market Roll-out and Retail Adoption for New Brands of
Non-durable Goods
by
Bart J. Bronnenberg∗ and Carl F. Mela
2 September 2002.
∗University of California at Los Angeles, e-mail [email protected], and Duke University, [email protected]. We thank Xavier Dreze for comments on an earlier draft. Also, the comments provided by semi-nar participants at Dartmouth College, Duke University, MIT, UCLA, and the INFORMS Conference in Edmonton,Canada, are gratefully acknowledged. Finally, the authors would like to thank IRI for generously supplying the datafor this study.
1
Market Roll-out and Retail Adoption for New Brands of Non-durable Goods
Abstract
Retail distribution is a decisive early factor in new product success. This paper provides a spatialand network-diffusion model of how retail distribution for new brands of repeat purchase goodsspreads across time and markets in the United States. We focus on two aspects of this process,namely (1) the manufacturer roll-out decision, i.e., the sequence in which manufacturers enter localmarkets with new brands, and (2) chains’ decisions to adopt new brand(s). As retail chains’ tradeareas typically consist of multiple markets, these two decisions can be interrelated. For example,after a large chain approves a new brand for distribution in one market, a manufacturer may preferto roll-out to other markets within the trade area of the same chain. Hence, past chain adoption caninfluence market entry by the manufacturer. Conversely, manufacturer market entry decisions caninfluence a chain’s adoption decision via availability of the new brand. Using weekly data from 166chains and 95 domestic markets on the introduction of two very successful new brands in the frozenpizza category, Digiorno and Freschetta, we find that (1) manufacturers choose rollout markets basedon spatial proximity to markets already entered (spatial evolution), (2) manufacturers choose rolloutmarkets based on whether chains in those markets previously adopted elsewhere (selection), and (3)a typical chain adopts new brands based on the adoption decisions of competing chains within itstrade territory (network diffusion). We also find that manufacturers accelerate the rollout of newproducts once the first markets have been entered successfully. Additionally, we find manufacturersfirst introduce brands into markets where they have large category share. This conservative approachis inconsistent with at least one practitioner who claimed that early roll-out targets are marketswith a low brand development index. The combination of network diffusion, spatial effects, andmarket selection creates substantial diffusion or “installed base” effects on the distribution of newproduct distribution. We use our model to explore some implications of the U.S. geography and themultimarket location of U.S. retail chains for lead market selection in new product launch.Keywords: spatial diffusion, network diffusion, retail distribution, new products, launch strategy.
2
1 Introduction
The success of new product programs counts among the most important drivers of economic pros-
perity for manufacturers in the consumer non-durables industry. An important component of a new
product launch is obtaining distribution for a brand. In practice, distribution is obtained in phases,
via a roll-out strategy, i.e., as a sequence of manufacturer decisions of local market introductions.
Given the key role such decisions play in a new product launch, surprisingly little academic research
exists on these multimarket rollouts. In this study, we address this issue. More specifically, we model
how retailer distribution of new brands spreads across multiple markets and multiple retailers.1 The
unique aspect of our study is that we model new brand diffusion across time, geographical space,
and retailer trade areas. With the results of our study, we make recommendations about the choice
of lead markets in the rollout of new repeat purchase goods.
Focusing on the distribution aspect of new product growth abstracts from other important com-
ponents of new product strategy. To better position our study in the context of other work in this
domain we decompose —in Figure 1— new brand sales for repeat purchase goods into four underlying
components. As can be seen from this decomposition, other foci on new product growth exist, in-
cluding product assortment, product trial, and penetration depth. Because the decomposition above
is multiplicative, not additive, all aspects can be argued to be “make or break” factors in the success
of new products. Rather than dismissing these other aspects as beyond the scope of this study, a
justification to focus on the first factor is in order.
First, we focus on distribution because demand and profits for repeat purchase goods are condi-
tional on distribution being non-zero. Second, for many brand managers, obtaining retailer distri-
bution is the primary objective during the early stages of a product’s life cycle. Third, distribution
decisions are interesting in the context of repeat-purchase goods, because the decision by a retail
chain to distribute a non-durable product is often a durable commitment. Fourth, of all marketing
mix elements, distribution is studied the least in the context of consumer packaged goods. Indeed,
extant research in packaged goods focuses on “product,” “promotion,” and “price” but rarely on
“place.” Stated differently, much research on new products takes a consumer-centric perspective
and focuses on value proposition (price and product) and awareness (advertising). While these are
fundamental questions in marketing, the “place” dimension of marketing has remained understudied
1In this paper we use retailer and chain interchangably. That is, our use of retailer does not refer to a specificstore, but rather the entire set of stores operating under a chain name.
3
new brand salesit #carrying storesit=#carrying storesit
#brand SKUsit
#brand SKUsit
#trialsit
distributionproduct assort-
ment and qualityprice andpromotion
Push orientation Pull orientation
(this study)
#trialsit
new brand salesit× × ×
price andadvertising
Distributiondepth
Penetrationbreadth
Penetrationdepth
Distributionbreadth
Figure 1: Fit of the present study in new product research
empirically, notable exceptions notwithstanding (e.g., McLaughlin and Rao 1991, Montgomery 1975,
Reibstein and Farris 1995).
The scarcity of empirical work on distribution has at least two causes. First, the effects of
distribution are most readily observed during the launch of new brands when there is ample temporal
and spatial variation in distribution. However, given that the launch of new brands is a relatively
rare event, good observations of how distribution builds over time are scarce. Second, in order to
analyze the effects of distribution, spatially disaggregated data are necessary. Focusing merely on a
measure of nationally aggregated “all commodity volume” (ACV) distribution obscures the processes
underlying the evolution of product distribution.
Fortuitously, at least several new brands have been launched in the recent past and with the
introduction of multimarket store-level data by data providers such as Information Resources Inc.
(IRI) and AC Nielsen, it is possible to get high-quality information about at least two critical decisions
in the evolution of distribution for new brands. These are (1) the decision by manufacturers to enter
a specific local market (e.g., the New York market, the Los Angeles market, etc.), and (2) the first-
time adoption of a retail chain whose trade areas include such a market (e.g., Kroger or Albertsons,
respectively).
Obviously, market entry and chain adoption are linked and should be analyzed in a single compre-
hensive framework. The retailer’s adoption decision is conditioned on the manufacturer’s entry into
a local market. In turn, a manufacturer’s choice of which local market to enter is likely influenced
by the fact that markets are linked through retailer networks. For instance, a local market may be
4
a good candidate for entry if chains operating within it previously adopted elsewhere in their trade
areas. That is, Los Angeles may be an attractive market to enter for a manufacturer if Vons and
Albertsons (both high share retailers in Los Angeles) adopted the brand previously elsewhere, when
it became available in their trade areas (e.g., in Phoenix, where both chains operate).
We estimate a model of interdependent evolution of market entry and retail adoption using
multimarket store-level data for two highly successful introductions of new brands in the frozen
pizza category, Digiorno and Freschetta. We find the following stylized facts.
1. Market entry by manufacturers
(a) Local market entry is subject to “proximity” effects. Manufacturers seem to fan out from
selected lead markets and gradually move from one area of the United States to the next.
(b) In addition to these proximity effects, local market entry is also subject to a “selectivity”
effect. Manufacturers appear to enter markets of chains who have adopted in the past in
other markets.
(c) For a new brand roll-out in a given category, manufacturers first enter markets in which
their extant brands have high category share. Manufacturers seem to enter markets first
at a relatively slow pace, accelerating to faster market entry later on. Taken together,
this behavior seems conservative.
2. Adoption by retailers given market entry
(a) Adoption by a given retail chain is subject to “network” effects, i.e., past adoption by
other chains in one’s own territory tends to increase the likelihood of adopting.
(b) Adoption by a given retail chain is positively associated with the manufacturer-share in
the chain, the share of the category in the chain, and retailer size. Adoption also tends
to decelerate with time since the brand became available in a chain’s trade area.
3. Using a Monte Carlo experiment to account for spatial as well as network effects on lead market
selection, we find that good lead markets are serviced by large chains whos territories do not
overlap too much. In other words, good lead markets are on a common edge of the trade areas
of geographically separated large retail-chains.
The remainder of this paper is organized as follows. Section 2, summarizes relevant academic
research on diffusion, distribution, and retailer decisions. Section 3 gives an overview of the frozen
pizza industry. Section 4 states the spatial and network diffusion model. Section 5 contains the data
analysis. Section 6 uses the results of the data analysis to make recommendations for the choice of
lead markets and lead retailers. Section 7 concludes.
5
2 Academic research and background
Our work draws from several research streams, including academic research on diffusion, retailer
conduct, spatial statistics and sociology.
Starting with Bass (1969), marketing researchers have been interested in modeling the diffusion
of new products. Studies in this tradition have often sought to explain the category sales of durable
goods at some cross-sectionally aggregated level (e.g., the U.S. level) across time. Under almost all
circumstances, this aggregate is no longer informative about the cross-sectional processes that govern
the diffusion.2 We are interested in how new brands diffuse across markets (space) and competing
retailers (networks). Hence a key feature that distinguishes our paper from extant work on diffusion
is that, in addition to the temporal dimension of diffusion, we model diffusion across geographic
space and retailer networks.
Spatial diffusion hold that the likelihood of innovations spreading from one point to another
is inversely related to distance. This diffusion concept has been applied to research questions in
the atmospheric sciences (Niu and Tiao 1995), epidemiology (Cliff et al 1981), and spatial statistics
(Stoffer 1986). It has as of yet received little attention in marketing. This is not to say that marketing
researchers do not acknowledge the potential importance of spatial diffusion as is apparent in the
work on international diffusion by Dekimpe, Parker and Sarvary (2000), and Putsis et al (1997).
Dekimpe, Parker and Sarvary (2000) make within-country adoption of technologies dependent on
the cross-country diffusion of such technologies. In turn, cross-country adoption is not explicitly
related to pair-wise proximity measures among countries (e.g., social similarity or distance). Putsis
et al (1997) allow local adoption to depend directly on adoption in another country but the cross-
country effects are equally strong for all pairs of countries. Our contribution vis-a-vis these papers
is that we explicitly model the effects of spatial and social proximity.
Network diffusion seeks to formalize the communication links between potential adopters and
traces diffusion of innovations along these links. Network diffusion is closely related to spatial
diffusion (especially with discrete spaces) and has roots in research on innovations (e.g., Valente
1995) and —more broadly— on sociology (e.g., Wasserman and Faust 1994). The contribution of this
paper to this literature is (1) the implementation of network effects in a new product diffusion model
2An exception occurs when the non-linearities in the individual level behavior aid in identification at the aggregatelevel (see e.g., Zenor and Srivastava 1993).
6
and (2) a test of alternative operationalizations of the network of multimarket retailers spanning the
continental United States.
Another, smaller, contrast to extant diffusion models is that our interest is with brand-level
diffusion and not category-level diffusion. However, at the retailer level, multiple brands can (and
do) achieve almost complete penetration of their target markets because retailers stock multiple
brands. As a consequence, we use a diffusion model with multiple brands, each of which can strive
for distribution by all retailers.
The managerially interesting aspect of diffusion in the context of business research on innovations
is that a portion of new product sales is generated through “inexpensive” word-of-mouth, or more
colloquially, that there is a “free lunch” or “windfall” created by the installed base of the product.
This “windfall” is managerially interesting because it depends on a manager’s choice of lead users
(in terms of how many potential users they may convert to adoptees). Sociological research deals
with suitable choices of “opinion leaders” or “lead users.” Specifically, Wasserman and Faust (1994)
formalize networks of potential adopters and seek to find those individuals with the highest degree
of centrality and prestige. The concepts of actor centrality and actor prestige are then used to select
those users that are the most effective “sources” of social contagion, e.g., will help diffuse the product
the quickest (for examples see Strang and Tuma 1993).
An important albeit somewhat stylized difference between the marketing tradition and the soci-
ology tradition in innovation research is that the former has predominantly dealt with diffusion of
new products across time, whereas the latter has predominantly dealt with diffusion across actors
or cross-sectional units. Indeed, most research in the marketing tradition has ignored differences
in users in terms of their efficacy in providing word-of-mouth, whereas the research tradition in so-
ciology often uses purely cross-sectional data. Vandenbulte and Lilien (2001) show that inferences
about what constitutes social “contagion” and simply cross-sectional covariation is confounded. As
perhaps the most clearly identifiable strength of this paper, we combine both streams to model the
spatio-temporal patterns in retailer adoption of new brands.
While the current paper is not about durable goods from a consumer-centric perspective, from
a retailer perspective, the distribution of a repeat purchase good is durable. Therefore we treat the
retailer decision to adopt a brand for distribution as a durable good. McLaughlin and Rao (1991)
study retailer adoption decisions of new brands. They find that retailer acceptance of new products is
7
related to a variety of variables ranging from marketing spending by vendors to product and vendor
status. Montgomery (1975) found that the percentage of competition carrying the new brand is
important in the adoption decision. However, both studies are cross-sectional.
3 The industry
Because our application uses data from the Frozen Pizza category, it helps the interpretation of the
empirical results to briefly discuss the industry and visualize the diffusion patterns of the main new
brands in this category. In addition to the presence of new product launches over our observation
window, we selected frozen pizza industry for our analysis because we have access to managers in-
volved with the new product launch. Finally, the Frozen Pizza category is sizable and manufacturers
have a large direct sales force thereby having greater control over their roll-out strategy and timing.
3.1 General description
Consumers. The frozen pizza (FP) industry accounted for roughly $2.8 billion of sales in the year
2000 in the United States. The category has the highest penetration of all frozen prepared foods:
57 percent of all American households buy frozen pizza. Frozen pizza consumption is highest among
18-44 year olds. There is little difference in consumption rates between single and married people
and no difference between black and white ethnic groups (Holcomb 2000). The market for frozen
pizza was forecasted to grow at a compounded annual rate of 8.9 percent per year between 1997 and
2002 (Holcomb 2000). Frozen pizza consumptions tends to be higher in the Midwest than elsewhere
in the United States. Supermarket sales account for 90 percent of frozen pizza sales.
Products. Prior to 1996, most sales occurred in the so-called “regular” frozen pizzas. The leading
two brands in this market were Tombstone and Tony’s, which were marketed by Kraft and Schwan’s
respectively. The latter manufacturer also produces the Red Baron brand. Each of these two leading
brands offers a variety of recipes and crusts. Prior to 1996 Kraft’s share of the industry was 33.7
percent, whereas Schwan’s had 23.5 percent market share (Holcomb 2000).
Manufacturer actions. From personal interviews, we learned that managers involved in these
launches choose lead markets for new brands very deliberately. Among the decision variables that
are frequently mentioned are transportation cost, low brand development index (BDI)3 for the man-
3The brand development index is measured as the sales of brands relative to all commodity volume scanned in a
8
ufacturer’s other brands accompanied by a high category development index (CDI)4 in local markets,
cannibalization, and funding constraints. Obviously, retailers on these lead markets play a crucial
role in the selection of which markets to target first. According to at least one Kraft manager,
a good lead market should be to some extent isolated, i.e., it should have smaller retail accounts
(predominantly from a cost perspective). In addition, another manager made the observation that
the Digiorno product sought to expand the frozen pizza category by competing with take-out pizza,
which may or may not have played a role in the selection of lead markets.
Retailer decisions. Retailers who adopt a new product do so typically once for all markets in
which they operate. For example, Kroger will approve to distribute a new Kraft brand, e.g., Digiorno,
for all its markets at once. That is to say, an important mile-stone in the diffusion of new products
through the retailer network, is whether the retailer approves the product at the chain level.
3.2 The introduction of Digiorno and Freschetta
Timing. Both Kraft as well as Schwan’s developed and launched a “premium” frozen pizza with
rising crust. First, Kraft introduced the Digiorno brand in 1995, followed a year later by Schwans’
introduction of the Freschetta line. These two introductions are the main focus of our study. Figures
2 and 3 visualize the diffusion of supermarket distribution for these two brands in the continental
United States.
Diffusion patterns. In both instances, brands are launched in a selected number of lead markets.
In the case of Digiorno, the brand is launched in Denver, Seattle, St. Louis and Atlanta. Two of these
markets (Denver and St. Louis) belong to the top-10 metropolitan areas in frozen pizza consumption
per capita (Holcomb 2000). In the case of Freschetta, the brand is launched in Omaha, St. Louis,
Minneapolis, and Kansas City. The first three launch cities belong to the same top-10. The launch
of Freschetta seems more local than that of Digiorno in the sense that Digiorno initially spans a
large area of the US with a select number of “lead markets” and fills in the empty space between
these cities through subsequent introductions. An alternative to this policy is to first create a dense
concentration of lead markets and expand —radially— from this “base.” There appears to be some
indication that initially Freschetta uses such a policy, expanding from north to south. Regardless, in
both cases there appears to be a strong local component to sequential market entry. For instance, in
given market. A high BDI means that relative to other markets the brands sell well.4CDI is the category sales divided by total sales over all categories.
9
(a) September 1995 (b) May 1996
(c) August 1996 (d) April 1998
25% ACV 50% ACV 75% ACV 100% ACV
Figure 2: Spatio-temporal development in retail distribution for Digiorno Pizza
Figure 2, we see that Digiorno expands from the Seattle market to three neighboring markets between
May and August 1996. Likewise, in Figure 3, it can be seen that Freschetta seeks to move south
from Atlanta into the Florida markets between April and October of 1997. Ample other examples
like this can be found in the graphs.
Success of the launches. Both launches were very successful and achieved national distribution
in 1 to 2 years with Freschetta starting later but rolling out faster than Digiorno. Currently, both
brands are important profit generators for their respective parent companies.
4 Model
We develop a model to represent the manufacturer’s decision to introduce a brand in a market, and
conditional on this decision, of the chain’s decision to carry the brand. Our modeling strategy consists
10
(a) January 1997 (b) April 1997
(c) October 1997 (d) July 1998
25% ACV 50% ACV 75% ACV 100% ACV
Figure 3: Spatio-temporal development in retail distribution for Freschetta Pizza
of representing these decisions as hazards with a survival probability following a probit specification.
We use an autoregressive representation of network and spatial effects. Figure 4 shows the main
model components. It shows that the spatial and network dependencies in the model are introduced
by allowing these individual level decisions to depend on similar decisions made by actors at other
points in geographical or social space and/or at earlier points in time. For instance, we allow the
adoption of a new brand by a given retailer to depend on the past adoption decisions of “neighboring”
retailers. Along the same lines, we allow for the entry of a given market to be influenced by past
entry in “neighboring” markets. Below, we first describe and operationalize the models for market
entry and retailer adoption, and then we define the neighborhood effects. Note that the spatial unit
of diffusion is different for entry decisions (markets) than for retailer adoptions (multimarket trade
areas). Therefore their respective definitions of neighborhood effects are different.
11
chain adoption given entry in at
least one market of its trade area
Chains that haveadopted by t-1
Chains adopting at t
Markets that havebeen entered by t-1
exogenous effectson chain adoption
exogenous effectson market entry
network effectson chain adoption
spatial effectson market entry
market selectionbased on retailer
adoption
Adoption in trade-area by
chains
local market entry by the manufacturer
Markets beingentered at t
(a) mental model of diffusion (b) empirical operationalization
time delay
timedelay
timedelay
Figure 4: The main features of the model
4.1 Market entry model
Denote the presence of the brand in a market by yimt, where i = 1, ..., I indexes brands, m = 1, ...,M
indexes markets, and t = 1, ..., T indexes time. The variable yimt is discrete and assumes the value
1 if the brand is present in market m, at time t, and 0 else. Market entry is treated as an absorbing
event because the data do not cover situations where the manufacturer retracts a new brand from a
market. Each market m eventually adopts.
The basic model with which we formalize entry into market m by manufacturer i in week t is a
probit model with an absorbing state.
Pr(yimt = 1) = [Φ (Uimt)]1−yimt−1 , (1)
in which Uimt is the attraction of market m in week t for manufacturer i, and Φ is the CDF of
the standard normal distribution N (0, 1). The power in this specification implements the absorbing
nature of market entry. This is consistent with a hazard interpretation of the probit model. A direct
consequence of this model is that observations of yimt are irrelevant for inferences after the brand
has been introduced in market m.
12
A flexible model of Uimt is obtained using a variance components model that includes fixed and
random effects of brand, market and time variables. Specifically, let
Uimt = Xyimt · θ + βm + γit, (2)
so that in addition to fixed effects Xyimt · θ, we allow for random components at the market level
βm that are common across brands and time, and for random components γit that are common to
markets but vary across brands and time.
Fixed effects. With respect to fixed effects we use the following operationalizations (for a short
overview of all the variables used in the model see Table 1). First, we allow for brand specific
intercepts for the brands considered in this study.
Second, the probability of market entry may be affected by the category development index,
CDIymt. We define the category development index in this study as the weekly category dollars as a
percentage of the total weekly dollars scanned in a market. In the same vein, we believe that the
probability of market entry may be affected by the manufacturer development index, MDIyimt. This
variable is defined as manufacturer i’s dollar share of the category in market m and week t. To the
extent that transportation costs play a role in the entry of markets, we use distance to manufacturing
site DSMyim (in 1000 miles) as the covariate for this component.
Next, we define the spatial effects, SPTyimt as the spatially autoregressive effect of market entry
in neighboring markets. These neighboring markets can be coded in an M ×M matrix Ws whose
rows add to one, and whose entries [m,m′] are 0 if m and m′ are not neighbors and positive if they
are. Next, arraying the market entry variables of t− 1 across markets into the M × 1 vector yit−1,
we define SPTyimt as the mth element of the spatially and temporally lagged (see Bronnenberg and
Mahajan 2001) market entry variables.
SPTyit =
(M×1)Ws·yit−1. (3)
In practical terms, SPTyimt is the weighted average of past entry in neighboring markets. The weights
are defined in a subsequent section. We expect the spatial effects of SPTyimt on entry to be positive,
i.e., we expect entry to be close to previously entered markets.
Another variable of interest in this study is market share of chains who have adopted previously
in another market, PRVyimt. This variable is defined as the sum of market shares in market m of
chains who adopted manufacturer i′s new brand in any market prior to t. Formally, we use an M by
13
K matrix H containing the ACV share of chain k in market m. In other words, in each row m, the
matrix H lists the share of all retailers in market m. Absence of a chain k from market m equates
to an ACV share of 0. Rows of H add to unity, i.e., shares add to 1 within each market. Last, H
can be interpreted as a representation of the geographical structure of U.S. retail chains. Denote the
distribution status of brand i by zikt = 1 if chain k adopted before or in week t and zikt = 0 if the
chain did not adopt up until week t. Array across chains to obtain a K × 1 vector zit. Then, we
define PRVyimt as the mth element of
PRVyit =
(M×1)H · zit−1. (4)
PRVyimt can be interpreted as a weighted average of past adoption among retailers that are in market
m. This measure is between 0 (none of the chains in market m has adopted the brand in the past
anywhere else) and 1 (all chains on market m have already adopted the brand previously in some
other market m′ �= m).
Random effects. Equation (2) contains two random components, βm and γit. These components
allow for market specific and brand-time specific influences on market entry. In turn, each of these
components contains descriptors of markets and brand-time interactions respectively. None of the
random components can contain an intercept because such an intercept would be confounded with
the intercepts in the fixed effects (see Vines, Gilks, and Wild 1996).
For the market specific random effects, we use three relevant market descriptors. We define
the size, ACVym, of the market measured as the average weekly volume sold on a given market in
millions of dollars. Then, we define the market level concentration, HRFym, of the retail-industry as
the Herfindahl index of the market shares based on all-commodity-volume (ACV) for the retailers
in market m. Finally, we use the fraction of the size of the market, SNIym, that is sold through
retail-chains rather than independent stores.5 The variable SNIym is a proxy for how “connected” a
market is in the network of retailers. That is, the larger the market share of independent stores is in
a given market, the less ties will exist across markets because such independent stores do not have
multi-market presence.
Through the brand-time random components we allow for the probability of market entry for
each brand to be time dependent. We define the variable TSIyit as the time since manufacturer i
introduced the brand in the first local market. If, for example, manufacturer 1 first introduces a
5 SNI stands for Shares of Non-Independent chains
14
new brand in week 201 in, say, St.-Louis, MO, then the variable TSIy1t counts the weeks after that
initial introduction for market other than St.-Louis. This variable allows for increasing or decreasing
hazard rates (e.g., Johnson and Kotz 1970, p. 283) for each brand, i.e., allows for acceleration or
deceleration of market entry. To summarize the discussion above, the complete specification of the
market entry model used in this study is as follows:
Uimt = θi + θ3CDIymt + θ4MDIyimt + θ5DSMyim + θ6SPT
yimt+
θ7PRVyimt + βm + γit, i = {1, 2}
(5)
with
βm ∼ N(φ1ACV
ym + φ2HRF
ym + φ3SNI
ym, σ2
β
)γit ∼ N
(λTSIyit, σ
2γ
) . (6)
4.2 Chain adoption model
Similar to the model of market entry, we propose a probit model to represent the probability that
a retail chain adopts the brand. For each chain k, brand i, and week t we let zikt = 1 if the brand
is adopted at or before week t and zikt = 0 if the brand is not adopted. Adoption can only occur
if the brand is made available by the manufacturer in at least one market that belongs to chain k’s
territory. Formally, if we let Ck be the set of markets on which chain k operates, and we write yiCkt
be an indicator variable that assumes the value 1 if brand i is available at week t on at least one
market m in Ck and 0 in all other cases, then adoption by retailer k of manufacturer i’s brand in
week t is modeled as a probit model with an absorbing state.
Pr(zikt = 1) = yiCkt · [Φ (Vikt)]1−zikt−1 , (7)
The multiplication by yiCkt expresses that entry in a retailer’s trade area is necessary for adoption.
The power 1 − zikt−1 implements that adoption is an absorbing state (retailers do not disadopt in
our data). Defining the moment of earliest entry into the trade area of retailer k by tavailk , and the
moment of first time adoption by the retailer by tadoptk , we write
Pr(zikt = 1) =
0 if t < tavailk
Φ(Vikt) if tavailk ≤ t ≤ tadoptk
1 if t > tadoptk
. (8)
15
Thus, the relevant observations for inference fall between (1) the moment of manufacturer entry in
the retailer’s trade area and (2) adoption by the chain somewhere in its trade area (inclusive of the
moment of adoption).
Analogous to the market entry model above, we specify that Vikt contains fixed and random
effects,
Vikt = Xziktµ+ bk (9)
This model is the same as the market entry model (2) except for the fact that the time brand
interaction is absent. Rather than being influenced by the time since introduction, we believe that
retailers k may be influenced in their adoption of brand i by the time that has elapsed since the
manufacturer entered its trade area. This is not a two-way i, t interaction, but a three way interaction
between time, retailer and brand which will be specified in the fixed effects.
Fixed effects. Among the fixed effects are brand specific intercepts µi, i = 1, 2, for each of the
two brands considered in this study.
Second, we allow the time difference between brand-availability and brand-adoption at the retailer
level to be dependent on the category development index, CDIzkt, at the retailer level. This variable
is defined as the weekly retailer level sales of the entire category as a percentage of the retailer’s
total volume (for short reminders of the definitions of this and other variables, see Table 1). We
expect retailers with higher CDIzkt to adopt earlier, i.e., the effect of the category development index
of the retailer on adoption is expected to be positive. Next, we allow for effects of the manufacturer
development index, MDIzikt, at the retailer level. This variable is manufacturer i’s dollar-share of
the category with retailer k. We expect retailers with a higher share of i’s extant brands to adopt a
new brand by manufacturer i earlier than with a lower share of i’s existing brands. As argued before
acceleration or deceleration of the tendency to adopt is controlled by introducing a variable, TSAzikt,
that measures the time since the brand became available in retailer k’s trade area (in weeks). We
have no a priori expectation of the effect of time since availability on adoption.
Central to this study, we introduce a variable that allows the adoptions by chains to be related
to past adoptions by other chains. This variable, DIFzikt, is defined by specifying an autoregressive
effect of adoption by retailers that are “close” in the network of overlapping retail trade areas ( see
e.g., Bronnenberg and Sismeiro 2002). Closeness can be captured in a K × K matrix Wn (to be
defined) whose rows add to one, and whose entries [k, k′] are 0 if k and k′ do not compete in the same
16
geographic markets and positive if they do compete directly. Going back to the definition of DIFzikt,
array the K distribution variables zikt−1 at t− 1 across markets into the K × 1 vector zit−1. Next,
DIFzikt is the kth element of the spatially and temporally lagged (see Bronnenberg and Mahajan
2001) chain adoption variables.
DIFzit =
(K×1)Wnzit−1. (10)
These variables can be interpreted as weighted averages of past adoptions by competing retailers. The
weights capture the degree of influence by each direct competitor in one’s trade area. Consistent with
the central hypothesis in innovation diffusion research, the expectation is that the effect of DIFzikt
on the adoption of brand i by retailer k in week t is positive.
Random effects. The components bk contain the retail chain effects on adoption timing. As
before, the components bk are modeled as a hierarchical model with covariates. We use two retailer-
level covariates. First, we define ACVzk as the total size of the retail chain in $MM aggregated across
all markets in k’s trade area and averaged over weeks. We expect the effect of chain size on adoption
to be positive. Second, the variable HRFzk measures the degree of spatial concentration of a retailer’s
ACV over the markets on which it operates. It is measured as the Herfindahl index of total retailer
volume across markets. For example, Dominicks is a retailer whose volume is very concentrated in
Chicago and is an example of a retailer with a high HRFzk. On the other hand, Kroger is a retailer
whose total volume is spread across many markets in the Eastern United States and is an example
of a retailer with a low HRFzk.
To summarize our retailer adoption model, we use a probit model (8), with the following opera-
tionalization
Vikt = µi + µ3CDIzkt + µ4MDIzikt + µ5TSAzikt + µ6DIFz
ikt + bk, i = {1, 2} (11)
with
bk ∼ N(ψ1ACV
zk + ψ2HRF
zk, σ
2b
). (12)
Obviously, from a diffusion perspective, special interest is with the parameter µ6. If this parameter is
positive this means that retailers tend to be influenced by past adoptions of retailers that are “close”
in the retailer network
17
4.3 The representation of geographic proximity
The spatial variable SPTimt uses a weight matrix Ws. We propose to use a discrete definition of
geographical space. Therefore the weight matrix identifies which locations are neighbors. Ws is also
called a spatial lag operator (see e.g., Anselin 1988). Generally, the rows of Ws add to 1. If so, the
matrix is said to be standardized. In this study, we use a standardized spatial lag operator Ws.
The definition of a set {Vi} of neighbors of market i must follow from a suitable discretization
of geographic space. We use the simple concept of Voronoi polygons (Okabe, et al. 2000 ), which
divides geographic space (e.g., the United States) exhaustively into mutually exclusive areas around
centers (e.g., local markets such as New York, Los Angeles, etc.) whose interior points are closest to
these centers. We define as a neighbor two local markets, whose Voronoi polygons have a common
edge. Thus, a given market can not be a neighbor of itself. For an illustration of Voronoi polygons
to define spatial autoregression in multimarket data using local U.S. markets, see Bronnenberg and
Mahajan (2001). For other illustrations of using geographic neighbors in marketing models, see e.g.,
ter Hofstede, Wedel and Steenkamp (2002).
We specify two alternative definitions for the weights in Ws. Defining the number of markets in
the neighbor-set {Vi} as Ni, these definitions are
ws,N (i, j) =
1/Ni if j ∈ {Vi}
0 elseand ws,acv(i, j) =
ACVj/∑
k∈{Vi}ACVk if j ∈ {Vi}
0 else(13)
The first definition takes the influence of each market on market entry to be equal, while the second
gives higher weight to larger markets.
4.4 The representation of the retailer network and network effects
The U.S. retail industry consists of many different retail chains. Some of these chains compete
directly in the same geographical markets, while others are separated from each other by distance.
Because we wish to measure the role of retailer connectedness in the diffusion of new products, it
will be useful to propose a definition of the degree to which pairs of retailers are “linked.” A useful
definition for such network links utilizes the overlap in trade areas between two retailers as a measure
of link-strength. More precisely, let retailer r operate in a set of markets m ∈ Cr, with Cr being
its retail trade area. Denote the average dollar-sales volume per week by retailer r in market m by
ACVrm. Then, the link-strength of r′ to retailer r is defined by the former’s share of ACV in the
18
latter’s territory. Specifically, a useful definition of the link-strength of retailer r′ to retailer r is
wn,acv(r, r′) =
∑m∈Cr
ACVr′m∑r′′ �=r
∑m∈Cr
ACVr′′m
and wn(r, r) = 0 ∀r. (14)
This measure is between 0 and 1 and adds to 1 over all competitors r′ for any retail chain r. The link
wn(r, r′) is 0 for all pairs of retail-chains whose trade-areas do not overlap and becomes larger with
the degree to which the trade areas of two retailers coincide. This definition also expresses that, for
any given retail-chain, large direct competitors have more influence than small direct competitors.
An alternative definition of the link between retailer r′ and r takes into account the size of the
retailer r to express that the influence of r′ on r is larger in markets where both are large. This
definition implies that greater weight is accorded to competing chains that operate within a chain’s
core (i.e., large share) markets. This interaction can be expressed as
wn,inter(r, r′) =
∑m∈Cr
ACVr′m ·ACVrm∑r′′ �=r
∑m∈Cr
ACVr′′m ·ACVrm
and wn(r, r) = 0 ∀r. (15)
Figure 5 helps to explain these definitions. This figure represents a hypothetical situation with 3
markets, and 3 retail chains. Retailer 1 operates in markets A and C, and it faces competition from
retailer 2 in market A and from retailer 3 in market C. The definition for wn,acv above states that
the relative influence on chain 1 is equal to
wn,acv(retailer 1, retailer 1) = 0
wn,acv(retailer 1, retailer 2) =ACV2A
ACV2A +ACV3C
and (16)
wn,acv(retailer 1, retailer 3) =ACV3C
ACV2A +ACV3C
In other words, the relative influence on retailer 1 by retailers 2 and 3 is proportional to the size of
the latter retailers in the former’s trade area.6 Taking the size of the circles proportional to market
size, retailer 3 is a larger retailer than retailer 2 in retailer 1’s trade area. Our definition for wn,acv
in equation (14) then implies that retailer 3 has more influence on retailer 1’s adoption than retailer
2 has.
In practice, the influence of retailers is not symmetric. Our definition allows for this. For example,
consider two retailers: H-E-B and Albertsons. Albertsons is a retailer with a very large trade area
6 If we use the definition based on the market-specific interactions between chains, then we would find for examplethat wn,inter(retailer 1,retailer 2) =ACV2A·ACV1A/(ACV2A·ACV1A+ACV3C ·ACV1C).
19
RETAILER 1
RETAILER 1
RETAILER 2RETAILER 2
RETAILER 3
RETAILER 3
MARKET A
MARKET B
MARKET C
Figure 5: Overlapping retail territories and market links
that spans the entire West and South of the continental U.S. (see also Figure 5). Because of the
size of its trade area it faces many competitors. H-E-B on the other hand is a high share retailer
located in several Texan markets. Given its geographic concentration it faces far fewer competitors.
Therefore, among all competitors of Albertsons, H-E-B’s share is relatively small because H-E-B is
present in only a few markets of Albertsons’ trade area. On the other hand, Albertson’s is present in
every market on which H-E-B operates. Among H-E-B’s competitors Albertson’s has an ACV share
of 0.258. We take this to imply that if network effects are important, all else equal, the influence of
Albertsons on H-E-B’s decision to adopt a new brand may be higher than vice versa.
All possible combinations for pairs of retailers span a K ×K matrix Wn, as follows:
Wn =
0 wn(1, 2) · · · wn(1,K)
wn(2, 1) 0 · · · wn(2,K)...
.... . .
...
wn(K, 1) wn(K, 2) · · · 0
This sparse matrix, i.e., which contains many wn(k, k
′) = 0, represents the network of retailers as a
sociomatrix with asymmetric links (see, e.g., Wasserman and Faust 1994, ch. 4).
20
4.5 Discussion
The model as operationalized herein presents a testable account of the spatial and temporal patterns
with which two recent national rollouts took place. The model is a reduced form model containing
equations for market entry by manufacturers and retail chain adoption given availability in its trade
area.
The simple structure of the model originates from the use of time lags. The selection effect on
market entry yit (M × 1) at time t is based on the retailer adoption zit−1 (K× 1) at time t− 1. This
assumption follows from noting that the sample rate of the data is high (weekly) compared to the
decision process to enter a market or to adopt a brand for distribution. It is therefore reasonable to
assume that same-week selection effects are absent, because adoption by retailer k in week t can not
cause same-week entry by the manufacturer in the other markets on which k operates. For a similar
point regarding the modeling of events in time and space, see Cressie (1993, p. 450). Likewise,
the spatial effects on yit are captured through spatial dependence on yit−1. Finally, the network
effects relate zit to zit−1. Because our observations include weeks up to and including the moment of
adoption, the model does not contain circularities. Statistically the two equations are even unrelated.
We offer a very selective justification on the variables in the equations (many of these variables
lie at the intersection of the factors enumerated by management and the measures in the data). The
variables in the market entry models are possibly related to launch cost. Indeed, it may be cheaper
for manufacturers to concentrate on markets (1) which are concentrated (fewer retailers), (2) which
are located close to markets that were entered before, or (3) on which many retailers operate who
adopted previously. If the selection effect is important, we additionally expect manufacturers to
enter markets with small shares of independent stores early (which also implies a positive effect of
the variable that measures the market level share of multi-market retailers, SNIkm).
Finally, consistent with the model in Figure 4, the selection effects and the market entry con-
straints on adoption decisions create a feedback loop among market entry and retailer adoption. In
other words, the evolution of retailer distribution in this model is path dependent and makes that
lead market selection matters for the speed of diffusion. We investigate lead-market selection explic-
itly after the estimation results of the model are discussed. We now turn to the empirical analysis
that produces these results.
21
5 Empirical analysis
5.1 Data
The data used in this study consist of sales data from the Frozen Pizza category for a national sample
of 1900 supermarket stores across 166 retailers and 95 local markets. We use the IRI definition for
what constitutes a market. Typically, an IRI market is either a metropolitan area (e.g., New York) or
a region (e.g. West Texas/New Mexico). Retailers are designated by a retailer code. Large retailers
may have region-dependent codes. Because, our interviews with Kraft managers indicated that
even these large retailers first take a central decision in approving the brand chain wide, such region
dependent codes were unified. The sample of stores and chains are subject to attrition and expansion
in our data, which we assume to be independent of the category under investigation. Retailers for
which first time adoption is not observed (i.e., retail chains that adopted prior to entering the sample)
add no information about adoption-timing and were not used in the analysis (there are a few small
retailers for which this holds). Also, independent stores were dropped from the analysis,7 because
they do not constitute a retail chain.
The data span 5 years of weekly store-level sales data. During this time two major new brands
were launched. First, Digiorno was launched in 1995. This Kraft-Foods brand, which established
the “premium” frozen pizza segment, was very successful. A year later Freschetta was launched by
Schwan’s as Digiorno’s direct competitor. Both brands obtained national retail distribution coverage.
As explained, the dependent variables central to this study are (1) the timing of market entry and
(2) the timing of retailer adoption given availability, i.e., entry in at least one market of a retailer’s
trade area. These variables are in principle unobserved to the analyst. Nonetheless, the available
IRI data allow for high quality proxies for the timing of these two decisions. First, we define the
moment of market entry as the week of first time sales in a given market. Second, the moment of
chain adoption is taken as the earliest week across all markets belonging to a retailer’s trade area in
which sales was recorded.8 While there may be small deviations between the first time we observe
sales and the actual decision moments by manufacturers and retailers, the variation in the proxies
7 The data on independents was however used in the computation of several variables. For instance, the variableSNIym measures the percentage of ACV in each market that is moved by chains with multi-market presence.
8 In a few instances, these definitions synchronize the initial market entry by manufacturers with the initial adoptionsby retailers. However, this simultaneity is uncommon and ample empirical observations exist to identify the two stagesin the model.
22
across retailers and markets is clearly beyond any reasonable measure of observation error within
retailers and markets. For instance, across markets, entry occurs on average 62 weeks after the first
market was entered for Digiorno and 28 weeks for Freschetta. Hence, the observed variation in entry
timing is very large compared to the time it takes to make such a decision. As a consequence, we
take the observed entry to be informative about the actual entry decisions by manufacturers. Along
the same lines, the deviation across retailers in adoption timing since availability is 15 weeks for
Digiorno and 17 weeks for Freschetta.9 We therefore believe that these observed adoption times are
informative about retailer adoption timing, because it is unlikely that a delay of 3 months or more
in adoption is coincidental or manufacturer driven.
––– Table 1 about here –––
Table 1 lists some descriptive statistics of the variables used in this study. The descriptive
statistics report on averages and standard deviations over all relevant observations of market entry
yimt and retailer adoption zikt. For instance, the market level category development index, CDIymt,
averages 0.67 and has a standard deviation in the data of 0.46. Hence, on average across markets,
0.67% of all scanned items by IRI are of the frozen pizza category.
5.2 Estimation
We estimate the hierarchical variance components model in equations (5) and (11) using an MCMC
approach. To implement this estimation approach, we specify the full-conditional distributions of
all model parameters and their prior distributions. Appendix A contains the full conditionals, while
appendix B contains the MCMC algorithm. The prior distributions are of two types and are chosen
to be uninformative. For all model parameters other than variance terms, we use IID N (0, 10000)
distributions. The variance terms of the model have an IG(1, 1) prior distribution.
A couple of notes about implementation of the MCMC chain deserve mentioning. Poor mixing,
i.e., the phenomenon that the parameters meander slowly, may occur with models of the type con-
tained in equations (5) and (11). Taking for instance the model in (5), an immediate problem is
9 The large variation in the data also helps to dispell the possibility that the differences in adoption across retailersare manufacturer controlled. This would for instance happen if manufacturers initially make their brand selectivelyavailable to a subset of retailers in a given market. From a cost perspective, it makes little sense to “hold back” newbrands selectively (for as long as multiple quarters) from some retailers, given that many launch costs are forceblymade at the market level (e.g., advertising or transportation).
23
that one can add a constant to all βm and subtract this from the intercept term in the θ vector
without altering the likelihood of the data (Gilks, Richardson, and Spiegelhalter 1996; Vines, Gilks
and Wild 1996). Fixes to this problem exist such as reparametrizing the model with a zero-mean for
βm (across m) at every pass through the sampler (Vines, Gilks, and Wild 1996). Alternatives to this
so-called sweeping are hierarchical centering (Gelfand, Sahu, and Carlin 1995), or block sampling
(Chib and Carlin 1999). In our application, we control for poor mixing by avoiding the use of too
many random factors.10,11
We executed the MCMC chain for 500,000 draws, used the first 50,000 draws for burn-in and
then sampled every 50th draw from the MCMC chain for further analysis. Thus a total of 9,000
draws are used in the computation of the parameter estimates.
We estimated all 4 combinations of models with Ws based on the number of neighbors (Ws,N )
or size of neighbors (Ws,acv), and with Wn based on the local size of rival chains (Wn,acv) or the
interaction of own size with the size of rival chains (Wn,inter). In addition, to contrast our model
to one where we just care about retailer adoption, we estimated the model based on equation (11)
only. In the latter model, all spatial and temporal evolution in market entry is attributed to retailer
adoption through Wn.
5.3 Results
5.3.1 Adoption conditional on market entry. The four operationalizations of the full model
were compared using Bayes Factors. We computed these factors using the method proposed by
Gelfand and Dey (1994) and Raftery (1996). The best operationalization of the weight matrices
uses contiguity relations based on market size (Ws,acv) and the local size of rival chains (Wn,acv). A
close second is the model with spatial lags defined by Ws,N and with Wn,acv. The models with the
retailer network effects based on the interaction between own and rival chains (i.e., using Wn,inter)
performes less on the Bayes factor criterion. Estimation results for the best model are in Table 2.
Market entry The vector θ contains the regression effects of the market entry model. Both
10 We also estimated the model using an orthogonalization by demeaning the covariates of the random componentsin the model. The parameter estimates were found to be identical except for a shift in the intercept to reflect the sweapof the mean. The convergence with sweeping is markedly quicker than without.
11 For instance, estimation of more and more random factors in addition to βm and γit makes the chain mix lessand less efficiently. Contemplate, for instance, a factor δim. Such a factor is very hard to distinguish from βm becausethere are only few brands. Hence, estimates of the sum of βm and δim will be stable, but their separate estimates areconfounded in practice.
24
brands have negative intercepts, because market entry is a relatively low probability event. The
intercept for Freschetta, θ2, is higher than that of Digiorno, θ1 (although the distributions overlap
somewhat). This means that while rollout is slow, Freschetta rolls out faster than Digiorno.
The effect θ3 of the category development index, CDIymt, on market entry yimt is not different from
0. In other words, the two manufacturers do not seem to enter markets in increasing or decreasing
order of category importance (although the lead markets are high-CDI markets).
In contrast, the effect θ4 of the manufacturer development index, MDIyimt, is significant and
positive. Manufacturers have a tendency to first enter markets on which they have a large extant
share. While this seems logical, prima facie, the effect is least unexpected, on reflection. It seems
conservative of a manufacturer to launch a new brand first where it already has high shares with
extant brands. The potential for cannibalization is highest in such markets. On the other hand, if
the new brand is targeted to a new market segment that is not currently served,12 there are potential
reputation benefits in markets with high MDIyimt. This is, in turn, a plausible reason for early entry
among all possible markets.
The distance to the manufacturing site, DSMyim, has no impact on the timing of entry or the
order in which markets are entered (θ5 is not different from 0). This is not unreasonable, because
one would expect the distribution cost to impact local market prices (Anderson and de Palma 1988)
and not necessarily the order of market entry.
The values of θ6 and θ7 represent the spatial and selection effects on market entry. The value
of θ6 is positive, and hence entry of markets depends positively on whether neighboring markets
have been entered in the past. We therefore infer that manufacturers tend to launch brands close
to markets that were already entered. A possible explanation for this entry pattern is more efficient
use of multimarket resources in the distribution channel such as distribution centers, transportation
carriers, etc.
Finally, the effect θ7 of PRVyimt is also positive. This means that markets are more likely entered
if retailers that operate on them have adopted in the past (i.e., in other markets). This effect is
interpreted as a “selection” effect, because it is the manufacturers’ choice to select markets with
chains that have already approved the brand in the past. Importantly, it is this selection effect that
12 There is some support for this condition. According to a Kraft manager, Digiorno was developed and marketed tocompete with take-out pizza and not with frozen pizza brands that were already on the market. The Digiorno slogan“It’s not delivery, it’s Digiorno!” reflects this positioning.
25
creates a feedback from retailer adoption to market entry.
A potential empirical concern with these two effects is their proper separation or discriminant
validity. Such a concern would be especially valid if many retailers are common to two neighboring
markets and, conversely, if two markets that have retailers in common, are also located close to
each other. Namely, if this were the case, the effects of PRVyimt and SPTy
imt would be confounded.
However, taking into account the actual retail structure in the United States, i.e., the M ×K matrix
H, the latter statement is not generally true. Whereas neighboring markets do share retailers,
markets with common retailers do not necessarily have to be spatially close. For example, Safeway is
large in San Francisco and in Washington D.C. (see Figure 5) but these markets are separated by a
large distance. As a result the variables SPTyimt and PRVy
imt are only moderately (0.32) correlated.
Hence, the spatial effects θ6 and selection effects θ7 measure different things.
With respect to the random factors βm and γit, we discuss the effects of their covariates. First
the effect φ1 represents the impact of market size, ACVym, on the random effect βm. The parameter
φ1 is not different from 0, and hence market size does not impact βm and by extension does not
impact the timing of market entry.
Second, because φ2 < 0, we infer that concentration of the retail industry in a given market,
HRFym, has a negative significant impact on market entry. Hence, markets that are not very con-
centrated, i.e., have many retailers, are entered earlier than markets that have one very dominant
retailer. It therefore seems that during early launch among manufacturers avoid reliance on only one
or few retailers. This seems consistent with the Kraft manager who stated that lead markets were
chosen such as to avoid markets with a single large retail chain. This approach seems to make the
manufacturer less dependent on any given retailer in early success of the brand.
The effect of the share of retail chains as opposed to independent stores, SNIym, is positive, that
is, φ3 > 0. This variable measures the strength of connection among markets due to multi-market
presence of retailers. Its large positive effect seems to indicate that manufacturers seek to enter
markets early that are “central” or “well-connected” through retailers.13
Finally, the factor γit has one covariate. The effect, λ, of time since introduction, TSIyit, is positive
and significant. This suggests that the rate at which markets are entered increases with time. This
13 Interestingly, and counter to what we find, in one interview with a Kraft manager, it was suggested that earlymarkets were selected for being relatively “isolated,” so that the cost of retailer incentives would be low. We do notfind this in the data.
26
“acceleration approach” is consistent with a conservative launch strategy (i.e., starting slow).
––– Table 2 about here –––
Chain adoption Given availability, Digiorno and Freschetta are adopted at base rates that are
comparable although Digiorno is adopted slightly faster than Freschetta. However, the intercepts µ2
for Freschetta and µ1 for Digiorno are not significantly different.
Retailers adopt a brand faster if the frozen pizza category is an important one. That is to say, the
effect, µ3, of the category development index, CDIzkt, at the retailer level is positive and significant.
Hence, retailers with a larger revenue share of frozen pizza (as a % of retailer ACV) adopt a new
brand earlier than retailers with a smaller revenue share of this category. One explanation for this
effect is that retailers with a large frozen pizza category are economically more motivated to try
innovations belonging to this category than retailers with a small frozen pizza category (see also
Rogers 1983). Additionally, retailers with a large frozen pizza category are likely to have more
available cooler equipment to shelf a new brand without having to rearrange the entire category.
Retailers also adopt a brand faster if the manufacturer has a large category share with the retailer.
That is to say, µ4, the effect of manufacturer i’s development index with retailer k,MDIzikt, is positive.
Retailers will likely have a tendency to adopt the brand from a high share manufacturer because this
manufacturer appears to be the preferred one among consumers from the retailer’s perspective.
Retailer adoption decelerates with the time since the brand is available in its trade area, TSAzikt,
i.e. µ5 < 0. In other words, the longer it takes for a retailer to adopt, the less likely it is that they
ever will.
Importantly, the network-diffusion effect among retailers, µ6, is positive. This means that retailers
have an increased tendency to adopt the brand if other retailers in their trade area have done so in
the past. This effect accords with what Bass (1969) calls a “word-of-mouth” or installed base effect.
While the network diffusion effect is significant, there are other effects (retailer specific characteristics
such as MDIzikt) that have greater on chain adoption.
Next we discuss the random effects bk. These random effects have two covariates. First, the
retailer level random effects are driven by retailer size (measured by ACVzk). Because ψ1 > 0, we
infer that the larger a retailer, the higher the probability is that it will adopt early. Rogers (1983)
27
notes, supportive of this effect, that early adoption of industrial goods is related to the size of the
adopter.
Finally, the effect ψ2 of concentration of retailers across markets, HRFzk, i.e., whether a retailer
is concentrated in one or two markets (e.g. Jewel in Chicago) of its trade area, or whether its total
size is spread across many markets (e.g., Kroger) is not different from 0.
5.3.2 Adoption confounded with availability The results discussed above are obtained from
the two-stage model where retailer adoption timing is conditioned on availability. Most diffusion
models, even individual level models (e.g., Lattin and Roberts 2000), do not take this condition
into account. In our specific case, the omission of the availability condition leads to very different
inferences for the adoption behavior of retailers. Table 3 shows the estimation results of a model
which ignores availability as a necessary condition for adoption. In such a model, the attribution is
made that manufacturer delays in market entry are in fact retailer delays in adoption. This leads to
the following differences in inferences compared to the conclusions above.
––– Table 3 about here –––
First, and most importantly, the influence of network diffusion (µ6) becomes more than three
times as large as before. Most likely this is because the spatial contiguity of retail-trade areas
can substitute in part for the spatial roll-out patterns of manufacturer entry. In fact, the network
diffusion effect is now larger than the effect of for instance MDIzikt.
Second, when availability is ignored, the adoption rates no longer decrease with time but instead
increase with time. This is likely caused by the fact that markets are entered progressively by
manufacturers, (i.e., λ > 0), and that the timing of entry and the timing of adoption are now
collapsed into one variable.
Third, and finally, instead of being zero, the effect of retailer concentration, ψ2, is now negative
and significant. This means that the degree to which retailers are concentrated in one or few markets
impacts the combination of entry and adoption negatively. This spurious effect can less clearly be
traced back to a single aspect of market entry but rather is a combination of omitting the influence
of connectedness (SNIym) and market concentration (HRFym) at the market level.
28
In sum, by ignoring the marketing actions (launch strategy) of manufacturers, the role of network
diffusion is very substantially overstated. Interestingly, and in a different context Vandenbulte and
Lilien (2001) also find that taking the marketing actions of manufacturers in to account tends to
weaken estimates of the “external” or word-of-mouth influence.14 Hence, it is very important that
efforts be made to include the marketing activity of firms in models of diffusion.
6 The selection of lead markets
With every new product rollout, managers need to select lead markets. To guide such decisions, we
designed a numerical experiment that focuses on the implications of the spatial—, and the selection
effects (SPTyimt and PRVy
imt) and the network effects (DIFzikt) on lead-market attractiveness.
The experiment initiates in each of the M markets a rollout at t = t0 by making the new brand
available in that market. After initialization in a candidate lead market m = 1, ...,M, other markets
are entered, and chains k = 1, ...,K adopt, probabilistically guided by the models (5) and (11). At
the same time, the three variables above, SPTyimt, PRV
yimt and DIFz
ikt, are updated recursively based
on markets entered and retailers that adopt. To isolate the effects of the U.S. geography and the
geographical retail structure, we set the parameters for all other variables to 0, while the effects
of SPTyimt, PRV
yimt and DIFz
ikt were set at 2, 1, and 0.5 respectively (e.g., their rounded empirical
values).
Given the probabilistic simulation of market entry and retailer adoption, not all sample paths
of diffusion for a given lead market m are identical. Indeed the number of possible diffusion paths
through the 166 retailers and 95 markets presents a formidable combinatorial problem. Therefore, we
approximated the variability of sample paths by running for each candidate lead marketm = 1, ...,M
400 replications, � = 1, ..., 400. For each combination of m and �, the number of weeks, Tm�, was
stored at which all 95 markets were entered and minimally 50% of all chains had adopted.15
To report on the findings of the experiment, we order the markets m = 1, ...,M on the mean (of
a left quantile to be chosen)16 of Tm� for each m. For instance, below we report the mean of the
best 10% of the completion times Tm� for each market. Figure 6 visualizes the location of the lead
14 However, this is not universally true, see e.g., Bass, Krishnan and Jain (1994).15 Other criteria were used with similar results.16 We use a left-quantile because, in the spirit of lead market selection, interest is with the “best” diffusion paths
not with the “average” path.
29
markets which performed best on this criterion. To facilitate interpretation, these lead markets are
placed relative to the trade areas of four of the main retailers in the United States.
From both a geographical and a retailer-network standpoint Denver is an attractive lead market.
This is not because of its central location in the United States —many such “central” markets fair
poorly— although that fact does contribute. Rather, Denver is in the trade area of three major
retailers, and is on the edge (convex hull) of two of them, opening up a large set of markets in the
United States (through the selection effect of PRVyimt).
Two of the best geographical lead markets are on the spatial edge of the U.S.. The market named
“Louisiana” is large and is in the trade area of both Kroger and Albertsons. Together, these two
retailers cover the majority of the US markets. Washington DC is an attractive market in our model
because Safeway operates on it and gives the brand potential exposure on both coasts. So, whereas
Washington DC is an “edge-market” in Euclidean space, from a retailer perspective it is a more
“central” market.
Many attractive lead markets are located in the trade area of Cubs Foods. This retailer has
excellent coverage in the central and Northern United States. In addition, the trade areas of retailers
in the East, South, and West of the United States all connect through the trade area of Cubs Foods,
giving it a high degree of what is called “betweenness centrality” (Wasserman and Faust 1994, p.
189-191).
The results for the average of other quantiles of Tm� are reasonably robust. For instance, the
correlation between the average completion time based on the 10th percentile correlates in excess of
0.8 with the average based on the 25th percentile. While the exact ordering of markets on average
completion time may change somewhat, markets tend to perform consistently well or consistently
poor. For example, the unique location of the Denver-market, central both from a geographic as
well as a retailer-network viewpoint, makes it the best market in any quantile of the distribution
of completion times. Other markets that score consistently high are Washington DC, and Omaha.
The Louisiana market tends to loose some ground in other quantiles, i.e., there is more variability
in the completion times than with other markets. This is because of the possibility to take an initial
step into the Florida markets which is an edge both geographically as well as in the retail-network.
A market that gains ground in other quantiles is the El-Paso/Albuquerque market (located directly
below Denver in Figure 6) which is in the Safeway and Albertsons trade-areas.
30
Figure 6: Lead markets and trade areas in the continental United States
A caveat of our simulation results is that the values of the parameters guiding the simulation are
estimated from the observed paths of manufacturer roll-outs and chain adoptions. It is implicitly
assumed that estimated manufacturer policies for the targeting of markets are invariant to the path.
Given the complexity of a formal optimization in a dynamic, spatial, and stochastic setting, the
condition that managers are not very strategic is likely a good approximation.17 Thererfore, the
results of this experiment can be interpreted as an exploration of several reasonable assumptions
about how manufacturers role out new brands and how retailers adopt them in the context of lead-
market selection.
7 Conclusion
We present and estimate a model of diffusion of retail distribution for new brands of repeat purchase
items in the United States. This model studies the important element of retail distribution. It
represents the evolution of retail distribution as an interactive sequence of manufacturer’s decisions
to enter local markets and retail chains’ decisions to adopt the brand.
17 For instance, our discussions with the managers involved in the launch of one of the brands suggest that the rolloutsequence is heuristically determined using the factors we model, as opposed to “optimized.”
31
In the confines of our empirical data, market entry by manufacturers is positively affected (1) by
the existing share of the manufacturer in the market at hand, (2) by spatial proximity to markets
already entered in the past, and (3) by the share of retailers who have adopted in the past (on other
markets). Further, markets entered early tend to contain many retailers (are not concentrated) with
multimarket trade areas (as opposed to independent stores). Finally, markets are entered in an
accelerated fashion, i.e., slow at first and faster later on. These results are suggestive of the fact that
manufacturers in our empirical application seem conservative. Their early focus in the new product
launch is on “home-markets” on which they are already large, and they seem to roll out slowly at
first.
Adoption of the brand by retailers is positively affected (1) by the importance of the category
to the retailer, (2) by the size of the manufacturer in the retailer’s outlets, and (3) by the adoption
of the brand by rival retailers. In addition, adoption is positively associated with retailer size and
negatively with time since availability. Hence retailers seem to adopt because of the importance of
the manufacturer (the combined effect of CDIzkt and MDIzikt) and because of competitive pressures
(the effect of DIFzikt).
Taken together, our paper suggests the presence of spatial—, selection—, and network effects in the
diffusion of new product distribution. These concepts have a broader applicability than the strict
interpretation in which they are applied here. For instance, in terms of future research, the most
obvious empirical extension of this paper is to model the evolution of other key performance variables
in new product launch. Candidates for such other variables are the three remaining ratios in Figure
1, i.e., assortment breadth, consumer trial, and consumer repeat purchase for new brands of repeat
purchase goods. Given the findings in this paper, another plausible extension is to conduct a more
formal dynamic spatial stochastic optimization of roll-out paths for the manufacturer.
We investigate lead market selection when chains approve new brands based on contagion from
other chains, and that manufacturers tend to roll the product out geographically. While we acknowl-
edge other selection criteria for lead markets, we find that markets that are located “central” in the
network of retail-chains, or on the edge of multiple large chains are especially appealing candidates,
all other factors held constant.
A potential limitation of the paper is caused by the fact that we have no data on manufacturer
incentives offered to the retailers. At the same time, the success of the brand with rival retailers
32
seems to be a more important and enduring incentive to adopt a brand than a one-time manufacturer
incentive. In sum, while manufacturer incentives may play a role initializing new product distribution,
it is unlikely that they replace the network effects found in this paper.
Finally, to our knowledge this paper is the first to empirically model spatial or network diffusion
in marketing. We hope that the current paper represents a constructive step towards modeling
many other contexts in which spatial and network dependencies may play a role. Such contexts
range from viral marketing applications and fashions or fads, to the diffusion of new technologies
and new products.
33
8 References
Anderson, Simon P. and Andre de Palma (1988), “Spatial Price Discrimination with
Heterogenous Products,” Review of Economic Studies, 55, 573-592
Anselin, Luc, (1988), “Spatial Econometrics: Methods and Models,” Kluwer Academic
Publishers, August 1988
Bass, Frank M. (1969), “A New Product Growth Model for Consumer Durables,” Man-
agement Science, 15(5), 215-227
––––—, Trichy Krishnan, and Dipak Jain (1994), “Why the Bass Model Fits Without
Decision Variables,” Marketing Science, 13:3, 203-223
Bronnenberg, Bart J. and Vijay Mahajan (2001), “Unobserved Retailer Behavior in Mul-
timarket Data” Joint Spatial Dependence in Market Shares and Promotion Variables,”
Marketing Science, 20:3, 284:299
––––— and Catarina Sismeiro (2002), “Using Multimarket Data to Predict Brand Per-
formance in Markets for which No or Poor Data Exist,” Journal of Marketing Research,
39, February, 1-17
Chib, Siddhartha, and Bradley P. Carlin (1999), “On MCMC Sampling in Hierarchical
Longitudinal Models,” Statistics and Computing, 9 17-26
Cliff, A.D., P. Haggett, J.K. Ord, G.R. Versey (1981), “Spatial Diffusion: An Historical
Geography of Epidemics in an Island Community,” Cambridge University Press, Cam-
bridge, London UK
Cressie, Noel (1993), “Statistics for Spatial Data”, Wiley and Sons, New York, NY
Dekimpe, Marnik G., Philip M. Parker, and Miklos Sarvary (2000), “Global Diffusion
of Technological Innovations: A Coupled Hazard Approach,” Journal of Marketing Re-
search, 37 (February), 47-59
Gelfand, A.E. and D.K. Dey (1994), “Bayesian Model Choice: Asymptotics and Exact
Calculations,” Journal of the Royal Statistical Society, Series B (Methodological), 56:3,
501-514
Gelfand, A.E., S.K. Sahu and B.P. Carlin (1995), “Efficient Parametrizations for Normal
Linear Mixture Models,” Biometrika, 82:3, September, 479-488
Gilks, W.R., S. Richardson, and D.J. Spiegelhalter (1996), “Markov Chain Monte Carlo
in Practice,” Chapman & Hall, New York, NY.
Hofstede, Frenkel ter, Michel Wedel, Jan-Benedict E.M. Steenkamp (2002), “Identifying
Spatial Segments in International Markets,” Marketing Science, 21:2, 160-177
Holcomb, Rodney B. (2000), “Food Industry Overview: Frozen Pizza ,” OSU Food Tech-
nology Facts, FAPC-103, Food and Agricultural Product Center, Oklahoma State Uni-
versity
Johnson, Norman L and Samuel Kotz (1970), “Continuous Univariate Distributions - 2,”
Wiley and Sons, New York, NY
Lattin, James M., and John H. Roberts (2000), “The Application of an Individual Level
Diffusion Model Prior to Launch,” working paper, Stanford Graduate School of Business
34
McLaughlin and Vithala Rao (1991), “Decision Criteria for New Product Acceptance and
Success,” Quorum Books, New York, NY
Niu, X.F. and George C. Tiao (1995), “Modeling Satellite Ozone Data,” Journal of the
American Statistical Association, 90, no 431, 969-983
Montgomery, David B. (1975), “New—Product Distribution, An Analysis of Supermarket
Buyer Decisions, Journal of Marketing Research, 12 (August), 255-264
Okabe, Atsuyuki, Barry Boots, Kokichi Sugihara, and Sung Nok Chiu (2000), “Spatial
Tessellations: Concepts and Applications of Voronoi Diagrams,” 2nd edition, Wiley and
Sons, New York, NY.
Putsis, William P. Jr., Sridhar Balasubramanian, Edward H. Kaplan, and Subrata K. Sen
(1997), “Mixing Behavior in Cross-Country Diffusion,” Marketing Science, 16:4, 354:369
Stoffer, D.S. (1986), “Estimation and Identification of Space-Time ARMAXModels in the
Presence of Missing Data,” Journal of the American Statistical Association, 81, 762-772
Strang, David, and Nancy Brandon Tuma (1993), “Spatial and Temporal Heterogeneity
in Diffusion,” American Journal of Sociology, 99:3, 614-639
Reibstein, David and Paul Farris (1995), “Market Share and Distribution: A General-
ization, A Speculation, and Some Implications,” Marketing Science, 14:3 (part 2), G130-
G202
Rogers, Everett M. (1983), “Diffusion of Innovations,” 3rd edition, Free Press, New York,
NY
Valente, Thomas W. (1995), “Network Models of the Diffusion of Innovations,” Hampton
Press, Cresskill, NJ
Vandenbulte, Christophe, and Gary Lilien (2001), “Medical Innovation Revisited: Social
Contagion vs. Marketing Effort,” American Journal of Sociology, 106:5, 1409-1435
Vines, S.K., W.R. Gilks and P. Wild (1996), “Fitting Bayesian Multiple Random Effects
Models,” Statistics and Computing, 6, 337-346.
Wasserman, Stanley and Katherine Faust (1994), “Social Network Analysis: Methods and
Applications,” Cambridge University Press, New York, NY
Zenor, Michael J. and Rajendra K. Srivastava (1993), “Inferring Market Structure with
Aggregate Data: A Latent Segment Logit Approach,” Journal of Marketing Research, 30
(August), 369-379
35
A Full conditional distributions
A.1 The market entry model.
• [yimt|yimt,rest]
The first distribution to specify is that of the latent probit variables yimt such that yimt > 0
if and only if yimt = 1 and yimt ≤ 0 if and only if yimt = 0. The variables yimt conditional
on the outcomes yimt have truncated Normal distributions. Specifically, we obtain for all
weeks that fall between the moment of global launch of the brand and entry in m, i.e., for
Tglobal launch of i ≤ t ≤ Tenter in m, that
[yimt|yimt, rest] ∼ N (Xyimt · θ + βm + γit, 1)
left-truncated at 0 if yimt = 1
right-truncated at 0 if yimt = 0(A.1)
• [θ|rest]
Define yθimt by yimt − βm − γit. Construct the array yθ and the matrix Xy by stacking over
brands, markets, and time. The full conditional for θ is proportional to
[θ|rest] ∝ [θ|yθ,Xy][θ0]
with [θ|yθ,Xy] ∼ N((Xy′Xy)−1
Xy′yθ, (Xy′Xy)−1)and prior [θ0] ∼ N (0,Vθ0) . This conju-
gate pair leads to the following full conditional
[θ|rest] = N (mθ,vθ)
mθ = vθXy′yθ (A.2)
vθ =(V−1
θ0 +(Xy′Xy
)−1
)−1
• [β|rest]
Define yβimt = yimt−Xyimt·θ−γit. Array all covariates of the hierarchical model for βm in a 1×Pβ
row vector Xβm (in the operationalization below equation (5) Xβm = [ACVym HRFy
m SNIym].
Array these covariates further into the M × Pβ matrix Xβ = [X′
β1 · · ·X′
βm · · ·X′
βM ].′ Finally,
define an index matrix Im of size Ny ×M which maps each observation yimt into m. Then, the
vector β of random factors βm has the following full conditional distribution.
[β|rest] = N (mβ,vβ) (A.3)
mβ = vβ
(I ′
myβ+1
σ2β
Xβφ
)
vβ =
(I ′
mIm +1
σ2β
IM
)−1
• [φ|rest]. The full conditional for φ is proportional to
[φ|rest] ∝ [φ|β,Xβ][φ0]
36
with [φ|β,Xβ] ∼ N
((X′
βXβ
)−1
Xββ,(X′
βXβ
)−1
)and prior [φ0] ∼ N (0,Vφ0) . This conju-
gate pair leads to the following full conditional
[φ|rest] = N (mφ,vφ) (A.4)
mφ = vφXββ
vφ =
(V−1
φ0 +1
σ2β
X′
βXβ
)−1
• [σ2β|rest]
The full conditional distribution for the variance σ2β is proportional to the product of the
distribution [β] = NM
(Xβφ, σ
2βIM
)and the prior [σ2
β] = IG(qβ, rβ),i.e.,
p(σ2β) =
rqββ
Γ(qβ)
(σ2β
)−(qβ+1)
e−rβ/σ2
β
This distribution and the multivariate Normal distribution forms a conjugate pair and results
in
[σ2β|rest] = IG
(qβ +
M
2, rβ +
1
2(β −Xβφ)
′(β −Xβφ)
)(A.5)
• [γ|rest]
Define yγimt = yimt − Xyimt · θ − βm. Array all covariates of the hierarchical model for γit in
a 1 × Pγ row vector Xγit (in the operationalization below equation (5) Xγit = [TSIyit]. Array
these covariates further into the M × Pγ matrix Xγ = [X′
γ11 · · ·X′
γ1T1X′
γ21 · · ·X′
γ2T2].′ Finally,
define an index matrix Iit of size Ny × (T1 + T2) which maps each observation yimt into i, t.
Then, the vector γ of random factors γit has the following full conditional distribution.
[γ|rest] = N (mγ ,vγ) (A.6)
mγ = vγ
(I ′
ityγ+
1
σ2γ
Xγλ
)vγ =
(I ′
itIit +1
σ2γ
IM
)−1
• [λ|rest]. The full conditional for λ is proportional to
[λ|rest] ∝ [λ|γ,Xγ ][φ0]
with [φ|γ,Xγ ] ∼ N((
X′
γXγ
)−1
Xγγ,(X′
γXγ
)−1
)and prior [φ0] ∼ N (0,Vφ0) . This conjugate
pair leads to the following full conditional
[φ|rest] = N (mφ,vφ) (A.7)
mφ = vφXγγ
vφ =
(V−1
φ0 +1
σ2γ
X′
γXγ
)−1
37
• [σ2γ |rest]
The full conditional distribution for the variance σ2γ is proportional to the product of the
distribution [γ] = NM
(Xγφ, σ
2γIT1+T2
)and the prior [σ2
γ ] = IG(qγ, rγ). This distribution and
the multivariate Normal distribution forms a conjugate pair and results in
[σ2γ|rest] = IG
(qγ +
T1 + T22
, rγ +1
2(γ −Xγλ)
′(γ −Xγλ)
)(A.8)
A.2 The chain adoption model.
• [zikt|zikt,rest]
As above, we specify is the distribution of the latent probit variables zikt such that zikt > 0 if
and only if zikt = 1 and zikt ≤ 0 if and only if zikt = 0. The variables zikt|zikt are distributed
truncated Normal. Specifically, for all weeks that fall between the moment of trade area
availability and adoption of the brand by retailer k, i.e., for Tavail,k ≤ t ≤ Tadopt,k, that
[zikt|zikt, rest] ∼ N (Xzikt · µ+ bk, 1)
left-truncated at 0 if zikt = 1
right-truncated at 0 if zikt = 0(A.9)
• [µ|rest]
Define zµikt by zikt − bk. Construct the array zµ and the matrix Xz by stacking over brands,
retailers, and time. The full conditional for µ is proportional to
[µ|rest] ∝ [µ|zµ,Xz][µ0]
with [µ|zµ,Xz] ∼ N((Xz′Xz)−1
Xz′zµ, (Xz′Xz)−1)and prior [µ0] ∼ N (0,Vu0) . This conju-
gate pair leads to the following full conditional
[µ|rest] = N (mµ,vµ) (A.10)
mµ = vµXz′zµ
vµ =(V−1
µ0 +(Xz′Xz
)−1
)−1
• [b|rest]
Define zbikt = zikt − Xzikt · µ. Array all covariates of the hierarchical model for bk in a 1 × Pb
row vector Xbk (in the operationalization below equation (11) Xbk = [ACVzk HRFz
k]. Array
these covariates further into the K × Pb matrix Xb = [X′
b1 · · ·X′
bk · · ·X′
bK ].′ Finally, define an
index matrix Ik of size Nz ×K which maps each observation zikt into k. Then, the vector b of
random factors bk has the following full conditional distribution.
[b|rest] = N (mb,vb) (A.11)
mb = vb
(I ′
kzb+
1
σ2b
Xbψ
)vb =
(I ′
kIk +1
σ2b
IK
)−1
38
• [ψ|rest]. The full conditional for ψ is proportional to
[ψ|rest] ∝ [ψ|b,Xb][ψ0]
with [ψ|b,Xb] ∼ N((X′
bXb)−1
Xbb, (X′
bXb)−1
)and prior [ψ0] ∼ N (0,Vψ0) . This conjugate
pair leads to the following full conditional
[ψ|rest] = N (mψ,vψ) (A.12)
mψ = vψXbb
vφ =
(V−1
ψ0 +1
σ2b
X′
bXb
)−1
• [σ2b |rest]
The full conditional distribution for the variance σ2b is proportional to the product of the distri-
bution [b] = NM
(Xbψ, σ2
bIK)and the prior [σ2
b ] = IG(qb, rb). The full conditional distribution
makes use of this conjugate pair and results in
[σ2b |rest] = IG
(qb +
K
2, rb +
1
2(b−Xbψ)′(b−Xbψ)
)(A.13)
B The MCMC algorithm
The full conditional distributions are all closed form. Hence, the algorithm uniquely consists of Gibbs
steps. Draws from the joint posterior distribution of the parameters are obtained by passing through
the following conditional distributions while updating the parameters on which these depend by the
most recent posterior draws.
1. Market entry model
(a) draw from [yimt|yimt,rest] (see equation A.1)
(b) set yθimt = yimt − βm − γit, draw from [θ|rest] (see equation A.2)
(c) set yβimt = yimt −Xyimt · θ − γit, draw from [β|rest] (see equation A.3).
(d) draw from [φ|rest] (see equation A.4)
(e) draw from [σ2β|rest] (see equation A.5)
(f) set yγimt = yimt −Xyimt · θ − βm, draw from [γ|rest] (see equation A.6)
(g) draw from [λ|rest] (see equation A.7)
(h) draw from [σ2γ|rest] (see equation A.8)
2. Retailer adoption model
(a) draw from [zikt|zikt,rest] (see equation A.9)
(b) set zµikt = zikt − bk, draw from [µ|rest] (see equation A.10)
(c) set zbikt = zikt −Xzikt · µ., draw from [b|rest] (see equation A.11)
(d) draw from [ψ|rest] (see equation A.12)
(e) draw from [σ2b |rest] (see equation A.13)
39
Table 1: Description of the variables
Model Variable name units mean std
category development index CDIymt %cat. salesmt/ACVm 0.67 0.46
manufacturer development index MDIyimt mfr. salesimt/cat.salesmt 0.25 0.14
distance to manufacturing site DSMyim 103 Miles 0.70 0.46
market spatially lagged entry SPTyimt [ ] 0.16 0.24
entry share of previous adopters PRVyimt % market ACV 0.35 0.34
market size ACVym MM$/week 4.19 3.38
retailer concentration in market HRFym [ ] 0.31 0.15
share of multi-market chains SNIym % market ACV 0.88 0.15
time since introduction TSIyit weeks 56.39 37.04
category development index CDIzkt %cat.saleskt/ACVk 0.50 0.37
manufacturer development index MDIzikt mfr. salesikt/cat.saleskt 0.16 0.16
retailer time since availability TSAzikt weeks 34.79 39.64
adoption network lagged adoption DIFzikt [ ] 0.77 0.27
retailer size ACVzk MM$/week 2.20 4.10
market concentration in retailer HRFzk [ ] 0.74 0.30
40
Table 2: Estimation results of the full model
percentile 2.5% 50% 97.5%
symbol variable name Market Entry
θ1 Digiorno DIGy -7.758 -6.331 -5.095
θ2 Freschetta FREy -6.516 -5.192 -4.016
θ3 category development index CDIymt -0.262 0.064 0.392
θ4 manufacturer development index MDIyimt 0.651 1.373 2.093
θ5 distance to the site of manufacturer DSMyim -0.253 0.063 0.368
θ6 spatial proximity SPTyimt 1.174 1.575 1.990
θ7 share of previous adopters PRVyimt 0.530 0.925 1.347
φ1 all commodity volume ACVym -0.080 -0.027 0.023
φ2 Herfindahl index HRFym -2.548 -1.337 -0.203
φ3 share of multimarket chains SNIym 0.971 2.213 3.633
λ time since introduction TSIyit 0.011 0.018 0.025
σ2β variance market component 0.166 0.304 0.540
σ2γ variance time component 0.399 0.651 1.085
Chain Adoption
µ1 Digiorno DIGz -3.044 -2.526 -2.041
µ2 Freschetta FREz -3.148 -2.634 -2.155
µ3 category development CDIzkt 0.332 0.528 0.730
µ4 manufacturer development index MDIzikt 1.435 1.906 2.404
µ5 time since availability TSAzikt -0.010 -0.005 -0.001
µ6 network effect on chain adoption DIFzikt 0.143 0.483 0.827
ψ1 all commodity volume ACVzk 0.036 0.074 0.113
ψ2 Herfindahl index HRFzk -0.229 0.219 0.684
σ2b variance chain component 0.216 0.344 0.531
41
Table 3: Estimation results ignoring market entry
percentile 2.5% 50% 97.5%
symbol variable name
µ1 Digiorno DIGz -4.442 -3.954 -3.450
µ2 Freschetta FREz -4.037 -3.581 -3.086
µ3 category development index CDIzkt 0.206 0.372 0.530
µ4 manufacturer development index MDIzikt 1.154 1.529 1.909
µ5 time since availability TSAzikt 0.004 0.007 0.009
µ6 network effect on chain adoption DIFzikt 1.463 1.738 2.047
ψ1 all commodity volume ACVzk 0.055 0.091 0.125
ψ2 Herfindahl index HRFzk -1.012 -0.466 -0.074
σ2b variance retailer 0.234 0.364 0.545
42
Top Related