Sales Forecasting with Financial Indicators and...

Sales Forecasting with Financial Indicators and Experts’ Input

Vishal Gaur∗, Nikolay Osadchiy†, Sridhar Seshadri‡

October 23, 2008

Abstract

The volume of retail sales is commonly understood to be correlated with the state of the

economy. This information can potentially be employed in demand forecasting, operations

decisions, and risk management. We propose a model in which the total sales of a retailer

is a function of sales forecasts generated by equity analysts, the term of the forecast, and

the return on an aggregate financial market index over the term of the forecast. We test

this model on a panel of 4,698 observations of annual firm-level sales forecasts for 97 retailers

over 10 years, each year containing multiple forecasts of varying terms. We show that the

correlation coefficient of sales forecast error with the financial market return is significant, and

varies across firms depending on the retail segment, the gross margin, and the term of the

forecast. Our model provides results on other parameters for forecasting as well, and a method

for forecast updating. We show that forecast updates from our model provide new information

not contained in the forecast updates by equity analysts, so that a combined forecast leads

to improved forecast accuracy. These results have applications in forecast updating, decision

postponement, production planning, and risk management.

∗Johnson Graduate School of Management, Cornell University, 321 Sage Hall, Ithaca NY 14853-6201; email:

[email protected].†Leonard N. Stern School of Business, New York University, 44 West 4th St, Suite 8-160, New York NY 10012;

email: [email protected].‡Department of Information, Risk, and Operations Management, McCombs School of Business, University of

Texas at Austin, 3.440, 1 University Station, Austin, TX 78712. E-mail: [email protected].

Sales Forecasting with Financial Indicators and Experts’ Input

Abstract

The volume of retail sales is commonly understood to be correlated with the state of the

economy. This information can potentially be employed in demand forecasting, operations

decisions, and risk management. We propose a model in which the total sales of a retailer

is a function of sales forecasts generated by equity analysts, the term of the forecast, and

the return on an aggregate financial market index over the term of the forecast. We test

this model on a panel of 4,698 observations of annual firm-level sales forecasts for 97 retailers

over 10 years, each year containing multiple forecasts of varying terms. We show that the

correlation coefficient of sales forecast error with the financial market return is significant, and

varies across firms depending on the retail segment, the gross margin, and the term of the

forecast. Our model provides results on other parameters for forecasting as well, and a method

for forecast updating. We show that forecast updates from our model provide new information

not contained in the forecast updates by equity analysts, so that a combined forecast leads

to improved forecast accuracy. These results have applications in forecast updating, decision

postponement, production planning, and risk management.

1. Introduction

There is ample evidence in the business press that end-consumer demand is correlated with broad

financial market indicators. For example, in the period 2000-2002, when the NASDAQ composite

index lost more than 60% of its value and the S&P 500 index lost about 40%, retail sales were

documented to be lower than predicted by analysts (Business Week (2001)). A number of retailers,

such as J. C. Penney, Sears, ShopKo, and OfficeMax announced store closings. Bonds of Saks Inc.

and ShopKo Stores were downgraded from investment quality to junk (Kaufman (2001)). Several

retailers, including Kmart, filed for bankruptcy.

More recent examples relate to the 2007-08 crisis of the mortgage-backed securities market. In

September 2007, the CFO of Office Depot announced: “Our small-business customers have changed

their buying habits as a result of this environment, and traffic remains slower than normal in our

retail stores” (Reuters (2007)). Several retailers, such as furniture store Levitz, electronics seller

Sharper Image, and home furnishings retailer Linens ‘n Things have filed for bankruptcy. While

high-end retailers such as Saks Inc. have suffered unexpected declines in sales, Wal-Mart and select

other discount retailers have benefited through stronger-than-expected sales. On the other hand,

during 1998-99 as well as during 2004-05, US financial markets and retail sales were both rising

(BBC (1999)).

These facts suggest that retail sales are correlated with financial market returns. This corre-

lation would not be surprising because financial market returns affect income and wealth, which

further affect household consumption. For example, one mechanism for this relationship is the per-

manent income hypothesis of Friedman (1957). In the recent operations literature, many academic

papers have studied models of optimal inventory decisions and hedging demand risk predicated

on this correlation, see for example, Caldentey and Haugh (2006), Gaur and Seshadri (2005), and

Van Mieghem (2003). However, the extent of this correlation is not known. It is also not known how

the correlation coefficient varies across firms and why. In this paper, we develop and test a model

to examine statistical evidence for this phenomenon and examine its usefulness for forecasting,

operational decision-making, and risk management.

Our study is concerned with the firm-level, dollar-denominated annual sales realized by retailers.

Forecasts of annual sales of firms are generated by equity analysts up to three years in advance of

their fiscal year-end dates, and are updated frequently. We model the joint evolution of the forecast

1

and the financial market using a continuous time stochastic process. We represent the financial

market by one broad market indicator, such as the value weighted market index. Besides financial

market indicators, retail sales can also be correlated with non-financial metrics of the state of the

economy, such as the GDP growth rate. In this paper, we focus on financial market indicators

because they are traded. We further focus on contemporaneous returns rather than lagged returns.

These two features of our model are key because they enable us to apply it for not only forecasting

but also risk assessment and hedging. Our model can easily be generalized to allow a vector of

multiple market indicators, non-financial metrics, or lagged variables.

One output of this model is an analytical expression for the forecast error as a function of the

term of the forecast, financial market return over the term of the forecast, and various volatility

parameters. We use this expression to test various hypotheses about the correlation coefficient

between sales forecast error and financial market return over the term of the forecast. Our data

set is a panel of 4,698 observations across 97 US public retailers and fiscal years 1997-2007, each

year containing multiple forecasts made at different times for each firm. The time period for these

data is useful since it covers a wide range of variation: two periods of high market returns, 1998-

2000 and 2003-2005, and two periods of low or negative market returns, 2001-2002 and 2006-07.

Another output of our model is a method to revise sales forecasts using up-to-date information

on financial market prices. We evaluate the accuracy of these forecasts and compare them to the

forecast revisions issued by equity analysts at the same time epochs. Thus, we determine if analysts

incorporate publicly available historical financial return information in their forecast updates, and

if our model adds new information augmenting their forecasts.

Our results show that the correlation between the sales forecast error and market return is

significant. With an average value of 0.17 across our entire data set, it varies across retail segments

from a high of 0.83 for office supplies stores to not significant or negative for shoe stores and

auto parts and accessories stores. We classify retail segments into two categories, discretionary

purchase and everyday use, analogous to a ‘needs versus wants’ distinction in economics. We

find that retail segments in the discretionary purchase group, such as jewelry, home improvement,

consumer electronics and apparel, have higher correlation coefficient estimates on average than

retail segments in the everyday purchase group, such as discounters and wholesale clubs. We

further classify retailers in each segment into high- and low-gross margin groups, and find that the

2

average correlation coefficient for high-margin retailers is significantly higher than that for low-

margin retailers. Our results not only fit our expectations of retailers, but also show a wide range

of variation across firms. For example, in the jewelry stores segment, Tiffany’s, a high-margin

retailer, has an estimated correlation coefficient of 0.87, whereas Zale Corp., a lower margin retailer

has 0.29. In the discounters and wholesale clubs segment, BJ’s Wholesale Club, a warehouse

club selling durable discretionary purchase items has an estimated correlation coefficient of 0.91,

whereas Wal-mart Stores, a discounter has 0.01, and Dollar General has -0.81. We further test the

hypothesis whether the correlation coefficient increases with the term of the forecast. We find that

its average value across the data set almost doubles as the term of the forecast increases from 12-15

months to 20-24 months.

The main application of our model is to forecast updating. Firm-level sales forecasts are issued

by retailers for planning and by equity analysts for estimating earnings. In both cases, substantial

effort is required to generate and update forecasts. In contrast, financial data are readily available

and highly reliable. Thus, our model, together with financial data, can be used to update sales

forecasts in a cost-efficient way. We test the accuracy of the model for forecast updating by

comparing errors of the forecasts generated by the model vis-a-vis forecasts issued by equity analysts

at the same times. We find that, on average, analysts have lower forecast root mean squared error

than our model. This is not surprising because analysts have access to many types of information

for their forecast updates, whereas our model-based updates are based on only one variable. More

importantly, we find only a weak evidence that analysts take financial market returns into account

in their forecast updates. We conduct a regression of analyst forecast updates on financial returns

over the period of update as per our model, and find that the financial returns are weakly significant

even though they were public information at the times when analysts updated their forecasts. To

further examine the usefulness of our model, we construct a combined model based on analysts’

forecast updates as well as our model’s forecast updates, and find that both variables are statistically

significant in improving forecast accuracy. Thus, a combined forecast obtained from both sources

performs better than the equity analysts. Furthermore, as expected, the improvement is higher for

firms with high correlation coefficients and for longer term forecasts.

Our paper contributes to the operations management literature by investigating empirical sup-

port for recent studies in financial and operational hedging, decision postponement, and forecast

3

evolution. Gaur and Seshadri (2005) examine optimal inventory and financial hedging decisions for

a firm when its demand is correlated with contemporaneous returns on financial market indices.

They present evidence for the existence of such correlations using retail industry-level data for same

store sales growth rates. Caldentey and Haugh (2006) use a continuous time model to study the

optimal operating policy and trading strategy when their payoffs are correlated. Chod et al. (2007)

study the optimal combination of operational flexibility and financial hedging for a two-product

firm whose demand is correlated with financial market indicators, and show when the risk mitiga-

tion strategies are complements or substitutes. Van Mieghem (2008)(Chapter 9) discusses various

types of risk and their mitigation through operational and financial means. Chen et al. (2007) study

inventory and financial hedging decisions in a multi-period model in which demand is correlated

with the prices of traded assets. Besides these research papers, a correlation between operational

payoffs and financial market returns is also often assumed in the real options literature, see for

example, Dixit and Pindyck (1994). In comparison to this research, our paper presents a method

to estimate the correlation between operational payoffs and financial market returns, examines its

drivers, and shows when knowledge of this correlation is most advantageous.

The theoretical framework of our model is similar to the martingale model of the forecast

evolution (MMFE) developed independently by Graves et al. (1986), Graves et al. (1998) and

Heath and Jackson (1994). A continuous time version of MMFE was studied by Sapra and Jackson

(2005). Our model can be viewed as a particular case of the MMFE. The difference is that rather

than using past observations for forecast updates, we find explanatory variables which drive the

update.

This paper uses data for the retailing sector, but our model can also be applied to the manu-

facturing and wholesale trade sectors because changes in consumer demand will eventually affect

demand at the upstream layers of the supply chain. In this respect, our paper complements the

literature on stock returns of industry portfolios along the supply chain. For example, Hong et al.

(2007) study if stock returns of industry portfolios predict future stock market movements, and

Menzly and Ozbas (2006) study cross-industry momentum. While these studies seek to predict fu-

ture industry-specific or aggregate stock market returns using current industry portfolios, we seek

to predict sales using aggregate stock market returns. Hence, our hypotheses rely on consumption

being dependent on financial market returns, and can be extended to examine lead and lag effects

4

among industries.

The paper is organized as follows: we set up our hypotheses in §2; in §3, we formulate a model

of the joint evolution of the forecast and financial indicators and discuss an econometric setup

for its estimation; §4 and §5 contain the description of the data set and the estimation results,

respectively; in §6, we evaluate the forecast accuracy of our model compared to equity analysts; in

§7, we conclude the paper with a discussion of applications, limitations, and further research. The

online appendix to the paper gives detailed firm-by-firm estimates of the correlation coefficient and

results on the forecast accuracy of our model.

2. Hypotheses

Consider the forecast of total dollar denominated annual sales of a retailer and the actual sales

realization for a particular fiscal year. Suppose that the forecast is a consensus estimate given

by several experts or financial analysts incorporating all the information available to them on the

forecast date, and the sales realization is announced by the retailer after the fiscal year-end date. We

denote the difference between the sales realization and the forecast as the sales forecast error, and

the time interval from the forecast date to the fiscal year-end date as the term of the forecast. We

use the terms experts and analysts interchangeably throughout the paper since our mathematical

model is generic and could be applied to situations other than firm-level sales.

We want to capture the causal relationship between the state of economy and spending behavior

of consumers, and the resulting effect on retail sales. We use broad aggregate financial indices as

proxies for the state of the economy because they are correlated with increase in wealth and income

of consumers. For our model, they are preferred to other proxies, such as growth rate of GDP,

since they are traded on a market and offer opportunities for continuous updating of forecasts as

well as hedging against uncertainty. We refer to the return on a broad market index generically

as financial market return. Alternatively to a broad index, one might consider using a return on

a portfolio of retail stocks. Most likely, the return on such a portfolio will be correlated with the

sales forecast error of a retailer since the stock value would tend to go up (down) in response to

higher (lower) sales. However, limiting the choice to sector specific indicators limits their ability

to proxy for the state of the economy and customers’ propensity to consume. A broad indicator

is also preferred if it suffices to explain the forecast error for a wide range of retailers. We might

5

also consider using vector valued processes to capture the evolution of financial information. The

model presented in this paper can easily be modified to accommodate this change.

A large stream of economics literature studies the effect of income and wealth on household

consumption. In the late fifties, Friedman formulated his permanent income hypothesis according

to which consumption does not depend on short term changes in disposable income but rather on

long-term income expectations. This idea was refined by several researchers. For example, Ando

and Modigliani (1963) formulated the income lifecycle hypothesis in which consumption depends

on the future returns on capital. Subsequently, a number of papers attempted to link marginal

propensity to consume to the stock market wealth of households and measure the magnitude of

the effect for different types of households, see Poterba (2000) for a review. These papers find that

rising stock market leads to increase in consumption among equity holding households, and even

non-stockholding households may experience spending increase due to buoyed confidence in their

future income. Poterba (2000) also acknowledges that stock market decline can prompt a sharp

and asymmetric decrease in consumption, admitting limited empirical evidence of this effect. More

recent papers such as Dynan and Maki (2001) use quarterly survey data of households with equity

investments and study how consumption depends on current and lagged stock returns. Dynan and

Maki (2001) find that the consumption amounts to 3.5 cents per dollar of stock wealth and the

effect of lagged stock returns is insignificant after 18 months. Thus, this literature implies that total

consumption, and hence, demand, increases with returns on equity investments of households. We

proxy returns on equity investments of households between the forecast date and the fiscal year-end

date by the return on the financial market index over the term of the forecast. Thus, motivated by

this literature, we set up the following hypothesis:

Hypothesis 1 Sales forecast error of retail firms is positively correlated with the return on a fi-

nancial market index over the term of the forecast.

Another stream of economics literature deals with the problem of determining how a consump-

tion structure changes with income. The fact that the proportion of income spent on everyday

purchase goods, such as food, basic medications, etc., decreases with income and that spent on

discretionary purchase goods, such as luxury goods, increases with income was first observed by

Engel in the 19th century. A substantial body of literature has been developed since then. Modern

econometric techniques have been used to estimate Engel curves for different segments, see for

6

example, Banks et al. (1997). This literature suggests that demand for everyday purchase prod-

ucts is negatively correlated with income, whereas demand for discretionary purchase products is

positively correlated with income. Therefore, we classify retailers into discretionary purchase and

everyday purchase retail segments based on the types of products that they sell. We expect to see

different values of the correlation of sales forecast error with market return across firms depending

on their classification. For example, we expect supermarket stores and pharmacies to have a steady

demand that is relatively insensitive to the financial market. On the other hand, we expect the

demand for luxury goods to be higher in periods of booming economy when consumer have more

disposable income. Our second hypothesis is based on this reasoning:

Hypothesis 2 (a) Retailers in discretionary purchase retail segments have positive correlation

between sales forecast error and financial market return over the term of the forecast. (b) Retailers

in everyday purchase retail segments have negative or insignificant correlation between sales forecast

error and financial market return over the term of the forecast.

Further motivated by the above reasoning, we expect that, within each retail segment, retailers

with higher gross margin would have higher correlation with financial market return because such

retailers would be selling high-ticket items targeting the upper and premium market segments. The

demand for products in these segments would have a greater discretionary or luxury component.

Such demand could be high in periods of high economic growth. In periods of low economic growth,

demand could be depressed due to consumers putting off their purchases or substituting to lower

priced products carried by the same or other firms. Prices could also be depressed due to greater

markdowns of merchandize. Both these reasons would lead to a reduction in dollar denominated

sales. On the contrary, retailers with lower gross margin would benefit due to increase in demand

from consumers who would otherwise have bought products from higher-priced retailers. In this

reasoning, we use gross margin as a proxy for price. This assumption is justifiable in retailing

because this industry is very competitive. Competition implies that cost changes would be passed

on to consumers because abnormally high or low margins cannot be sustained for too long. Thus,

we test the following hypothesis:

Hypothesis 3 In each retail segment, retailers with higher gross margin have a higher correlation

between sales forecast error and financial market return over the term of the forecast.

7

The term of the forecast is a key variable in our model since uncertainty around the forecast

would increase with the term of the forecast. Hence, we look at how the term of the forecast affects

the correlation of sales forecast error with the return on the financial indicator. We expect that

when the forecast horizon is longer, the sales forecast error is more influenced by changes in the

economy. As the forecast horizon gets shorter, the economic uncertainty gradually gets resolved

but the idiosyncratic uncertainty associate with the firm doesn’t get resolved. Therefore, we expect

that the correlation coefficient should be high for long term forecasts and should decrease as the

forecast horizon gets shorter. Thus, we have:

Hypothesis 4 The correlation of sales forecast error with the financial market return increases

with the term of the forecast.

Hypotheses 2-4 focus on three potential predictors of the correlation coefficient between sales

forecast error and financial market return. The predictors are the type of product, the gross margin,

and the term of the forecast. Thus, the results of these hypotheses will be useful in determining

situations in which the correlation coefficient is high. In such situations, the correlation with the

financial market return will have greater impact on forecasting and decision-making in a firm.

The next section sets up the model to estimate the correlation of sales forecast error with

financial market return and thereby test the above hypotheses.

3. Estimation Methodology

3.1 Model

We develop a model of joint evolution of the financial market and the future sales for each company

for each year. Through this analysis, we construct two equations, one for estimating the parameters

of our model and testing the hypotheses, and another for updating forecasts using financial market

information. We omit the firm and year indices for convenience. We shall introduce these indices

later when we specify the estimation procedure and data. Suppose we are in time t = 0, we are

interested in the sales for the next fiscal year, that will start at time T1 and end at time T . Such

sales forecasts are issued by the experts ahead of the selling season. Let Dt, 0 ≤ t ≤ T , be the time

t value of information variable representing the sales realized over the next financial year. At time

8

Figure 1: Forecasting and demand realization timeline

T , DT equals to the actual realized sales D (see Figure 1). We assume that lnDt is a martingale,

that is,

E[ln(Dt)|Fs] = ln(Ds), 0 ≤ s ≤ t ≤ T,

where Ft represents the time t information. We assume that the martingale process is square

integrable. By the martingale representation theorem (Karatzas and Shreve 1991, Thm. 4.15 p.

182),

lnDt = ln D0 +∫ t

0σD(u)dBD(u),

which implies that

d(lnDt) = σD(t)dBD. (1)

Solving (1) gives us

Dt = D0 exp(σDBDt). (2)

The experts forecast the value of DT , which is a random variable. At time t, their forecast is

given by

Ft = E[DT |Ft],

where the length of the period T − t is the term of the forecast. Under this definition, Ft is also a

martingale. Indeed, when t > s, Ft ⊃ Fs and

E[Ft|Fs] = E [E[DT |Ft]|Fs] = E[DT |Fs] = Fs.

The martingale evolution of the forecast is widely assumed in the literature (e.g., Graves et al.

(1986), Heath and Jackson (1994)) and directly corresponds to the unbiasedness of the forecast.

9

Let the stochastic process {Mt} represent the evolution of the financial indicator. We assume

that Mt ∈ Ft. The evolution of Mt is given by

dMt

Mt= µMdt + σMdBM . (3)

The financial indicator can be the price of shares of a company or the value of a portfolio or a

composite market index. We discuss choices of the indicator in §3.2. For simplicity we consider an

one dimensional process {Mt}, but our derivation can be extended to allow {Mt} to be a vector

process.

We assume that {Dt} and {Mt} are correlated processes, that is,

dBDdBM = ρdt.

The coefficients ρ, µD and σD may vary across firms. The coefficients µM and σM relate to the

financial market and are common for all firms. We assume that the values of all these coefficients are

time-invariant. This assumption is made in order to derive a regression equation that is separable

in the term of the forecast. We test this assumption in Hypothesis 4 by estimating our model

independently for different time buckets for the term of the forecast.

Equation (3) has the following solution at time T :

MT = M0 exp(

µMT − 12σ2

MT + σMBM T

). (4)

Given that the financial index at time T has the value MT , we can find the conditional expec-

tation for the future sales. From (2), lnDT = ln D0 + σDBDT. We can derive the value of lnD0

from the experts’ forecast at time zero, F0 = D0 exp(σ2DT/2). Therefore,

E[lnDT |F0,MT ] = lnF0 −12σ2

DT + σDE[BDT|F0,MT ]. (5)

Also, from (4),

BM T =1

σM

(ln

MT

M0− (µM − 1

2σ2

M )T)

M= a.

From the joint normality of (BDT , BM T ),

(BDT , BM T ) ∼ N

µ1

µ2

,

σ11 σ12

σ21 σ22

= N

0

0

, T

1 ρ

ρ 1

.

10

Thus, the conditional distribution of BDT given BM T can be written as (BDT |BM T = a) ∼ N(µ̄, Σ̄),

where

µ̄ = µ1 + σ12σ−122 (a− µ2) = ρa, Σ̄ = σ11 − σ12σ

−122 σ21 = T − Tρ2.

Redefining MTM0

= 1 + r0T , we have the following expression for the conditional expectation in (5):

E[BDT|F0,MT ] =

ρ

σM

(ln(1 + r0T )−

(µM − 1

2σ2

M

)T

).

Finally,

E[lnDT |F0, ST ] = lnF0 +ρσD

σMln(1 + r0T ) + T

ρσD

σM

(−µM +

σ2M

2− σDσM

2ρ

). (6)

Equation (6) is the conditional expectation of the logarithm of sales. The second term in (6)

reflects how the return on the financial index affects the conditional expectation. The last term

reflects the dependency on the term of the forecast T and the volatility correction. The common

effect of the financial market on both these terms is proportional to ρσD/σM . If ρ = 0 the effect of

the financial market vanishes.

The conditional variance of the logarithm of sales is given by:

Var[lnDT |F0,MT ] = σ2DVar[BDT |F0,MT ] = σ2

DT (1− ρ2). (7)

Market information is valuable unless ρ = 0. If |ρ| = 1, we get a perfect forecast of sales once we

know the value of MT . For other values of ρ, the forecast error is partially explained by the return

on the financial indicator. We shall use equations (6) and (7) to estimate the parameters of our

model and test Hypotheses 1-4 regarding the value of ρ.

In reality, we would not know the financial market information all the way up to time T .

Suppose that we know the experts’ forecast F0 and the value of the financial index Mt at some

moment t > 0. Let 1 + r0t = Mt/M0. Then,

E[lnDT |F0,Mt] = lnF0 −12σ2

DT + σDE[BDT |F0, BM t].

Since BD is a martingale, E[BDT |F0, BM t] = E[BDt|F0, BM t] = ρσS

(ln(1 + r0t)− (µM − 1

2σ2M )t

).

Therefore,

E[lnDT |F0,Mt] = lnF0 +ρσD

σM

(ln(1 + r0t)−

(µM − 1

2σ2

M

)t

)− 1

2σ2

DT, (8)

11

and

Var[lnDT |F0,Mt] = σ2D(T − tρ2). (9)

This equation will be useful to update forecasts as we discuss in §6. It will also be useful for

evaluating the benefits of our model and testing whether forecast updates by experts incorporate

observed financial market returns. Note that the effect of volatility in (8) is proportional to the

term of the original forecast T since only partial information has been revealed through the price

of the financial indicator.

3.2 Estimation

We use two ways to estimate (6)-(7). First, we estimate model parameters directly from the ex-

pressions for the conditional mean (6) and variance (7) using the maximum likelihood estimation

(MLE) method. Then, we estimate (6)-(7) using a random coefficients regression model incorpo-

rating parameter heterogeneity and a general covariance structure for errors.

Each observation in our data set is a combination of a forecast by experts, corresponding actual

sales realization, term of the forecast, and corresponding financial market return. Let Fijy denote

the j-th sales forecast by experts for firm i for year y, Diy denote the actual sales realization for firm

i for year y, Tijy denote the term of the forecast corresponding to Fijy, and rijy denote the financial

market return over the time period represented by Tijy. Note that there are several forecasts each

year and Diy is constant for all forecasts j for a given firm-year. In this notation, we write (6)-(7)

as

lnDiy = a + b1 lnFijy +ρσD

σMln(1 + rijy) + Tijy(−

ρσD

σMµM +

12ρσDσM − 1

2σ2

D) + εijy,

where εijy ∼ N(0, σ2

DTijy(1− ρ2)), and intercept a and coefficient b1 are added as controls.

Let zijy(a, b1, ρ, σD) = lnDiy−a−b1 lnFijy− ρσDσM

ln(1+rijy)−Tijy(−ρσDσM

µM + 12ρσDσM− 1

2σ2D).

The likelihood of the observation is

f(εijy|Diy, Fijy, rijy, Tijy) =1√

2πσ2DTijy(1− ρ2)

exp(−zijy(a, b1, ρ, σD)2

2σ2DTijy(1− ρ2)

).

Suppose for now that the observations are independent. We maximize the log-likelihood function

12

with respect to model parameters ρ and σD and controls a and b1 subject to the constraint σD ≥ 0:

lnL(ρ, σD, a, b1|D,F, r,T) =∑i,j,y

ln f(εijy|Dijy, Fijy, rijy, Tijy)

= −12

∑i,j,y

[ln(σ2

DTijy(1− ρ2)) + ln 2π +zijy(a, b1, ρ, σD)2

σ2DTijy(1− ρ2)

]. (10)

We apply the MLE method to estimate firm-specific values of the parameters a, b1, ρ and σD. We

also apply it to various groups of firms to obtain pooled estimators of the parameters.

Our data set contains many experts’ forecasts of sales in each year for each firm, so that the

estimation errors across these forecast would be correlated with each other. We use a subsampling

and bootstrap method to solve the problem of such correlations affecting our parameter estimates.

Specifically, we draw one observation per firm per year without replacement, then estimate the

model and repeat the procedure. The asymptotic variance of the estimators is the variance of

the sample of estimates. Since the number of forecasts for a firm may vary across years, some

observations may be drawn more often then others. To address this issue, we conduct a weighted

MLE by assigning a weight to each observation proportional to the size of the firm-year subsample

from which the observation is drawn.

We use MLE as the primary method because it is asymptotically efficient, allows for direct

estimation of the parameters of the model, allows subsampling and bootstrap, and is fully parame-

terized by the conditional mean and variance. Alternatively to MLE, we can set up a regression

model from equation (6) as follows:

lnDiy = a + b1 lnFijy + b2 ln(1 + rijy) + b3Tijy + εijy. (11)

The advantage of this approach compared to MLE is that it allows us to estimate general covariance

structures for the error term. Its disadvantage is that it does not recognize the common parameters

in the coefficients b1 and b2 and in the variance of the error term.

Coefficients b2 and b3 can vary among firms due to differences in ρ and σD. Thus, we adopt the

random coefficients (RC) method for estimation of (11) by letting b2 = b̄2+δi and b3 = b̄3+γi, where

δi ∼ N(0, σ22), γi ∼ N(0, σ2

3). By this assumption, coefficients b2 and b3 are normally distributed

with means b̄2, b̄3 and standard deviations σ2, σ3. We allow for the random disturbances δi and γi to

be correlated, since coefficients for the same firm have common factors. The correlation structure

is estimated from the data.

13

Model (11) provides a parsimonious way to capture the heterogeneity in slope parameters when

the estimation is done on pooled samples (Greene (2003), Chapter 13). Model (11) can also be

augmented by including extra variables such as the forecast update from one forecast to the next

and lagged returns on the financial index. Thus, we use this model to compare our forecasting

method with forecast updates provided by equity analysts. We also use it to cross-check the MLE

estimates with the results of model (11), where the coefficients ρ and σD are derived from the

estimates of b̄2 and b̄3 and best linear unbiased predictors of δi and γi.

4. Data Description

We use data for US retailers listed on NYSE/NASDAQ/AMEX stock exchanges. We focus on the

retailing sector because retailing is immediate to the consumer demand so that the effect of the state

of economy on the consumer demand will be first observed through retail sales. The US Department

of Commerce assigns NAICS codes 44 and 45 to retailers. We obtained data for all firms with the

NAICS codes shown in Table 1. We excluded the NAICS codes for gas stations, used merchandise

stores, art dealers, manufactured (mobile) home dealers, tobacco stores, nonstore retailers, vending

machine operators, direct selling establishments, and fuel dealers. We included data for firms in the

electronic shopping segment (NAICS = 45411), mail order houses (NAICS = 454113), and florists

(NAICS = 45311) in the firm-wise estimation of parameters, but excluded them from segment-wise

and overall pooled estimation; the first two segments were excluded because they contain dissimilar

firms selling a wide spectrum of products, the third segment was excluded since florists are primarily

service firms that collect orders and network with local stores for execution.

We obtained sales forecast data from the Institutional Brokers Estimate System (I/B/E/S)

summary history file, actual sales revenue and other financial information from Compustat, and

prices of financial indicators from CRSP. I/B/E/S contains analysts’ forecasts of firm-level sales

for fiscal years 1997 onwards. Thus, this is the start date of our data set. Our data span the

fiscal years 1997-2007. For each fiscal year, there are multiple sales forecasts of varying terms made

by analysts. We consider all forecasts made 24 to 12 months before the fiscal year-end date. All

databases were accessed through Wharton Research Data Services (WRDS).

For the firms and time-periods defined above, we obtained data on three variables from the

I/B/E/S summary history file, MEANEST, the mean forecast of sales made by equity analysts,

14

STATPERS, the corresponding forecast date, and NUMEST, the number of analysts included in

the mean. From the Compustat annual files, we obtain yearly sales revenue, cost of goods sold, and

inventory data. Using CRSP, we obtain prices of three composite indices: S&P 500 Index, Equal

Weighted Market Index, and Value Weighted Market Index. The returns on the three indices

are correlated, with the highest correlation of 0.981 between the S&P 500 and the Value Weighted

Market Index. Estimation results are consistent for each choice of the index. The results we present

in the paper are based on the Value Weighted Market Index. This index is a composite of the value

weighted prices and dividends of all stocks traded at the NYSE, the AMEX, and the NASDAQ.

To estimate µM and σM , we use the mean and standard deviation of historical daily returns from

the period 1996-2007, which yields annualized estimates of µM = 0.078 and σM = 0.193 (see Hull

(1993), Chapter 10).

Our initial data set retrieved from I/B/E/S contained 13,798 observations for 265 different firms,

as identified by their unique CUSIP codes. We applied several filters to clean these data. First,

firms with more than 40% deviation of actual sales from the first forecasted value were excluded

as they represented potential mergers or acquisitions. Second, we excluded companies with fewer

than three years of data in Compustat since we use subsampling. There were 209 firms left after

these two filters. We then excluded firms that were covered by fewer than three analysts, firms with

inventory less than 2% of the sales revenue, firms with sales revenue less than 5M$, and overseas

firms traded on US stock exchanges via American Depositary Shares (ADS).

After applying all the above filters, our final data set contains 97 firms. The total number of

observations is 4,698, and since there are multiple forecasts per firm-year, the unique number of

firm-year pairs is 570. The average number of forecasts per firm-year is 8.24. Observations in the

data set represent 36.4% of the total annual U.S. retail sales excluding motor vehicles and parts

dealers in the year 2005 according to the U.S. Census Bureau’s Annual Retail Trade Survey. Table

1 shows descriptive statistics of our data set.

5. Results

Hypotheses tests. We first discuss tests of Hypotheses 1-4 using the estimates of ρ from pooled,

segment-wise and firm-wise MLE. Table 2 presents the results for testing Hypotheses 1-2. The first

four columns present various pooled and segment-wise estimates of the parameters of (10). The

15

final three columns present summary statistics of firm-wise MLE estimates for ρ. Detailed MLE

results for all firms are included in the online appendix.

Hypothesis 1 is based on the reasoning that total consumption is positively correlated with the

contemporaneous financial market return. Since the hypothesis is based on total consumption, we

test it using a pooled estimate of the correlation coefficient ρ for all firms in our data set. The

pooled estimates can be interpreted as the means of ρ and σD across the various firms in our

data set. Since we use subsampling and bootstrap in MLE, these are weighted means, with the

weight of each firm-year being equal to the number of forecasts available for that firm-year. The

estimation results, shown in the first row of Table 2, are that ρ̂ = 0.170 (standard error = 0.020)

and σ̂D = 0.075 (standard error = 0.001). Both estimates are statistically significant at p=0.01.

Thus, Hypothesis 1 cannot be rejected based on this evidence. This inference is also consistent

with the median of the firm-wise estimates of ρ. The median is 0.1960, the minimum is -0.9934

(Great Atlantic and Pacific Tea Co.) and the maximum is 0.9550 (Officemax Inc.). The range of

variation between the minimum and the maximum is striking. We characterize this variation by

showing how it is explained by segment classification, gross margins, and presence of outliers.

To test Hypothesis 2, we classify the thirteen retail segments into two groups, discretionary

purchase and everyday purchase. The discretionary purchase group is composed of the six segments,

home furnishings stores, consumer electronics stores, home improvement stores, apparel stores,

jewelry stores, and department stores; and the everyday purchase group consists of the seven

remaining segments. We base this classification on the median gross margin of each segment shown

in Table 1 and on our subjective assessment of the types of products sold by retailers in each

segment. For example, jewelry, home furnishings, and home improvement stores can be expected

to have a higher component of sales from discretionary purchase items, and discount stores to have

a lower component of sales from discretionary purchase items.

Table 2 shows the estimation results for Hypothesis 2. First, note that the pooled estimates of

ρ for the discretionary purchase and everyday purchase groups are 0.1938 and 0.1153, respectively.

Using a paired t-test, the difference between them is statistically significant at p = .05, supporting

Hypothesis 2. Thus, the discretionary purchase retailers have a higher value of ρ than the everyday

purchase retailers.

Now consider the segment-wise estimates. The highest estimate of ρ is 0.8264 for the office

16

supplies stores. While we classified this segment in the everyday purchase group, the outcome is

intuitive since the customers of office supplies stores tend to be small businesses, whose needs will

be closely linked to the financial market return. The large magnitude of this estimate shows the

usefulness of incorporating financial market returns in sales forecasting for this segment. The second

highest value of ρ is 0.6075 for jewelry retailers. The other segments with high estimates of ρ are

home improvement, supermarkets, pharmacies, and apparel. All these estimates are statistically

significant at p = .01.

Home furnishings, consumer electronics, and department stores were classified in the discre-

tionary purchase group, but have estimates of ρ insignificant or lower than everyday purchase

segments. This is due to the presence of outliers as we discuss in the next paragraph. The remain-

ing segments with low estimates of ρ are everyday purchase segments: auto parts and accessories,

shoe stores, sports and hobby stores, and discounters and wholesale clubs.

The firm-wise estimates of ρ are useful since they show segments for which firm-wise estimates

are consistent with segment-wise estimates, and those in which we have outliers. In the office

supplies segment, there are three firms with ρ̂ = 0.85, 0.95, and 0.96, and median = 0.95. Thus,

the segment-wise estimate is consistent with the firm-wise estimates. In jewelry retailers, there are

four firms, with estimates of ρ as 0.87, 0.29,−0.69, and −0.93. The last two firms were assigned to

the jewelry segment by us due to the nature of their products. They are internet-based retailers

of customized luxury gifts. Thus, the firm-wise median of ρ differs from the segment-wise estimate

for jewelry retailers due to the presence of outliers.1 In home furnishings, one firm is an outlier,

Pier 1 Imports Inc., with ρ̂ = −0.68, while all other firms have positive estimates of ρ. Likewise,

in consumer electronics, Tweeter Home Entertainment Group is an outlier with ρ̂ = −0.64, while

all other firms have ρ̂ > 0.41. In discounters and wholesale clubs, we see that the segment can be

divided into two parts. Wholesale clubs such as BJ’s and Costco have large positive estimates of ρ,

whereas discounters such as Wal-Mart and Target have ρ close to zero. One can argue that these1This difference occurs because the firm-wise median treats each firm as having equal weight, whereas the segment-

wise pooled estimate gives higher weight to firms with more annual observations. For jewelry segment, our data set

has many more observations for Tiffany & Co. (ρ̂ = 0.87) and Zale Corp. (ρ̂ = 0.29) than the remaining two,

internet-based, retailers. Our methodology can be easily replicated to compute pooled estimates that assign equal

weights to all firms. Which weighting is more accurate depends on the outlier firms in a sample. Alternatively, one

may use firm-wise estimates shown in the online appendix.

17

retailers sell different kinds of products, with the wholesale clubs selling much more of durable

goods, which are discretionary purchase items, and discounters selling a larger fraction of everyday

purchase items, whose demand would be less correlated with the financial market return.

We find that a possible explanation for outlier firms is sales coming below analysts’ forecast

consistently year after year. For example, Tweeter Home Entertainment Group, experienced un-

expected sales declines over several years which resulted in March 2007 closing of 49 stores. Pier 1

Imports has similarly shown sales below expectations from 2005 through 2007 due to competitive

pressure, unfavorable exchange rate movements, and change in customer tastes. Therefore, when a

retailer has unexpected sales declines, we may see negative sales forecast errors, and thus, negative

values of ρ even when the economy is doing well. A possible use of our model is to detect such

outliers, since the model can be used to construct confidence intervals for parameters as well as

forecasts.

To test Hypothesis 3, we further classify firms in each segment into high and low gross margin

subgroups. Since we have panel data, we classify firms in two ways, static and dynamic. In the

static classification, we compute the median gross margin of each firm across its annual observations

and the median gross margin of each segment across all the firm-year observations in that segment.

We classify firms into high- and low- gross margin groups by comparing their medians with their

segment medians. For example, if the median gross margin for Best Buy Co. falls below the median

for the consumer electronics segment, then Best Buy would be classified in the low gross margin

group for each observation. In the dynamic classification, we classify firm-years into high- and low-

gross margin groups by comparing the gross margin of each firm in each year with the segment

median for that year. We classify observations for a firm in year y by comparing its gross margin

in year y − 1 with the median gross margin of its segment for year y − 1. For example, in this

method, Best Buy could be classified as a high margin retailer in one year and a low margin retailer

in another year. The static classification has the advantage of being less susceptible to noise in

the estimation of gross margin, whereas the dynamic classification has the advantage of capturing

relative changes in gross margin over time. Results for the static classification are shown in Table

3. Results for the dynamic classification are similar and are shown in the online appendix.

In Table 3, first consider the pooled estimates of ρ for high- and low- gross margin firms

across all segments as shown in the first row. The estimates are 0.192 and 0.133, respectively,

18

and the difference between them supports Hypothesis 3 at p=0.01 according to a paired t-test.

The estimates for different segments show that 8/13 segments have estimates of ρ consistent with

Hypothesis 3, 4/13 are significant in the reverse order, and 1/13, office supplies stores, has no

statistical difference between high and low margin groups. Thus, the result of the hypothesis test

is statistically significant at the aggregate level, but varies across segments. This variation may

possibly be due to the presence of outliers or small sample sizes in some segments. There may also

be classification errors for firms whose gross margin values are close to the median. In the case of

office supplies stores, both high- and low-gross margin groups have large estimates of ρ, 0.95 and

0.91, respectively.

Hypothesis 4 states that the magnitude of the correlation coefficient can depend on the term

of the forecast. To test this hypothesis, we split the observations into three time-buckets according

to the term of the forecast, namely, short (12-15 months), medium (16-19 months) or long (20-24

months) term forecasts. The time buckets have different number of observations because there

are fewer long-term forecasts than short-term forecasts available in the data set. However, the

industry composition within each bucket is approximately the same, allowing us to compare the

results fairly.

Table 4 presents the estimation results. As the term of the forecast increases, estimates of

the correlation coefficient ρ increase from 0.10 for short term to 0.30 for long term forecasts.

Monotonically increasing correlation coefficient ρ suggests that as the term of the forecast increases,

a larger percent of forecast error can be predicted by financial market returns. This result supports

Hypothesis 4. It implies that forecast updating based on our model is more suitable for longer term

forecasts.

Volatility estimates. Our model predicts not only the mean forecast, but also the volatility of

the forecast. This feature is useful for building confidence intervals and planning against uncertainty.

Table 2 reports the pooled and segment-wise estimates of σD. The pooled estimate is 0.075,

significant at p = .01. Retailers in the discretionary purchase group have higher volatility of sales

(σ̂D = 0.085) compared to those in the everyday purchase group (σ̂D = 0.057). From (6), the effect

of financial market return on mean sales is proportional to the product ρσD, thus, the higher the

volatility of sales the higher is the exposure of sales to the financial market return. Among retail

segments, apparel stores have the highest estimate of sales volatility (σ̂D = 0.097) followed by office

19

supplies (σ̂D = 0.092) and consumer electronics (σ̂D = 0.086). Pharmacies have the lowest estimate

of sales volatility (σ̂D = 0.035), followed by home improvement stores (σ̂D = 0.041) and discounters

and wholesale clubs (σ̂D = 0.047). Table 3 reports the variation in σ̂D with gross margin. The

pooled estimates of σD are 0.084 and 0.063 for high and low margin firms, respectively. These

estimates as well as segment-wise estimates for 10/13 segments show that high gross margin firms

have higher volatility than low margin firms. Table 4 reports the variation in σ̂D with the term

of the forecast. The estimates remain around 0.075 for all three time buckets. Constant volatility

across time buckets implies that the variance of the forecast error is linear in time, which supports

the assumption used in deriving our model. In summary, the instantaneous volatility of the forecast

is more for discretionary purchase than everyday purchase retailers. It is also more for high gross

margin retailers than low gross margin retailers. However, it is the same for forecasts of different

terms, ceteris paribus.

Comparison of MLE and RC models. Finally, we present a comparison of estimates obtained

by MLE and RC models. The RC model, (11), gives best linear unbiased predictions (BLUPs) of

the coefficients on financial market return (b2) and the term of the forecast (b3) for each firm. We

also compute these coefficients from firm-by-firm MLE estimates for ρ and σD using (6). Figure 2

plots the estimates obtained by these two models against each other. We observe that the results

for b2 are consistent across the two models. It is, however, difficult to draw a conclusion for the

estimates of b3. Computed by the RC model, they exhibit higher variation than the MLE estimates.

This is not surprising since the confidence interval for b3 is wider in relative terms than that for b2,

suggesting a larger variation in the results for b3.

We also used the RC model to test Hypotheses 1-4 using the estimate for the coefficient of

ln(1 + rijy) in (11). All these results were consistent with those obtained using MLE. In a previous

version of this paper, we tested the hypotheses on a larger data set containing 214 firms, since we

did not apply all the data filters reported in §4. The results on this larger set were also consistent

with those reported in this paper, and the estimates of ρ and σD were similar.

20

6. Comparison with Analysts’ Forecasts

Equations (8)-(9) can be used to update a previous sales forecast and its variance as the price

information for the financial indicator is revealed with the passage of time. Likewise, equity analysts

also update their forecasts over time. Figure 3 presents an example of these forecast updates for

an apparel retailer, Limited Brands, for fiscal year 2006. It shows the initial analysts’ forecast,

made 23 months before the fiscal year-end date, the periodic updates to this forecast, the forecasts

computed by our model using as input the initial analysts’ forecast and the financial market return,

and the actual sales realization. Notice that, in this example, our model forecast progressively grew

more accurate with time, whereas the experts’ forecast diverged.

In this section, we compare our forecast updates with those of equity analysts to address three

questions: Do equity analysts incorporate financial market returns in their forecast updates? Does

our model provide new information that augments the forecast updates by equity analysts? Do

the updates by equity analysts contain new information not contained in our model? We answer

these questions in the affirmative, thus showing that both financial market returns and updates by

equity analysts contain unique information useful for sales forecasting. Finally, we show that using

our model to augment the forecasts by equity analysts improves forecast accuracy.

The regression models used in this section are modifications of equations (8) and (11). All

models are estimated using ordinary least squares as well as the RC method summarized in §4.2.

We use all the observations in our data set for estimation except the first observation in each firm-

year, since the first forecast by equity analysts is used as input in our model. In other words, the

regression models in this section deal with forecast updates represented by observations (i, j, y) for

all j 6= 1.

To address the first question, we use (8) to set up a new regression model as follows:

lnFijy − lnFi1y = a + b1 ln(1 + r̂ijy) + b2tijy + b3Ti1y + εijy, j 6= 1. (12)

Here, Fijy and Fi1y are the j-th and first forecasts for firm-year (i, y), tijy is the time interval

between Fijy and Fi1y, r̂ijy is the return on the financial indicator for this time interval, εijy is the

error term, and a, b1, . . . , b3 are added as controls. Thus, the dependent variable in this model is

the consensus forecast update made by equity analysts, and the explanatory terms are the financial

market return and term of forecast variables from our model. Whereas in (8), we specified forecast

21

updating at the intermediate time instant t according to our model, in (12), we determine if the

forecast updates by equity analysis at the same time instant incorporate historical financial market

returns. Therefore, we are interested in testing the hypothesis whether b1 in (12) is positive.

For the second question, we regress actual sales on the forecast provided by equity analysts and

the update provided by our model. The regression model for this test is an extension of (11) as

follows:

lnDiy = a + b1 lnFijy + b2 ln(1 + rijy) + b3Tijy + b4 ln(1 + r̂ijy) + b5tijy + εijy, j 6= 1. (13)

Here, the dependent variable is the sales realization for firm i in year y, and the explanatory

variables are the experts’ updated forecast, Fijy, the financial market return over the term of

the forecast, rijy, lagged financial market return over the time period from the first forecast to

the updated forecast, r̂ijy, the time elapsed since the first forecast, tijy, and the term of the first

forecast, Ti1y. Note the similarity between this equation and (11); this equation augments (11) with

lagged financial market return and time elapsed since the first forecast because these two variables

are used by our model to update forecasts. The equity analysts know the lagged financial return,

r̂ijy, at the time of making their updated forecast, Fijy. Hence, by putting both variables together

in the regression equation, we test if our model provides information that augments the forecast

updates by equity analysts, i.e., if the coefficient b4 is positive.

To answer the third question, we modify (11) differently from above.

lnDiy = a+b1 lnFmijy+b2 ln(1+rijy)+b3Tijy+b4(lnFijy−lnFi1y)+b5tijy+εijy, j 6= 1. (14)

Here, Fmijy is our model forecast obtained from (8) using as inputs forecast Fi1y, the return on

the market index r̂ijy over the time-period from the first to the updated forecast, and firm-wise

estimates of parameters ρ and σD. We are interested in testing whether coefficient b4 is positive,

i.e., whether the forecast update provided by equity analysts has value beyond the forecast provided

by our model.

Table 5 shows the results of the above three models. From columns (1-2) corresponding to

(12), we find that the estimate for the coefficient of ln(1 + r̂ijy), b1, is 0.0321 using OLS estimation

and 0.0181 using the RC method. While both estimates are positive and the OLS estimate is

statistically significant at p=.01, the RC estimate is not. Thus, we obtain only a weak evidence

that equity analysts incorporate financial returns in their forecast updates. From columns (3-4)

22

corresponding to (13), we find that the coefficient of ln(1 + r̂ijy), b4, is statistically significant at

p=0.01 under both OLS and RC methods. Its estimates are 0.0872 and 0.0891, respectively. This

shows that our model provides information not incorporated by equity analysts in their updated

forecasts. The remaining coefficients in this model are as expected: the experts’ updated forecast,

lnFijy, is significant at p=0.01 with a value close to 1, and the financial market return over the

term of the forecast, ln(1+ rijy), is also significant at p=0.01. Finally, columns (5-6) corresponding

to (14) show that the model forecast lnFmijy, the experts’ forecast update, lnFijy − lnFi1y, and the

financial market return over the term of the forecast, ln(1 + rijy), are all positive and significant at

p=0.01.

In summary, we see that the financial market return and the forecast updates by equity analysts

both contain unique information useful for forecasting. The result about financial market returns

particularly is surprising since they are public information at the time when equity analysts update

their forecasts. Among other things, this result implies that these two information sources can be

used together to construct a combined forecast that should be more accurate than either of them.

We can compute the combined forecast by using (15) and replacing the initial experts’ forecast

with the updated experts’ forecast. We denote the combined forecast as F cijy and define it as

lnF cijy ≡ lnFijy +

ρσD

σM

(ln(1 + r̂ijy)−

(µM − 1

2σ2

M

)tijy

)− 1

2σ2

DTi1y, j 6= 1.

Note that the experts’ updated forecast Fijy replaces the initial forecast Fi1y in this equation. The

rest of the parameters are as in (8). Similarly to (14), we use the firm-wise MLE estimates of

parameters to compute the forecast.

Table 6 compares the forecast accuracy of the experts’ initial forecast Fi1y, the experts’ updated

forecast Fijy, the model’s updated forecast Fmijy, and the combined forecast F c

ijy for each segment.

As noted before, we compare predictions for each observation in the data set except the first one for

each firm-year since the first one is used an input in our model. For each forecast series, we use root

mean squared error (RMSE) as the performance measure. To account for heteroscedasticity, we

first compute percentage forecast errors for all observations, then segment-wise or firm-wise RMSE.

Table 6 shows the segment-wise RMSE. From the column (M/E), we observe that the RMSE values

are larger on average for our model than for the updated forecasts by equity analysts. Thus, the

equity analysts are on average more accurate than our model. This is to be expected since our

model uses information on only one financial indicator whereas equity analysts have access to a

23

much larger information set.

We also observe from the column (M/E0) that the RMSE of model forecast errors are on

average smaller than RMSE of errors of the initial analysts’ forecasts. This is consistent with

previous regressions, and shows that the updates computed by our model are useful in improving

forecast accuracy. Likewise, column (C/E0) shows that the combined forecast is on average better

than the experts’ initial forecast for each segment. The magnitude of improvement here is larger

than seen in (M/E0) since the combined forecast uses updated inputs from both the equity analysts

and the model. Finally, column (C/E) shows that the ratio of RMSE of the combined forecasts

to the RMSE of updated analysts’ forecasts is 0.9969 across all observations. Thus, the combined

forecast is more accurate than the experts’ updated forecast on average. We also observe from the

segment-wise values that the combined forecasts are more accurate on average than the analysts’

updated forecasts for 10/13 segments.

We investigate the reasons for performance differences between the forecasting methods, and

find that there are two possible reasons: the magnitude of ρ, and whether experts incorporate

historical financial returns in their forecasts. First, in comparing the ratio (M/E) across firms and

segments, we see that the model forecasts are closer to the experts’ forecasts for segments with

large magnitudes of ρ, such as electronics stores, jewelry stores, department stores, discounters and

warehouse clubs, and office supplies stores, with the exception of some outlier firms. Second, note

that the (C/E) ratio was greater than one for three segments, jewelry stores, sports and hobby

stores, and office supplies stores. This result is as expected for sports and hobby stores, since they

have a small and insignificant value of ρ in Table 2, but not for jewelry stores and office supplies

stores which have the highest values of ρ across all segments. To investigate this result, we regressed

model (13) individually for each segment. We found that past return was not significant for all

three segments where the combined forecast could not improve over experts. On the contrary, the

coefficient on past return was statistically significant for the remaining ten segments. This might be

an indication that for jewelry stores and office supplies stores, experts do incorporate the financial

information in their updates.

In the online appendix, we present forecast accuracy comparisons between the model and the

experts for each firm. The M/E ratio varies across firms from 0.68 to 1.98, with a median of 1.17.

Thus, for half the firms, the model’s RMSE is at most 17% worse than the analysts’ RMSE. The

24

M/E0 ratio ranges from 0.68 to 1.34, with a median of 0.99. The combined forecast has C/E ratio

ranging from 0.86 to 1.54 with a median of 1.0, and C/E0 ratio ranging from 0.47 to 1.21 with a

median of 0.88. Finally, we note that the performance of our model and combined forecasts could

be improved further based on the results of our hypotheses tests. For example, one could modify

the estimate of ρ based on the term of the forecast and the gross margin of a firm. One could also

remove outlier firms that consistently had sales below forecasts and yielded large negative estimates

of ρ.

7. Discussion

We presented a model in which sales forecast error is a function of the state of the economy

characterized by the return on a financial index. We tested this model using firm-level total annual

sales forecasts and financial data for public US retailers in several NAICS groups for the period

1997-2007. The pooled estimate for the correlation coefficient is positive and statistically significant.

Firms in different retail segments have different correlation coefficients, varying from 0.95 for office

supplies stores to insignificant or negative for some other segments. Within a retail segment, the

correlation coefficient increases with gross margin on average. It also increases with the term of

the forecast. Our model provides results on other parameters for forecasting as well, such as the

volatility of sales forecast error and the adjustment for drift rates. Together, these results can

be applied for forecast updating. We showed that forecast updates from our model provide new

information not contained in the forecast updates by equity analysts. Hence, a combination of

these two inputs has higher average forecast accuracy than equity analysts. This result is most

useful for firms with high correlation coefficients.

Beyond forecast updating, our model can be applied for risk management, calculating the value

of decision postponement, and planning other operational decisions. Several recent research papers

have discussed financial hedging based on demand being correlated with the price of a traded asset.

The correlation information obtained from our model can be used to inform such hedging decisions,

and shows that its value varies depending on the retail segment, the gross margin of a firm, and

the time horizon.

Alternatively, retailers may use operational hedging strategies of at least four types: multiple

25

store formats, flexibility in product assortments, price markdowns, and risk sharing with suppliers.2

Multiple store formats are used by many retailers and provide natural hedging. For example,

Nordstrom Stores and Saks Inc., both high-end retailers, have outlet store chains targeted at a

lower end of the market. High values of ρ for their premium stores will be moderated by low

values of ρ for their outlet stores. Wal-Mart Stores has a discount store chain, with small ρ,

and a warehouse club, with large ρ. Other examples include Gap Stores Inc., Brooks Brothers,

etc. An example of flexibility in product assortment is presented by Tiffany’s, which sells both

high priced and low priced jewelry, and manages changes in demand by customers substituting

downwards or upwards under different market conditions. Some retailers may be unable to utilize

natural hedges if their product market is too focused, for example, office supplies stores or other

category killers. Such retailers may benefit more from price markdowns and risk sharing with

suppliers, both of which have been widely studied in the operations management literature. Thus,

operational hedging methods enable retailers to manage the risk caused by demand for different

products being variously correlated with financial market indicators.

Our model can be used to quantify the value of decision postponement. Long term company-

wide decisions will have greater benefit from our model because they are based on aggregate fore-

casts and because financial information carries more value for updating long term forecasts. Exam-

ples of such decisions include production of retailer’s private brand products, procurement of raw

materials (e.g., fabric procurement for a fashion retailer), and reservation of logistics capacity; see

Swaminathan and Lee (2003) for a comprehensive discussion. The value of postponement generally

comes from two sources: an increase in expected profit, and a decrease in the variance of profit.

It can easily be shown for our model that increase in expected profit through decision postpone-

ment is more for firms with higher correlation and greater demand volatility. The reduction in

profit variance depends mostly on the extent of correlation between the sales forecast error and the

financial market return, and is less sensitive to the magnitude of demand volatility.

Besides the above applications, our model shows that retailers face a spectrum of different

degrees of market risk as quantified by ρ. During a period of growth, a firm may be tempted to

make long-term investments in expanding or changing its product portfolio or market position, only

to find this investment become risky later on. By quantifying the value of ρ for different markets2Van Mieghem (2008: p.333) presents a generic classification of four types of operational hedging strategies. Our

discussion focuses on specific examples of these four types for retailers.

26

and types of firms, our model helps identify the risks faced by firms in such decisions. Our results

are particularly useful here since they are based on a data set that includes two periods of high

growth, 1997-99 and 2002-05, and two periods of low growth, 2000-01 and 2006-07.

Our paper has some limitations that can be addressed in future research. One limitation is that

we assume sales forecast to evolve as a martingale. In practice, forecast updates by analysts may not

conform to this assumption. This does not affect our model since we use the analysts’ forecast as an

initial input and do updates only using financial market indicators. However, this assumption might

become important if updates are based on both financial and firm-specific data. Second, after the

issuance of a forecast, a firm may take decisions regarding prices, product assortment, and supply

chain that can change the correlation coefficient with financial market returns. For example, we

found the correlation coefficient to depend on the term of the forecast. This finding presents an

opportunity for future research incorporating firms’ decisions in the forecasting model. In this

respect, our model could be extended to a multi-period formulation and estimated on within-firm

product-level data. Third, we believe that results similar to ours can be found in upstream layers

of the supply chain, e.g., in manufacturing or wholesale distribution. The empirical investigation

of this phenomenon and the study of its applications to supply chain models would be valuable.

References

Ando, A., F. Modigliani. 1963. The” Life Cycle” hypothesis of saving: aggregate implications and tests. The

American Economic Review 53(1) 55–84.

Banks, J., R. Blundell, A. Lewbel. 1997. Quadratic engel curves and consumer demand. The Review of

Economics and Statistics 79(4) 527–539.

BBC. 1999. Business: The economy us retail sales boom. BBC, 03/11/1999, retrieved on 11/29/2007 .

Business Week. 2001. September retail sales drop stuns street. Business Week, Standard & Poor’s Global

Markets , 10/12/2001 retrieved on 11/29/2007.

Caldentey, R., M. Haugh. 2006. Optimal control and hedging of operations in the presence of financial

markets. Mathematics of Operations Research 31(2) 285.

Chen, X., M. Sim, D. Simchi-Levi, P. Sun. 2007. Risk aversion in inventory management. Operations

Research 55(5) 828.

27

Chod, J., N. Rudi, J. Van Mieghem. 2007. Operational flexibility and financial hedging: complements or

substitutes? Working paper .

Dixit, A.K., R.S. Pindyck. 1994. Investment under uncertainty . Princeton University Press Princeton, NJ.

Dynan, K.E., D.M. Maki. 2001. Does stock market wealth matter for consumption? FEDS Discussion Paper

No. 2001-23 .

Friedman, M. 1957. A theory of the consumption function. Princeton University Press.

Gaur, V., S. Seshadri. 2005. Hedging inventory risk through market instruments. Manufacturing & Service

Operations Management 7(2) 103–120.

Graves, S.C., D.B. Kletter, W.B. Hetzel. 1998. A dynamic model for requirements planning with application

to supply chain optimization. Operations Research 46 S35–S49.

Graves, S.C., H.C. Meal, S. Dasu, Y. Qiu. 1986. Two-stage production planning in a dynamic environment. in

Multi-stage Production Planning and Control, Axsater, S., Schneeweiss, C., Silver, E. (eds). Springer

Verlag, Berlin.

Greene, W.H. 2003. Econometric analysis. Prentice Hall, Upper Saddle River, NJ.

Heath, D. C., P. L. Jackson. 1994. Modeling the evolution of demand forecasts with application to safety

stock analysis in production/distribution systems. IIE Trans. 26(3) 17–30.

Hong, H., W. Torous, R. Valkanov. 2007. Do industries lead stock markets? Journal of Financial Economics

83(2) 367–396.

Hull, J. 1993. Options, futures, and other derivative securities. Prentice Hall, Upper Saddle River, NJ.

Karatzas, I., S. Shreve. 1991. Brownian motion and stochastic calculus. Springer, New York, NY.

Kaufman, L. 2001. Empty aisles and empty stores; what happens when anchor stores weigh anchor? The

New York Times, 02/06/2001, retrieved on 11/29/2007 .

Menzly, Lior, Oguzhan Ozbas. 2006. Cross-industry momentum. SSRN eLibrary .

Poterba, J.M. 2000. ” Stock Market Wealth and Consumption”. Journal of Economic Perspectives 14(2)

99–118.

Reuters. 2007. Office depot falls on gloomy forecast. Reuters, 09/07/2007, retrieved on 11/29/2007 .

Sapra, A., P.L. Jackson. 2005. The martingale evolution of price forecasts in a supply chain market for

capacity. Tech. rep., Working paper, Cornell University.

Swaminathan, J.M., H.L. Lee. 2003. Design for postponement , Handbooks in Operations Research and

Management Science, vol. 11. Elsevier Publishers.

Van Mieghem, J.A. 2003. Commissioned paper: capacity management, investment, and hedging: review and

recent developments. Manufacturing & Service Operations Management 5(4) 269–302.

28

Van Mieghem, J.A. 2008. Operations Strategy: Principles and Practice. Dinesystems.

29

30

Figure 2. Scatter plots of the coefficients b2 and b3 computed using MLE estimates and best linear unbiased predictors of the RC model.

Figure 3. Comparison of the initial forecast, the forecast updates generated by analysts and by the model, and the actual sales for Limited Brands, Inc., for fiscal year 2006.

b 2, (ln(1+r))

-1

-0.5

0

0.5

1

-1 -0.5 0 0.5 1

WMLE

RC

_BL

UP

b 3, (T)

-0.2

-0.15

-0.1

-0.05

0

0.05

0.1

0.15

0.2

-0.2 -0.15 -0.1 -0.05 0 0.05 0.1 0.15 0.2

WMLE

RC

_BL

UP

Limited Brands (rho=0.80), FY 2006

9800

9900

10000

10100

10200

10300

10400

10500

10600

10700

10800

12141618202224

Term of the forecast, months

Sales

Experts

Model

Experts 0

31

Table 1. Data summary, by retail segment. Retailers are grouped according to the first digits of their 2007 NAICS US Code (field NAICS ID).

Retail Segment

ID NAICS

ID Retail Segment Name # of firms

# of firm-years

# of obs.

Total sales

(2005), BB $

Median gross

margin, % Examples of firms 1 441 Automotive Parts and

Acc. Stores 5 22 173 17 46.81 Autozone, Pep Boys

2 442 Home Furnishings Stores

7 54 479 15 42.20 Haverty Furniture, Bed Bath & Beyond

3 443 Consumer Electronics Stores

5 33 262 54 38.65 Circuit City, Best Buy

4 444 Home Improvement Stores

3 23 204 140 31.03 Home Depot, Lowe’s

5 445 Supermarkets 8 43 313 124 30.56 Albertson’s, Safeway

6 446 Pharmacies 3 19 169 108 27.01 CVS, Rite Aid

7 4481 Apparel Stores 30 175 1449 90 39.02 GAP, Limited Brands

8 4482 Shoe Stores 4 23 173 9.2 31.51 Foot Locker, Shoe Carnival

9 4483 Jewelry Stores 4 23 186 5.4 51.92 Tiffany & Co., Zale’s

10 451 Sports and Hobby Stores

9 40 318 18 35.48 Big 5 Sporting Goods, Michael’s

11 4521 Department Stores 5 29 238 43 35.06 JC Penney, Saks Fifth Avenue

12 4529 Discounters and Warehouse Clubs

11 68 606 505 30.07 Costco, Wal-Mart, Dollar Tree

13 45321 Office Supplies Stores 3 18 128 33 27.54 Office Depot, Staples

Total 97 570 4,698 1,162

32

Table 2. Estimates of parameters obtained from segment-wise MLE and firm-wise MLE using equation (10).

Retail Segment

Segment-wise MLE Summary of Firm-wise MLE of

D Intercept,

a Coeff. of ln Fijy, b1 Minimum Median Maximum

Pooled (overall) .1703*** .0751*** -.0270*** 1.002** -.9934 .1960 .9550

(.0204) (.001) (.0086) (.001)

Pooled (discretionary purchase)

0.1938*** 0.0849*** -0.0084 0.9998 -.9316 .3045 .9124

(0.0246) (0.0012) (0.0092) (0.0011)

Pooled (everyday purchase)

0.1153*** 0.0570*** -0.0496*** 1.0044*** -.9934 .0877 .9550

(0.0376) (0.0016) (0.0134) (0.0014)

Auto Parts and Acc. -.1556 .0537*** .2187 .9721* -.0644 .0135 .4706

(.4958) (.0148) (.1416) (.0158)

Home Furnishings -.1204*** .0719*** -.0224 1.001 -.6793 .2693 .7947

(.0452) (.0021) (.0267) (.0036)

Cons. Electronics .0403 .0864*** -.1903*** 1.0199*** -.6385 .4199 .4785

(.0810) (.0037) (.0529) (.0059)

Home Improvement .4392*** .0406*** .0282 .9970 .3226 .4608 .6789

(.1118) (.0059) (.0345) (.0034)

Supermarkets .2904*** .0576*** -.0372 1.0026 -.9934 .1903 .9397

(.0613) (.0022) (.0271) (.0027)

Pharmacies .3153*** .0351*** -.2261** 1.0224** -.1405 .2946 .6656

(.1010) (.0059) (.0977) (.0095)

Apparel Stores .2681*** .0971*** -.0007 .9993 -.8407 .2938 .9124

(.0350) (.0021) (.0106) (.0014)

Shoe Stores .0482 .08146*** -.0560 1.0087 -.3621 -.0830 .1960

(.2093) (.0067) (.0478) (.0064)

Jewelry Stores .6075*** .0774*** -.0738 1.0085 -.9316 -.2032 .8655

(.0928) (.0055) (.0464) (.0064)

Sports and Hobby -.0757 .0635*** -.1904*** 1.0241*** -.8548 -.1629 .6791

(.1000) (.0053) (.0680) (.0093)

Department Stores .1850* .0558*** .3275*** .9620*** -.7827 -.5448 .4557

(.0950) (.0034) (.1167) (.0127)

Discount. and Whse Clubs

-.0686 .0470*** -.0636*** 1.0050*** -.8114 .2856 .9143

(.0546) (.0019) (.0127) (.0013)

Office Supplies .8264*** .0921*** -.5322* 1.0524 .8527 .9526 .9550

(.0628) (.0086) (.3007) (.0320)

Numbers in parentheses give standard errors of corresponding estimates. ***, **, * denote statistical significance at .01, .05, and .10, respectively. For b1, which is the coefficient of ln Fijy, we report statistical significance for the test that b1 = 1.0.

33

Table 3. Segment- and gross margin-wise estimates of parameters obtained from MLE using equation (10), with static classification of firms by gross margin

Retail Segment

Gross margin < Median for segment Gross margin ≥ Median for segment

D Intercept, aCoeff. of ln Fijy, b1 D Intercept, a

Coeff. of ln Fijy, b1

Pooled .1329*** .0628*** -.0441*** 1.0039** .1923*** .0839*** .0059 .9976

(.0363) (.0015) (.0147) (.0015) (.0292) (.0014) (.0144) (.0019) Auto Parts and Acc.

-.7679** .0848*** .4526** .8712*** .4576 .0821** -.1464 -.5037**

(.3742) (.0263) (.3742) (.0263) (.6196) (.0413) (.6196) (.0413) Home Furnishings

-.2636** .0742*** .3393*** .9448*** .0019051 .060449*** -.03331 1.0046

(.0611) (.0040) (.0727) (.0106) (.0647) (.0023) (.0361) (.0047) Cons. Electronics

.3749** .0775*** .1660 .9844 -.2095* .0896*** -.0139 .9944

(.1056) (.0031) (.1284) (.0131) (.1264) (.0063) (.0525) (.0068) Home Improvement

.4499** .0292*** .4722*** .9543*** .5886*** .0481*** .0073 .9987

(.1798) (.0027) (.1270) (.0121) (.1636) (.0076) (.0366) (.0037) Supermarkets .6467*** .0479*** -.2138*** 1.0191*** .0380 .0632*** -.0222 1.0019

(.0662) (.0039) (.0520) (.0051) (.0762) (.0028) (.0303) (.0031) Pharmacies -.1337 .1224 .8542 .9132 .5314*** .0368*** .2220 .9798

(.8928) (.1120) (3.9859) (.4091) (.1052) (.0066) (.1517) (.0145) Apparel .0996* .0883*** -.0980*** 1.0093*** .3952*** .0993*** .1136*** .9865**

(.0568) (.0029) (.0233) (.0029) (.0434) (.0028) (.0427) (.0055) Shoe Stores -.2483 .0871** -.1482 1.0213* .1392 .0916*** -.2251 1.0348

(.7083) (.0411) (.1136) (.0110) (.1804) (.0083) (.1672) (.0239) Jewelry Stores

.2427 .0506*** .1042*** .9851*** .7053*** .0886*** -.1856*** 1.0231**

(.2436) (.0053) (.0393) (.0052) (.1131) (.0084) (.0849) (.0114) Sports and Hobby

.4385** .0755*** .7151*** .9092*** -.0850 .0389*** -.3401*** 1.0426***

(.1929) (.0086) (.2210) (.0286) (.1812) (.0030) (.0596) (.0080) Department Stores

-.4974*** .0389*** .4125*** .9563*** .5901*** .0648*** .3657 .9545

(.1144) (.0039) (.0620) (.0067) (.0923) (.0067) (.2725) (.0304) Discount. and Whse Clubs

.1050 .0349*** -.0793*** 1.0062*** -.2163*** .0561*** -.0970*** 1.0098**

(.0826) (.0023) (.0210) (.0023) (.0755) (.0028) (.0277) (.0035) Office Supplies

.9538*** .0467*** -.0471 .9879*** .9084*** .0773*** .2958 .9666

(.0360) (.0047) (.0286) (.0035) (.0516) (.0180) (.4753) (.0492)

Numbers in parentheses give standard errors of corresponding estimates. ***, **, * denote statistical significance at .01, .05, and .10, respectively. For b1, which is the coefficient of ln Fijy, we report statistical significance for the test that b1 = 1.0.

34

Table 4. Estimates of parameters obtained by MLE using equation (10) on the subsamples with varying term of the forecast.

Term of the forecast

Number of observations D Intercept, a

Coeff. of ln Fijy, b1

Short 2089 .0920*** .0755*** -.0195*** 1.0015**

(.0161) (.0007) (.0058) (.0007)

Medium 1724 .2159*** .0738*** -.0328*** 1.0021*

(.0144) (.0006) (.0104) (.0012)

Long 885 .3009*** .0771*** -.0440*** 1.0035***

(.0118) (.0006) (.0078) (.0009)

Short, medium and long term forecasts are made 12-15, 16-19, 20-23 months, respectively, before each fiscal year end date. Numbers in parentheses give standard errors of corresponding estimates. ***, **, * denote statistical significance at .01, .05, and .10, respectively. For b1, which is the coefficient of ln Fijy, we report statistical significance for the test that b1 = 1.0. Table 5. Estimation results for the information structure of forecast updates by the model and by experts. Past financial market returns and experts’ forecast updates are both significant determinants of actual sales.

Regression equation Equation (12) Equation (13) Equation (14)

Dependent Variable ln Fijy – ln Fi1y ln Diy ln Diy

Estimation method OLS RC OLS RC OLS RC

(1) (2) (3) (4) (5) (6)

ln Fijy 1.0015*** .9914***

(.0008) (.0015)

ln (1+rijy) .0710*** .0737*** .0698*** .0465**

(.0076) (.0275) (.0078) (.0237)

Tijy -.0233*** .0181** -.0169*** .0309***

(.0044) (.0082) (.0045) (.0085)

ln 1 ̂ .0321*** .0181 .0872*** .0891***

(.0068) (.0274) (.0126) (.0336)

tijy -.0102** -.0127 -.0092 .0290*** .0102* .0505***

(.0047) (.0088) (.0060) (.0084) (.0060) (.0076)

Ti1y .0026 .0039

(.0046) (.0041)

ln Fmijy 1.0000*** .9891***

(.0009) (.0015)

ln Fijy – ln Fi1y 1.2764*** 1.0160***

(.0286) (.0247)

Numbers in parentheses give standard errors of corresponding estimates. ***, **, * denote statistical significance at .01, .05, and .10, respectively.

35

Table 6. Comparison of segment-wise RMSE of the initial analysts’ forecast , the updated analysts’ forecast , our model forecast , and combined forecast

RMSE values Ratio of RMSE values

Retail Segment

(E0)

(E)

(M)

(C) (M/E0) (M/E) (C/E0) (C/E)

Auto Parts and Acc. .0621 .0534 .0620 .0529 .9997 1.1611 .8517 .9905

Home Furnishings .1030 .0872 .1025 .0869 .9950 1.1753 .8439 .9968

Cons. Electronics .1190 .1020 .1150 .1016 .9666 1.1283 .8524 .9945

Home Improvement .0840 .0508 .0820 .0487 .9755 1.6127 .5807 .9603

Supermarkets .0778 .0664 .0766 .0657 .9845 1.1544 .8415 .9859

Pharmacies .0705 .0457 .0714 .0446 1.0122 1.5625 .6348 .9793

Apparel Stores .1362 .1139 .1378 .1127 1.0116 1.2104 .8309 .9936

Shoe Stores .1182 .0985 .1156 .0977 .9787 1.1738 .8297 .9956

Jewelry Stores .0922 .0825 .0912 .0857 .9895 1.1052 .9249 1.0336

Sports and Hobby .1036 .0824 .1105 .0825 1.0661 1.3409 .7965 1.0015

Department Stores .0793 .0695 .0767 .0679 .9665 1.1036 .8535 .9738

Discount. and Whse Clubs .0661 .0587 .0666 .0585 1.0079 1.1357 .8842 .9957

Office Supplies .0841 .0727 .0836 .0804 .9939 1.1493 .9557 1.1055

OVERALL .1062 .0886 .1067 .0884 1.0046 1.2036 .8321 .9969

Sales Forecasting with Financial Indicators and...

Documents

Transcript of Sales Forecasting with Financial Indicators and...