Optimal Campaigning in Presidential Elections: The Probability of...
Transcript of Optimal Campaigning in Presidential Elections: The Probability of...
Optimal Campaigning in Presidential Elections:The Probability of Being Florida
David Strömberg∗
IIES, Stockholm University
December 8, 2002
Abstract
This paper delivers a precise recommendation for how presidential candidatesshould allocate their resources to maximize the probability of gaining a majorityin the Electoral College. A two-candidate, probabilistic-voting model reveals thatmore resources should be devoted to states which are likely to be decisive in theelectoral college and, at the same time, have very close state elections. The op-timal strategies are empirically estimated using state-level opinion-polls availablein September of the election year. The model’s recommended campaign strategiesclosely resemble those used in actual campaigns. The paper also analyses the policyeffects of electoral reform. It finds that resources will be more equally distributedunder Direct Vote than under the Electoral College system, and that both systemsdiscipline presidents from extracting political rents to an equal extent.
∗[email protected], IIES, Stockholm University, S-106 91 Stockholm.I thank Steven Brams, Steve Coate, Antonio Merlo, Torsten Persson, Gerard Roland, Tom Romer, HowardRosenthal, Jim Snyder, Jörgen Weibull, and seminar participants at UC Berkeley, Columbia University,Cornell University, the CEPR/IMOP Conference in Hydra, Georgetown University, the Harvard / MITSeminar on Positive Political Economy, IIES, New York University, Stanford University, University ofPennsylvania, Princeton University, and the Wallis Conference in Rochester. Previous versions havebeen circulated under the titles: ”The Lindbeck-Weibull model in the Federal US Structure”, and, ”TheElectoral College and Presidential Resource Allocation”s.JEL-classification, D72, C50, C72, H50, M37.Keywords: elections, political campaigns, public expenditures
1. Introduction
The President of the United States is arguably the worlds most powerful political leader,and the incentives created by his electoral procedure are important. In consequence,it is not surprising that the Electoral College system1 has been under constant debate.According to federal historians, there have been more proposals for constitutional amend-ments to alter or abolish the Electoral College than on any other subject. The issue wasfurther put in focus by the 2000 election featuring a razor thin victory and a presidentelect who lost the popular vote. Still, the effects of the Electoral College system are notwell understood and academic research on the topic is underdeveloped.This paper attempts to fill this void by providing the first empirically reasonable, full-
fledged game-theoretic model of political competition under the Electoral College system.Moreover, it uses the model to provide precise answers to a range of questions: what wouldresource allocations look like if candidates try to maximize the probability of winning,are they doing this, who would gain and who would lose if the president was insteadelected through a direct national vote, which system creates a more equal distribution ofresources, which system will better protect voters from political rent-seeking?A major difference between this paper and earlier work is the complete integration of
theory and empirics, tying together theoretical insights with empirical results on actualcampaigns. This is done by constructing a probabilistic-voting model that can both beexplicitly solved, and directly estimated. The explicit solution draws on use of a CentralLimit Theorem approximation, while the possibility of direct estimation draws on a carefulmodelling of political preferences and election uncertainty. The uncertainty, which makesthe voting-model probabilistic, is estimated by using the errors of a vote forecast.The most closely related theoretical work is Snyder’s (1989) model of two-party com-
petition for legislative seats. Although the equilibrium of this model is not explicitlysolved, some important qualitative features are characterized. Snyder (1989) finds thatequilibrium campaign allocations are higher in districts with close to 50-50 vote shares.Further, if the goal of the parties is to maximize the probability of winning a majorityof seats, then allocations are also higher in districts which are more likely to be pivotal.Finally, more resources will be spent in safe districts of the advantaged party than in thesafe districts of the other party. Snyder’s paper studies one-member districts and thereforedoes not allow for variation in size necessary for the analysis of the Electoral College. Sizeeffects are instead included in Brams and Davis’ (1974) study of presidential campaigning.On the other hand, they abstract from differences in vote-shares by assuming that votesin each state are cast with equal probability for each candidate. Brams and Davis (1974)find that presidential candidates should allocate resources disproportionately in favor oflarge states. Their result is disputed by Colantoni, Levesque and Ordeshook (1975) whoinstead argue that a proportional rule, modified to take into account the closeness of thestate election, predicts actual campaign allocations better. Inspired by these results, Na-gler and Leighley (1992) empirically investigate state-by-state campaign expenditures on
1In this system, a direct vote election is held in each state and the winner of the vote is supposed toget all of that state’s electoral votes. Then all the electoral votes are counted, and the candidate whoreceives most votes wins the election. (The fact that Maine and Nebraska organize their presidentialelections by congressional district is disregarded in this paper.)
2
non-network advertising in 1972 and find these to be higher in states with closer electionsand more electoral votes.A separate theoretical literature has analyzed the policy effects of plurality versus
proportional representation election systems. For example, Persson and Tabellini (1999),and Lizzeri and Persico (2001), find that under plurality rule governments tend to over-provide redistributive spending, relative to public goods, because its benefits can be moreeasily targeted.The model finally reveals a link between all of the above literature and the literature
concerning ”voting power”, that is, the probability that a vote is decisive in an election.The statistical properties of voting power have been analyzed extensively; see Banzaf(1968), Chamberlain and Rothschild (1988), Gelman and Katz (2001), Gelman, King andBoscardin (1998)), and Merrill (1978). The "voting power" analyzed in this paper isslightly different since it is conditional on the candidates’ equilibrium strategies.The estimable probabilistic-voting model developed in this paper is quite general. It
can be used to analyze a range of resource allocation problems in a variety of electoralsettings. As shown in this paper, it can be used to analyze the allocation of campaignresources as well as redistributive spending and political rents under the Electoral andDirect Vote systems. With a minor adjustment, it may also be used to analyze theallocational strategies of parties trying to win a majority of one-member districts, in,say, US Congressional elections. In Strömberg (2002b), the model is adapted to analyzeendogenous voter turnout.Section 2 develops the theoretical model. The probability distribution for election
outcomes suggested by the model is empirically estimated in Section 3. These estimatesare used to confront the model’s predictions with actual campaign efforts in Section 4.Section 5 interprets the equilibrium. Section 6 addresses the effects of a change to a DirectVote system. Finally, Section 7 discusses the results and concludes.
2. Model
Two presidential candidates, indexed by superscript R and D, try to maximize their ex-pected probability of winning the election by selecting the number of days, ds, to campaignin each state s, subject to the constraint
SXs=1
dJs ≤ I,
J = R,D. In each state s, there is an election. The candidate who receives a majority ofthe votes in a state gets all the es electoral votes of that state. After elections have beenheld in all states, the electoral votes are counted, and the candidate who gets more thanhalf those votes wins the election.There is a continuum of voters, indexed by subscript i, a mass vs of which live in state
s. Campaigning in a state increases the popularity of the campaigning candidate amongvoters in that state, as captured by the increasing and concave function u
¡dJs¢.2 The
2This paper does not address the question of why campaigning matters. This is an interesting questionin its own right, with many similarities to the question of why advertisements affect consumer choice.
3
voters also care about some fixed characteristics of the candidates, captured by parametersRi, ηs, and η. The parameter Ri represents an individual-specific ideological preference infavor of candidate R, and ηs and η represent the general popularity of candidate R instate s and nation, respectively. The voters may vote for candidate R or candidate D,and voter i in state s will vote for D if
∆us = u¡dDs¢− u
¡dRs¢ ≥ Ri + ηs + η. (2.1)
At the time when the campaign strategies are chosen, there is uncertainty about thepopularity of the candidates on election day. This uncertainty is captured by the randomvariables ηs and η. The candidates know that the S state level popularity parameters,ηs, and the national popularity parameter, η, are independently drawn from cumulativedistribution functions Gs = N(0, σ2s), and H = N (0, σ2) respectively, but they do notknow the realized values.The distribution of voters’ ideological preferences, Ri, within each state is Fs =
N¡µs, σ
2fs
¢, a normal distribution with mean µs and variance σ2fs. The means of the
states’ ideological distributions may shift over time, but the variance is assumed to remainconstant. The share of votes that candidate D receives in state s is
Fs(∆us − ηs − η).
This candidate wins the state if
Fs(∆us − ηs − η) ≥ 12,
or, equivalently, ifηs ≤ ∆us − µs − η.
The probability of this event, conditional on the aggregate popularity η, and the campaignvisits, dDs , and dRs , is
Gs (∆us − µs − η) . (2.2)
Let es be the number of votes of state s in the Electoral College. Define stochasticvariables, Ds, indicating whether D wins state s
Ds = 1, with probability Gs (·) ,Ds = 0, with probability 1−Gs (·) .
The probability that D wins the election is then
PD (∆us, η) = Pr
"Xs
Dses >1
2
Xs
es
#. (2.3)
However, it is difficult to find strategies which maximizes the expectation of the aboveprobability of winning. The reason is that it is a sum of the probabilities of all possiblecombinations of state election outcomes which would result in D winning. The numberof such combinations is of the order of 251, for each of the infinitely many realizations ofη.
4
A way to cut this Gordian knot, and to get a simple analytical solution to this prob-lem, is to assume that the candidates are considering their approximate probabilities ofwinning. Since the ηs are independent, so are the Ds. Therefore by the Central LimitTheorem of Liapounov, P
sDses − µ
σE
whereµ = µ (∆us, η) =
Xs
esGs (∆us − µs − η) , (2.4)
andσ2E = σ2E (∆us, η) =
Xs
e2sGs (·) (1−Gs (·)) , (2.5)
is asymptotically distributed as a standard normal. The mean, µ, is the expected numberof electoral votes. That is, the sum of the electoral votes of each state, multiplied by theprobability of winning that state. The variance, σ2E, is the sum of the variances of thestate outcomes, which is the e2s multiplied by the usual expression for the variance of aBernoulli variable. Using the asymptotic distribution, the approximate probability of Dwinning the election, conditional on η, is
ePD (∆us, η) = 1− Φ
µ 12
Ps es − µ
σE
¶. (2.6)
The approximate probability of winning the election is
PD (∆us) =
Z ePD (∆us, η)h (η) dη.3 (2.7)
Let the set of allowable campaign visits be
X =
(d ∈ <S
+ :SXs=1
ds ≤ I
).
The Nash Equilibrium strategies¡dD∗, dR∗
¢in the competition between the two candidates
are characterized by
PD¡dD∗, dR
¢ ≥ PD¡dD∗, dR∗
¢ ≥ PD¡dD, dR∗
¢for all dD, dR ∈ X. This game has a unique interior pure-strategy equilibrium characterizedby the proposition below.
Proposition 1. The unique pair of strategies for the candidates¡dD, dR
¢that constitute
an interior NE in the game of maximizing the expected probability of winning the electionmust satisfy dD = dR = d∗, and, for all s and for some λ > 0,
Qsu0 (d∗s) = λ, (2.8)
3The error made from using the approximate probability of winning is discussed in Section 5. Forfurther discussion, see Appendix 6.8 of Stromberg (2002).
5
where
Qs =∂PD
∂∆us. (2.9)
Proof: See Appendix 8.1.Proposition 1 says that presidential candidates, trying to maximize their probability
of winning the election, should spend more time in states with high values of Qs. Thisfollows since u0 (d∗s) is decreasing in d∗s.Note that Qs can be decomposed into two additively separable parts:
Qs = Qsµ +Qsσ (2.10)
= es
Z1
σEϕ (x (η)) gs (−µs − η)h (η) dη
+e2sσ2E
Zϕ (x (η))x (η)
µ1
2−Gs (−µs − η)
¶gs (−µs − η)h (η) dη,
where
x (η) =12
Ps es − µ
σE.
One arises because the candidates have an incentive to influence the expected numberof electoral votes won by D, that is the mean of the normal distribution. The otherarises because the candidates have an incentive to influence the variance in the numberof electoral votes. To evaluate Qs, I now use the structure of the model to estimate theprobability distribution for election outcomes.
3. Estimation
This estimation provides the link between the theoretical probabilistic-voting model aboveand the empirical applications discussed below. In equilibrium, both candidates choosethe same allocation, so that ∆us = 0 in all states. The Democratic vote-share in state sat time t equals
yst = Fst (−ηst − ηt) = Φ
µ−µst − ηst − ηtσfs
¶, (3.1)
where Φ (·) is the standard normal distribution, or equivalently,
Φ−1(yst) = γst = −1
σfs(µst + ηst + ηt) . (3.2)
For now, assume that all states have the same variance of preferences, σ2fs = 1, andthe same variance in state-specific shocks, σ2s.
4 The former assumption implies that themarginal voter density, conditional on the state election being tied, is the same in all
4These assumptions will be removed in Section 6. However, the estimates become imprecise if separatevalues of µst, σfs, and σs are estimated for each state using only 14 observations per state. Therefore,the more restrictive specification will be used for most of the paper.
6
states. Further assume that the mean of the preference distribution, µst, depends on aset of observable variables Xst. Then we get the following estimable equation,
γst = − (Xstβ + ηst + ηt) . (3.3)
The parameters β, σs and σ are estimated using a standard maximum-likelihood estima-tion of the above random-effects model.5
The variables in Xst are basically those used in Campbell (1992). The nation-widevariables are: the Democratic vote share of the two-party vote share in trial-heat pollsfrom mid September (all vote-share variables x are transformed by Φ−1(x)); second quar-ter economic growth; incumbency; and incumbent president running for re-election. Thestate-wide variables for 1948-1984 are: lagged and twice lagged difference from the na-tional mean of the Democratic two-party vote share; the first quarter state economicgrowth; the average ADA-scores of each state’s Congress members the year before theelection; the Democratic vote-share of the two-party vote in the midterm state legislativeelection; the home state of the president; the home state of the vice president; and dummyvariables described in Campbell (1992). After 1984, state-level opinion-polls were avail-able. For this period, the state-wide variables are: lagged difference from the nationalmean of the Democratic vote share of the two-party vote share; the average ADA-scoresof each state’s Congress members the year before the election; and the difference be-tween the state and national polls. Other state-wide variables were insignificant whenstate polls were included. The coefficients β and the variance of the state-level popu-larity shocks, σ2s, are allowed to differ when opinion polls were available and when theywere not. Estimates of equation (3.3) yields forecasts by mid September of the electionyear. The data-set contains state elections for the 50 states 1948-2000, except Hawaii andAlaska which began voting in the 1960 election. During this period there were a totalof 694 state-level presidential election results. Four elections in Alaska and Hawaii wereexcluded because there were no lagged vote returns. Nine elections are omitted becauseof idiosyncrasies in Presidential voting in Alabama in 1948, and 1964, and in Mississippiin 1960; see Campbell (1992). This leaves a total of 681 observations.The estimation results are shown in Table 1. The estimated standard deviation of
the state level shocks after 1984 , bσs, equals 0.077, or about 3% in vote shares. Thisis more than twice as large as that of the national shocks, bσ = 0.033. The estimatedstate-preference means are bµst = Xst
bβ.The average absolute error in state-election vote-forecasts, Φ (bµst), is 3.0 percent and thewrong winner is predicted in 14 percent of the state elections. This precision is comparableto the best state-level election-forecast models (Campbell, 1992; Gelman and King, 1993;
5The model has been extended to include regional swings, see Appendix 6.4 of Strömberg (2002). Inthis specification, the democratic vote-share in state s equals
yst = Fst (ηst + ηrt + ηt) ,
where ηrt denotes independent popularity shocks in the Northeast, Midewest, West, and South. However,taking into account the information of September state-level opinon polls, there are no significant regionalswings. Therefore, the simpler specification without regional swings is used below.
7
Holbrook and DeSart, 1999; Rosenstone, 1983). Given bµst, bσs and bσ, Qs can be calculatedusing equation 2.10.
4. Relation between Qs and actual campaigns
This section compare the equilibrium campaign strategies, based on the above estimates,to actual campaign strategies. The first sub-section will investigate presidential candidatevisits to states in last three months of the 2000 election, and also more loosely discussvisits during the 1988-1996 elections. The second sub-section will study the allocation ofcampaign advertisements across media markets in last three months of the 2000 campaign.
4.1. Campaign visits
If one assumes that u (ds) is of log form, then the optimal allocation, based on equation(2.8) is,
d∗sPd∗s=
QsPQs
, (4.1)
and the number of days spent in each state should be proportional to Qs.The Bush and Gore campaigns were very similar to the model-predicted equilibrium
campaign based on September opinion polls. The actual number of year 2000 campaignvisits, after the party conventions, and Qs, are shown in Figure 4.1.6 Campaign visits byvice presidential candidates are coded as 0.5 visits. The model and the candidates’ actualcampaigns agree on 8 of the 10 states which should receive most attention. Notabledifferences between theory and practice are found in Iowa, Illinois and Maine, whichreceived more campaign visits than predicted, and Colorado, which received less. Perhapsextra attention was devoted to Maine since its (and Nebraska’s) electoral votes are splitaccording to district vote outcomes. Other differences could be because the campaignshad access to information of later date than mid September, and because aspects notdealt with in this paper matter for the allocation. The raw correlation between campaignvisits and Qs is 0.91. For Republican visits the correlation is 0.90 and for Democraticvisits, 0.88. A tougher comparison is that of campaign visits per electoral vote, ds/es,with Qs/es. The correlation between ds/es and Qs/es was 0.81 in 2000.Finally, I look at the 1996, 1992, and 1988 campaigns. For these campaigns, only
presidential visits are available. The correlation between visits and Qs during those yearsare 0.85, 0.64, and 0.76, respectively. But this is mainly a result of presidential candidatesspending more time in large states. For the 1996, 1992, and 1998 elections, the correlationbetween ds/es and Qs/es was 0.12, 0.58, and 0.25 respectively. An explanation for thepoor fit in 1996 and 1988 may be that these elections were, ex ante, very uneven. Theexpected Democratic vote shares in September of 1996, 1992, and 1988 were 56, 50, and46 percent. In uneven races, perhaps the candidates have other concerns, such as affectingthe congressional election outcome, rather than maximizing the probability of winning thepresidential election.
6I am grateful to Daron Shaw for providing me with the campaign data.
8
0 2 4 6 8 10 12
Hawaii
New York
Vermont
South Carolina
Alaska
Alabama
South Dakota
North Dakota
Montana
Maine
Wyoming
Maryland
Indiana
West Virginia
Virginia
Mississippi
Connecticut
Delaware
New Jersey
Minnesota
Nevada
Arizona
New Hampshire
Georgia
New Mexico
North Carolina
Colorado
Arkansas
Kentucky
Iowa
Illinois
Oregon
Louisiana
Washington
Wisconsin
Tennessee
Missouri
Ohio
California
Pennsylvania
Michigan
Florida
percent
∑ s
s
(Utah, Texas, Rhode Island, Oklahoma, Texas, Nebraska, Massachusetts, Kansas, Idaho) ≈ 0 for both series.
Actual campaign visits
Figure 4.1: Actual and equilibrium campaign visits 2000
9
4.2. Campaign advertisements
Appendix 8.3 models the decision of presidential candidates to allocate advertisementsacross Designated Market Areas (DMAs).7 In that model, two presidential candidateshave a fixed advertising budget Ia to spend on am ads in each media market m subject to
MXm=1
pmaJm ≤ Ia,
J = R,D, where pm is the price of an advertisement. Media market m contains a massvms of voters in state s. Voters are now also affected by campaign advertisements ascaptured by the increasing and concave function w(am). A voter i in media market m instate s will vote for D if
u¡dDs¢+ w
¡aDm¢− u
¡dRs¢− w
¡aRm¢ ≥ Ri + ηs + η.
The equilibrium number of visits is still characterized by equation (2.8). In equilibrium,both candidates choose the same advertising strategy. Advertising in media market m isincreasing in Qm
pm, where
Qm =SXs=1
Qsnms
ns.
Qm is the sum of the Qs of the states in the media market weighted by the share of thepopulation of state s that lives in media market m.The advertisement data is from the 2000 election and was provided by the Brennan
Center.8 It contains the number and cost of all advertisements relating to the presidentialelection, aired in the 75 major media markets between September 1 and Election Day. Thedata is disaggregated by whether it supported the Republican, Democrat, or independentcandidate, and by whether it was paid for by the candidate, the party or an independentgroup. The cost estimates, pm, are average prices per unit charged in each particular mediamarket. The estimates are made by the CampaignMedia Analysis Group. Advertisementswere aired in 71 markets, where the data set records a total of 174 851 advertisements,for a total cost of $118 million, making an average price of $680. The Democrats spent$51 million, while Republicans spent $67 million. To measure total campaign efforts, Isum together the advertisements by the candidates, the parties and independent groupssupporting the Democratic or Republican candidate.Figure 4.2 illustrates the predicted and actual advertising. The model and the data
agree on the two media markets where most ads should be aired (Albuquerque - SantaFe, and Portland, Oregon). These two markets have the highest effect on the win prob-ability per advertising dollar. In third place the model puts, Orlando - Daytona Beach- Melbourne, while the data has Detroit (number four in the model). The correlation
7A DMA is defined by Nielsen Media Research as all counties whose largest viewing share is given tostations of that same market area. Non-overlapping DMAs cover the entire continental United States,Hawaii and parts of Alaska.
8The Brennan Center began compiling this type of data for the 1998 elections. According to them,no such data exists elsewhere for any other election. This is a new and unique database.
10
0 76200
7620
Tota
l adv
ertis
emen
ts
Qm/pm
Albuquerque - Santa Fe
Portland, Oregon
DenverLexington
Figure 4.2: Total number of advertisements Sept. 1 to election day and Qm/pm, for the75 largest media markets
between actual campaign advertisement and equilibrium advertisement is 0.75. That fewadvertisements were aired in Denver is consistent with the few candidate visits to Col-orado (recall Figure 4.1). The few advertisements in Lexington are more surprising, sincecandidate visits to Kentucky were close to the equilibrium number.To see why Albuquerque - Santa Fe gives a large effect per advertising dollar, note
that we can decompose Qm
pminto four terms:
Qm
pm=Xs
nms
nm|{z}(o)
esns|{z}(i)
Qs
es|{z}(ii)
1
pm/nm| {z }(iii)
. (4.2)
(o) 95 percent of the population of Albuquerque - Santa Fe live in New Mexico (nms
nm=
0.95), and 5 percent in Colorado. (i) Since New Mexico is a small state with only 1.8million inhabitants, it had a high number of electoral votes per capita. (ii) Since NewMexico had a forecasted Democratic vote-share of 51.8%, it had a very high value of Qs
per electoral vote. This relationship will be discussed in Section 5, see Figure 5.3 for apreview. (iii) At the same time, the average cost of an ad per million inhabitants in themedia market is only $209, compared to the average media-market cost, which is $270.In comparison, the Detroit media market lies entirely in Michigan which had the highestvalue of Qs per electoral vote. However, being a fairly large state, Michigan only has 1.8electoral votes per million inhabitants. Further, the average cost of an ad in Detroit ishigher than in Albuquerque - Santa Fe. Therefore, the marginal impact on the probabilityof winning per dollar is lower than in Albuquerque - Santa Fe.Finally, one can note that since the correlation between price and market size is close
to one (0.92), there is no significant relationship between market size and the number ofads.Via the price, the size is instead captured in the costs. Assuming log utility, equi-librium expenditures, pma∗m, are proportional to Qm. Empirically, the simple correlationbetween advertisement costs, pmam, and Qm is 0.88.
11
5. Interpretation
This section discusses what Qs measures and why it varies across states. A qualified guessis that Qs is approximately the joint ”likelihood” that a state is actually decisive in theElectoral College and, at the same time, has a very close election. I will call states whoare ex post decisive in the Electoral College and have tied state elections decisive swingstates. In the 2000 election, Florida was a decisive swing state. In contrast, neither NewMexico nor Wyoming were decisive swing states. While New Mexico was a swing statewith a very close state-election outcome, it was not decisive in the Electoral College sinceBush would have won with or without the votes of New Mexico. While Wyoming wasdecisive in the Electoral College, since Gore would have won the election, had he wonWyoming, it was not a swing state.The above guess is based on the fact that the probability of being a decisive swing
state replaces Qs in the equilibrium condition of the model without the Central Limitapproximation. Further, Appendix 8.2 shows heuristically, why the analytical expressionfor Qs approximates the probability of being a decisive swing state. In this Appendix,Qs is also shown to approximately equal the "voting power" of state s, multiplied by themarginal voter density conditional on the state election being tied.To investigate whether Qs is indeed an almost exact approximation of the probability
of being a decisive swing state, one million electoral vote outcomes were simulated foreach election 1988-2000 by using the estimated state-preference means, and drawing stateand national popularity-shocks from their estimated distributions.9 Then, the share ofelections where a state was decisive in the Electoral College and at the same time had astate election outcome between 49 and 51 percent was recorded. This provides an estimatewhich should be roughly equal to Qs, up to a scaling factor, see Appendix 8.2.Figure 5.1 contains the simulated shares on the y-axis and values computed from the
scaled, analytic expression of Qs, on the x-axis. Large states are trivially more likely tobe decisive. To check that the correlation between Qs and the simulated values is not justa matter of size, the graph on the right contains the same series divided by the state’snumber of electoral votes. The simple correlation in the diagram to the right is 0.997. Sothe two variables are interchangeable, for practical purposes. The 0.003 difference couldresult on the Qs-side from using the approximate probability of winning the election,and on the simulation-side from using a finite number of simulations and recording stateelection results between 49 and 51 percent, whereas theoretically it should be exactly 50percent.To illustrate how Qs varies across states, I will use the year 2000 election, see Figure
5.2. Based on polls available in mid September, 2000, Florida, Michigan, Pennsylvania,California, and Ohio were the states most likely to be decisive in the Electoral College andat the same time have a state election margin of less than 2 percent. This happened in 2.2percent of the simulated elections in Ohio and 3.4 percent of the simulations in Florida. Incomparison, the scaled, analytic expression for Qs equals 3.5 percent for Florida. (UsingQs, the probability that Florida would be decisive in the Electoral College and have a
9Replace µst by the estimated bµst in equation (3.1), and draw ηst and ηt from their estimated distri-
butions N³0, bσ2s´ and N
³0, bσ2´, respectively to generate election outcomes yst.
12
Pivo
tal a
nd c
lose
per
ele
ctor
al v
ote
Qs per electoral vote0 .0020
.002
Sim
ulat
ed s
hare
piv
otal
and
clo
se e
lect
ions
Qs0 .0550
.047
Figure 5.1: Qs and simulated probability of being a decisive swing state
state margin of victory of 1000 votes may also be calculated. This probability is 1.5 in 10000. The probability that this would happen in any state is .44 percent.)
The analytic expression for Qs explains exactly why some states are more likely tobe decisive swing states. First, Qs is roughly proportional to the number of electoralvotes.10 WhileQsµ and,Qsσ are proportional to electoral votes and electoral votes squared,respectively, see equation (2.10), Qsσ is generally considerably smaller than Qsµ. Thisimplies that candidates should, on average, spend more time in large states. However, forstates of equal size there is considerable variation.To explain differences relative to size, we next study Qs/es. In Figure 5.3, the circular
dots show the share of the simulated elections where a state was decisive in the ElectoralCollege and at the same time had a state-election outcome between 49 and 51 percent, perelectoral vote. The solid line shows Qsµ/es. Its normal form arises because the candidatestry to affect the expected number of electoral votes, see equation (2.10). This part ofQs/esaccounts for most of the variation in the simulated values. It explains why states like NewYork and Texas in a million simulated elections are never decisive in the Electoral Collegeand at the same time have close state elections. But in Florida, Michigan, Pennsylvania,and Ohio this happens quite frequently. The solid line is in fact a normal distribution,multiplied by a constant. It is characterized by three features: its amplitude, its mean,and its variance.The amplitude of Qsµ/es is trivially higher when the national election is expected to
be close. This affects all states in a single election in the same way. It explains why theaverage Qsµ varies between elections. 11
10This can be contrasted to the finding that ”voting power”, that is, the probability that a vote ispivotal in the election is more than proportional to size (Banzaf 1967, Brams and Davis 1974). ”Votingpower” is roughly proportional to Qs, and thus roughly proportional to size. The difference resultsfrom all voters being equally likely to vote for one candidate or the other in their models, while voterspreferences for the candidates in my model are heterogenous and subject to aggregate popularity shocks,see Chamberlain and Rothschild (1981) for a theoretical discussion and Gelman and Katz (2001) forempirical results.11Formally, define eηt to be the national popularity-swing which would give equal expected Electoral
Vote shares, µ (eη) = 12
Ps es. Then Qsµ is larger when eηt is close to zero; See equation (8.6) in the
Appendix.
13
0 0,5 1 1,5 2 2,5 3 3,5
Hawaii
New York
Vermont
South Carolina
Alaska
Alabama
South Dakota
North Dakota
Montana
Maine
Wyoming
Maryland
Indiana
West Virginia
Virginia
Mississippi
Connecticut
Delaware
New Jersey
Minnesota
Nevada
Arizona
New Hampshire
Georgia
New Mexico
North Carolina
Colorado
Arkansas
Kentucky
Iowa
Illinois
Oregon
Louisiana
Washington
Wisconsin
Tennessee
Missouri
Ohio
California
Pennsylvania
Michigan
Florida
percent
(Utah, Texas, Rhode Island, Oklahoma, Texas, Nebraska, Massachusetts, Kansas, Idaho) = 0
Figure 5.2: Joint probability of being pivotal and having a state margin of victory lessthan two percent, based on September 2000 opinion polls.
14
Simulated values, pivotal and close /es
Forecasted democratic vote share
30 40 50 60 700
.0005
.001
.0015
Michigan (18)
PennsylvaniaOhio
California
New YorkTexasWyoming
µ∗s
Qsµ /es
Figure 5.3: Probability of being a decisive swing state per electoral vote.
The mean of Qsµ/es in Figure 5.3 is located slightly above 50%. This position is theresult of a trade-off between average, and timely, influence on state election outcomes.To get the intuition, suppose that the Democrats are ahead by 60-40 in the nationalforecasts. In states with 50-50 forecasts, a candidate’s visit is more likely to influencethe state election outcome. However, states where the Democrats are ahead 60-40 arelikely to have close state elections exactly when the national election is close. Althoughthe candidates are less likely to influence the state outcome, they are more likely to doso when it matters. The mean of Qsµ/es will always lie between 50-50 and the nationalforecast (60-40).Consider the example of the 1996 election. In September, Clinton was ahead by 60-40
in the national forecasts, as well as in Pennsylvania, whereas the forecasted outcome inTexas was 50-50. A visit to Texas was therefore more likely to affect the state outcomethan a visit to Pennsylvania. However, if Texas was a 50-50 state on election day, thenClinton was probably winning by a landslide, and the electoral votes of Texas were prob-ably not decisive in the electoral college. On the other hand, in the unlikely event thatPennsylvania was a 50-50 state on election day, the national election is likely to be close,and the electoral votes of Pennsylvania were likely to be decisive in the electoral college.Visits only matter if the state is a 50-50 state on election day, and the candidates mustcondition their visits on this circumstance.The model shows how to strike a balance between high average and timely influence.
The less correlated the state election outcomes, the more time should be spent in 50-50states like Texas. The reason is that without national swings, the state outcomes are notcorrelated, and a state being a swing state on election day carries no information aboutthe outcomes in the other states. In my estimates maximum attention should typicallybe given to states in the middle, 55-45 in this example. In September of 2000, Gore wasahead by 1.3 percentage points. The maximum Qsµ/es was obtained for states where theexpected outcome was a Democratic vote share of 50.8 percent, as illustrated in Figure
15
5.3.12
The variance inQsµ/es depends on the uncertainty about the election outcome. Betterstate-level forecasts lead to a more unequal allocation of campaign resources as the vari-ance of the normal-shaped distribution of Figure 5.3 decreases. States with forecasted voteshares close to the center of that distribution would gain while states far from the centerwould lose. Better national-level forecasts has a similar effect.14 Note that Wyoming andtwo other states to the left of the center are noticeably above the normal-shaped curve.The reason is that I could not find state-level opinion-poll data for these states, and theforecasts for these states are more uncertain. These states actually lie on a normal-shapedcurve with a higher variance than that drawn in Figure 5.3. These observations illustrateone effect of improved forecasting on the allocation of resources.In Figure 5.3, note also that around its peak, the normal-shaped curve is far from the
simulated probabilities of being a decisive swing state per electoral vote. States to theright of µ∗s, like Michigan and Pennsylvania, generally lie above the curve, while states onthe left, like Ohio, generally lie below. The difference between the simulated values andQsµ/es arises because the candidates also have incentives to influence the variance of theelectoral vote distribution, even if this means decreasing the expected number of electoralvotes, see Qsσ in equation (2.10).To get the intuition of why such behavior is rational, consider the following example
from the world of ice-hockey. One team is trailing by one goal and there is only oneminute left of the game. To increase the probability of scoring an equalizer, the trailingteam pulls out the goalie and puts in an extra offensive player. Most frequently, the resultis that the leading team scores. But the trailing team does not care about this, since theyare losing the game anyway. They only care about increasing the probability that theyscore an equalizing goal, which is higher with an extra offensive player. Therefore, it is
12These points are evident from the analytical form of the mean of equation (5.1), derived in Appendix8.2. The mean equals
µ∗st = −σ2
σ2 + (σE/a)2eηt, (5.1)
whereσ2E = σ2E
¡dD = dR, ηt = eηt¢ ,
at =Xs
esgs (−µst − eηt) .13The mean always lies between a pro-Republican state bias of µst = 0, which corresponds to a 50%forecasted Democratic vote share, and µst = −eηt, which approximately corresponds to the forecastednational Democratic vote share. (If the Democrats are ahead by 60-40 nationally, then a pro-Republicanswing eηt, corresponding to about 10%, is needed to draw the election. Therefore µst = −eηt correspondsto 10% pro-democrat bias in a state, that is, a vote share of 60-40.) The smaller the variance of thenational popularity-swings, σ2, the closer is the mean to 50-50. In the extreme case where this varianceequals zero, then µ∗s = 0. In the extreme case that σ approaches infinity, µ
∗s approaches −eη.
14The variance of the normal-form distribution,
eσ2 = σ2s +1³
1σ2 +
1(σE/a)
2
´ ,depends on the variance in the state, and national, level popularity shocks, see Appendix 8.2.
16
30 40 50 60 70-.0001
0
.0001
.0002
Michigan Pennsylvania
Ohio
Varia
nce
effe
ct, Q
sσ/v
s
Forecasted democratic vote shares
Figure 5.4: Incentive to influence variance
better to increase the variance in goals, even though this decreases net expected goals. Incontrast, if they were allowed, the leading team would like to pull out an offensive playerand put in an extra goalie.Similarly, a presidential candidates who is behind should try to increase variance in
the election outcome. He can do that by spending more time in large states where he isbehind (putting in an extra offensive player), and less time in states where he is ahead(pulling the goalie). A candidate who is ahead should instead try to decrease variancein electoral votes, thus securing his lead, by spending more time in large states where heis ahead (putting in an extra goalie), and less time in states where he is behind (pullingout an offensive player). Both candidates thus spend more time in large states where theexpected winner is leading.To formally see why a trailing candidate increases the variance by spending more time
in states with many electoral votes where he is behind, consider equation (2.5) showingthe variance, conditional on a national shock. The variance in the number of electoralvotes from a state is proportional to these votes squared. Therefore, the effect on thetotal variance, per electoral vote, is larger in large states. Further, the variance in a stateoutcome is higher the closer the expected result is to a tie. By visiting a state where theleading candidate is ahead, the trailing candidate moves the expected result closer to atie, and increases the variance in election outcome. Similarly, decreasing the number ofvisits to a state where the lagging candidate is leading increases the varianceFigure 5.4 illustrates this effect in the year 2000 election. It plots the values of the
analytical expression for Qsσ/es. The lagging candidate (Bush) should put in extra offen-sive visits in states like Michigan and Pennsylvania, at the cost of weakening the defenseof states like Ohio. The leading candidate (Gore) should increase his defense of states likeMichigan and Pennsylvania, at the cost of offensive visits to Ohio. This resounds with theresult by Snyder (1989) that parties will spend more in safe districts of the advantagedparty than in safe districts of disadvantaged party.Another way to use the model is to calculate the best-response to the other candidate’s
actual strategy, even though it differs from the equilibrium. According to the model,
17
either candidate could have increased their probability of winning by about 2 percentcompared to their actual strategies.15 The most important feature of the best responsesis that Bush should spend more than the equilibrium time in California, which the Gorecampaign left unguarded with very few visits. The expected democratic vote-share inCalifornia was 56 percent. Fewer visits by Gore moves the expected democratic vote-share closer to 50.8 percent, evaluated at Bush’s equilibrium strategy. This increases theprobability of California being a decisive swing state, which increases Bush’s incentivesto visit California. In fact, Bush spent more than the equilibrium amount of time inCalifornia. This, on the other hand, moves the expected democratic vote-share closerto 50.8 percent, evaluated at Gore’s equilibrium strategy, increasing Gore’s incentives tovisit California. However, Gore visited California less than the equilibrium number oftimes. Roughly speaking, more visits is a best response to more offensive visits (Bush inCalifornia) or fewer defensive visits (Gore in California) by the opponent. Fewer visits isa best response to fewer offensive visits or more defensive visits by the opponent.
6. Electoral reform
This section will explore the effects of a hotly debated institutional reform, namely, thechange to a direct vote for president. According to federal historians, over 700 proposalshave been introduced in Congress in the last 200 years to reform or eliminate the system.Indeed, there have been more proposals for constitutional amendments to alter or abol-ish the Electoral College than on any other subject. The debate intensified as the 2000presidential election awarded George W. Bush the White House by a razor-thin victory,despite his losing the popular vote by a 337,000-vote margin. In consequence, a Wash-ington Post/ABC News poll performed shortly after the election suggested that about 6in 10 Americans would prefer to abandon the Electoral College and switch to a directpopular vote.The effects of reform on campaigning and economic policy have also been debated.
Small states have voiced fear that, without the Electoral College, candidates might changetheir campaign patterns and shun them altogether. Others have argued that the ElectoralCollege system creates a bias favoring small states. Finally, Lizzeri and Persico (2001),and Persson and Tabellini (1999), have argued that under the Electoral College, thedistribution of targeted programs will be more concentrated, while political rents, andpublic goods provision will be lower than under Direct Vote. To discuss these issues,the model will first be modified to discuss economic policy formation under the ElectoralCollege, and then further modified to discuss policy formation under Direct Vote.
Economic policy Section 4 shows that politicians understand the incentives createdby the Electoral College system. This section explores the consequences if policy is alsoinfluenced by election concerns. To analyze economic policy, one could just re-interpretthe model of Section 4 as describing the incentives to make policy promises to states
15This result depends on the functional form assumed for u (ds) . I can not use log form since this isnot defined for zero visits. Instead I use the exponential function u (ds) = 0.018 ∗ d0.34s , where the twoconstants were estimated in Strömberg (2002).
18
during the campaign. If a state is important for the election outcome, not only shouldthe candidates visit that state, they should also make favorable policy promises to thatstate. However, the route taken in this section is to analyze the incentives of incumbentpresidents to set policy for re-election concerns.Suppose each voter in state s receives utility from policy
us = υ (zs)− r
n+ θ
where zs is redistributive spending per capita, rnis political rents per capita, and θ is the
incumbent’s competence which is drawn from a known normal distribution with meanzero. The function υ is increasing and concave. Political rents describe a conflict ofinterest between the president and the voters. It could entail party financing, extrasalaries, low effort and waste or outright corruption. The voter observes his total utilityfrom policy, but not its separate components. So, the voter does not know whether utilityfrom a government program is high because per capita spending is high, because thepolitical rents are low, or competence is high. The incumbent president’s preferences aredescribed by P + (r) where P is the approximate probability of re-election, definedbelow, and is an increasing and concave function.The timing is the following. First the incumbent sets policy. Then his competence and
popularity shocks are realized. Next, the incumbent president runs against an opponentwith unknown competence. After the election, the winner selects a fixed policy and theutility of the voters is determined by competence, ideology, and popularity.In this setting, voter i in state s will re-elect the (without loss of generality) democratic
incumbent ifE [θ] ≥ Ri + ηs + η0.
The voters observe us and form expectations, E [θ] = us − (υ (z∗s)− r∗) , where z∗s and r∗
denote equilibrium policy. Therefore, the above equation becomes
∆us = υ (zs)− r − (υ (z∗s)− r∗) ≥ Ri + ηs + η.
where η = η0 − θ. The above equation has exactly the same structure as equation (2.1).The same distributional assumptions are now made regarding the exogenous parametersRi, ηs, and η, and the approximate probability of winning, P , is again defined by equations(2.4), (2.5), (2.6), and (2.7).The incumbent chooses economic policy to
maxP + (r)
subject to a fixed budget constraint Xs
nszs = I.
Proposition 2. The incumbent strategy (z∗, r∗) that maximizes P + (r) must satisfy,for all s and for some λ > 0,
Qs
nsυ0 (z∗s) = λ, (6.1)
19
0 (r∗) =1
n
Xs
Qs. (6.2)
where
Qs =∂P
∂∆us
is defined by equations (2.9), (2.4) and (2.5).
In equilibrium, the voters policy expectations are correct and ∆us = 0.Incumbent presidents will provide higher per capita redistributive spending to states
with high Qs. Political rents are lower the larger isP
sQs, which measures the fall inre-election probability due to a small decrease in the utility of all voters.The estimated Qs will in general be different from those relevant for campaigning
decisions. The reason is that decisions regarding economic policy must be taken earlierthan September the election year. The incumbent president must make his policy decisionsbefore the September opinion poll results are known. Therefore, Qs used for evaluatingthe effects on policy will be estimated without opinion poll data.16
Direct Vote Next, suppose the president is elected by a direct national vote. Thenumber of Democratic votes in state s is then equal to
vsFs(∆us − η − ηs).
The Democratic candidate wins the election if he receives more than half of the popularvotes: X
s
vsFs(∆us − η − ηs) ≥1
2
Xs
vs.
The number of votes won by candidate D is asymptotically normally distributed withmean and variance
µv =Xs
vsΦ
∆us − µs − ηqσ2s + σ2fs
, (6.3)
σ2v = σ2v (∆us, η) .
See Appendix 8.4 for the explicit expression for σ2v. The probability of a Democraticvictory is
PD = 1−Z
Φ
µ 12
Ps vs − µvσv
¶dη.
The incumbent allocates spending and sets political rents to maximize PD + (r). Thefollowing proposition characterizes the equilibrium allocation.
16This version of the model has been used in Strömberg (2002a), where it is found that federal civilianemployment is higher when Qs is higher, even using time and state fixed effects and controlling for otherdeterminants of employment.
20
20 30 40 50 60.06
.08
.1
.12
CT
ME
MA
NH
RI
VT
DE
NJ
NY
PA
ILIN
MIOH
WIIA
KS MN
MO
NE
ND
SD
VA
AL
AR
FL
GA
LA
MS
NC
SC
TXKY
MD
OK
TN
WV
AZCO
IDMTNVNM
UT
WY
CA
OR
WA
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCT
MEMEMEMEMEMEMEMEMEMEMEMEMEMEME
MAMAMAMAMAMAMAMAMAMAMAMAMAMAMA
NHNHNHNHNHNHNHNHNHNHNHNHNHNHNH
RIRIRIRIRIRIRIRIRIRIRIRIRIRIRI
VTVTVTVTVTVTVTVTVTVTVTVTVTVTVT
DEDEDEDEDEDEDEDEDEDEDEDEDEDEDE
NJNJNJNJNJNJNJNJNJNJNJNJNJNJNJ
NYNYNYNYNYNYNYNYNYNYNYNYNYNYNY
PAPAPAPAPAPAPAPAPAPAPAPAPAPAPA
ILILILILILILILILILILILILILILILINININININININININININININININ
MIMIMIMIMIMIMIMIMIMIMIMIMIMIMIOHOHOHOHOHOHOHOHOHOHOHOHOHOHOH
WIWIWIWIWIWIWIWIWIWIWIWIWIWIWIIAIAIAIAIAIAIAIAIAIAIAIAIAIAIA
KSKSKSKSKSKSKSKSKSKSKSKSKSKSKS MNMNMNMNMNMNMNMNMNMNMNMNMNMNMN
MOMOMOMOMOMOMOMOMOMOMOMOMOMOMO
NENENENENENENENENENENENENENENE
NDNDNDNDNDNDNDNDNDNDNDNDNDNDND
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
VAVAVAVAVAVAVAVAVAVAVAVAVAVAVA
ALALALALALALALALALALALALAL
ARARARARARARARARARARARARARARAR
FLFLFLFLFLFLFLFLFLFLFLFLFLFLFL
GAGAGAGAGAGAGAGAGAGAGAGAGAGAGA
LALALALALALALALALALALALALALALA
MSMSMSMSMSMSMSMSMSMSMSMSMSMS
NCNCNCNCNCNCNCNCNCNCNCNCNCNCNC
SCSCSCSCSCSCSCSCSCSCSCSCSCSCSC
TXTXTXTXTXTXTXTXTXTXTXTXTXTXTXKYKYKYKYKYKYKYKYKYKYKYKYKYKYKY
MDMDMDMDMDMDMDMDMDMDMDMDMDMDMD
OKOKOKOKOKOKOKOKOKOKOKOKOKOKOK
TNTNTNTNTNTNTNTNTNTNTNTNTNTNTN
WVWVWVWVWVWVWVWVWVWVWVWVWVWVWV
AZAZAZAZAZAZAZAZAZAZAZAZAZAZAZCOCOCOCOCOCOCOCOCOCOCOCOCOCOCO
IDIDIDIDIDIDIDIDIDIDIDIDIDIDIDMTMTMTMTMTMTMTMTMTMTMTMTMTMTMTNVNVNVNVNVNVNVNVNVNVNVNVNVNVNVNMNMNMNMNMNMNMNMNMNMNMNMNMNMNM
UTUTUTUTUTUTUTUTUTUTUTUTUTUTUT
WYWYWYWYWYWYWYWYWYWYWYWYWYWYWY
CACACACACACACACACACACACACACACA
OROROROROROROROROROROROROROROR
WAWAWAWAWAWAWAWAWAWAWAWAWAWAWA
Estim
ated
mar
gina
l vot
er d
ensi
ty
Share independents 1976-88 (Erikson, Wright and McIver)
Figure 6.1: Marginal voter density and share independents
Proposition 3. The incumbent strategy (z, r) that maximizes PD + (r) must satisfy,for all s and for some λ > 0,
QDVs
nsυ0 (zs) = λ, (6.4)
0 (r) =Xs
QDVs . (6.5)
The variableQDVs measures the likelihood of a draw in the national election, multiplied
by the expected marginal voter density, conditional on the national election being tied,multiplied by the number of voters in the state; see Appendix 8.4. Since the likelihoodof a draw in the national election is the same in all states, QDV
s varies across states onlybecause of differences in the share of marginal voters and the number of voters.The allocation under Direct Vote depends crucially on the share of marginal voters,
which in turn depends on the estimated variance in the preference distribution, σ2fs.Therefore, the restriction σfs = 1 is removed in the maximum likelihood estimation ofequation (3.2), as well as the assumption that σs is the same for all states. However, theempirical identification of σfs and σs is not trivial. If the election outcome in a certainstate varies a lot over time, is this because the state has many marginal voters or is itbecause the state has been hit by unusually large shocks shifting voter preferences? Themodel solves this problem by identifying σfs by the response in vote shares to changes thatare common to all states, and observable changes in economic growth at national and statelevel, incumbency variables, home state of the president and vice president. States wherethe vote share outcome covary strongly with economic growth, etc., are thus estimated tohave many marginal voters. Maine is estimated to have the largest share of marginal voterswhile California has the smallest. The estimated share of marginal voters is positivelycorrelated with the share of independent voters as measured by Erikson, Wright andMcIver (1993), as illustrated in Figure 6.1. The variance in the state popularity-shocks,σ2s, is on average larger in southern states and smaller states.
Spending We can now discuss the allocation of spending under the Electoral College(EC) and Direct Vote (DV ) systems. To structure the discussion, assume log utility so
21
that equilibrium per capita spending under DV equals
zDVs =
QDVs
ns1n
PQDV
s
I
n,
whereas the analogous expression for spending under EC is given by just substitutingQs for QDV
s . The effect of reform is shown in Figure 6.2. The series are 1948-2000averages, scaled so that 1 denotes equal per capita spending. States above 1 on the y-axisreceive higher than average per capita spending under under the Electoral College system,whereas states to the right of 1 on the x-axis have higher than average per capita spendingunder under the Direct Vote system. Thus, states below the dashed 45 degree line wouldgain from reform, those above would lose. Some states, like Maine and New Hampshire,are well off under both systems while Mississippi is disadvantaged under both. Otherstates, like Nevada are among the winners in the present system but among the losersunder Direct Vote. The opposite is true for Rhode Island and Kansas.The reasons why certain states would gain or lose can be separated into variation in
(i) electoral size per capita and (ii) influence relative to electoral size.
EC :Qs
ns=
esns|{z}(i)
Qs
es|{z}(ii)
,
DV :QDVs
ns=
vsns|{z}(i)
QDVs
vs| {z }(ii)
.
Figure 6.3 plots the 1948-2000 averages of, (i), electoral votes per capita million andvoter turnout to the left, and, (ii), the probability of being a pivotal swing state perelectoral vote and the share of marginal voters to the right. The latter have been scaledso that 1 denotes equal per capita influence relative to size. Nevada and Delaware wouldlose from reform primarily because of their heavy endowment of electoral votes relativeto popular votes, while Ohio would lose primarily because it is likely to be a pivotalswing state, but does not have many marginal voters. On the winning side, Rhode Islandand Massachusetts would gain because of their many marginal voters, while Kansas andNebraska would gain because of their low probabilities of being decisive swing states inthe present system.Small states have not, on average, been advantaged by the Electoral College system.
Although small states are over-represented in terms of electoral votes, they have moreoften had lop-sided elections and have larger state-level uncertainty, σ2s. On net, Qs/ns isnot correlated with size.17 It is also the case that QDV
s /ns is uncorrelated with state size.So small states, as a group, neither gain nor lose from electoral reform.Which political system creates a more unequal distribution of resources? The extreme
variation in the probability of being a pivotal swing state per electoral vote creates verystrong incentives for unequal distribution under the EC. The average probability that
17I thank Andrew Gelman for pointing this out to me.
22
0 .5 1 1.5 2 2.50
.5
1
1.5
2
2.5
CT
ME
MA
NH
RI
VT
DE
NJ
NY
PA
IL
IN
MI
OH
WI
IA
KS
MN
MO
NE
ND
SD
VAAL
ARFL
GA
LA
MS
NCSC
TX
KYMD
OKTN
WV
AZ
CO
ID
MT
NV
NM
UT
WY
CA
OR
WA
AKHI
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCT
MEMEMEMEMEMEMEMEMEMEMEMEMEMEME
MAMAMAMAMAMAMAMAMAMAMAMAMAMAMA
NHNHNHNHNHNHNHNHNHNHNHNHNHNHNH
RIRIRIRIRIRIRIRIRIRIRIRIRIRIRI
VTVTVTVTVTVTVTVTVTVTVTVTVTVTVT
DEDEDEDEDEDEDEDEDEDEDEDEDEDEDE
NJNJNJNJNJNJNJNJNJNJNJNJNJNJNJ
NYNYNYNYNYNYNYNYNYNYNYNYNYNYNY
PAPAPAPAPAPAPAPAPAPAPAPAPAPAPA
ILILILILILILILILILILILILILILIL
INININININININININININININININ
MIMIMIMIMIMIMIMIMIMIMIMIMIMIMI
OHOHOHOHOHOHOHOHOHOHOHOHOHOHOH
WIWIWIWIWIWIWIWIWIWIWIWIWIWIWI
IAIAIAIAIAIAIAIAIAIAIAIAIAIAIA
KSKSKSKSKSKSKSKSKSKSKSKSKSKSKS
MNMNMNMNMNMNMNMNMNMNMNMNMNMNMN
MOMOMOMOMOMOMOMOMOMOMOMOMOMOMO
NENENENENENENENENENENENENENENE
NDNDNDNDNDNDNDNDNDNDNDNDNDNDND
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
VAVAVAVAVAVAVAVAVAVAVAVAVAVAVAALALALALALALALALALALALALAL
ARARARARARARARARARARARARARARARFLFLFLFLFLFLFLFLFLFLFLFLFLFLFL
GAGAGAGAGAGAGAGAGAGAGAGAGAGAGA
LALALALALALALALALALALALALALALA
MSMSMSMSMSMSMSMSMSMSMSMSMSMS
NCNCNCNCNCNCNCNCNCNCNCNCNCNCNCSCSCSCSCSCSCSCSCSCSCSCSCSCSCSC
TXTXTXTXTXTXTXTXTXTXTXTXTXTXTX
KYKYKYKYKYKYKYKYKYKYKYKYKYKYKYMDMDMDMDMDMDMDMDMDMDMDMDMDMDMD
OKOKOKOKOKOKOKOKOKOKOKOKOKOKOKTNTNTNTNTNTNTNTNTNTNTNTNTNTNTN
WVWVWVWVWVWVWVWVWVWVWVWVWVWVWV
AZAZAZAZAZAZAZAZAZAZAZAZAZAZAZ
COCOCOCOCOCOCOCOCOCOCOCOCOCOCO
IDIDIDIDIDIDIDIDIDIDIDIDIDIDID
MTMTMTMTMTMTMTMTMTMTMTMTMTMTMT
NVNVNVNVNVNVNVNVNVNVNVNVNVNVNV
NMNMNMNMNMNMNMNMNMNMNMNMNMNMNM
UTUTUTUTUTUTUTUTUTUTUTUTUTUTUT
WYWYWYWYWYWYWYWYWYWYWYWYWYWYWY
CACACACACACACACACACACACACACACA
OROROROROROROROROROROROROROROR
WAWAWAWAWAWAWAWAWAWAWAWAWAWAWA
AKAKAKAKAKAKAKAKAKAKAKAKHIHIHIHIHIHIHIHIHIHIHIHI
Direct Vote
Elec
tora
l Col
lege
Figure 6.2: Redistributive spending per capita, relative to national average, under theElectoral College and Direct Vote
Voter turnout.3 .5 .7
2
4
6
8
UTUTUTUTUTUTUTUTUTUTUTUTUTUTUTUT
MNMNMNMNMNMNMNMNMNMNMNMNMNMNMNMN
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDNDNDNDNDNDNDNDNDNDNDNDNDNDNDNDND
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTIAIAIAIAIAIAIAIAIAIAIAIAIAIAIAIA
NHNHNHNHNHNHNHNHNHNHNHNHNHNHNHNHIDIDIDIDIDIDIDIDIDIDIDIDIDIDIDID
WIWIWIWIWIWIWIWIWIWIWIWIWIWIWIWIILILILILILILILILILILILILILILILIL
MTMTMTMTMTMTMTMTMTMTMTMTMTMTMTMT
MAMAMAMAMAMAMAMAMAMAMAMAMAMAMAMAININININININININININININININININ
DEDEDEDEDEDEDEDEDEDEDEDEDEDEDEDE
RIRIRIRIRIRIRIRIRIRIRIRIRIRIRIRI
WVWVWVWVWVWVWVWVWVWVWVWVWVWVWVWV
WYWYWYWYWYWYWYWYWYWYWYWYWYWYWYWY
COCOCOCOCOCOCOCOCOCOCOCOCOCOCOCOMOMOMOMOMOMOMOMOMOMOMOMOMOMOMOMO
MEMEMEMEMEMEMEMEMEMEMEMEMEMEMEME
ORORORORORORORORORORORORORORORORKSKSKSKSKSKSKSKSKSKSKSKSKSKSKSKSWAWAWAWAWAWAWAWAWAWAWAWAWAWAWAWAMIMIMIMIMIMIMIMIMIMIMIMIMIMIMIMIOHOHOHOHOHOHOHOHOHOHOHOHOHOHOHOH
NENENENENENENENENENENENENENENENE
NJNJNJNJNJNJNJNJNJNJNJNJNJNJNJNJ
VTVTVTVTVTVTVTVTVTVTVTVTVTVTVTVT
PAPAPAPAPAPAPAPAPAPAPAPAPAPAPAPANYNYNYNYNYNYNYNYNYNYNYNYNYNYNYNYCACACACACACACACACACACACACACACACA
OKOKOKOKOKOKOKOKOKOKOKOKOKOKOKOK
NMNMNMNMNMNMNMNMNMNMNMNMNMNMNMNM
NVNVNVNVNVNVNVNVNVNVNVNVNVNVNVNV
KYKYKYKYKYKYKYKYKYKYKYKYKYKYKYKY
AKAKAKAKAKAKAKAKAKAKAKAKAK
MDMDMDMDMDMDMDMDMDMDMDMDMDMDMDMDAZAZAZAZAZAZAZAZAZAZAZAZAZAZAZAZ
HIHIHIHIHIHIHIHIHIHIHIHIHI
NCNCNCNCNCNCNCNCNCNCNCNCNCNCNCNC
DCDCDCDCDCDCDCDCDCDCDCDC
FLFLFLFLFLFLFLFLFLFLFLFLFLFLFLFLTNTNTNTNTNTNTNTNTNTNTNTNTNTNTNTN
LALALALALALALALALALALALALALALALA
ARARARARARARARARARARARARARARARAR
TXTXTXTXTXTXTXTXTXTXTXTXTXTXTXTXALALALALALALALALALALALALALALVAVAVAVAVAVAVAVAVAVAVAVAVAVAVAVA
GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAMSMSMSMSMSMSMSMSMSMSMSMSMSMSMSSCSCSCSCSCSCSCSCSCSCSCSCSCSCSCSC
QsDV/vs : Marginal voter density
Qs/e
s : P
r(dec
isiv
e sw
ing
stat
e)/e
s
Elec
tora
l vot
es p
er c
apita
milli
on
(i) (ii)
.5 1 1.50
.5
1
1.5
2
2.5
UTUTUTUTUTUTUTUTUTUTUTUTUTUTUTUT
MNMNMNMNMNMNMNMNMNMNMNMNMNMNMNMNSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
NDNDNDNDNDNDNDNDNDNDNDNDNDNDNDND
CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCT
IAIAIAIAIAIAIAIAIAIAIAIAIAIAIAIA NHNHNHNHNHNHNHNHNHNHNHNHNHNHNHNH
IDIDIDIDIDIDIDIDIDIDIDIDIDIDIDID
WIWIWIWIWIWIWIWIWIWIWIWIWIWIWIWI
ILILILILILILILILILILILILILILILIL
MTMTMTMTMTMTMTMTMTMTMTMTMTMTMTMT
MAMAMAMAMAMAMAMAMAMAMAMAMAMAMAMAININININININININININININININININ
DEDEDEDEDEDEDEDEDEDEDEDEDEDEDEDE
RIRIRIRIRIRIRIRIRIRIRIRIRIRIRIRI
WVWVWVWVWVWVWVWVWVWVWVWVWVWVWVWV
WYWYWYWYWYWYWYWYWYWYWYWYWYWYWYWY
COCOCOCOCOCOCOCOCOCOCOCOCOCOCOCO
MOMOMOMOMOMOMOMOMOMOMOMOMOMOMOMO
MEMEMEMEMEMEMEMEMEMEMEMEMEMEMEME
OROROROROROROROROROROROROROROROR
KSKSKSKSKSKSKSKSKSKSKSKSKSKSKSKS
WAWAWAWAWAWAWAWAWAWAWAWAWAWAWAWAMIMIMIMIMIMIMIMIMIMIMIMIMIMIMIMI
OHOHOHOHOHOHOHOHOHOHOHOHOHOHOHOH
NENENENENENENENENENENENENENENENE
NJNJNJNJNJNJNJNJNJNJNJNJNJNJNJNJ
VTVTVTVTVTVTVTVTVTVTVTVTVTVTVTVT
PAPAPAPAPAPAPAPAPAPAPAPAPAPAPAPA
NYNYNYNYNYNYNYNYNYNYNYNYNYNYNYNY
CACACACACACACACACACACACACACACACA
OKOKOKOKOKOKOKOKOKOKOKOKOKOKOKOK
NMNMNMNMNMNMNMNMNMNMNMNMNMNMNMNM
NVNVNVNVNVNVNVNVNVNVNVNVNVNVNVNV
KYKYKYKYKYKYKYKYKYKYKYKYKYKYKYKY
AKAKAKAKAKAKAKAKAKAKAKAKAK
MDMDMDMDMDMDMDMDMDMDMDMDMDMDMDMD
AZAZAZAZAZAZAZAZAZAZAZAZAZAZAZAZ
HIHIHIHIHIHIHIHIHIHIHIHIHINCNCNCNCNCNCNCNCNCNCNCNCNCNCNCNCFLFLFLFLFLFLFLFLFLFLFLFLFLFLFLFL
TNTNTNTNTNTNTNTNTNTNTNTNTNTNTNTN LALALALALALALALALALALALALALALALAARARARARARARARARARARARARARARARAR
TXTXTXTXTXTXTXTXTXTXTXTXTXTXTXTXALALALALALALALALALALALALALALVAVAVAVAVAVAVAVAVAVAVAVAVAVAVAVA
GAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAMSMSMSMSMSMSMSMSMSMSMSMSMSMSMS
SCSCSCSCSCSCSCSCSCSCSCSCSCSCSCSC
Figure 6.3: Variables affecting distribution under Electoral College and Direct Vote
23
Ohio is a pivotal swing state is more than twenty times that of Kansas. By comparison,Maine has only twice as high marginal voter density as California. To make things worse,the variation in electoral votes per capita is also higher than the variation in voter turnout.While Wyoming has four times as many electoral votes per capita as California, Minnesotahas less than twice times the voter turnout of Hawaii. Given this, it is not surprising thatthe equilibrium allocation of redistributive expenditures is much less equal under EC.The Lorenz-curve of spending under EC is strictly below that of spending under theDV . This finding is consistent with Persson, and Tabellini (2000) and Lizzeri and Persico(2001) who conclude that spending will be more narrowly targeted under majoritarianelections. Persson and Tabellini (2000) analyze electoral competition in an election withthree electoral districts. Their result is driven by the assumption that the district withthe highest marginal voter density is always decisive in the electoral college (majoritarianelection), while under proportional elections candidates internalize marginal voter densityacross all states, which is more equally distributed. Lizzeri and Persico (2001) use avery different framework, assuming no uncertainty about the election outcome, givencandidate strategies. In consequence, candidates use completely mixed strategies and allstates receive equal expected treatment.
Political rents First a theoretical point: there is no a priori reason to believe thatpolitical rents would depend on the size or number of states. Rents are decreasing inX
s
∂PD
∂∆us=Xs
Qs =Xs
vsesvs
Qs
es,
which measures how the probability of re-election falls when the utility of all voters isdecreased marginally. If Qs was more than proportionally increasing in the number ofelectoral votes, as suggested by Brams and Davis (1974), larger states would contributemore than proportionally to keeping rents low and the best electoral system would be tohave just one state of maximum size (Direct Vote). However, Qs is roughly proportional tothe number of electoral votes, see equation (2.10) and Figure (5.3), so all states contributeproportionally and there is no a priori reason to believe that size and number of statesmatter.Empirically, the ability to discipline politicians and keep political rents low is about
the same for both electoral systems. Figure 6.4 plotsP
sQs andP
sQDVs , multiplied by
an increase in political rents ∆r causing an average 5 percent fall in vote support.18 Thisfall is trivially higher in elections which are ex ante close. Therefore, equilibrium politicalrents are lower in 2000 than in 1984, under both electoral systems. Further, the averagefall is about the same for the two systems and politicians would be disciplined to an equalextent.This depends on two factors, how the probability of re-election depends on a uniform
fall in voter support across all states, and how the fall is likely to be distributed acrossstates. First, the fall in the probability of being elected from a uniform loss of 5 percent
18∆r = 0.05/average ϕ(·)σfs
. The figure describes the marginal incentives for political rents. As a measure
of the probability of losing the election, there is an approximation error since Qs and QDVs measure
marginal changes.
24
Elec
tora
l Col
lege
Direct Vote.1 .2 .3 .4 .5 .6
.1
.2
.3
.4
.5
.6
1992
2000
1968
1964
1984
1948
1940
19561992
1964
19762000
19721944
1948
1952
1976
1944
1948
1972
2000
1984
2000
1984
1964
1944
1960
1968
1988
1956
1976
194019401940
1964
19481948
19401944
1988
19401944
1988
1944
19681968
198819761988
1996
1940
19601976
1980
1956
1952
1988
1996
1944
1948
1984
1964
1972
1968
2000
1972
1960
1944
1948 1996
1984
19881976
1956
1952
1968
1964
1940
1992
1980
1976
1980
2000
1992
1988
19441972
1996
1984
19481956
1960
1940
1952
1964
1984
19601988
1948
2000
1940
1992
1944
1956
1968
1980
1952
1976
1996
1972
1960
19721944
1956
1976
1996
1968
1964
1948
1988
1940
1980
1992
20001952
1964
1976
1980
1968
2000
198419721944
19961956
19881960
1952
1940
1992
1984
1948
1952
1976
1972
1996
1960
1964
2000
1944
1956
1988
1980
1992
1968
1960
1944
1976
1968
2000
1972
1992
1964
1980
19401984
1948
1988
1996
1952200019881976
1948
1968
1972
1980
1964
1944
1952
1996
1984
1960
1940
1956
1960
19841972
1996
19522000
1956
1988
1944
1992
1980
1948
1940
1968
1976
1964
1988
1980
1960
1952
1944
1968
19921948
1984
1956
2000
1996
19401972
19761960
19401944
1948
1984
1956
1972
19641996
1952
1992
1980
1968
1988
19921996
1976
19441984
1964
1968
1940
20001952
1980
1948
19881960
19561992
1940
1988
1968
19602000
1984
1952
1972
19961956
1980
1976
1948
1964
1968
1996
197219841944
1988200019761960
1992
1940
1956
1980
1964
1952
1964
1988
1944
1996
19841972
1968
19601976
1940
19921948
2000
1980
1956
1968
1980
1992
1988
1972
1996
19841940
1960
1944
2000
1956
1952
1948
1964
1976
198419401972
2000
19641996
1968
1948
1980
19881960
19921956
19522000
1972
1964
19441984
1980
1956
1968
1960
19921996
1976
1940
1952
1988
1948
1952
1996
1976
1968
1944
1992
19882000
1964
1960
1956
1940
1980
1984
1948
1980
1964
1992
1940
1960
19721944
1968
1988
1984
1956
1976
1996
1952
1968
1988
1964
1944
20001952
1980
1956
19601976
19921996
19721940
1948
1940
1980
1984
1976
195619481992
1944
1968
1952
19881960
1964
1972
1996
1968
2000
1944
1952
19601976
1980
1956
1940
1988
1996
1972
19921948
1960
1996
1940
19561992
2000
1972
1980
19441984
1988
1968
1952
1976
1980
1968
19561992
1988
1996
1976
19401984
1952
1964
1948
20001960
197219441984
1968
1996
1952
1980
19401972
19762000
1988
19561992
1964
1948
1964
1952
1944
1992
1972
1976
1996
19841940
2000
19481956
19881960
1980
19441972
1992
1980
19481956
20001976
1964
1952
1984
1996
1968
19401944
1960
1980
1996
2000
1948
1988
1964
1972
1968
1952
1940
1992
1984
1976
1956
1980
20001952
1996
1968
19601988
1948
1964
1940
1992
194419841972
19561992
19721984
1968
1944
1948
1960
1996
195220001976
1964
1988
1980
1960
1952
1980
1956
1968
1992
198419721944
2000
1964
1976
19961948
19881976
1992
1960
1956
1988
1944
2000
1972
1968
1996
1980
1984
1952
1948
1964
2000
1984
1968
1988
1952
19761960
1944
19921948
1972
1956
1980
1940
1996
1960
1992
1972
1964
1940
1952
1984
2000
1968
1988
1996
1944
1956
1980
1976
1940
1964
1972
1992
1984
1968
1952
1976
1944
1980
1996
19602000
1956
1988
1944
1976
19961992
1980
20001988
19481956
1964
1960
1968
1972
1952
1984
1976
1992
1980
1972
19481956
2000
1964
1968
19881960
1984
1952
1996
1940
1996
1980
1952
1968
1992
1964
19602000
1956
19441984
1948
19401972
1976
1972
1996
1968
1948
1964
19921956
1988
1980
1960
1984
19762000
1944
1952
19602000
1972
1976
1948
1988
1992
1940
1964
1952
1968
1980
1996
1984
1956199219961964
1944
1968
1972
1952
1960
1980
1948
1984
1956
1940
20001976
1948
1984
1964
1980
196019882000
19961992
1976
1952
1972
1968
1956
1940
19961964
1988
19921948
1984
1976
1952
1960
194019721944
1956
2000
1980
1992
1984
1956
20001952
1948
19601988
1996
1944
1976
1980
1972
1964
1940
1996
1984
1952
1948
1940
20001960
19561992
19721944
1968
1980
1976
1964
19401944
1992
1980
1968
1984
1964
1972
1948
19522000
1996
1988
1956
1960
1984
2000
1980
1960
1968
1992
19721940
1976
19641996
19441944
1960
1968
1984
1964
20001988
1940
1976
1992
1972
1980
1968
19841944
2000
1964
1940
1976
1972
1996
1980
1988
1992
Figure 6.4: Probability that an incumbent would loose the election because of a corruptionscandal (5 percent fall in support) under EC and DV
support in all states is around 33 percent both under EC and DV .19 This similarityis far from obvious since the frequency of quite different events are measured: that theincumbent loses by less than 5 percent in any state that is decisive in the electoral college;and that the incumbent loses the national election by less than 5 percent. Appendix 8.4shows that this response is proportional to average "voting power", that is, the probabilitythat one randomly drawn voter decides the election. The result is therefore consistentwith the finding of Gelman and Katz (2001) that average voting power is about the sameunder EC and DV .However, the fall in support will not be uniform, it will be higher in states with many
marginal voters. If the fall is larger in states (like Florida, Michigan, and Pennsylvaniain 2000) who are likely to be decisive swing states, then the outcome under EC will bemore sensitive to the corruption scandal than under DV . This turns out not to be trueempirically, as these two variables are uncorrelated. A more direct measure of the shareof marginal voters, the share of independent voters in Erikson, Wright and McIver (1993),is also not related to the likelihood of being a decisive swing state.In contrast, Persson and Tabellini (1999) conclude that there will be more corruption
under Direct Vote than if the electorate is divided into three districts, a separate vote isheld in each district, and the candidate with the most districts wins the election. Theyreach their conclusion by assuming that there are more marginal voters in the one districtthat is always decisive. This assumption, translated to our framework, is not valid in thecase of the US as there are not more marginal voters in states more likely to be decisiveswing states.
Election statistics Finally, some argue that razor-thin victories, and presidents with-out a majority of the popular vote are unattractive features of an electoral system. Themodel can be used to estimate the probability of these events. Given that the elections1948-2000 are representative of future elections, this can be done by simulating elections
19This is estimated by simulations.
25
and recording event frequencies. The likelihood of a winning margin less than 1000 votes isabout 50 times higher under the Electoral College system (0.4 percent compared to 0.008percent under Direct Vote). The reason is that a margin of victory of, say, one percent,is equally likely in the two systems, but, since there are 50 states, one percent containson average 50 times more voters under Direct Vote. This is consistent with equal averagevoting power under both systems. Every time a single voter is pivotal under DV , then allvoters in the nation are pivotal. Every time a single voter is decisive under EC, then allvoters in that state are decisive. The latter event occurs 50 times more often but involves1/50 as many voters, so average voting power is the same. The probability of electing apresident without a majority of the popular vote is about 4 percent. This implies that weshould expect this outcome about once in every hundred years. Historically, this eventhas happened around three (perhaps four) times in the last 200 years: 1824 (perhaps),1876, 1888, and 2000. Arguably, however, the outcomes in 1824 and 1876 had to do withpeculiarities in the aggregation of votes.20
The above results support a reform of the Electoral College system in favor of DirectVote. Such a reform would decrease political incentives to allocate resources unequallyacross states for electoral reasons. It would also decrease the probability of razor thinwinning margins and, of course, the probability of electing presidents without supportof a majority of the popular votes. There would be no cost in terms of less disciplineon politicians. Also, small states need not fear reform. As a group, they are neitheradvantaged or disadvantaged by the present system, nor will they be under Direct Vote.
7. Conclusion and discussion
This paper develops a game-theoretic model of how presidential candidates should allocateresources across states in order to get a majority of the electoral votes. After applying itempirically to the US, a large set of questions have been answered.What would resource allocations look like if candidates try to maximize the probability
of winning a majority in the electoral college? Some theoretical results deserves mention-ing. First, more resources should be devoted to states who are likely to be decisive swingstates, that is, states who are decisive in the electoral college and, at the same time, havetied state elections. Second, the probability of being a decisive swing state equals the"voting power" in the state, multiplied by the marginal voter density conditional on thestate election being tied. Third, the probability of being a decisive swing state is roughlyproportional to the number of electoral votes. Fourth, this probability per electoral vote ishighest for states who have a forecasted state election outcome which lies between a drawand the forecasted national election outcome. Fifth, more precise state-election forecastsmake the optimal allocation of resources more concentrated. Sixth, the presidential can-didates who is lagging behind should try to increase variance in electoral votes. This isdone by spending more time in large states where this candidate is behind, and less timein large states where this candidate is ahead.
20In 1824, six of the twenty four States at the time still chose their Electors in the State legislature, sothe popular vote outcome is not known. In the chaotic election of 1876, each State delivered to Congresstwo slates of electors and a special commission accepted slates favoring of the popular vote loser. See "ABrief History of the Electoral College" in http://www.fec.gov/elections.html.
26
The second, fourth and fifth points are new, to the best of my knowledge. Thefirst point shows more precisely what people have conjectured or partly demonstratedearlier, see, for example, Snyder (1989). The third point has not previously been showntheoretically, although (because of the second point) it is closely related to the empiricalfinding of Gelman and Katz (2001) that "voting power" is proportional to size underreasonable assumptions. The sixth point makes more precise the statement of Snyder(1989) that more resources will be spent in safe districts of the advantaged party than inthe safe districts of the other party.More importantly, this paper quantifies these effects and shows that presidential candi-
dates’ actual strategies very closely resemble the optimal strategies. The model is appliedto presidential campaign visits across states during the 1988-2000 presidential elections,and to presidential campaign advertisements across media markets in the 2000 election.The actual allocation of these resources closely resembles the optimal allocation in themodel. In the 2000 election, the correlation between optimal and actual visits by stateis 0.91, and the correlation between optimal and actual advertisement expenditures byadvertising market is 0.88.Who would gain and who would lose if the president was instead elected through
a direct national vote? The gainers and losers from reform are identified as displayedin Figure 6.2. States like Kansas and Nebraska, would gain, while Nevada and Delawarewould lose. Small states are, as a group, not favored by the Electoral College system. Thedistribution of redistributive spending is much more unequal under the Electoral Collegethan under Direct Vote. This is basically because the probability of being a decisive swingstate is extremely unevenly distributed. The same conclusion was reached by Lizzeri andPersico (2001) and Persson, and Tabellini (2000), but for different reasons. Third, unlikePersson and Tabellini (1999), this paper finds that both systems are about equally fit todiscipline politicians from corruption or other activities which benefit them but hurt thevoters.The main difference between this paper and the earlier work is the complete integration
of theory and empirics, tying together theoretical insights with empirical results. Sincethe modelling framework is quite general, this integration can be carried over to the studyof other resource allocation problems in other electoral settings.
27
References
[1] Anderson, Gary M. and Robert D. Tollison, 1991, “Congressional Influence and Pat-terns of New Deal Spending, 1933-1939”, Journal of Law and Economics, 34, 161-175.
[2] Banzhaf, John R., 1968, ”One Man, 3.312 Votes: A Mathematical Analysis of theElectoral College”, Villanova Law Review 13, 304-332.
[3] Brams, Steven J. and Morton D. Davis, 1974, ”The 3/2 Rule in Presidential Cam-paigning.” American Political Science Review, 68(1), 113-34.
[4] Campbell, James E., 1992, Forecasting the Presidential Vote in the States. AmericanJournal of Political Science, 36, 386-407.
[5] Chamberlain, Gary and Michael Rothchild, 1981, ”A Note on the Probability ofCasting a Decisive Vote”, Journal of Economic Theory 25, 152-162.
[6] Cohen, Jeffrey E., 1998, ”State-Level Public Opinion Polls as Predictors of Presiden-tial Election Results”, American Politics Quarterly, Vol. 26 Issue 2, 139.
[7] Colantoni, Claude S., Terrence J. Levesque, and Peter C. Ordeshook, 1975, ”Cam-paign Resource Allocation under the Electoral College (with discussion)”, AmericanPolitical Science Review 69, 141-161.
[8] Erikson, Robert S., Gerald C. Wright and John P. McIver, 1993, "Statehouse Democ-racy", Cambridge University Press.
[9] Gelman, Andrew, and Jonathan N. Katz, 2001, ”How Much Does a Vote Count?Voting Power, Coalitions, and the Electoral College”, Social Science Working Paper1121, California Institute of Technology.
[10] Gelman, Andrew, and Gary King, 1993, ”Why are American Presidential ElectionCampaign Polls so Variable when Votes are so Predictable?”, British Journal ofPolitical Science, 23, 409-451.
[11] Gelman, Andrew, Gary King, and W. John Boscardin, 1998, ”Estimating the Prob-ability of Events that have Never Occurred: when is your vote decisive?”, Journal ofthe American Statistical Association, 93(441), 1-9.
[12] Hobrook, Thomas M. and Jay A. DeSart, 1999, ”Using State Polls to Forecast Pres-idential Election Outcomes in the American States”, International Journal of Fore-casting, 15, 137-142.
[13] Lindbeck, Assar and Jörgen W. Weibull, 1987, ”Balanced-Budget Redistribution asthe Outcome of Political Competition”, Public Choice, 52, 273-97.
[14] Lizzeri, Allesandro and Nicola Persico, 2001, "The Provision of Public Goods underAlternative Electoral Incentives", American Economic Review, 91(1), 225-239.
28
[15] Merrill, Samuel III, 1978, ”Citizen Voting Power Under the Electoral College: AStochastic Voting Model Based on State Voting Patterns”, SIAM Journal on AppliedMathematics, 34(2), 376-390.
[16] Nagler, Jonathan, and Jan Leighley 1992, ”Presidential Campaign Expenditures:Evidence on Allocations and Effects”, Public Choice 72„ 319-333.
[17] Persson, Torsten and Guido Tabellini, 1999, ”The Size and Scope of Government:Comparative Politics with Rational Politicians,” European Economic Review 43,699-735.
[18] Persson, Torsten, and Guido Tabellini, 2000, Political Economics: Explaining Eco-nomic Policy, MIT Press.
[19] Rosenstone, Steven J., 1983, ”Forecasting Presidential Elections”, New Haven: YaleUniversity Press.
[20] Shachar, Ron and Barry Nalebuff, 1999, ”Follow the Leader: Theory and Evidenceon Political Participation”, American Economic Review 89(3), 525-547.
[21] Shaw, Daron R., 1999, ”The Effect of TV Ads and Candidate Appearances onStatewide Presidential Votes, 1988-96, American Political Science Review 93(2), 345-361.
[22] Strömberg, David, 2002, ”Optimal Campaigning in Presidential Elections: The Prob-ability of Being Florida”, IIES working paper 706.
[23] Strömberg, David, 2002a, ”The Electoral College and the Distribution of FederalEmployment”, Stockholm university, in preparation.
[24] Strömberg, David, 2002b, ”Follow the Leader 2.0: Theory and Evidence on PoliticalParticipation in Presidential Elections”, Stockholm university, in preparation.
[25] Snyder, James M., 1989, ”Election Goals and the Allocation of Campaign Resources”,Econometrica 57(3), 637-660.
[26] Wallis, John J., 1996, “What Determines the Allocation of National GovernmentGrants to the States?”, NBER Historical Paper 90.
[27] Wright, Gavin, 1974, ”The Political Economy of New Deal Spending: An Economet-ric Analysis”, Review of Economics and Statistics 56, 30-38.
29
8. Appendix
8.1. Proof of Proposition 1
Since PD¡dD∗, dR
¢is continuous and differentiable, a necessary condition for an interior
NE is∂PD
¡dD∗, dR∗
¢∂dDs
= Qsu0 ¡dD∗s ¢ = λD, (8.1)
∂PD¡dD∗, dR∗
¢∂dRs
= Qsu0 ¡dR∗s ¢ = λR. (8.2)
Therefore,u0¡dD∗s¢
u0 (dR∗s )=
λD
λR, (8.3)
for all s. Suppose that dD∗ 6= dR∗. This means that dD∗s < dR∗s for some s, implying thatλD > λR by equation (8.3). Because of the budget constraint, it must be the case thatdD∗s0 > dR∗s0 for some s
0, which implies λD < λR, a contradiction. Therefore, λD = λR,which implies dD∗s = dR∗s for all s.Uniqueness: Suppose there are two equilibria with equilibrium strategies d and d0
corresponding to λ > λ0. The condition on the Lagrange multipliers implies ds > d0s for alls, which violates the budget constraint. Therefore, the only possibility is λ = λ0, whichimplies ds = d0s for all s.
8.2. Derivation and interpretation of Qs
To see, heuristically, why the analytical expression for Qs approximates the joint proba-bility of a state being decisive in the electoral college at the same time as having a tiedelection, note that state s can only be decisive if the margin of victory in the electoralcollege is less than its electoral votes, es. To a second order approximation, this happenswith probability
Φ
µ 12
Ps0 es0 + es − µ
σE
¶− Φ
µ 12
Ps0 es0 − es − µ
σE
¶≈ 2
µϕ (x (η))
esσE+
e2sσ2E
ϕ (x (η))x (η)
µGs (·)− 1
2
¶¶= 2pECD (η) ,
where the last equality defines pECD (η). However, s is only decisive when D wins na-tionally and in state s, or R wins nationally and in state s. This happens about half ofthe times the margin of victory is less than es, so the probability that a state is ElectoralCollege Decisive is approximately half the above expression. Second, given a nationalshock, η, the state outcome is a tie if Fs(−ηs − η) = 1
2which happens with likelihood
gs (−µs − η) . Conditional on the national shock, the joint likelihood of being decisiveand having a tied election is thus roughly pECD (η) gs (−µs − η), and the unconditionalprobability approximately Qs.21
21A stricter, and longer, version of this argument can be made. Please contact the author for details.
30
To compare Qs with simulated frequencies of being EC decisive and having a stateelection margin of less than two percent, Qs must be scaled. The probability of a winningmargin of less than x percent equals
1
2− x
200≤ Fs(−ηs − η) ≤ 1
2+
x
200.
The probability that the state-level shock falls in this region is approximately
gs (−µs − η)σfsϕ (0)
x
100.
The unconditional probability of being pivotal and having a state-election margin of xpercent is therefore approximately
Qsσfsϕ (0)
x
100.
The scaled values of Qs are very similar to the simulated frequencies of the corre-sponding events. In 3.4 percent of the 1 million simulated elections, Florida was decisivein the Electoral College and had a state margin of victory of less than 2 percent. Thescaled Qs for Florida was 3.5 percent. To get the probability that the state margin ofvictory is within, say 1000, votes, Qs should be scaled by 1000
vs
σfsϕ(0)
. Using this formula,the probability of a state being decisive in the Electoral College, and at the same timehaving an election result with a state margin of victory less than 1000 was 0.00015 inFlorida in the 2000 election. The probability that this would happen in any state was.0044. The probability of a victory margin of one vote in Florida is 0.15 per million, andthe probability of this happening in any state is 4.4 per million. The state where one voteis most likely to be decisive is Delaware, where it is decisive .4 times in a million elections.It is important to understand how the scaling of Qs relates to its interpretation.
Qs =1
0.02Qs
µσfsϕ (0)
0.02
¶| {z }
Pr(EC decisive andmargin of two percent)
ϕ (0)
σfs| {z } .marginal voter density,
conditional on tied state election
(8.4)
or, similarly,
Qs = vsQs
µσfsϕ (0)
1
vs
¶| {z }”votingpower"
ϕ (0)
σfs| {z } .marginal voter density,
conditional on tied state election
(8.5)
When all states are assumed to have the same variance of preferences σfs (until Section6), Qs is proportional to the probability of being EC decisive and having a state marginof victory of less than two percent.
31
nytt
tfi pfi
-.027613 .101768
.000531
.014273
Figure 8.1:
To simplify the analytical form of Qsµ, do a first order Taylor expansion of the meanof the expected number of electoral votes µ (η) around η = eη for which µ (eη) = 1
2
Ps es,
that is the value of the national shock which makes the expected outcome a draw. Withthis approximation
µ (η) =Xs
esGs (−µs − η) ≈ 12
Xs
es − a (η − eη) ,a =
Xs
esgs (−µs − eη) .Since the mean of the electoral votes, µ (η) , is much more sensitive to national shocksthan is the variance σE (η), the latter is assumed fixed
σ2E (η) = σ2E =Xs
e2sGs (−µs − eη) (1−Gs (−µs − eη)) .Then
1
σE (η)ϕ
µ 12
Ps es − µ
σE (η)
¶≈ 1
σEϕ
µη − eησE/a
¶.
This approximation is very good. Figure 8.1 shows the true, tfi, and the approximatedfunctions, pfi. The values are calculated for an interval of four standard deviations cen-tered around eη in the 2000 presidential election. We now have
Qsµ ≈ es1
σE
Zϕ
µη − eησE/a
¶gs (−µs − η)h (η) dη.
Integrating over η,
Qsµ ≈ esω
2πexp
Ã−12
σ2seη2 + (σE/a)2 µ2s + σ2 (eη + µs)2
σ2sσ2 + (σE/a)
2 σ2 + (σE/a)2 σ2s
!(8.6)
where
ω2 =
µ1
(σE/a)2 +
1
σ2s+1
σ2
¶−1.
32
Qsµ is larger when eη is close to zero. This effects all states in the same way, but variesacross elections. To clarify the differences between states with different µs, rewrite
Qsµ ≈ esω
2πexp
Ã−12
c+ (µs − µ∗s)2
eσ2!,
where
µ∗s = −σ2
σ2 + (σE/a)2eη,
and eσ2 = σ2s +1³
1σ2+ 1
(σE/a)2
´ .Qsσ is calculated using numerical integration.
8.3. Campaign advertisements
This Appendix analyses the decision of presidential candidates to allocate advertisementsacross media markets. Two presidential candidates: R and D, select the number of ads,am, in each media market m and the number of visits ds in each state s, subject to
MXm=1
pmaJm ≤ Ia,
SXs=1
dJs ≤ I,
J = R,D.22 Media market m contains a mass vms of voters in state s. Voters reactionsto campaign advertisements are captured by the increasing and concave function w(am).A voter i in media market m in state s will vote for D if
∆uwms = u¡dDs¢− u
¡dRs¢+ w
¡aDm¢− w
¡aRm¢ ≥ Ri + ηs + η.
In each media market m in state s, the individual specific preferences for candidates,Ri, are distributed with cumulative density function Fs. The state and national-levelpopularity-swings are drawn from the same distributions as before.The share D votes in media market m in state s equals
Fs(∆uwms − ηs − η).
D wins the state if Xm
vmsFs(∆uwms − ηs − η) ≥ vs2.
22For simplicity, it is assumed that both parties have the same resources for buying advertisements.The asymmetric equilibrium with unequal resources can be characterized, theoretically and empirically,given assumptions about w (a), the effectiveness of advertisements in changing votes.
33
Define the total (state and national) swing which causes a draw in the state:
ηs¡dD, aD, dR, aR
¢:Xm
vmsFs(∆uwms − ηs¡dD, aD, dR, aR
¢) =
vs2. (8.7)
D wins state s ifηs + η ≤ ηs (·) .
Conditional on the aggregate shock η, and the strategies, dD, aD, dR, aR, this happenswith probability
Gs
¡ηs¡dD, aD, dR, aR
¢− η¢.
The function ηs (·) now plays the same role as ∆us − µs in Section 2, see equation (2.2).The rest of the analysis is in that section, only exchanging ηs (·) for ∆us − µs. The bestreply functions of D is characterized by
Qs∂ηs (·)∂dDs
= λD, (8.8)
SXs=1
Qs∂ηs (·)∂aDm
= µDpm. (8.9)
Similarly, the best reply function of R is characterized by equations (8.8) and (8.9),replacing superscripts D by R. Because of the fixed budget and time constraints, theallocations must be symmetric so that λD = λR = λ, and µD = µR = µ. Differentiatingequation (8.7), and evaluating it in equilibrium yields
∂ηs (·)∂dDs
= u0¡dDs¢,
∂ηs (·)∂aDm
=vms
vsw0¡aDm¢.
Proposition 4. A pair of strategies for the parties¡dD, dR, aD, aR
¢that constitute a NE
in the game of maximizing the expected probability of winning the election must satisfydD = dR = d∗, and aD = aR = a∗, and for all s and for some λ, µ > 0
Qsu0 (d∗s) = λ, (8.10)
Qmw0 (a∗m) = µpm, (8.11)
where
Qm =SXs=1
Qsvms
vs.
Qm is the sum of the Qs of the states in the media market weighted by the share of thevoting population of state s that lives in media market m.Last, assuming that votes per capita
vms
nms= ts
is the same for all media markets within each state,
Qm =SXs=1
Qsnms
ns.
34
8.4. Direct presidential vote
First, the approximate probability of winning the election is derived under DV . Condi-tional on η, the expected vote share of D in state s is
µvs (∆us, η) =
ZFs(∆us − η − ηs)gs (ηs) dηs
= Φ
∆us − η − µsqσ2s + σ2fs
,
and the expected national vote of D is
µv (∆us, η) =Xs
vsµvs (∆us, η) .
Again conditional on η, the variance in D’s votes in state s is,
σ2vs = v2s
Z µΦ
µ∆us − η − ηs − µs
σfs
¶− µvs (∆us, η)
¶2gs (ηs) dηs,
and the variance in the national votes of D is
σ2v =Xs
σ2vs.
The approximate probability of D winning the election is
PD (∆us) = 1−Z
Φ
µ 12
Ps vs − µvσv
¶h (η) dη.
The equilibrium strategies of Proposition 3 depend crucially on
∂PD
∂∆us= QDV
s =∂PD
∂µv
∂µv∂∆us
+∂PD
∂σv
∂σv∂∆us
= QDVsµ +QDV
sσ ,
where
QDVsµ =
vs
σvqσ2s + σ2fs
Zϕ
µ 12
Ps vs − µvσv
¶ϕ
∆us − µs − ηqσ2s + σ2fs
h (η) dη, (8.12)
QDVsσ =
Z µ∂
∂σvΦ
µ 12
Ps vs − µvσv
¶¶∂σv∂∆us
h (η) dη.
Empirically, QDVsµ À QDV
sσ , the size of QDVsσ is negligible compared to QDV
sµ .The interpretation of QDV
sµ is
QDVsµ = vs (pdf of tied election) E [fs | election tied] . (8.13)
35
This follows since, conditional on η, the expected marginal voter density in state s isZ ∞
−∞fs(∆us − η − ηs)gs (ηs) dηs =
1qσ2s + σ2fs
ϕ
∆us − µs − ηqσ2s + σ2fs
.
The probability density function of a tied national election, conditional on η, is
1
σvϕ
µ 12
Ps vs − µvσv
¶,
and the unconditional
pdf of tied election =Z
1
σvϕ
µ 12
Ps vs − µvσv
¶h (η) dη. (8.14)
Therefore, the marginal voter density, conditional on a tied election, is
E [fs | election tied] (8.15)
=
Z ϕ³
12
Ps vs−µvσv
´Rϕ³
12
Ps vs−µvσv
´h (η) dη
1qσ2s + σ2fs
ϕ
∆us − µs − ηqσ2s + σ2fs
h (η) dη.
Inserting equations (8.14) and (8.15) into equation (8.13) yields equation (8.12).To see the relationship to "voting power" note that the probability of a national
election margin of x votes or less equals
pDVx =
ZΦ
µ 12
Ps vs +
x2− µv
σv
¶− Φ
µ 12
Ps vs − x
2− µv
σv
¶h (η) dη (8.16)
≈ x
Z1
σvϕ
µ 12
Ps vs − µvσv
¶h (η) dη.
Voting power is by definition pDV1 . By equations (8.14) and (8.16), pDV
1 ≈ pdf of tiedelection, therefore equation (8.13) may be written
QDVsµ ≈ vs ("voting power") E [fs | election tied] . (8.17)
This is the equivalent to equation (8.5) under the Electoral College.
Rents The response in re-election probability to a uniform fall in vote share supportacross states is proportional to average "voting power". To see this, first define ∆uFs asthe change in voter utility in state s necessary to induce a one percent fall in support:
∂F (∆us)
∂∆us∆uFs = 1,
or equivalently,
∆uFs =1
fs.
36
Next, define the response in the probability of winning representing an equal fall in voteshares as
∂PD
∂∆uFs=
∂PD
∂∆us∆uFs =
Qs
fs
Under EC, voting power equals
pEC1 ≈ Qs
vsfs (0),
see equation (8.5). Average voting power therefore equals the per capita response inre-election probability to a uniform one percent fall in vote share supportX
s
vsvpEC1 ≈ 1
v
Xs
∂P
∂∆uFs.
Similarly, under DV
1
v
Xs
∂P
∂∆uFs=1
v
Xs
QDVs
E [fs | national election tied] ≈ pDV1 ,
since QDVsµ À QDV
sσ , and using equation (8.17).It is reasonable that 1000 votes decide the election about 50 times more often under
the Electoral College. To see this, note that the probability of 1000 votes in any statedeciding the election under EC and DV is approximately
pEC1000 ≈Xs
Qs1000
vs/
µϕ (0)
σfs
¶,
andpDV1000 ≈
Xs
QDVs
1000
v/Xs
vsvE [fs | national election tied] ,
where the latter equation was derived by inserting pDV1000/1000 instead of (pdf tied election)
into equation (8.13), summing over s, and solving for pDV1000. Assume that the marginal
voter density is the same fs = 1 and that all states are of equal size, then vsv= 1
50and
pDV1000 =
1
50pEC1000.
Since Qs/vs is not strongly correlated with vs, the actual relation is not far from this.
8.5. Data definitions and sources
• Dmvote: state Democratic percentage of the two-party presidential vote. Source:1940-1944, ICPSR Study 0019; 1948-1988, Campbell; 1992, 1996, Statistical Ab-stract of the United States 2000; 2000, Federal Election Commission, 2000 OFFI-CIAL PRESIDENTIAL GENERAL ELECTION RESULTS.
• Electoral votes won (by state). Source: National Archives and Records Administra-tion.
37
• National trial-heat poll results. Source: 1948-1996, Campbell (2000); 2000, Gallup.• Second quarter national economic growth, multiplied by 1 if Democratic incumbentpresident and -1 if Republican incumbent president. Source: August or Septemberelection year issue of the Survey of Current Business, U.S. Department of Commerce,Bureau of Economic Analysis.
• Growth in personal state’s total personal income between the prior year’s fourthquarter and the first quarter of the election year, standardized across states ineach year, multiplied by 1 if Democratic incumbent president and -1 if Republicanincumbent president. Source: Survey of Current Business, U.S. Department ofCommerce, Bureau of Economic Analysis.
• Incumbent: 1 if incumbent president Democrat, -1 if incumbent president Republi-can.
• Presinc: 1 if incumbent Democratic president seeking re-election, -1 if incumbentRepublican president seeking re-election.
• President’s home state: 1 if Democratic president home state, -1 if Republican (0.5and -0.5 for large states (New York, Illinois, California). Source: Campbell 1948-1988; 1992-2000: Dave Leip’s Atlas of U.S. Presidential Elections.
• Vice president’s home state: 1 if Democratic president home state, -1 if Republican(0.5 and -0.5 for large states (New York, Illinois, California). Source: Campbell1948-1988; 1992-2000: Dave Leip’s Atlas of U.S. Presidential Elections.
• Average ADA-scores: Average ADA-scores of state’s members in Congress yearbefore election. Source: Tim Groseclose(http://faculty-gsb.stanford.edu/groseclose/homepage.htm).
• Legis: Partisan division of the lower chamber of the state legislature after the previ-ous midterm election. Index is Democratic share of state legislative seats above the50% mark. Two states, Nebraska and Minnesota, held nonpartisan state legislativeelections for all (Nebraska) or part (Minnesota of the period under study. In thecase of Nebraska, the state legislative division was estimated based on the rankingof states of Wright, Erikson, and McIver’s state partisan rankings based on publicopinion data. Using this index, Nebraska was assigned the mean partisan division ofthe state most similar to it on the public opinion index, the nearly equally Republi-can state of North Dakota. The partisan division of the Minnesota legislature in itsnonpartisan years (before 1972) is coded as the mean of its partisan division once itreformed to partisan elections (62% Democratic). Washington D.C. was as havingthe same partisan division as Maryland. Source: 1948-1988, Campbell; 1992-2000,Statistical Abstract of the United States.
• State-level opinion polls. Democratic share of two party vote. Source: Pre-electionissues of the Hotline (www.nationaljournal.com).
38
• Regional dummy variables, see Campbell.• Participation rate: 1948-1988, Shachar and Nalebuff (1999), voting age population1992-2000, and votes cast 2000, Federal Election Commission, votes cast 1992, 1996,Statistical Abstract of the United States.
39