miracles and stats

13
Miracles and Statistics: The Casual Assumption of Independence Author(s): William Kruskal Source: Journal of the American Statistical Association, Vol. 83, No. 404 (Dec., 1988), pp. 929- 940 Published by: American Statistical Association Stable URL: http://www.jstor.org/stable/2290117 . Accessed: 21/02/2011 08:55 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at . http://www.jstor.org/action/showPublisher?publisherCode=astata. . Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal of the American Statistical Association. http://www.jstor.org

Transcript of miracles and stats

Page 1: miracles and stats

Miracles and Statistics: The Casual Assumption of IndependenceAuthor(s): William KruskalSource: Journal of the American Statistical Association, Vol. 83, No. 404 (Dec., 1988), pp. 929-940Published by: American Statistical AssociationStable URL: http://www.jstor.org/stable/2290117 .Accessed: 21/02/2011 08:55

Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at .http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unlessyou have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and youmay use content in the JSTOR archive only for your personal, non-commercial use.

Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at .http://www.jstor.org/action/showPublisher?publisherCode=astata. .

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printedpage of such transmission.

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range ofcontent in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new formsof scholarship. For more information about JSTOR, please contact [email protected].

American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journalof the American Statistical Association.

http://www.jstor.org

Page 2: miracles and stats

Miracles and Statistics: The Casual Assumption of Independence

WILLIAM KRUSKAL*

The primary theme of this address is cautionary: Statistical independence is far too often assumed casually, without serious concern for how common is dependence and how difficult it can be to achieve independence (or related structures). After initial discussion of statistics and religion, the address turns to miracles, especially Hume's critique and Babbage's reply. Stress is given the often tacit or unexamined assumption of independence among witnesses of a putative miracle. Other contexts of multiple testimony are treated, and the address ends with contemporary casual assumptions of independence: nuclear reactor safety, repeated measurements, and so forth. Other topics include prayer, circularity of argument, and the tension between skepticism about testimony and the pragmatic need to accept most of it provisionally. KEY WORDS: Babbage; Hume; Religion; Safety; Testimony.

1. INTRODUCTION

Francis Henry Egerton, Earl of Bridgewater,' died in 1829; in his will he left 8,000 pounds for the writing and publication of works on "the Power, Wisdom, and Good- ness of God, as manifested in Creation." Under eminent auspices, eight clerical and scientific notables each wrote and published a treatise. Charles Babbage, progenitor of the modern computer, felt that the eight treatises did not carry their argument far enough, and that at least one of them incorrectly argued that the pursuits of science are unfavorable to religion. In addition, Babbage had views about miracles, views in part statistical. So in 1837 he published what he called The Ninth Bridgewater Treatise: A Fragment. (A second, revised edition appeared in 1838.)

I begin with Babbage's treatment of miracles in his own context, that of Hume's famous argument against testi- mony for miracles. Then I discuss miracles and testimony more generally, with special emphasis on the ubiquitous, critical, and often tacit assumption of independence among witnesses. I then discuss overcasual assumptions of inde- pendence in other contexts. If I have a moral, it is this: Do not multiply lightly.

Miracles are relevant to statistics in a number of ways. One analogy is that miracles are like apparent outliers that we study and worry over (e.g., see Anscombe 1978; Krus- kal 1960). One might even say that miracles are the ex- treme outliers of nonscientific life.2 It is widely argued of outliers that investigation of the mechanism for outlying may be far more important than the original study that led to the outlier; the discovery of penicillium is often

* William Kruskal is Professor, Department of Statistics, University of Chicago, Chicago, IL 60637. This is a revision of the author's 1982 presidential address to the American Statistical Association. This work was supported by National Science Foundation Grants SES 8303838, DMS 8601732, and DMS 8404941. The author thanks the following, who provided criticisms and suggestions: Keith Baker, George Barnard, Nor- man Bradburn, Stephen Brush, William A. Christian, Jr., Joel E. Cohen, 0. D. Duncan, William B. Fairley, R. W. Farebrother, Donald Fiske, Antony Flew, Joseph Gastwirth, Brian Gerrish, Langdon Gilkey, I. J. Good, R. M. Grant, Brown Grier, M. J. R. Healy, Paul W. Holland, J. Kadane, Benjamin King, Leslie Kish, Neal Koblitz, Samuel Kotz, Norma Kruskal, Martin E. Marty, Edward Offenbacher, David Owen, Stephen Portnoy, I. R. Savage, Eugene Seneta, Stephen M. Stigler, Judith M. Tanur, Ronald Thisted, B. M. Wright, S. L. Zabell, and Arnold Zellner.

given as an example. Just so, for those who take miracles as signs of revelation, the religious import of a miracle may be far more important than the daily life in which it occurred. In the same way, there are misleading apparent outliers and misleading false miracles, the results of poor, prejudiced, or careless observation. If identified as such, these outliers and miracles are worth little attention except for their connections with the psychological vagaries of human observation or with the effects on behavior of for- tuitous coincidences.

2. RELIGION AND STATISTICS

Religion and science generally are two basic realms of human effort that can hardly fail to interact and intersect. Gilkey (1982) expresses that strongly: "new religious forms appear and reappear out of and because of a scientific, technological culture in response, first, to the demand for a credible system of symbols giving structure, meaning, and direction to nature, history, society, and the self, and, second, to the particular sharp anxieties-and even ter- rors-of a technological age" (p. 71). A current example of intersection is the debate about teaching evolution, es- pecially in secondary schools, and Gilkey's article is about that debate. Again, the great physicist Frank (1956) wrote of how all scientific theories are related to "their fitness to support desirable attitudes in the domain of ethics, pol- itics, and religion" (p. 9). Some will be surprised at so normative a statement. I do not propose treating further the broad theme of religion and science, and I certainly will not exhibit aggressive incompetence by dipping deeply into theology.

Now narrow our focus to connections between religion and the mathematical sciences, a part of science in general. There are many such connections without explicit statis- tical aspects; I think of a range from astronomical and calendar calculations to recent attempts (Brams 1980) to- ward applying game theory to biblical stories.

One also finds fatuous or superficial uses of mathemat- ical terminology, for example, by Francis Hutcheson in 1724: "Benevolence is directly as the Moment of Good

? 1988 American Statistical Association Journal of the American Statistical Association

December 1988, Vol. 83, No. 404, Presidential Address

929

Page 3: miracles and stats

930 Journal of the Amercan Statistical Associafion, December 1988

and inversely as their abilities, i.e.-B = MIA" (Thomp- son 1931, pp. 14-15). Again, David Hartley writes in 1749 that W = F2IL, where F = fear and L = love of God, and W = love of the world (see Stephen 1962, Vol. 2, pp. 57-58). Such uses have been roundly criticized and also defended as useful metaphors. In the case of John Craig, whose 1699 calculation of the date of the Second Coming has been frequently derided, a recent brilliant interpre- tation by Stigler (1986) provides a sympathetic recasting. In any case, the argument over apparently superficial use of mathematical expressions continues in bitter argument over membership in the U.S. National Academy of Sci- ences (see Koblitz 1981; Marshall 1986).

If we specialize further, from the mathematical sciences generally to probability and statistics, a number of devel- opments come to mind. As Pearson (1924) said, De Moivre's work on the central limit theorem was theological: "he was determining the frequency of irregularities from the Original Design of the Deity. Without grasping this side of the matter, it is impossible to understand the history of statistics from De Moivre through Derham and Siissmilch to Quetelet, culminating in the modern principle of the stability of statistical ratios" (p. 404). In another place, Pearson (1978) carried the sequence of theologically mo- tivated statisticians through Florence Nightingale, and added that "with a modified meaning we might almost add Fran- cis Galton" (p. 286). Indeed, several chapters of the lec- tures (Pearson 1978), especially those dealing with Thomas Bayes and Richard Price, are suffused with theological motivations and connections.

A different theme is the ancient religious fear of being counted, for example in a census. This appears in several parts of the Old Testament; see Madansky (1986). For general treatments of religion and statistics, see Portnoy (1988) and Bartholomew (1984, 1988).

3. PRAYER

I digress to discuss statistically shaped studies of the efficacy of prayer. These studies are interesting per se, and they bring us near the topic of miracles. For some argue that prayers-at any rate answered prayers-are in a sense miracles, if often small miracles. Yet Tyndall (1871) wrote-and many others have made the point-that no prayer "could call one shower from heaven or deflect towards us a single beam of the sun" without a "distur- bance of natural law, quite as serious as the stoppage of an eclipse" (p. 36).

Perhaps the first recorded empirical comparative trial of prayer was the competition between Elijah and the many priests of Baal (1 Kings 18), although the objectivity of its reporting may be questioned by some. Recall that the priests of Baal built an altar, put on it a sacrificed bullock, and called on Baal to start the fire. Nothing hap- pened. Then Elijah had a bullock put on his altar, poured water over it to make the task dramatically harder, and called on his deity. The response was overwhelmingly fiery. At least one scholar (Dietl 1968) has built on this example and proposed randomized trials as an approach to showing

that miracles really happen-yet another relationship be- tween religion and statistics.

Among relatively recent empirical studies of prayer, the earliest I know was by the great Francis Galton. He rea- soned that the royal family and British clergymen probably generated more British prayers for good health than did others, so he compared longevity of members of the royal family and clergymen with the longevity of others. There are, of course, obvious problems of control and compar- ability, but Galton pushed ahead nonetheless, and came out with negative results that scandalized many contem- poraries, perhaps as much for the method as the conclu- sions. His first publication on the topic was in 1872, and he included the material in his 1883 book (see Galton 1872, 1883). The book also had a more general discussion of what Galton called theocratic intervention and the pos- sibility of ascertaining it. In the second edition, however, the theometric material was omitted, because-said Gal- ton-it offended the sensibilities of a number of readers. I believe that he never returned to the topic in print.34

The Galton (1872) article and other contemporary ex- changes were reprinted in The Prayer-Gauge Debate (Means 1876). [Galton's publication was slightly preceded by Tyn- dall (1871) and Henry Thompson, but they only proposed studies where Galton carried one out.] Pearson's (1924- 1930) massive biography of Galton describes the study (see Vol. 2, pp. 115-117). Haldane (1928, pp. 243-245) pro- vides a slightly distorted account (wrongly substituting children of clergymen for their fathers), and Brush (1974) gives a thorough discussion.5 A recent medical evaluation of prayer by Joyce and Weldon (1965) was double-blind and inconclusive. Another recent study (Cullipp 1969) gave results more encouraging to prayer, but its description leaves me with questions. Of course there is a long liter- ature on the curative effects of faith in various senses; for an overview and entry bibliography, see Frank (1978).

4. MIRACLES

We turn now to miracles themselves. There is a huge literature on miracles, and I have been able only to dip into it selectively with a primary interest in evidence for miracles, especially the validity of testimonial evidence. Begin with David Hume's 1748 definition (see Hume 1975) because Hume was one of the first6 to apply probabilistic ideas to miracles, and because Hume's famous argument against miracles is framed in terms of one probability out- weighing another. Hume defined a miracle as "a transgres- sion of a law of nature by a particular volition of the Deity, or by the interposition of some invisible agent" (Hume 1975, p. 115). See Pomeroy (1962, p. 6) for variant Hume characterizations of a miracle.

Aquinas had earlier said that a miracle is something done by "divine agency beyond the order commonly ob- served in nature." A more recent definition speaks of a "striking interposition of divine power by which the op- erations of the ordinary course of nature are overruled, suspended, or modified." See Flew (1967, p. 346) for a highly relevant treatment.

Page 4: miracles and stats

Kruskal: Miracles and Statistics 931

Every nuance of definitions like the above has been repeatedly examined and analyzed. For example, the idea that a miracle should be striking, remarkable, the Res- urrection in bright color with a full chorus singing the Messiah, has been both underscored and denied many times. Hume himself did not see the need for a striking appearance. He said: The raising of a house or ship into the air is a visible miracle. The raising of a feather, when the wind wants ever so little of a force requisite for that purpose, is as real a miracle, though not so sensible with regard to us. (1975, p. 115)

Other authors might exclude the raising of a feather by insisting that an event must have clear religious signifi- cance to be counted a miracle; for example, see Swinburne (1970, pp. 7-10). For a discussion in the Old Testament context, see Eichrodt (1967, p. 163). Yet it is hard to know what religious significance might mean. Eliade (1981) in- sists that miracles may appear in the most ordinary parts of everyday life. He says, for example, that "Just as I believed in the unrecognizability of miracle, so I also be- lieved in the necessity . . . of the camouflage of the 'ex- ceptional' in the banal, and of the trans-historic in his- torical events" (p. 224). I have difficulty with the unrecog- nizable miracle, for how can that be a sign or evidence or explanation? For a fuller statement of this point see Smart (1964, pp. 48-49).

There are also viewpoints that simply take life or science or man as miracles in themselves. "The greatest miracle is the existence of the laws of nature." Or, "All of life is a miracle. When you look at an apple seed, you are really looking at a miniature fruit tree. This is as big a miracle as the news of the Resurrection." These are some quo- tations from scientists in The God of Science, a book7 of interviews with scientists about religion (Trinklein 1971, pp. 80-81). Prior (1955) says that "Faith . .. is an inward miracle of God's mercy" (p. 8). To be more accurate, the statement is made by a Barthian Protestant in Prior's imag- inary dialogue. Eichrodt (1967) stresses that the Old Tes- tament takes miracles in the "widest possible terms. Even the course of Nature itself counts as a miracle" (pp. 162- 163).

An important problem with Hume's definition is that of explicating the idea of "law of nature." Another is to show that an alleged miracle is in fact willed by the deity, or by an appropriate "invisible agent"-presumably a benign one, say an angel, as opposed to a demon or devil (see Swinburne 1968, 1970, chap. 5).

Yet I avoid such angelic temptations and hasten on to the central question of testimony about the happening of a miracle. For testimony, in particular allegedly indepen- dent testimony from two or more witnesses, forms a cen- tral topic of this exposition.89

Hume's discussion of testimony is cynical, skeptical, and full of wit. He concludes that human testimony is so subject to bias, passion, self-interest, change over time, and so on, that "no testimony for any kind of miracle has ever amounted to a probability, much less to a proof; and that, even supposing it amounted to a proof, it would be op- posed by another proof" from the weakness of testimony

for remarkable events10 (1975, p. 127). Hume goes on to say that experience, and only experience, informs us of the validity of human testimony and informs us of the constancy of the laws of nature. When "these two kinds of experience are contrary, we have nothing to do but substract the one from the other . . . this substraction, with regard to all popular religions, amounts to an entire annihilation; and therefore . . . no human testimony can . . . prove a miracle, and make it a just foundation for any such system of religion" (1975, p. 127).

One can try to formalize this argument in several ways, comparing two explanatory hypotheses about a reported miracle, say the changing of water into wine or the levi- tation of a holy anchorite. Hypothesis H is that the miracle really happened, so the report is correct, putting aside questions of volition by a deity or other invisible agent. Hypothesis H is that the miracle did not happen and that the reporter (perhaps oneself) was mistaken, tricked, or even a liar. A fuller treatment might consider the possi- bility that the report was correct but misinterpreted. There are two sample points, R (the miracle is reported) and R (it is not), to permit an analysis.

A likelihood approach might be tried, but it seems not rich enough to include Hume's argument. Turn instead to a simple, familiar Bayesian structure; after all, miracles usually have a priori improbability. (But keep in mind those everyday miracles stressed by some.) Suppose then that 0 is the a priori probability for H, and 6 = 1 - 0 that for H. Let p be the probability of correct reporting: p = Pr(R I H) = Pr(R I H). Let p- = 1 - p. Then we have a standard 2 x 2 table:

R R

H pO pao 0 H TOa pO7 0

The familiar central calculation is to write the column-wise conditional probability as the unconditional one over the sum, Pr(H I R) = pOl(pO + jiG). Note that if p = p- = 2, then Pr(H I R) = 0 = Pr(H), a natural result. Even more naturally, if the witness is wholly trustworthy (p =

1), then Pr(H I R) = 1. This formulation may be criticized in many ways. For

example, the propensity to lie or be duped is presumably very different if a highly likely event happens than if it does not; a bridge hand of 12 hearts is, I think, much more likely to be reported as 13, than one of 13 reported as 12. Thus the value of p might well be taken as different in the two rows. More important, there is every reason to expect p to be a function of 0; the probability of reporting a bridge hand of 12 hearts as 13 is, I surmise, much greater than reporting a hand with 4 hearts as one of 5 (see Venn 1962, p. 421). These are psychological and historical questions that cannot be neglected. There seems to be remarkably little discussion of reasonable numerical values for 0 and p in various settings."l

At a more fundamental level, we generally do not set up such models until after a surprising event has been reported or at least discussed. Thus there is inherent bias

Page 5: miracles and stats

932 Journal of the American Statistical Association, December 1988

of selection in the very act of model formulation. Similarly, there is something arbitrary about setting up the two-point sample space. Coins do land on their edges sometimes, and more often witnesses have their own in-between sum- maries: neither water nor wine, uncertainties about how dead the revived man was, and so forth. In some cases, it makes little sense to look at the probability of a simple event rather than at a tail probability of decreasingly prob- able events under one hypothesis.

Hume argued, as I interpret him, that

Pr(incorrect observation I R) = Pr(H I R) > Pr(H I R)

= Pr(correct observation I R),

that is, that Pr(H I R) < . A little calculation shows that this is equivalent to 0 < -; that is, the unconditional prob- ability of the miracle is less than the probability of a lie (or mistake, or the like). He argues by anecdote, appeals to experience, and explanation of his cynical view of hu- man nature. [That skeptical"2 position was not new. Indeed it long predates the Christian era; see Grant (1952) for a detailed history of waves of skepticism and belief.] Hume might-but I think did not-have quoted the skeptical words of Shakespeare, who had Gloucester say "What means this noise?/Fellow, what miracle dost thou pro- claim?" (Henry the Sixth, Second Part, act 2, sc. 2). Still Hume put the skeptical case with special force, at a time of developing uncertainties about Christianity, and in terms-however early and rough-of probability.

5. BABBAGE'S REPLY

Hume's argument was met by a storm of protest, much too extensive to report here. [One account is by Brown (1984, pp. 89-91).] But I do want to describe Babbage's answer to Hume. Babbage said, in effect, "David Hume, you may be right for a single observer that 0 is less than p5, but suppose that there are several observers, and in- dependent ones, who agree in reporting the miracle, that is, in R. Isn't it obvious that, for any fixed 0 > 0 no matter how small, we can find an integer n such that agreement of n independent observers on R will produce a conditional probability for H that is arbitrarily close to 1?" Well of course that's right in an algebraic sense provided that p > p, that is, that a correct observation is more probable than an incorrect one.

Indeed, the conditional probability of H, given that all n independent observers agree in reporting that it hap- pened, is

pft 1

~P~0

where we assume 0 < 0 < 1. If p >75, then this probability as a function of n is monotone increasing to the limit unity. Note that this assumes that p is the same for all observers and not a function of n. It also assumes crucially that the observers behave independently. At the opposite extreme, if the observers behave in a wholly dependent way and

give the same answer, the conditional probability is the same as for a single observer.

A complication is that p might decrease as n increases, a phenomenon observable at magic shows. Under some circumstances, the larger the audience the greater the chance of misobservation.

Babbage went through calculations similar to those presented previously and concluded as follows:

... if independent witnesses can be found, who speak truth more frequently than falsehood, it is ALWAYS possible to assign a number of independent witnesses, the improbability of the falsehood of whose con- curring testimony shall be greater than that of the improbability of the miracle itself.

... the whole of this argument applies to independent witnesses. The possibility of the collusion, and the degree of credit to be assigned to witnesses under any given circumstances, depend on facts which have not yet been sufficiently collected to become the subject of mathematical inquiry. Some of those considerations which bear on this part of the subject, the reader will find treated of in the work of Dr. Conyers Mid- dleton, entitled "A Free Inquiry Into the Miraculous Powers Which Are Supposed to Have Subsisted in the Christian Church, From the Earliest Ages Through Several Successive Centuries." (Babbage 1838, pp. 202- 203) 13

Not surprisingly, others criticized"4 Hume along Bab- bage-like lines. Much earlier, Price (1777, p. 418) briefly made the point. Chalmers (1857, pp. 66-69, 130-135) gave a similar argument before Babbage. Chalmers ex- cused his "mathematical style of reasoning" because it was "the best fitted to neutralize the mischievous influence superadded to the skepticism of Hume by the great name of La Place" (p. 66). Chalmers also scolded Laplace on pages 114-116 and 142-143; I shall return to Laplace.

Price and Chalmers are careful to require independence, although I do not see proper discussions of how strong that requirement is. (Indeed, Chalmers sometimes forgets and multiplies too casually, e.g., p. 122.) In the same way the mathematician James R. Young carefully assumes in- dependence-no collusion, in his language-without ex- amining the strength of that assumption. Young's calcu- lations are much like Babbage's; a first claim by Young (1844, pp. 366-367; 1846, p. 80) is that Babbage (1838, pp. 198-199) made a small error in calculation that does affect his main conclusion. In fact, it seems to me that Young misread Babbage and that the two agree.

Young's second claimed advance is more interesting. He points out that both Hume's and Babbage's treatments refer to a specified miracle; in fact, initial reports of mir- acles refer to previously unspecified miracles. "Suppose, for instance," says Young (1844), that a person testifies to the fact that he saw a dead man raised to life; and that ten other persons with every disposition to deceive, but without collusion, testify to the same thing. Now, sup- posing that these ten persons were limited within the very narrow range of only ten fabrications suitable to their pur- poses of deception: the probability that they would all fix upon the particular miracle mentioned is 1/(10)1o'' (pp. 363-365). This development is an early discussion of the familiar statistical problem of selection. It is also an at- tempt to get around the Hume-Babbage requirement p > 2. But of course the primary weakness in Young's ar- gument is the difficulty in establishing the absence of col- lusion.

Page 6: miracles and stats

Kruskal: Miracles and Statistics 933

That some probability is very small says little or nothing by itself, or else every bridge hand would be astonishing. One natural statistical way of formalizing Young's ap- proach is to consider the alternative that, if one of the witnesses in fact saw a dead man raised to life, he would (no matter how much a liar) report it with probability qV > .1. That leads to a likelihood ratio of (V/.1)10 for the sample point in which all report the revival-assuming independence.

The literature on miracles includes much on qualitative consistency among witnesses, for example, among the New Testament Gospels. A widely read nineteenth-century ex- ample is The Life of Jesus by Strauss (1892, e.g., see pp. 742-743). Ernest DeWitt Burton, president of the Uni- versity of Chicago, devoted much of his scholarly energies to reconciling, and (as he put it) finding harmony among, the Gospels (e.g., see Stevens and Burton 1905). More recent related works are McArthur (1965), which discusses possible dependencies among the Gospels,15 and Perrin (1977).

6. OTHER FUNDAMENTAL ISSUES

In serious statistical thought about testimony, as about most other applications, fundamental problems may ques- tion the relevance of probability ideas at all, or point out ambiguities in framing a model. As Venn (1962) put it, "the individual presented himself, and the task was im- posed upon us of selecting the requisite groups or series to which to refer him. . . . we have to select the conditions of frequency out of a plurality of more or less suitable ones" (p. 395). In effect, Venn shows how p may well depend on the nature of the event in question, thus chal- lenging our ability to set up a usable probability space at all. The point had been raised earlier, for example by Chalmers (1857, pp. 140-141) as he chides Hume for shift- ing reference populations.

So the reference set may be questioned, and there are further complications. For example, in surveys with sev- eral or many interviewers it may be sensible to regard them as randomly chosen, thus introducing dependence among the respondents interviewed by the same enumer- ator. In other circumstances, especially with one or two interviewers only, we may decide to regard any interviewer effect as fixed so that there is no stochastic dependence. There is a similar ambiguity in some discussions of covari- ance analysis where nearby plots of ground in agricultural experimentation are taken to have correlated fertilities on one page and considered as bearing unknown fixed dif- ferential fertilities on the next. I think there are episte- mological issues16 here that remain to be sorted out.

7. FRANCE

Before coming to contemporary examples, I turn back to French thought. Starting with Condorcet and Laplace- especially in the latter's famous Philosophical Essay-cal- culations something like those sketched earlier for testi- mony to miracles, decisions about law-court cases, and deliberations of legislatures. Poisson was a major contrib-

utor. In most of this work there was little reference to empirical evidence and great reliance on a priori simpli- fying assumptions: independence, equal probabilities over people, and probabilities of error not depending on cir- cumstances. There was eventually a revulsion against what John Stuart Mill called the "opprobrium of mathematics" (Mill 1952, book 3, chap. 18, sec. 3, p. 353; 1973, p. 538), an expression translated into French by Bertrand (1888, p. 327) as "la scandale des Mathematiques." I hasten to add that there was also work of the era that tried to be more realistic and that dealt carefully with empirical evi- dence; in particular I cite Cournot (1838) and Bienayme (1838). For further discussion see Heyde and Seneta (1977, especially sec. 2.4, pp. 28-34).17

There was, in any case, a turning against direct prob- abilistic analyses of matters legal and legislative; only in recent years has such work-done far more carefully- had a rebirth. Yet even today egregious oversimplifica- tions are made in legal settings. A popular example is the notorious Collins case, in which multiplication of alleged probabilities of characteristics was casually done; see Fair- ley and Mosteller (1974) for citations and a tolerant ex- position. An extensive discussion in the legal literature, with many references, is McCormick (1972, pp. 491-499). See also the exposition by Eggleston (1978), in which the assumption of independence is carefully stated, but as a kind of incantation without due attention to its strength and the difficulty of confirming it.18

8. RECAPITULATION

I pause to recapitulate. Starting with questions about testimony for miracles, we have seen that one central issue is that of independence-or alleged independence-among witnesses. We have traced briefly the historical develop- ment of models for testimony generally, including testi- mony in judicial circumstances. Before turning to contem- porary examples, one may well ask why the assumption of independence is so widespread.

One answer is ignorance. Many otherwise estimable people will write that, for example, "The combination of these two probabilities-that the core will dry out under accident conditions and that safeguard systems will fail to reflood the core-is the product of the two individual probabilities, or 1 in 1 million" (Lapp 1974, p. 72).

Far more important than simple ignorance, in my opin- ion, is seductive simplicity: It is so easy to multiply mar- ginal probabilities, formulas simplify, and manipulation is relatively smooth, so the investigator neglects depen- dence, or hopes that it makes little difference. Sometimes that hope is realized, but more often dependence can make a tremendous difference.

There are intermediate positions in the constant tension between verisimilitude and simplicity.19 A grand theme in the development of stochastic models is the assumption of conditional independence, with the condition often turn- ing on unobserved, hypothetical variables. Two examples are factor analysis and latent structure theory.

In my view there is a central obligation to face squarely

Page 7: miracles and stats

934 Journal of the American Statistical Association, December 1988

what we do and do not know, and to study robustness of conclusion against mistakes in a priori assumptions of in- dependence, conditional or not.20

9. CONTEMPORARY EXAMPLES Over-easy acceptance of independence, and struggles

with the complexities of not assuming it, continue about us. I give a few examples beyond those already mentioned, starting with accidents.

Estimates of accident probabilities in nuclear reactors are important, and much in the news. The difficulties of making those estimates, as with other low-probability ac- cidents, turn on the relatively few serious accidents ob- served and on uncertainties in setting up physical models for events leading to accidents. The temptation to assume independence at various points must be strong; otherwise how can one get on with the job?

Most accident analysts understand, of course, that one cannot safely multiply marginal probabilities. For exam- ple, after explaining multiplication under independence, Bethe (1976) says that one cannot always do that: "There can be 'common mode' failures where one event triggers two or three failures of essential elements of the reactor; in that case the probability is the same as that of the triggering event, and one does not get any benefit from the multiplication of small probability numbers. The prob- ability of such common-mode failures is of course the most difficult to estimate" (p. 25). Exactly so. In fact many accident pathways are in between independence and com- plete dependence. One faces the difficult choice between simplifying assumptions-the simplest is independence- and inserting guesses about many complex conditional probabilities where direct empirical evidence is non- existent or tenuous. Or arbitrary averages of extreme pos- sibilities may be taken; for a critical analysis of that ap- proach, see Lewis (1980, p. 61).

It is not difficult to find a variety of statements at dif- ferent levels of care. Deffeyes and MacGregor (1980), for example, give the classically ingenuous form: "the prob- ability that an ore deposit will be formed . .. is determined by multiplying the probabilities that each essential ingre- dient was present" (p. 68). Swerdlow (1979, pp. 529-530) takes a contemporary historian of astronomy to task for naively multiplying probabilities.

Lorenz (1974) recognizes the need for independence, but passes lightly over judging it: "The improbability of coincidental similarity is proportional to the number of independent traits of similarity, and is, for n such char- acters, equal to 2n-1'' (p. 230). He refers to striking re- semblances in shape of, for example, a shark, a dolphin, and a torpedo. Other examples may readily be found in both ordinary science and exotic areas, for example, es- timating the probablity of extraterrestrial life or the prob- ability of a world war next year.21 Estimating probabilities like the latter two stretches to the limit-perhaps beyond it-the appropriateness of probabilistic ideas.

How important dependence can be is vividly described by Machol (1975) in connection with the sinking of the SS

Titanic. Before the Titanic hit an iceberg and sank in 1912, she was declared unsinkable because of her double-bottom hull and watertight compartments. Unfortunately the ice- berg cut a 300-foot gash on one side that flooded five adjacent compartments. In addition, the radio operator on a nearby ship was asleep, the Titanic was going too fast, and there were insufficient lifeboats.

It is ironic that Machol took the other side in a debate with the late J. Kiefer in an entirely different domain, the taxonomy and classification of mushrooms. Machol, to- gether with R. Singer, had strongly suggested assigning mushroom species to genera in accord with pseudolikeli- hood ratios formed by multiplying marginal probabilities of characteristics (like having amyloid spores or having an intermediate lamellae width). The procedure, which was called by Healy (1979, p. 169) in the medical diagnosis context "idiot's Bayes," is in fact the correct Bayes as- signment procedure if all indicators are assumed indepen- dent. Healy points out that the idiot Bayes procedure sometimes works surprisingly well, but Kiefer (1979) vig- orously argues that responsibility for showing that it works or not is the proposer's rather than the reader's. Kiefer's analytical and critical article appeared in the mycological literature; it contains a full bibliography. Machol replied in 1980.

In the earlier discussion of prayer,5 I mentioned natural continued interest by the Bishops of the Church of Eng- land. A recent episcopal statement about religious belief (House of Bishops 1986) deals in considerable part with miracles and testimony for them. It is explicit about de- pendence and independence, but its usage is, I think, somewhat vague. Thus, for example, it says that "if we bear in mind the dependence of Matthew and Luke on Mark, there are only two independent witnesses in the New Testament to the institution of the Eucharist, namely Mark and Paul" (p. 28). Why Mark and Paul are taken as independent is not made clear. The same vagueness may be found in some historical literature, for example, Garraghan (1946, pp. 294-295 and elsewhere) as cited somewhat misleadingly by Burns (1981, p. 296). Both of these books contain much material relevant to our dis- cussion.

We see then some of the traps in gratuitous or casual assumptions of independence. A grand irony is that in- dependence seems rare in nature, and when we really want it, we go to great pains to achieve it, for example in the production of pseudorandom numbers for selective service lotteries (see Fienberg 1971), and in randomization for allocation of treatments in experiments. (These two cases are not, strictly speaking, aimed at full independence be- cause of their without-replacement structure; the central point is that they are aimed at joint distributions of known form for the sample point. In fact those known forms are usually closely related to random sampling.)

In most real cases there is noticeable dependence be- tween phenomena. We are surprised, for example, when social psychologists working with sentiments of content- ment (see Bradburn 1969, pp. 57-61) find little or no association between putatively polar traits of positive hap-

Page 8: miracles and stats

Kruskal: Miracles and Statistics 935

piness and negative discontent. Similarly (Spence and Helmreich 1978) putatively polar traits of stereotypic mas- culinity (self-confidence and competitiveness) versus ste- reotypic femininity (gentleness and concern for others) turn out to be almost vanishingly correlated given sex. [Cautionary comments: (a) weak correlation is not the same as independence; (b) the averages differ in standard stereotyped ways.]

An almost universal assumption in statistical models for repeated measurements of real-world quantities is that those measurements are independent,22 yet we know that such independence is fragile. Berkson, Magath, and Hurn (1939), for example, showed how laboratory blood-cell counts present surprising correlations. Repeated mea- surements by the same observer are bound to be affected by memory of prior observed values. To avoid that prob- lem some measuring devices have arrangements for blind haphazard resetting of the zero point between successive observations.23 Mahalanobis (1947) suggested carrying this further by using different scales for separate observations, each with variant scale markings-perhaps even nonlin- early related! There is a related substantial literature on process control, in the sense of independent identically distributed observations. Much of that literature deals with shifts in expected value, but some discusses dependence. Page (1978) gives an excellent overview.

So we have come from miracles, through testimony, to inflated or insidious claims of independence and how they can be damaging. I began with Charles Babbage and I end by quoting William Schwenk Gilbert, another wise En- glishman. In Act 2 of The Yeoman of the Guard Fairfax asks, "And thou didst see all this?" Point answers,

Aye, with both eyes at once-this and that. The testimony of one eye is naught-he may lie. But when it is corroborated by the other, it is good evidence that none may gainsay. Here are both present in court, ready to swear to him.

NOTES

1. He was the eighth and last Earl, a renowned eccentric with family money, largely from the building and operation of English canals. Falk (1942) gives a colorful account. The Earl's testamentary intentions were carried out by the President of the Royal Society, assisted by the Arch- bishop of Canterbury and the Bishop of London. See Babbage (1838, p. xxi) and Barnes (1965, p. 857).

2. A philosophical discussion of the relationship between apparent miracles and "irregular or discrepant fact[s]" generally is given by McKinnon (1967, pp. 312-313).

3. It may be an oversimplification to regard answered prayers as small miracles. For example, Chalmers (1842) has an ingenious distinction to explain the apparent inconsistency between the efficacy of prayer (taken for granted) and the observed uniformity of nature, save for the few "well accredited miracles of the Jewish and Christian dispensations" (p. 237). Chalmers pointed out that we are unable to trace causative chains in nature back for more than a few steps, and that prayers are answered by unobservable divine intervention further back in the chain. Real mir- acles-those exceedingly rare events-arise from obvious, visible divine intervention. The same distinction had been made by Chalmers earlier (1857, pp. 149-150). He did not hesitate to repeat a favorite theme, and I note further that he wrote the first Bridgewater Treatise!

Romanes's (1874) book stresses the similarity between answered pray- ers and miracles (especially pp. x, 174-187), treats constructively the dilemma that a proper miracle or an answered prayer seems to tear a

hole in received natural law, and argues more or less quantitatively (pp. 265-266) against Galton's empirical analysis. Part of that analysis shows that clergy live about two years longer than lawyers and doctors; Ro- manes says that only a fraction of prayers by clergy are truly sincere, say half, that only a fraction of those are for clergy, say half again, and that only half of the remaining prayers are for length of life as opposed to other boons. Multiplication gives an eightfold dilution, and multiplying again two years by eight provides an effective advantage of 16 years! Romanes, to be sure, recognizes the gaps in this counterargument and says that he gives it to show "that the statistical method is not a trust- worthy instrument with which to gauge the physical efficacy of Prayer" (p. 266). Romanes's thinking about religion fluctuated, probably under the influence of Charles Darwin. An account of perhaps questionable accuracy is given by Romanes's widow (1908).

4. The omission, according to Pearson (1924-1930, Vol. 3B), was at "the urgent request of the publishers" (p. 448). In his preface to the second edition, Galton describes the omission ambiguously, but plainly says that he finds "nothing to recant" in the omitted chapters.

5. There was a use of the prayer-gauge idea by Stead (1891) in his article about a then-current scandal centering on Albert Edward, Prince of Wales, and alleged cheating at baccarat. Stead calculated that at least a thousand million prayers for the Prince of Wales had been made during his life, only to have him become the object of a virulent press attack over card cheating. Benson (1930, pp. 211-213) gives a lucid account of the incident, but he puts Stead with the attackers when in fact Stead is sympathetic toward the Prince-if not to the effectiveness of prayer. Stead explains the Prince's superficial life as the consequence of boredom and lack of serious responsibilities. Turner (1974, pp. 59-63) describes another controversy focused on the Prince of Wales and prayer-this one about serious illness and his recovery. Turner's article provides valu- able further discussion about the argument over efficacy of prayer.

In recent years there have again been British commentaries on possible relationships between prayer and healing, commentaries published under eminent and established auspices (see Archbishops' Commission 1958; British Medical Association 1956; Ramsey 1971). I have noted no ref- erence in these publications to Chalmers's explanation, Galton's analysis, or the prayer-gauge debate otherwise. There are criticisms-mildly stated yet perhaps deep-of the British Medical Association report by the Archbishops' Commission (especially in the latter's chapter 3).

Debate about miracles continues within the Church of England. In particular, I cite the Church's (1986) Report of the Church of England Bishops; it arose after York Minster was set afire by lightning soon after the consecration there of now-Bishop David Jenkins, who had been outspokenly skeptical about the fundamental Christian miracles. Two descriptive statements are by Thomas (1985) and Kramer (1985, pp. 80, 86-89). A strongly pro-miracle viewpoint is given by Berry (1986) to- gether with many further references. In the context of dependence among witnesses, I shall return later to a 1986 statement by the House of Bish- ops.

6. Not literally the first. Brown Grier (Northern Illinois University) has collected other examples, of which an unusually interesting one is the work of Hooper [1689 (especially chap. 4), 1699] on transmission of human testimony. The centrality and persistence of Hume's argument nonetheless makes it a good point of departure. As Stephen says "The fact that the argument is being answered to this day proves that its efficacy is not exhausted. Every new assault is a tacit admission that previous assaults have not demolished the hostile works" (Stephen 1962, vol. 1, chap. 6, sec. 1, p. 262).

Burns (1981) treats predecessors of Hume; see pages 10, 59-60, 75 (about Annet), 77 (Woolston), 89 (Wollaston), and 90-95. [Annet's treatment (1744, especially p. 64) is striking, although it lacks even the rough probability idea that Hume uses. Annet stresses the shrewd point that, if a miracle worker really had the power to change nature mar- velously, he might ipso facto, and much more simply, deceive the senses of witnesses.] Burns gives a systematic outline of Hume's several argu- ments about miracles; I discuss primarily the argument based on testi- mony and probability, called by Burns the a priori epistemological ar- gument.

7. Marty (1977) calls the Trinklein book "obscure and almost naive." Trinklein, says Marty, approached a "rather random sample of presti- gious . .. scientists . . .. The opinions he heard were as varied as those

Page 9: miracles and stats

936 Journal of the American Statistical Association, December 1988

. . . from 38 randomly selected patrons of any friendly neighborhood tavern" (p. 31).

8. Before going on, however, it is worth mentioning again the pos- sibility of extraordinary events that are not miracles; this may be called, following Hume, the Indian Prince question. A South Indian Prince, says Hume (1975), is told that water becomes hard and solid when the temperature goes down enough. The parochial Prince, continues Hume, ought to be skeptical and require strong testimony, but the alleged aqueous behavior is "not contrary to his experience . . . [but rather] not con- formable to it" (pp. 113-114). He simply had never observed water at low temperatures. The example goes back at least to John Locke in 1690 (see Locke 1959, chap. 9, sec. 5) in terms of the King of Siam. Something like that is the story of meteorites, whose existence was at first denied for years (see Westrum 1978). There is an apocryphal Hume-like an- ecdote about Thomas Jefferson: When informed that two Yale professors had reported a meteorite fall in 1807, Jefferson is said to have remarked that "It is easier to believe that two Yankee professors would lie than that stones would fall from heaven" (Mason 1967, p. 429). Whether ball lightning really exists has been questioned for years (Garfield 1977; Stew- art 1878, p. 403; Uman 1982, p. 535). The Stewart discussion deals explicitly with supposedly independent observers. Of course these ex- amples can be more than matched by many cases of false, tendentious, hallucinatory, and so forth observations of apparently extraordinary events. Most of these would not be called even potential miracles by most miracle enthusiasts, primarily because there is little religious rel- evance in ice, meteorites, ball lightning, and the like.

9. A basic divergence in the literature on miracles corresponds to the age-old debate about free will and divine intervention. To put it crudely: Did the creative divinity start the universe in a grand design and then simply sit back and watch it spin? Or does divinity tinker and interfere from time to time by suddenly introducing miracles?

Babbage (1838, pp. 32-49, 168-171) seems to take the first view; he regards miracles as preordained apparent shifts in natural law, but not real shifts-rather, parts of a more general law. His analog is to a com- puter program that gives the sequence of natural numbers for a thousand years and then suddenly shifts to another sequence in obedience to initial programming. Mill argues that the encompassing law-of-nature concept does not give a reasonable interpretation for miracles (Mill 1969, pp. 473-478). Mill says that a miracle "must be produced by a direct volition, without the use of means; or at least, of any means which if simply repeated would produce it" (pp. 224, 474).

Some thinkers seem to take both viewpoints at different places. An example is Maimonides; see Hartman (1976, pp. 157, 251) and Heller (1958). In Maimonides's (1963) Guide of the Perplexed the general law viewpoint is taken on pages 480-484, but the more traditional interven- tionist position appears on pages 178, 498-499, and 528-529. For theolo- gians who regard creation of the universe as timeless, or outside of time, the problem disappears, but fresh problems may then arise. For example, how can one reconcile a timeless universe and the specifically historical character of Christianity?

10. Hume used the concept of probability in a rather rough way. For discussions of the state of probability at or before Hume see Burns (1981), Van Leeuwen (1963), and Peirce (1958, pp. 305-306). See also Sobel (1987) and Owen (1987) for relevant material, especially on Bayesian interpretations of Hume's statements. Patey (1984) describes the devel- opment of probability concepts.

11. There is a sparse but extended literature presenting algebraic for- mulas for truth and testimony under various conditions. An excellent critical summary is by Zabell (in press).

12. One theme of this essay has been a skepticism about testimony; indeed, statisticians-along with lawyers, historians, and others-by in- clination and training listen with special caution to human testimony. Yet I must comment in passing that all of us take most testimony at face value most of the time. We could not otherwise carry on our lives.

Locke (1959, book 4, chap. 15, par. 6) pointed out in 1690 the dangers in accepting opinions of others: "since there is much more falsehood and error among men, than truth and knowledge . . . men have reason to be Heathens in Japan, Mahometans in Turkey, Papists in Spain, Prot- estants in England, and Lutherans in Sweden" (p. 638). Yet Locke's editor, Alexander Campbell Fraser, writes in a footnote that we "cannot

dispense with the authority of experts and, in many cases, dependence on men of higher intelligence and larger experience than our own is the most reasonable means we can use of the attainment of truth" (p. 638).

However sensitive we are to the vagaries of testimony, we simply must accept most of it, at least provisionally. We have neither time nor energy to check more than a little of what we have been told or read. That is certainly true of daily life. In scientific life, it is impossible for each of us to recapitulate more than a tiny fraction of other peoples' observations. Even in mathematics there is an analog: The most energetic and stringent mathematician cannot check and confirm more than a small fraction of statements and proofs in the mathematical literature. As Mill says, "Au- thority . . . is the evidence on which even the wisest receive all those truths of science, or facts in history or in life, of which they have not personally examined the proofs" (Mill 1969, p. 407). Thus profound, persistent skepticism would make life impossible. I find that paradoxical and puzzling.

13. Hume in fact refers to multiple witnesses at several places in his essay (e.g., Hume 1975, pp. 112, 116, 121) but the references are brief and in passing. Swinburne (1968, 1970) picks up the multiple-witness point and wonders whether Hume has overstringent standards of evi- dence. Middleton's book was widely influential; it is discussed, for ex- ample, by Stephen (1962, Vol. 1, especially pp. 222-230). I do not, however, find the question of independence addressed by Middleton.

14. There were also restatements of Hume's argument, notably by Mill (1885, pp. 217-241; 1969, pp. 470-489), who includes a bitter pas- sage (pp. 236-237; 479-480) on the weakness of testimony for the biblical miracles. Venn (1962, chap. 17) took a dim view of probabilistic calcu- lations about the veracity of joint testimony. A major reason for his skepticism was that "practically our main source of error and suspicion is the possible existence of some kind of collusion. Since we can seldom entirely get rid of the danger, and when it exists it can never be submitted to numerical calculation, it appears to me that combination of testimony ... is yet more unfitted for consideration in Probability than even that of single testimony" (p. 428).

Jean Jacques Rousseau earlier presented a skeptical approach to such calculations, presumably not in terms of explicit probability calculus. In "The Creed of a Savoyard Priest," he asked: "Who will venture to tell me how many eye-witnesses are required to make a miracle credible? What use are your miracles, performed in proof of your doctrine, if they themselves require so much proof?" (Rousseau 1911, p. 263).

A nineteenth-century eloquent supporter of the biblical miracles and their central importance for Christianity was Richard Trench. It is wryly ironic that his views about multiple witnesses should undercut the ar- guments put forward by other critics of Hume. Trench (1860) wrote that "however many they [the witnesses] may be, they are always but a few compared with the multitudes who attest a fact which excludes their fact, namely, the uninterrupted succession of a natural order in the world" (p. 61). Trench's discernment is in any case questionable. He asserts, for example, that people are lighter when awake than when asleep, as first observed by Pliny and verified by "every nurse that has carried a child" (p. 232). Another curious Trench position is suspicion about vari- ation among authoritative accounts of historical happenings, exactly the opposite view to that expressed by most defenders of the biblical miracles (that variation in accounts is natural, human, and increases credibility; p. 233).

15. There is a related literature about surprising degrees of agreement among the Fathers of the Church (see especially Daille 1843). Daille in effect discusses dependence among the views of the Fathers, and even draws analogies to contagion of illness. In the last chapter of his book, however, he seems to go the other way and argues that near unanimity of opinion strengthens the case (see pp. 144-145, 170, 268-270, 349).

16. Alleged circularity of argument is another basic question that arises in various related ways. Perhaps the most fundamental way is the vali- dation of faith via miracles described in a Holy Book; the Book in turn is validated by the faith. As Matthew Tindal, an early English deist, wrote in 1730, "It's an odd jumble to prove the truth of a book by the truth of the doctrines it contains, and at the same time to conclude those doctrines to be true because contained in that book" (Stephen 1962, p. 117). Yet, on the other side of the fence, one may find such boasts of circularity as the following: The New Testament "pretends to contain a

Page 10: miracles and stats

Kruskal: Miracles and Statistics 937

divine revelation; and, in support of this high pretension, it gives a narrative of the very facts which . . . constitute the evidence that this pretension is founded in truth." That statement is by John Cook, pro- fessor of divinity at St. Mary's College, St. Andrews (Cook 1821, pp. 351-352). Variants of Tindal's criticism come from the internal structure of the Bible. Frye (1982) asks, "How do we know that the Gospel story is true? Because it confirms the prophecies of the Old Testament. But how do we know that the Old Testament prophecies are true? Because they are confirmed by the Gospel story .... The two testaments form a double mirror, each reflecting the other" [from the review by Kermode (1982, p. 32)].

A third circular argument is criticized by Mill (1969). Mill says, as part of his recapitulation of Hume, that "the existence of God cannot possibly be proved by miracles, for unless a God is already recognized, the ap- parent miracle can always be accounted for on a more probable hy- pothesis. . .. Once admit a God, and the production by his direct volition of an effect . .. is no longer a purely arbitrary hypothesis . .. [and] must be reckoned with as a serious possibility" (p. 477). In vulgar brevity, God implies miracles implies God.

Mill points this out again when he writes, "All, therefore, which Hume has made out . .. is that . .. no evidence can prove a miracle to anyone who did not previously believe the existence of a being or beings with supernatural power .... If we do not already believe in supernatural agencies, no miracle can prove to us their existence" (Mill 1973, vol. 7, chap. 25, sec. 2, p. 625).

Stephen makes a similar point in discussing Hume and Paley: "When Paley calmly says, if we believe in God, there is no difficulty in believing [in] miracles, Hume's answer is plain. If God is the cause of order, belief in him does not facilitate belief in miracles" (Stephen 1962, vol. 1, chap. 6, sec. 34, p. 287; see also chap. 8, p. 352).

Lewis, in his well-known book on miracles, makes a virtue of the circle, without mention of Stephen or Mill, when he writes "Theology says in effect, 'Admit God and with him the risk of a few miracles, and I in return will ratify your faith in uniformity as regards the overwhelming majority of events" (Lewis 1960, p. 106; p. 109 in 1947 ed.). Lewis's argument was not entirely new. The famous mathematical physicist G. G. Stokes had given a variant: "Admit the Existence of a God, of a personal God, and the possibility of a miracle follows at once" (Stokes 1891, p. 24). For discussion see Mascall (1956, pp. 7-8, 180, 185).

The quotation may remind us not to sneer at what appear obvious circularities of argument. Perhaps we are all forced willy-nilly into cir- cularities that we dignify by calling them "coherences" or "interconsis- tencies." There is, however, this difference, that some of us are willing to examine our own circularities, and some simply deny them without study.

Other discussions of circularity may be found. I cite as examples Lewis (1849, pp. 82-93), Chalmers (1857, p. 385), and Walker (1975, p. 162). The relevance of this subtopic is that it presents a methodological tangle different from those of dependence of testimony and reference set, yet alike in the frequency with which they are ignored.

17. The opprobrium statement seems to have appeared first in the second (1846) edition of Mill (1952), although I do not find it in the 1848 American edition (see Mill 1973, vol. 7, p. 538; vol. 8, app. F). Following the opprobrium statement, there is a brief, pointed discussion of over- simple probability ideas applied to tribunals. For more on Mill's distaste for formalism, including that of French mathematical-political thought, see Schabas (1983). Eddington (1935, pp. 123-125) used the French legal example as the basis for his critique of casual independence.

18. Meier and Zabell (1980, especially pp. 501-502) give a fascinating story of 1867 expert testimony by two great American mathematicians, Benjamin and Charles S. Peirce (father and son), during a trial over a contested will. Apparently the independence assumption was used cas- ually in a context where it could hardly hold.

Casual multiplication in a legal setting has a literary endorsement in Poe's "The Mystery of Marie Roget" (1904). Poe writes that "If the feet of Marie being small, those of the corpse were also small, the increase of probability that the body was that of Marie would not be . . . merely arithmetical, but . . . highly geometrical, or accumulative. Add to all this shoes such as she had been known to wear . . .. Give us, then, flowers in the hat corresponding to those worn by the missing girl, and

we seek for nothing further. If only one flower, we seek for nothing further-what then if two or three, or more? Each successive one is multiple evidence-proof not added to proof, but multiplied by hundreds or thousands" (pp. 285-286).

19. There is also a somewhat aloof mathematical viewpoint that simply avoids the verisimilitude questions emphasized here. Kac (1982), for example, writes that "Statistical independence, once a shadowy partner of gamblers, experimental scientists and statisticians, has achieved the respectability that only an ancient discipline like number theory can bestow" (p. 72).

20. Perhaps one reason for misunderstandings about independence is inadequate training in classrooms and lectures. In a modest probe, I looked at two issues of the Journal of the American Statistical Association and counted 11 reviews of introductory textbooks. I inspected the six of these books in our library and graded their treatments of independence tolerantly: no A's, one B, two C's, one D, and three F's, an unhappy record. Among the problems with treatment of independence in these books, I cite first difficulty in even finding the independence discussions: Indexes were absent or skimpy, and in some cases I gave up after a fruitless search of the index and table of contents together with rapid reading in likely chapters. When treatments of independence were given, they tended to be definitions plus a few simple combinatorial examples. With luck, there might be mention of independence for random variables and observations. In at most one of the books was there anything ap- proaching responsible concern for what happens when there is depen- dence, how to recognize it, what analytic options there are, and so on.

21. An example of extraterrestrial calculation is the correspondence between Donaldson and Pollard (1980). Lyttle (1982) provides an ex- ample of the blithe assumption of independence to calculate the prob- ability of accidental launch of a strategic missile. For a fine treatment of the dependence problem in estimating accident probabilities for li- quified natural gas transportation see Fairley (1977, especially pp. 339- 346). Related problems have arisen in examination of acoustic evidence about the assassination of John F. Kennedy (National Research Council 1982, p. 129). Portnoy and Petersen (1984) criticize a study of biblical texts for casual assumptions of independence in statistical analyses. One could easily go on.

22. An important exception is the position of De Finetti and his fol- lowers. [See De Finetti (1982) for an intensely personal statement with key references.] The truly basic idea for De Finetti is not independence (plus identical distribution) but exchangeability, that is, invariance of the relevant joint distribution under permutations of variables. That option does not affect the primary theme of this paper: to look critically at any simplifying assumption-independence, exchangeability, whatever-and to worry about consequences when the assumption fails.

23. Discussions of such a device for measuring blood pressure are given by Wright and Dore (1970) and Labarthe, Hawkins, and Reming- ton (1973, especially p. 553). In addition to decreasing or avoiding de- pendence among contiguous observations, devices of this kind may also decrease or avoid sources of bias and unwanted variation: integer pref- erences, observer expectations, and so on.

[Received October 1987. Revised May 1988.1

REFERENCES

Annet, Peter (1744), The Resurrection of Jesus, London: M. Cooper. Anscombe, F. J. (1978), "Statistical Analysis, Special Problems of: Out-

liers," in International Encyclopedia of Statistics, eds. W. H. Kruskal and J. M. Tanur, New York: Free Press, pp. 1039-1944.

Archbishops' Commission (1958), The Church's Ministry of Healing, London: Church Information Board.

Babbage, Charles (1838), The Ninth Bridgewater Treatise: A Fragment (2nd ed.), London: John Murray. (The first edition appeared in 1837.)

Barnes, Harry Elmer (1965), An Intellectual and Cultural History of the Western World (3rd ed.), New York: Dover Publications. (First pub- lished in 1937.)

Bartholomew, David J. (1984), God of Chance, London: SCM. (1988), "iProbability, Statistics and Theology" (with discussion),

Journal of the Royal Statistical Society, Ser. A., 151, 137-178. Benson, E. F. (1930), As We Were/A Victorian Peep-Show, London:

Longmans, Green.

Page 11: miracles and stats

938 Journal of the American Statistical Association, December 1988

Berkson, Joseph, Magath, Thomas B., and Hurn, Margaret (1939), "The Error of Estimate of the Blood Cell Count as Made With the Hemo- cytometer," American Journal of Physiology, 128, 309-323.

Berry, R. J. (1986), "What to Believe About Miracles," Nature, 322, July 24, 321-322.

Bertrand, J. (1888), Calcul des Probabilites, Paris: Gauthier-Villars. Bethe, A. H. (1976), "The Necessity of Fission Power," Scientific Amer-

ican, 234, January, 21-31. Bienayme [Jules] (1838), "Probabilite des Jugements et des Temoin-

ages," in Extraits des Proces-Verbaux des Seances, Paris: Societe Phi- lomatique de Paris, pp. 93-96.

Bradburn, Norman M. (1969), The Structure of Psychological Well-Being, Chicago: Aldine.

Brams, Steven J. (1980), Biblical Games: A Strategic Analysis of Stories in the Old Testament, Cambridge, MA: MIT Press.

British Medical Association (1956), "Divine Healing and Co-operation Between Doctors and Clergy," memorandum to the Archbishops' Commission on Divine Healing, London: Author.

Brown, Colin (1984), Miracles and the Critical Mind, Grand Rapids, MI: Eerdmans.

Brush, Stephen G. (1974), "The Prayer Test," American Scientist, 62, 561-563; Correspondence (1975), 63, 6-7.

Burns, R. M. (1981), The Great Debate on Miracles/From Joseph Glanvil to David Hume, Lewisburg, PA: Bucknell University Press.

Chalmers, Thomas (1842), "On the Consistency Between the Efficacy of Prayer-and the Uniformity of Nature" (Discourses on the Christian Revelation Viewed in Connection With Modern Astronomy! To Which We Added Discourses Illustrative of the Connection Between Theology and General Science), in Works (Vol. 7), Glasgow: Collins, pp. 234- 262. (The catalog of the British Museum gives a range of 1836-1842 for the Works; it also appears to say that the Discourses was first published in 1818.)

- (1857), On the Miraculous and Internal Evidences of the Christian Revelation and the Authority of Its Records, New York: Carter. (First published in Edinburgh in 1814.)

Church of England (1986), The Nature of Christian Belief (Report of Church of England Bishops), London: Church House.

Cook, John (1821), An Inquiry Into the Books of the New Testament, Edinburgh: Waugh & Innes.

Cournot, A.-A. (1838), "Sur les Applications du Calcul des Chances a la Statistique Judiciare," Journal de Math6matiques Pures et Appli- quees, 3, 257-334.

Cullipp, Platon J. (1969), "The Efficacy of Prayer: a Triple-Blind Study," Medical Times, 97, 201-204.

Daille, John (Jean) (trans. T. Smith and G. Jekyll) (1843), A Treatise on the Right Use of the Fathers (2nd ed.), London: Bohn. (First pub- lished in 1631.)

De Finetti, Bruno (1982), "Probability and My Life," in The Making of Statisticians, ed. J. Gani, New York: Springer-Verlag, pp. 3-20.

Deffeyes, Kenneth S., and MacGregor, Ian D. (1980), "World Uranium Resources," Scientific American, 238, January, 66-76.

Dietl, Paul (1968), "On Miracles," American Philosophical Quarterly, 5, 130-134.

Donaldson, Thomas, and Pollard, William G. (1980), "Life on Other Planets?" American Scientist, 68, 128-131.

Eddington, Arthur (1935), New Pathways in Science, New York: Mac- millan.

Eggleston, Richard (1978), Evidence, Proof and Probability, London: Weidenfeld & Nicolson.

Eichrodt, Walther (trans. J. A. Baker) (1967), Theology of the Old Testament (Vol. 2), Philadelphia: Westminster. [Published in 1964 as Theologie des Alten Testament (5th ed.), Stuttgart: Ehrenfried Klotz.]

Eliade, Mircea (trans. M. L. Ricketts) (1981), Autobiography, Vol. I: 1907-1937, Journey East, Journey West, San Francisco: Harper & Row.

Fairley, William B. (1977), "Evaluating the 'Small' Probability of a Catastrophic Accident From the Marine Transportation of Liquified Natural Gas," in Statistics and Public Policy, by W. Fairley and F. Mosteller, Reading, MA: Addison-Wesley, pp. 331-353.

Fairley, William B., and Mosteller, Frederick (1974), "A Conversation About Collins," The University of Chicago Law Review, 41, 242-253. [Reprinted, along with other relevant materials, in 1977 in Statistics and Public Policy, Reading, MA: Addison-Wesley, pp. 369-379.]

Falk, Bernard (1942), The Bridgewater Millions/A Candid Family His- tory, London: Hutchinson. [See the (anonymous) review of this book in The Times Literary Supplement, 41, June 13, 294.]

Fienberg, Stephen E. (1971), "Randomization and Social Affairs: The 1970 Draft Lottery," Science, January 22, 255-261.

Flew, Antony (1967), "Miracles," in Encyclopedia of Philosophy (Vol. 5), New York: Macmillan and Free Press, pp. 346-353.

Frank, Jerome D. (1978), "The Medical Power of Faith," Human Nature, 1, August, 40-47.

Frank, Philipp G. (1956), "The Variety of Reasons for the Acceptance of Scientific Theories," in The Validation of Scientific Theories, ed. P. G. Frank, Boston: Beacon, pp. 3-18.

Frye, Northrop (1982), The Great Code: The Bible and Literature, New York: Harcourt, Brace & Jovanovich. [Reviewed by Kermode (1982).]

Galton, Francis (1872), "Statistical Inquiries Into the Efficacy of Prayer," The Fortnightly Review, New Series 12, August, 125-135. [Reprinted in Means (1876, pp. 85-106).]

(1883), Inquiries Into Human Faculty and Its Development, Lon- don: Macmillan. (Second edition published in 1907 by Dent.)

Garfield, Eugene (1977), "When Citation Analysis Strikes Ball Light- ening," in Essays of an Information Scientist (Vol. 2, 1974-1976), Philadelphia: ISI Press, pp. 479-490. (First appeared in Current Con- tents, May 12, 1976.)

Garraghan, Gilbert J. (1946), A Guide to Historical Method, ed. J. De- langlez, New York: Fordham University Press.

Gilkey, Langdon (1982), "The Creationist Controversy: the Interrelation of Inquiry and Belief," Science, Technology, & Human Values, 77, Summer, 67-71.

Grant, Robert M. (1952), Miracle and Natural Law in Graeco-Roman and Early Christian Thought, Amsterdam: North-Holland.

Haldane, J. B. S. (1928), "Science and Theology as Art Forms," in Possible Worlds and Other Papers, New York: Harper & Bros., pp. 237-252. (Reprinted in 1985 in On Being the Right Size and Other Essays, ed. J. M. Smith, New York: Oxford University press.)

Hartman, David (1976), Maimonides: Torah and Philosophic Quest, Phil- adelphia: Jewish Publication Society of America.

Healy, M. J. R. (1979), "Does Medical Statistics Exist?" Bulletin In Applied Statistics, 6, 137-182.

Heller, Joseph (1958), "Maimonides' Theory of Miracles," in Between East and West, ed. A. Alexander, London: East & West Library, pp. 112-127.

Heyde, C. C., and Seneta, E. (1977), I. J. BienaymelStatistical Theory Anticipated, New York: Springer-Verlag.

Hooper, George (1689), A Fair and Methodical Discussion.. ,London: Chiswell.

(1699), "A Calculation of the Credibility of Human Testimony," Philosophical Transactions of the Royal Society, 21, 359-365. (There has been disagreement about the authorship of this paper. It appeared in Hooper's Works, published by Oxford University Press in 1757 and 1855.)

House of Bishops (1986), The Nature of Christian Belief, London: Church House.

Hume, David (1975), "Of Miracles," in An Enquiry Concerning Human Understanding (3rd ed.), ed. L. A. Selby-Bigge, revised by P. H. Nidditch, Oxford, U.K.: Clarendon Press, sec. 10, p. 109-131. (First published in 1748; this is from the posthumous 1777 ed.)

Joyce, C. R. B., and Weldon, R. M. C. (1965), "The Objective Efficacy of Prayer: a Double-Blind Clinical Trial," Journal of Chronic Diseases, 18, 367-377.

Kac, Mark (1982), "The Search for the Meaning of Independence," in The Making of Statisticians, ed. J. Gani, New York: Springer-Verlag, pp. 62-72.

Kermode, Frank (1982), Review of The Great Code: The Bible and Literature, by N. Frye, The New Republic, 186, June 9, 30-33.

Kiefer, J. (1979), "Comments on Taxonomy, Independence, and Math- ematical Models (with reference to a methodology of Machol and Singer)," Mycologia, 71, 343-378.

Koblitz, Neal (1981), "Mathematics as Propaganda," in Mathematics Tomorrow, ed. L. A. Steen, New York: Springer-Verlag, pp. 111- 120.

Kramer, Jane (1985), "Letter From Europe," The New Yorker, 61, April 15, 73-89.

Kruskal, William H. (1960), "Some Remarks on Wild Observations," Technometrics, 2, 1-3.

Labarthe, Darwin R., Hawkins, C. Morton, and Remington, Richard D. (1973), "Evaluation of Performance of Selected Devices for Mea- suring Blood Pressure," The American Journal of Cardiology, 32, 546- 553.

Lapp, Ralph E. (1974), "Nuclear Salvation or Nuclear Folly?" The New York Times Magazine, February 10, 10-73.

Page 12: miracles and stats

Kruskal: Miracles and Statistics

Lewis, C. S. (1960), Miracles/A Preliminary Study (2nd ed.), New York: Macmillan. (First published in 1947.)

Lewis, George Cornewall (1849), An Essay On the Influence of Authority in Matters of Opinion, London: Parker.

Lewis, Harold W. (1980), "The Safety of Fission Reactors," Scientific American, 242, March, 53-65.

Locke, John (1959), An Essay Concerning Human Understanding, ed. A. C. Fraser, New York: Dover. (First published in 1690; Fraser edition published in 1894.)

Lorenz, Konrad Z. (1974), "Analogy as a Source of Knowledge," Sci- ence, 185, July 19, 229-234.

Lyttle, Bradford (1982), "The Apocalypse Equation," Harvard Maga- zine, 89, March-April, 19-20.

Machol, Robert E. (1975), "The Titanic Coincidence," Interfaces, 5, 53- 54.

(1980), "Comments on Taxonomy-A Response," Mycologia, 72, 1222-1224.

Madansky, Albert (1986), "On Biblical Censuses," Journal of Official Statistics, 2, 561-569.

Mahalanobis, P. C. (reported by J. Tucker, Jr.) (1947), "Summary of Lecture on the Combination of Data From Tests Conducted at Dif- ferent Laboratories," ASTM Bulletin, No. 144, 63-65.

Maimonides, Moses (trans. S. Pines) (1963), The Guide of the Per- plexed,Chicago: University of Chicago Press.

Marshall, Eliot (1986), "Academy Membership Fight Goes Public," Sci- ence, 234, 1192-1194; Follow-up (1987), 236, 661-662.

Marty, Martin E. (1977), "Science Versus Religion: An Old Squabble Simmers Down," Saturday Review, 5, December 10, 29-35.

Mascall, E. L. (1956), Christian Theology and Natural Science, New York: Ronald.

Mason, Brian (1967), "Meteorites," American Scientist, 55, 429-455. McArthur, Harvey K. (1965), "A Survey of Recent Gospel Research,"

in New Theology No. 2, eds. M. E. Marty and D. G. Peerman, New York: Macmillan, pp. 201-221. (First published in 1964.)

McCormick, Charles Tilford (1972), Handbook of the Law of Evidence (2nd ed.), ed. E. W. Cleary, St. Paul, MN: West Publishing.

McKinnon, Alastair (1967), "'Miracle' and 'Paradox'," American Phil- osophical Quarterly, 4, 308-314.

Means, John 0. (ed.) (1876), The Prayer-Gauge Debate, Boston: Con- gregational Publishing Society.

Meier, Paul, and Zabell, Sandy (1980), "Benjamin Peirce and the How- land Will," Journal of the American Statistical Association, 75, 497- 506.

Mill, John Stuart (1885), Nature, the Utility of Religion and Theism, London: Longmans, Green. [First published in 1874. For a full version see the 1969 Toronto edition, subtitled "Three Essays on Religion" (Vol. 10 of the Collected Works).]

(1952), A System of Logic/Ratiocinative and Inductive (8th ed.), London: Longmans, Green. [First published in 1843. Second edition 1846. For full scholarly treatment see the 1973 edition (Vol. 7), Uni- versity of Toronto Press.]

(1969), Collected Works, Toronto: University of Toronto Press. (1973), A System of Logic/Ratiocinative and Inductive, Toronto:

University of Toronto Press. National Research Council (1982), "Reexamination of Acoustic Evi-

dence in the Kennedy Assassination," Science, 218, October 8, 127- 133; Correction (1982), 218, November 15, 521.

Owen, David (1987), "Hume Versus Price on Miracles and Prior Prob- abilities: Testimony and the Bayesian Calculation," The Philosophical Quarterly, 37, 187-202.

Page, E. S. (1978), "Process Control," in International Encyclopedia of Statistics, eds. W. H. Kruskal and J. M. Tanur, New York: Free Press, pp. 809-812.

Patey, Douglas Lane (1984), Probability and Literary Form/Philosophic Theory and Literary Practice in the Augustan Age, Cambridge, U.K.: Cambridge University Press.

Pearson, Karl (1924), "Historical Note on the Origin of the Normal Curve of Error," Biometrika, 16, 402-404.

(1924-1930), The Life, Letters and Labours of Francis Galton, Cambridge, U.K.: Cambridge University Press.

Pearson, Karl (1978), The History of Statistics in the 17th and 18th Cen- turies, ed. E. S. Pearson, London: Charles W. Griffin. (Lectures given in 1921-1933.)

Peirce, C. 5. (1958), "4The Laws of Nature and Hume's Argument Against Miracles," Values in a Universe of Chance/Selected Writings of Charles S. Peirce (1839-1 914), ed. P. P. Wiener, Stanford, CA: Stanford Uni- versity Press, pp. 289-321. (Written in 1901.)

Perrin, Norman (1977), The Resurrection According to Matthew, Mark, and Luke, Philadelphia: Fortress.

Poe, Edgar Allan (1904), "The Mystery of Marie Roget," in Works (Vol. - 1), New York: Collier, pp. 248-327. (First published in 1842.) Pomeroy, Ralph S. (1962), "Hume on the Testimony for Miracles,"

Speech Monographs, 29, 1-12. Portnoy, Stephen (1988), "Statistics and Religious Studies," in Encyclo-

pedia of Statistical Sciences (Vol. 8), eds. S. Kotz and N. L. Johnson, New York: John Wiley, pp. 679-683.

Portnoy, Stephen L., and Petersen, David L. (1984), "Biblical Texts and Statistical Analysis: Zechariah and Beyond," Journal of Biblical Lit- erature, 103, 11-21.

Price, Richard (1777), "On the Importance of Christianity, the Nature of Historical Evidence, and Miracles," in Four Dissertations (4th ed.), London: Cadell, Dissertation 4. (First published in 1767.)

Prior, A. N. (1955), "Can Religion Be Discussed?" in New Essays in Philosophical Theology, eds. A. Flew and A. Maclntyre, London: SCM, pp. 1-11. [First appeared in Australian Journal of Philosophy (1942).]

Ramsey, I. T. (1971), "Our Understanding of Prayer," Archbishops' Commission on Christian Doctrine Occasional Paper 1, London: SPCK.

Romanes, Ethel (1908), The Life and Letters of George John Romanes (6th impression), London: Longmans, Green.

Romanes, George J. (1874), Christian Prayer and General Laws, Lon- don: Macmillan. (Includes the Burney Prize essay for 1873.)

Rousseau, Jean Jacques (trans. B. Foxley) (1911), Emile, London: Deut. [First published in 1762. The quotation is from "The Creed of a Sa- voyard Priest" (book 4).]

Schabas, Margaret (1983), "John Stuart Mill to William Stanley Jevons: An Unpublished Letter," The Mill News Letter, 18, No. 2, 24-28.

Smart, Ninian (1964), Philosophers and Religious Truth, London: SCM. Sobel, Jordan Howard (1987), "On the Evidence of Testimony for Mir-

acles: A Bayesian Interpretation of David Hume's Analysis," The Philosophical Quarterly, 37, 166-186.

Spence, Janet T., and Helmreich, Robert L. (1978), Masculinity & Fem- ininityl Their Psychological Dimensions, Correlates, & Antecedents, Austin: University of Texas Press.

Stead, W. T. (1891), "The Prince of Wales," The Review of Reviews, 4, 23-34. [Ascription of authorship comes from Benson (1930, p. 211) and from internal evidence.]

Stephen, Leslie (1962), History of English Thought in the Eighteenth Century (3rd ed.), New York: Harcourt, Brace & World. (First pub- lished in 1876.)

Stevens, William Arnold, and Burton, Ernest DeWitt (1905), A Har- mony of the Gospels for Historical Study (4th ed.), New York: Charles Scribner.

Stewart, Balfour (1878), "Science and a Future State," The Princeton Review, 54, 399-408.

Stigler, Stephen (1986), "John Craig and the Probability of History: From the Death of Christ to the Birth of Laplace," Journal of the American Statistical Association, 81, 879-887.

Stokes, G. G. (1891), Natural Theology, London: Black. Strauss, David Friedrich [trans. G. Eliot (M. Evans)] (1892), The Life

of Jesus, London: Swan Sonnenschein. (First published in 1835, in Tuibingen, as Das Leben Jesu.)

Swerdlow, Noel M. (1979), "Ptolemy on Trial," The American Scholar, 48, 523-531. (Essay-review of R. R. Newton's The Crime of Claudius Ptolemy, Johns Hopkins University Press.)

Swinburne, R. G. (1968), "Miracles," Philosophical Quarterly, 18, 320- 328.

(1970), The Concept of Miracle, London: Macmillan. Thomas, Jo (1985), "Toward the Blunt Bishop, Thatcher Is Irreverent,"

New York Times, May 9. Thompson, Harold William (1931), A Scottish Man of Feeling, London:

Oxford University Press. Trench, Richard Chenevix (1860), Notes on the Miracles of Our Lord

(2nd Amer. ed.), New York: Appleton. Trinklein, Frederick E. (1971), The God of Science, Grand Rapids, MI:

Eerdmans. Turner, Frank M. (1974), "Rainfall, Plagues, and the Prince of Wales:

A Chapter in the Conflict of Religion and Science," Journal of British Studies, 13, 46-65.

Tyndall, John (1871), "Prayer and Natural Law," in Fragments of Science for Unscientific People, London: Longmans, Green, pp. 31-38. [First published in Mountaineering (1861).]

Uman, M. A. (1982), Review of Ball Lightening and Bead Lightening, by J. D. Barry, American Scientist, 80, 535-536.

Page 13: miracles and stats

940 Journal of the American Statistical Association, December 1988

Van Leeuwen, Henry G. (1963), The Problem of Certainty in English ThoughtIl630-1690, the Hague: Nijhoff.

Venn, John (1962), The Logic of Chance (4th ed.), New York: Chelsea Publishing. (First published in 1866; 3rd ed. 1888.)

Walker, D. P. (1975), Spiritual and Demonic Magic/From Ficino to Cam- panella, South Bend, IN: University of Notre Dame Press. (First pub- lished in 1958.)

Westrum, Ron (1978), "Science and Social Intelligence About Anom- alies: The Case of Meteorites," Social Studies of Science, 8, 461- 493.

Wright, B. M., and Dore, C. F. (1970), "A Random-Zero Sphygmo- manometer," The Lancet, 1, February 14, 337-338.

Young, J. R. (1844), An Elementary Treatise on Algebra, Theoretical and Practical (4th ed.), London: Souter & Law.

(1846), Three Lectures Addressed to the Students of Belfast College on Some of the Advantages of Mathematical Study to Which Is Added an Examination of Hume's Argument Against Miracles, London: Sou- ter & Law.

Zabell, S. L. (in press), "The Probabilistic Analysis of Testimony," Journal of Statistical Planning and Inference, 20.