Charlie Gibbons Political Science 236 October 1,...

28
Randomization Inference Charlie Gibbons Political Science 236 October 1, 2008

Transcript of Charlie Gibbons Political Science 236 October 1,...

Page 1: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Randomization Inference

Charlie GibbonsPolitical Science 236

October 1, 2008

Page 2: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Outline

1 ExperimentsAbout experimentsThe framework

2 Testing for no effectAbout these testsTest statistics

These notes are based upon Rosenbaum (2002).

Page 3: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer makes four important points about experiments:1 Experiments do not require units to be homogeneous2 Experiments do not require units to be a random sample

from a population of units3 Treatment only needs to be allocated randomly among

experimental units4 Probability (“chance”) enters the analysis only through

random treatment assignment, which the researchercontrols

Page 4: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer makes four important points about experiments:1 Experiments do not require units to be homogeneous2 Experiments do not require units to be a random sample

from a population of units3 Treatment only needs to be allocated randomly among

experimental units4 Probability (“chance”) enters the analysis only through

random treatment assignment, which the researchercontrols

Page 5: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer makes four important points about experiments:1 Experiments do not require units to be homogeneous2 Experiments do not require units to be a random sample

from a population of units3 Treatment only needs to be allocated randomly among

experimental units4 Probability (“chance”) enters the analysis only through

random treatment assignment, which the researchercontrols

Page 6: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer makes four important points about experiments:1 Experiments do not require units to be homogeneous2 Experiments do not require units to be a random sample

from a population of units3 Treatment only needs to be allocated randomly among

experimental units4 Probability (“chance”) enters the analysis only through

random treatment assignment, which the researchercontrols

Page 7: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer makes four important points about experiments:1 Experiments do not require units to be homogeneous2 Experiments do not require units to be a random sample

from a population of units3 Treatment only needs to be allocated randomly among

experimental units4 Probability (“chance”) enters the analysis only through

random treatment assignment, which the researchercontrols

Page 8: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer and Neymann had different ideas about whatexperiments achieved.

Page 9: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer and Neymann had different ideas about whatexperiments achieved.

Fischer believed that the experiment gave an accurate inferencefor those units that had to be treated (consistent with hispreceding points), but not necessarily a sound inference for theentire population. Hence experiments give internal validity, butnot external validity.

Page 10: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Experiments

Fischer and Neymann had different ideas about whatexperiments achieved.

Fischer believed that the experiment gave an accurate inferencefor those units that had to be treated (consistent with hispreceding points), but not necessarily a sound inference for theentire population. Hence experiments give internal validity, butnot external validity.

Neymann believed that units can be chosen randomly from thepopulation to generate external validity.

Page 11: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

The framework

Let there be N units divided into S strata prior to treatment,giving ns units in strata s. Define the following

Zsi = Iunit i in strata s received treatmentZs = (Zs1, . . . Zsns)′

Z = (Z1, . . . , ZS)′

ms =ns∑i=1

Zsi

Note that Z is a random variable with a distributiondetermined by the researcher. The only limitation on thisdistribution is that 0 < Pr(Zsi = 1) < 1 ∀ s, i. Let Ω be the setof all possible realizations of Z.

Page 12: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

The framework

Examples:Fixed margins

ms = ks

K ≡ #Ω =S∏

s=1

(ns

ks

)Binomial randomization

Pr(Zsi = 1) =12

K ≡ #Ω = 2N

Page 13: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

The framework

Examples:Fixed margins

ms = ks

K ≡ #Ω =S∏

s=1

(ns

ks

)Binomial randomization

Pr(Zsi = 1) =12

K ≡ #Ω = 2N

Page 14: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

The framework

Examples:Fixed margins

ms = ks

K ≡ #Ω =S∏

s=1

(ns

ks

)Binomial randomization

Pr(Zsi = 1) =12

K ≡ #Ω = 2N

Page 15: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

The simplest case of randomization inference is to test for theabsence of an effect. That is, the null hypothesis is that there isno treatment effect. Why?

Page 16: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

The simplest case of randomization inference is to test for theabsence of an effect. That is, the null hypothesis is that there isno treatment effect. Why?

This sort of test can be performed with very limitedassumptions and may provide a good starting point beforegenerating point estimates.

Page 17: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

Let rsi be the response of unit i in strata s and r be a vector ofall responses corresponding to Z.

Page 18: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

Let rsi be the response of unit i in strata s and r be a vector ofall responses corresponding to Z.

What is random under the null hypothesis?

Page 19: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

Let rsi be the response of unit i in strata s and r be a vector ofall responses corresponding to Z.

What is random under the null hypothesis?The allocation of treatment Z is random and under thecontrol of the researcherThe response vector r is not random—under the nullhypothesis, there is no response to treatment and thusresponses would not change if a different set of treatmentsZ ′ was applied.

Page 20: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

Let rsi be the response of unit i in strata s and r be a vector ofall responses corresponding to Z.

What is random under the null hypothesis?The allocation of treatment Z is random and under thecontrol of the researcherThe response vector r is not random—under the nullhypothesis, there is no response to treatment and thusresponses would not change if a different set of treatmentsZ ′ was applied.

Page 21: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Testing for no effect

Let rsi be the response of unit i in strata s and r be a vector ofall responses corresponding to Z.

What is random under the null hypothesis?The allocation of treatment Z is random and under thecontrol of the researcherThe response vector r is not random—under the nullhypothesis, there is no response to treatment and thusresponses would not change if a different set of treatmentsZ ′ was applied.

Hence, when doing inference, we will take r as given (i.e., ,condition on it) and compare it to other possible realizations oftreatment.

Page 22: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test procedure

We have the following procedure for test statistic t(z, r) forfixed set of responses and treatment assignment z:

1 The null hypothesis is assumed, thereby fixing the value ofr.

2 Treatment assignment z is selected at random from Ω.3 Find the test statistic T of the actual experiment.4 Calculate the probability of generating a test statistic

greater than or equal to the one from the actualexperiment using all possible treatment assignments in Ω.

Page 23: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test procedure

We have the following procedure for test statistic t(z, r) forfixed set of responses and treatment assignment z:

1 The null hypothesis is assumed, thereby fixing the value ofr.

2 Treatment assignment z is selected at random from Ω.3 Find the test statistic T of the actual experiment.4 Calculate the probability of generating a test statistic

greater than or equal to the one from the actualexperiment using all possible treatment assignments in Ω.

Page 24: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test procedure

We have the following procedure for test statistic t(z, r) forfixed set of responses and treatment assignment z:

1 The null hypothesis is assumed, thereby fixing the value ofr.

2 Treatment assignment z is selected at random from Ω.3 Find the test statistic T of the actual experiment.4 Calculate the probability of generating a test statistic

greater than or equal to the one from the actualexperiment using all possible treatment assignments in Ω.

Page 25: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test procedure

We have the following procedure for test statistic t(z, r) forfixed set of responses and treatment assignment z:

1 The null hypothesis is assumed, thereby fixing the value ofr.

2 Treatment assignment z is selected at random from Ω.3 Find the test statistic T of the actual experiment.4 Calculate the probability of generating a test statistic

greater than or equal to the one from the actualexperiment using all possible treatment assignments in Ω.

Page 26: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test procedure

We have the following procedure for test statistic t(z, r) forfixed set of responses and treatment assignment z:

1 The null hypothesis is assumed, thereby fixing the value ofr.

2 Treatment assignment z is selected at random from Ω.3 Find the test statistic T of the actual experiment.4 Calculate the probability of generating a test statistic

greater than or equal to the one from the actualexperiment using all possible treatment assignments in Ω.

This yields a significance level of

Pr(t(Z, r)) =∑z∈Ω

It(z, r) ≥ TPr(Z = z)

Page 27: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test statistics

In The Lady Tasting Tea, the test statistic that we use is thenumber of cups correctly guessed:

T = Z ′r︸︷︷︸Treated correctly identified

+(1− Z ′) (1− r)︸ ︷︷ ︸

Control correctly identified

= 2Z ′r

Page 28: Charlie Gibbons Political Science 236 October 1, 2008gibbons.bio/courses/ps236/RandomizationInference.pdf · Charlie Gibbons Political Science 236 October 1, 2008. Outline 1 Experiments

Test statistics

In The Lady Tasting Tea, the test statistic that we use is thenumber of cups correctly guessed:

T = Z ′r︸︷︷︸Treated correctly identified

+(1− Z ′) (1− r)︸ ︷︷ ︸

Control correctly identified

= 2Z ′r

When we are calculating the significance of this result againstthe null, we ask, “if treatment were assigned differently, but hisresponses do not change, what’s the probability that his slate ofguesses would have performed at least as well as it did in thiscase?”

For example, “if we had ordered the cups differently, what’s theprobability that his guesses would have gotten at least as manycups correct as he did here?”