ComparativePoliticsNewsletter - Matt...

Volume 25, Issue 1, Spring

Comparative Politics NewsletterThe Organized Section in Comparative Politics of the American Political Science Association

Editors: Matt Golder ([email protected]), Sona N. Golder ([email protected])Editorial Assistant: Charles CrabtreeEditorial Board: Molly Ariotti, Xun Cao, Elizabeth C. Carlson, Amanda Fidalgo, Vineeta Yadav, Joseph Wright

Contents

Letter from the Editors: Sensitive Topics by Matt Golder and Sona N. Golder . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

I. Symposium: Studying Sensitive Political Phenomena

Can Anonymity Promises Possibly be Credible in Police States? by Jesse Driscoll . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4Sensitive Data and the French State by David Laitin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7Survey Methods for Sensitive Topics by Graeme Blair . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12Randomized Response: Foundations and New Developments by Daniel Gingerich . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16Capturing Ethnic Salience on Surveys in Developing Countries by Elizabeth C. Carlson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27Lobbying, Corruption, and Non-Responses in Small Samples by Vineeta Yadav . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

II. Special Topic

The Ethics of Field Experiments in Comparative Politics by Scott Desposato . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

III. Dataset

Integrating Data on Ethnicity, Geography and Conflict: The Family of Ethnic Power Relations Datasets 2014by Nils-Christian Bormann, Manuel Vogt, and Lars-Erik Cederman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

IV. Other

Contributors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46Announcements and Additional Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

Letter from the Editors: Sensitive Topicsby Matt Golder & Sona N. Golder

The Pennsylvania State University

Welcome to the Spring 2015 issue of the ComparativePolitics Newsletter.We are very happy to take over as ed-itors, andwould like to take this opportunity to thank theprevious editors at the Hertie Institute in Berlin, MarkHallerberg and Mark Kayser, for their excellent workover the last three years. We hope to maintain their highstandards during our own editorial term.

One of the most important goals of the Compar-ative Politics Newsletter in our opinion is to engen-der a sense of community among comparative poli-

tics scholars around the world. To help with this, wehave created a webpage for the Newsletter at http://comparativenewsletter.com. Please take a look atthe webpage. You’ll find information about our editorialteam at Penn State, but, more importantly, you’ll alsofind information about the contributors to our upcom-ing issues. As you’ll see, our next issue will include asymposium on training graduate students in compar-ative politics. In addition, you’ll find links to previousissues of the Newsletter. To make the website more use-ful, we also have a section on the Homepage that pro-vides news and announcements that might be relevantto the members of the Comparative Politics Section. Ifyou have anything that you would like to share with oth-ers, please contact us and we will be happy to post iton the website. You can contact us through the Contact

m http://comparativenewsletter.com/ B [email protected] 1

[email protected]

mailto:[email protected]

http://comparativenewsletter.com/editorial_team#crabtree

http://comparativenewsletter.com/editorial_team#ariotti

http://comparativenewsletter.com/editorial_team#cao

http://comparativenewsletter.com/editorial_team#carlson

http://comparativenewsletter.com/editorial_team#fidalgo

http://comparativenewsletter.com/editorial_team#yadav

http://comparativenewsletter.com/editorial_team#wright

http://comparativenewsletter.com

http://comparativenewsletter.com

http://comparativenewsletter.com/contact



page or, alternatively, by using our Penn State email ad-dresses ([email protected], [email protected]). So thatyou do not have to keep checking the website to see ifthere have been any news updates, you can also followus on Twitter @cp_newsletter. We hope that by usingthe webpage and our Twitter account, we can pass alongimportant news and announcements in a much moretimely manner than has been possible in the past. Ifyou have any suggestions for improving the utility of thewebpage or Newsletter, we’d be very happy to hear them.

In line with previous editorial teams, the Newslet-ter will include symposia on various substantive andmethodological issues, highlight new datasets of broadappeal, and generally inform the community about field-specific developments. With the exception of our in-augural issue, all future issues will also include a newsection, tentatively called, Letters to the Editors. Thissection will include selected short comments (less than300 words) from our readers in response to the materialin the previous issue of the Newsletter. These commentsmight critique or build on the contributions from theprevious issue, highlight some line of research or datasource that was overlooked, or more generally provideadditional information that might be of use to the Sec-tion membership. Our hope is that this new section canhelp engender and sustain more of a dialogue amongstcomparative political scientists. We encourage you tosubmit your thoughts about the content of our currentNewsletter using the Contact Page on the Newsletterwebsite.

Our current issue includes a symposium lookingat how researchers go about studying sensitive politicalphenomena, a special topic on the ethics of experimentsin comparative politics, and an overview of the family ofEthnic Power Relations (EPR) datasets.

Our opening contribution to the symposium on re-searching sensitive political phenomena comes fromJesse Driscoll, an Assistant Professor in the School ofInternational Relations and Pacific Studies at the Uni-versity of California, San Diego. Jesse spent over twoyears conducting ethnographic research in Tajikistanand Georgia for his forthcoming Cambridge UniversityPress book,Warlords and Coalition Politics in Post-SovietStates. Many of the people Jesse interviewed were for-mer combatants and militia members, some of whomhad engaged in violent criminal activity both on behalfof, and against, the state. In his contribution, Jesse de-

scribes some of the safety, methodological, and ethicalissues that he faced when conducting his research, par-ticularly the tradeoff between promises of anonymityand emergent norms of data replicability.

Our second contribution comes fromDavid Laitin, aProfessor in the Department of Political Science at Stan-ford University. David’s essay looks at the relationshipbetween sensitive data and the French state, a topic cov-ered in his forthcoming Harvard University Press book,Why Muslim Integration Fails in Christian-Heritage Soci-eties (co-authored with Claire L. Adida and Marie-AnneValfort). Historically, the French state has prohibited thecollection or processing of data that reveal the racial orethnic origins (or political, philosophical or religiousopinions, or union membership, or health, or sexuallife) of people living in France. This prohibition is cur-rently a matter of hot political debate in France due tosocial tensions in the immigrant banlieues and grow-ing evidence of ethnic and religious discrimination inthe public sphere. The French Prime Minister ManuelValls recently went so far as to say that “territorial, socialand ethnic apartheid” existed in France (de La Baume,2015). The January 2015 terrorist attacks carried outby French-born radical Islamists on the offices of thesatirical magazine Charlie Hebdo, which occurred af-ter David had submitted his piece, have added furtherimpetus to those calling on the French state to autho-rize the collection of ethnicity data. It’s not clear howmuch will change, though. On February 5, the Frenchpresident, François Hollande, stated that collecting eth-nic data would be “inutile” (useless) and would not helpsolve social problems related to unemployment and ed-ucational underachievement (Baumard, 2015).

Our third contribution comes from Graeme Blair, aPh.D. Candidate at Princeton who will be an AssistantProfessor atUCLA from2016.When scholars use surveydata, they implicitly assume that respondents have an-swered the survey questions truthfully. However, this as-sumptionmay be violated if respondents have incentivesto conceal the truth, something that is likely to happenwhen the survey addresses sensitive topics. Graeme’s es-say provides an overview of various methods — surveyadministration protections, randomized response tech-niques, list experiments, and endorsement experiments— that can be used to elicit more truthful responses. Healso discusses some of the critiques that can be made ofeach of these techniques.




https://twitter.com/cp_newsletter


http://www.cambridge.org/us/academic/subjects/politics-international-relations/international-relations-and-international-organisations/warlords-and-coalition-politics-post-soviet-states


http://comparativenewsletter.com/


Daniel Gingerich, an Associate Professor in the De-partment of Politics at the University of Virginia, pro-vides our fourth contribution. Daniel offers an in-depthlook at how randomized response techniques can beused to study sensitive behavior and attitudes. Amongother things, he describes several of the most commonlyused randomized response models, outlines key recentdevelopments in the use of these models, and discussesthe findings of validation studies that have examinedthe performance of these models for a variety of sensi-tive topics both in political science and more generally.He concludes with some practical advice for implement-ing randomized response surveys in the field.

Our last two contributions to the symposium comefrom two of our colleagues whowork on sensitive topics,Elizabeth C. Carlson and Vineeta Yadav. Elizabeth is anAssistant Professor in the Department of Political Sci-ence and the Program on African Studies at Penn State.She uses survey and experimental methods to study po-litical behavior and citizen preferences in Africa’s newdemocracies. Some of her research attempts to identifythe conditions under which Ugandan voters prefer aco-ethnic candidate for office, especially when that can-didate is dubiously qualified. In her contribution, shereports the results of an experiment in which she exam-ines reported levels of co-ethnic voting when (i) the eth-nicity of the enumerator and respondent are matched,and (ii) when questions about co-ethnic voting are askedthrough a secret ballot embedded within a larger face-to-face interview. She finds that matching the ethnicityof respondents and enumerators exacerbates bias in re-ported co-ethnic voting, but that providing an oppor-tunity for respondents to self-administer sensitive ques-tions can lead to a more accurate reporting of co-ethnicvoting.

Vineeta Yadav is an Assistant Professor in the De-partment of Political Science at Penn State. Her researchfocuses on the effects of institutions on economic devel-opment, with a particular emphasis on how institutionsinfluence lobbying, corruption, and judicial empower-ment. In her contribution, Vineeta points out that manyof the survey methods developed for dealing with sen-sitive questions — randomized response techniques,list experiments, and endorsement experiments — re-quire a large sample size for successful implementation.This can be problematic if one’s population of interestis relatively small. Vineeta goes on to discuss this issuein the context of a business elite survey that she con-

ducted on corruption and lobbying in Brazil and India.She suggests that it is important to take into account theselection process that causes only some respondents toprovide answers to sensitive survey questions.

Our inaugural issue of the Comparative PoliticsNewsletter also includes a special topics contributionby Scott Desposato on ethics and experiments in com-parative politics. Scott is an Associate Professor in theDepartment of Political Science at UC San Diego. In2013, he organized and hosted the NSF-funded Ethicsin Comparative Politics Experiments Conference. Thegoal of the conference was to identify critical issues withexperiments in comparative politics, explore opinion inthe field, and propose practical strategies formoving for-ward. In his contribution, Scott summarizes the centraltopics that came up during this conference, focusing onthe importance of taking context into account, the roleof local review, issues to do with deception and con-sent, and the impact that experiments can have in realpolitical situations. A more detailed account of the pro-ceedings of the conference will be published later thisyear in the Routledge Series in Experimental PoliticalScience.

Finally, Nils-Christian Bormann, Manuel Vogt, andLars-ErikCederman describe the family of Ethnic PowerRelations (EPR) datasets. Nils-Christian and Manuelare both post-doctoral researchers in the InternationalConflict Research group at ETH Zurich; Lars-Erik isProfessor of International Conflict Research in the Cen-ter for Comparative and International Studies at ETHZurich. The EPR datasets integrate information on eth-nicity, geography, and conflict for over 800 uniqueethnic groups in 165 countries from 1946 to 2013.The individual components of the EPR dataset familycan be downloaded from the GROWup web portal athttp://growup.ethz.ch.

We hope that you enjoy our first issue of the Com-parative Politics Newsletter. If you have ideas for pos-sible symposia or special topics, or would like to publi-cize a dataset of broad appeal, please contact us. Again,you can contact us through the Contact page of ourwebpage at http://comparativenewsletter.com/contact or simply use our Penn State email addresses([email protected], [email protected]). We look forwardto working with you over the next four years.

Matt and Sona


http://polisci2.ucsd.edu/polisciethics/


http://www.routledge.com/books/series/EPS/

http://www.routledge.com/books/series/EPS/

http://growup.ethz.ch







References

Baumard, Maryline. 2015. “François Hollande estimateinutile les statistiques.” Le Monde .

de La Baume, Maïa. 2015. “French Premier Says ‘Apartheid’Is Leaving Minorities on the Fringe.” The New York Times .

I. Symposium: Studying SensitivePolitical Phenomena

Can Anonymity Promises Possibly be Credible inPolice States? Interview Strategies in Post-WarAuthoritarian Regimes

by Jesse DriscollUniversity of California, San Diego

“I get it. I know what you want. I under-stand. You want to make up a list, on yourcomputer, of all our bad men. ‘Terrorists.’ Youwant to cross names off the list when they werekilled or jailed. To see that we Tajiks can takecare of our own. But we can. We did. You’llsee.”

Yuri, Dushanbe, 2007

In the course of researching certain topics — partic-ularly topics related to civil war violence, criminality,and terrorism — the best data tends to come from peo-ple who are implicated in violence and criminality. Re-searchers are required to construct credible protectionsfor terrible people to speak candidly about terrible thingsthat they have done. I put a great deal of energy intoworrying about the practicalities of how to protect theidentities of these people while conducting research formy forthcoming book, Warlords and Coalition Politics inPost-Soviet States. The book attempts to explain how sta-ble authoritarian regimes emerged after state failure inCentral Asia and the Caucasus. This short essay, whichdraws heavily on material from my book, is meant toilluminate the practical trade-offs between anonymitypromises — which are necessary for the researcher toget decent data in the first place, and to credibly presentherself as an academic and not as a spy — and emergentnorms of data replicability.

I traveled to Central Asia for the first time in 2005and proceeded to spend nearly 25 uninterruptedmonthsin Tajikistan and Georgia between 2006 and 2008. I em-ployed ethnographic methods — including living in aremote village on the Tajik-Afghan border — to gainrapport with interview subjects. I gradually forged rela-tionships with academics, journalists, expatriate profes-sionals, current and former government employees, andeventually many former militia members in both coun-tries. I conducted over 300 interviews, 173 of whichwerewith former combatants.

At first, I naively tried to avoid implicating my re-spondents in descriptions of abjectly criminal activities.It quickly became obvious that this was impossible. I set-tled for a policy of total anonymity for all respondents. Irecorded only the subject’s first name, or a pseudonym,and interview date to preserve anonymity. In a few cases,the subject insisted that I record his full name. For myown safety, and that of my respondents, I never com-plied with these requests.

Many of my respondents used to be employed in thepolice or the army. At the time of the interview, manywere marginally employed. Many of these men had paidsevere costs in order to “capture the state” and had suc-ceeded in transforming themselves from paramilitariesinto uniformed members of state security services; laterthey had themselves been purged. Many held bleak andironic attitudes, and could angrily recall the jarring tran-sition from being “the law” to being “on the wrong sideof the law” in a few arbitrary weeks. In early interviews,a strategy to collect individual-level characteristics topredict which militia members joined the state, whichfactions ended up with which jobs, and which factionswere “weeded out,” I had respondents sketch picturesof the security structures at different periods of time.Many early respondents gamely played along. Thoughthe exercises quickly devolved into incoherence— pagesfilled with circles, lines, and scribbled names that I real-ized would never be coded or systematized — the pro-cess was intoxicating. Meandering conversation threadsgradually gave way tomore structured discussions aboutwho was getting what, how and when side-switching be-tween militia commanders was considered, and the like.For a time, I was sure I was getting the real story. For atime, this arcana made me feel cool, like I was a charac-ter on The Wire.


http://www.lemonde.fr/societe/article/2015/02/05/francois-hollande-estime-inutiles-les-statistiques-ethniques_4571106_3224.html

http://www.lemonde.fr/societe/article/2015/02/05/francois-hollande-estime-inutiles-les-statistiques-ethniques_4571106_3224.html

http://www.nytimes.com/2015/01/21/world/europe/paris-attacks-suspects.html

http://www.nytimes.com/2015/01/21/world/europe/paris-attacks-suspects.html





But it did not stay cool for very long. I was living inKyrgyzstan in 2005 when the Andizjan events unfoldedacross the border in Uzbekistan.There was an attemptedprison break that was put down with great violence byUzbekistan’s government. Many people died. For a fewvery uncomfortable weeks, it was not clear whether therefugees who had fled toKyrgyzstanwould spread a con-tagion of ethnic fear across the volatile Ferghana Valleyor if the war in Afghanistan was creeping north.1 I wasliving in Kyrgyzstan’s capital city of Bishkek at the time,many hundreds of kilometers away from the violence.But I recall distinctly the feeling of vulnerability whenthe internet stopped working correctly. I decided that if Iwas going to continue operating in an unfriendly author-itarian environment, I needed to adapt. I quickly edu-cated myself about how internet servers work. I stoppedassuming my email communications were private. Forimportant topics I began to rely upon pen and paper(see Figure 1). But these realizations gradually trans-formed my relationship with “the field.” Especially inTajikistan, it meant proceeding very slowly over monthsthat turned into years. After a few frightening encoun-ters, I was warned by a trusted advisor that my researchdesign wasmakingme look quite a bit like a spy, and thatif I ran afoul of the wrong character in the wrong secu-rity bureaucracy, it could easily result in my permanentdisappearance. The risks to my local contact networkwere, if anything, even more serious. I was asking thequestions that locals had learned not to ask.2

Certain subject matter was carefully filtered by therespondents themselves. It was very rare for anyone todiscuss money in specific denominations. I learned a lotabout the banalities of racketeering, price-fixing, moneylaundering, and other mainstays of the post-Soviet un-derworld, but specific information about the dispensa-tion of family assets was carefully guarded. Details werenot volunteered and I never asked. Resentment towards

Figure 1: Why I Did Not Write Any Names Down OnMy Fieldnotes, Or Take Notes On My Computer

Image source: http://xkcd.com/538/.

the current generation of “big men” was tempered bythe realities of the embedded favor economy. There arestrong norms against taking the side of outsiders againstthe nation, tribe, or family. I never stopped being an out-sider. My field interviews improved substantially whenI accepted the limits of my role. I acquired a reputa-tion for being primarily interested in political stories andhaving no particular local agenda. I also took conscioussteps to control the subject matter, intentionally shyingaway from asking respondents about their own violentactivities. The main way I did this was to make it clearthat I was studying institutions — militia structures —and not people. I overcame my skepticism of questionslike “in your group did you observe …” when I real-ized that respondents could describe their own actionsin the third person, with plausible deniability and emo-tional distance from actions that were regretted in ret-rospect.3 But interviews often sprawled, and what I ulti-mately ended up collectingwere life histories.With time,as I learned to ask the right questions, and behind the veilof anonymity that my methods provided, I received richanecdotes.

1Akiner (2005) went to Andizjan two weeks after the uprising to conduct interviews and concludes that the demonstrations were a “care-fully prepared” attack on the Uzbek government organized by armed militants (part of the “Akromiya movement”) who were multi-national incomposition (30-31, 27-29). Her version of events is contradicted by theOSCE (2005), Ilkhamov (2006), Kendzior (2006), Bukharbaeva (2005),and Daly (2005). Although relations between the West and the government of Uzbekistan deteriorated significantly as a result of the Andizjanevents, I am certain thatmost of the readers of this essay have never even heard of Andizjan; central Asia is very remote fromAmerican nationalinterests.

2A Tajik doctoral student and personal friend was recently placed under house arrest in Tajikistan pending a trial for charges of treason andespionage associated with qualitative observational research (conducting interviews in Badakhshon) on behalf of a non-Tajik Principal Inves-tigator. He was allowed to leave the country and currently resides in Canada, but he may never be allowed to return. Journalists and writersare routinely intimidated in Central Asia. Independent of each other, both of my regular translators in Tajikistan (both aspiring journalists)requested that I omit their names from my book and all future publications.

3I acknowledge that there is some risk that the leading structure of the interview questions biased respondents towards remembering theworst in their counterparts, which may mean that the book’s argument over-emphasizes the role of criminal sociopathy in small unit orga-nizations. On the other hand, I am absolutely confident that the war brought a lot of sociopaths out of the woodwork in both Georgia andTajikistan.


http://xkcd.com/538/



Not everyone trusted me at first. Some respon-dents left the interview not trusting me at all (and whocould blame them?!). It is reasonable to second-guessthe motives of networked strangers who arrive from faraway states to ask questions about a war. Having ex-tended conversations about post-Soviet security struc-tures occasionally led respondents to ask pointed ques-tions about my actual goals, leading to uncomfortableand conspiracy-laden conversations about whether oldRussian phone taps had been replaced with Americanpost-9/11 security assistance. I took notes but did notrecord interviews (Derluguian, 2005), and I am sure thatif I had started taping voices I would have received lessaccess and very different kinds of data. Though there istechnically an amnesty law on the books, many respon-dents became visibly uncomfortable when the conversa-tion turned to certain topics, especially their opinion ofTajikistan’s long-sitting president Emomalii Rakhmon.But strange as it sounds, many of the respondents whoassumed I was a spy, or affiliated with the U.S. military,were often more willing to engage with me. I can onlyspeculate why this was the case. Perhaps they saw me asbeing part of the same fraternity of warrior-defenders,or they believed that we shared the experience of mak-ing bad youthful decisions and getting in over our heads.

After a few frightening encounters, Iwas warned by a trusted advisor thatmy research design was making melook quite a bit like a spy, and that if Iran afoul of the wrong character inthe wrong security bureaucracy, itcould easily result in my permanentdisappearance.

It is also the case that some of my respondents weresociopathic liars, and that some respondents misrepre-sented basic facts about events, their beliefs, and theirroles. Still, I listened hard, asked follow-up questionswhen I could, and usually stayed until the person wishedto stop talking. But sometimes I got scared. The in-terview subjects always noticed. The interview endedshortly afterwards. That I conducted so many interviewsis evidence that, frommy perspective, the participants inthese wars were not all thugs, gangsters, and sociopaths.

I could not have created the necessary intimacy if theywere. Obviously there were strategic reasons to presentdistorted and self-serving versions of certain facts, butmost respondents seem to remember doing what theythought was appropriate at the time — and some ofit was horrible. After a few uncomfortable conversa-tions, I tried to enforce a policy of stopping interviewswhen subjects divulged personal memories of violenceor war crimes. But this was not always possible. Manyinterviews had a confessional aspect to them. Some mentried to shock me with graphic descriptions of tactics— inflicted or received — in order to test, embarrass,or entice me. As an author, I have learned that there isno catharsis to be gained by seeing these descriptions inprint. Lurid descriptions of mutilations, written fromthe safety of the ivory tower, cheapen the horror ofthe experiences for participants. The deepest scars —post-traumatic stress and survivor guilt — are invisi-ble. Ethnographers working in conflict zones eventuallylearnwhat police detectives and competentmilitary pro-fessionals have understood for time immemorial: thereare serious mental and emotional costs associated withtreating sociopaths as subjects.4

Authoritarianism is more a set of practices than akind of state, and sometimes fears that originate in au-thoritarian environments follow researchers home totheir “open access” societies. I spent a lot of time wor-rying about what I would do if my field notes were sub-poenaed by the FBI to assist the Russian or Tajik govern-ment in dismembering the networks that I accessed formy research. It is clear that even liberal states can, and ifthey deem it necessary will, deploy the law to place aca-demics in horrible double-binds.5 After returning to theUnited States, and participating in conversations witha few West Point professors who teach classes that spe-cialized in network analysis, I realized that to limit myown liability it was necessary to physically destroy all ofmy field notes that could be used to reconstruct contem-porary networks (including all those pages of circles,sketches, and names described above). Some unluckyPI will likely be forced to choose between “protectinghis sources” (e.g., releasing data that was acquired un-der explicit promises of confidentiality) and spendingtime behind bars, and I did not want it to be me. I sus-pect that in the future every researcher — qualitative

4I occasionally wonder how my life would be different if I had really read and absorbed E. Valentine Daniel (1996) before going into thefield.

5Human subjects protections promised by Boston College were more or less moot once Northern Ireland police decided thatthey needed access to the names in the researcher’s archive to prosecute the killing of Jean McConville. For more details, seehttps://bostoncollegesubpoena.wordpress.com/.


https://bostoncollegesubpoena.wordpress.com/



or quantitative — who collects data related to contem-porary terrorism, criminality, the drug war, or civil warviolence in general is going to have to self-define theirrole with a great deal more self-awareness.

People do not, as a rule, like to be studied. And asJarvis Cocker observed, “everybody hates a tourist.” Peo-ple do, as a rule, like the idea that their stories will berecorded for posterity. Ethnographers — by their inva-sive presence— force these decision heuristics into con-flict. As the line between researcher and subject blurs,what occasionally emerges is a crucible for creative, co-operative theory-building. People can tell when theirwords are being received with empathy (i.e., when theyare being treated as subjects) and when their words arebeing clinically recorded for some other purpose (i.e.,when they are being treated as objects). I found that oncea subject decides that the researcher is actually listening— and taking anonymity promises seriously — the re-searcher gets much better data.

References

Akiner, Shirin. 2005. Violence in Andijan, 13 May 2005: Anindependent assessment. Technical report.

Bukharbaeva, Galima. 2005. Testimony: Statement ofGalima Bukharbaeva, Correspondent—Institute for Warand Peace Reporting. Technical report HelsinkiCommission Hearing.

Daly, John C. K. 2005. “The Andijan disturbances and theirimplications.” Central Asia-Caucasus Analyst 29.

Daniel, E. Valentine. 1996. Charred Lullabies: Chapters in anAnthropography of Violence. Princeton, NJ: PrincetonUniversity Press.

Derluguian, Georgi M. 2005. Bourdieu’s Secret Admirer inthe Caucasus: A World-System Biography. Chicago, IL:University of Chicago Press.

Ilkhamov, Alisher. 2006. “The phenomenology of“Akromiya”: Separating facts from fiction.” China andEurasia Forum Quarterly 4(2006):39–48.

Kendzior, Sarah. 2006. “Inventing Akromiya: The role ofUzbek propagandists in the Andijan massacre.”Demokratizatsiya: The Journal of Post-SovietDemocratization 14(4):545–562.

OSCE. 2005. Preliminary findings on the events in Andijan,Uzbekistan, 13 May 2005. Technical report OSCE Officefor Democratic Institutions and Human Rights.

Sensitive Data and the French Stateby David Laitin

Stanford University

Much attention has been directed at the obligations ofscholars in protecting individuals from assaults on theirprivacy. Indeed, the entire IRB apparatus has been con-structed on that concern. Less attention has focusedon states and their obligations to protect their citizensfrom the data collection they sponsor and the use ofthose data for social analysis. In France, the tables aresomewhat turned. There has been little attention to IRBapproval of individual research projects, but the rulesfor the scientific use of state-collected data are a matterof high public concern. This is especially the case con-cerning data about the ethnic composition of France’spopulation.

This essay will first review France’s approach towardofficial ethnic data. It will then discuss the political pres-sures for change. Finally, it will describe the outlines ofthe new regime. This emerging regime seeks to balancean ideology of republicanism and a concern for the pri-vacy of its citizens. In so doing, it recognizes the need tounderstand ethnic processes so that public officials canbetter address fundamental social, political, and eco-nomic problems.

I. The French Republican Tradition1

French republican ideology emphasizes that the statehas no interest in knowing the ethnic past of any of itscitizens, as from the state’s point of view, all citizensare equally French. The nation, according to the Frenchsociologist Dominique Schnapper is a “community ofcitizens” (Schnapper, 1998, 16–17). It is not, she has in-sisted, an ethnic group.2 In her republican framework,ethnic attachments are irrelevant for national member-ship. Furthermore, state recognition of ethnic groupsundermines national solidarity. As a member of theConstitutional Council (the highest constitutional au-thority in France) that had authority over issues of na-tional accounting, her views had considerable sway.

Schnapper’s views on citizenship, ethnicity, and thenation, which she partially revised after retiring from the

1This section draws from the Appendix of Adida, Laitin and Valfort (2015). Thanks to Patrick Simon for his comments on an earlier draftof this section.

2She moderates her position in Schnapper (2007).


http://www.isdp.eu/publications/silk-road-papers.html?task=showbib&id=5671&return=

http://www.isdp.eu/publications/silk-road-papers.html?task=showbib&id=5671&return=

http://www.csce.gov/index.cfm?Fuseaction=ContentRecords.ViewWitness&ContentRecord_id=663&ContentType=D&ParentType=H



http://osce.org/odihr/15653

http://osce.org/odihr/15653



Constitutional Council and returning to research, aredeeply shared across the political spectrum in France.They are associated with the radical republican idealsthat reached ascendance in France in the early twenti-eth century when the principal social divide was trans-formed from class conflict to religious conflict. Theearly 20th century divide between the seculars and thechurch reflected battles going back to the Reformation,as Francemoved back-and-forth between full support ofthe Catholic Church and then issuing edicts proclaim-ing toleration for all cults.3 The republican focus on asecularism, called laïcité, entailed a reining in of thegains made by the Church in getting full control overpublic education during the period of the Second Em-pire (1852-70). As the Second Empire collapsed amidthe Paris Commune of 1871, and in a fit of republicanfrenzy signaling their hatred of the Church, radicals ex-ecuted the Archbishop of Paris.

The Republicans won the first elections of the 3rdRepublic in 1879 and began a “grande laïcisation del’État”. Laws were passed repealing the Sunday clos-ings of businesses, endingChurch control over hospitals,and secularizing cemeteries. A law formally separatingchurch and state was passed in 1905 granting freedomof conscience to all, with restrictions that could only bejustified by need for public order.While only four “cults”(Catholic, Lutheran, Calvinist and Jewish) received of-ficial recognition, all others were free to practice. Forthe next century, Republicans made further assaults onChurch subventions, though by the 5th republic (whilethe religious cleavage still revealed typical voting pat-terns) there were no more outstanding substantive is-sues on the republican agenda. But suddenly the issue oflaïcité took on new meaning in 1989 with the “foulard,”or “headscarf ”, incident in which confessionals and laysunited as “seculars” against the public display of Muslimidentity (Bowen, 2006).

One implication of the republican consensus is thatthe collection of ethnic data by the state has long beenan anathema. This has severely restricted careful so-cial analyses of immigration patterns. For example, inan important study seeking to measure intergenera-tional inequalities linked to immigration led by the state-

affiliated research group INSEE (Institut National de laStatistique et des Études Économiques), the researcherscould not distinguish between descendants of Frenchnationals born abroad (the “pieds noirs”) and descen-dants of immigrants from the same country.The authorsconfessed that “the fact that they cannot be distinguishedin our data leads to an underestimation of the negativeeffects of inadequate social capital among immigrants”(Meurs, Pailhé and Simon, 2006, 676).

The issue of ethnic data and its collection has con-tinued to be debated among state statisticians and so-cial scientists. The flavor of this debate has been nicelycaptured in a special issue of French Politics, Cultureand Society. Its history is convoluted.4 In the grandrepublican era, official statistics recognized only thosewho were “French”, those who were “French by acqui-sition” (though with no legal status differentiating themfrom the first category), and those who were “foreign-ers.” However, in the Vichy era (for Jews) and today (forArabs), republican principles were compromised as thestate differentiated them from Catholic-heritage French.

Researchers in INED, as well as INSEE,were split between those who feltthat republican ideals could best bemet by addressing discriminationthrough the analysis of ethnic dataand those who felt republican idealswould best be met with a state thatdid not recognize its citizens throughan ethnic lens.

Despite these deviations, considered a stain on re-publican principles, French law continues to be reso-lutely republican. A 1978 law prohibited the collectionor processing of data that reveal the racial or ethnic ori-gins (or political, philosophical or religious opinions, orunion membership, or health, or sexual life) of persons.The French Data Protection Authority (CNIL) was thenestablished to assure more assiduous implementation ofthe law. In 2005 HALDE (High Authority for Antidis-crimination and Equality) was created and funded bythe state, but it was unable to infer general trends of dis-crimination since it could not collect systematic data on

3See Bertossi (2012) for an analysis of the changing foci of republican thinking in France vis-à-vis the immigrant “other” over the pastcentury. The following account relies on Zuber (2008).

4For a comprehensive discussion of that history, as the state statistical bureaus navigated between a desire for objectivity of categories andone that could accurately depict the legal and historical aspects of the French population, see Simon (1999).

5HALDE is now institutionally embedded in “Défenseur des Droits,” an independent constitutionally-mandated authority (since 2008)charged with ensuring the protection of citizens’ rights and liberties, and promoting equality (see http://defenseurdesdroits.fr). It appears it has


http://www.ingentaconnect.com/content/berghahn/fpcs/2008/00000026/00000001;jsessionid=aucuoiavt3wa.alice

http://www.ingentaconnect.com/content/berghahn/fpcs/2008/00000026/00000001;jsessionid=aucuoiavt3wa.alice

http://defenseurdesdroits.fr



the employment of different ethnic or religious groups.5

II. Political Pressures for Change

With the obvious facts of discrimination (supple-mented by a number of sociological investigations thatskirted legal constraints) in the public realm, INED (In-stitut National d’Etudes Démographiques) researcherscollectedmore socio-cultural information on the Frenchpopulation. Their survey in 1992, called “Mobilité Géo-graphique et Insertion Sociale,” asked respondents aboutnative language, something that served as a proxy forwhat they called “ethnic belonging.”This created a stormof protest within INED. Opponents to these surveys ar-gued that the survey instrument made arbitrary judg-ments. Kurds, for example, were distinguished fromTurks, but not Catalans from Spaniards (Blum andGuérin-Pace, 2008). Researchers in INED, as well as IN-SEE, were split between those who felt that republicanideals could best be met by addressing discriminationthrough the analysis of ethnic data and those who feltrepublican ideals would best be met with a state that didnot recognize its citizens through an ethnic lens.6

The controversy has not abated. On the one hand,exceptions to the restrictions from 1978 were grantedin the recognition of a need for a clearer understandingof the diverse nature of the population. A law passedin 2006 (“loi pour l’égalité des chances”) had provisionsthat opened the possibility for tests of discriminationthat would allow researchers to indicate through subtlesignals the ethnicity of job or housing applicants. Onthe other hand, there were public outcries against therelaxation of republican principles. An inflammatorypetition published in the leftist newspaper Libérationargued that ethnic data were not necessary in the battleagainst discrimination (Amadieu et al., 2007). Data, thearticle stipulated, could only provoke inter-ethnic con-frontations (“affrontements communautaires”). PatrickSimon and co-authors from their perch as INED re-searchers responded in Le Monde, bringing the battleof the statisticians into the public realm. Simon thencriticized the radical republicans for making a “choiceof ignorance” by seeking “equality through invisibility”(Simon, 2008, 8). Amid this brouhaha, the provision inthe 2007 law that allowed for a census of the ethnic/racial

backgrounds of the French population was subsequentlydeclared unconstitutional by the Constitutional Coun-cil.

Having experienced the tensions in the immigrantbanlieues in his role as Minister of Interior, NicolasSarkozy, on becoming French President, sought to giveFrance the “statistical tools permitting it to measure itsdiversity.” He demanded that these tools be “objectiveand uncontestable.” Yazid Sabeg — an Arab born in Al-geria, educated by the Jesuits, with a Ph.D. from the Sor-bonne, and subsequently a successful businessman —was appointed Commissioner of Diversity and Equal-ity of Opportunity (Smith, 2005). Rather than draftinga new law, he appointed François Héran, at that timeDirector of INED and president of the European Asso-ciation of Population Studies, to draft a report address-ing the president’s charge (Eeckhout, 2009). Ultimately,the report sought a balance between “tout ethnique” and“l’ethnicité zero.” The balance would be in defining thecircumstances and the guarantees that can make thecollection of any such data useful and legitimate.7

The trend, then, became greater openness to ethnicdata collection. In 2008-2009 a collaboration of INEDand INSEE researchers constructed a super-sample ofimmigrants from diverse backgrounds in order to mea-sure their trajectories of integration into French society(Beauchemin, Hamel and Simon, 2010). Second gen-eration migrants were identified even though 95% ofthem were French citizens and, by republican standards,should not be distinguished from those French withdeeper roots. Furthermore, the religion question wasasked. While the data are heavily guarded and releasedonly to members of the scientific community, this wasa big step forward for data collection on ethnicities inFrance.

There is yet another force pushing France down thissame path — European research institutions. The Euro-peanConvention onHumanRights demands fulfillmentof anti-discrimination regulations that will likely requireFrance to produce data on its record in combating racialand religious discrimination. Also, EU scientific bu-reaus provide substantial research funds to academicswho collect cross-national data, with ethnic issues get-

broader investigatory powers than did HALDE.6Thomas (2012) describes howChirac’s Nationality Commission (1988) and the subsequent loiMéhaignerie, which changed the nationality

law, were able to create an uneasy compromise among the strands of republicanism that ran across France’s political system.7Oddly, given the circumstances of the time with the Muslim question dominant in public debate, this report analyzed data collection for

six types of ethnic discrimination but not that of religion. Religion, it is evident, was a cleavage too politically hot to handle.




ting substantial attention. Being part of Europe, in otherwords, has slowly pushed France toward a Europeannorm of sociological investigations and surveys on eth-nicity. Even Dominique Schnapper, as I pointed outearlier, has accommodated to these European pressuresand now argues that it is “impossible, politically andmorally, for researchers to renounce their role in thecreation of the self-awareness of a democratic society byestablishing knowledge that is as objective as possible”(Simon, 2008, 29).

III. A New Regime for Disseminating Secret Data

Access to sensitive data that is considered poten-tially threatening to individual privacy is now facilitatedthrough a simple bureaucratic process by the FrenchData Archives for Social Sciences (Réseau Quetelet).The Quetelet network is under the responsibility of theConsultative Committee for Data in the Social Sciences(CCDSHS). It was created by decree in 2001 by theFrench ministries of the economy, employment, and na-tional education and research. It reports directly to theminister in charge of research, and it has the responsibil-ity to accede access to sensitive micro-data, to providesupport for its utilization, and to document (througharchiving and documentation) those data so that theyare useful for research. Researchers register their scien-tific projects with the Réseau Quetelet and, if approvedfor meeting standards of scientific objectivity, can re-ceive columns of data that are withheld from the public.

In a collaborative project with Jens Hainmueller inour new Laboratory of Migration and Integration atStanford, we sought to re-analyze data from the Tra-jectoires et Origines dataset. However, the answers to thequestion of the professed religion of a survey respondentwere not publicly available. Since our project was certi-fied as scientifically objective by the Réseau Quetelet, wewere quickly sent those data linked to an identificationnumber for each respondent, thereby permitting easymerging.

However, with more sensitive data, there are im-posed strict rules of confidentiality requiring accredita-tion from the Committee on Secret Statistics (Comitédes Secrets). This committee, created in 1984, is an or-gan of the National Council of Statistical Information(CNIS); its mission is to give its opinion on whetherthese research projects have safeguards to ensure pri-vacy, and whether they have sufficiently important ends

to justify any risks to individual privacy. In 2008, therewas a modification of the law that allowed individualresearchers to have access to a wide range of govern-ment statistics so long as their research purposes weretruly scientific (or historical). The role of the commit-tee is to assess whether the scientific purpose can beachieved in any way without using the sensitive datathat was requested, to monitor the required promisethat researchers will destroy the files containing the datashortly after the research project has been completed,and to assess whether the researcher has any personalstake in the findings. The committee meets every threemonths to review proposals, and researchers must ap-pear before the committee to support their scientificclaims.

Heavy fines are imposed for any misuse of the dataprovided to researchers. If approval is granted, the re-searcher is given a data box to store all of the resultsfrom the statistical models, and this can be done withremote access from selected locations within France.However, the final files with statistical tables must getreviewed by a representative of the Committee on SecretStatistics before downloading. But under no conditionswould it be possible to download the raw data, as thesedata never leave the server and are encrypted. This limitmay make it impossible to publish work in those jour-nals that demand a replication dataset.

While getting the professed religion of survey re-spondents took only days to receive (though requiringa hard copy application sent to France), we needed toapply to the Comité des Secrets when we sought dataon the precise department and year of arrival of im-migrants in the French census. We proposed doing aregression discontinuity design on a program calledthe Contract of Welcoming and Integration (Contratd’Accueil et d’Intégration), which was rolled out at dif-ferent times across departments. We wanted to know ifthose immigrants who were subject to this Contrat in-tegrated into France more successfully than those whowere not subject to it (i.e. those who arrived in Francemonths before the Contrat was imposed). Here, clearly,the disaggregation of the data could conceivably allow usto identify subjects as individuals rather than statisticalobservations, and thus extra provisions were necessary.After waiting several months to get onto the Comité’scalendar, our proposal was accepted, and we now haveaccess to those data.


http://www.reseau-quetelet.cnrs.fr/spip/?lang=en

http://www.reseau-quetelet.cnrs.fr/spip/?lang=en

https://immigrationlab.stanford.edu/



Now that we have been granted permission, what arethe implications for publication in political science jour-nals demanding access to the raw data as a condition forpublication? There needs to be some accommodation,and a precedent for such an accommodation alreadyexists. In a paper that is now in press at the Journal ofPolitical Economy (JPE), my co-authors and I relied ondata considered sensitive by INSEE and thus requiredan official authorization to get access to them (Algan,Hémet and Laitin, Forthcoming). Indeed, the nature ofthe data (information about individuals’ national ori-gins) and the very local geographic level at which thestudy is conducted (housing block) could, if used im-properly, violate the privacy of French citizens. As aconsequence, these data are not public and we got ac-cess to them as part of a convention between INSEE andSciences Po, the institutional home of my co-authors.We were required to make use of these data within theconfines of INSEE and we had to use a secured accesspoint to analyze them. The JPE (after accepting the pa-per) questioned whether we could publish it withoutaccess to the raw data by other researchers that wouldpermit replication. We stressed to the editor that thesedata are available to any researcher once INSEE agreesto the scientific merits of the project through a remotesecured system (called CASD) that has been developedover the past five years in France. The JPE agreed to anexemption to its replication requirement after we agreedto provide all the necessary information explaining howto get access to these data through the CASD.

France has not lost touch with its republican tradi-tions, and its focus on state responsibility towards itscitizens rather than individual researcher responsibil-ity to their subjects represents the French statist tradi-tion. Yet French authorities are slowly coming to termswith the need to know more about their diverse popula-tions. Their evolving policy around a high state councilof secrets may sound highly bureaucratic to Americanears. However, French concern that their citizens not bethought of as “Muslims” or “Africans” or “of immigrantbackground,” but rather as “French,” is not an ignobleconcern. It is nonetheless an approach to the right toprivacy that is increasingly cognizant of the need for theobjective evaluation of its integration policies, much tothe benefit of the social scientific community and publicpolicy attempting to confront discriminatory processes.

References

Adida, Claire L., David D. Laitin and Marie-Anne Valfort.2015. Why Muslim Integration Fails in Christian-HeritageSocieties. Cambridge, MA.: Harvard University Press.

Algan, Yann, Camille Hémet and David D. Laitin.Forthcoming. “The social effects of ethnic diversity at thelocal level: A natural experiment with exogenousresidential allocation.” Journal of Political Economy .

Amadieu, Jean-François, Patrick Weil, Dominique Sopo,Samuel Thomas and Mouloud Aounit. 2007. “Engagementrépublicain contre les discriminations.” Libération 23.

Beauchemin, Cris, Christelle Hamel and Patrick Simon.2010. Trajectoires et origine. Documents de travail 16INED and INSEE.

Bertossi, Christophe. 2012. “French republicanism and theproblem of normative density.” Comparative EuropeanPolitics 10(3):248–265.

Blum, Alain and France Guérin-Pace. 2008. “Frommeasuring integration to fighting discrimination: Theillusion of ‘ethnic statistics’.” French Politics, Culture andSociety 26(1):45–61.

Bowen, John. 2006. Why the French Don’t Like Headscarves:Islam, the State, and Public Space. Princeton, NJ:Princeton University Press.

Eeckhout, Laetitia Van. 2009. “Pour mesurer la diversité,Yazid Sabeg renonce à légiférer.” Le Monde .

Meurs, Dominique, Ariane Pailhé and Patrick Simon. 2006.“The persistence of intergenerational inequalities linkedto immigration: Labour market outcomes for immigrantsand their descendants in France.” Population61(5):645–682.

Schnapper, Dominique. 1998. Community of Citizens: Onthe Modern Idea of Nationality. New Brunswick, NJ:Transaction Publishers.

Schnapper, Dominique. 2007. Qu’est-ce que l’intégration?Paris, France: Gallimard.

Simon, Patrick. 1999. “Nationality and origins in Frenchstatistics: Ambiguous categories.” Population 11:193–219.

Simon, Patrick. 2008. “The choice of ignorance: The debateon ethnic and racial statistics in France.” French Politics,Culture and Society 26(1):7–31.

Smith, Craig S. 2005. “What makes someone French?” NewYork Times .

Thomas, Elaine. 2012. Immigration, Islam, and the Politics ofBelonging in France. Philadelphia, PA: University ofPennsylvania Press.

Zuber, Valentine. 2008. “La séparation des églises et de l’Étaten France et à Genève (1905-1907): Une solution pourdeux histoires.” French Politics, Culture and Society26(1):71–91.


http://www.lemonde.fr/societe/article/2009/05/07/pour-mesurer-la-diversite-yazid-sabeg-renonce-a-legiferer_1189895_3224.html

http://www.lemonde.fr/societe/article/2009/05/07/pour-mesurer-la-diversite-yazid-sabeg-renonce-a-legiferer_1189895_3224.html

http://www.nytimes.com/2005/11/11/international/europe/11france.html



Survey Methods for Sensitive Topicsby Graeme Blair

Columbia University

When comparativists rely on survey data, they implic-itly invoke an important assumption: that respondentsanswered truthfully. This assumption may be violatedwhen there are incentives to conceal the truth. Socialpressures, fears of retaliation, and possible legal sanc-tions may drive respondents to answer questions in theleast revealing way rather than entirely honestly. Theseincentives can operate when the topic is unmistakablysensitive — for example, in surveys about participationin protests or support for an unpopular policy — butthey may also affect seemingly innocuous subjects likevoter turnout. When there are incentives to conceal, ourinferences about respondents (e.g. what proportion ofthem shared information with a militant) will be biased.Moreover, in some cases it will be difficult to determinethe size or even the direction of the misreporting bias.

Incentives to conceal may even affect whether re-spondents agree to participate in a survey or to offer a re-sponse to a sensitive survey question. If the responses ofparticipants — even truthful ones — differ from thoseof non-participants, then inferences about the popula-tion from the survey data (e.g. what proportion of Nige-rians shared informationwith amilitant) will suffer fromnonresponse bias.

What can we do about these misreporting and non-response biases? In what follows, I review four surveytechniques used by comparativists to address incentivesto conceal truthful responses.1 I first review survey ad-ministration practices designed to protect sensitive re-sponses. For contexts in which these are insufficient, Ireview three experimental methods that can be used inaddition that avoid soliciting exact answers to sensitivequestions altogether. The experimental methods enablecomparativists to ask survey questions that could nototherwise be asked due to ethical concerns and the riskof bias. However, these methods require additional as-sumptions that are often not testable, necessitating care-ful design and pilot testing. I conclude with a discussionof common critiques of the experimental techniques.

I. Survey Administration Protections

The first and most common approach is to buildtrust with respondents by implementing and commu-nicating measures to protect sensitive answers throughchanges in how the responses are collected and stored.If respondents find these confidentiality measures con-vincing, they may be more likely to respond, and torespond truthfully. Specific approaches used by re-searchers include:• Separating sensitive items from names, contact de-

tails, and questions that identify individuals such asexact age or family size (permanently, or via codes thatcan only be accessed by the researcher);

• Locking up paper surveys or encrypting electronicsurveys;

• Ensuring that interviews take place in private loca-tions without bystanders;

• Using interviewers who share the age/socialgroup/gender/etc. of the respondent to address fearsof outsiders asking sensitive questions;

• Self-administration on paper; via a recording(Chauchard, 2013); by touch-tone telephone; or ona smartphone, tablet, or computer.In each case, the sensitive survey questions them-

selves are not modified. As a result, the great advantageto these measures is the simplicity of interpretation andanalysis. The “yes” and “no” answers to a sensitive ques-tion can be directly analyzed with standard techniquessuch as means and logit or probit regressions. No ad-ditional assumptions beyond truthful responses are re-quired.

II. Randomized Response Technique

The randomized response technique protects re-spondents by introducing random noise into their re-sponses, so the responses could either reflect a truthfulanswer to the sensitive question of interest or an irrele-vant response. To implement this technique, several de-sign variants exist (Blair, Imai and Zhou, 2015). For ex-ample, in the forced response design, a randomizationdevice such as a coin or a die directs each respondent toeither automatically answer “yes” or “no,” or to answerthe question of interest truthfully. An example on esti-mating support for coalition forces in Afghanistan illus-trates the technique (Blair, Imai and Zhou, 2015):

1I will discuss how they can enable researchers to solicit truthful answers to binary “yes” or “no” questions, though most of these techniquescan be extended to other kinds of outcomes such as numerical responses.




For this question, I want you to answer yesor no. But I want you to consider the num-ber of your dice throw. If 1 shows on the dice,tell me no. If 6 shows, tell me yes. But if an-other number, like 2 or 3 or 4 or 5 shows,tell me your own opinion about the ques-tion that I will ask you after you throw thedice. [ TURN AWAY FROM THE RESPON-DENT ] Now you throw the dice so that Icannot see what comes out. Please do not for-get the number that comes out. [ WAIT TOTURN AROUND UNTIL RESPONDENTSAYS YES TO: ] Have you thrown the dice?Have you picked it up? Now, during theheight of the conflict in 2007 and 2008, didyou know any militants, like a family mem-ber, a friend, or someone you talked to on aregular basis? Please, before you answer, takenote of the number you rolled on the dice.

Individual responses are protected because a “yes”or “no” answer may be a truthful answer or it may in-dicate that the respondent rolled a 1 or a 6. To identifyhow many respondents said “yes” to the sensitive ques-tion, the “yes” and “no” responses are combined withproperties of the randomizing device (i.e. a standard diehas approximately a 1/6 probability of landing on eachside).

Noise can also be introduced in other ways. In themirrored question design, a coin or die rolled by therespondent in private assigns her to answer either thesensitive question or its inverse (“yes” and “no” flipped).In the unrelated question design, respondents answerthe sensitive question or an unrelated innocuous ques-tion.

The chief advantage of the randomized responsetechnique is that no individual response can be exactlyidentified by anyone. In addition, the level of protection— how much can be learned about sensitive individ-ual responses — is directly controlled by the researcher,who chooses the randomizing device (coin, dice, spin-ner, etc.). The strong protection and control do comeat a cost: compliance with the instructions is assumed,and this may be a strong assumption with low-educationrespondents or in contexts in which saying “yes” is itselfsensitive. There are, however, designs and models to ad-just for non-compliance.

For design and analysis advice, see Gingerich (2010)

and Blair, Imai and Zhou (2015).

III. List Experiments

The list experiment conceals individual responses toa sensitive survey item by aggregating those responseswith responses to several other control questions. Therespondent replies to a list experiment question with acount of the number of “yes” responses to a list of ques-tions that includes the sensitive item. In this way, eachindividual’s “yes” or “no” response to the sensitive ques-tion is hidden within the “yes” and “no” responses toother questions.

A second experimental group is used to estimate theproportion of respondents who said “yes” to the sensitiveitem (the quantity of interest). In this group, an identicalquestion is asked, except that the list excludes the sen-sitive item. The average response in this control groupis subtracted from the average response to the originalquestion to identify the proportion of respondents whosaid “yes” to the sensitive item. An example on estimat-ing support for coalition forces in Afghanistan illustratesthe technique (Blair, Imai and Lyall, 2014):

I’m going to read you a list with the namesof different groups and individuals on it. Af-ter I read the entire list, I’d like you to tellme how many of these groups and individ-uals you broadly support, meaning that yougenerally agree with the goals and policiesof the group or individual. Please don’t tellme which ones you generally agree with; onlytell me how many groups or individuals youbroadly support.

Sensitive Item ConditionKarzai GovernmentForeign ForcesNational Solidarity ProgramLocal Farmers

Control Item-Only ConditionKarzai GovernmentNational Solidarity ProgramLocal Farmers

Now, please tell me how many of these groupsor individuals do you broadly support?




Several list experiment applications have revealedincreased respondent willingness to answer sensitivequestions.However, a key disadvantage of the list experi-ment is that some individual responses are not protected.A respondent in the treatment group who answers thatshe supports all four groups is identified as a supporterof foreign forces in the example and a respondent whosays no groups is identified as a non-supporter of foreignforces. Neither attitude is protected. As a result, some re-spondents may answer dishonestly to avoid having theirviews identified. There are methods to detect and ad-just for this behavior (Blair and Imai, 2012), but the lackof complete protection may make all respondents morecautious.

For applications in comparative politics, seeCorstange (2009), Gonzalez-Ocantos et al. (2012), andMeng, Pan and Yang (Forthcoming). For design andanalysis advice, see Corstange (2009), Imai (2011), Blairand Imai (2012), Kramon and Weghorst (2012), Glynn(2013), Aronow et al. (2013), and Imai, Park and Greene(Forthcoming).

IV. Endorsement Experiments

The endorsement experiment is useful for measur-ing attitudes toward a political actor such as an electedofficial (see Rosenfeld, Imai and Shapiro (Forthcom-ing) for an inversion of the design to measure attitudestoward a policy). The method protects individual at-titudes toward the sensitive actor by combining themwith attitudes towards one or more public policies. Inthe endorsement condition, randomly assigned respon-dents are asked a question about a policy and told thatthe sensitive political actor endorses the policy. An in-dividual’s affect toward the actor is protected becausea positive response could reflect affinity for either thepolicy or the actor.

To separate affect toward the actor from policy pref-erences, randomly assigned respondents are asked anidentical question, but without the endorsement. Thisidentifies policy preferences alone. Affect toward the ac-tor is identified by subtracting average policy prefer-ences from the average response to the endorsementquestion. This is the “endorsement effect.” An examplefrom Lyall, Blair and Imai (2013) that solicits attitudestoward the Taliban illustrates:

Policy-Only ConditionA recent proposal …

Endorsement ConditionA recent proposal by the Taliban …

In both conditions underlined above:

…calls for the sweeping reform of the Afghanprison system, including the construction ofnew prisons in every district to help alleviateovercrowding in existing facilities. Thoughexpensive, new programs for inmates wouldalso be offered, and new judges and pros-ecutors would be trained. How do you feelabout this proposal? Do you strongly agree,somewhat agree, are you indifferent, do youdisagree, or do you strongly disagree with thispolicy?

Theendorsement technique is perhaps themost pro-tective of the methods because the direct sensitive ques-tion is not asked. Endorsement questions are also easyfor enumerators and respondents to understand. Thereare two main downsides. First, the level of protectionfor respondents depends on the policies. In the exam-ple above, if most respondents disagree with the prisonpolicy, an answer of “strongly agree” would indicate Tal-iban support. Second, the indirect nature of the questionmeans that the magnitudes of endorsement effects arenot directly interpretable without a behavioral assump-tion.

Examples of applications in comparative politics in-clude Blair et al. (2013) and Lyall, Blair and Imai (2013).For design and analysis advice, see Bullock, Imai andShapiro (2011) and Blair, Imai and Lyall (2014).

By mitigating the biases fromincentives to conceal truthfulresponses, these four methodsenable comparativists to askquestions that could typically onlybe asked before by building trustover long periods of time with smallpools of respondents.

V. Analysis of the Experimental Methods

Regression and other standard analyses can easilybe conducted for the randomized response, list, and en-dorsement techniques using free software in the R sta-tistical environment with the “rr,” “list,” and “endorse”




packages (Blair, Imai and Zhou, 2015; Blair, Zhou andImai, 2015b; Blair and Imai, 2010; Shiraito and Imai,2013). The rr and list packages also enable researchersto use randomized response and list experiment ques-tions as predictors in a regression. Analysis is no morecomplicated than running a regression in StataorR. Forexample, the list experiment regression command is:

ictreg( y.variable ~ x.variable, treat =“treatment.variable”, data = my.data )

VI. Critiques of the Experimental Methods

I now discuss five critiques that have been leveled atthe experimental methods described above.

1. There is still no incentive for respondents to answertruthfully. This is an empirical question, and one forwhich there is not yet conclusive evidence. Neverthe-less, in the small number of validation studies that havetaken place, respondents have been more forthcomingwith sensitive information and the estimates of knownpopulation parameters were closer to the truth usingsome of these methods compared to questions withoutprotections (Rosenfeld, Imai and Shapiro, Forthcom-ing).

2. The list experiment and randomized response techniqueare confusing to enumerators and respondents.This is un-doubtedly a problem. Carefully developing and pretest-ing instructions for respondents, directly training allenumerators, and identifying points of confusion by ask-ing respondents a practice question can help.

3. Respondents see through the designs and do not complywith them to avoid any risk. An example illustrates thisissue: in a list experiment in Afghanistan, zero of 2,754respondents reported that they supported none or allof the groups mentioned — and these are the two re-sponses that are not protected by the design (Blair, Imaiand Lyall, 2014). This problem may be avoided by pilot-ing different designs to find one that provides sufficientprotection to encourage participation and compliance.

4. Low power. Each method requires a larger sample sizethan do direct questions. To improve power, researcherscan use the “double list experiment” (Glynn, 2013), askeach control question separately in the list experimentcontrol group (Corstange, 2009), ask multiple policyquestions for the endorsement experiment, or combineresponses from multiple measurements (see Aronow

et al. (2013) on direct questions and list experiments,and Blair, Imai and Lyall (2014) on list and endorse-ment experiments).

5. Difficult to design. Intensive fieldwork is needed toidentify appropriate question designs and instructions.Multiple pilot tests are also often needed to design thecontrol items for list experiments and the policies for theendorsement experiment.

VII. Discussion

By mitigating the biases from incentives to concealtruthful responses, these four methods enable compara-tivists to ask questions that could typically only be askedbefore by building trust over long periods of time withsmall pools of respondents. The choice of which methodto use will depend on the context, but the choice can beinformed by the theoretical differences discussed hereand by careful pilot testing. Moreover, the methods arenot mutually exclusive. Survey administration protec-tions can be combined with the experimental meth-ods, and multiple experimental methods can be used formeasurement (Blair, Imai and Lyall, 2014). Future re-search should validate these methods in contexts stud-ied by comparativists in order to increase confidence inthem and provide more concrete advice on the choice oftechniques.

References

Aronow, Peter M., Alexander Coppock, Forrest W. Crawfordand Donald P. Green. 2013. “Combining list experimentand direct question estimates of sensitive behaviorprevalence.” arXiv preprint arXiv:1312.1268 .

Blair, Graeme, C. Christine Fair, Neil Malhotra and Jacob N.Shapiro. 2013. “Poverty and support for militant politics:Evidence from Pakistan.” American Journal of PoliticalScience 57(1):30–48.

Blair, Graeme and Kosuke Imai. 2010. “list: Statisticalmethods for the item count technique and listexperiment.” Available at The Comprehensive R ArchiveNetwork (CRAN) .

Blair, Graeme and Kosuke Imai. 2012. “Statistical analysis oflist experiments.” Political Analysis 20(1):47–77.

Blair, Graeme, Kosuke Imai and Jason Lyall. 2014.“Comparing and combining list and endorsementexperiments: Evidence from Afghanistan.” AmericanJournal of Political Science 58(4):1043–1063.

Blair, Graeme, Yang-Yang Zhou and Kosuke Imai. 2015a.Design and analysis of randomized response technique.Technical report.


http://imai.princeton.edu/software/list.html



http://imai.princeton.edu/research/files/randresp.pdf



Blair, Graeme, Yang-Yang Zhou and Kosuke Imai. 2015b. “rr:Statistical Methods for the Randomized ResponseTechnique.” Available at The Comprehensive R ArchiveNetwork (CRAN) .

Bullock, Will, Kosuke Imai and Jacob N. Shapiro. 2011.“Statistical analysis of endorsement experiments:Measuring support for militant groups in Pakistan.”Political Analysis 19(4):363–384.

Chauchard, Simon. 2013. “Using MP3 players in surveys:The impact of a low-tech self-administration mode onreporting of sensitive attitudes.” Public Opinion Quarterly77(S1):220–231.

Corstange, Daniel. 2009. “Sensitive questions, truthfulanswers? Modeling the list experiment with LISTIT.”Political Analysis 17(1):45–63.

Gingerich, Daniel W. 2010. “Understanding off-the-bookspolitics: Conducting inference on the determinants ofsensitive behavior with randomized response surveys.”Political Analysis 18(3):349–380.

Glynn, Adam N. 2013. “What can we learn with statisticaltruth serum? Design and analysis of the list experiment.”Public Opinion Quarterly 77(S1):159–172.

Gonzalez-Ocantos, Ezequiel, Chad Kiewiet De Jonge, CarlosMeléndez, Javier Osorio and David W Nickerson. 2012.“Vote buying and social desirability bias: Experimentalevidence from Nicaragua.” American Journal of PoliticalScience 56(1):202–217.

Imai, Kosuke. 2011. “Multivariate regression analysis for theitem count technique.” Journal of the American StatisticalAssociation 106(494):407–416.

Imai, Kosuke, Bethany Park and Kenneth F. Greene.Forthcoming. “Using the predicted responses from listexperiments as explanatory variables in regressionmodels.” Political Analysis .

Kramon, Eric and Keith R. Weghorst. 2012. “Measuringsensitive attitudes in developing countries: Lessons fromimplementing the list experiment.” Newsletter of the APSAExperimental Section 3(2):14–22.

Lyall, Jason, Graeme Blair and Kosuke Imai. 2013.“Explaining support for combatants during wartime: Asurvey experiment in Afghanistan.” American PoliticalScience Review 107(04):679–705.

Meng, Tianguang, Jennifer Pan and Ping Yang. Forthcoming.“Conditional receptivity to citizen participation evidencefrom a survey experiment in China.” Comparative PoliticalStudies .

Rosenfeld, Bryn, Kosuke Imai and Jacob Shapiro.Forthcoming. “An empirical validation study of popularsurvey methodologies for sensitive questions.” AmericanJournal of Political Science .

Shiraito, Yuki and Kosuke Imai. 2013. “Endorse: R packagefor analyzing endorsement experiments.” Available at TheComprehensive R Archive Network (CRAN) .

Randomized Response: Foundations and NewDevelopments

by Daniel GingerichUniversity of Virginia

I. Overview

Randomized response refers to a family of surveytechniques designed to alleviate bias due to social desir-ability and legal concerns in studies of sensitive behav-iors and attitudes.The distinguishing feature of random-ized response surveys is the introduction of a random-izing device into the survey response process in order toguarantee the confidentiality of responses to potentiallyjeopardizing survey items. In a randomized responsesurvey, a respondent’s observed response about a sen-sitive trait is jointly a function of the sensitive trait inquestion as well as the private (i.e. known only to therespondent) realization of the randomizing device.

The dependence of the observed response on theprivately viewed realization of the randomizing devicemakes it impossible to deduce the respondent’s trait sta-tus based on the information she divulges to the enu-merator. As a consequence, concerns about social ap-probation or potential legal sanctioning are hopefullyalleviated, thereby liberating the respondent to respondfreely about the sensitive trait. Indeed, each respondent’strait status remains unknown to all actors involved inthe collection and analysis of the survey, so there is nocondition under which a court order or other form ofgovernment action could be used to compel researchersto reveal the sensitive behaviors of their respondents.While the realization of the randomizing device viewedby any given respondent is unknown to the analyst, thedistribution of realizations generated by the device isknown prior to conducting the survey. This a prioriknowledge makes it possible for the analyst to estimatethe proportion of individuals who bear the sensitive traitand develop a better understanding of the factors thatcontribute to its incidence.

Randomized response came into being nearly fiftyyears ago with the publication of a seminal paper writ-ten by the statistician Stanley Warner (Warner, 1965). Inrecent years, applications of the technique have includedstudies of topics as varied as abortion (Lara et al., 2006),


http://imai.princeton.edu/software/rr.html



http://imai.princeton.edu/software/endorse.html

http://imai.princeton.edu/software/endorse.html



social security fraud (Lensvelt-Mulders et al., 2006), cor-ruption within public bureaucracies (Gingerich, 2010,2013), the prevalence of xenophobia and anti-Semitism(Krumpal, 2012), cheating by undergraduates (Scheersand Dayton, 1987; Kerkvliet, 1994a; Fox and Meijer,2008), drug use (Kerkvliet, 1994b; Dietz et al., 2013),demand for pornography and prostitution (De Jong,Pieters and Fox, 2010), sexual attitudes (De Jong, Pietersand Stremersch, 2012), and the impact of anonymityon altruistic behavior in a laboratory setting (List et al.,2004; Franzen and Pointner, 2012). Given the growinginterest within comparative politics in sensitive forms ofpolitical behavior such as corruption, clientelism, votebrokerage, violent anti-government protest, and supportfor extremist groups, the technique is one which may beof utility to much future work in the subfield. Moreover,randomized response data lends itself fairly straightfor-wardly to multivariate statistical analysis, thereby allow-ing researchers to better comprehend the factors thatdrive such sensitive forms of political behavior.

II. Examples of Randomized Response Models

As mentioned above, randomized response modelsrefer to a family of survey questioning strategies as op-posed to any specific technique. In order to give a senseof how randomized response surveys work, I concen-trate here on describing several variants of randomizedresponse surveys appropriate for studying a sensitivetrait that is binary in form.1 The key assumption thatallows the analyst to identify the proportion of individ-uals bearing the sensitive trait based on the observedresponses generated by these techniques is called hon-esty given protection (Gingerich, 2010). This assumptionrequires that, given the privacy guarantee provided byrandomized response, respondents answer truthfully asprompted by the randomized response survey design.Throughout the discussion that follows, I will assumethat the honesty given protection assumption holds.

Notation. Consider a setting in which each respon-dent i in a randomly selected sample of size n is queriedindirectly about her status on an unobservable sensitivetrait, θi ∈ {0 (“absent”), 1 (“present”)}. Our initial in-terest resides in estimating π = E[θi] = P(θi = 1), theproportion of individuals who bear the trait of interest;later though we will also consider means of estimatingthe impact of determinants of the sensitive trait. Theobserved randomized response of a respondent is de-

noted by the binary outcome variable Yi ∈ {0, 1}, withn1 =

∑ni Yi.

WarnerModel. In the originalWarnermodel, a ran-domizing device prompts respondents to respond “true”or “false” to one of two statements. The two statementsare identical, save for the fact that one is in the positiveform and the other is in the negative form.Theparticularstatement towhich the respondent is directed to responddepends on the private realization of the randomizingdevice, thereby ensuring confidentiality. Figure 1 on thenext page gives an example of a question asked using theWarner variant of the randomized response technique.The sensitive trait of interest is whether or not the re-spondent would be willing to pay a bribe to avoid a traf-fic ticket. The respondent is given a spinner divided intotwo regions A and B (with the probability that the spin-ner lands in region A equal to p, where p ̸= 1

2 ). Prior toanswering the question, she is directed to spin the spin-ner, observing the section where the spinner lands inprivate. If the spinner lands in section A, she is directedto respond true or false with respect to statement A (thestatement in the positive form). If the spinner lands insection B, she is directed to respond true or false with re-spect to statement B (the statement in the negative form).

The confidentiality of respondents’ responses is pro-vided by the fact that an observed response of “true” or“false” could refer to either the statement in its positiveform or negative form, and, ipso facto, could in each casebe either an admission of possession of the trait or denialof possession of the trait.

Let Yi = 1 (Yi = 0) denote the event that respon-dent i’s observed response is “true” (“false”). Given thestructure of the Warner response process and the hon-esty given protection assumption, the probability of eachtype of observed response is:

P(Yi=1) = pπ + (1− p)(1− π)

(1)P(Yi=0) = p(1− π) + (1− p)π.

Letting I(x) be an indicator function equal to 1 if x istrue and 0 otherwise, the likelihood of the observed datain the sample is written,∏n

iP(Yi=1)I(Yi=1)P(Yi=0)I(Yi=0), (2)

1More comprehensive descriptions of the varieties of randomized response surveys, including techniques appropriate for multivalued orcontinuous sensitive traits, can be found in Fox and Tracy (1986), Chaudhuri andMukerjee (1988), Gingerich (2010), and Blair, Imai and Zhou(2015).m http://comparativenewsletter.com/ B [email protected] 17



Figure 1: An Example Warner Randomized Response Item

with corresponding log-likelihood equal to,

n1 lnP(Yi=1) + (n− n1) lnP(Yi=0). (3)

Maximizing Eq. (3) with respect to the prevalence rate,π, gives themaximum likelihood estimate of the propor-tion of individuals who bear the sensitive trait:

π̂W =n1n + p− 1

2p− 1. (4)

The above shows that the prevalence of the sensitivetrait can be deduced from the observed proportion ofaffirmative responses to the randomized response ques-tion in conjunction with the known properties of therandomizing device (i.e. the probability that the spinnerlands in region A).

Forced ResponseModel. A commonly used alterna-tive to theWarnermodel is the so-called forced responsemodel (Boruch, 1971). In the forced response model, re-spondents are given a randomizing device that generatesone of three different outcomes, 1,2,3, say, having cor-responding probabilities p1, p2, and 1 − p1 − p2. Therespondent begins by drawing a realization from thedevice. If the first outcome obtains, the respondent isdirected to answer “true” or “false” to a statement in-dicating possession of the sensitive trait. If the second

outcome obtains, the respondent is directed to simplyanswer “true.” If the third outcome obtains, the respon-dent is directed to simply answer “false.”

In this model, the confidentiality of responses stemsfrom the fact that some proportion of the observed“true” and “false” responses are based solely on the real-ization of the randomizing device and not on the char-acteristics or experiences of the respondents. As such,respondents prompted to respond in reference to thesensitive trait will know that their observed “true” or“false” response will not reveal their trait status.

As above, let Yi = 1 (Yi = 0) denote the event thatrespondent i’s observed response is “true” (“false”). Thestructure of the forced response data generating processimplies that the probability of each type of observed re-sponse is as follows:

P(Yi=1) = p1π + p2

(5)P(Yi=0) = p1(1− π) + 1− p1 − p2.

The log-likelihood of the sample for this model is as inEq. (3), with the probabilities for the observed responses




replaced by those presented above. Maximizing the log-likelihood, the maximum likelihood estimate of the pro-portion of individuals who bear the sensitive trait for thismodel is:

π̂FR =n1n − p2

p1. (6)

As before, the estimate of the prevalence rate is a simplefunction of the proportion of affirmative responses andthe known properties of the randomizing device.

Crosswise Model. The Crosswise Model refers toa sensitive questioning strategy that is mathematicallyidentical to the Warner version of randomized responsedescribed above, but one that utilizes an innocuousstatement indicating membership in a non-sensitivegroup in place of a randomizing device (Yu, Tian andTang, 2008; Tan, Tian and Tang, 2009). Figure 2 on thenext page gives an example of a question asked using theCrosswise Model.

Respondents are presented with two statements andasked how many are true. The first statement is the in-nocuous statement, which indicates that the respon-dent’s mother was born in October, November, or De-cember. The second statement is the statement of inter-est, which, staying with our running example, denotesthat the respondent would be willing to bribe a policeofficer in order to avoid a traffic ticket.

Protection is provided by constraining the mannerin which respondents are permitted to respond. Re-spondents can give an answer of “A”, which indicatesthat either both statements are true OR neither state-ment is true, or they can give an answer of “B”, whichindicates that one of the two statements is true. As aconsequence of merging the responses about briberywith those of one’s mother’s birthday, it is impossible todiscern whether or not a respondent is willing to bribebased on the observed response of either “A” or “B”.

In order to use this questioning strategy, one hasto choose an innocuous statement with special charac-teristics. First, the proportion of individuals for whomthe statement is true must be known in advance by theanalyst. Second, the innocuous statement must be struc-tured so that the proportion of respondents for whomit is true is not equal to 1

2 . Third, the membership inthe group indicated by the innocuous statement must

be private information (i.e. it must be known to respon-dents but not to survey administrators and it must beknown by respondents to be unknown to administra-tors). Finally, membership in the group indicated by thebenign statement must be uncorrelated with the sensi-tive behavior of interest. As in the example provided inFigure 2, innocuous statements based on the birthdaysof friends or relatives are likely to satisfy all these crite-ria.2

Although the Crosswise Model has only been in ex-istence for approximately six years, it has already beenapplied widely to topics such as cheating and plagia-rism (Jann, Jerke and Krumpal, 2011; Höglinger, Jannand Diekmann, 2014), petty bribery and drug use (Gin-gerich et al., 2014), and tax evasion (Krumpal, 2012;Kundt, 2014). These studies have reported significantbenefits to using the Crosswise Model over direct ques-tioning and even over other variants of the randomizedresponse technique.

III. The Protection-Variance Trade-off

Relative to direct questioning, the use of randomizedresponse techniques necessarily entails a substantial re-duction in the statistical precision of estimates of sensi-tive items. In this way, randomized response presents theapplied researcher with a fundamental trade-off: greaterprotection of respondents — and, one hopes, lower biasdue to legal or social desirability concerns — versus aloss of statistical precision due to the manner in whichthe technique intentionally introduces noise into the re-sponse process.

The highly transparent structure of randomized re-sponse designs makes it straightforward to assess thecost component of this trade-off (i.e. the increase in vari-ance due to the use of a particular randomized responsedesign). For the Warner Model, for instance, the vari-ance of the estimate of the prevalence rate is:

var(π̂W) =π(1− π)

n+

1

n

[1

16(p− .5)2− 1

4

], (7)

where π(1−π)n denotes the sampling variance and

1n

[1

16(p−.5)2− 1

4

]denotes variation due to the random-

izing device. The second component of Eq. (7) captures2To see that this model is mathematically identical toWarner randomized response, let Yi = 1 (Yi = 0) denote the event that respondent i’s

observed response is “A” (“B”) and let p denote the proportion of individuals in the population of interest whose mother was born in October,November, or December. Then the probabilities of the two observed responses are exactly as in Eq. (1).




Figure 2: An Example Crosswise Item

how the characteristics of the randomizing device affectthe precision of the estimate of the prevalence rate. Thevariance of the estimate increases as the probabilities ofbeing prompted to respond to the statement in the pos-itive form and the statement in the negative form con-verge to one another (i.e. as p approaches 1/2), and it de-creases as the probabilities of being prompted to respondto the statement in the positive formand the statement inthe negative form diverge from one another (i.e. as p ap-proaches 0 or 1). Therein lies the trade-off between pro-tection and variance. As p approaches 1/2, the level ofinformation contained in any observed response aboutwhether or not the respondent has the sensitive traitgets smaller. This provides the respondent with greaterprotection. However, and for the exact same reason, thevariance of the point estimate explodes as this happens.Given these considerations, a natural question for theapplied researcher is how to structure the randomizingdevice so that the survey provides both a reasonable levelof respondent protection and statistical precision. In theWarnermodel, for instance, how should one set p? If onewere willing to assume that the honesty given protectionwould hold at any level of protection then this problemhas a simple solution: minimize variance by setting theprotection level as low as possible (e.g. by setting p veryclose to 0 or 1 in the Warner Model). Of course, it is un-reasonable to believe that the honesty given protectionassumption would hold were one to do so.

One potentially useful way to think about the appro-priate level of respondent protection is in terms of theminimally sufficient condition for plausible deniability.In other words, the applied researcher could attempt togauge how low the level of protection could be set beforerespondents begin to feel that they could no longer plau-sibly deny possession of the trait based on their observedresponses. Identifying this threshold is something thatcould possibly be explored within the context of focusgroup sessions. More formally, researchers might per-form experiments examining how randomized responseprevalence estimates change as a function of the level ofprotection, then use these findings, taken in conjunc-tion with the relevant variance formulas, for optimallyselecting the protection level afforded to respondents. Asit stands now, however, the level of protection chosen inapplied work seems to be chosen largely on the basisof intuition and sample size considerations. As such, Ibelieve this is an area of sensitive survey design wherefurthermethodological development would be very use-ful.

IV. Extensions

I now consider several extensions to standard ran-domized response models. These include the incorpo-ration of responses generated by direct questioning, ex-planatory models capturing the influence of predictors




of the sensitive trait, and item response theory modelsfor modeling multiple randomized response questions.

Incorporating Direct Responses

The working theory that leads one to use a sensi-tive survey technique like randomized response is typ-ically not that everyone lies under direct questioning,but rather that those who have the sensitive trait of in-terest generally do, or they do so at such a rate as toinvalidate analyses based on direct questions about thetrait. However, even if we believe that evasiveness underdirect questioning would be extensive, in any given ap-plication it is probably unreasonable to assume that allindividuals bearing the sensitive trait would lie or refuseto respond if queried directly. As such, sensitive surveytechniques— used in isolation— throw away some po-tentially very useful pieces of information, namely, theaffirmative responses of those respondents who wouldhave been willing to tell us under direct questioning thatthey bear the sensitive trait.

Here I provide a brief overview of a framework de-veloped in Gingerich et al. (2014) for jointly incorporat-ing randomized response and direct responses in sur-veys of sensitive behavior. That paper shows that thereis a sense in which the joint approach offers the best ofboth worlds: the resulting estimator of the prevalencerate (or the estimator of the influence of a determinantof the sensitive trait) inherits all of the bias-reducingadvantages of randomized response, while at the sametime enjoying significant improvements in statisticalprecision (relative to randomized response alone) dueto the incorporation of direct responses. Moreover, theuse of the joint response model permits one to estimatea series of diagnostic parameters that reveal the extent towhich a sensitive survey technique is truly necessary tostudy the phenomenon of interest in a given population.

The joint response approach is easy to describe.Each respondent in the sample is first queried abouther status on the sensitive trait using a randomizedresponse technique. Then, at a later stage in the sur-vey, each respondent is asked to respond directly to aquestion about her status, with the explicit option of“choose not to respond directly” offered to her. Thus,the observed outcomes for each respondent consist ofa vector Yi = (yDi , yAi ), where yDi = {0 (“absent”), 1(“present”),∅ (“unwilling to respond directly”)} is theobserved response when i is queried about the sensi-

tive trait directly and yAi ∈ {0, 1} is the observed re-sponse when i is queried about the sensitive trait usingthe randomized response (or crosswise) technique de-signed to guarantee anonymity. The observed responseset is thus an array with six distinct elements, Y ={(0, 0), (0, 1), (1, 0), (1, 1), (∅, 0), (∅, 1)}, with k ∈ Yrepresenting an arbitrary element in this set.Henceforth,I will use the simplification Yi ∈ Y = {1, 2, ...5, 6},where each natural number 1, .., 6 represents one of thesix distinct response combinations.

In addition to honesty given protection, the joint re-sponse approach makes an assumption called one-sidedlying.This assumption is based on the assumed directionof social desirability bias in sensitive surveys. It holdsthat individuals who do not bear the sensitive trait neverfalsely claim that they do. Rather, the set of potentialliars is limited to those respondents who do bear thesensitive trait. Let λT

θ , λLθ , and 1 − λT

θ − λLθ denote

the probability that, when asked directly, a respondentwhose status is θ tells the truth about her willingness tobribe, lies about her willingness, or refuses to answer thequestion, respectively. One-sided lying specifically im-plies that λL

0 = 0.

Randomized response presents theapplied researcher with afundamental trade-off: greaterprotection of respondents — and,one hopes, lower bias due to legal orsocial desirability concerns — versusa loss of statistical precision due tothe manner in which the techniqueintentionally introduces noise intothe response process.

Now suppose that the analyst uses either the Warnerrandomized response model or the Crosswise Modelas the sensitive question technique. Then given the as-sumptions of honesty given protection and one-sided ly-ing, the probability of each combination of responsesin the observed response set is as in Table 1 on thenext page. The parameter vector to be estimated is ξ =(π, λT

1 , λL1 , λ

T0 ), where the final three elements in this

vector can be thought of as diagnostic parameters.Theseindicate the need (or lack thereof) to use a sensitive ques-tioning technique to study the trait of interest in the tar-get population and are substantively important in theirown right. In particular, λT

1 expresses the probability




Table 1: Outcome Probabilities for the Joint Response ModelYi Outcome Probability

1 (yDi = 0, yAi = 0) pλT0 (1− π) + (1− p)λL

1 π

2 (yDi = 0, yAi = 1) (1− p)λT0 (1− π) + pλL

1 π

3 (yDi = 1, yAi = 0) (1− p)λT1 π

4 (yDi = 1, yAi = 1) pλT1 π

5 (yDi = ∅, yAi = 0) p(1− λT0 )(1− π) + (1− p)(1− λT

1 − λL1 )π

6 (yDi = ∅, yAi = 1) (1− p)(1− λT0 )(1− π) + p(1− λT

1 − λL1 )π

that an individual bearing the sensitive trait would bewilling to respond honestly under direct questioning.

The log-likelihood for this model is equal to:

lnL(ξ|Y ) =∑6

k=1nk lnPY (k), (8)

where PY (k) is the probability of observing Yi = k andnk =

∑ni=1 I(Yi = k) is the number of respondents

exhibiting response category k. Gingerich et al. (2014)describes how to use the E-M algorithm to estimate theparameters of this model.

Explanatory Randomized Response

For many social scientists, the real attraction of sen-sitive survey techniques like randomized response restswith the fact that they can be easily utilized to studythe impact of determinants of sensitive behavior. Notsurprisingly, the theory and practice of randomized re-sponse regressionmodels has a fairly long history (Mad-dala, 1986; Scheers and Dayton, 1988; Van Der Heijdenand van Gils, 1996; Van Der Heijden, Bouts and Hox,2000; Lara et al., 2006; Lensvelt-Mulders et al., 2006;Gingerich, 2013; Blair, Imai and Zhou, 2015).

From a likelihood-perspective, setting up an ex-planatory model for the sensitive trait simply entails re-placing the unconditional probability π with an appro-priately parameterized conditional probability,

πi = f(Xi;β), (9)

where Xi is a vector of background characteristics anda constant, β is a parameter vector, and f is a func-tion from the real line into the interval [0, 1] such as theinverse-logit or probit function. Given the model for thesensitive trait, the analyst inserts πi in place of π in the

expressions for the probabilities of each type of observedresponse.

If the outcome consists of randomized response dataonly, then the log-likelihood for the sample becomes:

lnL(β|Y,X) =∑n

i=1[I(Yi = 1) lnP(Yi = 1|Xi)

+I(Yi = 0) lnP(Yi = 0|Xi)], (10)

whereP(Yi = 1|Xi) andP(Yi = 0|Xi) are defined basedon Eq. (1) or Eq. (5), depending on which randomizedresponse method is utilized.

Similarly, if the outcome data consist of both ran-domized responses and direct responses, then the log-likelihood is equal to:

lnL(ξ|Y,X) =∑n

i=1

∑6

k=1I(Yi = k) lnPY (k|Xi),

(11)where thePY (k|Xi) are defined as in Table 1 (forWarnerrandomized response or the Crosswise Method) andξ =(β, λT

1 , λL1 , λ

T0 ). The reader is referred to Blair, Imai

andZhou (2015) andGingerich et al. (2014) for a discus-sion of estimation and inference with statistical modelsof this form.

Non-parametric or semi-parametric program eval-uation techniques, such as matching, stratification, andpropensity score weighting, can also be used to study theimpact of drivers of sensitive behavior using random-ized response data (Gingerich, 2010). Suppose that thegoal of the analyst is to estimate the impact of a directlyobserved binary treatment variable, Ti ∈ {0, 1}, andthat the outcome of interest is measured using the ran-domized response technique. Assuming that treatmentassignment is ignorable given the covariates X, the aver-age treatment effect (i.e. the average unit-level impact of




being assigned T = 1 instead of T = 0) can be identi-fied from the data as:

τ = EX{E[Y |T = 1,X = x]−E[Y |T = 0,X = x]}/c,(12)

where c = 2p − 1 if the Warner randomized responseor Crosswise Models are used, and c = p1 if the forcedresponse model is used. (The notation EX denotes thatthe outer expectation is taken with respect to the co-variates). The numerator of the expression above is theaverage treatment effect calculated using the observedrandomized responses as the outcome variable. Thus,the analyst can estimate the average treatment effect forthe sensitive trait by first estimating the average treat-ment effect for the observed reports — using matching,stratification, or propensity score weighting — then di-viding this quantity by the constant c.

Item ResponseTheory Modeling of Multiple Random-ized Responses

In applied studies, researchers may be interestedin measuring the latent sensitive attitudes of individu-als or in developing an understanding of the influenceof factors that contribute to such attitudes. A growingbody of work on the use of item response theory modelswith randomized response data provides a frameworkfor accomplishing these goals (Böckenholt and Van derHeijden, 2007; Böckenholt, Barlas and VanDer Heijden,2009; Fox, 2005, 2010; Fox and Wyrick, 2008).

The setting of interest for the use of these techniquesare instances in which the researcher would like to cap-ture an attitude that is sensitive and inherently continu-ous. An example might be support for the use of torturein order to extract information from suspected terror-ists. Rather than ask the respondent directly about thislatent trait, since support for torture is an inherently am-biguous concept, the analyst asks the respondent a seriesof questions about support for particular practices thatmany would classify as torture: threatening a detainee’sfamily members, sleep deprivation, rectal “feeding,” wa-terboarding, and so on. Since support for each of thesepractices is also potentially sensitive, these questions areall asked in randomized response format.

Given this structure of questioning, the observed re-sponses consist of binary responses to J different ran-domized response items, with the response to the jthitem denoted by Yij . These items are tied structurally toθi, which denotes the value of the continuous and sen-

sitive latent trait of interest for respondent i. The goal ofthe analyst is either to estimate θi for all i or to estimatea model of determinants of θi. It is assumed that vari-ation in the latent trait is responsible for variation in aseries of J sensitive and unobservable items (e.g. varia-tion in support for torture is responsible for variation insupport for the specific practices described above). Letθ̃ij ∈ {0, 1} denote the unobserved value on the jthsensitive item for respondent i (e.g. whether or not i sup-ports waterboarding).The commonly used normal ogiveitem response model assumes that the responses on theitems are linked to the latent trait through the followingrelationship:

π̃ij = P(θ̃ij = 1|θi) = Φ(ajθi + bj), (13)

where Φ is the standard normal cdf and aj and bj areitem parameters measuring the discriminating powerand difficulty of item j, respectively.

The relationship between the underlying outcomeson the unobservable sensitive items and the observedrandomized responses are then expressed via a modifi-cation of the equations in (1) or (5), depending onwhichparticular randomized response model is utilized. If theWarner or Crosswise Model is used, for instance, theprobabilities of the observed responses are written:

P(Yij=1) = pπ̃ij + (1− p)(1− π̃ij)

(14)P(Yij=0) = p(1− π̃ij) + (1− p)π̃ij .

In order to identify the parameters of the base-line randomized response IRT model, ξ =(θ, a, b), it istypically assumed that θi is distributed normally withmean zero and unit variance. Alternatively, if the analystwishes to estimate an explanatory model for the latentsensitive trait, then shemay further specify the equation:

θi = X⊤i β + εi, εi ∼ N(0, 1),

in which case the parameters to be estimated areξ =(β, a, b). See Fox (2010) for a detailed discussionof a Bayesian approach to estimation and inference withmodels of this kind.

V. Evidence from Validation Studies

The performance of the randomized response tech-nique in uncovering sensitive behaviors has been ex-amined in a number of validation studies in which thetrue prevalence of the sensitive behavior in the sample isknown by researchers in advance.These studies compare




randomized response to other questioning methods interms of the proximity of the prevalence rates generatedby each questioning method to the true rate. The overalltenor of the results produced by these studies, with acouple of exceptions, is generally favorable to the use ofthe method.3

Most common are validation studies comparing ran-domized response to direct questioning. In an earlystudy of this kind, Locander, Sudman and Bradburn(1976) compared the performance of randomized re-sponse to three forms of direct questioning for esti-mating the prevalence of voter registration, library cardownership, bankruptcy declaration, voting in primaryelections, and drunken driving arrests. Using a sam-ple of 680 adults in Chicago, the authors found thatrandomized response produced estimates closest to thetrue values for the most sensitive items under consid-eration — bankruptcy and drunk driving — but thesmall number of observations per outcome prohibitedthem from reaching a definitive conclusion about theperformance of the technique. More definitive findingswere produced by Lamb Jr. and Stem Jr (1978), whocompared the performance of direct questioning ver-sus randomized response in estimating the incidenceof failing course grades in a convenience sample of312 upper-division undergraduates. They found thatwhereas both direct questioning and randomized re-sponse only slightly underestimated the proportion ofstudents who failed courses, direct questioning severelyunderestimated the average number of failed coursesand randomized response did a much better job in esti-mating this quantity.

Several validation studies have been based on publicarrest records or court proceedings. In this vein, Tracyand Fox (1981) compared the performance of directquestioning versus randomized response in estimatingthe frequency of arrests in a sample of 530 arrestees inPhiladelphia. The authors found that randomized re-sponse did better in estimating the overall mean num-ber of arrests than direct questioning. In a more re-cent study, Van Der Heijden, Bouts and Hox (2000)compared the performance of computer-assisted self-interviews (CASI), face-to-face direct questioning, andtwo varieties of randomized response in estimating theprevalence of welfare and unemployment benefit fraudin a sample of 534 Dutch adults previously found guilty

of fraud. The authors found that the two randomizedresponse procedures produced prevalence rates muchcloser to the true value than CASI or face-to-face di-rect questioning. In a similar investigation, Wolter andPreisendörfer (2013) performed a comparison of thelevel of misreporting of past convictions of criminal of-fenses across randomized response and face-to-face di-rect questioning in a sample of 552 German adults whohad all been previously convicted of minor criminal of-fenses. These authors found that randomized responseproduced a prevalence estimate closer to the true valuethan direct questioning, but the difference between thetwo estimates was negligible.

Two validation studies have examined the incidenceof smoking using biomarkers. In the first, Akers et al.(1983) examined the prevalence of teenage smoking us-ing levels of salivary thiocyanate recorded in saliva asan indicator of past smoking. They found that the ran-domized response estimates of smoking prevalence werenearly identical to the direct questioning estimates andthat the pattern of direct responses very closely trackedlevels of salivary thiocyanate, implying that the use ofrandomized response was unnecessary to study the in-cidence of smoking in this population. A very differentfinding was encountered by Fox, Avetisyan and Palen(2013) when they compared the performance of ran-domized response versus direct questioning in detect-ing smoking behavior among 305 outpatients of a pul-monary department in a hospital in the Netherlands. Inthis study, the true smoking status of each patient wasassessed by recording the CO level in exhaled air usinga portable CO monitor. The authors found that being asmoker (based on the CO measure) much more pow-erfully predicted reported smoking when randomizedresponse was utilized than when direct questioning wasemployed, thereby establishing the value of randomizedresponse questioning for this particular population.

The most systematic and rigorous validation studyinvolving randomized response is provided in a recentpaper by Rosenfeld, Imai and Shapiro (Forthcoming).This work examines the performance of four surveymethods for asking about sensitive topics: direct ques-tioning, the list experiment, endorsement experiments,and the randomized response technique. In order tocompare the questioning methods, the study used a sur-vey of 2,655 voters who participated in the 2011 Missis-

3All of these studies focus on variants of randomized response techniques developed prior to the Crosswise Model. To the best of myknowledge, the Crosswise Model has not yet been evaluated in a formal validation study.




sippi General Election (which featured a controversialanti-abortion referendum). Performance was gauged bycomparing the proportion of “no” votes obtained using agiven questioningmethod to the true proportion of “no”votes as recorded in the election (at both the county andstate-level). Given Mississippi’s conservative bent, a “no”vote was the potentially stigmatizing behavior.The studyfound that randomized response performed better thanall the alternatives, producing estimates of the propor-tion of “no” votes nearly identical to the true proportion(and with minimal variance relative to the other indirectquestioning techniques).

VI. Some Practical Advice

I conclude by offering some practical advice to re-searchers who are contemplating the use of randomizedresponse survey techniques in their applied work. Thegeneral point that needs to be made up front is thatsuccessfully implementing a randomized response sur-vey is very hard work. Without careful attention to howthese techniques are explained to and understood byrespondents, randomized response surveys can easilyfail to meet their objectives. Indeed, there are a coupleof papers that have reported implementation failures inapplications of randomized response, especially with theforced response variant of the technique (Edgell, Him-melfarb and Duchan, 1982; Azfar and Murrell, 2009;Holbrook and Krosnick, 2010). With these issues inmind, I have several concrete suggestions for maximiz-ing the chances of success for a randomized responsesurvey in the field.

First, the administration of randomized responsesurveys should ideally be conducted in a face-to-faceformat by enumerators who have undergone extensivetraining in the logic and implementation of the partic-ular technique they will employ. Before responding tothe randomized response questions, every respondentshould receive an explanation of why the technique isbeing used and how it works. Thereafter, and prior tothe presentation of the randomized response questionsof interest, respondents should be presented with sev-eral examples of how hypothetical individuals, varyingin trait status and their observed realizations of the ran-domizing device (or the benign group indicator if theCrosswise Model is used), would respond to a particularquestion according to the randomized response proto-

col. All this takes time, and survey researchers should beprepared to shorten other components of their surveyinstruments in order to give these items the attentionthey require. Indeed, researchers would do well to di-rectly train the enumerators in administering the ran-domized response component of the survey, and requireany survey firm they employ to utilize only the enumer-ators they have trained to deliver the survey in the field.In my mind, the importance of enumerator-respondentinteraction in explaining the randomized response tech-nique generally counsels against its use in telephone,mail, or internet surveys.4

As this overview makes clear, there are a varietyof different forms of randomized response models, andthere can be multiple ways in which to present questionsthat are based upon a given variant of the technique.In order to get a sense of which modality of the tech-nique works best with a given population, researcherswould also do well to make use of carefully selectedfocus groups prior to fielding a survey. Focus group ses-sions can be invaluable in helping the survey researcherto ascertain how framing randomized response ques-tions differently can affect comfort with the technique,and they can also be very helpful in assessing the suc-cess of alternative scripts that introduce and explain thetechnique.

Once a variant of randomized response techniqueand its mode of presentation has been selected, re-searchers should field test the survey instrument in asmall scale pilot study. In the pilot study, enumeratorsshould be accompanied by silent observers from the re-search team. These individuals view the entire processof survey administration with an eye to detecting anychallenges in the delivery of the randomized responsecomponent of the survey or any overt signs of respon-dent discomfort with the technique.

Finally, there are quality checks that can and shouldbe incorporated into the survey instrument itself. Theseinclude follow-up questions subsequent to the executionof the randomized response component of the surveythat assess respondents’ trust in, and understanding of,the technique. By allowing researchers to identify sub-groups within the sample for whom the randomizedresponse protocol was unclear or viewed with suspicion,the inclusion of such questions permits robustness exer-

4The one exception to this might be the Crosswise Model, which appears to have had success thus far in on-line surveys (Höglinger, Jannand Diekmann, 2014; Jann, Jerke and Krumpal, 2011; Kundt, 2014).




cises that verify whether or not the central findings of astudy depend on the potentially problematic responsesof such individuals.

References

Akers, Ronald L, James Massey, William Clarke andRonald M Lauer. 1983. “Are self-reports of adolescentdeviance valid? Biochemical measures, randomizedresponse, and the bogus pipeline in smoking behavior.”Social Forces 62(1):234–251.

Azfar, Omar and Peter Murrell. 2009. “Identifying reticentrespondents: assessing the quality of survey data oncorruption and values.” Economic Development andCultural Change 57(2):387–411.

Blair, Graeme, Kosuke Imai and Yang-Yang Zhou. 2015.Design and analysis of randomized response technique.Technical report.

Böckenholt, Ulf and Peter GM Van der Heijden. 2007. “Itemrandomized-response models for measuringnoncompliance: Risk-return perceptions, socialinfluences, and self-protective responses.” Psychometrika72(2):245–262.

Böckenholt, Ulf, Sema Barlas and Peter G.M. VanDer Heijden. 2009. “Do randomized-response designseliminate response biases? An empirical study ofnon-compliance behavior.” Journal of AppliedEconometrics 24(3):377–392.

Boruch, Robert F. 1971. “Assuring confidentiality ofresponses in social research: A note on strategies.” TheAmerican Sociologist 6(4):308–311.

Chaudhuri, Arijit and Rahul Mukerjee. 1988. Randomizedresponse: Theory and techniques. New York, NY: MarcelDekker.

De Jong, Martijn G., Rik Pieters and Jean-Paul Fox. 2010.“Reducing social desirability bias through itemrandomized response: An application to measureunderreported desires.” Journal of Marketing Research47(1):14–27.

De Jong, Martijn G., Rik Pieters and Stefan Stremersch. 2012.“Analysis of sensitive questions across cultures: Anapplication of multigroup item randomized responsetheory to sexual attitudes and behavior.” Journal ofPersonality and Social Psychology 103(3):543.

Dietz, Pavel, Heiko Striegel, Andreas G. Franke, Klaus Lieb,Perikles Simon and Rolf Ulrich. 2013. “Randomizedresponse estimates for the 12-month prevalence ofcognitive-enhancing drug use in university students.”Pharmacotherapy: The Journal of Human Pharmacologyand Drug Therapy 33(1):44–50.

Edgell, Stephen E, Samuel Himmelfarb and Karen L.Duchan. 1982. “Validity of forced responses in arandomized response model.” Sociological Methods andResearch 11(1):89–100.

Fox, James Alan and Paul E Tracy. 1986. RandomizedResponse: A Method for Sensitive Surveys. London, UK:Sage Publications.

Fox, Jean-Paul. 2005. “Randomized item response theorymodels.” Journal of Educational and Behavioral statistics30(2):189–212.

Fox, Jean-Paul. 2010. Randomized Item Response Models.In Bayesian Item Response Modeling. New York, NY:Springer.

Fox, Jean-Paul and Cheryl Wyrick. 2008. “A mixed effectsrandomized item response model.” Journal of Educationaland Behavioral Statistics 33(4):389–415.

Fox, Jean-Paul, M. Avetisyan and Van Der J. Palen. 2013.“Mixture randomized item-response modeling: Asmoking behavior validation study smoking behaviorvalidation study.” Statistics in Medicine 32(27):4821–4837.

Fox, Jean-Paul and Rob R Meijer. 2008. “Using itemresponse theory to obtain individual information fromrandomized response data: An application using cheatingdata.” Applied Psychological Measurement 32(8):595–610.

Franzen, Axel and Sonja Pointner. 2012. “Anonymity in thedictator game revisited.” Journal of Economic Behavior andOrganization 81(1):74–81.


Gingerich, Daniel W. 2013. Political Institutions andParty-Directed Corruption in South America: Stealing forthe Team. Cambridge, UK: Cambridge University Press.

Gingerich, Daniel W., Virginia Oliveros, Ana Corbacho andMauricio Ruiz-Vega. 2014. “When to protect? Using thecrosswise model to integrate protected and directresponses in surveys of sensitive behavior.” WorkingPaper.

Höglinger, Marc, Ben Jann and Andreas Diekmann. 2014.“Sensitive questions in online surveys: An experimentalevaluation of the randomized response technique and thecrosswise model.” University of Bern Social SciencesWorking Paper No 9.

Holbrook, Allyson L and Jon A Krosnick. 2010. “Socialdesirability bias in voter turnout reports tests using theitem count technique.” Public Opinion Quarterly74(1):37–67.

Jann, Ben, Julia Jerke and Ivar Krumpal. 2011. “Askingsensitive questions using the crosswise model: Anexperimental survey measuring plagiarism.” PublicOpinion Quarterly 76(1):32–49.

Kerkvliet, Joe. 1994a. “Cheating by economics students: Acomparison of survey results.” The Journal of EconomicEducation 25(2):121–133.

Kerkvliet, Joe. 1994b. “Estimating a logit model withrandomized data: The case of cocaine use.” AustralianJournal of Statistics 36(1):9–20.





Krumpal, Ivar. 2012. “Estimating the prevalence ofxenophobia and anti-semitism in Germany: Acomparison of randomized response and directquestioning.” Social Science Research 41(6):1387–1403.

Kundt, Thorben Christian. 2014. “Applying ‘Benford’s Law’to the Crosswise Model: Findings from an online surveyon tax evasion.” Available at SSRN 2487069 .

Lamb Jr., Charles W. and Donald E. Stem Jr. 1978. “Anempirical validation of the randomized responsetechnique.” Journal of Marketing Research 15(4):616–621.

Lara, Diana, Sandra G. García, Charlotte Ellertson, CarolCamlin and Javier Suárez. 2006. “The measure of inducedabortion levels in Mexico using random responsetechnique.” Sociological Methods and Research35(2):279–301.

Lensvelt-Mulders, Gerty J.L.M., Peter G.M. VanDer Heijden, Olav Laudy and Ger Van Gils. 2006. “Avalidation of a computer-assisted randomized responsesurvey to estimate the prevalence of fraud in socialsecurity.” Journal of the Royal Statistical Society: Series A(Statistics in Society) 169(2):305–318.

List, John A., Robert P. Berrens, Alok K. Bohara and JoeKerkvliet. 2004. “Examining the role of social isolation onstated preferences.” American Economic Review94(3):741–752.

Locander, William, Seymour Sudman and NormanBradburn. 1976. “An investigation of interview method,threat and response distortion.” Journal of the AmericanStatistical Association 71(354):269–275.

Maddala, Gangadharrao S. 1986. Limited-Dependent andQualitative Variables in Econometrics. Cambridge, UK:Cambridge University Press.

Rosenfeld, Bryn, Kosuke Imai and Jacob Shapiro.Forthcoming. “An empirical validation study of popularsurvey methodologies for sensitive questions.” AmericanJournal of Political Science .

Scheers, N.J. and C. Mitchell Dayton. 1987. “Improvedestimation of academic cheating behavior using therandomized response technique.” Research in HigherEducation 26(1):61–69.

Scheers, N.J. and C. Mitchell Dayton. 1988. “Covariaterandomized response models.” Journal of the AmericanStatistical Association 83(404):969–974.

Tan, Ming T., Guo-Liang Tian and Man-Lai Tang. 2009.“Sample surveys with sensitive questions: Anonrandomized response approach.” The AmericanStatistician 63(1):9–16.

Tracy, Paul E. and James Alan Fox. 1981. “The validity ofrandomized response for sensitive measurements.”American Sociological Review 46(2):187–200.

Van Der Heijden, Peter G.M. and Ger van Gils. 1996. Somelogistic regression models for randomized response data.In Statistical modeling. Proceedings of the 11thinternational workshop on statistical modeling. Orvieto,Italy. pp. 15–19.

Van Der Heijden, Peter G.M., Jan Bouts and Joop Hox. 2000.“A comparison of randomized response,computer-assisted self-interview, and face-to-face directquestioning eliciting sensitive information in the contextof welfare and unemployment benefit.” SociologicalMethods and Research 28(4):505–537.

Warner, Stanley L. 1965. “Randomized response: A surveytechnique for eliminating evasive answer bias.” Journal ofthe American Statistical Association 60(309):63–69.

Wolter, Felix and Peter Preisendörfer. 2013. “Askingsensitive questions: An evaluation of the randomizedresponse technique versus direct questioning usingindividual validation data.” Sociological Methods andResearch 42(3):321–353.

Yu, Jun-Wu, Guo-Liang Tian and Man-Lai Tang. 2008. “Twonew models for survey sampling with sensitivecharacteristic: Design and analysis.” Metrika67(3):251–263.

Capturing Ethnic Salience on Surveys inDeveloping Countries

by Elizabeth C. CarlsonThe Pennsylvania State University

The literature on public opinion in the U.S. demon-strates that social norms against expressing racist at-titudes cause misestimates on a variety of importantmeasures, including racial animosity (Wittenbrink, Juddand Park, 1997); support for race-related policies such asschool integration (Berinsky, 1999); and intended votefor non-white political candidates (Hopkins, 2009; Terk-ildsen, 1993). Similar taboos against expressing nega-tive attitudes toward certain ethnic, racial, or religiousgroups also exist elsewhere and presumably bias sur-vey results in these locations as well. Unfortunately, ourability to identify and combat such biases is often lim-ited in developing country contexts. One well-testedmeans for decreasing social desirabilty bias is to allowrespondents to report their answers in private — byself-administering the survey on paper or a computer— rather than reporting their opinions out loud to an-other person (Krysan, 1998; Krysan and Couper, 2003).However, this type of privacy is hard to offer in con-texts where many respondents are functionally illiterateand thus cannot self-administer a written survey. To be




accessible to the full population, surveys in the develop-ing world are usually conducted orally: an enumeratorreads the questions to the respondent and records theresponses on her behalf. The choice faced by those con-ducting surveys on sensitive issues in the developingworld is thus between gathering genuine opinions fromthe subset of the population that can read and write,and selecting a representative sample of the population,while acknowledging that survey mode used to gatherthe data from such a sample increases the chances thatthe results will include bias.

Of course, neither of these options is particularlyattractive. The question that motivates my own recentwork — the conditions under which Ugandan votersprefer a coethnic candidate for office, especially whenthat candidate is dubiously qualified — cannot be an-swered under either approach. Since ethnic saliencevaries across class and education (Eiffert, Miguel andPosner, 2010; Bossuroy, 2011), I cannot reduce my sam-ple to only the literate. But identifying the conditionsunder which ethnic voting occurs requires, at mini-mum, an accurate measure of when voters do and donot vote along ethnic lines. Social censure of ‘tribalism’is a serious concern if we gather vote preference throughoral interviews. Though some Africanists argue thatAfrican voters are open about their ethnic loyalties (Eif-fert, Miguel and Posner, 2010), most data suggests thatvoters strongly prefer not to reveal such preferences toan enumerator. Though voting returns in many Africanelections make it clear that voters support candidates oftheir own ethnicity, most survey respondents who areasked directly report that they do not consider ethnicitywhen voting, or even that such preferences are morallywrong (Lindberg and Weghorst, 2010; Posner, 2005).We can therefore expect that, without intervention toreduce bias, survey respondents in African countrieswill systematically under-report voting for candidates oftheir own ethnicity on public opinion surveys, affectingour understanding of when such voting is likely.

In my study, I used two simple interventions in-tended to increase the likelihood that respondentswouldaccurately report their true preferences, while maintain-ing an oral framework accessible to all members of thepopulation (Carlson, Forthcoming, 2014). In the first in-tervention, I adapted the delivery of the survey to allowprivate reporting by illiterate respondents. Chauchard(2013) uses such a technique in India. He recorded ques-tions on MP3 players and allowed respondents to self-

report their answer from a menu of options representedwith familiar symbols. In Chauchard (2013)’s exam-ple, however, the entire survey was self-administered.This limits the content of the survey to only certaintypes of closed-ended questions. My intervention wasmuch smaller: I embedded a single question answeredwith a secret ballot into a larger face-to-face interview.Such a method also happens to require only pen and pa-per, which may be an advantage for very low-resourceprojects. A second strategy, used with mixed results inthe U.S. (Hatchett and Shuman, 1975; Campbell, 1981;Finkel, Guterbock and Borg, 1991) and proposed byAfricanists such as Adida et al. (2014), is to match theethnicity of enumerators and respondents. This inter-vention is based on the assumption that respondentswill be more honest about ethnic preferences when theydo not need to worry about offending a member of an-other group. In my results, I find that allowing limitedself-administration effectively reduces bias, while ethnicmatching exacerbates the problem.

In my study, I asked 800 Ugandans, drawn from 32villages, to report their preference between two hypo-thetical candidates whose ethnicity and qualification foroffice — such as education and record of performancein prior office — were randomly selected from a list ofpossible traits. A sample vignette is shown in Figure 1below. In the vignettes, ethnicity was signaled by thecandidate’s city of origin. Performance was indicated bywhat the candidate accomplished in prior office. In thesample vignette, the first candidate paved a road. Thesecond candidate did not have any accomplishments tonote, which in the Ugandan context signals incompe-tence or corruption. The dependent variable of interestis the percent of respondents who selected a candidate oftheir own ethnicity. If the interventions work to reducesocially desirability bias, the percent reporting supportfor a coethnic, regardless of his performance, should in-crease.

Figure 1: Sample Candidate Pairing

Candidate One. The candidate is fromHoima. He has aMaster’s Degree in AfricanDevelopment. Before running for presi-dent, he was an LCI [village chief]. Whilehe was LCI, he paved the community’s ma-jor road. If elected president he promises tocreate new jobs.




Candidate Two. The candidate is fromSoroti. He has a university degree inTourism. Before running for president, hewas a Member of Parliament. If electedpresident he promises to improve govern-ment health care. (Note: This candidate iscued as having no record of provision, simplyby leaving out mention of such provision.)

Participants in the experiment were recruited intoa 45-minute long survey about local welfare and pub-lic goods quality that contained both closed- and open-ended questions and was conducted orally one-on-onewith an enumerator. Respondents were then assigned tosmall groups. Members of each group listened to theirassigned vignettes and reported their preference via asecret pen and paper ballot. To enable illiterate respon-dents to mark their ballot without assistance, the can-didates were indicated as a “circle” and a “square” (or“box” as appropriate in the local language). Figure 2 be-low shows the ballot.

Figure 2: Ballot

There were two key variations in the context of the vot-ing experiment. First, respondents faced two differentlevels of social exposure: while most respondents castonly a secret ballot, the members of approximately 1/3of the groups were also assigned to report their pref-erence aloud to everyone in the group. Second, thegroups varied in their ethnic composition: conditionalon the ethnic diversity of the local community fromwhich the respondents were drawn, whether a respon-dent voted only with members of her own ethnic groupor in amixed group was random.The first variation testswhether granting respondents privacy will reduce bias;the second tests whether asking respondents to reportethnic preferences to members of only their own ethnic-ity does the same.

The first conclusion of the experiment is that pri-vacy works to reduce social desirability bias. Those who

cast only secret ballots were significantly more likely toreport a preference for a coethnic candidate than thosewho reported their preferences publicly. Among voterswho reported their votes aloud, coethnics had no advan-tage: 50% of coethnics won their contests, as did 50%of non-coethnic candidates.This result strongly suggeststhat Ugandan voters do not see any advantage to co-ethnic candidates. Yet, when voters were allowed to re-port their vote entirely in secret, it is clear that votersdo actually hold such a preference. In the secret-ballotcondition, coethnic candidates were 6% more likely towin than were non-coethnic candidates. The vote sharesof coethnic and non-coethnics in the secret ballot con-dition are significantly different from their vote sharesin the public voting treatment, as well as significantlydifferent from one another. These results indicate thatallowing limited self-administration on sensitive ques-tions, rather than requiring oral responses, can reducebias to the point that it actually changes our conclusions.

These results indicate that allowinglimited self-administration onsensitive questions, rather thanrequiring oral responses, can reducebias to the point that it actuallychanges our conclusions.

Reducing observation also decreases socially unde-sirable responses outside of the experimental Ugandancontext. Since almost all existing data on public opin-ion in Africa was gathered with in-person interviews,I cannot test the effects of allowing full privacy on re-spondents’ reported vote choice. However, I can lever-age variation in the number of people in front of whomrespondents reported their opinions. Despite enumera-tors’ efforts to prevent it, about a quarter of respondentson the massive, multi-country Afrobarometer publicopinion survey nevertheless complete their interview infront of family or friends. If this happens, it is is noted atthe end of the survey. I pool the results fromRound Four(2011-2012) of the Afrobarometer from every countrywhere the major presidential candidates were of differ-ent ethnicities, for a total sample of over 25,000 respon-dents across 18 countries. Controlling for country fixedeffects and individual-level characteristics that mightmake a respondent more likely to be observed, such asage or gender, I find that as the number of observersincreases, the likelihood that a respondent will reporta preference for a candidate of their own ethnicity de-creases.Thosewhose interviewswere observed by family




or neighbors are 10% less likely to report support for acoethnic presidential candidate than those whose sur-veys were conducted only by an enumerator, at a highlevel of statistical significance. These results suggest thatreducing the number of observers to zero — by allow-ing complete self-administration, such as with a secretballot — might increase reported support for coethniccandidates even more.

Though my results indicate that allowing respon-dents privacy to report their vote preferences will reducesocial desirability bias, they also indicate that getting ac-curate results does not require adapting an entire surveyto be self-administered. It is reasonable to be concernedthat embedding a secret ballot question in an otherwiseoral survey might prime the very issues we are hopingto de-emphasize. There is some evidence in my data thatthis sort of priming is occurring. In the experiment, ap-proximately 40% of the candidates in the vignettes wereassigned to be of the respondent’s ethnicity, which isfar higher than chance would predict in a country asethnically diverse as Uganda. By the final round of theexperiment, when voters had already heard three vi-gnettes and cast a ballot three times, many respondentsseem to have realized they were being asked to revealethnic preferences and adjusted their responses accord-ingly.The votes cast in the third round of the experimentstrongly de-emphasized ethnicity regardless of whethervotes were cast publicly or privately.

The question is whether the priming that occurred inthe private ballot condition produces different answersthan the priming that would have occurred without theuse of a private ballot: does it, for example, emphasizethat such attitudes should be sensitive or secret and ex-acerbate bias on later questions? I can provide an initialtest of this using responses from the post-survey. Af-ter participating in the voting experiment, respondentscompleted a post-survey one-on-one with an enumera-tor. The questions were potentially sensitive, includingtwo questions explicitly about ethnic salience. On oneof these questions — whether the respondent priori-tized their ethnic or national identity — the responsesof those who voted publicly and privately were the same(67% of respondents in both groups say they priori-tize their national identity.) On a second question —whether the respondent would ever marry someone ofanother group — the groups are significantly differ-ent, but in a way that suggests that allowing private vot-ing dampened bias even on later questions asked aloud.

Those who were allowed to vote privately in the votingexperiment were 3% more likely to admit in the post-survey that they would never marry someone of anothergroup.The difference is significant at standard levels androbust to a variety of respondent-level controls. Alteringsurvey protocol to allow self-administration of a singlequestion, therefore, does not seem to worsen bias onlater questions, and may even reduce it.

The second conclusion of the study is that whileprivacy works, ethnic matching does not. If it did, wewould expect that those respondents assigned to votein groups composed of only their own coethnics wouldbe more likely to report support for coethnic candi-dates than those assigned to groups of mixed ethnicity.Instead, those who voted in fully homogenous groupswere significantly less likely to report a preference for acoethnic candidate. Those who voted in a group com-prised only of their coethnics selected coethnics 50%of the time, whereas those who voted in ethnically het-erogeneous groups selected coethnics 57% of the time.Though the ethnic composition of the group is corre-lated with the ethnic diversity in the local community(a diverse group cannot be constructed out of a fullyhomogenous community), the estimated treatment ef-fect is robust to controls for local ethnic diversity. Thisresult can again be replicated outside the experimentalcontext. Using the same pooled Afrobarometer data Idescribed above, with country-level fixed effects, and avariety of controls, I find that respondents are 7% lesslikely to report support for the candidate of their ownethnicity when they are interviewed by a coethnic enu-merator.

One explanation for this finding is that interactionswith non-coethnicsmay prime ethnic conflict or compe-tition for resources in a way that intra-ethnic interactiondoes not, triggering a defensive reaction against the ideaof a non-coethnic in power and generating a preferencefor coethnic politicians; this explanation has nothing todo with social desirability bias. However, we also knowthat some Africans are more generous when playing ex-perimental games with coethnic strangers than they arewhen playing with non-coethnics (Habyarimana et al.,2007). This appears to be because ethnic networks areso dense that no exchange between coethnics can everbe truly anonymous: exposure to coethnics is more con-sequential than exposure to non-coethnics. If this is thecase, matching respondents with coethnic enumeratorsmay actually produce stronger bias in the direction of so-




Table 1: Ethnic Diversity and Social Exposure on Reported Preferences for Coethnic Candidates in UgandaPrivate voting Public voting All votes

Ethnically diverse voting group 0.61 0.58 0.60(0.03) (0.04) (0.02)

Ethnically homogenous voting group 0.53 0.44 0.50(0.02) (0.04) (0.02)

All groups 0.55 0.50 0.54(0.02) (0.03) (0.01)

cial norms.The results ofmy experiment, summarized inTable 1 on the next page, suggest that both mechanismsare in operation: those who were assigned to vote in anethnically diverse group are more likely to vote for a co-ethnic whether they vote privately or publicly. However,exposure to social pressure most strongly affects the re-ported preferences of those in ethnically homogenousgroups, indicating that the desire to conform to socialnorms is stronger among coethnics. This may also ex-plain why observation by family and neighbors, whowillalmost always be coethnics, has such a strong effect in re-ducing reported ethnic voting in the survey results pre-sented above. Together, my results provide some guid-ance for those gathering data on ethnic politics in placeswhere ethnicity is a sensitive subject and surveys mustbe conducted orally. Though it is very low-cost, and maybe effective in some contexts, a strategy of ethnic match-ing between respondents and enumerators is risky: myresults indicate that it exacerbates bias in Uganda andin a pooled African sample. It may also do so in In-dia, the Middle East, and other areas where intra-ethnicsocial networks are strong and intra-group social sanc-tions are particularly consequential. The better option isto provide an opportunity to allow self-administrationof sensitive questions so that respondents do not needto report their preferences aloud: my results are just onecase in which this strategy has been shown to be effec-tive. What my results contribute is a finding that gath-ering self-administered data on some of a survey’s ques-tions does not require writing an entire survey to be self-administered. Adaptation on a few of a survey’s mostsensitive questions can improve data quality with mini-mal investment and effect on the rest of the survey’s data.

References

Adida, Claire, Karen Ferree, Daniel Posner and AmandaRobinson. 2014. “Who’s asking? Interviewer coethnicityeffects in African survey data.” Unpublished Manuscript.

Berinsky, Adam. 1999. “The two faces of public opinion.”American Journal of Political Science 43(4):1209–1230.

Bossuroy, Thomas. 2011. “Individual determinants of ethnicidentification.” Dauphine University Paris IRD WorkingPaper.

Campbell, Bruce. 1981. “Race-of-interviewer effects amongsouthern adolescents.” Public Opinion Quarterly45:231–244.

Carlson, Elizabeth C. 2014. “Social desirability bias andreported ethnic voting on African surveys.”Afrobarometer Working Paper No. 144.

Carlson, Elizabeth C. Forthcoming. “Ethnic voting andaccountability: A choice experiment in Uganda.” WorldPolitics .

Chauchard, Simon. 2013. “Using MP3 players in surveys:The impact of a low-tech self-administration mode onreporting of sensitive attitudes.” Public Opinion Quarterly77(S1):220–231.

Eiffert, Benn, Edward Miguel and Daniel Posner. 2010.“Political competition and ethnic identification in Africa.”American Journal of Political Science 54(2):494–510.

Finkel, Steven, Thomas Guterbock and Marian Borg. 1991.“Race-of-interviewer effects in a pre-election poll:Virginia 1989.” Public Opinion Quarterly 55(3):313–330.

Habyarimana, James, Macartan Humphreys, Daniel Posnerand Jeremy Weinstein. 2007. “Why does ethnic diversityundermine public goods provision?” American PoliticalScience Review 101(4):709–725.

Hatchett, Shirley and Howard Shuman. 1975. “Whiterespondents and race of interviewer effects.” PublicOpinion Quarterly 39(4):523–527.

Hopkins, Daniel J. 2009. “No more Wilder effect, never aWhitman effect: When and why polls mislead about blackand female candidates.” Journal of Politics 71(3):769–781.

Krysan, Maria. 1998. “Privacy and the expression of whiteracial attitudes.” Public Opinion Quarterly 62:506–544.

Krysan, Maria and Mick Couper. 2003. “Race in live andvirtual interviews: Racial deference, social desirability andactivation effects in attitude surveys.” Social PsychologyQuarterly 66(4):364–383.

Lindberg, Staffan and Kieth Weghorst. 2010. “Are swingvoters instruments of democracy or farmers ofclientelism? Evidence from Ghana.” Working Paper.




Posner, Daniel. 2005. Institutions and Ethnic Politics inAfrica. Cambridge: Cambridge University Press.

Terkildsen, Nayda. 1993. “When white voters evaluate blackcandidates: The processing implications of candidate skincolor, prejudice, and self-monitoring.” American Journalof Political Science 37(4):1032–1053.

Wittenbrink, Bernd, Charles M. Judd and Bernadette Park.1997. “Evidence for racial prejudice at the implicit leveland its relationship with questionnaire measures.” Journalof Personality and Social Psychology 72:262–74.

Lobbying, Corruption, and Non-Responses inSmall Samples

by Vineeta YadavThe Pennsylvania State University

The illegal nature of corruption has posed formidableobstacles for researchers studying this complex phe-nomenon. In the absence of hard objective data, schol-ars have most often relied on surveys of citizens, firms,and experts to gather perceptions of corruption (Olkenand Pande, 2011; Treisman, 2007). However, surveyquestions on corruption often suffer from high non-response, and possibly false response, rates because theyare considered intrusive, because admitting to partic-ipating in corrupt practices is considered socially un-desirable, and because respondents may fear retaliationfrom officials for admitting participation in, or knowl-edge of, corruption (see, for example, reports docu-menting the implementation of the European Bank forReconstruction and Development’s [ERBD] 2002 Busi-ness Environment and Enterprise Performance Survey[BEEPS]).

As Blair, Imai and Zhou (2015), Krumpal (2013),and Tan, Tian and Tang (2009) discuss, the surveymethodological toolkit now includes several techniques— randomized responses, list experiments, endorse-ment experiments — that can help scholars address theproblems, such as non-responses and false responses,that are associatedwith sensitive survey questions.Thesevarious techniques focus on reassuring respondents thattheir responses will be confidential by using methodsthat share one key characteristic — they sacrifice theability of the analyst to identify individual responses.

For many research questions, including those related tocorruption, sacrificing this ability does not typically im-pose a high substantive analytical cost. Several recentcorruption studies have therefore used these techniquesto address some of the challenges posed by sensitivity(Winters and Weitz-Shapiro, 2013; Gingerich, 2010).

However, there are important research agendaswhere the relationship between the respondent’s ownbehavior and his perception of a collective or aggregateoutcome is the central research question. This is mostnotably the case for the burgeoning literature on lobby-ing and corruption. The central focus in this literatureis on studying how the lobbying practices of an individ-ual special interest group — especially its decision touse money rather than other resources — influence itsperception of corruption in that country (Yadav, 2011;Bennedsen, Feldmann and Lassen, 2009; Campos andGiovannoni, 2007). The loss of the ability to identify thecorruption level reported by a specific respondent in thiscase deprives the researcher of the ability to study thisquestion.

The second challenge this specific research agendaposes for analysts is that the sample sizes required forsuccessfully implementing the most commonly usedtechniques for addressing sensitivity are larger than theentire population sizes of the key elite populations thatare at the center of the lobbying-corruption nexus. Allof these techniques generally require sample sizes of1,000 or more to be effective. For example, Gingerich(2010, 364) finds that randomized response techniquesunderperform compared to direct responses for sam-ples smaller than 1,000 and recommends using samplesof 3,000 or more. Corstange (2009, 55) finds that list ex-periments are not suitable for sample sizes smaller than1,000 and recommends sample sizes of 2,000 or more.This constraint makes these techniques feasible and ap-propriate for studying perceptions of corruption amonglarge populations such as voters, firms, and bureaucrats,but not for small elite populations. Unfortunately, sev-eral populations that are highly relevant for the study ofcorruption, such as those of party leaders and businesslobbies, are too small for these techniques to be usedappropriately.

Considerable empirical evidence shows that the ac-tors most likely to be involved in, and knowledgeableabout, corruption in parliament and the executive arenot citizens or firms, who rarely interact directly with




these bodies, but elite groups in society, such as busi-ness interest groups and very large firms, who interactwith them frequently (OECD, 2012, 2014; Kalniņš, 2011;Thomas and Hrebenar, 2008; EBRD, 2002). While citi-zens and firms do interact with the bureaucracy and thelegal system, they do so only intermittently (Ayyagari,Beck andDemirguc-Kunt, 2007; International, 2007). Incontrast, business lobbies routinely interact with the bu-reaucracy and the legal system in order to influence pol-icy legislation, implementation, and legal adjudication(OECD, 2012, 2014; Kalniņš, 2011; Thomas and Hrebe-nar, 2008; EBRD, 2002). Importantly, studies show thatin most countries around the world (the U.S. being anotable exception), lobbying is done primarily by for-mal business associations and chambers rather than byfirms themselves or hired third party lobbyists (OECD,2012, 2014; Kalniņš, 2011;Thomas andHrebenar, 2008).Indeed, firms that are members in business associa-tions and chambers often identify such intercession withelected officials, the bureaucracy, or the judiciary as oneof the most valuable benefits of association membership(OECD, 2012, 2014; Kalniņš, 2011; Thomas and Hrebe-nar, 2008; EBRD, 2002).

The sample sizes required forsuccessfully implementing the mostcommonly used techniques foraddressing sensitivity are larger thanthe entire population sizes of the keyelite populations that are at thecenter of the lobbying-corruptionnexus.

Theproblem for corruption scholars is created by thefact that while citizens and firms number in the millionsand thousands, the total number of associations, cham-bers, and other bodies involved directly in lobbying inmost countries (with the exception of the U.S. and ahandful of wealthy countries) numbers in the hundredsand may cross a thousand for bigger, more sophisticatedeconomies (OECD, 2012, 2014; Kalniņš, 2011; Thomasand Hrebenar, 2008). Very large firms can and often dolobby individually in their own interests.Their numbers,however, are not high enough to boost the populationsizes of active business lobbies over the thresholds re-quired for using the survey techniques mentioned pre-viously (Ayyagari, Beck and Demirguc-Kunt, 2007). Thesmall numbers of organized business associations and

very large firms therefore render the use of the standardrepertoire of techniques for addressing sensitive topicsineffective and ill-advised in most developing countries.

Finally, an additional concern that few extant studiesof corruption address is that the sensitivities of respon-dents, whether masses or elites, may vary for differenttypes of corrupt behavior. The common use of the term“corruption” encompasses a wide range of phenomena,including corruption in parliament, the executive of-fice, political parties, individual politicians, bureaucrats,judges, police, etc. Different corruption sub-types maybe subject to varying sensibilities of social desirability.We would, therefore, expect the sensitivity of respon-dents and consequently their willingness to respond,and to respond honestly, to corruption related questionsto potentially vary across questions asking for their opin-ions on these different corruption sub-types. With a fewnotable exceptions, however, the vastmajority of surveyson corruption use questions which do not differentiatebetween these different corruption sub-types.1 As a re-sult, we have little leverage in understanding which ofthese sub-types is driving non-responses to a questionon “corruption” and on which sub-type of corruptionthe respondent is in fact expressing his opinion.

The measurement problem this creates is substantialin the cross-country context. Since countries vary in thefactors that are believed to influence different sub-typesof corruption, non-responses to “corruption” questionsmay be driven by sensitivity to different sub-types ofcorruption across (and, of course, within) countries,and responses may be dominated by perceptions of dif-ferent sub-types of corruption as well. The rate of non-response, as well as the origins and magnitude of theselection bias that non-responses introduce, thereforevaries in an opaque way across countries. As a result,whether scholars use survey questions on corruptionfrom individual surveys or indices of corruption thatcombine information from different surveys, it remainsunclear exactly what concept is being examined.

I now briefly illustrate how identifying and collect-ing data on less sensitive factors that influence non-responses to corruption can allow us to address someof these problems. Specifically, I use such data to studywhat influences respondents to participate in, or avoidanswering, corruption questions and to correct for the

1The BEEPS and the UN’s International Crime Victimization Surveys do ask questions on distinct corruption behaviors; however, mostcross-country surveys do not.




selection bias that participation decisions introduce incomparing corruption levels and behaviors across coun-tries. By asking narrow questions that query respon-dents on specific sub-types of corruption, we can studywhether patterns of non-responses on sensitive corrup-tion questions are similar or different across sub-typesof corruption both within and across countries. I useevidence from a 2005–2006 survey of business lobbiesin Brazil and India to illustrate these points.

Although Brazil and India both have a reputationfor corruption, they have different institutional designs.Existing research tells us that these institutional differ-ences should influence not just the level of aggregatecorruption in each country but also the prominence ofvarious corruption sub-types (Graf Lambsdorff, 2005).This suggests that the sensitivity of respondents to ques-tions querying them on different sub-types of corrup-tionmay vary across sub-types of corruptionwithin eachcountry and across the two countries, leading to corre-spondingly different patterns of non-responses and falseresponses. When I conducted my survey in 2005-2006,the aggregate number of business associations, regionaland sectoral chambers, and other bodies involved inpolicy lobbying at any level was 1,492 in Brazil and 880in India.2 These total population sizes are too small toeffectively implement techniques such as randomizedresponses, list experiments, or list endorsements to dealwith corruption related sensitivity. Therefore, this re-search agenda required a different approach.

Given the prominence of business lobbying in Braziland India (dos Santos and de Costa, 2014; Schneider,2013; Yadav, 2008), one possibility was to use informa-tion on the lobbying tactics of businesses — informa-tion, media, public protests, money — to gain leverageon non-responses. This is because the choice of lobbyingtactics conceivably influences how willing business lob-bies are to respond to questions on different types of cor-ruption. In both countries, lobbyingwas unregulated butnot illegal, corporate financial contributions to partiesand politicians were legal but capped, and reporting anddisclosure laws for financial donors and recipients werepoorly enforced (Yadav, 2011). At first blush, these prac-tices do not make collecting information on lobbying

seem like a promising avenue. What makes this a viablestrategy, however, is that, even though it was unregu-lated, lobbying did not suffer from a social desirabilitybias among the key actors (the business establishment,government officials, and elected officials) involved inthe policymaking process.

In both countries, these policy players recognizedbusiness lobbying as useful for acquiring policy infor-mation and expertise, and saw it as an endeavor that waslegitimate and essential for good policymaking and rep-resentation (dos Santos and de Costa, 2014; Schneider,2013; Yadav, 2011).This viewwas driven significantly bythe fact that lobbying is practiced openly and actively indemocracies such as the U.S., U.K., and Germany, coun-tries that are held up as aspirational economicmodels. Inthis context, the fact that the study was being sponsoredby a U.S. university certainly reduced the discomfort as-sociated with reporting lobbying behaviors as well.

These trends on lobbying legitimacy suggest thatquestions to business interest groups about their lobby-ing tactics are unlikely to provoke high non-responserates. If this is the case, then they may offer leverage onunderstandingwho responds to corruption related ques-tions in each country and the relationships across non-responses on different sub-types of corruption, theymayaccount for the bias created by respondents selecting outof responding to each corruption sub-type, and, impor-tantly, they may allow us to study how an individualrespondent’s lobbying tactics influences her perceptionsof corruption in each country. In turn, comparing theseresults across the two countries may elucidate some ofthe potential problems that cross-country corruptionmeasures might be hiding.

The surveyswere given to high level officials fromor-ganized and formal business interest groups from 2005to 2006.3 Therealized sample sizeswere 158 inBrazil and179 in India. Questions asking groups who they lobbied(executive office, political parties, individual legislators,regional and sectoral caucuses) saw non-response rate of3.2% and 7.2% in Brazil and India. Questions queryinghow they lobbied these targets (using technical informa-tion, financial contributions, media campaigns, public

2There were, and still are, no existing databases that list all of the organizations involved in lobbying in either country. I put these sam-pling frames together by consulting and compiling listed organizations from various sources myself. This has been the technique that otherresearchers, including Frye (2002) and the World Bank’ WBES and BEEPs survey supervisors, have adopted. See Yadav (2011) for more detailson the survey and sampling designs.

3Please see Yadav (2011) for the exact questions that were used to operationalize these concepts. All were modeled after similar questionsasked in the BEEPS 2002 surveys conducted by the World Bank and the EBRD.




protests) saw non-response rates of 4.4% and 7.2% inBrazil and India. In contrast, questions asking respon-dents how a certain form of corruption “had an impacton businesses in your sector” drew high, but varying,non-response rates for different corruption sub-types inboth countries. Figure 1 below illustrates these trends foreight different sub-types of corruption. Notably, non-response rates vary substantially across questions ondifferent types of corruption within each country, rang-ing from 19.6% to 44.3% in Brazil and 12.2% to 52.8% inIndia.

Given the low non-response rates for lobbying ques-tions, it is possible to model a group’s decision to par-ticipate in answering questions about each of the eightcorruption sub-types listed in Figure 1 as a function ofwhether that group used technical information, publicprotests, media campaigns, or money to lobby policy-makers on an issue of vital importance to their mem-bers while controlling for other factors that might po-tentially influence this choice as well. Corruption scalesranged from 1, indicating no impact on the respondentgroup’s members from that specific type of corruptionbehavior, to 4, indicating a decisive impact on its mem-bers. We can now directly address the question of how alobby’s own tactics — especially whether or not it usedmoney for lobbying — influence its perception of differ-ent types of corruption by estimating a series of orderedprobit models explaining corruption levels with a selec-tion stage modeling the choice to respond as a functionof lobbying tactics on resources.

The outcome variable in each ordered probit modelis one of the eight corruption sub-types listed in Fig-ure 1: parliamentary corruption, executive corruption,party corruption, patronage, bureaucratic corruption,judicial corruption, corruption in party campaign fi-nance, and corruption in individual campaign finance.The selection stage models the decision to participate inresponding to the question on that specific corruptionsub-type as a function of that group’s tactical choice touse money, information, or other tactics to lobby policy-makers and a few other theoretically relevant variables.

Applying this strategy to Brazil and India allowsus to see how these results may be similar or differentacross these countries (i.e. whether non-responses aredriven by similar factors and whether the use of moneyfor lobbying has a similar effect on participation andreported corruption levels in the two countries). Below,I highlight a few key findings from such an analysis ofparliamentary, executive, bureaucratic, and judicial cor-ruption. The full results from this analysis are availableat http://sites.psu.edu/vineetayadav/.

Participation Decisions: Results from the selectionstage show that, with the exception of bureaucratic cor-ruption in Brazil, the decision to respond to corruptionquestions and the decision to report a certain level ofcorruption were highly correlated in both countries. Re-source lobbying tactics seemed to rarely affect the deci-sion to respond to any corruption questions in Brazil. In-

Figure 1: Binary Non-Response Rates by Corruption Type

Note: Data are from a 2005-2006 survey of high level officials in business lobbies in Brazil (dark grey) and India (light grey). There were 158respondents in Brazil and 179 respondents in India. CF denotes campaign finance.


http://sites.psu.edu/vineetayadav/



stead, group-specific features, such as the fate of groupmembers under economic reforms, as well as features ofa country’s overall business environment, such as trans-parency in policymaking and the legal status of polit-ical contributions, influence these decisions. In con-trast to Brazil, Indian business groups lobbying withmoney were significantly less likely to respond to anytype of corruption question, and those using informa-tion were significantly more likely to respond to ques-tions on all corruption types except executive corrup-tion. Thus, in India, differences in lobbying tactics ledto non-respondent samples that differed systematicallyfrom respondents, and these samples differed across cor-ruption types.

Lobbying Tactics and Reported Corruption Levels:Results from the outcome stage of the ordered probitmodels show that accounting for the effects of resourcelobbying tactics on the decision to respond to corrup-tion questions influences the results we obtain regardingthe effects of resource tactics on reported corruptionlevels. Resource tactics now only influence bureaucraticcorruption levels in India and have no influence on anyreported corruption levels in Brazil.

The target of the resource lobbying is a significantpredictor of corruption levels in both countries. Groupslobbying legislators in Brazil report lower levels of alltypes of corruption. Groups in India lobbying partiesand the executive report higher levels of parliamentaryand bureaucratic corruption. In contrast to these results,models which do not account for the respondent’s choiceto answer find that groups lobbying with technical infor-mation report lower parliamentary and executive cor-ruption levels in Brazil and those lobbying with moneyreport higher parliamentary corruption in India. Col-lectively, these results suggest that not accounting forrespondents’ decisions to respond to corruption ques-tions in cross-country analyses of corruption survey datamay be introducing significant and variable levels of biasinto our results regarding corruption.

In conclusion, the evidence presented here suggeststhat surveys can continue to serve as an effective way ofcollecting data to answer specific questions on the cor-ruption agenda when researchers are constrained in thechoice of techniques they can use due to small samplesizes. In order to do so, however, it is essential to accountfor respondents’ selection into non-responses, especiallyin cross-country contexts. This is because countries may

vary considerably in the extent to which their respon-dents systematically refuse to answer questions and inthe reasons why they refuse. The results also highlighthow important it is to disaggregate the concept of cor-ruption into its component sub-types and then deal withthe sensitivity issues specific to each sub-type. Whilethe discussion here focuses on lobbying and corrup-tion, these issues are pertinent to many situations wheresmall elite populations must be surveyed and where re-spondents are being queried on their opinions regardingconceptually fuzzy aggregate terms that are composedof distinct sensitive individual components such as thequality of governance or rule of law in non-democracies.

References

Ayyagari, Meghana, Thorsten Beck and Asli Demirguc-Kunt.2007. “Small and medium enterprises across the globe.”Small Business Economics 29(4):415–434.

Bennedsen, Morten, Sven E Feldmann and David DreyerLassen. 2009. Strong firms lobby, weak firms bribe: Asurvey-based analysis of the demand for influence andcorruption. Technical report EPRU Working Paper Series,Department of Economics, University of Copenhagen.

Blair, Graeme, Kosuke Imai and Yang-Yang Zhou. 2015.Design and analysis of randomized response technique.Technical report.

Campos, Nauro F. and Francesco Giovannoni. 2007.“Lobbying, corruption and political influence.” PublicChoice 131(1-2):1–21.

Corstange, Daniel. 2009. “Sensitive questions, truthfulanswers? Modeling the list experiment with LISTIT.”Political Analysis 17(1):45–63.

dos Santos, Luiz Alberto and Paulo Mauricio Teixeirade Costa. 2014. Brazil: Lobby Regulation, Transparencyand Democratic Governance. In Lobbyists, Governmentsand Public Trust, Volume 3. OECD pp. 109–123.

EBRD. 2002. The Business Environment and EnterprisePerformance Survey 2002: A brief report on observations,experiences and methodology from the survey. Technicalreport EBRD.

Frye, Timothy. 2002. “Capture or exchange? Businesslobbying in Russia.” Europe-Asia Studies 54(7):1017–1036.


Graf Lambsdorff, Johann. 2005. International Handbook ofthe Economics of Corruption. Northampton, MA: EdwardElgar.

International, Transparency. 2007. Global corruption report


http://www.econ.ku.dk/epru/




http://ebrd-beeps.com





2007: Corruption and judicial systems. TransparencyInternational.

Kalniņš, Valts. 2011. “Transparency in lobbying:Comparative review of existing and emerging regulatoryregimes.” Centre for Public Policy, PROVIDUS .

Krumpal, Ivar. 2013. “Determinants of social desirabilitybias in sensitive surveys: A literature review.” Quality andQuantity 47(4):2025–2047.

OECD. 2012. Private Interests, Public Conduct: The Essenceof Lobbying. In Lobbyists, Governments and Public Trust,Volume 2. OECD.

OECD. 2014. Transparency in Lobbying Activities. InLobbyists, Governments and Public Trust, Volume 3.OECD.

Olken, Benjamin A and Rohini Pande. 2011. Corruption indeveloping countries. Technical report National Bureau ofEconomic Research.

Schneider, Ben Ross. 2013. “Institutions for effectivebusiness-government collaboration: Micro mechanismsand macro politics in Latin America.” IDB Working PaperNo. IDB-WP-418 .

Tan, Ming T., Guo-Liang Tian and Man-Lai Tang. 2009.“Sample surveys with sensitive questions: Anonrandomized response approach.” The AmericanStatistician 63(1):9–16.

Thomas, Clive S. and Ronald J. Hrebenar. 2008.“Understanding interest groups, lobbying and lobbyists indeveloping democracies.” Journal of Public Affairs8(1-2):1–14.

Treisman, Daniel. 2007. “What have we learned about thecauses of corruption from ten years of cross-nationalempirical research?” Annual Review of Political Science10:211–244.

Winters, M. and R. Weitz-Shapiro. 2013. “Third-personversus second-person vignettes in survey experimentalresearch on sensitive topics.” Unpublished Manuscript.

Yadav, Vineeta. 2008. “Business lobbies and policymaking indeveloping countries: The contrasting cases of India andChina.” Journal of Public Affairs 8(1-2):67–82.

Yadav, Vineeta. 2011. Political Parties, Business Groups, andCorruption in Developing Countries. Oxford, UK: OxfordUniversity Press.

II. Special Topic

The Ethics of Field Experiments in ComparativePolitics

by Scott DesposatoUniversity of Zurich and University of California, San Diego

Bynowmost political scientists are probably aware of thecontroversy surrounding a recent field experiment con-ducted in Montana by Stanford and Dartmouth faculty(Michelson, 2014; Murphy, 2014; Stanford, 2014; Otani,2014; Willis, 2014; Dennison, 2014). The scholars sentmailers to a reported 100,000 Montana voters with con-tent regarding candidates for elected judgeships. Manyhave expressed surprise and indignation over the study,for its potential impact on an election outcome, for fea-tures of the mailer’s design, and for possible IRB irreg-ularities. But for this audience, the design should befamiliar because it is one that has been applied in dozensof comparative politics experiments all over the world.Indeed, some of those comparative politics studies wereillegal, lacked local IRB approval, and never informed,consented, or debriefed subjects1.

Experiments are not new to political science, butmost of our history with them involves fully informedand consenting undergraduate students playing simpleand safe games in campus laboratories in the UnitedStates. Over the last 15 years, however, the number ofexperiments conducted by political scientists has in-creased exponentially.2 These experiments can involvehundreds of thousands of subjects, sometimes unin-formed and unconsenting, in very different contexts,with treatments that may fundamentally affect subjects’lives.

These changes hold great promise for advancingour research — identifying causal mechanisms, test-ing theories, and generalizing models across contexts.But they are generating ethical challenges, especially forcomparative politics. Many of these challenges are newand unique to social science, and they do not have pre-existing solutions in other fields.

1Ironically, the recent Montana study differs from most similar experiments in that the mailer noted that it was part of a study. Had theresearchers not disclosed their affiliations, we probably never would have heard of the study.

2See empirical analysis of the number and type of experiments in Morton and Williams (2010), Druckman et al. (2006), and Desposato(2015).


http://www.nber.org/papers/w17398

http://www.nber.org/papers/w17398

http://www.academia.edu/4272234/Third-Person_versus_Second-Person_Vignettes_in_Survey_Experimental_Research_on_Sensitive_Topics





To start a dialogue on these new challenges, I helda conference on ethics in comparative politics experi-ments at UCSD (conference website; NSF# 1251510).The goal was to identify critical issues, explore opinionin the field, and propose practical strategies for movingforward. The proceedings are to be published later thisyear in the Routledge Series in Experimental PoliticalScience.

In this essay, I will briefly review some of the issueswe examined. We can organize the most common issuesscholars are encountering into several categories: con-text, local review, deception and consent, and impact.The last three usually are manifest together in field ex-periments.

I. Context

Thefirst set of issues are a byproduct of the globaliza-tion of experimental political science. Experiments thathave no imaginable risks in the United States can causeproblems elsewhere in the world. For example, KimDionne, Augustine Harawa, andHastings Honde’s chap-ter shows how standard compensation plans can causeconflict between subjects and non-subjects in contextsof extreme poverty and inequality. Becky Morton andJonathan Rodgers discuss questions of religion, includ-ing how simple economic games may violate religiousnorms in some countries. Eddie Malesky examines theCommon Rule exemption for public officials and askshow we should define “officials” in other contexts. Forexample, are inherited village leadership positions reallypublic officials with no protections (Dionne and Honde,2015; Morton and Rogers, 2015; Malesky, 2015)?

Based on the discussion at the conference, many ofthese are “easy” issues in that they would be relativelystraightforward to fix through low-cost design changes,andmost agreed that we should in fact adapt our designsto local context. For example, to avoid religious prob-lems with treatments that look like gambling (whichis forbidden in Islam), Morton and Rodgers discuss re-casting a game to be about choosing traffic routes. To ad-dress tensions over compensation, Dionne, Harawa, andHonde propose strategies for explaining sampling andpayoffs to subjects. Other contextual questions are notresolved. For example, if subjects frequently encounterviolence in their everyday life, can we expose subjectsand enumerators to levels of risk that might be commonin their country, but which would be unacceptable back

home (Driscoll, 2015)?

II. Local Review

Questions of local review are more complex. Manyscholars from the United States and Europe that areconducting experiments overseas do not have any lo-cal approval of their projects. Yet some countries havestrict rules regarding special research visas, governmentpermission, and local ethical review of protocols. Inmany cases, these requirements seem designed to pre-vent any research from every happening; in other casesthey are opportunities to extract rents from scholars.At the conference, we focused on the cases of Brazil,China, Ecuador, Malawi, Mexico, and Vietnam, andfound widely varying human subjects’ rules and opin-ions on how we should proceed.

Some felt that experiments withoutforeign permission would beacceptable. Others, includingscholars from the developing world,were indignant that first-worldresearchers would fly in on touristvisas, conduct experiments withoutpermission, and slip out of thecountry, essentially running “underthe radar” experiments.

Some issues were not controversial.Most agreed thatcompliance would be appropriate if the procedures werestraightforward and reasonable. Similarly, most agreedthat some form of local review — even if just informalreview by local scholars or collaborators — is alwaysappropriate. One benefit to local review is feedback onwhether your intervention is contextually appropriateor has any risks you did not imagine.

But when considering cases where formal proce-dures are onerous or the government agencies incompe-tent or corrupt, there was clear disagreement on how weshould proceed. Some felt that experiments without for-eign permission would be acceptable. Others, includingscholars from the developing world, were indignant thatfirst-world researchers would fly in on tourist visas, con-duct experiments without permission, and slip out of thecountry, essentially running “under the radar” experi-ments. One particularly grey area involves authoritarianregimes — where research rules may be designed toprotect the regime rather than the subjects, and where


http://polisci2.ucsd.edu/polisciethics

http://www.routledgementalhealth.com/books/details/9781138909168/

http://www.routledgementalhealth.com/books/details/9781138909168/



compliance would either mean not doing the researchor only doing research that is beneficial to an oppressiveregime.

Debate about local review also illustrates some of thedifferences between political science and other fields.Medical experiments which are often much higher riskwould never proceed overseas without proof of localreview; most medical funding agencies would requireproof of local approval of the experiment. Political sci-ence experiments are generally extremely low risk tosubjects, but may be perceived as threatening to regimesand could be even harder to get approved than med-ical studies. For the time being, the National ScienceFoundation and many of our university IRB’s are notrequiring any local approval of political science experi-ments.

One possible strategy is for scholars to test the wa-ters of local review and if it seems like a bottomless pit,try working with local academics to develop a reason-able and effective review process. An example of this isthe case of Brazil, where the University of São Paulo hasestablished an Ethics Committee that will review foreignscholars’ projects.

III. Deception, Consent, and Impact

The other most common issues involve deception,consent, and impact. The first two are enduring ques-tions across bioethics, where the limits of consent anddeception are continually being explored and debated(see, for example, Koenig (2014)). In political sciencethere are also long-standing debates about whether de-ception is ever appropriate, even in laboratory environ-ments. But more recently, and of greatest concern forcomparative politics, these three issues are manifest to-gether in field experiments.

Field experiments are interventions conducted “inthe real world”, where some feature of the natural envi-ronment is manipulated by researchers. Often subjectshave no idea that they are participating in a study. Thesedesigns are especially promising because they allow usto observe the effect of the treatment in a real applica-tion, not in an artificial laboratory setting where manyfeatures of the experiment may not generalize to the real

world. At their best, field experiments providemore gen-eralizable knowledge about how our treatments workand could aid in policy implementation and theorybuilding.

My impression is that there is a significant divide inthe field on the ethics of many field experiments. Pro-ponents note that the treatments usually have only atrivially small risk of harm to individual subjects andare typical of everyday experiences. For example, it isdifficult to imagine how a political mailer from a re-searcher poses a serious risk, especially when citizens re-ceive dozens or even hundreds during campaigns. Otherexperiments might involve fake job applications to busi-nesses or constituency mail to elected officials. But whatis one extra resume or letter to an office that might dealwith hundreds daily? And when field experiments in-volve consequential goods, like health care or clean wa-ter, most experiments do not violate the standard of careprinciple: the control group’s situation is never madeworse, they just do not receive the treatment. Further,random assignment of public goods is probably morefair than letting a corrupt political process perform theassignment.

Others are concerned with several features of fieldexperiments. One position holds informed consent asparamount; thus any manipulation of subjects withouttheir consent is simply unacceptable. A second source oftension is that while individual risks may be low, aggre-gate costs and risks may be large. For example, supposewe deceive a subject into just fifteen minutes of lookingat a fake resume, as part of a study on employment. If wehave 5,000 subjects, thatmeanswe have used 31weeks ofdeception-induced full time labor.3 A third is that evenwhen risks to subjects are trivially small, people oftenresent being manipulated without their permission forscholars’ own purposes. Angering and upsetting sub-jects can be considered a form of harm to individuals.Field experiments might also be considered ethicallyquestionable if they harm the broader discipline, for ex-ample, if a study results in all experiments being bannedby an unfriendly government.4

Field experiments in the study of illegal activities,including corruption and vote buying, are particularlycontroversial, as treatments may involve breaking local

3(5,000 subjects × 15 minutes per subject )/ (60 minutes per hour × 8 hours per day × 5 work days per week) = 31 weeks4At the same time, too much concern with not making people or governments upset might stifle scholarship and violate norms of academic

freedom. But the point illustrates that the choices we make about individual projects can affect our colleagues’ research as well.




law. Scholars have solicited illegal activity by subjects,and committed minor crimes as part of field experi-ments. One perspective is that these are critically impor-tant topics and deserve our attention, and that deceptive— and sometimes illegal — designs are the only wayto study such topics. For others, these experiments aresimply wrong.

Especially disturbing to some are interventions inreal elections or other political processes, where a polit-ical scientist’s research has the potential to change voteshares or even an election outcome — potentially af-fecting millions of subjects and bystanders without theirconsent. As Zimmerman (2015) has pointed out, elec-tions are always a zero sum game, so any impact of atreatment can cause harm. In addition, some are con-cerned that intervention in real politics may compro-mise our objectivity and integrity as scientists. On theother hand, one could argue that many campaign fieldexperiments are normatively good — for example, if apolitical mailer provides factual information to votersor increases turnout, can we say that the intervention isa normative good, regardless of its impact on electionresults?

IV. Discussion

These are just a few of the perspectives on field ex-periments. My own opinion is that we can dodge orat least minimize many of the controversial cases withmodest design changes. As I have argued elsewhere, themost critical issue is that of informed consent.Most con-troversies fade away when those affected by the studyknowingly agree to participate. Some field experimentshave designs that are amenable to informed consent,and whenever possible, we should request it. For caseswhere consent is impractical or ruins a design, thereare alternative forms of consent that one might employ(Humphreys, N.d.). One can also minimize problemsby treading lightly. For example, one could conduct apower analysis and minimize sample size. Why treat100,000 subjects when 5,000 would be sufficient for hy-pothesis testing? We might also adopt a norm that sub-jects in field experiments be compensated for their time— either directly, or through payment to some publicgoods provider. Finally, a norm of post-study disclosureto all affected parties provides stronger incentives forresearchers to anticipate problems and build solutionsinto designs.

There are many other issues I do not have space todiscuss here, some of which are covered in the book. I’llconclude with a few general recommendations to con-sider before conducting your next experiment.

1. Risk is not just about the expected physical harm toan individual subject. Thinking ethically about experi-ments means thinking about individual subjects, aggre-gate costs, bystanders, and enumerators. Further, even ifthe study is safe and no one gets hurt, there can be coststo the discipline from making subjects and governmentsangry at us. As we have learned recently, people canget very upset about experiments that seem safe and in-nocuous.

2. Be honest about the incentives of the profession. Ac-knowledge and accept that we all have a conflict of in-terest when it comes to ethics and research with humansubjects. Protecting subjects and others affected by in-terventions almost always makes research harder. Atthe same time, the discipline rewards the studies thatare bigger and more exotic. Our own enthusiasm forthe research and the professional success it may bringcan nudge us toward ignoring real issues. The stakes arehighest for those in the most precarious career situa-tions: junior faculty and graduate students. This meansthat senior faculty should provide leadership on how toethically conduct experiments, through mentoring, ed-ucation, and example.

3. Do not use ethics to reject the experimental revolution.Experiments hold great promise for the discipline andshould be embraced as one of many tools for studyingcomparative politics. Most comparative experiments aresafe, legal, and have effectively zero risk. The problemswe’ve encountered are rare, and in many cases, easilyavoidable.

4. Own it. Do not outsource ethical questions to a thirdparty. I have heard multiple scholars, when asked aboutthe ethics of a project, respond, “Well, it got throughIRB!” IRBs are imperfect institutions largely created toprotect university access to federal funds, and IRB ap-proval is not perfect absolution. Responsibility for ethicsrests with the principal investigator.

5. Be part of the dialogue. There is increasing interest inethics across the discipline, and there will almost cer-tainly be several roundtables on ethics at Midwest andAPSA. These may lead to new APSA Ethics Guidelines




for research. This dialogue will affect all of us, and youshould be a part of it.

References

Dennison, Mike. 2014. “Montana’s top political cop wantsaction, apology from Dartmouth and Stanford.” TheBillings Gazette .

Desposato, Scott. 2015. Introduction. In Ethics andExperiments: Problems and Solutions for Social Scientistsand Policy Professionals, ed. Scott Desposato. RoutledgeStudies in Experimental Political Science London, UK:Routledge.

Dionne, Kim Yi, Augustine Harawa and Hastings Honde.2015. The Ethics of Exclusion when Experimenting inImpoverished Settings. In Ethics and Experiments:Problems and Solutions for Social Scientists and PolicyProfessionals, ed. Scott Desposato. Routledge Studies inExperimental Political Science London, UK: Routledge.

Driscoll, Jesse. 2015. Prison States and Games of Chicken. InEthics and Experiments: Problems and Solutions for SocialScientists and Policy Professionals, ed. Scott Desposato.Routledge Studies in Experimental Political ScienceLondon, UK: Routledge.

Druckman, James N., Donald R. Green, James H. Kuklinskiand Arthur Lupia. 2006. “The growth and development ofexperimental research in political science.” AmericanPolitical Science Review 100(4):627–635.

Humphreys, Macartan. N.d. “Manipulation and consent.”Unpublished Manuscript.

Koenig, Barbara. 2014. “Have we asked too much ofConsent?” The Hastings Center Report 44(4):33–34.

Malesky, Edward. 2015. Who is a Public Official? In Ethicsand Experiments: Problems and Solutions for SocialScientists and Policy Professionals, ed. Scott Desposato.Routledge Studies in Experimental Political ScienceLondon, UK: Routledge.

Michelson, Melissa R. 2014. “Messing with Montana:Get-out-the-vote experiment raises ethics questions.”Western Political Science Association .

Morton, Rebecca B. and Kenneth C. Williams. 2010.Experimental Political Science and the Study of CausalityFrom Nature to the Lab. New York: Cambridge.

Morton, Rebecca and Jonathan Rogers. 2015. Religion,Experiments, and Ethical Concerns. In Ethics andExperiments: Problems and Solutions for Social Scientistsand Policy Professionals, ed. Scott Desposato. RoutledgeStudies in Experimental Political Science London, UK:Routledge.

Murphy, Katy. 2014. “Stanford, Dartmouth sidestep legalaction against researchers’ election mailers.” MercuryNews .

Otani, Akane. 2014. “Stanford and Dartmouth’s bungledvoter experiment spans three states.” Businessweek .

Stanford. 2014. “Stanford responds to voter mailer.”.Willis, Derek. 2014. “Professors’ research project stirs

political outrage in Montana.” New York Times .Zimmerman, Brigitte. 2015. Ethical Concerns Surrounding

Research Interventions in Democratic Processes. In Ethicsand Experiments: Problems and Solutions for SocialScientists and Policy Professionals, ed. Scott Desposato.Routledge Studies in Experimental Political ScienceLondon, UK: Routledge.

III. Dataset

Integrating Data on Ethnicity, Geography, andConflict: The Family of Ethnic Power RelationsDatasets 2014

by Nils-Christian Bormann, Manuel Vogt,& Lars-Erik Cederman

ETH Zürich

Since the early 1990s, empirical research on the con-sequences of ethnic diversity has been the subject ofmuch academic debate among economists (Alesina,Baqir and Easterly, 1997), comparativists (Ordeshookand Shvetsova, 1994), and IR scholars (Denny and Wal-ter, 2014). Moving from highly aggregated measures ofethnic diversity such as the ethno-linguistic fractional-ization (ELF) index, political scientists have turned tomore detailed group-level datasets that cover linguistic,religious, and racial ethnic groups (Gurr, 1995). This es-say introduces the latest release of the Ethnic Power Rela-tions (EPR) data, version 2014, which includes informa-tion on over 800 unique ethnic groups in 165 countriesbetween 1946 and 2013. Consisting of five constituentdatasets, the EPR Family 2014 provides time-varying in-formation on ethnic groups’ access to executive power,relative group size, their geographic settlement patternsand involvement in intrastate wars, transnational link-ages, and sub-group dimensions such as language andreligion.While all variables are coded on the group-level,we also provide country-level versions of the data thatcan be used to study outcomes such as democratiza-tion. In addition, the 2014 version features several nov-elties, such as a new variable of territorial autonomy, andfor the first time, comprehensive documentation in theform of the EPR Atlas. In the following, we explain the


http://billingsgazette.com/news/government-and-politics/montana-s-top-political-cop-wants-action-apology-from-dartmouth/article_5ccf7736-3428-550f-b288-e1b261609e1b.html

http://billingsgazette.com/news/government-and-politics/montana-s-top-political-cop-wants-action-apology-from-dartmouth/article_5ccf7736-3428-550f-b288-e1b261609e1b.html

http://thewpsa.wordpress.com/2014/10/25/messing-with-montana-get-out-the-vote-experiment-raises-ethics-questions/

http://thewpsa.wordpress.com/2014/10/25/messing-with-montana-get-out-the-vote-experiment-raises-ethics-questions/

http://www.mercurynews.com/education/ci_26824200/stanford-dartmouth-sidestep-legal-action-against-researchers-election

http://www.mercurynews.com/education/ci_26824200/stanford-dartmouth-sidestep-legal-action-against-researchers-election

http://www.businessweek.com/articles/2014-10-30/stanford-and-dartmouth-send-fake-voter-guides-to-three-states

http://www.businessweek.com/articles/2014-10-30/stanford-and-dartmouth-send-fake-voter-guides-to-three-states

http://news.stanford.edu/votermailer/

http://www.nytimes.com/2014/10/29/upshot/professors-research-project-stirs-political-outrage-in-montana.html

http://www.nytimes.com/2014/10/29/upshot/professors-research-project-stirs-political-outrage-in-montana.html



Figure 1: The structure of the EPR Dataset Family 2014

Note: Bold boxes designate EPR Family datasets; dashed boxes identify datasets to which EPR is linked.

outstanding features of the EPR Family 2014 by tracingits evolution from earlier versions and by comparing itto alternative data sources on ethnicity.

Providing a list of politically relevant ethnic groups,the EPR Core dataset is at the center of the EPR datauniverse (see Figure 1). EPR defines ethnicity as a sub-jectively experienced sense of commonality based on abelief in common ancestry and shared culture. An eth-nic group is considered politically relevant if at leastone political organization has claimed to represent itsinterests at the national level or if its members are sub-jected to state-led political discrimination (Cederman,Wimmer and Min, 2010).1 By broadening the notionof “political relevance” to include majority and domi-nant groups, the EPR group list is more inclusive thanthe Minorities at Risk (MAR) dataset by Gurr (1995),and it accounts for changes in the set of politically rel-evant groups when new actors make political claimsalong new or altered ethnic identity lines.2 For example,EPR considers Whites, Blacks, and Native Americansto be the only politically relevant groups in the UnitedStates until 1965. Afterwards, Hispanics, Asian Amer-icans, and Arab Americans become politically relevantas several civil society organizations that make claimson behalf of these groups are founded in the wake of the

anti-discrimination legislation of the Johnson adminis-tration.

The central feature of all EPR versions is the ordi-nal scale of ethnic groups’ access to executive powerthat distinguishes three major categories: (1) one groupcontrols the executive, (2) representatives of an ethnicgroup share government power, and (3) members of anethnic group are excluded from the highest ruling bodyof a state (e.g., a democratic cabinet, a military junta,or a royal court). Each of these three main categoriesincludes sub-classifications of an ethnic group’s powerstatus (see Table 1 and textbox on the next page).

Access to sub-national governments such as federalstates does not count towards this power-scale, which fo-cuses solely on the access of group representatives to thecentral government. This classification of groups’ powerstatus has undergone an important change in the 2014version, as previous EPR versions code regional auton-omy within the ordinal power scale for groups excludedfrom the central government, such as the Naga in India,but not for groups included in the government. In con-trast, the EPR Core 2014 dataset removes the regionalautonomy category from the relative power access andprovides a new variable that assesses groups’ access to

1Based on collaboration with researchers at UCLA, the original version of EPR v. 1.0 covers the period 1946-2005. Collected by the Inter-national Conflict Research Group at ETH, the direct predecessor to the current release is the dataset EPR-ETH 2.0, which extends coverageuntil 2009. A separate version assembled by researchers from UCLA (labeled EPR version 3) also builds directly on the first release of EPRand extends coverage to 2010. While this version introduces information about the “cultural contents” of ethnic groups, it does not offer anyspatial extension nor any compatibility with EPR Family 2014.

2The new AMAR dataset by Birnir et al. (2014) lists more than 1,200 “socially relevant” ethnic groups and provides various sub-groups.However, additional variables are only coded for a small random sample of the larger universe of groups.




Table 1: Power Status, 2013

Power status N Percent

One-group ruleMonopoly 24 3.4%Dominant 42 6.0%

Power-sharingSenior partner 83 11.9%Junior partner 164 23.5%

ExcludedPowerless 312 44.7%Discriminated 67 9.6%Self-exclusion 6 0.9%

Total 698 100%

EPR Power Categories

Monopoly: Only representatives from one ethnic group are included in the executive (e.g.,Sunni Arabs in Saudi Arabia).

Dominant: The government consists predominantly of members from one ethnic groupbut some token members of other groups are included without any real influence (e.g., Sinhalesein Sri Lanka).

Senior Partner: A group shares power but its representatives have more influence than itspartners (e.g., Ovambu in Namibia).

Junior Partner: A group shares power but its representatives have less power than itspartners (e.g., Kavango in Namibia).

Powerless: A group lacks representation in the executive (e.g., Bosniaks in Croatia).

Discriminated: The state actively and intentionally targets members of an ethnic group inthe political realm (e.g., Sahrawis in Morocco).

Self-exclusion: This category describes ethnic groups that control some territory of thestate and have declared this region to be independent from the central government (e.g.,Armenians in Nagorno-Karabakh).

executive power on the regional level independentlyof their national power status. Representatives of thePunjabi-Sikhs were included in most of India’s rulingcoalitions, and they have politically dominated the Pun-jab since the redrawing of its boundaries in 1966. Inearlier versions of EPR, the Punjabi would simply havebeen coded as being included in the central government,but the 2014 release now codes the Punjabis as both in-cluded in a power-sharing government and controllingtheir own state.This change enables researchers to inves-tigate governmental and territorial power-sharing for alarge number of ethnic groups in pre- and post-war set-tings (Cederman et al., 2015). The de facto coding of ac-cess to executive power on the national and sub-nationallevel also sets the EPR data apart from data sources thatprovide information on the de jure provisions of peaceagreements in post-conflict states (Hartzell and Hoddie,2007).

A unique feature of the EPRFamily is the geo-codingof ethnic groups’ settlement patterns in the GeoEPRdataset, which was introduced by Wucherpfennig et al.(2011). Providing shape files for the vast majority of eth-nic groups that are territorially concentrated, GeoEPRprovides digital time-variant maps of ethnic groups’home territories. Beyond extending the temporal scopeto 2013, GeoEPR 2014 introduces the “statewide” cat-egory that encompasses groups with a presence in vir-tually every part of a country. These groups previously

fell into the “dispersed” category that now only com-prises the subset of minority groups that live in differentparts of the country. Beyond serving as useful graph-ical illustrations, the GeoEPR maps offer the basis formeasures of an ethnic group’s territorial concentrationand open up new avenues for data collection on the eth-nic group-level. Cederman, Weidmann and Gleditsch(2011), for example, use the GeoEPR maps to deriveestimates of group inequality by overlaying it with geo-referenced income data (Nordhaus, 2006). Figure 2 onthe next page shows an example of the GeoEPR maps. Itshows the ethnic geography in Yugoslavia before its dis-solution in 1990. GeoEPR also traces changes in groups’settlement patterns due to changes in country borders asin Yugoslavia or mass-scale migration within the samecountry as occurred after the Russian invasion of Geor-gia in 2008.

The conflict coding of the EPR family establishesa link to the UCDP/PRIO Armed Conflict (ACD)Database (Gleditsch et al., 2002; Themnér and Wallen-steen, 2014).3 The ACD2EPR data classify rebel groupsas ethnic if they recruit fighters from a particular ethnicgroup and make public claims on behalf of that group.Coding individual rebel organizations rather than entirecivil wars, the ACD2EPR data offer amore nuanced clas-sification of conflicts, which enables conflict researchersto compare multiple rebel groups that fight on behalf ofthe same ethnic group, such as theKarenNationalUnion

3For UCDP, see the Uppsala Conflict Data Program (http://www.pcr.uu.se/research/UCDP/). PRIO stands for Peace Research InstituteOslo (http://www.prio.org/)


http://www.pcr.uu.se/research/UCDP/

http://www.prio.org/



Figure 2: EPR groups in Yugoslavia in 1990 with inequality estimates (left) and G-ECON income data (right)

and the Karen National United Party in Burma, withrebel groups that unite several ethnic groups against thegovernment, such as the AfricanNational Congress dur-ing South Africa’s Apartheid regime that united Asians,Blacks, and Coloreds (Wucherpfennig et al., 2012).

Going beyond the confines of the nation-state, thenewly integrated Transborder Ethnic Kin (EPR-TEK)dataset identifies all EPR groups with settlements in atleast two countries through nominal matching. Thus,groups in different countries are coded as transborderkin if they share the same ethnographic name, includ-ing synonyms. The Kurds who live in Turkey, Iran, Iraq,and Syria exemplify such a transnational ethnic group,and thereby represent about half of all groups in theEPR dataset that feature a cross-border link. In contrastto the MAR dataset, which only codes whether groupsare linked to other groups in neighboring states, thecombination of EPR-Core and EPR-TEK data also pro-vides information on the political status of transborderethnic kin groups. Researchers can therefore differen-tiate if groups in one country can obtain support froma transborder group that holds government power orfrom excluded groups, and therefore trace more specificmechanisms (Cederman et al., 2013).

The second novel addition to the EPR Dataset Fam-ily is the Ethnic Dimensions (EPR-ED) data, which

identify the linguistic, religious, and racial segmentsof all EPR groups.4 This is the first dataset that codesboth multiple cleavage dimensions and several seg-ments within a cleavage dimension for ethnic groups.Hence, this approach refrains from representing eth-nic groups as always being exclusively linguistic, re-ligious, or racial. In contrast, other researchers oftenlink ethnic groups to one primary category such as“ethno-linguistic” or “ethno-religious” (Sorens, 2011),thus treating these groups as internally homogenous andone-dimensional. Drawing on the Ethnologue databaseon languages (Lewis, 2009), the Joshua Project’s cod-ing of religious identities (2011), and many secondarysources, the EPR-ED data enable researchers to go be-yond the ethnic group level by constructing estimatesof within-group fractionalization and dyadic between-group differences along any combination of linguistic,religious, or racial cleavages.

The individual components of the EPR dataset fam-ily can be downloaded from the GROWup web portal athttp://growup.ethz.ch. In addition to featuring a PublicFront-End (PFE) for visual inspection of ethnic groupsin time and space, the system provides a user-friendlyResearch Front-End (RFE), which allows users to as-semble and download customized panel datasets com-posed of EPR-related variables. The data offered via theRFE are pre-aggregated to the level of group-years and

4The term “race” here refers to ethnic groups’ origins from particular world regions, such as Europe, Sub-Saharan Africa, Oceania, etc.These regional origins — expressed at the individual level through certain phenotypical markers (particularly skin color) — have becomerelevant as social categories in the context of the European colonization of the world and the related process of racial classification.


http://growup.ethz.ch



country-years. Formatted in order to facilitate statisticalanalysis, the selection of variables includes conflict onsetand incidence dummies, as well as various peace-yearsvariables and other temporally defined conflict indica-tors.5

In summary, the EPR Dataset Family 2014 providesscholars with a wide range of instruments to improvetheir understanding of politicized ethnicity in general,and ethnic conflict in particular. Thanks to its exten-sive scope, the integrated datasets facilitate the study of avariety of questions. However, much work still remainsto be done. In this regard, we want to highlight threepromising directions of future data collection. First, it isimportant to identify the actual political actors that areat the roots of the collective action undertaken by eth-nic groups. For this purpose, we plan to launch a newdata collection project, the EPR-Organizations Dataset(http://www.r4d.epr.ethz.ch), which will offer informa-tion on ethnically based political organizations in allcountries of the world.The goal is to break up themono-lithic concept of ethnic groups by taking into account thediverse agendas and claims of different organizationalrepresentatives, and to study ethnic mobilization pro-cesses. Second, broadening the spectrum of political vi-olence covered by EPR beyond civil war, upcoming ex-tensions of theACD2EPRdatasetwill link EPRgroups toall UCDP actors, including those involved in one-sidedviolence, non-state conflict, and inter-state war. Finally,going beyond the nominal coding of EPR-TEK, futureversions of the dataset will adhere to a claim-based cod-ing that allows for changes in transnational ethnic iden-tifications.

References

Alesina, Alberto, Reza Baqir and William Easterly. 1997.“Public goods and ethnic divisions.” Quarterly Journal ofEconomics 114(4):1243–1284.

Birnir, Jóhanna K., Jonathan Wilkenfeld, James D. Fearon,David D. Laitin, Ted Robert Gurr, Dawn Brancati,Stephen M. Saideman, Amy Pate and Agatha S. Hultquist.2014. “Socially relevant ethnic groups, ethnic structure,and AMAR.” Journal of Peace Research 52(1):110–115.

Cederman, Lars-Erik, Andreas Wimmer and Brian Min.2010. “Why do ethnic groups rebel? New data andanalysis.” World Politics 62(1):87–119.

Cederman, Lars-Erik, Kristian Skrede Gleditsch, IdeanSalehyan and Julian Wucherpfennig. 2013. “Transborderethnic kin and civil war.” International Organization67(2):389–410.

Cederman, Lars-Erik, Nils B. Weidmann and Kristian SkredeGleditsch. 2011. “Horizontal inequalities andethnonationalist civil war: A global comparison.”American Political Science Review 105(3):478–495.

Cederman, Lars-Erik, Simon Hug, Andreas Schädel andJulian Wucherpfennig. 2015. “Territorial autonomy in theshadow of conflict: Too little, too late?” American PoliticalScience Review .

Denny, Elaine K. and Barbara F. Walter. 2014. “Ethnicityand civil war.” Journal of Peace Research 51(2):199–212.

Gleditsch, Nils Petter, Peter Wallensteen, Mikael Eriksson,Margareta Sollenberg and Håvard Strand. 2002. “Armedconflict 1946-2001: A new dataset.” Journal of PeaceResearch 39(5):615–637.

Gurr, Ted Robert. 1995. Minorities at Risk: A Global View ofEthnopolitical Conflicts. United States Institute of Peace.

Hartzell, Caroline A. and Matthew Hoddie. 2007. CraftingPeace: Power-Sharing Institutions and the NegotiatedSettlement of Civil Wars. University Park, PA: Penn StatePress.

Lewis, M. Paul, ed. 2009. Ethnologue: Languages of the world.Vol. 9 SIL International Dallas, TX.

Nordhaus, William D. 2006. “Geography andmacroeconomics: New data and new findings.”Proceedings of the National Academy of Sciences of theUnited States of America 103(10):3510–3517.

Ordeshook, Peter C. and Olga V. Shvetsova. 1994. “Ethnicheterogeneity, district magnitude, and the number ofparties.” American Journal of Political Science38(1):100–123.

Sorens, Jason. 2011. “Mineral production, territory, andethnic rebellion: The role of rebel constituencies.” Journalof Peace Research 48(5):571–585.

Themnér, Lotta and Peter Wallensteen. 2014. “Armedconflicts, 1946–2013.” Journal of Peace Research51(4):541–554.

Wucherpfennig, Julian, Nils B. Weidmann, Luc Girardin,Lars-Erik Cederman and Andreas Wimmer. 2011.“Politically relevant ethnic groups across space and time:Introducing the GeoEPR dataset.” Conflict Managementand Peace Science 28(5):423–437.

Wucherpfennig, Julian, Nils W. Metternich, Lars-ErikCederman and Kristian Skrede Gleditsch. 2012.“Ethnicity, the state, and the duration of civil war.” WorldPolitics 64(1):79–115.

5The EPR 2014 data are available in their raw format here: http://www.icr.ethz.ch/data/index. The new Research Front End that allowscustomized downloads will go online in March 2015.


http://www.r4d.epr.ethz.ch

http://www.icr.ethz.ch/data/index



Editors and Contributors

Matt Golder

Matt Golder is an Associate Professor in the Department of Political Science at The Penn-sylvania State University. He received his Ph.D. in 2004 from New York University. His re-search looks at how political institutions affect democratic representation, with a particularfocus on electoral rules and party systems. In addition to articles in leading journals, such asthe American Journal of Political Science, the British Journal of Political Science, the Journalof Politics, and Political Analysis, he has also published a textbook on comparative politics,Principles of Comparative Politics. He is currently working on two research projects. The firstlooks at negative campaigning in a multi-party context, while the second examines the re-lationship between religious participation, economic development, and political attitudes.In addition to serving as chair of APSA’s section on Representation and Electoral Systems(2011-2013), he is also amember of the executive board for theMaking Electoral DemocracyWork project led by André Blais at the University of Montreal and the advisory board for theElectoral Integrity Project led by Pippa Norris at Harvard University. More information canbe found at his website and on his Google scholar profile.

Sona N. Golder

Sona Golder is an Associate Professor in the Department of Political Science at The Penn-sylvania State University. She received her Ph.D. in 2004 from New York University. Shestudies political institutions, with a particular interest in coalition formation. Her early re-search and first book, The Logic of Pre-Electoral Coalition Formation, examines the determi-nants and consequences of electoral alliances in parliamentary democracies. She is currentlyworking on a book project that draws together her more recent work on the post-electiongovernment formation process in Europe. Her research has been published in many of thediscipline’s leading journals, including the American Journal of Political Science, the BritishJournal of Political Science, the Journal of Politics, and Political Analysis. She has also pub-lished a textbook on comparative politics, Principles of Comparative Politics. She is also in-volved in the women inmethods group— she was the organizer and host for the 4th AnnualVisions in Methodology (VIM) Conference, she serves as a VIM mentor for female gradu-ate students and junior faculty, and she is a member of the diversity committee for APSA’sPolitical Methodology Section. More information can be found at her website and on herGoogle scholar profile.


http://polisci.la.psu.edu/

http://psu.edu/

http://psu.edu/

http://politics.as.nyu.edu/page/home

http://college.cqpress.com/sites/principlescp

http://apsa-res.blogspot.com/

http://electoraldemocracy.com/

http://electoraldemocracy.com/

https://sites.google.com/site/electoralintegrityproject4/home

https://files.nyu.edu/mrg217/public/

http://scholar.google.com/citations?user=yPbxmSwAAAAJ&hl=en


http://psu.edu/

http://psu.edu/

http://politics.as.nyu.edu/page/home

https://ohiostatepress.org/index.htm?books/book%20pages/Golder%20Logic.html

http://college.cqpress.com/sites/principlescp

http://visionsinmethodology.org/conferences/2012-conference/

http://visionsinmethodology.org/conferences/2012-conference/

https://files.nyu.edu/sln202/public/

http://scholar.google.com/citations?user=Cuz1fTcAAAAJ&hl=en



Jesse Driscoll

Jesse Driscoll is an Assistant Professor at the School of International Relations and PacificStudies at the University of California, San Diego. He received his Ph.D. in 2009 from Stan-ford University and was a Post-Doctoral Academy Fellow at the Harvard Academy for In-ternational and Area Studies. He has conducted extensive fieldwork in the Former SovietUnion, and is currently overseeing a number of research projects looking at how individualsliving in conditions of acute insecurity respond to survey questions about their behavior. Hiswork has appeared in the Journal of Conflict Resolution, Research and Politics, and the Journalof Survey Statistics and Methodology. His book Warlords and Coalition Politics in Post-SovietStates is forthcoming fromCambridge University Press. His research has been funded by theNational Science Foundation.

David Laitin

David Laitin is the James T. Watkins IV and Elise V. Watkins Professor in the Departmentof Political Science at Stanford University. He conducts research on political culture, ethnicconflict, and civil war, and his field expertise spans Somalia, Nigeria, Catalonia, Estonia andFrance. His research has been published in all of the discipline’s leading journals. He hasalso published books with Cambridge University Press and Oxford University Press. He hasbeen a recipient of fellowships from the Howard Foundation, the Rockefeller Foundation,the Guggenheim Foundation, and the Russell Sage Foundation. He is an elected member ofthe American Academy of Arts and Sciences and the National Academy of Sciences. Moreinformation can be found at his website.

Graeme Blair

Graeme Blair is a pre-doctoral fellow at Columbia University through Experiments in Gov-ernance and Politics and the Earth Institute. He will receive his Ph.D. in 2015 fromPrincetonUniversity and will be an Assistant Professor in the Department of Political Science at UCLAfrom 2016. His research examines both why civilians support armed groups and why groupsliving near valuable assets like oil fields are often able to force the state to make policy con-cessions and fiscal transfers. In addition, he has developed new methods for sensitive surveyquestions. His work has appeared in the American Political Science Review, the AmericanJournal of Political Science, and Political Analysis. He has also won the 2013 Pi Sigma Al-pha Award for best paper at Midwest Political Science Association. More information can befound at his website and on his Google scholar profile.


http://irps.ucsd.edu/

http://irps.ucsd.edu/

http://ucsd.edu/

https://politicalscience.stanford.edu/


http://academy.wcfia.harvard.edu/

http://academy.wcfia.harvard.edu/



http://irps.ucsd.edu/research/pdel/researchmeasuring-service-provision.htm



http://stanford.edu/

http://www.brown.edu/initiatives/howard-foundation/

http://www.rockefellerfoundation.org/bellagio-center/conferences-and-residencies/3670

http://www.gf.org/fellows/all?index=l

http://www.russellsage.org/visiting-scholars/david-d-laitin

https://www.amacad.org/content/members/members.aspx

http://www.nasonline.org/member-directory/members/20015062.html

http://laitinresearch.stanford.edu/

http://polisci.columbia.edu/

http://e-gap.org/

http://e-gap.org/

http://www.earthinstitute.columbia.edu/sections/view/9

http://www.princeton.edu/politics/

http://www.princeton.edu/politics/

http://www.polisci.ucla.edu/

http://www.ucla.edu/

http://www.mpsanet.org/Awards/2013AwardRecipients/tabid/804/Default.aspx#.U_bP4EgeWgk

http://www.mpsanet.org/Awards/2013AwardRecipients/tabid/804/Default.aspx#.U_bP4EgeWgk

http://graemeblair.com/

http://scholar.google.com/citations?user=0hbap6YAAAAJ



Daniel Gingerich

Daniel Gingerich is an Associate Professor in the Department of Politics at the Universityof Virginia. He also directs the Quantitative Collaborative, a University-wide institute de-signed to disseminate new advances in the quantitative social sciences and foster greaterinterdisciplinary collaboration. He received his Ph.D. in 2007 from Harvard University. Hisresearch focuses on understanding the causes and consequences of corruption and clien-telism in Latin America and the Caribbean, as well as on developing new methodologies tostudy these phenomena. He has published articles in journals such as the British Journal ofPolitical Science, Political Analysis, and the Quarterly Journal of Political Science. His book,Political Institutions and Party-Directed Corruption in South America: Stealing for the Team,was published by Cambridge University Press in 2013. His research has been funded by theNational Science Foundation. More information can be found at his website.

Elizabeth C. Carlson

Elizabeth C. Carlson is an Assistant Professor in the Department of Political Science and theProgram on African Studies at The Pennsylvania State University. She received her Ph.D.in 2011 from the University of California, Los Angeles and has held pre- and post-doctoralfellowships at Stanford and Yale. She uses survey and experimental methods to study po-litical behavior and citizen preferences in Africa’s new democracies, as well as how citizenbehavior shapes government performance and accountability. Her work is forthcoming inWorld Politics and has been funded by the National Science Foundation. She is a member ofExperiments in Governance and Politics. More information can be found at her website andon her Google scholar profile.

Vineeta Yadav

Vineeta Yadav is an Assistant Professor in the Department of Political Science at The Penn-sylvania State University. She received her Ph.D. in 2007 from Yale University. Her researchfocuses on the effects of institutions on economic development, with a particular emphasison how institutions influence lobbying, corruption, and judicial empowerment. In additionto several articles, she has also published two books: Political Parties, Business Groups, andCorruption in Developing Countries and Democracy, Electoral Systems and Judicial Empower-ment in Developing Countries. A third book,Corruption in Dictatorships, is forthcoming withCambridge University Press. She is currently working on two new research projects.The firstexamines the origins of commercial courts and their effect on economic development andcivic rights. The second looks at how the evolution of political learning and support influ-ences institution building and democratic consolidation. More information can be found ather website and on her Google scholar profile.


http://politics.virginia.edu/

http://www.virginia.edu/

http://www.virginia.edu/

http://www.virginia.edu/quantitative-collaborative/

http://www.gov.harvard.edu/

http://www.cambridge.org/us/academic/subjects/politics-international-relations/comparative-politics/political-institutions-and-party-directed-corruption-south-america-stealing-team

http://www.nsf.gov/awardsearch/showAward?AWD_ID=1119908

http://danielgingerich.weebly.com/


http://afr.la.psu.edu/

http://psu.edu/


http://cddrl.stanford.edu/

http://www.yale.edu/macmillan/

http://www.nsf.gov/funding/pgm_summ.jsp?pims_id=13453

http://e-gap.org/

https://sites.psu.edu/ecc13/

http://scholar.google.com/citations?user=2Cs3Tt4AAAAJ&hl=en


http://psu.edu/

http://psu.edu/

http://politicalscience.yale.edu/

http://global.oup.com/academic/product/political-parties-business-groups-and-corruption-in-developing-countries-9780199735914

http://global.oup.com/academic/product/political-parties-business-groups-and-corruption-in-developing-countries-9780199735914

https://www.press.umich.edu/5037026/democracy_electoral_systems_and_judicial_empowerment_in_developing_countries

https://www.press.umich.edu/5037026/democracy_electoral_systems_and_judicial_empowerment_in_developing_countries

http://sites.psu.edu/vineetayadav/

http://scholar.google.de/citations?user=vGjxl7YAAAAJ&hl=en



Scott Desposato

Scott Desposato is an Associate Professor in the Department of Political Science at the Uni-versity of California, San Diego. Before coming to San Diego, he was an Assistant Professorat the University of Arizona, a Harvard Academy Scholar, and a Research Fellow at Prince-ton University. He received his Ph.D. in 2001 from the University of California, Los Angeles.His general research interests include democratic institutions, campaigning, mass behav-ior, and political methodology. Specific projects have examined redistricting in the UnitedStates, electoral rules and federalism in Brazil, party-switching by politicians, and statisticalmethods for studying legislatures. His work has appeared in the discipline’s leading journalssuch as the American Journal of Political Science, the American Political Science Review, andthe Journal of Politics. In 2013, he organized the NSF-funded Ethics in Comparative PoliticsExperiments Conference. More information can be found at his website and on his Googlescholar profile.

Nils-Christian Bormann

Nils-Christian Bormann is a post-doctoral researcher in the International Conflict Research(ICR) group at ETH Zurich. Prior to taking his position with the ICR, he was a visitingresearcher at the University of Pittsburgh’s Department of Political Science, where he wasfunded by aDocMobility Grant from the Swiss National Science Foundation. He received hisPh.D. in 2014 from ETH Zurich. His research interests include ethnic coalitions and power-sharing, civil wars, ethnic conflict, democratization, electoral rules, and spatial methods. Hiswork has appeared in Electoral Studies. More information can be found at his website.

Manuel Vogt

ManuelVogt is a post-doctoral researcher in the International Conflict Research (ICR) groupat ETHZurich.He received his Ph.D. in 2013 fromETHZurich. For the past five years, he hasmanaged the EPR-ETH dataset and coordinated the two updates of the dataset. His researchinterests include ethnic conflict, (post-conflict) democratization, electoral institutions andparty systems, and Latin American and African politics. His work has appeared in LatinAmerican Politics and Society and in several edited volumes. More information can be foundat his website.

Lars-Erik Cederman

Lars-Erik Cederman is a Professor of International Conflict Research in the Center for Com-parative and International Studies at ETH Zurich. Before coming to Zurich, he was an Asso-ciate Professor at Harvard University, an Assistant Professor at the University of California,Los Angeles, and a University Lecturer and Fellow at Somerville College, Oxford University.He received his Ph.D. in 1994 from theUniversity ofMichigan. His research interests includecomputational modeling, international relations theory, nationalism, integration and disin-tegration processes, and historical sociology. He has published articles in journals such asthe American Political Science Review, International Organization, and Proceedings of the Na-tional Academy of Sciences. More information can be found at his website and on his Googlescholar profile.


http://polisci.ucsd.edu/

http://ucsd.edu/

http://ucsd.edu/

http://sgpp.arizona.edu/

http://academy.wcfia.harvard.edu/academy_scholars_program.html

http://www.princeton.edu/csdp/people/previous-scholars/

http://www.princeton.edu

http://www.princeton.edu


http://www.nsf.gov/



http://polisci2.ucsd.edu/sdesposato/Scott_Desposato_UCSD/Home.html

http://scholar.google.de/citations?user=vJzgfekAAAAJ&hl=en

http://scholar.google.de/citations?user=vJzgfekAAAAJ&hl=en

http://www.icr.ethz.ch

http://www.ethz.ch

http://www.polisci.pitt.edu

http://www.snf.ch/en/funding/careers/doc-mobility/Pages/default.aspx

http://www.snf.ch/en/Pages/default.aspx

http://www.gess.ethz.ch/index_EN

http://www.icr.ethz.ch/people/bormann

http://www.icr.ethz.ch

http://www.ethz.ch

http://www.gess.ethz.ch/index_EN

http://www.icr.ethz.ch/data/growup/epr-eth

http://www.icr.ethz.ch/people/vogt

http://www.cis.ethz.ch/

http://www.cis.ethz.ch/

http://www.ethz.ch

http://www.gov.harvard.edu/



http://www.some.ox.ac.uk/

http://www.lsa.umich.edu/polisci/

http://www.icr.ethz.ch/people/cederman/

http://scholar.google.de/citations?user=1xEOTO0AAAAJ&hl=en

http://scholar.google.de/citations?user=1xEOTO0AAAAJ&hl=en



Announcements

• REPAL 2015 is a new conference on Latin American political economy. The conference is 7-8 July in Montevideo.For more information see http://redeconomiapoliticaamlat.com.

• Please see the call for papers announcement for a special issue dedicated to the topic of ‘Political Activism inEurope: Numbers and Trends’ in the online journal PACO -http://siba-ese.unisalento.it/index.php/paco/announcement/view/36.

• The IPSA Committee on Concepts and Methods seeks an editor, or team of editors, for a two-year term. This is anopportunity to make a significant contribution to one of the leading discussion forums for conceptual andmethodological issues in Political Science. For more information please visithttp://comparativenewsletter.com/node/16.

About the Section

The Organized Section in Comparative Politics is the largest organized section in the American Political Science As-sociation (APSA) with over 1,300 members. The purpose of the Section is to promote the comparative, especiallycross-national, study of politics and to integrate the work of comparativists, area studies specialists, and those inter-ested in American politics. The Section organizes panels for APSA’s annual meetings; awards annual prizes for bestpaper, best article, best book, and best dataset; and oversees and helps finance the publication of the Newsletter. Formore information, please visit the Section’s website.

About the Newsletter

The goal of the Comparative Politics Newsletter is to engender a sense of community among comparative politicsscholars around the world. To this end, the Newsletter publishes symposia on various substantive andmethodologicalissues, highlights new datasets of broad appeal, prints short comments from readers in response to materials in theprevious issue, and generally informs the community about field-specific developments. Recent symposia have lookedat the varieties of authoritarianism, the global economic crisis, and field experiments. Upcoming issues will focus ongraduate training in comparative politics and the politics of space. It is published twice a year— once during the Springand once during the Fall. The Newsletter is currently edited by Matt Golder and Sona N. Golder at The PennsylvaniaState University.

How to Subscribe

Subscription to the APSA-CP Newsletter is a benefit to members of the Organized Section in Comparative Politicsof the American Political Science Association. To join the section, check the appropriate box when joining APSAor renewing your Association membership. You may join the APSA online at http://www.apsanet.org/content.asp?contentid=4.

Copyright 2015 American Political Science Association.Address: 1527 New Hampshire Ave, NW Washington, DC 20036-1206Phone: (202) 483-2512 | Fax: (202) 483-2657 | Email: [email protected]


http://redeconomiapoliticaamlat.com

http://siba-ese.unisalento.it/index.php/paco/announcement/view/36

http://comparativenewsletter.com/node/16

http://community.apsanet.org/comparativepolitics

http://comparativenewsletter.com/editorial_team#mgolder

http://comparativenewsletter.com/editorial_team#sgolder

http://www.apsanet.org/content.asp?contentid=4

http://www.apsanet.org/content.asp?contentid=4

http://www.apsanet.org




ComparativePoliticsNewsletter - Matt...

Documents

Transcript of ComparativePoliticsNewsletter - Matt...