Distributive Analysis Stata Package V2.3

150
USER MANUAL DASP version 2.3 DASP: Distributive Analysis Stata Package By Abdelkrim Araar, JeanYves Duclos Université Laval PEP, CIRPÉE and World Bank June 2013

description

This is the manual of the Distributive Analysis Stata Package. Features of the DASP Package:Estimate the most popular statistics (indices, curves) used for the analysis of poverty, inequality, social welfare, and equity;Estimate the differences in such statistics;Estimate standard errors and confidence intervals by taking full account of survey design;Support distributive analysis on more than one data base;Perform the most popular distributive decomposition procedures;Check for the ethical robustness of distributive comparisons;Unify syntax and parameter use across various estimation procedures for distributive analysis.

Transcript of Distributive Analysis Stata Package V2.3

Page 1: Distributive Analysis Stata Package V2.3

USERMANUAL

DASPversion2.3

DASP:DistributiveAnalysisStataPackageBy

AbdelkrimAraar,Jean‐YvesDuclos

UniversitéLavalPEP,CIRPÉEandWorldBank

June2013

Page 2: Distributive Analysis Stata Package V2.3

2

TableofcontentsTableofcontents............................................................................................................................. 2 ListofFigures .................................................................................................................................. 6 1  Introduction ............................................................................................................................ 8 2  DASPandStataversions ......................................................................................................... 8 3  InstallingandupdatingtheDASPpackage ........................................................................... 9 3.1  InstallingDASPmodules ................................................................................................ 9 3.2  AddingtheDASPsubmenutoStata’smainmenu ...................................................... 10 

4  DASPanddatafiles ............................................................................................................... 10 5  Mainvariablesfordistributiveanalysis ............................................................................. 11 6  HowcanDASPcommandsbeinvoked? .............................................................................. 11 7  HowcanhelpbeaccessedforagivenDASPmodule? ....................................................... 12 8  ApplicationsandfilesinDASP ............................................................................................. 12 9  BasicNotation ....................................................................................................................... 14 10  DASPandpovertyindices ................................................................................................ 14 10.1  FGTandEDE‐FGTpovertyindices(ifgt). .................................................................... 14 10.2  DifferencebetweenFGTindices(difgt) ...................................................................... 15 10.3  Wattspovertyindex(iwatts). ...................................................................................... 16 10.4  DifferencebetweenWattsindices(diwatts) .............................................................. 16 10.5  Sen‐Shorrocks‐Thonpovertyindex(isst). ................................................................. 17 10.6  DifferencebetweenSen‐Shorrocks‐Thonindices(disst) ......................................... 17 10.7  DASPandmultidimensionalpovertyindices ............................................................. 17 10.8  Multipleoverlappingdeprivationanalysis(MODA)indices ..................................... 19 

11  DASP,povertyandtargetingpolicies ............................................................................... 20 11.1  Povertyandtargetingbypopulationgroups ............................................................. 20 11.2  Povertyandtargetingbyincomecomponents ......................................................... 21 

12  Marginalpovertyimpactsandpovertyelasticities ........................................................ 22 12.1  FGTelasticity’swithrespecttotheaverageincomegrowth(efgtgr). ..................... 22 12.2  FGTelasticitieswithrespecttoaverageincomegrowthwithdifferentapproaches(efgtgro). .................................................................................................................................... 23 12.3  FGTelasticitywithrespecttoGiniinequality(efgtineq). .......................................... 23 12.4  FGTelasticitywithrespecttoGini‐inequalitywithdifferentapproaches(efgtine). 24 12.5  FGTelasticitieswithrespecttowithin/betweengroupcomponentsofinequality(efgtg). ........................................................................................................................................ 25 12.6  FGTelasticitieswithrespecttowithin/betweenincomecomponentsofinequality(efgtc). ........................................................................................................................................ 26 

13  DASPandinequalityindices ............................................................................................ 28 13.1  Giniandconcentrationindices(igini) ........................................................................ 28 13.2  DifferencebetweenGini/concentrationindices(digini) .......................................... 28 13.3  Generalisedentropyindex(ientropy) ........................................................................ 29 13.4  Differencebetweengeneralizedentropyindices(diengtropy) ................................ 29 13.5  Atkinsonindex(iatkinson) ......................................................................................... 30 13.6  DifferencebetweenAtkinsonindices(diatkinson) .................................................. 30 

Page 3: Distributive Analysis Stata Package V2.3

3

13.7  Coefficientofvariationindex(icvar) .......................................................................... 31 13.8  Differencebetweencoefficientofvariation(dicvar) ................................................. 31 13.9  Quantile/shareratioindicesofinequality(inineq) .................................................. 31 13.10  DifferencebetweenQuantile/Shareindices(dinineq) .......................................... 32 13.11  TheAraar(2009)multidimensionalinequalityindex ........................................... 32 

14  DASPandpolarizationindices ......................................................................................... 32 14.1  TheDERindex(ipolder) .............................................................................................. 32 14.2  DifferencebetweenDERpolarizationindices(dipolder) ......................................... 33 14.3  TheFosterandWolfson(1992)polarizationindex(ipolfw) ................................... 34 14.4  DifferencebetweenFosterandWolfson(1992)polarizationindices(dipolfw) .... 34 14.5  TheGeneralisedEsteban,GradinandRay(1999)polarisationindex(ipoger) ...... 34 14.6  TheInaki(2008)polarisationindex(ipoger) ............................................................ 36 

15  DASPanddecompositions ................................................................................................ 39 15.1  FGTPoverty:decompositionbypopulationsubgroups(dfgtg) ............................... 39 15.2  FGTPoverty:decompositionbyincomecomponentsusingtheShapleyvalue(dfgts) 40 15.3  AlkireandFoster(2011)MDindexofpoverty:decompositionbypopulationsubgroups(dmdafg) ................................................................................................................. 42 15.4  AlkireandFoster(2011:decompositionbydimensionsusingtheShapleyvalue(dmdafs) .................................................................................................................................... 42 15.5  FGTPoverty:decompositionbyincomecomponentsusingtheShapleyvalue(dfgts) 43 15.6  DecompositionofthevariationinFGTindicesintogrowthandredistributioncomponents(dfgtgr)................................................................................................................. 45 15.7  DecompositionofchangeinFGTpovertybypovertyandpopulationgroupcomponents–sectoraldecomposition‐(dfgtg2d). ................................................................. 46 15.8  DecompositionofFGTpovertybytransientandchronicpovertycomponents(dtcpov) ..................................................................................................................................... 49 15.9  Inequality:decompositionbyincomesources(diginis) ........................................... 51 15.10  Regression‐baseddecompositionofinequalitybyincomesources ..................... 52 15.11  Giniindex:decompositionbypopulationsubgroups(diginig). ........................... 58 15.12  Generalizedentropyindicesofinequality:decompositionbypopulationsubgroups(dentropyg). ........................................................................................................... 59 15.13  Polarization:decompositionoftheDERindexbypopulationgroups(dpolag) .. 59 15.14  Polarization:decompositionoftheDERindexbyincomesources(dpolas) ....... 60 

16  DASPandcurves................................................................................................................ 60 16.1  FGTCURVES(cfgt). ....................................................................................................... 60 16.2  FGTCURVEwithconfidenceinterval(cfgts). ............................................................. 62 16.3  DifferencebetweenFGTCURVESwithconfidenceinterval(cfgts2d). .................... 62 16.4  LorenzandconcentrationCURVES(clorenz). ............................................................ 62 16.5  Lorenz/concentrationcurveswithconfidenceintervals(clorenzs). ....................... 63 16.6  DifferencesbetweenLorenz/concentrationcurveswithconfidenceinterval(clorenzs2d) .............................................................................................................................. 64 16.7  Povertycurves(cpoverty) ........................................................................................... 64 16.8  Consumptiondominancecurves(cdomc) .................................................................. 65 16.9  Difference/Ratiobetweenconsumptiondominancecurves(cdomc2d) ................. 66 

Page 4: Distributive Analysis Stata Package V2.3

4

16.10  DASPandtheprogressivitycurves ......................................................................... 66 16.10.1  Checkingtheprogressivityoftaxesortransfers .............................................. 66 16.10.2  Checkingtheprogressivityoftransfervstax ................................................... 67 

17  Dominance ......................................................................................................................... 67 17.1  Povertydominance(dompov) ..................................................................................... 67 17.2  Inequalitydominance(domineq) ................................................................................ 68 17.3  DASPandbi‐dimensionalpovertydominance(dombdpov) .................................... 68 

18  Distributivetools .............................................................................................................. 69 18.1  Quantilecurves(c_quantile) ........................................................................................ 69 18.2  Incomeshareandcumulativeincomesharebygroupquantiles(quinsh) ............. 69 18.3  Densitycurves(cdensity) ............................................................................................ 69 18.4  Non‐parametricregressioncurves(cnpe) ................................................................. 71 18.4.1  Nadaraya‐Watsonapproach ................................................................................ 71 18.4.2  Locallinearapproach ........................................................................................... 71 

18.5  DASPandjointdensityfunctions. ............................................................................... 71 18.6  DASPandjointdistributionfunctions ........................................................................ 72 

19  DASPandpro‐poorgrowth .............................................................................................. 72 19.1  DASPandpro‐poorindices .......................................................................................... 72 19.2  DASPandpro‐poorcurves ........................................................................................... 73 19.2.1  Primalpro‐poorcurves ........................................................................................ 73 19.2.2  Dualpro‐poorcurves ........................................................................................... 74 

20  DASPandBenefitIncidenceAnalysis .............................................................................. 75 20.1  Benefitincidenceanalysis ............................................................................................ 75 

21  Disaggregatinggroupeddata ........................................................................................... 79 22  Appendices ........................................................................................................................ 83 22.1  AppendixA:illustrativehouseholdsurveys ............................................................... 83 22.1.1  The1994BurkinaFasosurveyofhouseholdexpenditures(bkf94I.dta) ........ 83 22.1.2  The1998BurkinaFasosurveyofhouseholdexpenditures(bkf98I.dta) ........ 84 22.1.3  CanadianSurveyofConsumerFinance(asubsampleof1000observations–can6.dta) ................................................................................................................................ 84 22.1.4  PeruLSMSsurvey1994(Asampleof3623householdobservations‐PEREDE94I.dta) .................................................................................................................... 84 22.1.5  PeruLSMSsurvey1994(Asampleof3623householdobservations–PERU_A_I.dta) ........................................................................................................................ 85 22.1.6  The1995ColombiaDHSsurvey(columbiaI.dta) ............................................... 85 22.1.7  The1996DominicanRepublicDHSsurvey(Dominican_republic1996I.dta) . 85 

22.2  AppendixB:labellingvariablesandvalues ................................................................ 86 22.3  AppendixC:settingthesamplingdesign .................................................................... 87 

23  Examplesandexercises ................................................................................................... 89 23.1  EstimationofFGTpovertyindices .............................................................................. 89 23.2  EstimatingdifferencesbetweenFGTindices. ............................................................ 95 23.3  Estimatingmultidimensionalpovertyindices ........................................................... 99 23.4  EstimatingFGTcurves. ............................................................................................... 102 23.5  EstimatingFGTcurvesanddifferencesbetweenFGTcurveswithconfidenceintervals ................................................................................................................................... 110 23.6  Testingpovertydominanceandestimatingcriticalvalues. .................................... 114 

Page 5: Distributive Analysis Stata Package V2.3

5

23.7  DecomposingFGTindices. ......................................................................................... 115 23.8  EstimatingLorenzandconcentrationcurves. .......................................................... 118 23.9  EstimatingGiniandconcentrationcurves ............................................................... 124 23.10  Usingbasicdistributivetools ................................................................................ 128 23.11  Plottingthejointdensityandjointdistributionfunction ................................... 134 23.12  Testingthebi‐dimensionalpovertydominance .................................................. 137 23.13  Testingforpro‐poornessofgrowthinMexico ..................................................... 140 23.14  BenefitincidenceanalysisofpublicspendingoneducationinPeru(1994). .... 146 

Page 6: Distributive Analysis Stata Package V2.3

6

ListofFiguresFigure1:Ouputofnetdescribedasp..................................................................................................................9Figure2:DASPsubmenu......................................................................................................................................10Figure3:UsingDASPwithacommandwindow.......................................................................................11Figure4:AccessinghelponDASP....................................................................................................................12Figure5:EstimatingFGTpovertywithonedistribution......................................................................13Figure6:EstimatingFGTpovertywithtwodistributions....................................................................13Figure7:Povertyandthetargetingbypopulationgroups..................................................................21Figure8:DecompositionoftheFGTindexbygroups.............................................................................39Figure9:DecompositionofFGTbyincomecomponents......................................................................44Figure10:SectoraldecompositionofFGT..................................................................................................48Figure11:Decompositionofpovertyintotransientandchroniccomponents...........................50Figure12:DecompositionoftheGiniindexbyincomesources(Shapleyapproach)..............52Figure13:FGTcurves...........................................................................................................................................61Figure14:Lorenzandconcentrationcurves..............................................................................................63Figure15:Consumptiondominancecurves...............................................................................................66Figure16:ungroupdialogbox..........................................................................................................................82Figure17:Surveydatasettings........................................................................................................................87Figure18:Settingsamplingweights..............................................................................................................88Figure19:EstimatingFGTindices...................................................................................................................91Figure20:EstimatingFGTindiceswithrelativepovertylines...........................................................92Figure21:FGTindicesdifferentiatedbygender......................................................................................93Figure22:EstimatingdifferencesbetweenFGTindices.......................................................................96Figure23:EstimatingdifferencesinFGTindices.....................................................................................97Figure24:FGTdifferencesacrossyearsbygenderandzone.............................................................98Figure25:Estimatingmultidimensionalpovertyindices(A)..........................................................100Figure26:Estimatingmultidimensionalpovertyindices(B)..........................................................101Figure27:DrawingFGTcurves.....................................................................................................................103Figure28:EditingFGTcurves........................................................................................................................103Figure29:GraphofFGTcurves.....................................................................................................................104Figure30:FGTcurvesbyzone.......................................................................................................................105Figure31:GraphofFGTcurvesbyzone....................................................................................................106Figure32:DifferencesofFGTcurves..........................................................................................................107Figure33:Listingcoordinates.......................................................................................................................108Figure34:DifferencesbetweenFGTcurves............................................................................................109Figure35:DifferencesbetweenFGTcurves............................................................................................110Figure36:DrawingFGTcurveswithconfidenceinterval.................................................................111Figure37:FGTcurveswithconfidenceinterval....................................................................................112Figure38:DrawingthedifferencebetweenFGTcurveswithconfidenceinterval.................113Figure39:DifferencebetweenFGTcurveswithconfidenceinterval ( 0) ...........................113Figure40:DifferencebetweenFGTcurveswithconfidenceinterval ( 1) ...........................114Figure41:Testingforpovertydominance...............................................................................................115Figure42:DecomposingFGTindicesbygroups....................................................................................116Figure43:Lorenzandconcentrationcurves...........................................................................................119

Page 7: Distributive Analysis Stata Package V2.3

7

Figure44:Lorenzcurves..................................................................................................................................120Figure45:Drawingconcentrationcurves................................................................................................121Figure46:Lorenzandconcentrationcurves...........................................................................................122Figure47:DrawingLorenzcurves...............................................................................................................123Figure48:Lorenzcurves..................................................................................................................................123Figure49:EstimatingGiniandconcentrationindices.........................................................................125Figure50:Estimatingconcentrationindices...........................................................................................126Figure51:EstimatingdifferencesinGiniandconcentrationindices...........................................127Figure52:Drawingdensities..........................................................................................................................128Figure53:Densitycurves.................................................................................................................................129Figure54:Drawingquantilecurves............................................................................................................130Figure55:Quantilecurves...............................................................................................................................130Figure56:Drawingnon‐parametricregressioncurves......................................................................131Figure57:Non‐parametricregressioncurves........................................................................................132Figure58:Drawingderivativesofnon‐parametricregressioncurves........................................133Figure59:Derivativesofnon‐parametricregressioncurves...........................................................133Figure60:Plottingjointdensityfunction.................................................................................................134Figure61:Plottingjointdistributionfunction........................................................................................136Figure62:Testingforbi‐dimensionalpovertydominance...............................................................138Figure63:Testingthepro‐poorgrowth(primalapproach).............................................................141Figure64:Testingthepro‐poorgrowth(dualapproach)‐A............................................................142Figure65:Testingthepro‐poorgrowth(dualapproach)–B..........................................................144Figure66:Benefitincidenceanalysis..........................................................................................................147Figure67:BenefitIncidenceAnalysis(unitcostapproach).............................................................149

Page 8: Distributive Analysis Stata Package V2.3

8

1 IntroductionTheStatasoftwarehasbecomeaverypopulartooltotransformandprocessdata.Itcomeswithalargenumberofbasicdatamanagementmodulesthatarehighlyefficientfortransformationoflargedatasets.TheflexibilityofStataalsoenablesprogrammerstoprovidespecialized“.ado”routinestoaddtothepowerofthesoftware.ThisisindeedhowDASPinteractswithStata.DASP,whichstandsforDistributiveAnalysisStataPackage,ismainlydesignedtoassistresearchersandpolicyanalystsinterestedinconductingdistributiveanalysiswithStata.Inparticular,DASPisbuiltto:

Estimatethemostpopularstatistics(indices,curves)usedfortheanalysisofpoverty,inequality,socialwelfare,andequity;

Estimatethedifferencesinsuchstatistics; Estimatestandarderrorsandconfidenceintervalsbytakingfullaccountofsurvey

design; Supportdistributiveanalysisonmorethanonedatabase; Performthemostpopularpovertyanddecompositionprocedures; Checkfortheethicalrobustnessofdistributivecomparisons; Unifysyntaxandparameteruseacrossvariousestimationproceduresfordistributive

analysis.ForeachDASPmodule,threetypesoffilesareprovided:

*.ado: Thisfilecontainstheprogramofthemodule*.hlp: Thisfilecontainshelpmaterialforthegivenmodule*.dlg:

Thisfileallowstheusertoperformtheestimationusingthemodule’sdialogbox

The *.dlg files in particularmakes theDASP package very user friendly and easy to learn.Whenthese dialog boxes are used, the associated program syntax is also generated and showed in thereviewwindow.Theusercansavethecontentsofthiswindowina*.dofiletobesubsequentlyusedinanothersession.

2 DASPandStataversionsDASPrequires

o Stataversion10.0orhighero adofilesmustbeupdated

Toupdatetheexecutablefile(from10.0to10.2)andtheadofiles,see:http://www.stata.com/support/updates/

Page 9: Distributive Analysis Stata Package V2.3

9

3 InstallingandupdatingtheDASPpackageIngeneral,the*.adofilesaresavedinthefollowingmaindirectories:

Priority Directory Sources1 UPDATES: OfficialupdatesofStata *.adofiles2 BASE: *.adofilesthatcomewiththeinstalledStata software3 SITE: *.adofilesdownloadedfromthenet4 PLUS: ..5 PERSONAL: Personal*.adofiles

3.1 InstallingDASPmodulesa. Unzipthefiledasp.zipinthedirectoryc:b. Makesurethatyouhavec:/dasp/dasp.pkgorc:/dasp/stata.tocc. IntheStatacommandwindows,typethesyntax

netfromc:/dasp

Figure1:Ouputofnetdescribedasp

d. Typethesyntaxnetinstalldasp_p1.pkg,forcereplacenetinstalldasp_p2.pkg,forcereplacenetinstalldasp_p3.pkg,forcereplacenetinstalldasp_p4.pkg,forcereplace

Page 10: Distributive Analysis Stata Package V2.3

10

3.2 AddingtheDASPsubmenutoStata’smainmenuWithStata9,submenuscanbeaddedtothemenuitemUser.

Figure2:DASPsubmenu

ToaddtheDASPsubmenus,thefileprofile.do(whichisprovidedwiththeDASPpackage)mustbecopiedintothePERSONALdirectory.Ifthefileprofile.doalreadyexists,addthecontentsoftheDASP–providedprofile.dofileintothatexistingfileandsaveit.Tocheckifthefileprofile.doalreadyexists,typethecommand:findfileprofile.do.

4 DASPanddatafilesDASPmakes itpossible tousesimultaneouslymore thanonedata file.Theusershould,however,“initialize”eachdatafilebeforeusingitwithDASP.Thisinitializationisdoneby:

1. Labelingvariablesandvaluesforcategoricalvariables;2. Initializingthesamplingdesignwiththecommandsvyset;3. Savingtheinitializeddatafile.

UsersarerecommendedtoconsultappendicesA,BandC,

Page 11: Distributive Analysis Stata Package V2.3

11

5 MainvariablesfordistributiveanalysisVARIABLEOFINTEREST.Thisisthevariablethatusuallycaptureslivingstandards.Itcanrepresent,forinstance, incomeper capita, expenditures per adult equivalent, calorie intake, normalizedheight‐for‐agescoresforchildren,orhouseholdwealth.SIZEVARIABLE.Thisreferstothe"ethical"orphysicalsizeoftheobservation.Forthecomputationofmany statistics, we will indeed wish to take into account how many relevant individuals (orstatisticalunits)arefoundinagivenobservation.GROUPVARIABLE.(ThisshouldbeusedincombinationwithGROUPNUMBER.)Itisoftenusefultofocusone’s analysis on some population subgroup. We might, for example, wish to estimate povertywithinacountry’sruralareaorwithinfemale‐headedfamilies.OnewaytodothisistoforceDASPto focus on a population subgroupdefined as those forwhom someGROUP VARIABLE (say, area ofresidence)equalsagivenGROUPNUMBER(say2,forruralarea).SAMPLINGWEIGHT.Samplingweightsaretheinverseofthesamplingprobability.Thisvariableshouldbesetupontheinitializationofthedataset.

6 HowcanDASPcommandsbeinvoked?Statacommandscanbeentereddirectlyintoacommandwindow:Figure3:UsingDASPwithacommandwindow

Page 12: Distributive Analysis Stata Package V2.3

12

Analternativeistousedialogboxes.Forthis,thecommanddbshouldbetypedandfollowedbythenameoftherelevantDASPmodule.Example:dbifgt

7 HowcanhelpbeaccessedforagivenDASPmodule?TypethecommandhelpfollowedbythenameoftherelevantDASPmodule.Example:helpifgtFigure4:AccessinghelponDASP

8 ApplicationsandfilesinDASPTwomaintypesofapplicationsareprovidedinDASP.Forthefirstone,theestimationproceduresrequireonlyonedatafile.Insuchcases,thedatafileinmemoryistheonethatisused(or“loaded”);itisfromthatfilethattherelevantvariablesmustbespecifiedbytheusertoperformtherequiredestimation.

Page 13: Distributive Analysis Stata Package V2.3

13

Figure5:EstimatingFGTpovertywithonedistribution

Forthesecondtypeofapplications,twodistributionsareneeded.Foreachofthesetwodistributions,theusercanspecifythecurrently‐loadeddatafile(theoneinmemory)oronesavedondisk.Figure6:EstimatingFGTpovertywithtwodistributions

Page 14: Distributive Analysis Stata Package V2.3

14

Notes:1. DASPconsiderstwodistributionstobestatisticallydependent(forstatisticalinference

purposes)ifthesamedatasetisused(thesameloadeddataordatawiththesamepathandfilename)forthetwodistributions.

2. IftheoptionDATAINFILEischosen,thekeyboardmustbeusedtotypethenameoftherequiredvariables.

9 BasicNotationThefollowingtablepresentsthebasicnotationusedinDASP’susermanual.

Symbol Indicationy variableofinteresti observationnumberyi valueofthevariableofinterestforobservationihw samplingweighthwi samplingweightforobservationihs sizevariablehsi sizeofobservationi(forexamplethesizeofhouseholdi)wi hwi*hsihg groupvariablehgi groupofobservationi.wik swik=swiifhgi=k,and0otherwise.n samplesize

Forexample,themeanofyisestimatedbyDASPas :

1

1

n

i ii

n

ii

w yˆ

w

10 DASPandpovertyindices

10.1 FGTandEDE‐FGTpovertyindices(ifgt).Thenon‐normalisedFoster‐Greer‐ThorbeckeorFGTindexisestimatedas

1

1

ni i

in

ii

w ( z y )

P( z; )

w

wherezisthepovertylineand max( ,0)x x .TheusualnormalisedFGTindexisestimatedas ( ; ) ( ; ) /( )P z P z z

Page 15: Distributive Analysis Stata Package V2.3

15

TheEDE‐FGTindexisestimatedas:

1/( ( ; )) ( ; )EDE P z P z for

> 0

Thereexistthreewaysoffixingthepovertyline:

1‐Settingadeterministicpovertyline;2‐Settingthepovertylinetoaproportionofthemean;3‐SettingthepovertylinetoaproportionofaquantileQ(p).

Theusercanchoosethevalueofparameter . Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepovertybyusingsimultaneouslypercapitaconsumptionandpercapitaincome. Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.1

10.2 DifferencebetweenFGTindices(difgt)ThismoduleestimatesdifferencesbetweentheFGTindicesoftwodistributions.Foreachofthetwodistributions: Thereexistthreewaysoffixingthepovertyline:

1‐Settingadeterministicpovertyline;2‐Settingthepovertylinetoaproportionofthemean;3‐SettingthepovertylinetoaproportionofaquantileQ(p)

Onevariableofinterestshouldbeselected. Conditionscanbespecifiedtofocusonspecificpopulationsubgroups. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged. Alevelfortheparameter canbechosenforeachofthetwodistributions.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.2.

Page 16: Distributive Analysis Stata Package V2.3

16

10.3 Wattspovertyindex(iwatts).TheWattspovertyindexisestimatedas

1

1

q

i ii

ni

i

w (ln( z / y )

P( z )

w

wherezisthepovertylineand q thenumberofpoor.

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepovertybyusingsimultaneouslypercapitaconsumptionandpercapitaincome. Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

10.4 DifferencebetweenWattsindices(diwatts)ThismoduleestimatesdifferencesbetweentheWattsindicesoftwodistributions.Foreachofthetwodistributions: Onevariableofinterestshouldbeselected. Conditionscanbespecifiedtofocusonspecificpopulationsubgroups. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged. Alevelfortheparameter canbechosenforeachofthetwodistributions.

Page 17: Distributive Analysis Stata Package V2.3

17

10.5 Sen‐Shorrocks‐Thonpovertyindex(isst).TheSen‐Shorroks‐Thonpovertyindexisestimatedas:

1* *gP( z ) HP ( z, )[ G ]

wherezisthepovertylineH istheheadcount, *P ( z, ) thepovertygapestimatedatthelevelof

poorgroupand *gG theGiniindexofpovertygaps( z y ) / z .

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepovertybyusingsimultaneouslypercapitaconsumptionandpercapitaincome. Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

10.6 DifferencebetweenSen‐Shorrocks‐Thonindices(disst)ThismoduleestimatesdifferencesbetweentheWattsindicesoftwodistributions.Foreachofthetwodistributions: Onevariableofinterestshouldbeselected. Conditionscanbespecifiedtofocusonspecificpopulationsubgroups. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged. Alevelfortheparameter canbechosenforeachofthetwodistributions.

10.7 DASPandmultidimensionalpovertyindicesThegeneralformofanadditivemultidimensionalpovertyindexis:

Page 18: Distributive Analysis Stata Package V2.3

18

1

1

( , )

( , )

n

i ii

n

ii

w p X Z

P X Z

w

where ( , )ip X Z isindividualI’spovertyfunction(withvectorofattributes ,1 ,,...,i i i JX x x and

vectorofpovertylines 1,..., JZ z z ),determiningI’scontributiontototalpoverty ( , )P X Z .

[1]Chakravartyetal(1998)index(imdp_cmr)

,

1

( , )J

j i ji j

jj

z xp X Z a

z

[2]ExtendedWattsindex(imdp_ewi)

,1

( , ) lnmin( ; )

Jj

i jj i jj

zp X Z a

z x

[3]MultiplicativeextendedFGTindex(imdp_mfi)

,

1

( , )jJ

j i ji

jj

z xp X Z

z

[4]Tsui(2002)index(imdp_twu)

,1

( , ) 1min( ; )

jbJ

ji

j i jj

zp X Z

z x

[5]Intersectionheadcountindex(imdp_ihi)

,1

( , )J

i j i jj

p X Z I z x

[6]Unionheadcountindex(imdp_uni)

,1

( , ) 1J

i j i jj

p X Z I z x

[7]BourguignonandChakravartybi‐dimensional(2003)index(imdp_bci)

Page 19: Distributive Analysis Stata Package V2.3

19

1 2( , )ip X Z C C

where:

1 ,11

1

iz xC

z

and 2 ,22

2

iz xC

z

[8]AlkireandFoster(2011)index(imdp_afi)

,

1

1 1( , , ) ( )

N Jj i j

i j i cji j

z xp X Z w I d d

N J z

where1

J

jj

w J

and id denotesthenumberofdimensionsinwhichtheindividual i isdeprived.

cd denotesthenormativedimensionalcut‐off.

Themodulespresentedabovecanbeusedtoestimatethemultidimensionalpovertyindicesaswellastheirstandarderrors.

Theusercanselectamongthesevenmultidimensionalpovertyindices. Thenumberofdimensionscanbeselected(1to10). Ifapplicable,theusercanchooseparametervaluesrelevanttoachosenindex. Agroupvariablecanbeusedtoestimatetheselectedindexatthelevelofacategorical

group. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith3decimals;thiscanbealsochanged.

UsersareencouragedtoconsidertheexercisesthatappearinSection 23.3

10.8 Multipleoverlappingdeprivationanalysis(MODA)indices TheimodaDASPmoduleproducesaseriesofmultidimensionalpovertyindicesinordertoshow the incidence of deprivation in each dimension. Further, this application estimates theincidenceofmulti‐deprivationinthedifferentcombinationsofdimensions.Inthisapplication,thenumber of dimensions is set to three. Further, themultidimensional poverty ismeasured by theheadcount(unionandintersectionheadcount indices)andtheAlkireandFoster(2011)M0indexfordifferentlevelsofthedimensionalcut‐off.

Page 20: Distributive Analysis Stata Package V2.3

20

Thenumberofdimensionsisthree. AgroupvariablecanbeusedtoestimatetheMODAindicesatthelevelofacategorical

group. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith3decimals;thiscanbealsochanged.

11 DASP,povertyandtargetingpolicies

11.1 Povertyandtargetingbypopulationgroups

Theper‐capitadollarimpactofamarginaladditionofaconstantamountofincometoeveryonewithinagroupk–calledLump‐SumTargeting(LST)–ontheFGTpovertyindexP(k, z; ) ,isasfollows:

P(k, z; 1) if 1LST

f (k, z) if 0

where z isthepovertyline,kisthepopulationsubgroupforwhichwewishtoassesstheimpactoftheincomechange,and ( , )f k z isthedensityfunctionofthegroup k atlevelofincome z .Theper‐capitadollarimpactofaproportionalmarginalvariationofincomewithinagroupk,calledInequalityNeutralTargeting,ontheFGTpovertyindexP(k, z; ) isasfollows:

P(k, z; ) zP(k, z; 1)if 1

(k)INT

zf (k, z)if 0

(k)

Themoduleitargetgallowsto: Estimatetheimpactofmarginalchangeinincomeofthegrouponpovertyofthegroupand

thatofthepopulation; Selectthedesignofchange,constantorproportionaltoincometokeepinequality

unchanged; Drawcurvesofimpactaccordingforarangeofpovertylines; Drawtheconfidenceintervalofimpactcurvesorthelowerorupperboundofconfidence

interval; Etc.

Page 21: Distributive Analysis Stata Package V2.3

21

Figure 7: Poverty and the targeting by population groups

Reference: DUCLOS,J.‐Y.ANDA.ARAAR(2006):PovertyandEquityMeasurement,Policy,andEstimationwithDAD,BerlinandOttawa:SpringerandIDRC.(sec.12.1)

11.2 Povertyandtargetingbyincomecomponents

Proportional change per 100% of component

Assume that total income Y is the sum of J income components, with 1

J

j jj

Y y

and where c is a factor

that multiplies income component jy and that can be subject to growth. The derivative of the normalized

FGT index with respect to j is given by

jj 1, j 1 Jj

P(z, )CD (z, )

where CDj is the Consumption dominance curve of component j. Change per $ of component

The per-capita dollar impact of growth in the thj component on the normalized FGT index of the thk group is as follows:

Page 22: Distributive Analysis Stata Package V2.3

22

j j

jj

P(z, )

yCD (z, )

y

where j

CD is the normalized consumption dominance curve of the component j. Constant change per component Simply we assume that the change concerns the group with component level greater than zero. Thus, this is similar to targeting by the nonexclusive population groups. Themoduleitargetcallowsto: Estimatetheimpactofmarginalchangeinincomecomponentonpoverty; Selecttheoptionnormalisedornonnormalisedbytheaverageofcomponent; Selectthedesignofchange,constant(lumpsum)orproportionaltoincometokeep

inequalityunchanged; Drawcurvesofimpactaccordingforarangeofpovertylines; Drawtheconfidenceintervalofimpactcurvesorthelowerorupperboundofconfidence

interval; Etc.

Reference: DUCLOS,J.‐Y.ANDA.ARAAR(2006):PovertyandEquityMeasurement,Policy,andEstimationwithDAD,BerlinandOttawa:SpringerandIDRC.(sec.12)

12 Marginalpovertyimpactsandpovertyelasticities

12.1 FGTelasticity’swithrespecttotheaverageincomegrowth(efgtgr).

The overall growth elasticity (GREL) of poverty, when growth comes exclusively from growth within a group k (namely, within that group, inequality neutral), is given by:

( , )0

( )

( , ; ) ( , ; 1)1

( , )

zf k zif

F zGREL

P k z P k zif

P z

where z is the poverty line, k is the population subgroup in which growth takes place, ( , )f k z is the

density function at level of income z of group k , and ( )F z is the headcount.

Page 23: Distributive Analysis Stata Package V2.3

23

Araar,AbdelkrimandJean‐YvesDuclos,(2007),Povertyandinequalitycomponents:a micro framework,Working Paper: 07‐35. CIRPEE, Department of Economics,UniversitéLaval.Kakwani, N. (1993) "Poverty and economic growth with application to CôteD’Ivoire",ReviewofIncomeandWealth,39(2):121:139.

ToestimatetheFGTelasticity’swithrespectaverageincomegrowththegrouporthewhole

population; Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepovertybyusingsimultaneouslypercapitaconsumptionandpercapitaincome. Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

12.2 FGTelasticitieswithrespecttoaverageincomegrowthwithdifferentapproaches(efgtgro).

The overall growth elasticity of poverty is estimated using one approach among the following list:

The counterfactual approach; The marginal approach; The parameterized approach; The numerical approach;

The module efgtgroallowstheestimationofapovertyelasticityorsemi‐elasticitywithrespecttogrowthwiththedifferentapproachesmentionedabove.Formoredetailsontheseapproaches,see:

Abdelkrim Araar, 2012. "Expected Poverty Changeswith Economic Growth andRedistribution,"Cahiersderecherche1222,CIRPEE.

ToestimateaFGTelasticity–semi‐elasticity‐withrespecttoaverageincomegrowthina

grouporinanentirepopulation; Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Theresultsaredisplayedwith6decimals;thiscanbechanged.

12.3 FGTelasticitywithrespecttoGiniinequality(efgtineq).

The overall growth elasticity (INEL) of poverty, when growth comes exclusively from change in inequality within a group k , is given by:

Page 24: Distributive Analysis Stata Package V2.3

24

( ) ( , ) ( ) ( ) ( ) ( )/ 0

( )

( , ; ) ( ) / ( , ; 1) ( ) ( ) ( )/ 1

( , )

k f k z k z k k C kif

F z IINEL

P k z k z z P k z k k C kif

P z I

where z is the poverty line, k is the population subgroup in which growth takes place, ( , )f k z is the

density function at level of income z for group k , and ( )F z is the headcount. ( )C k is the concentration

coefficient of group k when incomes of the complement group are replaced by ( )k . I denotes the Gini index.

Araar,AbdelkrimandJean‐YvesDuclos,(2007),Povertyandinequalitycomponents:a micro framework,Working Paper: 07‐35. CIRPEE, Department of Economics,UniversitéLaval.Kakwani, N. (1993) "Poverty and economic growth with application to CôteD’Ivoire",ReviewofIncomeandWealth,39(2):121:139.

ToestimateaFGTelasticitywithrespecttoaverageincomegrowthinagrouporinan

entirepopulation; Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepovertybyusingsimultaneouslypercapitaconsumptionandpercapitaincome. Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

12.4 FGTelasticitywithrespecttoGini‐inequalitywithdifferentapproaches(efgtine).

The overall Gini-inequality elasticity of poverty can be estimated by using one approach among the following list:

The counterfactual approach; The marginal approach; The parameterized approach; The numerical approach;

The module efgtineallowstheestimationofapovertyelasticityorsemi‐elasticitywithrespecttoinequalitywiththedifferentapproachesmentionedabove.Formoredetailsontheseapproaches,see:

Abdelkrim Araar, 2012. "Expected Poverty Changeswith Economic Growth andRedistribution,"Cahiersderecherche1222,CIRPEE.

Page 25: Distributive Analysis Stata Package V2.3

25

ToestimateaFGTelasticity–semi‐elasticity‐withrespecttoinequality; Agroupvariablecanbeusedtoestimatepovertyatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Theresultsaredisplayedwith6decimals;thiscanbechanged.

12.5 FGTelasticitieswithrespecttowithin/betweengroupcomponentsofinequality(efgtg).

ThismoduleestimatesthemarginalFGTimpactandFGTelasticitywithrespecttowithin/betweengroupcomponentsofinequality.Agroupvariablemustbeprovided.ThismoduleismostlybasedonAraarandDuclos(2007):

Araar,AbdelkrimandJean‐YvesDuclos,(2007),Povertyandinequalitycomponents:a micro framework,Working Paper: 07‐35. CIRPEE, Department of Economics,UniversitéLaval.

Toopenthedialogboxofthismodule,typethecommanddbefgtg.

AfterclickingonSUBMIT,thefollowingshouldbedisplayed:

Page 26: Distributive Analysis Stata Package V2.3

26

(g)

12.6 FGTelasticitieswithrespecttowithin/betweenincomecomponentsofinequality(efgtc).

ThismoduleestimatesthemarginalFGTimpactandFGTelasticitywithrespecttowithin/betweenincomecomponentsofinequality.Alistofincomecomponentsmustbeprovided.ThismoduleismostlybasedonAraarandDuclos(2007):

Araar,AbdelkrimandJean‐YvesDuclos,(2007),Povertyandinequalitycomponents:a micro framework,Working Paper: 07‐35. CIRPEE, Department of Economics,UniversitéLaval.

Toopenthedialogboxofthismodule,typethecommanddbefgtc.

Page 27: Distributive Analysis Stata Package V2.3

27

AfterclickingonSUBMIT,thefollowingshouldbedisplayed:

(k)

Incaseoneisinterestedinchangingsomeincomecomponentonlyamongthoseindividualsthatare

effectivelyactiveinsomeeconomicsectors(schemes * * *(k), and inthepapermentionedabove),theusershouldselecttheapproach“Truncatedincomecomponent”.

Page 28: Distributive Analysis Stata Package V2.3

28

13 DASPandinequalityindices

13.1 Giniandconcentrationindices(igini)TheGiniindexisestimatedas

ˆˆ 1

ˆI

where

2 21

21 1

( ) ( )ˆn

i ii

i

V Vy

V

andn

i hh i

V w

and 11 2 n ny y y y .

TheconcentrationindexforthevariableTwhentherankingvariableisYisestimatedas

1 TT

T

ˆIC

ˆ

where ˆT istheaverageofvariableT,

2 21

21 1

( ) ( )ˆn

i iT i

i

V Vt

V

andwheren

i hh i

V w

and 11 2 n ny y y y .

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecanestimateinequality,forinstancebyusingsimultaneouslypercapitaconsumptionandpercapitaincome.

Toestimateaconcentrationindex,theusermustselectarankingvariable.. Agroupvariablecanbeusedtoestimateinequalityatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.9

13.2 DifferencebetweenGini/concentrationindices(digini)ThismoduleestimatesdifferencesbetweentheGini/concentrationindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Toestimateaconcentrationindex,arankingvariablemustbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups;

Page 29: Distributive Analysis Stata Package V2.3

29

Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Boththetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged.

Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.3 Generalisedentropyindex(ientropy)Thegeneralizedentropyindexisestimatedas

i

ini

ii 1

ini i

ii 1

i i in

ii

i 1

y1w 1 if 0,1

ˆ1 w

ˆ1I w log if 0

yw

w y y1log if 1

ˆ ˆw

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimateinequalitysimultaneouslyforpercapitaconsumptionandforpercapitaincome. Agroupvariablecanbeusedtoestimateinequalityatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.4 Differencebetweengeneralizedentropyindices(diengtropy)Thismoduleestimatesdifferencesbetweenthegeneralizedentropyindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

Page 30: Distributive Analysis Stata Package V2.3

30

13.5 Atkinsonindex(iatkinson)DenotetheAtkinsonindexofinequalityforthegroupkby I(ε) .Itcanbeexpressedasfollows:

ni i

i 1n

ii 1

w yˆˆ ( )ˆ ˆI(ε) whereˆ

w

TheAtkinsonindexofsocialwelfareisasfollows:

1

1n 1 ε

i ini 1

ii 1

ni in

i 1i

i 1

1w (y ) if 1 and 0

w

ξ(ε)

1Exp w ln(y ) ε 1

w

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimateinequalitysimultaneouslyforpercapitaconsumptionandforpercapitaincome. Agroupvariablecanbeusedtoestimateinequalityatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.6 DifferencebetweenAtkinsonindices(diatkinson)ThismoduleestimatesdifferencesbetweentheAtkinsonindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

Page 31: Distributive Analysis Stata Package V2.3

31

13.7 Coefficientofvariationindex(icvar)Denote the coefficient of variation index of inequality for the group k by CV. It can be expressed as follows:

1n n 22 2

i i ii 1 i 1

2

ˆw y / wCV

ˆ

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimateinequalitysimultaneouslyforpercapitaconsumptionandforpercapitaincome. Agroupvariablecanbeusedtoestimateinequalityatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.8 Differencebetweencoefficientofvariation(dicvar)Thismoduleestimatesdifferencesbetweencoefficient of variation indicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.9 Quantile/shareratioindicesofinequality(inineq)Thequantileratioisestimatedas

11 2

2

Q(p )QR(p , p )

Q(p )

whereQ(p) denotesap‐quantileand 1p and 2p arepercentiles.

Theshareratioisestimatedas

GL(p2)-GL(p1)

SR(p1,p2,p3,p4) GL(p4)-GL(p3)

Page 32: Distributive Analysis Stata Package V2.3

32

whereGL(p) istheGeneralisedLorenzcurveand 1p , 2p , 3p and 4p arepercentiles.

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimateinequalitysimultaneouslyforpercapitaconsumptionandforpercapitaincome. Agroupvariablecanbeusedtoestimateinequalityatthelevelofacategoricalgroup.Ifa

groupvariableisselected,onlythefirstvariableofinterestisthenused. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.10 DifferencebetweenQuantile/Shareindices(dinineq)ThismoduleestimatesdifferencesbetweentheQuantile/Shareindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged; Theresultsaredisplayedwith6decimals;thiscanbechanged.

13.11 TheAraar(2009)multidimensionalinequalityindexThe Araar (2009) the multidimensional inequality index for the K dimensions of wellbeing takes the following form:

1

1K

k k k k ki

I I C

where k is the weight attributed to the dimension k (may take the same value across the

dimensions or can depend on the averages of the wellbeing dimensions). k kI C are respectively

the relative –absolute- Gini and concentration indices of component k . The normative parameter

k controls the sensitivity of the index to the inter-correlation between dimensions. For more

details, see: Abdelkrim Araar, 2009. "The Hybrid Multidimensional Index of Inequality," Cahiers de recherche 0945,

CIRPEE: http://ideas.repec.org/p/lvl/lacicr/0945.html

14 DASPandpolarizationindices

14.1 TheDERindex(ipolder)

Page 33: Distributive Analysis Stata Package V2.3

33

TheDuclos,EstebanandRay(2004)(DER)polarizationindexcanbeexpressedas:1DER( ) f (x) f (y) y x dydx

wheref(x)denotesthedensityfunctionatx.Thediscreteformulathatisusedtoestimatethisindexisasfollows:

ni i i

i 1n

ii 1

w f (y ) a(y )DER( )

w

ThenormalizedDERestimatedbythismoduleisdefinedas:

(1 )DER( )

DER( )2

where:i i 1

j i j j i ij 1 j 1

i i N Ni i

i 1 i 1

2 w w 2 w y w y

a(y ) y 1

w w

TheGaussiankernelestimatorisusedtoestimatethedensityfunction.

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecan

estimatepolarizationbyusingsimultaneouslypercapitaconsumptionandpercapitaincome.

Agroupvariablecanbeusedtoestimatepolarizationatthelevelofacategoricalgroup.Ifagroupvariableisselected,onlythefirstvariableofinterestisthenused.

Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Boththetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged.

Theresultsaredisplayedwith6decimals;thiscanbechanged.Mainreference

DUCLOS,J.‐Y.,J.ESTEBAN,ANDD.RAY(2004):“Polarization:Concepts,Measurement,Estimation,”Econometrica,72,1737–1772.

14.2 DifferencebetweenDERpolarizationindices(dipolder)ThismoduleestimatesdifferencesbetweentheDERindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedsuchastofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged.

Page 34: Distributive Analysis Stata Package V2.3

34

Theresultsaredisplayedwith6decimals;thiscanbechanged.

14.3 TheFosterandWolfson(1992)polarizationindex(ipolfw)TheFosterandWolfson(1992)polarizationindexcanbeexpressedas:

FW 2 2 0.5 Lorenz(p 0.5) Ginimedian

Theusercanselectmorethanonevariableofinterestsimultaneously.Forexample,onecanestimatepolarizationbyusingsimultaneouslypercapitaconsumptionandpercapitaincome.

Agroupvariablecanbeusedtoestimatepolarizationatthelevelofacategoricalgroup.Ifagroupvariableisselected,onlythefirstvariableofinterestisthenused.

Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Boththetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged.

Theresultsaredisplayedwith6decimals;thiscanbechanged.MainreferenceFOSTER,J.ANDM.WOLFSON(1992):“PolarizationandtheDeclineoftheMiddleClass:CanadaandtheU.S.”mimeo,VanderbiltUniversity.

14.4 DifferencebetweenFosterandWolfson(1992)polarizationindices(dipolfw)

ThismoduleestimatesdifferencesbetweentheFWindicesoftwodistributions.Foreachofthetwodistributions:

Onevariableofinterestshouldbeselected; Conditionscanbespecifiedsuchastofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

14.5 TheGeneralisedEsteban,GradinandRay(1999)polarisationindex(ipoger)

The proposed measurement of polarisation by Esteban and Ray (1994) is defined asfollows:

Page 35: Distributive Analysis Stata Package V2.3

35

m m

ER 1j k j k

j 1 k 1

P (f , ) p p

where j and jp denote respectively the average income and the population share of

group j. Theparameter 1,1.6 reflectssensitivityof society topolarisation. The first

stepfortheestimationrequirestodefineexhaustiveandmutuallyexclusivegroups, .Thiswillinvolvesomedegreeoferror.Takingintoaccountthisidea,themeasureofpolarisationproposedbyEstebanetal.(1999)isobtainedaftercorrectingthe ERP ( ) indexappliedtothe simplified representation of the original distributionwith ameasure of the groupingerror. Nonetheless,whendealingwithpersonalorspatial incomedistributions, therearenounanimouscriteria forestablishing theprecisedemarcationbetweendifferentgroups.Toaddressthisproblem,Estebanetal.(1999)followthemethodologyproposedbyAghevliandMehran(1981)andDaviesandShorrocks(1989)inordertofindanoptimalpartitionofthedistributionforagivennumberofgroups, * .Thismeansselectingthepartitionthat

minimises the Gini index value of within‐group inequality, *Error G f G (see

Esteban et al., 1999). Themeasureof polarisationproposedbyEsteban et al. (1999) isthereforegivenby:

m m

EGR * 1 *j k j k

j 1 k 1

P (f , , , ) p p G f G

where, 0 isaparameterthatinformsabouttheweightassignedtotheerrorterm.(InthestudyofEstebanetal.(1999),thevalueusedis 1 ).The Statamodule ipoegr.ado estimates the generalised form of the Esteban et al. (1999)polarisationindex.Inadditiontotheusualvariables,thisroutineoffersthethreefollowingoptions:

1. Thenumberofgroups.Empiricalstudiesuse twoor threegroups.Theusercanselect thenumberofgroups.Accordingtothisnumber,theprogramsearchesfortheoptimalincomeinterval for each group and displays them. It also displays the error in percentage, ie:

*G f G*100

G f

;

2. Theparameter ;3. Theparameter .

To respect the scale invariance principle, all incomes are divided by average income i.e.

j j / .Inaddition,wedividetheindexbythescalar2tomakeitsintervalliebetween

0and1when 1 .

Page 36: Distributive Analysis Stata Package V2.3

36

m m

EGR * 1 *j k j k

j 1 k 1

P (f , , , ) 0.5 p p G f G

14.6 TheInaki(2008)polarisationindex(ipoger)

Letapopulationbesplitinto N groups,eachoneofsize 0in .Thedensityfunction,themeanand

thepopulationshareofgroup i aredenotedby ( )if x , i and i respectively. istheoverallmean.

We therefore have that ( ) 1 if x ,1

N

i ii

and1

1

N

ii

. Using Inaki (2008), a social

polarisationindexcanbedefinedas:

1

( ) ( , ) ( , )N

W Bi

P F P i F P i F

where1 2 1( , ) ( ) ( )W i i iP i F f x f y x y dydx

and

1 1 1 1( , ) ( ) 1 ( ) B i i i i i iP i F f x dx f x xdx

ThemoduleStatadspolallowsperformingthedecompositionofthesocialpolarisationindex ( )P F intogroupcomponents. Theusercanselecttheparameteralpha; Theusercanselecttheuseofafasterapproachfortheestimationofthedensityfunction; Standarderrorsareprovidedforallestimatedindices.Theytakeintoaccountthefull

samplingdesign; Theresultsaredisplayedwith6decimalsbydefault;thiscanbechanged; TheusercansaveresultsinExcelformat.

Theresultsshow: Theestimatedpopulationshareofsubgroup i : i ;

Theestimatedincomeshareofsubgroup i : / i i ;

Theestimated ( , )WP i F indexofsubgroup i ;

Theestimated ( , )BP i F indexofsubgroup i ;

Theestimated ( , )W WiP P i F index;

Theestimated ( , )B BiP P i F index;

Theestimatedtotalindex FP

Page 37: Distributive Analysis Stata Package V2.3

37

Toopenthedialogboxformoduledspol,typedbdspolinthecommandwindow.

Example:Forillustrativepurposes,weusea1996Cameroonianhouseholdsurvey,whichismadeofapproximately1700households.Thevariablesusedare:Variables:STRATA StratuminwhichahouseholdlivesPSU PrimarysamplingunitofthehouseholdWEIGHT SamplingweightSIZE HouseholdsizeINS_LEV Educationleveloftheheadofthehousehold

1. Primary;2. ProfessionalTraining,secondaryandsuperior;3. Notresponding.

We decompose the above social polarization index using the module dspol by splitting theCameroonian population into three exclusive groups, according to the education level of thehouseholdhead.Wefirstinitializethesamplingdesignofthesurveywiththedialogboxsvysetasshowninwhatfollows:

Page 38: Distributive Analysis Stata Package V2.3

38

Afterthat,openthedialogboxbytypingdbdspol,andchoosevariablesandparametersasin:

AfterclickingSUBMIT,thefollowingresultsappear:

Mainreferences

1. DUCLOS,J.‐Y.,J.ESTEBAN,ANDD.RAY(2004):“Polarization:Concepts,Measurement,Estimation,”Econometrica,72,1737–1772.

2. TianZ.&all(1999)"FastDensityEstimationUsingCF‐kernelforVeryLargeDatabases".http://portal.acm.org/citation.cfm?id=312266

3. IñakiPermanyer,2008."TheMeasurementofSocialPolarizationinaMulti‐groupContext,"UFAEandIAEWorkingPapers736.08,UnitatdeFonamentsdel'AnàlisiEconòmica(UAB)andInstitutd'AnàlisiEconòmica(CSIC).

Page 39: Distributive Analysis Stata Package V2.3

39

15 DASPanddecompositions

15.1 FGTPoverty:decompositionbypopulationsubgroups(dfgtg)ThedgfgtmoduledecomposestheFGTpovertyindexbypopulationsubgroups.Thisdecompositiontakestheform

1

( ; ) ( ) ( ; ; )G

gP z g P z g

whereG isthenumberofpopulationsubgroups.Theresultsshow:

TheestimatedFGTindexofsubgroup g : ( ; ; )P z g

Theestimatedpopulationshareofsubgroup g : ( )g

Theestimatedabsolutecontributionofsubgroupg tototalpoverty: ( ) ( ; ; )g P z g Theestimatedrelativecontributionofsubgroupg tototal

poverty: ( ) ( ; ; ) / ( ; )g P z g P z

Anasymptoticstandarderrorisprovidedforeachofthesestatistics.Toopenthedialogboxformoduledfgtg,typedbdfgtginthecommandwindow.Figure8:DecompositionoftheFGTindexbygroups

NotethattheusercansaveresultsinExcelformat.InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.7

Page 40: Distributive Analysis Stata Package V2.3

40

15.2 FGTPoverty:decompositionbyincomecomponentsusingtheShapleyvalue(dfgts)

The dfgts module decomposes the total alleviation of FGT poverty into a sum of the contributions generated by separate income components. Total alleviation is maximal when all individuals have an income greater than or equal to the poverty line. A negative sign on a decomposition term indicates that

an income component reduces poverty. Assume that there exist K income sources and that ks denotes

income source .k The FGT index is defined as:

1

1

1

1; ;

niK

ik n

ki

i

yw zP z y s

w

where iw is the weight assigned to individual i and n is sample size. The dfgts Stata module estimates:

The share in total income of each income source k ;

The absolute contribution of each source k to the value of ( 1P );

The relative contribution of each source k to the value of ( 1P );

Note that the dfgts ado file requires the module shapar.ado, which is programmed to perform decompositions using the Shapley value algorithm developed by Araar and Duclos (2008).

Araar A and Duclos J‐Y (2008), “An algorithm for computing the Shapley Value”, PEP and CIRPEE. Tech.‐

Note: Novembre‐2008: http://dad.ecn.ulaval.ca/pdf_files/shap_dec_aj.pdf 

 

Empirical illustration with a Nigerian household survey We use a survey of Nigerian households (NLSS, using 17764 observations) carried out between September 2003 and August 2004 to illustrate the use of the dfgts module. We use per capita total household income as a measure of individual living standards. Household observations are weighted by household size and sampling weights to assess poverty over all individuals. The six main income components are:

source_1: Employment income; source_2: Agricultural income; source_3: Fish-processing income; source_4: Non-farm business income; source_5: Remittances received; source_6: All other income;

The Stata data file is saved after initializing its sampling design with the command svyset.

To open the dialog box for module dfgts, type db dfgts in the command window.

Page 41: Distributive Analysis Stata Package V2.3

41

Figure 1: Decomposition of the FGT index by income components

Indicate the varlist of the six income sources. Indicate that the poverty line is set to 15 000 $N. Set the variable HOUSEHOLD SIZE. Set the variable HOUSEHOLD WEIGHT. Click on the button SUBMIT. The following results appear:

Page 42: Distributive Analysis Stata Package V2.3

42

15.3 AlkireandFoster(2011)MDindexofpoverty:decompositionbypopulationsubgroups(dmdafg)

ThedmdafgmoduledecomposestheMDAlkireandFosterindexofpovertyindexbypopulationsubgroups.Thisdecompositiontakestheform.Theresultsshow: TheestimatedAlkireandFosterindexofeachsubgroup: Theestimatedpopulationshareofsubgroup; Theestimatedabsolutecontributionofsubgroupg tototalpoverty; Theestimatedrelativecontributionofsubgroupg tototalpoverty;

Anasymptoticstandarderrorisprovidedforeachofthesestatistics.

15.4 AlkireandFoster(2011:decompositionbydimensionsusingtheShapleyvalue(dmdafs)

Thedmdafsmoduledecomposes theAlkire andFoster (2011)multidimensional poverty

indices intoa sumof the contributionsgeneratedbyeachof thepovertydimensions. Ituses theShapley characteristic function. The non‐presence of a given factor –dimension‐ is obtained bysettingthelevelofthatdimensiontoitsspecificpovertyline,thusensuringthenon‐contributionofthis dimension to the AF (2011) indices. Note that the dmdafs ado file requires the moduleshapar.ado,which isprogrammed toperformdecompositionsusing theShapleyvaluealgorithmdevelopedbyAraarandDuclos(2008).

Araar A and Duclos J‐Y (2008), “An algorithm for computing the Shapley Value”, PEP andCIRPEE.Tech.‐Note:Novembre‐2008:http://dad.ecn.ulaval.ca/pdf_files/shap_dec_aj.pdf

Page 43: Distributive Analysis Stata Package V2.3

43

15.5 FGTPoverty:decompositionbyincomecomponentsusingtheShapleyvalue(dfgts)

The dfgts module decomposes the total alleviation of FGT poverty into a sum of the

contributions generated by separate income components. Total alleviation is maximal when allindividuals have an income greater than or equal to the poverty line. A negative sign on adecompositiontermindicatesthatanincomecomponentreducespoverty.AssumethatthereexistK incomesourcesandthat ks denotesincomesource .k TheFGTindexisdefinedas:

1

1

1

1; ;

niK

ik n

ki

i

yw zP z y s

w

where iw istheweightassignedtoindividual i and n issamplesize.ThedfgtsStatamodule

estimates: Theshareintotalincomeofeachincomesource k ;

Theabsolutecontributionofeachsource k tothevalueof( 1P );

Therelativecontributionofeachsource k tothevalueof( 1P );

Notethatthedfgtsadofilerequiresthemoduleshapar.ado,whichisprogrammedtoperformdecompositionsusingtheShapleyvaluealgorithmdevelopedbyAraarandDuclos(2008).

Araar A and Duclos J‐Y (2008), “An algorithm for computing the Shapley Value”, PEP andCIRPEE.Tech.‐Note:Novembre‐2008:http://dad.ecn.ulaval.ca/pdf_files/shap_dec_aj.pdf

EmpiricalillustrationwithaNigerianhouseholdsurveyWe use a survey of Nigerian households (NLSS, using 17764 observations) carried out betweenSeptember2003andAugust2004toillustratetheuseofthedfgtsmodule.Weusepercapitatotalhousehold income as a measure of individual living standards. Household observations areweighted by household size and samplingweights to assess poverty over all individuals. The sixmainincomecomponentsare:

source_1:Employmentincome; source_2:Agriculturalincome; source_3:Fish‐processingincome; source_4:Non‐farmbusinessincome; source_5:Remittancesreceived; source_6:Allotherincome;

Page 44: Distributive Analysis Stata Package V2.3

44

TheStatadatafileissavedafterinitializingitssamplingdesignwiththecommandsvyset.Toopenthedialogboxformoduledfgts,typedbdfgtsinthecommandwindow.Figure9:DecompositionofFGTbyincomecomponents

Indicatethevarlistofthesixincomesources. Indicatethatthepovertylineissetto15000$N. SetthevariableHOUSEHOLDSIZE. SetthevariableHOUSEHOLDWEIGHT. ClickonthebuttonSUBMIT.Thefollowingresultsappear:

Page 45: Distributive Analysis Stata Package V2.3

45

15.6 DecompositionofthevariationinFGTindicesintogrowthandredistributioncomponents(dfgtgr)

DattandRavallion(1992)decomposethechangeintheFGTindexbetweentwoperiods,t1andt2,intogrowthandredistributioncomponentsasfollows:

t2 t1 t1 t1 t1 t2 t1 t12 1

var iation C1 C2

P P P( , ) P( , ) P( , ) P( , ) R / ref 1

t2 t2 t1 t2 t2 t2 t2 t1

2 1

var iation C1 C2

P P P( , ) P( , ) P( , ) P( , ) R / ref 2

wherevariation =differenceinpovertybetweent1andt2;C1 =growthcomponent;C2 =redistributioncomponent;R =residual;Ref =periodofreference.

1 1( , )t tP :theFGTindexofthefirstperiod1 1( , )t tP :theFGTindexofthesecondperiod

Page 46: Distributive Analysis Stata Package V2.3

46

),(P 1t2t :theFGTindexofthefirstperiodwhenallincomes 1tiy ofthefirstperiodaremultiplied

by 1t2t /

),(P 2t1t :theFGTindexofthesecondperiodwhenallincomes2t

iy ofthesecondperiodare

multipliedby 2t1t / TheShapleyvaluedecomposesthevariationintheFGTIndexbetweentwoperiods,t1andt2,intogrowthandredistributioncomponentsasfollows:

21

Variation

12 CCPP

),(P),(P),(P),(P2

1C 2t1t2t2t1t1t1t2t

1 ),(P),(P),(P),(P

2

1C 1t2t2t2t1t1t2t1t

2

15.7 DecompositionofchangeinFGTpovertybypovertyandpopulationgroupcomponents–sectoraldecomposition‐(dfgtg2d).

Additivepovertymeasures, like theFGT indices, canbeexpressedasa sumof the

poverty contributionsof thevarious subgroupsofpopulation.Each subgroupcontributesbyitspopulationshareandpovertylevel.Thus,thechangeinpovertyacrosstimedependsonthechangeinthesetwocomponents.Denotingthepopulationshareofgroup inperiod by , the change in poverty between two periods can be expressed as (see Huppi(1991)andDuclosandAraar(2006)):

(06)

Thisdecompositionuse the initialperiodas theone. If the referenceperiod is the

final,thedecompositiontakestheform:

Page 47: Distributive Analysis Stata Package V2.3

47

(06)

Toremovethearbitrarnessinselectingthereferenceperiod,wecanusetheShapley

decompositionapproach,finding:

(07)

where is the average population share and

.TheDASPmoduledfgtg2dperformsthissectoraldecomposition,andthisbyselectingthereferenceperiodoftheShapleyapproach(seethefollowingdialogbox):

Page 48: Distributive Analysis Stata Package V2.3

48

Figure 10: Sectoral decomposition of FGT

=== === === Population 0.005650 0.002463 0.000000 0.002932 0.004317 0.000000 Inactive -0.001724 -0.011681 0.000000 0.016127 0.018726 0.000000 Subsistence farmer 0.012648 -0.014336 0.000000 0.005992 0.013963 0.000000 Crop farmer -0.010387 0.029328 0.000000 0.000700 0.000930 0.000000 Other type of earner 0.000610 -0.000234 0.000000 0.001380 0.002417 0.000000 Artisan or trader 0.001731 -0.000768 0.000000 0.000930 0.001222 0.000000 Wage-earner (private sector) 0.001224 0.000218 0.000000 0.001117 0.001931 0.000000 Wage-earner (public sector) 0.001548 -0.000064 0.000000 Component Component Component Group Poverty Population Interaction Decomposition components

0.000000 0.016124 0.000000 0.010927 0.019477 Population 1.000000 0.444565 1.000000 0.452677 0.008113 0.004839 0.035336 0.003354 0.032340 0.047901 Inactive 0.075856 0.414986 0.046719 0.386852 -0.028134 0.016403 0.021132 0.015083 0.011572 0.024093 Subsistence farmer 0.680885 0.514999 0.653552 0.533956 0.018957 0.014896 0.034911 0.014125 0.024457 0.042625 Crop farmer 0.104402 0.500707 0.167806 0.424391 -0.076316 0.001308 0.060817 0.000923 0.089680 0.108357 Other type of earner 0.006650 0.194481 0.005689 0.293404 0.098923 0.004288 0.014712 0.004666 0.018202 0.023404 Artisan or trader 0.062640 0.097548 0.055795 0.126776 0.029228 0.002164 0.024093 0.002624 0.023087 0.033369 Wage-earner (private sector) 0.026598 0.067271 0.029035 0.111283 0.044012 0.003790 0.012599 0.003927 0.023396 0.026573 Wage-earner (public sector) 0.042971 0.022406 0.041403 0.059094 0.036688 Pop. share FGT index Pop. share FGT index FGT index Group Initial Initial Final Final Difference in Population shares and FGT indices

Parameter alpha : 0.00 Group variable : gse Decomposition of the FGT index by groups

. dfgtg2d exppc exppcz, alpha(0) hgroup(gse) pline(41099) file1(C:\data\bkf94I.dta) hsize1(size) file2(C:\data\bkf98I.dta) hsize2(size) ref(0)

Page 49: Distributive Analysis Stata Package V2.3

49

15.8 DecompositionofFGTpovertybytransientandchronicpovertycomponents(dtcpov)

Thisdecomposestotalpovertyacrosstimeintotransientandchroniccomponents.TheJalanandRavallion(1998)approach

Let tiy betheincomeofindividualiinperiodtand i beaverageincomeovertheTperiodsforthat

sameindividuali,i=1,…,N.Totalpovertyisdefinedas:

T N ti i

t 1i 1N

ii 1

w (z y )

TP( , z)

T w

Thechronicpovertycomponentisthendefinedas:N

i ii 1

Ni

i 1

w (z )CPC( , z)

w

Transientpovertyequals:

TPC( , z) TP( , z) CPC( , z)

Duclos,AraarandGiles(2006)approach

Let tiy be the incomeof individual i inperiodtand i beaverage incomeover theTperiods for

individuali.Let ( , z) bethe”equally‐distributed‐equivalent”(EDE)povertygapsuchthat:

1/( , z) TP( , z)

Transientpovertyisthendefinedas

Ni i

i 1N

ii 1

w ( , z)TPC( , z)

w

where i i i, z 1, z andB1/

T ti i

i t( , z) (z y ) / T

Page 50: Distributive Analysis Stata Package V2.3

50

andchronicpovertyisgivenby

CPC( , z) ( , z) TPC( , z)

Notethatthenumberofperiodsavailableforthistypeofexerciseisgenerallysmall.Becauseofthis,abias‐correctionistypicallyuseful,usingeitherananalytical/asymptoticorbootstrapapproach.Toopenthedialogboxformoduledtcpov,typedbdtcpovinthecommandwindow.Figure11:Decompositionofpovertyintotransientandchroniccomponents

Theusercanselectmorethanonevariableofinterestsimultaneously,whereeachvariable

representsincomeforoneperiod. Theusercanselectoneofthetwoapproachespresentedabove. Small‐T‐bias‐correctionscanbeapplied,usingeitherananalytical/asymptoticora

bootstrapapproach. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged.

References JalanJyotsna,andMartinRavallion.(1998)"TransientPovertyinPostreformRuralChina"JournalofComparativeEconomics,26(2),pp.338:57.Jean‐Yves Duclos & Abdelkrim Araar & John Giles, 2006. "Chronic and Transient Poverty:MeasurementandEstimation,withEvidencefromChina,"WorkingPaper0611,CIRPEE.

Page 51: Distributive Analysis Stata Package V2.3

51

15.9 Inequality:decompositionbyincomesources(diginis)AnalyticalapproachThediginismoduledecomposesthe(usual)relativeortheabsoluteGiniindexbyincomesources.Thethreeavailableapproachesare:

Rao’sapproach(1969) LermanandYitzhaki’sapproach(1985) Araar’sapproach(2006)

Reference(s)

Lerman, R. I., and S. Yitzhaki. "Income Inequality Effects by Income Source: A NewApproachandApplications to theUnitedStates."ReviewofEconomics andStatistics67(1985):151‐56.

AraarAbdelkrim(2006).OntheDecompositionoftheGiniCoefficient:anExactApproach,withanIllustrationUsingCameroonianData,Workingpaper02‐06,CIRPEE.

ShapleyapproachThedsineqsmoduledecomposes inequality indices intoa sumof thecontributionsgeneratedbyseparateincomecomponents.ThedsineqsStatamoduleestimates: Theshareintotalincomeofeachincomesource k ; Theabsolutecontributionofeachsource k totheGiniindex; Therelativecontributionofeachsource k totheGiniindex;

For theShapleydecomposition, therule that isused toestimate the inequality index fora

subset of components is by suppressing the inequality generated by the complement subset ofcomponents. For this, we generate a counterfactual vector of income that equals the sum of thecomponentsofthesubsetplustheaverageofthecomplementsubset. Notethatthedsineqsadofile requires themoduleshapar.ado,which isprogrammed toperformdecompositionsusing theShapleyvaluealgorithmdevelopedbyAraarandDuclos(2008).

Araar A and Duclos J‐Y (2008), “An algorithm for computing the Shapley Value”, PEP andCIRPEE.Tech.‐Note:Novembre‐2008::http://dad.ecn.ulaval.ca/pdf_files/shap_dec_aj.pdf

To open the dialog box for module dsginis, type db dsginis in the command window.

Page 52: Distributive Analysis Stata Package V2.3

52

Figure 12: Decomposition of the Gini index by income sources (Shapley approach)

15.10 Regression‐baseddecompositionofinequalitybyincomesourcesA useful approach to show the contribution of income covariates to total inequality is bydecomposingthelatterbythepredictedcontributionsofcovariates.Formally,denotetotalincome

by y andthesetofcovariatesby 1, 2 , , KX x x x .Usingalinearmodelspecification,wehave:

0 1 1 2 2ˆ ˆ ˆ ˆ ˆ ˆ

k k K Ky x x x x

where 0 and denoterespectivelytheestimatedconstanttermandtheresidual.Twoapproachesforthedecompositionoftotalinequalitybyincomesourcesareused:

1‐ The Shapleyapproach: This approach is based on the expectedmarginal contribution ofincomesourcestototalinequality.

2‐ TheAnalyticalapproach:Thisapproach isbasedonalgebraicdevelopments that expresstotalinequalityasasumofinequalitycontributionsofincomesources.

WiththeShapleyapproach: Theusercanselectamongthefollowingrelativeinequalityindices;

Giniindex Atkinsonindex Generalizedentropyindex Coefficientvariationindex

Theusercanselectamongthefollowingmodelspecifications;

Linear: 0 1 1 2 2ˆ ˆ ˆ ˆ ˆ

K Ky x x x

SemiLogLinear: 0 1 1 2 2ˆ ˆ ˆ ˆ ˆlog( ) K Ky x x x

Page 53: Distributive Analysis Stata Package V2.3

53

WiththeAnalyticalapproach: Theusercanselectamongthefollowingrelativeinequalityindices;

Giniindex Squaredcoefficientvariationindex

Themodelspecificationislinear.

Decomposingtotalinequalitywiththeanalyticalapproach:

Total income equals 0 1 2 K Ry s s s s s where 0s is the estimated constant, ˆk k ks X

and Rs is the estimated residual. As reported byWang 2004, relative inequality indices are not

definedwhen the average of the variable of interest equals zero (the case of the residual). Also,inequalityindicesequalzerowhenthevariableof interestisaconstant(thecaseoftheestimatedconstant).Todealwiththesetwoproblems,Wang(2004)proposesthefollowingbasicrules:

Let 0 1 2ˆ Ky s s s s and 1 2 Ky s s s ,then: 0( ) ( ) rI y cs I y cs

Thecontributionoftheconstant: 0 ˆ( ) ( )cs I y I y

Thecontributionoftheresidual: ˆ( ) ( )Rcs I y I y TheGiniindex:UsingRao1969’sapproach,therelativeGiniindexcanbedecomposedasfollows:

( ) kk

y

I y C

where y istheaverageof y and kC isthecoefficientofconcentrationof ks when y istherankingvariable.TheSquaredcoefficientofvariationindex:AsshownbyShorrocks1982,thesquaredcoefficientofvariationindexcanbedecomposedas:

21

ov( , )( )

Kk

k y

C y sI y

Shapleydecompositions:TheShapleyapproachisbuiltaroundtheexpectedmarginalcontributionofacomponent.Theusercanselectamongtwomethodstodefinetheimpactofmissingagivencomponent.

Withoption:method(mean),whenacomponentismissingfromagivensetofcomponents,itisreplacedbyitsmean.

Withoption:method(zero),whenacomponentismissingfromagivensetofcomponents,itisreplacedbyzero.

Asindicatedabove,wecannotestimaterelativeinequalityfortheresidualcomponent.

Page 54: Distributive Analysis Stata Package V2.3

54

Forthelinearmodel,thedecompositiontakesthefollowingform: ˆ( ) ( ) rI y I y cs ,where

thecontributionoftheresidualis ˆ( ) ( )rcs I y I y .

FortheSemi‐loglinearmodel,theShapleydecompositionisappliedtoallcomponentsincludingtheconstantandtheresidual.

WiththeShapleyapproach,theusercanusetheloglinearspecification.However,theusermustindicatetheincomevariableandnotthelogofthatvariable(DASPautomaticallyrunstheregressionwithlog(y)asthedependentvariable).Example1

Page 55: Distributive Analysis Stata Package V2.3

55

Page 56: Distributive Analysis Stata Package V2.3

56

Page 57: Distributive Analysis Stata Package V2.3

57

Example2

Withthisspecification,wehave 0 1 2xp( )K Ry E s s s s s .Then:

Wecannotestimatetheincomeshare(nolinearform);

Page 58: Distributive Analysis Stata Package V2.3

58

Thecontributionoftheconstantisnil. 01

xp( ). xp( ). xp( )K

k Ek

y E s E s E s

.Addinga

constantwillhavenotanyimpact.

Example3

15.11 Giniindex:decompositionbypopulationsubgroups(diginig).The diginig module decomposes the (usual) relative or the absolute Gini index by populationsubgroups.Let therebeGpopulationsubgroups.Wewish todetermine thecontributionof everyoneofthosesubgroupstototalpopulationinequality.TheGiniindexcanbedecomposedasfollows:

G

g g gWithin Overlapg 1

Between

I I I R

where

Page 59: Distributive Analysis Stata Package V2.3

59

g thepopulationshareofgroupg;

g theincomeshareofgroupg.

I

between‐groupinequality(wheneachindividualisassignedtheaverageincomeofhisgroup).

R

Theresidueimpliedbygroupincomeoverlap

15.12 Generalizedentropyindicesofinequality:decompositionbypopulationsubgroups(dentropyg).

TheGeneralisedEntropyindicesofinequalitycanbedecomposedasfollows:

K

k 1

ˆ (k)ˆˆ ˆI( ) (k) .I(k; ) I( )ˆ

where:B )k( istheproportionofthepopulationfoundinsubgroupk.

B )k( isthemeanincomeofgroupk.

B ;kI isinequalitywithingroupk.

B I is population inequality if each individual in subgroup k is given the meanincomeofsubgroupk, (k) .

15.13 Polarization:decompositionoftheDERindexbypopulationgroups(dpolag)

As proposed byAraar (2008), theDuclos, Esteban andRay index can be decomposed as

follows:

1 1

Between

Within

g g g gg

P R P P

where

1

1

( ) ( ) ( )

( ) ( )

g gg

g g g

a x x f x dxR

a x f x dx

g and g arerespectivelythepopulationandincomesharesofgroup g .

Page 60: Distributive Analysis Stata Package V2.3

60

( )g x denotesthelocalproportionofindividualsbelongingtogroup g andhaving

income x ; P istheDERpolarizationindexwhenthewithin‐grouppolarizationorinequalityis

ignored;

ThedpolasmoduledecomposestheDERindexbypopulationsubgroups.Reference(s)

Abdelkrim Araar, 2008. "On the Decomposition of Polarization Indices: Illustrations with Chinese and Nigerian Household Surveys," Cahiers de recherche 0806, CIRPEE.

15.14 Polarization:decompositionoftheDERindexbyincomesources(dpolas)

As proposed byAraar (2008), theDuclos, Esteban andRay index can be decomposed as

follows:

k kk

P CP

where1

1

( ) ( )kk

k k

f x a x dxCP

and k arerespectivelythepseudoconcentrationindexand

incomeshareofincomesource k .ThedpolasmoduledecomposestheDERindexbyincomesources.Reference(s)

Abdelkrim Araar, 2008. "On the Decomposition of Polarization Indices: Illustrations with Chinese and Nigerian Household Surveys," Cahiers de recherche 0806, CIRPEE.

16 DASPandcurves.

16.1 FGTCURVES(cfgt).FGTcurvesareusefuldistributivetoolsthatcaninteraliabeusedto:

1. Showhowthelevelofpovertyvarieswithdifferentpovertylines;2. Testforpovertydominancebetweentwodistributions;

Page 61: Distributive Analysis Stata Package V2.3

61

3. Testpro‐poorgrowthconditions.FGTcurvesarealsocalledprimaldominancecurves.Thecfgtmoduledrawssuchcurveseasily.Themodulecan: drawmorethanoneFGTcurvesimultaneouslywhenevermorethanonevariableofinterest

isselected; drawFGTcurvesfordifferentpopulationsubgroupswheneveragroupvariableisselected; drawFGTcurvesthatarenotnormalizedbythepovertylines; drawdifferencesbetweenFGTcurves; listorsavethecoordinatesofthecurves; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.Toopenthedialogboxofthemodulecfgt,typethecommanddbdfgtinthecommandwindow.Figure13:FGTcurves

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.4.

Page 62: Distributive Analysis Stata Package V2.3

62

FGTCURVEwithconfidenceinterval(cfgts).ThecfgtsmoduledrawsanFGTcurveanditsconfidenceintervalbytakingintoaccountsamplingdesign.Themodulecan: drawanFGTcurveandtwo‐sided,lower‐boundedorupper‐boundedconfidenceintervals

aroundthatcurve; conditiontheestimationonapopulationsubgroup; drawaFGTcurvethatisnotnormalizedbythepovertylines; listorsavethecoordinatesofthecurveandofitsconfidenceinterval; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.5.

16.3 DifferencebetweenFGTCURVESwithconfidenceinterval(cfgts2d).Thecfgts2dmoduledrawsdifferencesbetweenFGTcurvesandtheirassociatedconfidenceintervalbytakingintoaccountsamplingdesign.Themodulecan: drawdifferencesbetweenFGTcurvesandtwo‐sided,lower‐boundedorupper‐bounded

confidenceintervalsaroundthesedifferences; normalizeornottheFGTcurvesbythepovertylines; listorsavethecoordinatesofthedifferencesbetweenthecurvesaswellastheconfidence

intervals; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.5.

LorenzandconcentrationCURVES(clorenz).Lorenzandconcentrationcurvesareusefuldistributivetoolsthatcaninteraliabeusedto:

1. showthelevelofinequality;2. testforinequalitydominancebetweentwodistributions;3. testforwelfaredominancebetweentwodistributions;4. testforprogressivity.

TheclorenzmoduledrawsLorenzandconcentrationcurvessimultaneously.Themodulecan:

Page 63: Distributive Analysis Stata Package V2.3

63

drawmorethanoneLorenzorconcentrationcurvesimultaneouslywhenevermorethan

onevariableofinterestisselected; drawmorethanonegeneralizedorabsoluteLorenzorconcentrationcurvesimultaneously

whenevermorethanonevariableofinterestisselected; drawmorethanonedeficitsharecurve; drawLorenzandconcentrationcurvesfordifferentpopulationsubgroupswheneveragroup

variableisselected; drawdifferencesbetweenLorenzandconcentrationcurves; listorsavethecoordinatesofthecurves; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.Toopenthedialogboxofthemoduleclorenz,typethecommanddbclorenzinthecommandwindow.Figure14:Lorenzandconcentrationcurves

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.8.

16.5 Lorenz/concentrationcurveswithconfidenceintervals(clorenzs).TheclorenzsmoduledrawsaLorenz/concentrationcurveanditsconfidenceintervalbytakingsamplingdesignintoaccount.Themodulecan:

Page 64: Distributive Analysis Stata Package V2.3

64

drawaLorenz/concentrationcurveandtwo‐sided,lower‐boundedorupper‐boundedconfidenceintervals;

conditiontheestimationonapopulationsubgroup; drawLorenz/concentrationcurvesandgeneralizedLorenz/concentrationcurves; listorsavethecoordinatesofthecurvesandtheirconfidenceinterval; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.

16.6 DifferencesbetweenLorenz/concentrationcurveswithconfidenceinterval(clorenzs2d)

Theclorenz2dmoduledrawsdifferencesbetweenLorenz/concentrationcurvesandtheirassociatedconfidenceintervalsbytakingsamplingdesignintoaccount.Themodulecan: drawdifferencesbetweenLorenz/concentrationcurvesandassociatedtwo‐sided,lower‐

boundedorupper‐boundedconfidenceintervals; listorsavethecoordinatesofthedifferencesandtheirconfidenceintervals; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.

16.7 Povertycurves(cpoverty)Thecpovertymoduledrawsthepovertygaporthecumulativepovertygapcurves.

o Thepovertygapatapercentile p is: ( ; ) ( ( ))G p z z Q p

o Thecumulativepovertygapatapercentile p ,notedby ( ; )CPG p z ,isgivenby:

1

1

( ) ( ( ))

( ; )

ni i i

in

ii

w z y I y Q p

CPG p z

w

Themodulecanthus: drawmorethanonepovertygaporcumulativepovertygapcurvessimultaneously

whenevermorethanonevariableofinterestisselected; drawpovertygaporcumulativepovertygapcurvesfordifferentpopulationsubgroups

wheneveragroupvariableisselected;

Page 65: Distributive Analysis Stata Package V2.3

65

drawdifferencesbetweenpovertygaporcumulativepovertygapcurves; listorsavethecoordinatesofthecurves; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.

16.8 Consumptiondominancecurves(cdomc) Consumption dominance curves are useful tools for studying the impact of indirect tax fiscal reforms on

poverty. The thj Commodity or Component dominance (C-Dominance for short) curve is defined as follows:

2

1

1

1

1

( )

2

( , )

( )

| ( ) 1

n jsi i i

in

iij

n ji i i

j in

ii

w z y y

if s

w

CD z s

w K z y y

E y y z f z if s

w

where K( ) is a kernel function and yj is the thj commodity. Dominance of order s is checked by

setting s 1 . Thecdomcmoduledrawssuchcurveseasily.Themodulecan: drawmorethanoneCDcurvesimultaneouslywhenevermorethanonecomponentis

selected; drawtheCDcurveswithconfidenceintervals; estimatetheimpactofchangeinpriceofagivencomponentonFGTindex(CDcurve)fora

specifiedpovertyline; drawthenormalizedCDcurvesbytheaverageofthecomponent; listorsavethecoordinatesofthecurves; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.Toopenthedialogboxofthemodulecdomc,typethecommanddbcdomcinthecommandwindow.

Page 66: Distributive Analysis Stata Package V2.3

66

Figure15:Consumptiondominancecurves

16.9 Difference/Ratiobetweenconsumptiondominancecurves(cdomc2d)

Thecdomc2dmoduledrawsdifferenceorratiobetweenconsumptiondominancecurvesandtheirassociatedconfidenceintervalsbytakingsamplingdesignintoaccount.Themodulecan: drawdifferencesbetweenconsumptiondominancecurvesandassociatedtwo‐sided,lower‐

boundedorupper‐boundedconfidenceintervals; listorsavethecoordinatesofthedifferencesandtheirconfidenceintervals; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.

16.10 DASPandtheprogressivitycurves

16.10.1 CheckingtheprogressivityoftaxesortransfersThe module cprog allows checking whether taxes or transfers are progressive. Let X be a gross income, T be a given tax and B be a given transfer. The tax T is Tax Redistribution (TR) progressive if :

PR p L p C p 0 0,1X T p

Page 67: Distributive Analysis Stata Package V2.3

67

The transfer B is Tax Redistribution (TR) progressive if : PR p C p L p 0 0,1B X p

The tax T is Income Redistribution (IR) progressive if :

PR p C p L p 0 0,1X T X p

The transfer B is Income Redistribution (IR) progressive if :

PR p C p L p 0 0,1X B X p

16.10.2 CheckingtheprogressivityoftransfervstaxThe module cprogbt allows checking whether a given transfer is more progressive than a given tax. The transfer B is more Tax Redistribution (TR) progressive than a tax T if :

PR p C p C p 2L p 0 0,1B T X p

The transfer B is more Income Redistribution (TR) progressive than a tax T if :

PR p C p -C p 0 0,1X B X T p

17 Dominance

17.1 Povertydominance(dompov)

Distribution1dominatesdistribution2atordersovertherange z,z ifonlyif:

1 2( ; ) ( ; ) ,P P z z for 1s .

ThisinvolvescomparingstochasticdominancecurvesatordersorFGTcurveswith 1s .Thisapplicationestimatesthepointsatwhichthereisareversaloftherankingofthecurves.Saiddifferently,itprovidesthecrossingpointsofthedominancecurves,thatis,thevaluesof and

1 ( ; )P forwhich1 2( ; ) ( ; )P P when:

1 2 2 1( ( ; ) ( ; )) ( ( ; ) ( ; ))sign P P sign P P forasmall .Thecrossingpoints canalsobereferredtoas“criticalpovertylines”.Thedompovmodulecanbeusedtocheckforpovertydominanceandtocomputecriticalvalues.ThismoduleismostlybasedonAraar(2006):

Page 68: Distributive Analysis Stata Package V2.3

68

Araar,Abdelkrim,(2006),Poverty,InequalityandStochasticDominance,TheoryandPractice:Illustration with Burkina Faso Surveys,Working Paper: 06‐34. CIRPEE, Department ofEconomics,UniversitéLaval.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.6.

17.2 Inequalitydominance(domineq)

Distribution1inequality‐dominatesdistribution2atthesecondorderifandonlyif:

1 2( ) ( ) 0,1 L p L p p

Themoduledomineqcanbeusedtocheckforsuchinequalitydominance.ItisbasedmainlyonAraar(2006):

Araar,Abdelkrim,(2006),Poverty,InequalityandStochasticDominance,TheoryandPractice:Illustration with Burkina Faso Surveys,Working Paper: 06‐34. CIRPEE, Department ofEconomics,UniversitéLaval.

Intersectionsbetweencurvescanbeestimatedwiththismodule.ItcanalsousedtocheckfortaxandtransferprogressivitybycomparingLorenzandconcentrationcurves.

17.3 DASPandbi‐dimensionalpovertydominance(dombdpov)Lettwodimensionsofwell‐beingbedenotedby 1,2k .Theintersectionbi‐dimensionalFGTindexfordistributionD isestimatedas

2

1 1

1

n k k ki i

i kD n

ii

w ( z y )

P ( Z; A )

w

where 1 2,Z z z and 1 2,A arevectorsofpovertylinesandparameters respectively,

and max( ,0)x x .

Distribution1dominatesdistribution2atorders 1 2,s s overtherange 0, Z ifandonlyif:

1 2 1 2( ; 1) ( ; 1) 0, 0,P Z A s P Z A s Z z z andfor 1 1 2 21, 1s s .

TheDASPdombdpovmodulecanbeusedtocheckforsuchdominance.Foreachofthetwodistributions:

Thetwovariablesofinterest(dimensions)shouldbeselected; Conditionscanbespecifiedtofocusonspecificpopulationsubgroups; Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged.

Page 69: Distributive Analysis Stata Package V2.3

69

Surfacesshowingthedifference,thelowerboundandtheupperboundoftheconfidencesurfacesareplottedinteractivelywiththeGnuPlottool.

Coordinatescanbelisted. CoordinatescanbesavedinStataorGnuPlot‐ASCIIformat.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.12.

18 Distributivetools

18.1 Quantilecurves(c_quantile)Thequantileatapercentilepofacontinuouspopulationisgivenby:

1( ) ( )Q p F p where ( )p F y isthecumulativedistributionfunctionaty.For a discrete distribution, let n observations of living standards be ordered such that

1 2 1i i ny y y y y . If )()( 1 ii yFpyF , we define1( ) iQ p y . The normalised

quantileisdefinedas ( ) ( ) /Q p Q p .InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.10.

18.2 Incomeshareandcumulativeincomesharebygroupquantiles(quinsh)

Thismodulecanbeusedtoestimate the incomeshares,aswellas, thecumulative incomesharesbyquantilegroups.Theusercanindicatethenumberofgrouppartition.Forinstance,ifthenumber is five, the quintile income shares are provided. We can also plot the graph bar of theestimatedincomeshares.

18.3 Densitycurves(cdensity)TheGaussiankernelestimatorofadensityfunction )(xf isdefinedby

2

1

( ) 1ˆ ( ) ( ) exp 0.5 ( ) ( )2

i ii ii i in

ii

w K x x xf x and K x x and x

hhw

wherehisabandwidththatactsasa“smoothing”parameter.InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.10.Boundary bias correction: A problem occurs with kernel estimation when a variable of interest is bounded. It may be for instance that consumption is bounded between two bounds, a minimum and a maximum, and that we wish to estimate its density “close” to these two bounds. If the true value of the density at these two bounds is

Page 70: Distributive Analysis Stata Package V2.3

70

positive, usual kernel estimation of the density close to these two bounds will be biased. A similar problem occurs with non-parametric regressions. Renormalisation approach: One way to alleviate these problems is to use a smooth “corrected” Kernel estimator, following a paper by Peter Bearse, Jose Canals and Paul Rilstone. A boundary-corrected Kernel density estimator can then be written as

*i i ii

ni

i 1

w K (x)K (x)f (x)

w

where

h

xx)x(and)x(5.0exp

2h

1)x(K i

i2

ii

and where the scalar )x(K*i is defined as

))x((P)x()x(K i*i

!1s!21)(P

1s2

)0001(l,h

minxB,

h

maxxA:ld)(P)(P)(KlM)x( ss

1B

As1

min is the minimum bound, and max is the maximum one. h is the usual bandwidth. This correction removes bias to order hs. DASP offers four options, without correction, and with correction of order 1, 2 and 3. Refs:

Jones,M.C.1993,simplyboundarycorrectionforKerneldensityestimation.StatisticsandComputing3:135‐146.

Bearse,P.,Canals,J.andRilstone,P.EfficientSemiparametricEstimationofDurationModelsWithUnobservedHeterogeneity,EconometricTheory,23,2007,281–308

Reflection approach: The reflection estimator approaches the boundary estimator by “reflecting” the data at the boundaries:

ri ii

ni

i 1

w K (x)f (x)

w

r x X x X 2min x X 2maxK (x) K K K

h h h

Refs: CwikandMielniczuk(1993),Data‐dependentBandwidthChoiceforaGradeDensityKernel

Estimate.StatisticsandprobabilityLetters16:397‐405

Page 71: Distributive Analysis Stata Package V2.3

71

Silverman,B.W.(1986),DensityforStatisticsandDataAnalysis.LondonChapmanandHall(p30).

18.4 Non‐parametricregressioncurves(cnpe)Non‐parametric regression is useful to show the link between two variables without specifyingbeforehandafunctionalform.Itcanalsobeusedtoestimatethelocalderivativeofthefirstvariablewithrespecttothesecondwithouthavingtospecifythefunctionalformlinkingthem.Regressionswiththecnpemodulecanbeperformedwithoneofthefollowingtwoapproaches:

18.4.1 Nadaraya‐WatsonapproachAGaussiankernelregressionofyonxisgivenby:

( )( | )

( )i i ii

i ii

w K x yE y x y x

w K x

Fromthis,thederivativeof ( | )y x withrespecttoxisgivenby

( | )dy y xE x

dx x

18.4.2 LocallinearapproachThelocallinearapproachisbasedonalocalOLSestimationofthefollowingfunctionalform:

1 1 12 2 2( ) ( ) ( ) ( ) ( ) ( )i i i i iK x y x K x x K x x x v

or,alternatively,of:

1 1 12 2 2( ) ( ) ( ) ( )i i i i i iK x y K x K x x x v

Estimatesarethengivenby:

E y x ,

dyE x

dx

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.10.

18.5 DASPandjointdensityfunctions.Themodulesjdensitycanbeusedtodrawajointdensitysurface.TheGaussiankernelestimatorofthejointdensityfunction ( , )f x y isdefinedas:

Page 72: Distributive Analysis Stata Package V2.3

72

22ni i

ini 1 x y

x y ii 1

x x y y1 1f (x, y) w exp

2 h h2 h h w

Withthismodule: Thetwovariablesofinterest(dimensions)shouldbeselected; specificpopulationsubgroupcanbeselected; surfacesshowingthejointdensityfunctionareplottedinteractivelywiththeGnuPlottool; coordinatescanbelisted;c coordinatescanbesavedinStataorGnuPlot‐ASCIIformat.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.11???

18.6 DASPandjointdistributionfunctions

Themodulesjdistrubcanbeusedtodrawjointdistributionsurfaces.Thejointdistributionfunction ( , )F x y isdefinedas:

ni i i

i 1n

ii 1

w I(x x)I(y y)F(x, y)

w

Withthismodule: Thetwovariablesofinterest(dimensions)shouldbeselected; specificpopulationsubgroupscanbeselected; surfacesshowingthejointdistributionfunctionareplottedinteractivelywiththeGnuPlot

tool; coordinatescanbelisted; coordinatescanbesavedinStataorGnuPlot‐ASCIIformat.

InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.11

19 DASPandpro‐poorgrowth

19.1 DASPandpro‐poorindicesThemoduleipropoorestimatessimultaneouslythethreefollowingpro‐poorindices:

1. TheChenandRavallionpro‐poorindex(2003):

Page 73: Distributive Analysis Stata Package V2.3

73

1 2

1

W ( z ) W ( z )Index

F ( z )

where DW ( z ) istheWattsindexfordistribution 1 2D , and 1F ( z ) istheheadcountfor

indexforthefirstdistribution,bothwithpovertylinesz.

2. TheKakwaniandPerniapro‐poorindex(2000):

1 2

1 1 1 2

P ( z, ) P ( z )Index

P ( z, ) P ( z( / ), )

3. TheKakwani,KhandkerandSonpro‐poorindex(2003):

wheretheaveragegrowthis 2 1 1g( ) / andwhereasecondindexisgivenby:

2 1Index _ Index _ g

Onevariableofinterestshouldbeselectedforeachdistribution. Conditionscanbespecifiedtofocusonspecificpopulationsubgroups. Standarderrorsandconfidenceintervalswithaconfidencelevelof95%areprovided.Both

thetypeofconfidenceintervalsprovidedandthelevelofconfidenceusedcanbechanged. Theresultsaredisplayedwith6decimals;thiscanbechanged. Alevelfortheparameter canbechosenforeachofthetwodistributions.

19.2 DASPandpro‐poorcurvesPro‐poorcurvescanbedrawnusingeithertheprimalorthedualapproach.Theformerusesincomelevels.Thelatterisbasedonpercentiles.

19.2.1 Primalpro‐poorcurvesThechangeinthedistributionfromstate1tostate2iss‐orderabsolutelypro‐poorwithstandardcons if:

+2 1( , ) ( , 1) ( , 1) <0 z 0,zz s P z cons s P z s

Thechangeinthedistributionfromstate1tostate2iss‐orderrelativelypro‐poorif:

+22 1

1

( , ) ( , 1) , 1 <0 z 0,zz s z P z s P z s

Themodule cpropoorp can be used to draw these primal pro‐poor curves and their associatedconfidenceintervalbytakingintoaccountsamplingdesign.Themodulecan:

1 2

1 1 1 21

P ( z, ) P ( z )Index _ g

P ( z, ) P ( z( / ), )

Page 74: Distributive Analysis Stata Package V2.3

74

drawpro‐poorcurvesandtheirtwo‐sided,lower‐boundedorupper‐boundedconfidenceintervals;

listorsavethecoordinatesofthedifferencesbetweenthecurvesaswellasthoseoftheconfidenceintervals;

savethegraphsindifferentformats:o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.13.

19.2.2 Dualpro‐poorcurvesLet:

( )Q p :quantileatpercentile p .( )GL p :GeneralisedLorenzcurveatpercentile p .

:averagelivingstandards.Thechangeinthedistributionfromstate1tostate2isfirst‐orderabsolutelypro‐poorwithstandardcons=0if:

+2 1( , ) ( ) ( )>0 0, p ( )z s Q p Q p p F z

orequivalentlyif:

+2 1

1

( ) ( )( , ) >0 0, p ( )

( )

Q p Q pz s p F z

Q p

Thechangeinthedistributionfromstate1tostate2isfirst‐orderrelativelypro‐poorif:

+2 2

1 1

( )( , ) - >0 0, p ( )

( )

Q pz s p F z

Q p

Thechangeinthedistributionfromstate1tostate2issecond‐orderabsolutelypro‐poorif:

+2 1( , ) ( ) ( )>0 0, p ( )z s GL p GL p p F z

orequivalentlyif:

+2 1

1

( ) ( )( , ) >0 0, p ( )

( )

GL p GL pz s p F z

GL p

Thechangeinthedistributionfromstate1tostate2isfirst‐orderrelativelypro‐poorif:

Page 75: Distributive Analysis Stata Package V2.3

75

+2 2

1 1

( )( , ) - >0 0, p ( )

( )

GL pz s p F z

GL p

The module cpropoord can be used to draw these dual pro‐poor curves and their associatedconfidenceintervalbytakingintoaccountsamplingdesign.Themodulecan: drawpro‐poorcurvesandtheirtwo‐sided,lower‐boundedorupper‐boundedconfidence

intervals; listorsavethecoordinatesofthedifferencesbetweenthecurvesaswellasthoseofthe

confidenceintervals; savethegraphsindifferentformats:

o *.gph:Stataformat;o *.wmf:typicallyrecommendedtoinsertgraphsinWorddocuments;o *.eps:typicallyrecommendedtoinsertgraphsinTex/Latexdocuments.

Manygraphicaloptionsareavailabletochangetheappearanceofthegraphs.InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.13

20 DASPandBenefitIncidenceAnalysis

20.1 Benefitincidenceanalysis

Themainobjectiveofbenefitincidenceistoanalysethedistributionofbenefitsfromtheuseofpublicservicesaccordingtothedistributionoflivingstandards.

Two main sources of information are used. The first informs on access of household

memberstopublicservices.Thisinformationcanbefoundinusualhouseholdsurveys.Theseconddeals with the amount of total public expenditures on each public service. This information isusuallyavailableatthenationallevelandsometimesinamoredisaggregatedformat,suchasattheregionallevel.Thebenefitincidenceapproachcombinestheuseofthesetwosourcesofinformationtoanalysethedistributionofpublicbenefitsanditsprogressivity.

Formally,let

iw bethesamplingweightofobservation i ;

iy bethelivingstandardofmembersbelongingtoobservation i (i.e.,percapitaincome);

sie bethenumberof“eligible”membersofobservationi,i.e.,membersthat“need”the

publicserviceprovidedbysectors.ThereareSsectors;

sif bethenumberofmembersofobservationi thateffectivelyusethepublicservice

providedbysectors;

ig bethesocio‐economicgroupofeligiblemembersofobservationi (typicallyclassifiedbyincomepercentiles);

Page 76: Distributive Analysis Stata Package V2.3

76

ic beasubgroupindicatorforobservationi (e.g.,1foraruralresident,and2foranurbanresident).Eligiblememberscanthusbegroupedintopopulationexclusivesubgroups;

srE betotalpublicexpendituresonsector s inarea r .ThereareRareas(theareahere

referstothegeographicaldivisionwhichonecanhavereliableinformationontotalpublicexpendituresonthestudiedpublicservice);

sE betotalpublicexpendituresonsector s

Rs s

rr 1

E E

.

Herearesomeofthestatisticsthatcanbecomputed.

1. Theshareofaginsector s isdefinedasfollows:

ns

i is i 1g n

si i

i 1

w f I(i g)

SH

w f

Notethat:G

sg

g 1

SH 1

.

2. Therateofparticipationofagroupginsectors isdefinedasfollows:

ns

i is i 1g n

si i

i 1

w f I(i g)

CR

w e I(i g)

Thisratecannotexceed100%since s si if e i .

3. Theunitcostofabenefitinsectorsforobservation j ,whichreferstothehousehold

membersthatliveinarea r :

r

ss rj n

sj j

j 1

EUC

w f

where rn isthenumberofsampledhouseholdsinarear.

4. Thebenefitofobservationifromtheuseofpublicsector s is:

s s si i iB f UC

5. Thebenefitofobservation i fromtheuseoftheSpublicsectorsis:

Page 77: Distributive Analysis Stata Package V2.3

77

Ss

i is 1

B B

6. Theaveragebenefitatthelevelofthoseeligibletoaservicefromsectorsandforthose

observationsthatbelongtoagroupg ,isdefinedas:n

si i

s i 1g n

si i

i 1

w B I(i g)

ABE

w e I(i g)

7. Theaveragebenefitforthosethatusetheservice s andbelongtoagroupg isdefinedas:

8. Theproportionofbenefitsfromtheservicefromsectors thataccruestoobservationsthat

belongtoagroupg isdefinedas:

sgs

g s

BPB

E

wheren

s sg i i

i 1

B w B I(i g)

.

These statistics can be restricted to specific socio‐demographic groups (e.g.,. rural/urban) byreplacing I(i g) by I(i c) .

.Thebian.adomoduleallowsthecomputationofthesedifferentstatistics.Somecharacteristicsofthemodule:

o Possibilityofselectingbetweenoneandsixsectors.o Possibilityofusingfrequencydataapproachwheninformationabouttheleveloftotalpublic

expendituresisnotavailable.o Generationofbenefit variablesby the typeofpublic services (ex:primary, secondaryand

tertiaryeducationlevels)andbysector.o Generationofunitcostvariablesforeachsector.o Possibilityofcomputingstatisticsaccordingtogroupsofobservations.o Generationofstatisticsaccordingtosocial‐demographicgroups,suchasquartiles,quintiles

ordeciles.

ns

i is i 1g n

si i

i 1

w B I(i g)

ABF

w f I(i g)

Page 78: Distributive Analysis Stata Package V2.3

78

Publicexpendituresonagivenserviceoftenvaryfromonegeographicaloradministrativeareatoanother. When information about public expenditures is available at the level of areas, thisinformationcanbeusedwiththebianmoduletoestimateunitcostmoreaccurately.Example1Observationi HH

sizeEligibleHHmembers

Frequency Areaindicator Totallevel ofregionalpublicexpenditures

1 7 3 2 1 140002 4 2 2 1 140003 5 5 3 1 140004 6 3 2 2 120005 4 2 1 2 12000

Inthisexample,thefirstobservationcontainsinformationonhousehold1.

Thishouseholdcontains7individuals; Threeindividualsinthishouseholdareeligibletothepublicservice; Only2amongthe3eligibleindividualsbenefitfromthepublicservice; This household lives in area 1. In this area, the government spends a total of 14000 to

providethepublicserviceforthe7usersofthisarea(2+2+3).Theunitcostinarea1equals:14000/7=2000Theunitcostinarea2equals:12000/3=4000Bydefault, theareaindicator issetto1forallhouseholds.Whenthisdefault isused,thevariableRegionalpublicexpenditures(thefifthcolumnthatappearsinthedialogbox)shouldbesettototalpublic expenditures at the national level. This would occur when the information on publicexpendituresisonlyavailableatthenationallevel.Example2Observationi HH

sizeEligiblemembers

Frequency Areaindicator Regionalpublicexpenditures

1 7 3 2 1 280002 4 2 2 1 280003 5 5 3 1 280004 6 3 2 1 280005 4 2 1 1 28000

Theunitcostbenefit(atthenationallevel)equals:28000/10=2800InterestedusersareencouragedtoconsidertheexercisesthatappearinSection 23.14

Page 79: Distributive Analysis Stata Package V2.3

79

21 Disaggregatinggroupeddata TheungroupDASPmodulegeneratesdisaggregateddatafromaggregatedistributiveinformation.Aggregate information isobtained fromcumulative incomeshares (orLorenzcurveordinates)atsomepercentiles.Forinstance:

Percentile(p) 0.10 0.30 0.50 0.60 0.90 1.00Lorenzvalues:L(p) 0.02 0.10 0.13 0.30 0.70 1.00

Theusermustspecifythetotalnumberofobservationstobegenerated.Theusercanalsoindicatethe number of observations to be generated specifically at the top and/or at the bottom of thedistribution,inwhichcasetheproportion(in%)ofthepopulationfoundatthetoporatthebottommustalsobespecified.Remarks:

If only the total number of observations is set, the generated data are self weighted (oruniformlydistributedoverpercentiles).

Ifanumberofobservationsissetforthebottomand/ortoptails,thegenerateddataarenotselfweightedandaweightvariableisprovidedinadditiontothegeneratedincomevariable. Example:Assumethatthetotalnumberofobservationstobegeneratedissetto1900,

but thatwewould like the bottom 10%of the population to be represented by 1000observations.Inthiscase,weightswillequal1/1000forthebottom1000observationsand1/100fortheremainingobservations(thesumofweightsbeingnormalizedtoone).

Thegeneratedincomevectortakesthenameof_yandthevectorweight,_w. The number of observations to be generated does not have to equal the number of

observations of the sample thatwas originally used to generate the aggregated data. Theungroup module cannot in itself serve to estimate the sampling errors that would haveoccurred had the original sample data been used to estimate poverty and/or inequalityestimates.

Theusercanselectanysamplesizethatexceeds(number_of_classes+1),butitmaybemoreappropriateforstatisticalbias‐reductionpurposestoselectrelativelylargesizes.

STAGEIGeneratinganinitialdistributionofincomesandpercentiles

S.1.1:GeneratingavectorofpercentilesStarting from information on the importance of bottom and top groups and on the number ofobservationstobegenerated,wefirstgenerateavectorofpercentiles.

Page 80: Distributive Analysis Stata Package V2.3

80

Examples:Notations:NOBS:numberoftotalobservationsF:vectorofpercentilesB_NOBS:numberofobservationsforthebottomgroupT_NOBS:numberofobservationsforthetopgroup.

ForNOBS=1000spreadequallyacrossallpercentiles,F=0.001,0.002...0.999,1.Toavoidthe value F=1 for the last generated observation, we can simply replace F by F‐(0.5/NOBS).

For NOBS=2800, B_NOBS=1000 and T_NOBS=1000, with the bottom and top groupsbeingthefirstandlastdeciles:

a. F=0.0001,0.0002,...,0.0999,0.1000in0001/1000b. F=0.1010,0.1020,…,0.8990,0.9000in1001/1800c. F=0.9001,0.9002,...,0.9999,1.0000in1801/2800

AdjustmentscanalsobemadetoavoidthecaseofF(1)=1.Theweightvectorcaneasilybegenerated.S.1.2:GeneratinganinitialdistributionofincomesTheusermustindicatetheformofdistributionofthedisaggregateddata.‐Normalandlognormaldistributions:Assumethat x followsalognormaldistributionwithmean andvariance 2 .TheLorenzcurveisdefinedasfollows:

2( ) ( )( )

Ln xL p

and

( )Ln xp

Weassumethat 1 andweestimatethevarianceusingtheproceduresuggestedbyShorrrocksandWan(2008):avalueforthestandarddeviationoflogincomes,σ,isobtainedbyaveragingthe

1m estimatesof 1 1 ( ) 1, , 1kk kp L p k m

wherem isthenumberofclassesandΦisthestandardnormaldistributionfunction(AitchisonandBrown1957;KolenikovandShorrocks2005,Appendix).‐GeneralizedQuadraticLorenzCurve:Itisassumedthat:

2(1 ) ( ) ( 1) ( )L L a p L bL p c p L

Wecanregress (1 )L L on 2( )p L , ( 1)L p and ( )p L withoutanintercept,droppingthelastobservationsincethechosenfunctionalformforcesthecurvetogothrough(1,1).

Wehave 0.52 22

( )2 4

mp n mp np ebQ p

Page 81: Distributive Analysis Stata Package V2.3

81

2

1

4

2 4

e a b c

m b a

n be c

‐BetaLorenzCurve:Itisassumedthat:

log log( ) log( ) log(1 )p L p p

Afterestimatingtheparameters,wecangeneratequantilesasfollows

1

1Q p p p

p p

SeealsoDatt(1998).‐TheSingh‐MaddaladistributionThedistributionfunctionproposedbySinghandMaddala(1976)takesthefollowingform:

1( ) 1

1 ( / )a

q

F xx b

where 0, 0, 1/a b q a are parameters to be estimated. The income ( x ) is assumed to beequaltoorgreaterthanzero.Thedensityfunctionisdefinedasfollows:

( 1) 1( ) / 1 / /

qa af x aq b x b x b

Quantilesaredefinedasfollows:

1/1/( ) 1 1

aqQ p b p

WefollowJenkins(2008)’sapproachfortheestimationofparameters. Forthis,wemaximizethelikelihood function, which is simply the product of density functions evaluated at the averageincomeofeachclass:http://stata‐press.com/journals/stbcontents/stb48.pdfSTAGEIIAdjustingtheinitialdistributiontomatchtheaggregateddata(optional).

ThisstageadjuststheinitialvectorofincomesusingtheShorrocksandWan(2008)procedure.Thisprocedureproceedswithtwosuccessiveadjustments:

Adjustment1:Correctingtheinitialincomevectortoensurethateachincomegrouphasitsoriginalmeanincome.

Adjustment2:Smoothingtheinter–classdistributions.

The generated sample is saved automatically in a new Stata data file (called by defaultungroup_data.dta;namesanddirectoriescanbechanged).TheusercanalsoplottheLorenzcurvesoftheaggregated(whenweassumethateachindividualhastheaverageincomeofhisgroup)andgenerateddata.Dialogboxoftheungroupmodule

Page 82: Distributive Analysis Stata Package V2.3

82

Figure 16: ungroup dialog box

IllustrationwithBurkinaFasohouseholdsurveydataIn thisexample,weusedisaggregateddata togenerateaggregated information.Then,wecompare the density curve of the true data with those of the data generated throughdisaggregationofthepreviouslyaggregateddata.genfw=size*weightgeny=exppc/r(mean)clorenzy,hs(size)lres(1)

Aggregatedinformation:pL(p).1.0233349.2.0576717.3.0991386.4.1480407.5.2051758.6.2729623.7.3565971.8.4657389.9.621357111.00000

0.5

11.

5

0 2 4 6Normalised per capita expenditures

True distribition Log NormalUniform Beta LCGeneralized Quadratic LC SINGH & MADALLA

(without adjustment)Density functions

0.5

11.

5

0 2 4 6Normalised per capita expenditures

True distribition Log NormalUniform Beta LCGeneralized Quadratic LC SINGH & MADALLA

(with adjustment)Density functions

Page 83: Distributive Analysis Stata Package V2.3

83

22 Appendices

22.1 AppendixA:illustrativehouseholdsurveys

22.1.1 The1994BurkinaFasosurveyofhouseholdexpenditures(bkf94I.dta)Thisisanationallyrepresentativesurvey,withsampleselectionusingtwo‐stagestratifiedrandomsampling.Sevenstratawereformed.Fiveof thesestratawereruralandtwowereurban.Primarysamplingunitsweresampledfromalistdrawnfromthe1985census.Thelastsamplingunitswerehouseholds.Listofvariablesstrata Stratuminwhichahouseholdlives

psu Primarysamplingunit

weight Samplingweight

size Householdsize

exp Totalhouseholdexpenditures

expeq Totalhouseholdexpendituresperadultequivalent

expcp Totalhouseholdexpenditurespercapita

gse Socio‐economicgroupofthehouseholdhead

1wage‐earner(publicsector)2wage‐earner(privatesector)3Artisanortrader4Othertypeofearner5Cropfarmer6Subsistencefarmer7Inactive

sex Sexofhouseholdhead1Male2Female

zone Residentialarea1Rural2Urban

Page 84: Distributive Analysis Stata Package V2.3

84

22.1.2 The1998BurkinaFasosurveyofhouseholdexpenditures(bkf98I.dta)Thissurveyissimilartothe1994one,althoughtenstratawereusedinsteadofsevenfor1994.Toexpress 1998 data in 1994 prices, two alternative procedures have been used. First, 1998expendituredataweremultipliedbytheratioof the1994officialpoverty linetothe1998officialpoverty line: z_1994/z_1998. Second, 1998 expenditure data weremultiplied by the ratio of the1994consumerpriceindextothe1998consumerpriceindex:ipc_1994/ipc_1998.Listofnewvariablesexpcpz Totalhouseholdexpenditurespercapitadeflatedby(z_1994/z_1998)

expcpi Totalexpenditurespercapitadeflatedby(ipc_1994/ipc_1998)

22.1.3 CanadianSurveyofConsumerFinance(asubsampleof1000observations–can6.dta)

ListofvariablesX Yearlygrossincomeperadultequivalent.

T Incometaxesperadultequivalent.

B1 Transfer1peradultequivalent.

B2 Transfer2peradultequivalent.

B3 Transfer3peradultequivalent.

B SumoftransfersB1,B2andB3

N Yearlynetincomeperadultequivalent(XminusT plusB)

22.1.4 PeruLSMSsurvey1994(Asampleof3623householdobservations‐PEREDE94I.dta)

Listofvariablesexppc

Totalexpenditures,percapita(constantJune1994solesperyear).

weight Samplingweight

Page 85: Distributive Analysis Stata Package V2.3

85

size Householdsize

npubprim

Numberofhouseholdmembersinpublicprimaryschool

npubsec

Numberofhouseholdmembersinpublicsecondaryschool

npubuniv

Numberofhouseholdmembersinpublicpost‐secondaryschool

22.1.5 PeruLSMSsurvey1994(Asampleof3623householdobservations–PERU_A_I.dta)

Listofvariableshhid

HouseholdId.

exppc

Totalexpenditures,percapita(constantJune1994solesperyear).

size Householdsize

literate Numberofliteratehouseholdmembers

pliterate literate/size

22.1.6 The1995ColombiaDHSsurvey(columbiaI.dta)ThissampleisapartoftheDatafromtheDemographicandHealthSurveys(Colombia_1995)witchcontainsthefollowinginformationforchildrenaged0‐59monthsListofvariableshid Householdidhaz height‐for‐agewaz weight‐for‐agewhz weight‐for‐heightsprob survivalprobabilitywght samplingweightAsset assetindex

22.1.7 The1996DominicanRepublicDHSsurvey(Dominican_republic1996I.dta)

ThissampleisapartoftheDatafromtheDemographicandHealthSurveys(RepublicDominican_1996)witchcontainsthefollowinginformationforchildrenaged0‐59months

Page 86: Distributive Analysis Stata Package V2.3

86

Listofvariableshid Householdidhaz height‐for‐agewaz weight‐for‐agewhz weight‐for‐heightsprob survivalprobabilitywght samplingweightAsset assetindex

22.2 AppendixB:labellingvariablesandvalues Thefollowing.dofilecanbeusedtosetlabelsforthevariablesinbkf94.dta. Formoredetailsontheuseoflabelcommand,typehelplabelinthecommandwindow.

=================================lab_bkf94.do==================================#delim;/*Todropalllabelvalues*/labeldrop_all;/*Toassignlabels*/labelvarstrata"Stratuminwhichahouseholdlives";labelvarpsu"Primarysamplingunit";labelvarweight"Samplingweight";labelvarsize"Householdsize";labelvartotexp"Totalhouseholdexpenditures";labelvarexppc"Totalhouseholdexpenditurespercapita";labelvarexpeq"Totalhouseholdexpendituresperadultequivalent";labelvargse"Socio‐economicgroupofthehouseholdhead";/*Todefinethelabelvaluesthatwillbeassignedtothecategoricalvariablegse*/labeldefinelvgse1"wage‐earner(publicsector)"2"wage‐earner(privatesector)"3"Artisanortrader"4"Othertypeofearner"5"Cropfarmer"6"Subsistencefarmer"7"Inactive";/*Toassignthelabelvalues"lvgse"tothevariablegse*/labelvalgselvgse;labelvarsex"Sexofhouseholdhead";

Page 87: Distributive Analysis Stata Package V2.3

87

labeldeflvsex1Male2Female;labelvalsexlvsex;labelvarzone"Residentialarea";labeldeflvzone1Rural2Urban;labelvalzonelvzone;====================================End======================================

22.3 AppendixC:settingthesamplingdesignTosetthesamplingdesignforthedatafilebkf94.dta,openthedialogboxforthecommandsvysetbytypingthesyntaxdbsvysetinthecommandwindow.IntheMainpanel,setSTRATAandSAMPLINGUNITSasfollows:Figure17:Surveydatasettings

IntheWeightspanel,setSAMPLINGWEIGHTVARIABLEasfollows:

Page 88: Distributive Analysis Stata Package V2.3

88

Figure18:Settingsamplingweights

ClickonOKandsavethedatafile.Tocheckifthesamplingdesignhasbeenwellset,typethecommandsvydes.Thefollowingwillbedisplayed:

Page 89: Distributive Analysis Stata Package V2.3

89

23 Examplesandexercises

23.1 EstimationofFGTpovertyindices“HowpoorwasBurkinaFasoin1994?”

1. Open the bkf94.dta file and label variables and values using the information of Section 22.1.1.Typethedescribecommandandthenlabellisttolistlabels.

2. UsetheinformationofSection 22.1.1.tosetthesamplingdesignandthensavethefile.3. Estimatetheheadcountindexusingvariablesofinterestexpccandexpeq.

a. You should set SIZE to household size in order to estimate poverty over thepopulationofindividuals.

b. Usetheso‐called1994officialpovertylineof41099FrancsCFAperyear.4. Estimate theheadcount indexusing the sameprocedureas aboveexcept that thepoverty

lineisnowsetto60%ofthemedian.5. Usingtheofficialpovertyline,howdoestheheadcountindexformale‐andfemale‐headed

householdscompare?6. Can you draw a 99% confidence interval around the previous comparison? Also, set the

numberofdecimalsto4.AnswerQ.1Ifbkf94.dtaissavedinthedirectoryc:/data,typethefollowingcommandtoopenit:use"C:\data\bkf94.dta",clearIflab_bkf94.doissavedinthedirectoryc:/do_files,typethefollowingcommandtolabelvariablesandlabels:do"C:\do_files\lab_bkf94.do"Typingthecommanddescribe,weobtain:obs: 8,625 vars: 9 31Oct200613:48size: 285,087(99.6%of memoryfree) storagedisplay valuevariable name type format label variablelabel weight float %9.0g Samplingweightsize byte %8.0g Householdsizestrata byte %8.0g Stratuminwhichahouseholdlivespsu byte %8.0g Primarysamplingunitgse byte %29.0g gse Socio‐economicgroupofthehouseholdheadsex byte %8.0g sex Sexofhouseholdheadzone byte %8.0g zone Residentialareaexp double %10.0g Totalhouseholdexpendituresexpeq double %10.0g Totalhouseholdexpendituresperadultequivalentexppc float %9.0g Totalhouseholdexpenditurespercapita Typinglabellist,wefind:zone: 1 Rural 2 Urban

Page 90: Distributive Analysis Stata Package V2.3

90

sex: 1 Male 2 Femalegse: 1 wage‐earner(publicsector) 2 wage‐earner(privatesector) 3 Artisanortrader 4 Othertypeofearner 5 Cropfarmer 6 Foodfarmer 7 InactiveQ.2Youcansetthesamplingdesignwithadialogbox,asindicatedinSection 22.3,orsimplybytypingsvysetpsu[pweight=weight],strata(strata)vce(linearized)Typingsvydes,weobtain

Q.3TypebdifgttoopenthedialogboxfortheFGTpovertyindexandchoosevariablesandparametersasindicatedinthefollowingwindow.ClickonSUBMIT.

Page 91: Distributive Analysis Stata Package V2.3

91

Figure19:EstimatingFGTindices

Thefollowingresultsshouldthenbedisplayed:

Q.4SelectRELATIVEforthepovertylineandsettheotherparametersasabove.

Page 92: Distributive Analysis Stata Package V2.3

92

Figure20:EstimatingFGTindiceswithrelativepovertylines

AfterclickingonSUBMIT,thefollowingresultsshouldbedisplayed:

Q.5Setthegroupvariabletosex.

Page 93: Distributive Analysis Stata Package V2.3

93

Figure21:FGTindicesdifferentiatedbygender

ClickingonSUBMIT,thefollowingshouldappear:

Q.6UsingthepanelCONFIDENCEINTERVAL,settheconfidencelevelto99%andsetthenumberofdecimalsto4intheRESULTSpanel.

Page 94: Distributive Analysis Stata Package V2.3

94

Page 95: Distributive Analysis Stata Package V2.3

95

23.2 EstimatingdifferencesbetweenFGTindices.“HaspovertyBurkinaFasodecreasedbetween1994and1998?”

1. OpenthedialogboxforthedifferencebetweenFGTindices.2. Estimatethedifferencebetweenheadcountindiceswhen

a. Distribution1isyear1998anddistribution2isyear1994;b. Thevariableofinterestisexppcfor1994andexppczfor1998.c. You should set size to household size in order to estimate poverty over the

populationofindividuals.d. Use41099FrancsCFAperyearasthepovertylineforbothdistributions.

3. Estimatethedifferencebetweenheadcountindiceswhena. Distribution1isruralresidentsinyear1998anddistribution2isruralresidentsin

year1994;b. Thevariableofinterestisexppcfor1994andexppczfor1998.c. You should set size to household size in order to estimate poverty over the

populationofindividuals.d. Use41099FrancsCFAperyearasthepovertylineforbothdistributions.

4. Redothelastexerciseforurbanresidents.5. Redothelastexerciseonlyformembersofmale‐headedhouseholds.6. Testiftheestimateddifferenceinthelastexerciseissignificantlydifferentfromzero.Thus,

test:

0 1: ( 41099, 0) 0 : ( 41099, 0) 0H P z against H P z

Set the significance level to 5% and assume that the test statistics follows a normaldistribution.

AnswersQ.1OpenthedialogboxbytypingdbdifgtQ.2 Fordistribution1,choosetheoptionDATAINFILEinsteadofDATAINMEMORYandclickon

BROWSEtospecifythelocationofthefilebkf98I.dta. Followthesameprocedurefordistribution2tospecifythelocationofbkf94I.dta. Choosevariablesandparametersasfollows:

Page 96: Distributive Analysis Stata Package V2.3

96

Figure22:EstimatingdifferencesbetweenFGTindices

AfterclickingonSUBMIT,thefollowingshouldbedisplayed:

Page 97: Distributive Analysis Stata Package V2.3

97

Q.3 Restricttheestimationtoruralresidentsasfollows:

o SelecttheoptionCondition(s)o WriteZONEinthefieldnexttoCONDITION(1)andtype1inthenextfield.

Figure23:EstimatingdifferencesinFGTindices

AfterclickingonSUBMIT,weshouldsee:

Q.4

Page 98: Distributive Analysis Stata Package V2.3

98

Onecanseethatthechangeinpovertywassignificantonlyforurbanresidents.Q.5Restricttheestimationtomale‐headedurbanresidentsasfollows:

o SetthenumberofCondition(s)to2;o SetsexinthefieldnexttoCondition(2)andtype1inthenextfield.

Figure24:FGTdifferencesacrossyearsbygenderandzone

AfterclickingonSUBMIT,thefollowingshouldbedisplayed:

Q.6

Page 99: Distributive Analysis Stata Package V2.3

99

Wehavethat:LowerBound:=0.0222UpperBound:=0.1105Thenullhypothesisisrejectedsincethelowerboundofthe95%confidenceintervalisabovezero.

23.3 Estimatingmultidimensionalpovertyindices“Howmuchisbi‐dimensionalpoverty(totalexpendituresandliteracy)inPeruin1994?”Usingtheperu94I.dtafile,

1. EstimatetheChakravartyetal(1998)indexwithparameteralpha=1and

Var.ofinterest Pov.line a_jDimension1 exppc 400 1Dimension2 pliterate 0.90 1

2. EstimatetheBourguignonandChakravarty(2003)indexwithparameters alpha=beta=gamma=1and

Var.ofinterest Pov.lineDimension1 exppc 400Dimension2 literate 0.90

Q.1Steps: Type

use"C:\data\ peru94_A_I.dta",clear Toopentherelevantdialogbox,type

dbimdp_bci

Choosevariablesandparametersasin

Page 100: Distributive Analysis Stata Package V2.3

100

Figure25:Estimatingmultidimensionalpovertyindices(A)

AfterclickingSUBMIT,thefollowingresultsappear.

Q.2 Toopentherelevantdialogbox,type

dbimdp_cmrSteps: Choosevariablesandparametersasin

Page 101: Distributive Analysis Stata Package V2.3

101

Figure26:Estimatingmultidimensionalpovertyindices(B)

AfterclickingSUBMIT,thefollowingresultsappear.

Page 102: Distributive Analysis Stata Package V2.3

102

23.4 EstimatingFGTcurves.“Howsensitivetothechoiceofapovertylineistherural‐urbandifferenceinpoverty?”

1. Openbkf94I.dta2. OpentheFGTcurvesdialogbox.3. DrawFGTcurvesforvariablesofinterestexppcandexpeqwith

a. parameter 0 ;b. povertylinebetween0and100,000FrancCFA;c. sizevariablesettosize;d. subtitleofthefiguresetto“Burkina1994”.

4. DrawFGTcurvesforurbanandruralresidentswitha. variableofinterestsettoexpcap;b. parameter 0 ;c. povertylinebetween0and100,000FrancCFA;d. sizevariablesettosize.

5. Drawthedifferencebetweenthesetwocurvesanda. save the graph in *.gph format to be plotted in Stata and in *.wmf format to be

insertedinaWorddocument.b. Listthecoordinatesofthegraph.

6. Redothelastgraphwith 1 .AnswersQ.1Openthefilewithuse"C:\data\bkf94I.dta",clearQ.2OpenthedialogboxbytypingdbdifgtQ.3Choosevariablesandparametersasfollows:

Page 103: Distributive Analysis Stata Package V2.3

103

Figure27:DrawingFGTcurves

Tochangethesubtitle,selecttheTitlepanelandwritethesubtitle.Figure28:EditingFGTcurves

AfterclickingSUBMIT,thefollowinggraphappears:

Page 104: Distributive Analysis Stata Package V2.3

104

Figure29:GraphofFGTcurves

Page 105: Distributive Analysis Stata Package V2.3

105

Q.4Choosevariablesandparametersasinthefollowingwindow:Figure30:FGTcurvesbyzone

AfterclickingSUBMIT,thefollowinggraphappears:

Page 106: Distributive Analysis Stata Package V2.3

106

Figure31:GraphofFGTcurvesbyzone

Page 107: Distributive Analysis Stata Package V2.3

107

Q.5 ChoosetheoptionDIFFERENCEandselect:WITHTHEFIRSTCURVE; Indicatethatthegroupvariableiszone; SelecttheResultspanelandchoosetheoptionLISTintheCOORDINATESquadrant. IntheGRAPHquadrant,selectthedirectoryinwhichtosavethegraphingphformatandto

exportthegraphinwmfformat.

Figure32:DifferencesofFGTcurves

Page 108: Distributive Analysis Stata Package V2.3

108

Figure33:Listingcoordinates

Page 109: Distributive Analysis Stata Package V2.3

109

AfterclickingSUBMIT,thefollowingappears:

Figure34:DifferencesbetweenFGTcurves

Q.6

Page 110: Distributive Analysis Stata Package V2.3

110

Figure35:DifferencesbetweenFGTcurves

23.5 EstimatingFGTcurvesanddifferencesbetweenFGTcurveswithconfidenceintervals

“Isthepovertyincreasebetween1994and1998inBurkinaFasostatisticallysignificant?”1) Using the filebkf94I.dta, draw the FGT curve and its confidence interval for the variable of

interestexppcwith:a) parameter 0 ;b) povertylinebetween0and100,000FrancCFA;c) sizevariablesettosize.

2) Using simultaneously the files bkf94I.dta and bkf98I.dta, draw the difference between FGTcurvesandassociatedconfidenceintervalswith:a) Thevariableofinterestexppcfor1994andexppczfor1998.b) parameter 0 ;c) povertylinebetween0and100,000FrancCFA;d) sizevariablesettosize.

3) Redo2)withparameter 1 .AnswersQ.1

Page 111: Distributive Analysis Stata Package V2.3

111

Steps: Type

use"C:\data\bkf94I.dta",clear Toopentherelevantdialogbox,type

dbcfgts

Choosevariablesandparametersasin

Figure36:DrawingFGTcurveswithconfidenceinterval

AfterclickingSUBMIT,thefollowingappears:

Page 112: Distributive Analysis Stata Package V2.3

112

Figure37:FGTcurveswithconfidenceinterval

0.2

.4.6

.8

0 20000 40000 60000 80000 100000Poverty line (z)

Confidence interval (95 %) Estimate

Burkina FasoFGT curve (alpha = 0)

Q.2Steps: Toopentherelevantdialogbox,type

dbcfgtsd2 Choosevariablesandparametersasin

Page 113: Distributive Analysis Stata Package V2.3

113

Figure38:DrawingthedifferencebetweenFGTcurveswithconfidenceinterval

Figure39:DifferencebetweenFGTcurveswithconfidenceinterval ( 0)

-.1

-.0

50

.05

0 20000 40000 60000 80000 100000Poverty line (z)

Confidence interval (95 %) Estimated difference

(alpha = 0)Difference between FGT curves

Page 114: Distributive Analysis Stata Package V2.3

114

Figure40:DifferencebetweenFGTcurveswithconfidenceinterval ( 1)

-.0

4-.

02

0.0

2

0 20000 40000 60000 80000 100000Poverty line (z)

Confidence interval (95 %) Estimated difference

(alpha = 1)Difference between FGT curves

23.6 Testingpovertydominanceandestimatingcriticalvalues.“HasthepovertyincreaseinBurkinaFasobetween1994and1998beenstatisticallysignificant?”1) Usingsimultaneouslyfilesbkf94I.dtaandbkf98I.dta,checkforsecond‐orderpovertydominance

andestimatethevaluesofthepovertylineatwhichthetwoFGTcurvescross.a) Thevariableofinterestisexppcfor1994andexppczfor1998;b) Thepovertylineshouldvarybetween0and100,000FrancCFA;c) Thesizevariableshouldbesettosize.

AnswersQ.1Steps: Toopentherelevantdialogbox,type

dbdompov Choosevariablesandparametersasin

Page 115: Distributive Analysis Stata Package V2.3

115

Figure41:Testingforpovertydominance

AfterclickingSUBMIT,thefollowingresultsappear:

23.7 DecomposingFGTindices.“WhatisthecontributionofdifferenttypesofearnerstototalpovertyinBurkinaFaso?”

1. Openbkf94I.dtaanddecomposetheaveragepovertygapa. withvariableofinterestexppc;b. withsizevariablesettosize;c. attheofficialpovertylineof41099FrancsCFA;d. andusingthegroupvariablegse(Socio‐economicgroups).

2. Dotheaboveexercisewithoutstandarderrorsandwiththenumberofdecimalssetto4.

Page 116: Distributive Analysis Stata Package V2.3

116

AnswersQ.1Steps: Type

use"C:\data\bkf94I.dta",clear Toopentherelevantdialogbox,type

dbdfgtg

Choosevariablesandparametersasin

Figure42:DecomposingFGTindicesbygroups

AfterclickingSUBMIT,thefollowinginformationisprovided:

Page 117: Distributive Analysis Stata Package V2.3

117

Q.2UsingtheRESULTSpanel,changethenumberofdecimalsandunselecttheoptionDISPLAYSTANDARDERRORS.

AfterclickingSUBMIT,thefollowinginformationisobtained:

Page 118: Distributive Analysis Stata Package V2.3

118

23.8 EstimatingLorenzandconcentrationcurves.“HowmuchdotaxesandtransfersaffectinequalityinCanada?”Byusingthecan6.dtafile,

1. Draw the Lorenz curves for gross income X and net income N. How can you see theredistributionofincome?

2. Draw Lorenz curves for gross income X and concentration curves for each of the threetransfersB1,B2 andB3 and the taxT. Whatcanyousayabout theprogressivityof theseelementsofthetaxandtransfersystem?

“WhatistheextentofinequalityamongBurkinaFasoruralandurbanhouseholdsin1994?”Byusingthebkf94I.dtafile,

3. DrawLorenzcurvesforruralandurbanhouseholds

a. withvariableofinterestexppc;b. withsizevariablesettosize;c. andusingthegroupvariablezone(asresidentialarea).

Q.1Steps: Type

use"C:\data\can6.dta",clear Toopentherelevantdialogbox,type

dbclorenz

Choosevariablesandparametersasin

Page 119: Distributive Analysis Stata Package V2.3

119

Figure43:Lorenzandconcentrationcurves

AfterclickingSUBMIT,thefollowingappears:

Page 120: Distributive Analysis Stata Package V2.3

120

Figure44:Lorenzcurves

Q.2Steps:

Choosevariablesandparametersasin

Page 121: Distributive Analysis Stata Package V2.3

121

Figure45:Drawingconcentrationcurves

AfterclickingonSUBMIT,thefollowingappears:

Page 122: Distributive Analysis Stata Package V2.3

122

Figure46:Lorenzandconcentrationcurves

Q.3Steps: Type

use"C:\data\bkf94I.dta",clear

Choosevariablesandparametersasin

Page 123: Distributive Analysis Stata Package V2.3

123

Figure47:DrawingLorenzcurves

Figure48:Lorenzcurves

Page 124: Distributive Analysis Stata Package V2.3

124

23.9 EstimatingGiniandconcentrationcurves“ByhowmuchdotaxesandtransfersaffectinequalityinCanada?”Usingthecan6.dtafile,

1. EstimatetheGiniindicesforgrossincomeXandnetincomeN.2. Estimate the concentration indices for variables T andN when the ranking variable is

grossincomeX.“ByhowmuchhasinequalitychangedinBurkinaFasobetween1994and1998?”Usingthebkf94I.dtafile,

3. EstimatethedifferenceinBurkinaFaso’sGiniindexbetween1998and1994

a. withvariableofinterestexpeqzfor1998andexpeqfor1994;b. withsizevariablesettosize.

Q.1Steps: Type

use"C:\data\can6.dta",clear Toopentherelevantdialogbox,type

dbigini

Choosevariablesandparametersasin

Page 125: Distributive Analysis Stata Package V2.3

125

Figure49:EstimatingGiniandconcentrationindices

AfterclickingSUBMIT,thefollowingresultsareobtained:

Q.2Steps:

Choosevariablesandparametersasin

Page 126: Distributive Analysis Stata Package V2.3

126

Figure50:Estimatingconcentrationindices

AfterclickingSUBMIT,thefollowingresultsareobtained:

Q.3Steps: Toopentherelevantdialogbox,type

dbdigini

Choosevariablesandparametersasin

Page 127: Distributive Analysis Stata Package V2.3

127

Figure51:EstimatingdifferencesinGiniandconcentrationindices

AfterclickingSUBMIT,thefollowinginformationisobtained:

Page 128: Distributive Analysis Stata Package V2.3

128

23.10 Usingbasicdistributivetools“WhatdoesthedistributionofgrossandnetincomeslooklikeinCanada?”Usingthecan6.dtafile,

1. DrawthedensityforgrossincomeXandnetincomeN.‐ Therangeforthexaxisshouldbe[0,60000].

2. DrawthequantilecurvesforgrossincomeXandnetincomeN.‐ Therangeofpercentilesshouldbe[0,0.8]

3. Drawtheexpectedtax/benefitaccordingtogrossincomeX.‐ Therangeforthexaxisshouldbe[0,60000]‐ Usealocallinearestimationapproach.

4. EstimatemarginalratesfortaxesandbenefitsaccordingtogrossincomeX.‐ Therangeforthexaxisshouldbe[0,60000]‐ Usealocallinearestimationapproach.

Q.1Steps: Type

use"C:\data\can6.dta",clear Toopentherelevantdialogbox,type

dbcdensity Choosevariablesandparametersasin

Figure52:Drawingdensities

Page 129: Distributive Analysis Stata Package V2.3

129

AfterclickingSUBMIT,thefollowingappears:

Figure53:Densitycurves

0.0

0001

.000

02.0

0003

.000

04.0

0005

f(y)

0 12000 24000 36000 48000 60000y

X N

Density Curves

Q.2Steps: Toopentherelevantdialogbox,type

dbc_quantile Choosevariablesandparametersasin

Page 130: Distributive Analysis Stata Package V2.3

130

Figure54:Drawingquantilecurves

AfterclickingSUBMIT,thefollowingappears:Figure55:Quantilecurves

01

0000

200

003

0000

Q(p

)

0 .2 .4 .6 .8Percentiles (p)

X N

Quantile Curves

Page 131: Distributive Analysis Stata Package V2.3

131

Q.3Steps: Toopentherelevantdialogbox,type

dbcnpe Choosevariablesandparametersasin

Figure56:Drawingnon‐parametricregressioncurves

AfterclickingSUBMIT,thefollowingappears:

Page 132: Distributive Analysis Stata Package V2.3

132

Figure57:Non‐parametricregressioncurves

05

000

100

001

5000

200

00E

(Y|X

)

0 12000 24000 36000 48000 60000X values

t b

(Linear Locally Estimation Approach | Bandwidth = 3699.26 )

Non parametric regression

Q.4Steps: Choosevariablesandparametersasin

Page 133: Distributive Analysis Stata Package V2.3

133

Figure58:Drawingderivativesofnon‐parametricregressioncurves

AfterclickingSUBMIT,thefollowingappears:

Figure59:Derivativesofnon‐parametricregressioncurves

-1-.

50

.51

dE

[Y|X

]/dX

0 12000 24000 36000 48000 60000X values

t b

(Linear Locally Estimation Approach | Bandwidth = 3699.26 )

Non parametric derivative regression

Page 134: Distributive Analysis Stata Package V2.3

134

23.11 Plottingthejointdensityandjointdistributionfunction“WhatdoesthejointdistributionofgrossandnetincomeslooklikeinCanada?”Usingthecan6.dtafile,

4. EstimatethejointdensityfunctionforgrossincomeXandnetincomeN.o Xrange:[0,60000]o Nrange:[0,60000]

5. EstimatethejointdistributionfunctionforgrossincomeXandnetincomeN.o Xrange:[0,60000]o Nrange:[0,60000]

Q.1Steps: Type

use"C:\data\can6.dta",clear Toopentherelevantdialogbox,type

dbsjdensity

Choosevariablesandparametersasin

Figure60:Plottingjointdensityfunction

AfterclickingSUBMIT,thefollowinggraphisplottedinteractivelywithGnuPlot4.2:

Page 135: Distributive Analysis Stata Package V2.3

135

0

10000

20000

30000

40000

50000

60000

0 10000

20000 30000

40000 50000

60000

0 5e-010 1e-009

1.5e-009 2e-009

2.5e-009 3e-009

Joint Density Function

f(x,y)

Dimension 1

Dimension 2

Q.2Steps: Toopentherelevantdialogbox,type

dbsjdistrub

Choosevariablesandparametersasin

Page 136: Distributive Analysis Stata Package V2.3

136

Figure61:Plottingjointdistributionfunction

AfterclickingSUBMIT,thefollowinggraphisplottedinteractivelywithGnuPlot4.2:

0 10000

20000 30000

40000 50000

60000

0 10000

20000 30000

40000 50000

60000 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

1

Joint Distribution Function

F(x,y)

Dimension 1Dimension 2

Page 137: Distributive Analysis Stata Package V2.3

137

23.12 Testingthebi‐dimensionalpovertydominanceUsing the columbia95I.dta (distribution_1) and the dominican_republic95I.dta (distribution_2)files,

1. Draw the difference between the bi‐dimensional multiplicative FGT surfaces and theconfidenceintervalofthatdifferencewhen

Var.ofinterest Range alpha_jDimension1 haz:height‐for‐age ‐3.0/6.0 0Dimension2 sprob:survival

probability0.7/1.0 0

2. Testforbi‐dimensionalpovertyusingtheinformationabove.

Answer:Q.1Steps: Toopentherelevantdialogbox,type

dbdombdpov

Choosevariablesandparametersasin

Page 138: Distributive Analysis Stata Package V2.3

138

Figure62:Testingforbi‐dimensionalpovertydominance

AfterclickingSUBMIT,thefollowinggraphisplottedinteractivelywithGnuPlot4.2:

-3-2

-1 0

1 2

3 4

5 6

0.78 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1

-0.4-0.3-0.2-0.1

0 0.1 0.2 0.3 0.4 0.5

Bi-dimensional poverty dominance

DifferenceLower-boundedUpper-bounded

Dimension 1

Dimension 2

Q.2

Page 139: Distributive Analysis Stata Package V2.3

139

To make a simple test of multidimensional dominance, one should check if the lower‐boundedconfidenceintervalsurfaceisalwaysabovezeroforallcombinationsofrelevantpovertylines–orconversely.

o Forthis,clickonthepanel“Confidenceinterval”andselecttheoptionlower‐bounded.o ClickagainonthebuttonSubmit.

AfterclickingSUBMIT,thefollowinggraphisplottedinteractivelywithGnuPlot4.2:

-3-2

-1 0

1 2

3 4

5 6 0.78 0.8 0.82 0.84 0.86 0.88 0.9 0.92 0.94 0.96 0.98 1

-0.35-0.3

-0.25-0.2

-0.15-0.1

-0.05 0

0.05

Bi-dimensional poverty dominance

Lower-bounded

Dimension 1

Dimension 2

Page 140: Distributive Analysis Stata Package V2.3

140

23.13 Testingforpro‐poornessofgrowthinMexicoThe three sub‐samples used in these exercises are sub‐samples of 2000 observations drawnrandomlyfromthethreeENIGHMexicanhouseholdsurveysfor1992,1998and2004.Eachofthesethreesub‐samplescontainsthefollowingvariables:strata Thestratumpsu Theprimarysamplingunitweight Samplingweightinc Incomehhsz Householdsize

1. Usingthefilesmex_92_2mI.dtaandmex_98_2mI.dta,testforfirst‐orderrelativepro‐poornessofgrowthwhen:

Theprimalapproachisused. Therangeofpovertylinesis[0,3000].

2. Repeatwiththedualapproach.3. Byusingthefilesmex_98_2mI.dtaandmex_04_2mI.dta,testforabsolutesecond‐orderpro‐

poornesswiththedualapproach.

4. Usingmex_98_2mI.dtaandmex_04_2mI.dta,estimatethepro‐poorindicesofmodule

ipropoor. Parameteralphasetto1. Povertylineequalto600.

Answer:Q.1Steps: Toopentherelevantdialogbox,type

dbcpropoorp

Page 141: Distributive Analysis Stata Package V2.3

141

Choosevariablesandparametersasin(selecttheupper‐boundedoptionfortheconfidenceinterval):

Figure63:Testingthepro‐poorgrowth(primalapproach)

AfterclickingSUBMIT,thefollowinggraphappears

Page 142: Distributive Analysis Stata Package V2.3

142

-.15

-.1

-.05

0.0

5

0 600 1200 1800 2400 3000Poverty line (z)

Difference Upper bound of 95% confidence intervalNull horizontal line

(Order : s=1 | Dif. = P_2( (m2/m1)z, a=s-1) - P_1(z,a=s-1))

Relative propoor curve

Q.2Steps: Toopentherelevantdialogbox,type

dbcpropoord Choosevariablesandparametersas in(with the lower‐boundedoption for theconfidence

interval):Figure64:Testingthepro‐poorgrowth(dualapproach)‐A

Page 143: Distributive Analysis Stata Package V2.3

143

AfterclickingSUBMIT,thefollowinggraphappears

-.4

-.2

0.2

.4

0 .184 .368 .552 .736 .92Percentiles (p)

Difference Lower bound of 95% confidence intervalNull horizontal line

(Order : s=1 | Dif. = Q_2(p) /Q_1(p) - mu_2/mu_1 )

Absolute propoor curves

Q.2Steps:

Page 144: Distributive Analysis Stata Package V2.3

144

Toopentherelevantdialogbox,typedbcpropoord Choosevariablesandparametersas in(with the lower‐boundedoption for theconfidence

interval):Figure65:Testingthepro‐poorgrowth(dualapproach)–B

AfterclickingSUBMIT,thefollowinggraphappears

Page 145: Distributive Analysis Stata Package V2.3

145

02

46

0 .184 .368 .552 .736 .92Percentiles (p)

Difference Lower bound of 95% confidence intervalNull horizontal line

(Order : s=2 | Dif. = (GL_2(p) - GL_1(p) ) / GL_2(p) )

Absolute propoor curves

Q.4Steps: Toopentherelevantdialogbox,type

dbipropoor Choosevariablesandparametersas.

Page 146: Distributive Analysis Stata Package V2.3

146

AfterclickingSUBMIT,thefollowingresultsappear:

23.14 BenefitincidenceanalysisofpublicspendingoneducationinPeru(1994).

1. Usingtheperedu94I.dtafile,estimateparticipationandcoverageratesoftwotypesofpublicspendingoneducationwhen:

‐ Thestandardoflivingisexppc‐ Thenumberofhouseholdmembers thatbenefit fromeducation is fr_prim for the

primarysectorandfr_secforthesecondaryone.‐ Thenumberof eligiblehouseholdmembers isel_prim for theprimary sector and

el_secforthesecondaryone.‐ Socialgroupsarequintiles.

Page 147: Distributive Analysis Stata Package V2.3

147

Answer:Typedbbianinthewindowscommandandsetvariablesandoptionsasfollows:Figure66:Benefitincidenceanalysis

AfterclickingonSubmit,thefollowingappears:

Page 148: Distributive Analysis Stata Package V2.3

148

2. To estimate total public expenditures on education by sector at the national level, the

followingmacroinformationwasused:‐ Pre‐primary and primary public education expenditure (as% of all levels), 1995:

35.2%‐ Secondarypubliceducationexpenditure(as%ofalllevels),1995:21.2%‐ Tertiarypubliceducationexpenditure(as%ofalllevels),1995:16%‐ Publiceducationexpenditure(as%ofGNP),1995=3%‐ GDPpercapita:about3800.

Usingthisinformation,thefollowingvariablesaregenerated

capdrop_var1;gen_var1=size*weight*3800;quisum_var1;quigenpri_pub_exp=0.03*0.352*`r(sum)';quigensec_pub_exp=0.03*0.212*`r(sum)';quigenuni_pub_exp=0.03*0.160*`r(sum)';capdrop_var1;

‐ Totalpublicexpendituresonprimarysector:pri_pub_exp‐ Totalpublicexpendituresonsecondarysector:sec_sec_exp‐ Totalpublicexpendituresonuniversitysector:uni_pub_exp

Estimatetheaveragebenefitsperquintileandgeneratethebenefitvariables.

Answer:Setvariablesandoptionsasfollows:

Page 149: Distributive Analysis Stata Package V2.3

149

Figure67:BenefitIncidenceAnalysis(unitcostapproach)

AfterclickingonSubmit,thefollowingappears:

Page 150: Distributive Analysis Stata Package V2.3

150