Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality •...

44
Intro to Causality David Madras October 22, 2019

Transcript of Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality •...

Page 1: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

IntrotoCausalityDavidMadras

October22,2019

Page 2: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Simpson’sParadox

Page 3: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

TheMontyHallProblem

Page 4: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

TheMontyHallProblem

1. Threedoors– 2havegoatsbehindthem,1hasacar(youwanttowinthecar)

2. Youchooseadoor,butdon’topenit3. Thehost,Monty,opensanother door(nottheoneyouchose),and

showsyouthatthereisagoatbehindthatdoor4. Younowhavetheoptiontoswitchyourdoorfromtheoneyou

chosetotheotherunopeneddoor5. Whatshouldyoudo?Shouldyouswitch?

Page 5: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

TheMontyHallProblem

Page 6: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

What’sGoingOn?

Page 7: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Causation!=Correlation

• Inmachinelearning,wetrytolearncorrelationsfromdata• “WhencanwepredictXfromY?”

• Incausalinference,wetrytomodelcausation• “WhendoesXcause Y?”

• Thesearenotthesame!• Icecreamconsumptioncorrelateswithmurderrates• Icecreamdoesnotcausemurder(usually)

Page 8: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CorrelationsCanBeMisleading

https://www.tylervigen.com/spurious-correlations

Page 9: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalModelling

• Twooptions:1. Runarandomizedexperiment

Page 10: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalModelling

• Twooptions:1. Runarandomizedexperiment2. Makeassumptionsabouthowourdataisgenerated

Page 11: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalDAGs

• PioneeredbyJudeaPearl• Describesgenerativeprocessofdata

Page 12: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalDAGs

• PioneeredbyJudeaPearl• Describes(stochastic)generativeprocessofdata

Page 13: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalDAGs

• Tisamedicaltreatment• Yisadisease• Xareotherfeaturesaboutpatients(say,age)

• Wewanttoknowthecausaleffect ofourtreatmentonthedisease.

Page 14: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalDAGs

• Experimentaldata:randomizedexperiment• WedecidewhichpeopleshouldtakeT

• Observationaldata:noexperiment• PeoplechosewhetherornottotakeT

• Experimentsareexpensiveandrare• Observationscanbebiased• E.g.WhatifmostlyyoungpeoplechooseT?

Page 15: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

AskingCausalQuestions• SupposeTisbinary(1:receivedtreatment,0:didnot)• SupposeY isbinary(1:diseasecured,0:diseasenotcured)• Wewanttoknow“Ifwegivesomeonethetreatment(T=1), whatistheprobabilitytheyarecured(Y=1)?”

• Thisisnot equaltoP(Y=1|T=1)• Supposemostlyyoungpeopletakethetreatment,andmostwerecured,i.e.P(Y=1|T=1)ishigh• Isthisbecausethetreatmentisgood?Orbecausetheyareyoung?

Page 16: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Correlation vs.Causation

• Correlation

• Intheobserveddata,howoftendopeoplewhotakethetreatmentbecomecured?• Theobserveddatamaybebiased!!

Page 17: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Correlationvs.Causation

• Let’ssimulate arandomizedexperiment• i.e.• CutthearrowfromXtoT• Thisiscalledado-operation

• Then,wecanestimatecausation:

Page 18: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Correlationvs.Causation

• Correlation

• Causation– treatmentisindependent ofX

Page 19: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

InversePropensityWeighting

• Cancalculatethisusinginversepropensityscores• RatherthanadjustingforX,sufficienttoadjustforP(T|X)

P(T|X)

Page 20: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

InversePropensityWeighting

• Cancalculatethisusinginversepropensityscores• Thesearecalledstabilizedweights

Page 21: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

MatchingEstimators

• Matchupsampleswithdifferenttreatmentsthatareneartoeachother• Similartoreweighting

Page 22: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Review:Whattodo withacausalDAG

ThecausaleffectofTonYis

Thisisgreat!Butwe’vemadesomeassumptions.

Page 23: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Simpson’sParadox,Explained

Page 24: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Simpson’sParadox,Explained

Size

Trmt Y

Page 25: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Simpson’sParadox,Explained

Size

Trmt Y

Page 26: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

MontyHallProblem,Explained

Boringexplanation:

Page 27: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

MontyHallProblem,Explained

Causalexplanation:• Mydoorlocationis

correlatedwiththecarlocation,conditioned onwhichdoorMontyopens!

CarLocationMyDoor

OpenedDoor

https://twitter.com/EpiEllie/status/1020772459128197121

Page 28: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

MontyHallProblem,Explained

Causalexplanation:• Mydoorlocationis

correlatedwiththecarlocation,conditioned onwhichdoorMontyopens!

• ThisisbecauseMontywon’tshowmethecar

• Ifhe’sguessingalso,thencorrelationdisappears

CarLocationMyDoor

Monty’sDoor

Page 29: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

StructuralAssumptions

• AllofthisassumesthatourassumptionsabouttheDAGthatgeneratedourdataarecorrect

• Specifically,weassumethattherearenohiddenconfounders• Confounder:avariablewhichcausallyeffectsboththetreatment(T)andtheoutcome(Y)• No hiddenconfoundersmeansthatwehaveobservedallconfounders

• Thisisastrongassumption!

Page 30: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

HiddenConfounders

• CannotcalculateP(Y|do(T))here,sinceUisunobserved

• Wesayinthiscasethatthecausaleffectisunidentifiable• Eveninthecaseofinfinitedataandcomputation,wecannevercalculatethisquantity

X

T Y

U

Page 31: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

WhatCanWeDowithHiddenConfounders?

• Instrumentalvariables• Findsomevariablewhicheffectsonly thetreatment

• Sensitivityanalysis• Essentially,assumesomemaximumamountofconfounding• Yieldsconfidenceinterval

• Proxies• Otherobservedfeaturesgiveusinformationaboutthehiddenconfounder

Page 32: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

InstrumentalVariables

• Findaninstrument – variablewhichonlyaffectstreatment• Decouplestreatmentandoutcomevariation

• Withlinearfunctions,solveanalytically• Butcanalsouseanyfunctionapproximators

Page 33: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

SensitivityAnalysis

• Determinetherelationshipbetweenstrengthofconfoundingandcausaleffect• Example:Doessmokingcauselungcancer?(wenowknow,yes)• Theremay beagenethatcauseslungcancerand smoking• Wecan’tknowforsure!• However,wecanfigureouthowstrongthisgenewouldneedtobetoresultintheobservedeffect• Turnsout– verystrong

X Gene

Smoking Cancer

Page 34: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

SensitivityAnalysis

• Theideais:parametrizeyouruncertainty,andthendecidewhichvaluesofthatparameterarereasonable

Page 35: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

UsingProxies

• Insteadofmeasuringthehiddenconfounder,measuresomeproxies(V=fprox(U))• Proxies:variablesthatarecausedbytheconfounder• IfUisachild’sage,Vmightbeheight

• Iffprox isknownorlinear,wecanestimatethiseffect

X

T

U

Y V

Page 36: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

UsingProxies

• Iffprox isnon-linear,wemighttrytheCausalEffectVAE• LearnaposteriordistributionP(U|V)withvariationalmethods• However,thismethoddoesnotprovidetheoreticalguarantees• Resultsmaybeunverifiable:proceedwithcaution!

X

T

U

Y V

Page 37: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

CausalityandOtherAreasofML

• ReinforcementLearning• Naturalcombination– RLisallabouttakingactionsintheworld• Off-policylearningalreadyhaselementsofcausalinference

• Robustclassification• Causalitycanbenaturallanguageforspecifyingdistributionalrobustness

• Fairness• Ifdatasetisbiased,MLoutputsmightbeunfair• Causalityhelpsusthinkaboutdatasetbias,andmitigateunfaireffects

Page 38: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

QuickNoteonFairnessandCausality

• Manyfairnessproblems(e.g.loans,medicaldiagnosis)areactuallycausalinferenceproblems!• WetalkaboutthelabelY– however,thisisnotalwaysobservable• Forinstance,wecan’tknowifsomeonewould returnaloanifwedon’tgiveonetothem!• Thismeansifwejusttrainaclassifieronhistoricaldata,ourestimatewillbebiased• Biasedinthefairnesssenseand thetechnicalsense

• Generaltakeaway:ifyourdataisgeneratedbypastdecisions,thinkveryhardabouttheoutputofyourMLmodel!

Page 39: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

FeedbackLoops

• Takesustopart2…feedbackloops• WhenMLsystemsaredeployed,theymakemanydecisionsovertime• Soourpastpredictionscanimpactourfuturepredictions!• Notgood

Page 40: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

UnfairFeedbackLoops

• We’lllookat“FairnessWithoutDemographicsinRepeatedLossMinimization”(Hashimotoetal,ICML2018)• Domain:recommendersystems• Supposewehaveamajoritygroup(A=1)andminoritygroup(A=0)• Ourrecommendersystemmayhavehighoverallaccuracybutlowaccuracyontheminoritygroup• Thiscanhappenduetoempiricalriskminimization(ERM)

• Canalsobeduetorepeateddecision-making

Page 41: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

RepeatedLossMinimization

• Whenwegivebadrecommendations,peopleleaveoursystem• Overtime,thelow-accuracygroupwillshrink

Page 42: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Distributionally RobustOptimization

• Upweight exampleswithhighlossinordertoimprovetheworstcase• Inthelongrun,thiswillpreventclustersfrombeingunderserved

• Thisendsupbeingequalto

Page 43: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Distributionally RobustOptimization

• Upweight exampleswithhighlossinordertoimprovetheworstcase• Inthelongrun,thiswillpreventclustersfrombeingunderserved

Page 44: Intro to Causalitytoni/Courses/Fairness/Lectures/... · Quick Note on Fairness and Causality • Many fairness problems (e.g. loans, medical diagnosis) are actually causal inference

Conclusion

• Yourdataisnotwhatitseems• MLmodelsonlyworkifyourtraining/testsetactuallylookliketheenvironmentyoudeploythemin• Thiscanmakeyourresultsunfair• Orjustincorrect

• Soexamineyourmodelassumptionsanddatacollectioncarefully!