Propensity Score Matching Using SAS Enterprise Guide

Propensity Score Matching Using SAS Enterprise Guide.

Ian Morton

Newtyne SAS User Gathering (N-SUG1)

Wednesday 19th May 2010.

What is the problem ? I have a fictitious dataset containing:

a set of background characteristics (about customers who bought a product);

an indicator of the year they bought the product (e.g. year 1 or year 2); and

an outcome (e.g. they bought the product). I want an estimate of the difference between whether

they bought it or not from one year to the next But I want the same or very similar customers; I don’t want complications of the “case mix” biasing the

answer. I will provide real examples later.

How do you do it ?

Featured in Allison, P. D. (1999) Logistic Regression Using the SAS System SAS Institute and Wiley, North Carolina.

Propensity Score Matching - two steps: scoring and then matching.

Terminology – year 1 (controls), year 2 (treatments)

Process flow – part 1

Parameterised code

SAS dataset Excel, import data, list data and create format

980 bought in year 1 and 1,020 bought in year 2

Obs var1 var2 var3 var4 var5 var6 var7 var8 var9 var10 var11 Product result 1 F 3 1 2 1 7 1 1 1 1 1 product

2 Bought

2 M 2 3 2 2 3 1 1 1 5 1 product 2

Not bought

3 M 2 3 2 1 6 1 1 1 1 1 product 2

Bought

4 F 3 1 2 1 7 1 1 1 2 1 product 2

Bought

5 M 2 3 2 1 5 1 1 1 1 1 product 2

Bought

.

.

.

. . .

.

.

. 2000 M 2 3 2 1 5 1 1 1 1 1 product

1 Not bought

Propensity score Use logistic regression to estimate probability

(propensity score) of the treatment In SAS (Analyse, regression, logistic) Provides a probability (propensity score) for all

students; treatment and control. The logistic equation is below, where Ti is the

treatment status, Xi are the observations and h(Xi) is made up of the covariates (age, gender, etc).

i

i

Xh

Xh

iie

eXT

11Pr

Matching Coca-Perraillon, M. (2007) Local and

Global Optimal Propensity Score Matching SAS Global Forum 2007, Orlando, Florida, April16th - 19th 2007.

Need to match treatments to controls by propensity score.

There are different matching methods Once a match has been found for the

treatments in the controls, use information on the latter's outcome for inference.

Process flow – part 2

Results 980 treatments reduced to 303 treatments The controls that match these treatments

are: Obs var1 var2 var3 var4 var5 var6 var7 var8 var9 var10 var11 result

1 F 3 1 1 2 6 1 1 2 2 1 Bought

2 M 3 3 1 1 3 1 1 1 2 1 Bought

3 F 1 1 1 1 7 1 1 1 2 1 Bought

4 F 1 1 1 1 6 1 1 1 2 1 Bought

.

.

.

.

.

.

.

.

.

303 F 1 1 1 1 7 1 1 1 2 1 Not Bought

The 303 treatments and matched controls have similar characteristics and allow further assessment

Uses of the method Justice

Want to know if programmes and interventions are successful in reducing reconvictions of offenders over time;

don’t want complications getting in the way, so need the same offender characteristics between cohorts.

Example: Ministry of Justice (2010) Evaluating the use of judicial mediation in Employment Tribunals Ministry of Justice Research Series 7/10.

Medical In a case control study need to match cases to controls by say gender, age, smoking

status. Example: Foster, E. M. (2003) Propensity Score Matching: An Illustrative Analysis of

Dose Response. Medical Care 41 10 1183-1192. Propensity score matching and then counter-factual inference

The Scottish Funding Council wanted an estimate of the drop-out rate of students who studied outside Scotland, if they had actually studied in Scotland.

An estimate of what would have been the outcome if customers who bought product 2 had actually bought product 1.

Example: Rosenbaum, P. R. and Rubin, D. B. (1983) The central role of the propensity score in observational studies for causal effects Biometrika 70 1 41-56.

Reference

Morton, I.D., Penny, K., Ashraf, M.Z. and Duffy, J.C. (In Press) The Use of Propensity Score Matching in Comparing Student Outcomes Sent to Journal of the Operational Research Society

Propensity Score Matching Using SAS Enterprise Guide

Business

Transcript of Propensity Score Matching Using SAS Enterprise Guide