Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27,...

27
Diabetes Registries for CER, Surveillance, and other Research Purpose Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011

Transcript of Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27,...

Page 1: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes Registries for CER, Surveillance,

and other Research Purpose Jay Desai

HealthPartners Research Foundation

EDM Forum Methods SymposiumWashington, DC

October 27, 2011

Page 2: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

TeamJay Desai (HealthPartners Research Foundation)Patrick O’Connor (HealthPartners Research

Foundation)Greg Nichols (Kaiser Permanente Northwest)Joe Selby (KPNC PCORI)Pingsheng Wu (Vanderbilt University)Tracy Lieu (Harvard Pilgrim)

Page 3: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes DatalinkHMO Research Network

Approximately 11 million member population

SUrveillance, PREvention, and ManagEment of Diabetes Mellitus (SUPREME-DM)Twelve participating health systemsAlmost 1.3 million diabetes cases

Funded by AHRQ

Page 4: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes Registries for CER, Surveillance, and other Research Purposes

The Gold Standard problem

Sensitivity, Specificity, Predictive Positive Value

Confidence and Understanding Understand contribution of EHD sources Variation in EHD data sources

Population Representativeness

Case Retention

Page 5: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

The Gold Standard ProblemBiological gold standard for diabetes identification:

Elevated blood glucose levels

With good care management glucose levels may be below the threshold for diabetes diagnosis

Remission due to substantial weight loss, bariatric surgery

Comparative validityMedical record documentationSelf-reportClaims-based diagnosis codes

Page 6: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes Case Identification versus True Diabetes Population

False Negatives

(C)

False Positives

(B)

Actual Yes Actual No

Case ID Yes A B A + B

Case ID No C D C + D

A + C B + D

Sensitivity = A/(A+C)

A

Page 7: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes Case Identification versus True Diabetes Population

False Negatives

(C)

False Positives

(B)

Actual Yes Actual No

Case ID Yes A B A + B

Case ID No C D C + D

A + C B + D

Specificity = B/(B+D)

A

(D)

Page 8: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Diabetes Case Identification versus True Diabetes Population

False Negatives

(C)

False Positives

(B)

Actual Yes Actual No

Case ID Yes A B A + B

Case ID No C D C + D

A + C B + D

Sensitivity = A/(A+C)

A

PPV = A/(A+B)

Example:90% Sensitivity99% Specificity5% Prevalence

81% PPV

Page 9: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Tailoring Diabetes Case Definition to Specific Research QuestionsHigh Sensitivity High Predictive Positive

ValueMaximize inclusion of

potential casesObservational studiesSurveillancePopulation-based quality

metricsCER

Attenuate results but may have broader generalizability

Maximize identification of ‘true’ cases

Studies involving subject interventions

Registries that guide clinical interactions

Accountability tied to providers or systems

Intervention studiesCER

Restricted case identification Potential selection bias

Page 10: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Building the DataLink Registry

Initial registry construction

Broad sweep:Any indication of diabetes from any

electronic health data source

Page 11: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Datalink: Prevalent Diabetes Case Definition

Enrolled beginning in 2005Look back at data beginning from 2000

Within a 2-year time period…At least 2 face-to-face outpatient diabetes diagnoses,

orAt least 1 inpatient diabetes diagnosis, orAt least 1 anti-glycemic pharmacy dispense or claim

(excluding metformin, thiazolidinediones, or exenatide when no other criteria are met), or

At least 2 elevated blood glucose levels (HbA1c, fasting plasma glucose, random plasma glucose) or one elevated OGTT

Page 12: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Building Confidence in Case IdentificationVary time frames using same case

identification criteriaShorter time frames: more confident but

capture fewer casesLonger time frames: less confident but may

capture more casesWhat is ideal, especially with no gold standard?

Periodic recapture

Page 13: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Building Confidence in Case Identification

Prioritizing case identification criteriaAssign probabilities of ‘true case’

Prioritize data sourcesMore independent data sources identification…

more confidence.

Page 14: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Pharmacy Dispense(68%)

Outpatient & InpatientDiagnoses

(92%)

Laboratory Results(63%)

2008-9 Diabetes Case Identification at a DataLink Health System

?%

?%

?%?%

Page 15: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Dynamic and Static CohortsDynamic Cohort

Cumulative case identification over multiple years

Enter as new case or care system member

Leave due to death or disenrollment

Static Cohort Identification over

defined time period and followed

Enter…none. Prevalence0123456789

108.2

7.7

Dynamic (2000-9)Static (2008-9)

Figure 1. Dynamic & Static Diabetes Prevalence at one DataLink

Health System (2008-9)

Page 16: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Differential Use and Characteristics of Electronic Health Data Sources

There is wide variation across health systems regarding the primary source for case identification.

There may be selection bias associated with specific data sources.

This could affect case-mix and therefore results.

Page 17: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Initial Case Identification: At least 2 elevated blood glucose levels (lab)

Page 18: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Initial Case Identification: At least 1 diabetes drug pharmacy claims

Page 19: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Data Sources: Added ValueInsurance and pharmacy claims are routinely

used for diabetes case identification.Numerous validation studies against medical

record or self-report

What is the added case identification value of clinical data found in EMR’s?

Page 20: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Step-wise Contributions of Electronic Health Data Sources to Diabetes Case Identification (2008-9) at One DataLink Health System

A B C D E F G

2 Outpatient Claims Dx

1 Inpatient Claims Dx

Only

A + B 1 Diabetes Drug Claim Only

C + D 2 Elevated

Blood Glucose

Tests Only

E + F

Prevalence (N) 7.0%(12,916)

0.1%(111)

7.1%(13,02

7)

0.4%(678)

7.4%(13,70

5)

0.2%(422)

7.7%(14,12

7)

% Cases 91% +1% +5% +3%

Page 21: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Patient Characteristics Based on Qualifying Case Identification EHD (2008-9) at One DataLink

Health SystemA B C D E F G

Outpatient Claims Dx

Inpatient Claims Dx

Only

A + B Diabetes Drug Claim Only

C + D Blood Glucose Lab Test

Only

E + F

Prevalence (N) 7.0%(12,916)

0.1%(111)

7.1%(13,02

7)

0.4%(678)

7.4%(13,70

5)

0.2%(422)

7.7%(14,12

7)

% Cases 91% 1% 5% 3%

Select Patient Characteristics

Female (%) 49 60 49 79 51 50 51

18-44 years (%) 12 26 12 58 14 8 14

HbA1c < 8 (%) 81 91 81 91 81 98 82

LDL-c < 100 (%) 74 71 74 46 73 56 72

Current smoker (%)

11 11 11 14 11 15 11

Page 22: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Population RepresentativenessCER studies?

Assess relative effectiveness of various treatments and systems of care in defined patient populations.

Uninsured No: if population defined based on insurance claims Probably: if population defined based on EMR

Units of analysis Patient, Provider, Clinic, Health System

Large multi-site registries more likely to provide representative ‘units of analysis’ HIE potential to include smaller, less integrated systems

Page 23: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Percent Retention of 2002 Incident Diabetes Cohort at One DataLink Health System

Page 24: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

Comparing Selected Characteristics of 2006 Incident Cohort by Retention Status through 2010:Baseline characteristics

Retained Cohort Lost to disenrollment

Female 51% 50%

18-44 years 15% 27%

45-64 years 52% 55%

65+ years 32% 16%

Current smoker 15% 19%

BMI ≥ 30 kg/m2 60% 62%

HbA1c < 8% 86% 78%

LDL-c < 100 mg/dl 51% 43%

SBP < 140 mmHg 80% 79%

DBP < 90 mmHg 94% 90%

Page 25: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

SummaryNo realistic EHD gold standard for many

conditions.

When designing a registry,Think multi-purposeMaximize case capture so that a variety of case

definitions can be derived depending on specific study needs.

Consider developing several case definitions with different levels of confidence [sensitivity & PPV].

Page 26: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

SummaryFor CER studies we are interested in defined

patient populations, providers, clinics, health systems…

The greater the diversity of health systems participating in a disease registry the better…the more representative.

EMR-derived registries may include uninsured and be most representative.

Page 27: Jay Desai HealthPartners Research Foundation EDM Forum Methods Symposium Washington, DC October 27, 2011.

SummaryCohorts developed using insurance claims have

substantial attrition due to disenrollment over time.

Important to include demographic and clinical characteristics of retained population compared to those loss-to-follow-up

CER studies requiring long follow-up to outcomes may be challenging if based on secondary use of EHD dataImprove as health systems get regionally connected so

patients can be tracked across systems (HIE’s)?