Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University

35
Some challenges to make current data-driven (‘statistical’) models even more relevant to public health Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University

description

Some challenges to make current data-driven (‘statistical’) models even more relevant to public health. Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University. Focus on time series analysis of incidence analysis. 400k. 0-1y. 5-10y. 1-3y. 10-15y. 350k. 3-5y. 15- y. - PowerPoint PPT Presentation

Transcript of Ottar Bjornstad Center for Infectious Disease Dynamics, Penn State University

Page 1: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Some challenges to make current data-driven (‘statistical’) models even more

relevant to public health

Ottar Bjornstad Center for Infectious Disease Dynamics,

Penn State University

Page 2: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Focus on time series analysis of incidence analysis

0

50k

100k

150k

200k

250k

300k

350k

400k

‘44 ‘55

Quarterly measles incidence 44-55

0-1y1-3y3-5y

5-10y10-15y

15- y

Page 3: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Outline:

~ 1993

then –> now

current challenges

mostly anecdotal personal reflections

Page 4: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

~ 1993

Page 5: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

~1993 maturing* mathematical formalism incorporating: (cf yesterdays talks by Mick and Val)

– Seasonal forcing

– Age-structure & non-homogenous mixing

– Spatial diffusion & metapopulation dynamics

– Plausible scenarios of scaling of transmission with pop size

– Stochasticity

But, early days w.r.t. letting these general models loose on data …

* for directly transmitted persistent (SI), fully immunizing (SIR) or fully non-immunizing (SIS)

Page 6: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

~1993 Early days w.r.t. letting these general models loose on data …

… because of many challenges

The obvious:

–Absence of data on key state variables (eg susceptible)

–Disparity between key state variables and observed quantities (eg incidence is not prevalence)

The less obvious:

–Weariness regarding whether a detailed quantitative match to data should be a critical characteristic of mathematical models

Page 7: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

then –> now

Page 8: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Today’s expectation for match (1)

eg TSIR forecast for E&W measles (E&W)

IncidenceTSIR forecast

Page 9: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

),(~1

1

1

ttt

tttt

tttt

INegBinI

IS

IBSS

S - Susceptibles, B - Births , I - Infected and Infective, - epidemic intensity, - correction for time discretization, β - seasonal transmission rate

Host dynamics

Transmission dynamics

Stochasticity

TSIR: Discrete time ‘piecewise constant’ B-D process (cf Chain-binomial)If l is small relative to S, then the chain can be approximated by an unconstrained B-D process; The conditional distribution of It+1 is the sum of It Geometric distributions -> NegBin with clumping It

Page 10: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Today’s expectation for match (2)

eg age-structured TSIR forecast for rubella (South Africa)

Metcalf et al 2013

* difference in heat intensity is due to underreporting

*

Page 11: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Yesteryear’s expectations

Kot and Schaffer (1985) JTB:

“One way of resolving the problem is to view the motion in phase space,i.e. in a vector space whose axes are the state independent variables.However, for most real world ecological and epidemiological systems, thisrequirement is not easily met. It is often difficult even to enumerate all ofthe state variables, much less to follow their magnitudes over time. Put another way, the variables studied in nature are generally embedded in more complex systems. As a practical matter, it is unlikely that population dynamicists will ever be able to write down the complete governing equations for any natural system.”

ID complexities: age-structured mixing, age-specific seasonality in transmission, spatial heterogeneity, heterogeneities in susceptibility, etc, etc

Page 12: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

Page 13: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

Page 14: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

Page 15: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

3) A seasonal chain-binomial model can in fact be recast as a non-autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2

Page 16: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (1)

-> Many ‘Obvious’ challenges were painstakingly resolved along the way

1) Perhaps models may have some qualitative relevance?- Nonparametric forecasting to distinguish cycles from chaos (Suigihara &c). - Nonparametric Lyapunov exponent estimators (Ellner &c).

2) If we can somehow reconstruct the unobserved susceptible class, would it be egregiously ambitious to compare model simulations and data? - Semiparametric models with smart embedding (Ellner &c) - Susceptible reconstruction (Bobashev &c; Finkenstadt &c)

3) A seasonal chain-binomial model can in fact be recast as a non-autonomous autoregressive regression: I dear you! - Time-series SIR ver 1 (Finkenstadt & Grenfell) and TSIR ver 2

4) Why in the world does the TSIR seem to fit measles in E&W? - ‘Emergent simplicity’ (Grenfell); Dynamic homogeneity (Earn &c)

Page 17: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (2)

5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c)

- Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c).….. (cf Simon’s talk)

Page 18: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Journey from there to here (2)

5) We believe! Real dynamics can be predicted by simple mechanistic models (that incorporates key idiosyncrasies) - POMP et al (King &c)

- Hierarchical models with observation process (Cauchemez &c). - Age-structured TSIR (Metcalf &c).….. (cf Simon’s talk)

Lessons from last 20 years: - ‘All models are wrong …’ Some much less than we expected.- Emergent simplicity once key idiosyncrasies are identified - ?Tactical/strategical? models may be more relevant than we expected.

[The prevailing notion that computation was the important driver in the field is wrong (Cambridge MRCs BUGS has been around since 20 years)]

Page 19: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Some current challenges

Page 20: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Some critical issues are:

(i) use nonlinear stochastic modeling to identify all potentially undesirable side effects of intervention-induced reduction in circulation.

-Rubella and CRS (cf Jess’ talk)

-Chikenpox vaccine and increased shingles incidence

-Whooping cough and the role of natural antigen circulation in maintaining immune memory. The possibility of long-term vaccine failure.

More case law!

Page 21: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

1920 1940 1960 1980 2000

Pertussis in Massachusetts

Years

Re

port

ed

pe

rtus

sis

case

s (/

yr.)

10

10

01

00

010

00

0

Cas

es (

/mo.

)

1990 1995 2000 2005

200

400 Recent cases

Vaccine introduced

The first decades of vaccine induced control was extremely successful …

Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Page 22: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

1920 1940 1960 1980 2000

Pertussis in Massachusetts

Years

Re

port

ed

pe

rtus

sis

case

s (/

yr.)

10

10

01

00

010

00

0

Cas

es (

/mo.

)

1990 1995 2000 2005

200

400 Recent cases

Vaccine introduced

The first decades of vaccine induced control was extremely successful …… Then even in very high cover areas throughout the developed world (e.g. Massachusetts with consistent >95% cover) the disease re-emerged!

Mass-vaccination introduced in most rich countries in mid ‘40s - early ’50s

Page 23: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Re-emergence is associated with a completely different core group

age class

dens

ity0.

000

.05

0.1

00

.15

0-1 5-6 10-15 15-60

pre-vaccine era

age class

dens

ity0.

000

.05

0.1

00

.15

0-1 5-6 12-13 20-21 30-31 40-41 50-51

vaccine era

Massachusets age-incidence patterns

Lavine, King and Bjornstad. 2011. PNAS

Page 24: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

The ‘anamnestic’ 4 compartment SIR model – re-exposure helps maintain immune memory

S – suceptible

I – Infected

R – Highly immune

W – Waning: resistant to infection and will get boosted or loose immunity depending on competing rates

- force of infection

- boosting coeffiecient

- recovery rate

- rate of loss of immunity

- rate of loss of immunity

Lavine, King and Bjornstad. 2011. PNAS

Page 25: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

As long as the anamnestic response is at least 10x greater than the naïve response:

Pre-vaccination prediction

Age

Page 26: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Post-vaccination prediction:

Lavine, King and Bjornstad. 2011. PNAS

Page 27: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

0.0 0.2 0.4 0.6 0.8 1.0

Vaccine coverage

Pro

po

rtio

n in

fect

ed Total incidence

0.0

008

0.0

011

0.0

014

? 0.5? 20? 5000

Natural immune boosting in pertussis dynamics and the potential for long-term vaccine failure

Inci

denc

e

Vaccine coverage

‘SIR’

‘SIS’

Page 28: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Pro

po

rtio

n ca

ses

0.0

00

.01

0.0

20.

03

0.0

4

Eff ect of booster v accine at a g e 15

1 3 5 7 9 11 13 15 17 19 21 23

Boost at 15No boost

Predicted public health consequences of a booster vaccine at age 15 …

… The booster may push circulation towards adults of childbearing age and increase perinatal infection and increase severe disease.

(cf CRS but different mechanism)

Page 29: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

Some critical issues are:

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules

Page 30: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules

e.g. Measles Incidence in China: 3 provinces

From Matt Ferrari

Page 31: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

(ii) robust forecasting in the face of rapidly changing demographies and vaccination schedules; *

* Perreti et al PNAS 2013 110:5253-5257 (‘Model-free forecasting outperforms the correct mechanistic model for simulated and experimental data’) is not the way to go

Page 32: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

(iii) probabilistically projecting possible/probable build-up of ‘susceptible pockets’ in the face of imperfect vaccination programs

Page 33: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

(iii) probabilistically projecting possible/probable build-up of ‘susceptible pockets’ in the face of imperfect vaccination programs

Burkina Faso: 2009

Malawi: 2010

•>135,000 cases following 10 years of low incidence

France, 2011

0

200

400

600

800

1000

1200

1986

Jul-

86

1987

Jul-

87

1988

Jul-

88

1989

Jul-

89

1990

Jul-

90

1991

Jul-

91

1992

Jul-

92

1993

Jul-

93

1994

Jul-

94

1995

Jul-

95

1996

Jul-

96

1997

Jul-

97

1998

Jul-

98

MES / AÑO

CA

SOS

1986

-96

0

2000

4000

6000

8000

10000

12000

CA

SOS

1997

-98

CAMPAÑA MAYO 87SARAMPIÓN 9M - 14 A

CAMPAÑA JUNIO 92M-M-R 1 - 10 A

Sao Paolo: 1997

Eg Measles (from Matt Ferrari)

Page 34: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

(iv) Important challenge:

We need accurate parametric anchoring of mechanistic models

-7000

-6000

-5000

-4000

-3000

-2000

-1000

2 e-04 4 e-04 6 e-04 8 e-04 1 e-03

2 e-04

4 e-04

6 e-04

8 e-04

1 e-03

intra.beta

inte

r.b

eta

Inference for mechanistic models usually reveal severe multicollinearity among parameters:-Various parameter combinations can fit observed data equally well,-but will not make the same out of sample predictions

Log-likelihood

Intrastage β

Inte

rsta

ge β

Eg 2-stage PDV model (cf Klepac et al. 2009)

Page 35: Ottar Bjornstad  Center for Infectious Disease Dynamics, Penn State University

- ‘All models are wrong …’ Some much less than we expected.- Emergent simplicity once key idiosyncrasies are identified. -?Tactical/strategical? models may be more relevant than we expected.

- We have an enormous arsenal of model fitting tools.- Multicollinearity makes anchoring of estimates a critical challenge.

Current modeling challenges:- Study unanticipated Public health consequences - Consequences of rapidly changing demographics- Understand build-up of susceptible pockets

Thank you!