Mathematical Modeling of Genetic Regulatory … Modeling of Genetic Regulatory Networks Hidde de...

45
Mathematical Modeling of Genetic Regulatory Networks Hidde de Jong Projet HELIX INRIA Rhône-Alpes 655, avenue de l’Europe Montbonnot, 38334 Saint Ismier CEDEX Email: [email protected]

Transcript of Mathematical Modeling of Genetic Regulatory … Modeling of Genetic Regulatory Networks Hidde de...

Mathematical Modeling ofGenetic Regulatory Networks

Hidde de JongProjet HELIX

INRIA Rhône-Alpes655, avenue de l’Europe

Montbonnot, 38334 Saint Ismier CEDEX

Email: [email protected]

2

Overview

1. Genetic regulatory networks

2. Modeling and simulation of genetic regulatory networks

3. Modeling and simulation approaches:

l differential equations

l stochastic equations

4. Conclusions

3

Genes and proteinsv Genes code for proteins that are essential for development

and functioning of organism: gene expression

DNA

RNA

transcription

protein

translation

protein andmodifier molecule

post-translational modification

4

Regulation of gene expression

v Regulation of gene expression on several levels

v Gene expression controlled by proteins produced by othergenes: regulatory interactions

transcriptional regulation

translational regulation

regulation ofpost-translational modification

5

repressors

repressor complex

activator

gene 1

gene 3

gene 2

Genetic regulatory network

v Genetic regulatory network consists of set of genes, proteins,small molecules, and their mutual regulatory interactions

v Development and functioning of organisms cell emerges frominteractions in genetic regulatory networks

6

Bacteriophage λ infection of E. coli

v Response of E. coli to phage λinfection involves decisionbetween alternativedevelopmental pathways:lytic cycle and lysogenyPtashne, 1992

7

Genetic regulatory network phage λ

v Choice between alternative developmental pathways controlledby network of genes, proteins, and mutual regulatoryinteractions

McAdams & Shapiro, 1995

8

v Most genetic regulatory networks are large and complexCells have many components that can interact in complex ways

v Dynamics of large and complex genetic regulatory processeshard to understand by intuitive approaches alone

v Mathematical methods for modeling and simulation arerequired:

l precise and unambiguous description of network of interactions

l systematical derivation of behavioral predictions

v Practical application of mathematical methods requires user-friendly computer tools

Computational approaches

9

Mathematical modeling approachesv Mathematical modeling has developed since the 1960s and is

currently attracting much attentionBower and Bolouri, 2001; Hasty et al., 2001; McAdams and Arkin, 1998;Smolen et al., 2000; de Jong, 2002

v Two approaches to computer modeling and simulationdiscussed in this session:

l differential equations

l stochastic equations

v Jean-Luc Gouzé will discuss class of piecewise-lineardifferential equations central to this project in more detail

10

Differential equation models

v Cellular concentration of proteins, mRNAs, and other moleculesat time-point t represented by continuous variable xi(t) ∈ R≥0

v Regulatory interactions modeled by kinetic equations

where fi(x) is rate law

v Rate of change of variable xi is function of other concentrationvariables x = [x1,…, xn]´

v Differential equations are major modeling formalism inmathematical biology

Segel, 1984; Kaplan and Glass, 1995; Murray, 2002

xi = fi(x), 1 ≤ i ≤ n,.

11

Negative feedback system

v Gene encodes a protein inhibiting its own expression:negative feedback

v Negative feedback important for homeostasis, maintenance ofsystem near a desired state

Thomas and d’Ari, 1990

gene

mRNA

protein

-

12

Model of negative feedback systemgene

mRNA

protein

x1 = mRNA concentration

x2 = protein concentration

x1 = κ1 f (x2) - γ1 x1

x2 = κ2 x1 - γ2 x2

κ1 , κ2 > 0, production rate constants

γ1, γ2 > 0, degradation rate constants

.

.

f (x2) = , θ > 0 thresholdθ n

θ n + x2n

x2

f (x2 )

θ0

13

Steady state analysis

v No analytical solution of nonlinear differential equationsdescribing feedback system

v System has single steady state at x = 0

v Steady state is stable, that is, after perturbation system willreturn to steady state (homeostasis)

.

x2

x1

0

x2 = 0 .

x1 = 0 .

x1 = 0 : x1 = f (x2)κ1γ1

x2 = 0 : x1 = x2γ2κ2

.

.

14

Transient behavior after pertubation

v Numerical simulation of differential equations shows transientbehavior towards steady state after perturbation

Initial values x1 (0), x2 (0) correspond to perturbation

x1

x2

t0

x2

x1

0

x2 = 0 .

x1 = 0 .

15

Positive feedback system

v Gene encodes a protein activating its own expression:positive feedback

v Positive feedback important for differentiation, evolutiontowards one of two alternative states of system

gene

mRNA

protein

+

16

Model of positive feedback systemgene

mRNA

protein

+

x1 = mRNA concentration

x2 = protein concentration

x1 = κ1 f (x2) - γ1 x1

x2 = κ2 x1 - γ2 x2

κ1 , κ2 > 0, production rate constants

γ1, γ2 > 0, degradation rate constants

.

.

f (x2) =x2

θ n + x2n

n

x2

f (x2 )

θ0

17

Steady state analysis

v No analytical solution of nonlinear differential equationsdescribing feedback system

v System has three steady states

v Two stable and one unstable steady state. System will tend toone of two stable steady states (differentiation)

x2

x1

0

x2 = 0 .

x1 = 0 .

x1 = 0 : x1 = f (x2)κ1γ1

x2 = 0 : x1 = x2γ2κ2

.

.

18

Transient behavior after pertubation

v Depending on strength of perturbation, transient behaviortowards different steady states

x1

x2

t0

x2

x1

0

x2 = 0 .

x1 = 0 .

x1

x2

t0

19

Model of time-delay feedback system

v Time to complete transcription and translation introduces time-delay in differential equations

v Time-delay feedback systems may exhibit oscillatory behavior

gene

mRNA

protein

-

x1 = mRNA concentration

x2 = protein concentration

x1 = κ1 f (x2) - γ1 x1

x2 = κ2 x1 - γ2 x2

.

.

x1(t ) = x1(t - τ1) , τ1 > 0 time-delay

x2(t ) = x2(t - τ2) , τ2 > 0 time-delay

τ

τ

τ

τ

20

More complex feedback systems

v Gene encodes a protein activating synthesis of another proteininhibiting expression of gene: positive and negative feedback

v Interlocking feedback loops give rise to models with complexdynamics: numerical simulation techniques necessary

gene a

mRNA b

protein A

-

protein B

gene b

+

mRNA a

21

Application of differential equations

v Differential equations have been used to model a variety ofgenetic regulatory networks:

l circadian rhythms in Drosophila (Leloup and Goldbeter, 1998)

l λ phage infection of E. coli (McAdams and Shapiro, 1998)

l segmentation of early embryo of Drosophila (Reinitz and Sharp, 1996)

l cell division in Xenopus (Novak and Tyson, 1993)

l Trp synthesis in E. coli (Santillán and Mackey, 2001)

l induction of lac operon in E. coli (Carrier and Keasling, 1999)

l developmental cycle of bacteriophage T7 (Endy et al., 2000)

l ...

22

Simulaton of phage ? infection

v Kinetic model of the phage ? network underlying decisionbetween lytic cycle and lysogenyMcAdams & Shapiro, 1995

23

Simulaton of phage ? infection

v Time evolution of promoter activity and protein concentrationsin (a) lysogenic and (b) lytic pathways

McAdams & Shapiro, 1995

24

Evaluation of differential equations

v Pro: general formalism for which powerful analysis andsimulation techniques exist

v Contra: numerical techniques are often not appropriate due tolack of quantitative knowledge

value of parameters and evolution of concentrations are not known

v Contra: implicit assumptions of continuous and deterministicchange of concentrations may not be valid on molecular level

25

Gene expression is discrete process

v Gene expression is result of large number of discrete events:chemical reactions

0 1 2 3 4 n-1 n0 1 2 3 4 n-1 nDNA

0 1 2 3 4 n-1 n

RNA polymerase

DNA + RNAP → DNA0 • RNAP

0 1 2 3 4 n-1 nDNA

DNAi • RNAP → DNAi+1 • RNAP

26

Gene expression is stochastic process

v Gene expression is stochastic process: random timeintervals τ between occurrence of reactions

v Time interval τ has probability distribution

P(τ )

τ

0 1 2 3 4 n-1 n0 1 2 3 4 n-1 nDNA

RNA polymerase

27

Differential equations are abstractions

v Differential equation models make continuous anddeterministic abstraction of discrete and stochastic process

l xi(t) ∈ R≥0 is continuous variable

l xi = fi(x) determines change in xi at t

v Abstraction may not be warranted when modeling generegulation on molecular level: low number of molecules

v Therefore, more realistic stochastic models of gene regulation

.

.

28

Stochastic variables

v Stochastic variables Xi describe number of molecules ofproteins, mRNAs, etc.

l Xi(t) ∈ N≥0 is discrete variable

l P(Xi(t)) is probability distribution describing probability that at time-point t cell contains Xi molecules of i

.

P(Xi(t))

Xi(t)

29

Stochastic master equations

v Stochastic master equations describe evolution of state X =[X1,…, Xn]´ of regulatory system

l m is the number of reactions that can occur in the system

l αj ∆t is the probability that reaction j will occur in [t, t +∆t] given thatthe system is in state X at t

l βj ∆t is the probability that reaction j will bring the system in state Xfrom another state in [t, t +∆t]

van Kampen, 1997

P(X (t +∆t)) = P(X (t )) (1 - ∑ αj ∆t ) + ∑ βj ∆tj = 1 j = 1

mm

30

Stochastic simulation

v For ∆t → 0 we obtain

v Analytical solution of master equations is not possible

v Stochastic simulation by predicting a sequence of reactionschanging the state of the system, starting from initial state X0

Stochastic simulation uses stochastic variables τ and ρτ = time interval until occurrence of next reaction

ρ = type of reaction

Gillespie, 1977

P(X (t )) = ∑ (βj - αj P(X (t )))j = 1

m∂∂t

31

Reactions in gene expression

v Five possible reactions in gene expression are considered

0 1 2 3 4 n-1 n

RNA polymerase

0 1 2 3 4 n-1 nDNA + RNAP → DNA0 • RNAP

DNAi • RNAP → DNAi+1 • RNAP0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

DNAn • RNAP → DNA + RNAP0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

DNA + R → DNA • R0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

repressor

DNA • R → DNA + R0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

1

3

2

5

4

32

Simulation of gene expression

v Stochastic simulation from initial state

0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 nDNA + RNAP → DNA0 • RNAP

DNAi • RNAP → DNAi+1 • RNAP0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

DNAi • RNAP → DNAi+1 • RNAP0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

reaction 1 chosen

reaction 2 chosen

reaction 2 chosen

33

Stochastic outcome of simulation

v Simulation starting from same initial state will generally lead todifferent results

0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 nDNA + R → DNA • R

DNA • R → DNA + R0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

DNA + RNAP → DNA0 • RNAP0 1 2 3 4 n-1 n 0 1 2 3 4 n-1 n

reaction 4 chosen

reaction 5 chosen

reaction 1 chosen

34

Stochastic simulation and master equation

v Repeating stochastic simulations allows approximation ofP(X (t )) in master equation to be given

X1

t

P(X1(t1))

X1(t1)

t1

35

Application of stochastic equations

v Stochastic equations have been used to model genetic andother regulatory systems:

l λ phage infection of E. coli (Arkin et al., 1998)

l chemotactic signalling in E. coli (Morton-Firth and Bray, 1998)

l ...

36

Stochastic analysis of phage ? infection

v Stochastic model of ?lysis-lysogenydecision networkArkin et al., 1998

37

Stochastic analysis of phage ? infection

v Time evolution of Cro and CIdimer concentrations

v Due to stochastic fluctuations,under identical conditions cellsfollow one or other pathway withsome probability

Arkin et al., 1998

38

Comparison with deterministic approach

v Deterministic models can beseen as predicting averagebehavior of cell populationGillespie, 2000

v However, analysis of averagebehavior may obscure thatone part of populationchooses one pathway ratherthan another

Arkin et al., 1998

39

Evaluation of stochastic equations

v Pro: more realistic models of gene regulation

v Contra: required information on regulatory mechanisms onmolecular level usually not available

reaction schemas and values of parameters τ and ρ are not orincompletely known

v Contra: stochastic simulation is computationally expensive

large networks cannot currently be handled

40

Conclusions

v Computer tools for modeling and simulation will be necessaryto understand genetic regulatory processes

v Variety of approaches available, representing geneticregulatory systems on different levels of abstraction

v Choice of approach depends on aim of analysis and onavailable information:

l knowledge on reaction mechanisms

l quantitative data on model parameters and gene expression levels

v Serious applications are beginning to emerge

41

Literature

A. Arkin et al., Stochastic kinetic analysis of developmental pathway

bifurcation in phage λ-infected Escherichia coli cells, Genetics,149:1633-1648, 1998

J.M. Bower and H. Bolouri, Computational Modeling of Genetic andBiochemical Networks, MIT Press, 2001

T.A. Carrier and J.D. Keasling, Investigating autocatalytic gene expressionsystems through mechanistic modeling, J. Theor. Biol., 201:25-36, 1999

J.L. Cherry and F.R. Adler, How to make a biological switch, J. Theor. Biol.,203:117-133, 2000

42

Literature

D. Endy et al., Computation, prediction, and experimental tests of fitness forbacteriophage T7 mutants with permuted genomes, P. Nat. Acad. Sc.USA, 97(10):5375-5380, 2000

J. Hasty et al., Computational studies of gene regulatory networks: innumero molecular biology, Nat. Rev. Genet., 2(4):268-279, 2001

H. de Jong, Modeling and simulation of genetic regulatory systems: Aliterature review, J. Comput. Biol., 9(1): 69-105, 2002

N. van Kampen, Stochastic Processes in Physics and Chemistry, Elsevier,1997

43

Literature

D. Kaplan and L. Glass, Understanding Nonlinear Dynamics, SpringerVerlag, 1995

A.D. Keller, Model genetic circuits encoding autoregulatory transcriptionfactors, J. Theor. Biol., 172:169-185, 1995

J.-C. Leloup and A. Goldbeter, A model for circadian rhythms in Drosophilaincorporating the formation of a complex between the PER and TIMproteins, J. Biol. Rhythms, 13(1):70-87, 1998

B. Lewin, Genes VI, Oxford University Press, 199

44

Literature

H. McAdams and A. Arkin, Simulation of prokaryotic genetic circuits, Annu.Rev. Biophys. Biomol. Struct., 27:199-224, 1998

C. Morton-Firth and D. Bray, Predicting temporal fluctuations in anintracellular signalling pathway, J. Theor. Biol., 192:117-128, 1998

H. McAdams and L. Shapiro, Circuit simulation of genetic networks,Science, 269:650-656

B. Novak and J. Tyson, Modeling the cell-division cycle: M-phase trigger,oscillations and size control, J. Theor. Biol., 165:101-134, 1993

M. Ptashne, A Genetic Switch: Phage ? and Higher Organisms,

J. Reinitz and D. Sharp, Gene circuits and their uses, in: J. Collado-Videset al. (eds), Integrative Approaches to Molecular Biology, MIT Press,253-272,1996

45

Literature

M. Santillán and M.C. Mackey, Dynamic regulation of the tryptophanoperon: A modeling study and comparison with experimental data,P. Nat. Acad. Sc. USA, 98(4):1364-1369, 2001

L. Segel, Modeling Dynamic Phenomena in Molecular and Cellular Biology,Cambridge University Press, 1984

P. Smolen et al., Modeling transcription control in gene networks: Methods,recent results, and future directions, Bull. Math. Biol., 62:247-292, 2000

S.H. Strogatz, Nonlinear Dynamics and Chaos: With Applications toPhysics, Biology, Chemistry, and Engineering, Perseus Books, 1994

R. Thomas and R. d’Ari, Biological Feedback, CRC Press, 1990