Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up...

48
Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge

Transcript of Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up...

Page 1: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Linear Algebraic Approaches to Metabolic Systems Analysis:

Adventures for Undergrads… and Up

NIMBIOS Workshop

June 20, 2014

Terrell L. Hodge

Page 2: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

NOTE:

The ideas discussed in these slides provide but one in-road into the use of algebraic (particularly linear algebraic, here) methods for the representation and analysis of metabolic and other biochemical reaction networks. This is a fast-moving field and there is far more research that has been done on this topic that is much newer.

These slides complement undergraduate curriculum materials in Chapter 8 of Modern Concepts and Methods in Modern Biology, but we will also soon start with and use other materials for today’s presentation and exercises (posted in the NIMBIOS

blog).

Page 3: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Outline of Workshop Topics

Metabolic Pathways Stoichiometric Matrices Null Spaces: Extreme Pathways Left Null Spaces: Extreme Pool Maps Going Further: SVDs and More

Page 4: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways

Context Examples One

Mathematical Approach

[diagram from Wikipedia]

Page 5: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

The Meaning of “Life”?

Living organisms/systems are, thermodynamically, open systems that tend to maintain a steady state.

Steady state: “All rates of flows in the system are constant, so the system does not change with time.” [A steady state is a stable state!]

Eventual state of closed thermodynamic system is equilibrium.

Equilibrium: Death for a living system. (However, individual reactions in a system may be close to equilibrium.) [Death is also a very stable state.]

Page 6: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic pathways[Voet & Voet]

“Metabolism is the overall process through which living systems acquire and utilize free energy to carry out their various functions.”

Metabolism is enacted through metabolic pathways: chains of “consecutive enzymatic reactions that produce specific products for use by an organism”.

The metabolites in a metabolic pathway are usually taken to be the substrates, intermediates, and reactants in this chain of reactions.

Page 7: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

‘Glykos’ = ‘sweet’, ‘lysis’ = ‘loosening’ (Greek)

Widely shared mechanism of life forms for energy extraction

Overall reaction:

GLU + 2NAD+ + 2ADP + 2Pi 2NADH + 2PYR + 2ATP + 2H2O + 4H+

Page 8: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Stoichiometric Matrix S

Metabolic system with m metabolites, n reactions

Dynamic mass balance equation dC/dt = Sv

S = (sij) is integer-valued m by n matrix

– sij = 0 if metabolite i not involved in reaction j

– sij < 0 if metabolite i is a substrate in

reaction j (|sij| moles (units) consumed in reaction j)

– sij > 0 if metabolite i is a product of

reaction j (|sij| moles formed in reaction j)

metabolites

s11r …

e

a …

c

t …

i

o …

n

s …

s1n

s21 s2n

: :

: :

sm1 smn

Page 9: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

Page 10: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

Page 11: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Dynamic Mass-Balance Equation(s) dC/dt = Sv

Metabolic system with m metabolites, n reactions Ci = [Xi] := concentration in the metabolic system of

Xi := metabolite i, for i= 1,..,m

dCi/dt = si1v1 + si2v2 + ... + sinvn, where

vj= rate of reaction j := flux of reaction j

Page 12: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

Dynamic Mass-Balance

First two reactions, again.

Exercise: Repeat for extended two-reaction system, and/or first three reactions.

Page 13: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

A Linear Algebraic Perspective

Sv = 0Null Space of S

N(S)

xTS = 0Left Null Space of S

N(ST)

Picture modified from[Famili and Palsson]

Page 14: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

A Linear Algebraic Perspective

Sv = 0Null Space of S

N(S)

xTS = 0Left Null Space of S

N(ST)

Picture modified from[Famili and Palsson]

Focus here.

Page 15: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Null Space of S

Sv = 0: steady-state solutions to dynamic mass balance equation dC/dt = Sv

N(S):={y in Rn | Sy = 0}; a vector v in N(S) is a flux vector for the metabolic system (steady state)

Vectors in N(S) give dependencies among columns of S (i.e., reactions)

Page 16: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

Page 17: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Glycolysis[diagram from Voet & Voet]

Page 18: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways Exercise, Step 1: Build Your Own Stoichiometric Matrix

A ‘toy’ example from [Schilling and Palsson].

Page 19: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways Exercise, Step 2: Read off the “Pathways”

Figures from [Schilling & Palsson]

Page 20: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways Exercise: Drawing “Paths”

Figure from [Schilling & Palsson]

Page 21: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways Exercise, Step 3: Change Basis, Read Pathways

Figures from [Schilling & Palsson]

Page 22: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathways Exercise: Drawing “Paths” Again

Figure from [Schilling & Palsson]

Page 23: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

From the Null Space of S to Extreme Pathways

Null space of S: Standard methods yield mathematically valid basis of N(S), but resulting vectors may not be biologically valid total flux vectors.

Base-changing: Aim for “biologically valid” basis of N(S); does such necessarily exist? Even if so, what about uniqueness?

Next: Convex hulls, extreme pathways, and examples.

Page 24: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Biologically “Good”Flux Vectors

In the convex hull flux cone(S), there is an analogue of a basis for N(S), only better: a generating set of ‘independent’ flux vectors P = {p1,…, pt}, unique up to taking scalar multiples, and for which every w in flux cone(S) is a unique non-negative linear combination of vectors in P.

Image from [Schilling, Letscher, Palsson]

Page 25: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: ExtremePathways (Expas)[Schilling, Schuster, Palsson & Heinrich]

Figures and tables from [Schilling, Schuster, Palsson & Heinrich]

b1

b2

b3

Basis (transposed) for N(S):

f1

f2

f3

f4

Expas: systematically independent basis P(transposed) for convex flux cone:

p1 = f1 := b1 – b2,, p2 = f2 := b1, p3 = f3 := b3 – b2 , p4 = f4 := b3

Look here!

Page 26: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Expas Example: Human Red Blood Cell(HRBC) [Wiback & Palsson]

Model accounts for 39 metabolites and 32 internal metabolic reactions, as well as 19 external ones (12 pri-mary exchange and 7 currency exchange fluxes).

Resulting flux cone(S) has |P| = 54; further partitioning into ‘types’ yields 39 expas of interest (36 Type I, 39 Type II).

Page 27: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

HRBC: Some of the Type I Expas[Wiback & Palsson]

Page 28: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

HRBC: More Type I and II Expas[Wiback & Palsson]

Outcomes include: Unique and mathematically

precise description of pathways, including key ‘historical pathways’, but extending to many ‘less intuitive’ paths that reflect network properties

Opportunity to predict system effects of enzyomapathy and other ‘load’ capacities on individual reactions

Another not-so-subtle point of the last few slides: consider advantages of good mathe-matical framework, such as linear algebra.

Page 29: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Enzyomapathies in HRBC [Çakir, Tacer & Ülgen]

Human red blood cell model with 44 metabolites and 39 reactions

Investigates 5 (of about 20 known) enzyomapathies (in this case, enzyme deficiencies) using metabolic pathway analysis

Follows work including [Wiback & Palsson], but using EFMs (elementary flux modes)

One aim: identify targets for drug intervention for diseases caused by enzyme alterations/dysfunction

Page 30: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.
Page 31: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Recall: A Linear Algebraic Perspective

Sv = 0Null Space of S

N(S)

xTS = 0Left Null Space of S

N(ST)

Picture modified from[Famili and Palsson]

Focus here.

Page 32: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Left Null Space of S

N(ST) = {x in Rm | STx = 0} = N(-ST)

= {x in Rm | xTS = 0} * v in N(ST) is a potential “pool map”, defining

a conservation relationship** Vectors in N(ST) give dependencies among

rows of S (metabolites)

* Rm consists of column vectors. **Details in the Appendix.

Page 33: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Conservation and Pool Maps Exercise, Step 1: Build Your Own Dynamic Mass-Balance Equations

A ‘toy’ example from [Nikolaev, Burgard, & Maranas]

Page 34: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Conservation and Pool Maps Exercise, Step 1: Build Your Own Dynamic Mass-Balance Equations (Solution)

Note differencefrom previous sign conventions; can use –S in place of prior S.

Example from [Nikolaev, Burgard, & Maranas]

Page 35: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Conservation and Pool Maps Exercise, Step 2: Find Conserved Cycles

Note differencefrom previous sign conventions; can use –S in place

of prior S.

Diagram from [Nikolaev, Burgard, & Maranas]

Page 36: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Conservation and Pool Maps Exercise, Step 2: Find Conserved Cycles

Note differencefrom previous sign conventions; can use –S in place

of prior S.

Diagram from [Nikolaev, Burgard, & Maranas]

Page 37: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Alternate Perspective: Reaction Maps to Compound Maps

Metabolites Nodes Reactions Arrows S, N(S) Substrates Tails of Edges Products Heads of Edges

Reactions Nodes Metabolites Arrows -ST, N(-ST) Substrates Edges Entering Nodes Products Edges Exiting Nodes

e.g., as in [Famili and Palsson]

Page 38: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Left Null Space of S: Compound Maps and Extreme Pool Maps

As before, basis of N(-ST) lacks uniqueness and may not be biologically interesting

As before, compute convex basis, call resulting (unique) vectors extreme pool maps (extreme pools)

Page 39: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Extreme Pool Maps in Glycolysis[Nikolaev, Burgard & Maranas]

Glycolysis represented with 11 metabolites (16 if include ATP, ADP, NAD+, NADH, H20), and

13 reactions. Flux cone(-ST) has a

systematically independent basis with |P| = 8 vectors, so there are 8 extreme pool maps.

Diagram from [Nikolaev, Burgard & Maranas]

Page 40: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Example: Extreme Pool Maps in Glycolysis[Nikolaev, Burgard & Maranas]

Glycolysis offered as both a rich and sufficiently small real-life system for direct computation and analysis of extreme pools.

However, paper considers alternative methods to elucidate and analyze extreme pools for larger systems, paralleling alternate ‘flux coupling’ methods for extreme paths.

Page 41: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Metabolic Pathway Analysis, Extreme Pathways, and Extreme Pools: Some Consequences

Yields mathematically precise definition of metabolic pools and pathways that take a systems/network approach

Yields ‘unique’ generating set, with properties similar to vector space bases (‘minimality’ and ‘spanning’)

Gives geometrically and graphically appealing interpretations Algorithms and programs exist for computing extreme paths

and extreme pools Linear algebra framework provides accessible mathematical

framework that is rich in computational power and is a base for many other mathematical structures

Extends current biological ‘intuition’, suggests mechanisms for understanding how living systems maintain steady states and fight or fall to disease, as well as proper design of medical interventions.

Page 42: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Linear Algebra Again: The Four Fundamental Subspaces and S

Sv = 0Null Space of S

N(S)

xTS = 0Left Null Space of S

N(ST)

Picture modified from[Famili and Palsson]

Focus here.

Page 43: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Singular Value Decomposition (SVD):Learning More From S [Price, et. al.]

Recall convex basis of expas P = {p1,.., pt} for flux cone(S). Set P to be the matrix with columns p1,..pt.

Find SVD(P) = UVT. Analysis allows for comparison of extreme pathways for

different metabolic systems, and may assist in identifying key branch points (targets for regulation).

Diagram from [Price, et. al.]

‘p’ above = ‘t’ below.

Page 44: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Singular Value Decomposition (SVD): Learning More From S [Price, et. al.]The column vectors of U (‘modes’) give information re: flux variability within the cone. The singular values measure variance in directions given by the corresponding U vectors.

Diagrams from [Price, et. al.]

Page 45: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

Some Issues in Use of Extreme Pathways/Extreme Pool Maps

In small to medium systems, expas/expools can be calculated, but giving their biological interpretations is not automated!

Scaling to genome-level an issue: Implementation of original algorithms for computing flux cones (like expa) problematic for large systems: computational round-off error for large S, combinatorial explosion and NP completeness issues arise…

Perspective may be enhanced by comparison with other linear algebra/convex analysis/linear programming methods*, e.g., EFMs, FCA, MCCA and MCPI, FluxAnalyzer…

Dynamic and regulatory information are not, in general, treated in MPA (metabolic pathway analysis) approach.

Biologists are not out of jobs: good biological data and physical approaches to flux determination (isotope labeling, etc.) still important.

*Still other approaches/enhancements exist, making use of aspects of probability and statistics, etc…

Page 46: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

References

Schilling and Palsson, The underlying pathway of biochemical reaction networks.

Famili and Palsson, The convex basis of the left null space of the stochiometric matrix leads to the definition of metabolically meaningful pools.

Schilling, Schuster, Palsson and Heinrich, Metabolic pathway analysis: Basic concepts and scientific applications in the post-genomic era.

Çakir, Tacer and Ülgen, Metabolic pathway analysis of enzyme-deficient human red blood cells.

Suthers, Burgard, Dasika, Nowroozi, Van Dien, Keasling, Maranas, Metabolic flux elucidation for large-scale models using 13C labeled isotopes.

Price, Reed, Papin, Famili and Palsson, Analysis of metabolic capabilities using singular value decomposition of extreme pathway matrices.

Schilling, Letscher, and Palsson, Theory for systematic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective.

Wiback and Palsson, Extreme pathway analysis of human red blood cell metabolism.

Nikolaev, Burgard, and Maranas, Elucidation and Structural Analysis of Conserved Pools for Genome-Scale Metabolic Reconstructions.

Page 47: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.

References

Voet and Voet, Biochemistry (3rd Ed.)

Bell and Palsson, expa, a program for calculating extreme pathways in biochemical reaction networks.

Becker, Feist, Mo, Hannum, Palsson, and Herrgard, Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox.

Schuster, Dandekar, and Fell, Detection of elementary flux modes in biochemical networks: a promising tool for pathway analysis and metabolic engineering.

Klamt, Stelling, Ginkel and Giles, FluxAnalyzer: exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps.

Palsson, Representing Reconstructed Networks Mathematically: The Stochiometric Matrix (lecture series).

Systems Biology Research Group, http://gcrg.ucsd.edu/

Schuster, Fell, and Dandekar, A general definition of metabolic pathways useful for systematic organization and analyis of complex metabolic networks.

Schuster and Hilgetag, On elementary flux modes in biochemical reaction systems at steady state.

Schuster, Hilgetag, Woods and Fell, Reaction routes in biochemical reaction systems: algebraic properties, validated calculation procedure and example from nucleotide metabolism.

Page 48: Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up NIMBIOS Workshop June 20, 2014 Terrell L. Hodge.