Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up...
-
Upload
geoffrey-gallagher -
Category
Documents
-
view
214 -
download
1
Transcript of Linear Algebraic Approaches to Metabolic Systems Analysis: Adventures for Undergrads… and Up...
Linear Algebraic Approaches to Metabolic Systems Analysis:
Adventures for Undergrads… and Up
NIMBIOS Workshop
June 20, 2014
Terrell L. Hodge
NOTE:
The ideas discussed in these slides provide but one in-road into the use of algebraic (particularly linear algebraic, here) methods for the representation and analysis of metabolic and other biochemical reaction networks. This is a fast-moving field and there is far more research that has been done on this topic that is much newer.
These slides complement undergraduate curriculum materials in Chapter 8 of Modern Concepts and Methods in Modern Biology, but we will also soon start with and use other materials for today’s presentation and exercises (posted in the NIMBIOS
blog).
Outline of Workshop Topics
Metabolic Pathways Stoichiometric Matrices Null Spaces: Extreme Pathways Left Null Spaces: Extreme Pool Maps Going Further: SVDs and More
Metabolic Pathways
Context Examples One
Mathematical Approach
[diagram from Wikipedia]
The Meaning of “Life”?
Living organisms/systems are, thermodynamically, open systems that tend to maintain a steady state.
Steady state: “All rates of flows in the system are constant, so the system does not change with time.” [A steady state is a stable state!]
Eventual state of closed thermodynamic system is equilibrium.
Equilibrium: Death for a living system. (However, individual reactions in a system may be close to equilibrium.) [Death is also a very stable state.]
Metabolic pathways[Voet & Voet]
“Metabolism is the overall process through which living systems acquire and utilize free energy to carry out their various functions.”
Metabolism is enacted through metabolic pathways: chains of “consecutive enzymatic reactions that produce specific products for use by an organism”.
The metabolites in a metabolic pathway are usually taken to be the substrates, intermediates, and reactants in this chain of reactions.
Example: Glycolysis[diagram from Voet & Voet]
‘Glykos’ = ‘sweet’, ‘lysis’ = ‘loosening’ (Greek)
Widely shared mechanism of life forms for energy extraction
Overall reaction:
GLU + 2NAD+ + 2ADP + 2Pi 2NADH + 2PYR + 2ATP + 2H2O + 4H+
Stoichiometric Matrix S
Metabolic system with m metabolites, n reactions
Dynamic mass balance equation dC/dt = Sv
S = (sij) is integer-valued m by n matrix
– sij = 0 if metabolite i not involved in reaction j
– sij < 0 if metabolite i is a substrate in
reaction j (|sij| moles (units) consumed in reaction j)
– sij > 0 if metabolite i is a product of
reaction j (|sij| moles formed in reaction j)
metabolites
s11r …
e
a …
c
t …
i
o …
n
s …
s1n
s21 s2n
: :
: :
sm1 smn
Example: Glycolysis[diagram from Voet & Voet]
Example: Glycolysis[diagram from Voet & Voet]
Dynamic Mass-Balance Equation(s) dC/dt = Sv
Metabolic system with m metabolites, n reactions Ci = [Xi] := concentration in the metabolic system of
Xi := metabolite i, for i= 1,..,m
dCi/dt = si1v1 + si2v2 + ... + sinvn, where
vj= rate of reaction j := flux of reaction j
Example: Glycolysis[diagram from Voet & Voet]
Dynamic Mass-Balance
First two reactions, again.
Exercise: Repeat for extended two-reaction system, and/or first three reactions.
A Linear Algebraic Perspective
Sv = 0Null Space of S
N(S)
xTS = 0Left Null Space of S
N(ST)
Picture modified from[Famili and Palsson]
A Linear Algebraic Perspective
Sv = 0Null Space of S
N(S)
xTS = 0Left Null Space of S
N(ST)
Picture modified from[Famili and Palsson]
Focus here.
Null Space of S
Sv = 0: steady-state solutions to dynamic mass balance equation dC/dt = Sv
N(S):={y in Rn | Sy = 0}; a vector v in N(S) is a flux vector for the metabolic system (steady state)
Vectors in N(S) give dependencies among columns of S (i.e., reactions)
Example: Glycolysis[diagram from Voet & Voet]
Example: Glycolysis[diagram from Voet & Voet]
Metabolic Pathways Exercise, Step 1: Build Your Own Stoichiometric Matrix
A ‘toy’ example from [Schilling and Palsson].
Metabolic Pathways Exercise, Step 2: Read off the “Pathways”
Figures from [Schilling & Palsson]
Metabolic Pathways Exercise: Drawing “Paths”
Figure from [Schilling & Palsson]
Metabolic Pathways Exercise, Step 3: Change Basis, Read Pathways
Figures from [Schilling & Palsson]
Metabolic Pathways Exercise: Drawing “Paths” Again
Figure from [Schilling & Palsson]
From the Null Space of S to Extreme Pathways
Null space of S: Standard methods yield mathematically valid basis of N(S), but resulting vectors may not be biologically valid total flux vectors.
Base-changing: Aim for “biologically valid” basis of N(S); does such necessarily exist? Even if so, what about uniqueness?
Next: Convex hulls, extreme pathways, and examples.
Biologically “Good”Flux Vectors
In the convex hull flux cone(S), there is an analogue of a basis for N(S), only better: a generating set of ‘independent’ flux vectors P = {p1,…, pt}, unique up to taking scalar multiples, and for which every w in flux cone(S) is a unique non-negative linear combination of vectors in P.
Image from [Schilling, Letscher, Palsson]
Example: ExtremePathways (Expas)[Schilling, Schuster, Palsson & Heinrich]
Figures and tables from [Schilling, Schuster, Palsson & Heinrich]
b1
b2
b3
Basis (transposed) for N(S):
f1
f2
f3
f4
Expas: systematically independent basis P(transposed) for convex flux cone:
p1 = f1 := b1 – b2,, p2 = f2 := b1, p3 = f3 := b3 – b2 , p4 = f4 := b3
Look here!
Expas Example: Human Red Blood Cell(HRBC) [Wiback & Palsson]
Model accounts for 39 metabolites and 32 internal metabolic reactions, as well as 19 external ones (12 pri-mary exchange and 7 currency exchange fluxes).
Resulting flux cone(S) has |P| = 54; further partitioning into ‘types’ yields 39 expas of interest (36 Type I, 39 Type II).
HRBC: Some of the Type I Expas[Wiback & Palsson]
HRBC: More Type I and II Expas[Wiback & Palsson]
Outcomes include: Unique and mathematically
precise description of pathways, including key ‘historical pathways’, but extending to many ‘less intuitive’ paths that reflect network properties
Opportunity to predict system effects of enzyomapathy and other ‘load’ capacities on individual reactions
Another not-so-subtle point of the last few slides: consider advantages of good mathe-matical framework, such as linear algebra.
Enzyomapathies in HRBC [Çakir, Tacer & Ülgen]
Human red blood cell model with 44 metabolites and 39 reactions
Investigates 5 (of about 20 known) enzyomapathies (in this case, enzyme deficiencies) using metabolic pathway analysis
Follows work including [Wiback & Palsson], but using EFMs (elementary flux modes)
One aim: identify targets for drug intervention for diseases caused by enzyme alterations/dysfunction
Recall: A Linear Algebraic Perspective
Sv = 0Null Space of S
N(S)
xTS = 0Left Null Space of S
N(ST)
Picture modified from[Famili and Palsson]
Focus here.
Left Null Space of S
N(ST) = {x in Rm | STx = 0} = N(-ST)
= {x in Rm | xTS = 0} * v in N(ST) is a potential “pool map”, defining
a conservation relationship** Vectors in N(ST) give dependencies among
rows of S (metabolites)
* Rm consists of column vectors. **Details in the Appendix.
Conservation and Pool Maps Exercise, Step 1: Build Your Own Dynamic Mass-Balance Equations
A ‘toy’ example from [Nikolaev, Burgard, & Maranas]
Conservation and Pool Maps Exercise, Step 1: Build Your Own Dynamic Mass-Balance Equations (Solution)
Note differencefrom previous sign conventions; can use –S in place of prior S.
Example from [Nikolaev, Burgard, & Maranas]
Conservation and Pool Maps Exercise, Step 2: Find Conserved Cycles
Note differencefrom previous sign conventions; can use –S in place
of prior S.
Diagram from [Nikolaev, Burgard, & Maranas]
Conservation and Pool Maps Exercise, Step 2: Find Conserved Cycles
Note differencefrom previous sign conventions; can use –S in place
of prior S.
Diagram from [Nikolaev, Burgard, & Maranas]
Alternate Perspective: Reaction Maps to Compound Maps
Metabolites Nodes Reactions Arrows S, N(S) Substrates Tails of Edges Products Heads of Edges
Reactions Nodes Metabolites Arrows -ST, N(-ST) Substrates Edges Entering Nodes Products Edges Exiting Nodes
e.g., as in [Famili and Palsson]
Left Null Space of S: Compound Maps and Extreme Pool Maps
As before, basis of N(-ST) lacks uniqueness and may not be biologically interesting
As before, compute convex basis, call resulting (unique) vectors extreme pool maps (extreme pools)
Example: Extreme Pool Maps in Glycolysis[Nikolaev, Burgard & Maranas]
Glycolysis represented with 11 metabolites (16 if include ATP, ADP, NAD+, NADH, H20), and
13 reactions. Flux cone(-ST) has a
systematically independent basis with |P| = 8 vectors, so there are 8 extreme pool maps.
Diagram from [Nikolaev, Burgard & Maranas]
Example: Extreme Pool Maps in Glycolysis[Nikolaev, Burgard & Maranas]
Glycolysis offered as both a rich and sufficiently small real-life system for direct computation and analysis of extreme pools.
However, paper considers alternative methods to elucidate and analyze extreme pools for larger systems, paralleling alternate ‘flux coupling’ methods for extreme paths.
Metabolic Pathway Analysis, Extreme Pathways, and Extreme Pools: Some Consequences
Yields mathematically precise definition of metabolic pools and pathways that take a systems/network approach
Yields ‘unique’ generating set, with properties similar to vector space bases (‘minimality’ and ‘spanning’)
Gives geometrically and graphically appealing interpretations Algorithms and programs exist for computing extreme paths
and extreme pools Linear algebra framework provides accessible mathematical
framework that is rich in computational power and is a base for many other mathematical structures
Extends current biological ‘intuition’, suggests mechanisms for understanding how living systems maintain steady states and fight or fall to disease, as well as proper design of medical interventions.
Linear Algebra Again: The Four Fundamental Subspaces and S
Sv = 0Null Space of S
N(S)
xTS = 0Left Null Space of S
N(ST)
Picture modified from[Famili and Palsson]
Focus here.
Singular Value Decomposition (SVD):Learning More From S [Price, et. al.]
Recall convex basis of expas P = {p1,.., pt} for flux cone(S). Set P to be the matrix with columns p1,..pt.
Find SVD(P) = UVT. Analysis allows for comparison of extreme pathways for
different metabolic systems, and may assist in identifying key branch points (targets for regulation).
Diagram from [Price, et. al.]
‘p’ above = ‘t’ below.
Singular Value Decomposition (SVD): Learning More From S [Price, et. al.]The column vectors of U (‘modes’) give information re: flux variability within the cone. The singular values measure variance in directions given by the corresponding U vectors.
Diagrams from [Price, et. al.]
Some Issues in Use of Extreme Pathways/Extreme Pool Maps
In small to medium systems, expas/expools can be calculated, but giving their biological interpretations is not automated!
Scaling to genome-level an issue: Implementation of original algorithms for computing flux cones (like expa) problematic for large systems: computational round-off error for large S, combinatorial explosion and NP completeness issues arise…
Perspective may be enhanced by comparison with other linear algebra/convex analysis/linear programming methods*, e.g., EFMs, FCA, MCCA and MCPI, FluxAnalyzer…
Dynamic and regulatory information are not, in general, treated in MPA (metabolic pathway analysis) approach.
Biologists are not out of jobs: good biological data and physical approaches to flux determination (isotope labeling, etc.) still important.
*Still other approaches/enhancements exist, making use of aspects of probability and statistics, etc…
References
Schilling and Palsson, The underlying pathway of biochemical reaction networks.
Famili and Palsson, The convex basis of the left null space of the stochiometric matrix leads to the definition of metabolically meaningful pools.
Schilling, Schuster, Palsson and Heinrich, Metabolic pathway analysis: Basic concepts and scientific applications in the post-genomic era.
Çakir, Tacer and Ülgen, Metabolic pathway analysis of enzyme-deficient human red blood cells.
Suthers, Burgard, Dasika, Nowroozi, Van Dien, Keasling, Maranas, Metabolic flux elucidation for large-scale models using 13C labeled isotopes.
Price, Reed, Papin, Famili and Palsson, Analysis of metabolic capabilities using singular value decomposition of extreme pathway matrices.
Schilling, Letscher, and Palsson, Theory for systematic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective.
Wiback and Palsson, Extreme pathway analysis of human red blood cell metabolism.
Nikolaev, Burgard, and Maranas, Elucidation and Structural Analysis of Conserved Pools for Genome-Scale Metabolic Reconstructions.
References
Voet and Voet, Biochemistry (3rd Ed.)
Bell and Palsson, expa, a program for calculating extreme pathways in biochemical reaction networks.
Becker, Feist, Mo, Hannum, Palsson, and Herrgard, Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox.
Schuster, Dandekar, and Fell, Detection of elementary flux modes in biochemical networks: a promising tool for pathway analysis and metabolic engineering.
Klamt, Stelling, Ginkel and Giles, FluxAnalyzer: exploring structure, pathways, and flux distributions in metabolic networks on interactive flux maps.
Palsson, Representing Reconstructed Networks Mathematically: The Stochiometric Matrix (lecture series).
Systems Biology Research Group, http://gcrg.ucsd.edu/
Schuster, Fell, and Dandekar, A general definition of metabolic pathways useful for systematic organization and analyis of complex metabolic networks.
Schuster and Hilgetag, On elementary flux modes in biochemical reaction systems at steady state.
Schuster, Hilgetag, Woods and Fell, Reaction routes in biochemical reaction systems: algebraic properties, validated calculation procedure and example from nucleotide metabolism.