Constructing a Quantitative Spatio-temporal Atlas of Gene...

1

Constructing a Quantitative Spatio-temporal Atlas of Gene

Expression in the Drosophila Blastoderm

Charless C. Fowlkes1,2,, Cris L. Luengo Hendriks1,3, Soile V. E. Keränen1,4, Gunther H. Weber1,5,

Oliver Rübel1,5, Min-Yu Huang1,5, Sohail Chatoor1,3, Angela H. DePace1,4, Lisa Simirenko1,4, Clara

Henriquez1,4, Amy Beaton4, Richard Weiszmann4, Susan Celniker1,3, Bernd Hamann1,5, David W.

Knowles1,3, Mark D. Biggin1,4, Michael B. Eisen1,4*, Jitendra Malik1,6*

1 Berkeley Drosophila Transcription Network Project 2 School of Information and Computer Science, University of California, Irvine, CA 92697, USA 3 Life Sciences Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA 4 Genomics Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA 5 Institute for Data Analysis and Visualization (IDAV) and Department of Computer Science, University of California, Davis, CA 95616, USA 6 Department of Electrical Engineering and Computer Science, University of California, Berkeley, CA 94720, USA * Corresponding Authors Email: [email protected], Phone: 510-486-5214, Fax: 786-549-0137 Email: [email protected], Phone: 510-642-7597, Fax: 510-642-5775

2

SUMMARY To fully understand animal transcription networks, it is essential to accurately measure the spatial

and temporal expression patterns of transcription factors and their targets. We describe a registration

technique that takes image-based data from hundreds of Drosophila blastoderm embryos, each co-

stained for a reference gene and one of a set of genes of interest, and builds a model VirtualEmbryo.

This model captures in a common framework the average expression patterns for many genes in

spite of significant variation in morphology and expression between individual embryos. We

establish the method’s accuracy by showing that relationships between a pair of genes’ expression

inferred from the model are nearly identical to those measured in embryos co-stained for the pair.

We present a VirtualEmbryo containing data for 95 genes at six time cohorts. We show that known

regulatory interactions within the network can be recovered from this dataset and predict hundreds

of new interactions.

3

INTRODUCTION The output of animal transcription networks are dynamic, three-dimensional patterns of gene

expression. The expression level of each gene can vary radically between neighboring cells and

these intricate spatial variations play a major role in determining animal body plans. It is a grand

challenge to decipher the transcriptional information encoded in animal genomes to the point where

we can model and predict such patterns. Developing techniques that accurately characterize and

analyze gene expression patterns in the context of changing morphology is a critical step towards

this goal.

Spatial patterns of protein and mRNA expression are being systematically recorded and catalogued

by various approaches over a range of spatial and temporal resolutions in several animal systems,

e.g., Drosophila (Simin et al., 2002; Tomancak et al., 2002; Myasnikova et al., 2001; Tomancak et

al, 2007), Xenopus (Pollet et al., 2005), zebrafish (Kudoh et al., 2001), the ascidian Ciona

intestinalis (Imai et al., 2006; Tassy et al., 2006), mouse embryos (Neidhardt et al., 2000; Visel et

al., 2004), and mouse brains (Carson et al., 2005; Davidson et al., 1997; Gong et al., 2003, Lein et

al., 2007). These data sets, however, do not provide a comprehensive quantitative description of

gene expression in a whole developing embryo at the spatio-temporal resolution needed for detailed

computational modeling of animal transcription networks in three dimensions. For example, in situ

datasets constructed from images of alkaline phosphatase-based probes have provided only

qualitative, 2D descriptions of gene expression patterns that lack cellular resolution and critical

morphological detail (e.g., Tomancak et al., 2002) Kumar et al., 2002; Peng et al., 2006). Perhaps

the most comprehensive and automated expression atlas construction efforts to date are of mouse

brain (Carson et al., 2005; Davidson et al., 1997; Gong et al., 2003, Lein et al., 2007) imaged via

4

serial sectioning and registered using anatomical features. These techniques, however, currently

yield relatively low temporal and spatial resolution (along the sectioning axis) and limited

quantitation of gene expression.

To address the need for highly quantitative spatial expression data, we have developed methods,

based on fluorescence microscopy, that measure relative concentrations of gene products in three

dimensions over an entire Drosophila blastoderm embryo at the resolution of individual cells

(Luengo Hendriks et al., 2006; Keränen et al., 2006) along with tools for interactively visualizing

such data (Rübel et al., 2006; Weber et al., 2007). These methods have already revealed previously

unknown features of blastoderm gene expression and morphology, such as an intricate, dynamic

pattern of nuclear densities that change over time along both body axes (Luengo Hendriks et al.,

2006; Keränen et al., 2006). These results illustrate the importance of, and were dependent upon,

capturing data for the whole embryo at cellular resolution.

A serious difficulty encountered in all current strategies for quantitating spatially resolved gene

expression is that it is not possible to label the expression of more than a few gene products in a

given animal or tissue (e.g., Kosman et al., 2004; Levsky et al., 2002). Yet, even simple portions of

animal transcription networks can comprise tens of regulators and hundreds of target genes (e.g.,

Stathopoulos et al., 2002; Liang and Biggin, 1998; Oliveri and Davidson, 2004). A single cis-

regulatory module (CRM) may often be bound by five, ten or even more co-localized regulators

(see e.g., Yuh et al., 2001). Therefore, to analyze how spatial and temporal changes in transcription

factors correlate with changes in expression of their targets, it will be necessary to simultaneously

5

compare the expression levels of many more gene products than is possible with conventional

microscopy.

In this paper, we present a computational technique that overcomes this limitation by compositing

independent expression measurements made from images of hundreds of different embryos into a

common spatio-temporal atlas in which the average expression patterns of many gene products can

be studied simultaneously. Our technique involves two key components, outlined in Figure 1.

The first is a spatial registration algorithm that uses a reference gene expression pattern common to

all labeled embryos to help identify equivalent corresponding cells or nuclei across images of

multiple embryos at the same stage of development. The resulting correspondences are used to map

expression measurements for other genes, whose expression was labeled in only a subset of

embryos, onto a common model.

The second component is a dynamical morphological template that specifies the average locations

of cells or nuclei over time, providing temporal correspondences between nuclei in embryos imaged

at different developmental time points (Figure 1). This morphological template consists of an

average number of nuclei whose three-dimensional (3D) motions track changes in the average

blastoderm shape and nuclear packing density between each of six temporal cohorts during the 50

minutes prior to gastrulation (Fowlkes and Malik, 2006; Keränen et al., 2006).

Once correspondences have been established among embryos within and between cohorts,

expression measurements are combined into a single composite model, termed a VirtualEmbryo,

6

which describes the average patterns of expression for many genes at multiple time points (Figure

1).

In developing our method, we measured significant geometric variability between individual

embryos at the same developmental stage both in their size, shape, number and positions of nuclei

and in the locations of gene expression patterns. We show that our registration methods correctly

take this geometric variation into account by quantitatively analyzing the relationship between pairs

of genes, a measure which is independent of the exact spatial location of cells. We find that the

relationship between a pair of genes’ expression in individual embyos co-stained for the pair is

similar to the average co-expression recorded in the model, thus demonstrating that the

VirtualEmbryo accurately describes average patterns of gene expression present in individual

embryos. Finally, to establish the utility of this comprehensive, quantitative spatial expression data,

we carry out a statistical analysis for a set of 17 regulatory factors and 95 target genes which is able

to recover many known regulatory interactions from the literature and predicts many new ones.

RESULTS

Data acquisition pipeline

Our previously established pipeline was used to obtain quantitative measurements of gene

expression in individual embryos (Luengo Hendriks et al., 2006). Briefly, embryos were fixed and

fluorescently stained to label the mRNA expression patterns of two genes and nuclear DNA. One of

the genes labeled was either eve or ftz, which were used as fiduciary markers for subsequent spatial

registration. Embryos were manually staged into one of six temporal cohorts based on the extent of

cell membrane invagination (e.g., stage 5:0-3% membrane invagination) and imaged by two-photon

7

microscopy. Using the DNA marker, the location and extent of each blastoderm nucleus was

automatically determined and its location was recorded along with expression levels of the two

genes in the nucleus and surrounding cytoplasm.

The resulting data structure, which we call a PointCloud, contains 3D coordinates of the center of

mass of each nucleus, average fluorescence within each nucleus and the surrounding cytoplasm,

estimates of nuclear and cytoplasmic volumes, neighborhood relationships between nuclei, and

experimental meta-data associated with a given embryo. This provides a compact representation of

the image data for each embryo that is readily amenable to further computational analysis, while

faithfully capturing the blastoderm morphology and gene expression patterns measured.

PointClouds were generated for 1822 embryos, including data for 95 genes over the 50 minutes

prior to the onset of gastrulation.

Spatial registration

Analysis of the PointCloud data suggests that even embryos at approximately the same

developmental stage vary quite significantly in their size, shape, number and distribution of nuclei,

and in the relative spatial locations of gene expression patterns (e.g., Figure 2). This geometric

variability in the data describing individuals must include true biological variations among embryos

as well as measurement errors and physical deformations introduced by our methods.

For example, the PointCloud data showed that the embryos imaged ranged in overall egg-length

from 301µm to 502µm (mean=398µm, std dev=29.4µm). Although some of this size variation

certainly results from the fixation, staining and mounting procedures the embryos were subjected to,

8

much of it likely represents true biological variation since egg length and blastoderm surface area

showed a marked correlation with the total number of peripheral nuclei (r = 0.62 and r=0.64, Figure

2A,B). While our counts of nuclear number (which exclude yolk nuclei and pole cells) are also

subject to some small error on the order of a few percent (Luengo Hendriks et al, 2006), these errors

are too small to explain the measured variation and should not correlate with egg length or surface

area. Thus significant biological variation in the shapes of individual embryos must exist prior to

any experimental manipulation. Indeed, the variation in embryo size we measured is comparable to

reports for embryos that have not been fixed and stained (Warren, 1924; Azevedo et al, 1996) and

our automated counts of nuclei number are comparable to those derived from manual counting of

nuclei in a few embryos (Zalokar and Erk, 1976; Turner and Mahowald, 1976). Although a

correlation between embryo size and nuclei number among members of a population has not been

noted previously, our observation is consistent with the results of embryo ligation experiments,

which have shown that nuclear divisions are controlled by local nuclear densities (reviewed by

Edgar et al, 1986). Thus, our PointCloud data can be used to detect many aspects of biological

variation which have not been realistically measurable before.

Such large geometric variation makes comparing gene expression among individuals non-trivial, as

some technique more sophisticated than simply overlaying embryos is required to identify

equivalent corresponding cells in different embryos. Unlike the adult animal, the blastoderm lacks

distinctive morphological features that identify particular cells or tissues. Instead, nuclei are

primarily distinguished by the expression levels of transcription factors and other regulators that

control development, including the genes whose expression we measured. Therefore, our spatial

9

registration method seeks to identify corresponding nuclei in different embryos which have similar

gene expression profiles.

Broadly, our spatial registration method works as follows. We first use data from all PointClouds in

each temporal cohort to build a morphological template with a fixed number of nuclei arranged to

match the average measured nuclear distribution and mean embryo shape. The template also

specifies the mean locations of the ftz and eve marker gene expression boundaries (Figure 1). We

compute a smooth deformation of each individual PointCloud to warp both the extracted marker

gene boundaries and the overall PointCloud shape into alignment with the template (see Figure 3).

For each nucleus, a best match in the template is identified, establishing correspondences for all

cells across all embryos in the cohort. Once detailed correspondences among embryos have been

found, estimates of the average locations of marker genes specified in the template are refined and

the spatial registration process repeated until further iterations provide no significant changes in the

correspondence. We now describe this process in more detail (see also Experimental Procedures).

An initial estimate of the deformation required to bring PointClouds of a given cohort into register

with the template was computed by automatically identifying the anterior-posterior (A-P) axis and

scaling the PointClouds to the average egg length for that cohort. To establish the dorsal-ventral (D-

V) orientation, the location of the ventral midline was judged by eye using gene expression and

morphological markers. While this coarse alignment step did bring the reference genes into closer

alignment by factoring out gross variations in overall embryo size, significant differences remained

in the locations of expression patterns, as well as residual differences in PointCloud shape and the

distribution of nuclei. For example, Figure 2C, D shows the standard deviation in the A-P locations

10

of ftz expression boundaries before and after scaling embryos to the average egg length. Measuring

expression boundaries with respect to egg length removes up to half the apparent variability in some

stripe boundary locations but standard deviations after scaling are still near 30% the average stripe

width.

To determine correspondences in a way that factors out this remaining non-rigid geometric

variation, we carried out a fine registration step in which the embryos in each temporal cohort were

registered onto the morphological template using the boundaries of either ftz or eve expression

domains. These two markers provided high contrast boundaries (up to four-fold change in

fluorescence between neighboring nuclei) whose locations were easy to extract by identifying

maxima of the gradient magnitude along the blastoderm surface. At a given coordinate around the

embryo’s D-V circumference, these genes provide 14 well-defined A-P locations spaced along the

trunk region (Figures 1 and 3). To match stripe boundaries extracted from each PointCloud to the

template, we performed an alignment using dynamic programming along the A-P axis at each of 40

D-V circumference coordinates spread uniformly around the circumference of the embryo. This

matching allowed for the possibility of missing edge points or falsely detected edges arising from

non-specific staining (see Experimental Procedures).

After matching points along stripe boundaries were identified, a non-rigid 3D coordinate

transformation, or warp, was computed that brought the corresponding points into alignment with

the template (Figure 3). We represented this coordinate transformation using a non-parametric

deformation model, the regularized thin-plate spline (TPS) (Duchon, 1977; Meinguet, 1979;

Wahba, 1990). The TPS is the higher-dimensional generalization of the cubic spline and has been

11

shown effective for modeling variation in biological forms (Bookstein, 1989). We found that in

practice, the TPS yielded quantifiably better registration results than other splines we tested (see

Experimental Procedures). Once all PointClouds had been warped into alignment with the template,

each nucleus in the morphological template is assigned to the nearest PointCloud nucleus. This

many-to-one matching, which allows multiple nuclei in the template to correspond to the same

detected nucleus, is appropriate since the total number of nuclei varies across embryos. Averaging

the marker gene expression measurements from individuals onto the template gives an improved

estimate of the template stripe boundary locations. The entire process of deriving correspondences

was repeated using this improved template until no further significant change was observed between

iterations.

The warping of individual PointClouds during fine registration not only establishes more accurate

correspondences between cells, but also provides an estimate of the geometric variation among

PointClouds, excluding that component due to isotropic variation in size (which was removed by

coarse alignment). Supplementary Figure 1 shows example deformations required to bring

individual PointClouds into alignment with the template during fine registration. Supplementary

Figure 2 shows a principal components analysis (Mardia et al., 1979) of these spatial deformations.

The figure indicates the directions along which nuclei were displaced relative to the average shape

but the magnitude of the displacement has been greatly magnified for the purposes of visualization.

The dominant mode of this geometric variation, which explained more than 45% of the variance in

displacement fields for all cohorts, was an anisotropic “bulging” about the blastoderm center. We

also measured overall shifts along the A-P axis in the location of the trunk domain specified by the

marker genes (8% of variance), curving of the embryo axis (6% of variance), and more subtle

12

variations in the asymmetry between anterior and posterior ends and dorsal and ventral sides of the

PointClouds (Supplementary Figure 2, rows 2, 3, 4).

Temporal correspondence

To track the temporal dynamics of gene expression, it is necessary to estimate correspondences

between the nuclei in successive temporal cohorts. While there are no nuclear divisions during the

50 minute period spanned by our cohorts, we previously showed using both fixed and live embryo

data that there are changes in the locations of both nuclei and gene expression patterns (Keränen et

al., 2006). Careful measurements of several genes, including eve and ftz, showed that expression

pattern domains move relative to the lattice of nuclei, and thus our marker gene boundaries cannot

be used to identify corresponding nuclei over time (Keränen et al., 2006). Furthermore, based on

nuclear tracking in live imaging and nuclear density measurements from fixed material, we found

that, as well as moving basally and elongating, nuclei flow from the anterior and posterior poles

towards the dorsal surface (Keränen et al., 2006). Previous models of the Drosophila blastoderm

have tacitly assumed that a unique spatial coordinate identifies the same cellular/nuclear location at

different times (e.g. Jaeger et. al. 2004; Lucchetta et. al. 2005; Lott et. al. 2007). Our results clearly

indicated this assumption is not valid as localized contractions or expansions of the blastoderm

surface mean some nuclei consistently travel as far as three cell diameters during stage 5 (Keränen

et al., 2006).

In order to take these nuclear movements into account in establishing correspondences across cohort

templates, we developed a method to infer nuclear movements from fixed material (Fowlkes and

Malik, 2006). The average embryo shape and nuclear density pattern from each cohort was used to

13

constrain a numerical model that predicted the direction and distance each nucleus needed to move

through space to account for the measured changes in density and shape between cohorts. Based on

the behavior of nuclei observed in live imaging (Keränen et al., 2006), our model assumed that

nuclear movements were smooth and small, and that the total number of nuclei was conserved. The

resulting morphological template time-series, which specified the locations of 6078 nuclei for each

of the six temporal cohorts, was used for spatially registering and compositing expression data as

described above. This provided the needed correspondences between nuclei at different time points

(Figure 1).

Compositing expression levels

Before averaging expression levels for each gene onto the morphological template, it was necessary

to put fluorescence measurements from different embryos onto a common scale. While our data

does provide an accurate measure of relative mRNA expression levels of a gene between cells

within an individual embryo (Luengo Hendriks et al., 2006), our fixation, and staining methods

result in variation in the absolute fluorescence between different embryos. In particular, the

absolute degree of fluorescence varied significantly across embryos stained in different batches,

presumably due to variable reaction time and efficiency and other experimental artifacts.

To mitigate this error, we normalized mRNA expression levels for each gene across multiple

batches by assuming that the total expression level of a gene was the same between embryos in the

same cohort, and that the fluorescence levels measured in different batches were related by a single

multiplicative factor. Once expression levels were normalized, we averaged peak expression levels

for each gene across all embryos within each temporal cohort in order to estimate the degree of total

14

up- or down-regulation between cohorts (see Experimental Procedures). As Supplemental Figure 3

shows, the relative temporal changes in averaged fluorescence levels measured for each gene were

reasonably consistent across staining batches. These expression time-courses matched general

expectations based on other data, suggesting that absolute differences in fluorescence provide a

useful estimate of relative changes in expression levels between cohorts.

To remove any remaining differences in the quantitation of each gene between embryos within a

cohort, we chose a scale and offset for each embryo that minimized the average standard deviation

in expression between embryos, subject to the constraint that the maximum average expression level

matched the estimated time course. The scaled expression data for each gene from each PointCloud

was then transferred to the corresponding nuclei in the appropriate cohort template and averaged

together.

The atlas

The result of registration and compositing is a VirtualEmbryo that describes the average dynamics

of gene expression and morphology in the blastoderm. Figure 4 shows example lateral views of the

final average expression patterns recorded in the VirtualEmbryo for several genes displayed in

cylindrical projection. This “unrolling” of the blastoderm surface allows one to see the entire

pattern of expression laid flat. As the embryos are approximately bilaterally symmetric, we show

only the left side from dorsal midline to ventral midline. Supplementary Figure 3 shows the

complete set of mRNA patterns for 95 different genes estimated from 1822 embryos and over 11

million cells. The genes analyzed include many of the known early acting transcription factors that

specify patterning prior to gastrulation in Drosophila. For 23 of these factors, we have combined

15

mRNA data from 25 or more embryos spanning the entire 50 minutes leading up to gastrulation.

For the remaining 72 genes, which are known or putative targets of these early transcription factors,

we have collected mRNA data from a smaller number of embryos falling primarily in the three

temporal cohorts just prior to gastrulation.

In addition to the average description present in the VirtualEmbryo, we also provide the individual

raw PointClouds, files recording the nuclear correspondences identified between each individual

PointCloud and the VirtualEmbryo, and associated meta-data. We refer to this comprehensive

dataset as a “gene expression atlas”. Our VirtualEmbryo and the initial atlas release are publicly

available through a web-based interface hosted at http://bdtnp.lbl.gov/. [[For purposes of review,

we have enclosed a pdf file containing screen shots of this part of our website as it will appear when

our work is published. A previous release containing of a smaller set of individual PointClouds is

currently available for viewing at the link given. ]]

Evaluating registration quality

It is nearly impossible to judge by eye if the correspondence between nuclei in two different

embryos is “correct” as most blastoderm nuclei lack any identifying morphological features. Since

our method for determining correspondences is based on finding equivalent nuclei using gene

expression data, one objective criterion we have used to evaluate the accuracy of our registration is

to measure the extent to which nuclei from different individuals identified as corresponding have

similar expression profiles. A second evaluation criterion is to ask how accurately the mean

expression data in the VirtualEmbryo quantitatively captures the relationships between

transcriptional regulator and target gene expression patterns as measured in individual embryos co-

16

stained for the target and the regulator. This evaluation speaks directly to the utility of the

VirtualEmbryo for modeling transcriptional regulation.

Registration decreases the apparent variation between PointClouds.

In principle, a perfect registration method would remove all of the differences between individuals

that are attributable to geometric variation. Any remaining variation would then constitute measured

expression differences between otherwise equivalent cells. Such non-geometric variability could

arise from measurement error or from genuine regulatory variability, such as stochasticity in the

response of targets to their regulators (see e.g., Gregor et al., 2007). Quantifying non-geometric

variability is important as it limits the degree to which any registration method can possibly reduce

the apparent expression variation among embryos. Although the true amount of regulatory

variability between embryos is uncertain, below we describe an experimentally derived upper-

bound and measure how close our registration results are to achieving this limit.

First we examined the effect of registration on the apparent variation in expression levels for

reference and non reference genes. Figure 5A,B shows the mean and standard deviation in

expression levels at a given time point (stage 5: 50-75%) for three different genes along a lateral A-

P strip (n=100, 25, and 8 embryos for ftz, slp1 and gt respectively). For comparison, the variation in

expression is plotted when nuclear correspondences have been determined by coarse alignment

alone, i.e., assuming nuclei at the same relative spatial location along the A-P axis correspond

(Figure 5A). We quantified the apparent inter-embryo variation in expression for each gene in a

cohort by averaging the standard deviation across all corresponding nuclei in each PointCloud

containing data for the gene (Figure 5A,B, top right of each panel; Supplemental Table 1). The

17

apparent inter-embryo variation is lower when correspondences are derived from fine registration

rather than coarse alignment alone. Furthermore, after fine registration our estimates of the average

patterns become less “blurry” and more like those observed in individual embryos, particularly for

highly modulated pair-rule patterns (e.g., ftz and slp1).

The decrease in apparent variability of our registration marker (here ftz) suggests that our fine

registration accurately identified corresponding ftz boundaries and warped them into alignment.

More importantly, the apparent inter-embryo variation also decreased for “held out” expression

patterns (e.g., slp1 and gt) which were not used during the fine registration process. This indicates

that our deformation model (TPS warping) makes good predictions about how all nuclei should be

shifted based only on how the nuclei near ftz or eve boundaries are shifted. This holds true both for

trunk patterns that are close to the marker expression stripes (e.g., slp1) and for cephalic patterns

(e.g., anterior gt) that are relatively far from the registration marker. Supplementary Table 1 shows

the change in apparent variability between coarse alignment and fine registration for 23 early

factors. The decrease in variation holds true for most of these genes and time-points. One general

exception is D-V patterning genes, suggesting that the precise locations of D-V expression

boundaries are fairly uncorrelated with our A-P registration markers. Interestingly, the importance

of fine registration generally increases as patterns become more sharply localized over the course of

stage 5.

Expression relationships are similar in co-stained embryos and registered PointClouds

While the above analysis shows that fine registration yields better correspondences than coarse

alignment alone, it does not address what fraction of the remaining variation in expression between

18

individuals after fine registration is due to remaining geometric variability and what fraction is

attributable to non-geometric causes. To experimentally establish a baseline on the non-geometric

variability measured among embryos, we directly estimated the relation in expression levels

between pairs of genes in individual embryos co-stained for both genes. This measurement is

invariant to spatial deformations of the individual embryos so any variability measured in the co-

expression across co-stained embryos must arise from non-geometric sources. We can directly

compare the variability in this relation to that inferred by registering data from pairs of embryos

where each embryo was stained for only one of the two genes in order to judge how much

geometric variability remains after registration. The average co-expression relationship estimated

from directly co-stained embryos also provides a “gold standard” against which we can compare the

average co-expression recorded in the VirtualEmbryo inferred from the registered pairs.

Figure 5C shows the relation in expression levels of eve and gt within a three cell wide lateral strip

at the anterior boundary of eve stripe 2 in embryos co-stained for both eve and gt. Each point in the

plot gives the joint expression level measured in a single nucleus from one of 44 embryos at stage

5:9-75%. It is known that gt plays a key role in determining the anterior boundary of eve stripe 2.

This regulatory relationship is revealed in the tight anti-correlation between the two genes’

expression in this part of the embryo (Figure 5C). The blue curve in Figure 5D shows the average

relation between the two factors estimated by binning nuclei based on the level of gt expression and

then computing the average level of eve for each bin. The variability in the relation between the two

genes was quantified by computing the standard deviation for eve expression in each bin. We

measured a maximum standard deviation of 0.21 (relative to a maximal eve expression of 1). This

measure of local variability is comparable in magnitude to similar measurements made on the

19

relation of bcd and hb (Gregor et al., 2007). Because this variation is measured within individual

embryos, it cannot be due to geometric variation and thus is not removable by our registration

method. Instead, this variation is likely due to some combination of variability in the regulation of

eve by gt, spatial variations in other unmeasured regulatory factors that also influence eve

expression, and error in our measurements of mRNA concentrations.

Having set an upper bound for non-geometric variation, we then used the co-stained embryo data to

quantify the effect of geometric variation between embryos on the apparent relationships between

regulator and target expression in coarsely registered embryos. Pairs of co-stained embryos were

selected at random and the level of eve in one nucleus was compared to the level of gt in a nucleus

at a corresponding spatial location in the other embryo. The red curve in Figure 5D shows that the

resulting estimate of co-expression in coarsely aligned embryos is significantly different from that

derived from individual co-stained nuclei. Furthermore the apparent variability in the level of eve

as a function of gt is significantly larger. In contrast, if we sample pairs of nuclei from different

embryos that were identified as corresponding by the fine registration process (in this case using

eve as the reference gene) the resulting curve (green in Figure 5D) is very similar to the direct co-

stain curve and has a similarly small apparent variability. This implies that while the exact location

of eve stripe two varies significantly from one embryo to the next (we measured similar variability

to that reported for ftz in Figure 2) the pattern of gt is shifted in a correlated manner. Supplemental

Table 1, which specifies the change in the apparent variability of individual genes before and after

fine registration, thus characterizes the extent to which expression patterns of individual genes are

correlated with those of our registration markers.

20

Since most genes are spatially correlated with our registration marker in individuals, this suggests

that the composite VirtualEmbryo data should accurately capture average regulatory relationships

between individual genes as well as between each gene and the registration marker. We verified

this experimentally as displayed in Figure 5E, which shows the co-expression levels for eve and gt

estimated from a VirtualEmbryo constructed using embryos stained for either eve and ftz or gt and

ftz. In this experiment, embryos were registered using ftz as the marker gene and pairs of

corresponding nuclei from different embryos were used to estimate the level of eve as a function of

gt. As the green curve in Figure 5E shows, even though gt and eve are never observed in the same

embryo, the mean functional relationship inferred from the virtual co-expression measurements is

quite close to that inferred from co-stained embryos. The resulting average estimate deviates from

the co-stain estimate by less than 7% of the maximum expression level and has a maximum

standard deviation of 0.30. In contrast, using only coarse alignment to register these same embryos

yields an average relation with a different shape (red curve, Figure 5D) and larger variability (max

std dev = 0.36). Further analysis suggests our estimates of the variability after registration may be

quite conservative (see Experimental Procedures).

In summary, the results of testing our two evaluation criteria suggest that the registration process

successfully factors out a large fraction of the geometric sources of variability, leaving an average

estimate of expression which is quite close to that measured in individual embryos.

Inferring regulatory interactions from spatio-temporal expression data

One of our chief motivations for developing the blastoderm expression atlas is to help determine

which transcription factors regulate which target genes. It has generally been shown that the

21

expression patterns of regulators frequently correlate, at least in some portion of an animal, with

those of their target genes. For example, the anti-correlation of gt expression with eve expression

across the anterior border of eve stripe 2 described previously (Fig. 5). In principle, it ought to be

possible to infer regulatory relationships by searching for such correlations. It is unlikely that the

resulting inferences will always be correct, and cannot be taken on their own to indicate that a

transcription factor directly binds and regulates a target gene. They should, however, provide a

significant constraint on possible models for the network, which when combined with other classes

of data such as genome-wide in vivo binding data and in vitro DNA specificity data, will lead to a

deeper understanding of how combinations of factors act together to generate specific

transcriptional outputs.

To demonstrate the utility of the composite expression atlas for inferring regulatory interactions, we

performed a regression analysis to predict regulatory interactions based solely on the spatial

expression data. Since the major transcriptional regulators in the Drosophila blastoderm are known,

we searched for a subset of these potential regulators whose combination best predicted the spatio-

temporal dynamics of each of the 95 genes whose expression we have recorded. Let M(x,t) be the

level of mRNA transcription of a given target in a blastoderm nucleus x at time t and P(x,t) be a

vector where the ith entry specifies the protein concentration of the ith potential regulator. We

sought to identify a set of regulators P and a function F that predicted the target expression M as a

function of the regulator concentrations for all nuclei and cohorts, i.e. M = F(P) for all x,t.

We used a generic form for F consisting of a sigmoid applied to a linear combination of the protein

concentrations (with weighting vector a) and a constant offset b. This linear model was explicitly

22

choosen to be as simple as possible, e.g., ignoring complexities such as possible heteromeric

interactions between transcription factors. To fit the parameters a and b to our observed data, we

performed least squares minimization over all nuclei at all time points for which we had data and

characterized the goodness of fit for each target (see Experimental Procedures).

Because the individual components of blastoderm expression patterns (e.g. each stripe) are often

controlled by distinct cis-regulatory modules (CRMs) (e.g. Fujioka et. al. 1999), we automatically

segmented each target output pattern into individual expression domains and fit each such “module”

independently, assuming that expression elsewhere was zero. This let us use a simple form for the

regulatory function F while still allowing, say KR to repress eve stripe 5 but activate eve stripe 3.

Even for this simple linear model, there are still a large number of potential regulators, which may

result in overfitting. In preliminary experiments, models fit using the complete set of regulators

typically assigned small weights to a large number of factors, limiting their explanatory power. In

contrast, the presence or absence of even a single regulator in vivo has been shown to often have

dramatic effects on the patterns of target genes (see e.g. Carroll 1989; Clyde 2006). Thus, to make

fits more interpretable, we restricted the model space by only considering those weighting vectors a

with 6 or fewer non-zero entries. We selected the best 6 regulators from a pool of 17 early acting

transcription factors by exhaustive feature selection, choosing that set of 6 regulators which best

predicted the each target pattern (largest R2 ).

Because protein expression patterns differ both spatially and temporally from mRNA expression

patterns, we added additional PointCloud data to the atlas from 302 embryos stained with antibodies

23

to BCD, HB, GT, or KR proteins (see Experimental Procedures). Expression patterns for these

proteins are shown in Supplemental Figure 5. Since we lacked protein data for the remaining 13

early transcription factors, we approximated their protein patterns by the distribution of their mRNA

with a fixed temporal delay of two cohorts (roughly 16 minutes), i.e. )16,(),( !" txMtxPii

. For

those strictly zygotic factors for which we measured both mRNA and protein expression data, we

have found that this temporal delay of the mRNA pattern gave the closest match to the gene’s

protein pattern (e.g., Luengo et al, 2006). In addition, using this two cohort delay to approximate

protein patterns yielded the best average quality fit across all genes on held out data in our

regression analysis.

Figure 6 A and B shows the correlation coefficients of each of 17 regulators (columns) determined

by the fitting process for each target eve and gt stripe (rows). Green indicates predicted activators,

red indicates repressors, and black indicates unused regulators (zero entries). Supplemental Figure 6

shows similar fits for all 238 “modules” for the 95 genes in the atlas and Figure 6C summarizes the

distribution of R2 values for all modules.

Broadly speaking, the R2 goodness-of-fit values demonstrate that we can fit much of the expression

data quite well with our relatively simple linear model. Of the 238 modules, 202 (85%) were fit with

an R2 value of 0.5 or greater. This suggests that this small set of 17 regulators contain enough

spatial information to generate the wide variety of cell expression profiles that are apparent by the

end of stage 5.

24

In addition, the regression analysis contains many correct predictions. For example, the analysis

correctly predicts HB as an activator of eve stripe 2 and KR, KNI and GT as repressors (Small et al,

1992; Arnosti et al, 1996). Similarly, for gt stripe 5 the analysis correctly predicts repression by HB

and Huckebein (Eldon and Pirrotta, 1991). Interestingly, the analysis predicts regulation of A-P

target genes by D-V regulators. For example the gt stripes clearly have a D-V pattern which is

picked out by the regression (regulation by SNA and BRK).

Not surprisingly, this model also has some clear failures. For example, BCD does not appear as an

activator in many cases, including for one of its best characterized targets, eve stripe 2. This is not

too surprising since the method will tend to pick regulators whose protein expression changes

sharply near boundaries of the target pattern, while BCD has a graded expression pattern. Another

limitation is that the expression “modules” that we automatically identified may not correspond to

the output of distinct CRMs. For example in eve, stripes 4 and 6 are both controlled by the same

regulators acting via a single CRM and stripes 3 and 7 both by another CRM (Clyde 2003). This

may in part explain the relatively poor quality of fits in Figure 6A to these stripes. The addition of

such known biological information to the model, as well as addition features, such as cooperative or

competitive interactions between pairs of factors acting on CRMs, will no doubt allow even more

precise fits to target expression patterns.

Finally, the comparison of the prediction’s of the regression analysis to results in the literature

underlines the well known difficulty in correctly divining regulatory interactions within this

complex network. For example, our analysis predicts SLP as a regulator of several gt stripes and yet

in slp loss of function mutant embryos, gt expression is not affected (Eldon and Pirrotta, 1991).

25

Such loss of function genetic experiments, however, cannot rule out the possibility of functionally

redundant regulatory interactions, as revealed in other cases by more detailed experiments (e.g.

Laney and Biggin, 1996; Manoukian and Krause, 1992; Walter et al, 1994), and thus cannot

disprove predictions of our regression analysis. This and other complexities of the system suggest

that picking apart network interactions will require careful consideration of multiple datasets.

DISCUSSION

Our work establishes the first spatio-temporal quantitative atlas of gene expression and morphology

for a whole embryo at cellular resolution. By using registration to bring quantitative gene expression

data for many genes into a common spatio-temporal frame, our work opens the way for quantitative

analyses of the large networks of interactions between developmental regulators and their targets.

Our initial regression study demonstrates the utility of this dataset for such analyses. The

correspondences established between cells in different embryos also provide a unique tool for

exploring variations in morphology and gene expression between individuals, beyond those already

noted in this paper. We anticipate that the composite VirtualEmbryo expression and

correspondence data derived here will allow novel insights into many aspects of the system-level

biology of developmental regulatory networks.

Accuracy of Registration

In general, registration techniques are designed to establish correspondences by factoring out a

certain class of variations between individuals (typically geometric variation in size and shape etc.)

while maintaining other variations of interest. Characterizing the performance of a particular

algorithm, however, is conceptually difficult since the actual nature of the variations under study is

26

seldom known in advance. Our analysis suggests that the registration method presented here comes

close to separating geometric variability from the regulatory variability with which promoters in

different cells respond to similar concentrations of transcription factors, at least to within the

accuracy afforded by our measurement techniques. Factoring out geometric variability will be

important in isolating and characterizing differences in regulatory mechanisms, both within and

between closely related species.

From a practical perspective, our results suggest that our registration procedure yields a

VirtualEmbryo containing average expression data that is nearly as accurate as could be obtained

from averaging directly co-stained embryos, and is thus sufficient for many types of analyses of

regulatory interactions. It is worth noting that in a high-throughput setting, our method is far more

efficient than direct staining. To measure pair-wise interactions between N genes directly requires

imaging on the order of N2 embryos, stained for each possible pair of genes. In contrast our method

only requires staining and imaging N embryos.

We feel confident that the registration methods outlined here can be extend to later developmental

stages as well as other organisms with segmented body plan architectures (e.g., vertebrates). We

have also explored a more generic technique, “shape context matching” (Fowlkes et. al., 2005),

which is applicable to general patterns of expression. Although the blastoderm has a very simple

morphology, it exhibits the key challenges in building cellular resolution expression atlases:

biological and experimentally induced variability across animals, temporal changes in morphology

and gene expression, and ambiguity in cell identity. In later developmental stages that have complex

morphologies, the locations of nuclei and cell membranes provide far more information about the

27

identity of a particular cell (unlike the fairly homogeneous blastoderm). In these cases,

morphological features will likely be more useful in constraining the correspondence step. On the

other hand, deformation models used for registration will have to be more flexible and localized to

track complicated anatomical structures (e.g., legs or gut).

Variation between individuals

By identifying corresponding cells, our method allows the direct comparison of the locations and

expression profiles of homologous cells in different individuals. This provides a novel tool for

understanding biological variability arising from genetic, environmental and stochastic sources

within a population. Indeed, as a natural outcome of the development of our registration method, we

have already measured several important aspects of variation between individual blastoderm

embryos. Although some of the variation measured must represent experimental error, as discussed

earlier, a significant percent of the measured differences clearly reflect real biological differences

between embryos.

For example, we have estimated a quantitative upper-bound on the degree of regulatory variability

in the relationship between gt and its repression of the target CRM eve stripe 2. The values we

measure are largely consistent with the earlier work of Gregor et al. who made a similar estimate for

the variability in HB concentration as a function of BCD (Gregor et al., 2007).

We have also provided analogous upper-bounds on the degree of geometric variability. In particular,

we discovered there is a significant correlation between the number of nuclei and size of the

embryo. Although this has not previously been reported, it is consistent with earlier results from

28

embryo ligation experiments suggesting that the number of nuclei and nuclear divisions is

determined by some mechanism that senses, in different parts of the embryo, local nuclear densities

(Edgar et al, 1986). It is also consistent with the result of manipulation experiments in echinoderm

and vertebrate embryos that suggest the ratio of cytoplasm to nuclear material regulates the number

of cells at the mid-blastula transition (reviewed by Edgar et al, 1986). Our extensive measurements

imply that even among wild type embryos that have not been experimentally manipulated, even

modest quantitative changes in egg size likely influence locally the number of nuclear

divisions/nuclear loss events.

This novel decomposition of variability into geometric and regulatory components highlights the

importance of registration methods not only as a technical tool for compositing data but also as a

fundamental approach to understanding and describing the accuracy with which developmental

regulatory systems function.

Predicting regulatory interactions from comprehensive spatio-temporal expression data

The closest work to ours is that of (Myasnikova et al., 2001) and (Spirov et al., 2002) who

registered spatial profiles of protein concentrations along the A-P axis of the Drosophila blastoderm

using images of the lateral surface of embryos which had been flattened before imaging. While their

data only provides a 1D picture that largely disregards the blastoderm morphology, it has been

widely adopted for use in modeling pattern formation due to its quantitative nature (see e.g.,

Janssens et al., 2006; Perkins et al., 2006; Alves et al. 2006; Ludwig et al., 2005; Aegerter-Wilmsen

et al., 2005). Our approach expands this quantitative picture of pattern formation with an explicit

29

description of changing morphology and comprehensive coverage of many more spatially patterned

genes, in full 3D.

We have demonstrated a technique for analyzing such 3D spatio-temporal expression data in order

to uncover regulatory relationships between transcription factors and their targets. While our model

of regulation is intentionally quite simple, it is capable of explaining many target patterns quite well

with only a few regulators (as witnessed by high R2 values for most of the targets). We are also able

to recover many interactions proposed in the literature. While our model does not capture many

potential subtleties of regulation such as co-operative or competitive interactions between multiple

bound factors, post-transcriptional and translational control mechanisms, phosphorylation, etc.,

clearly it could be extended and made more accurate by including these processes (see also e.g.,

Clyde 2003; Struffi 2004; Rivera-Pomar 1996; Ronchi 1993; Simpson-Brose 1994).

Our long-term goal is to construct high fidelity VirtualEmbryos containing protein and mRNA

expression data thousands of genes with the quantitative accuracy necessary to provide firm

grounding for a new generation of pattern formation models that take into account features such as

3D diffusion and transport, nuclear movement, and interaction between A-P and D-V patterning

systems. The availability of accurate 3D quantitative data at cellular resolution should provide far

more constraints on potential models of the regulatory structures underlying animal development.

A visualization tool

One of the challenges of complex quantitative datasets is the difficulty of exploring them. In our

case, the data represent dynamic 3D information for many genes, which is fundamentally difficult to

30

visualize all at once. Since many biologists lack the mathematical and computational skills needed

to work directly with the data files provided, the BDTNP has built a comprehensive set of tools for

interactive visualization and analysis of the atlas, called PointCloudXplore, that run on Windows,

OS X, or UNIX operating systems (Rübel et al., 2006; Frankel, 2007). These tools allow the user to

explore VirtualEmbryo expression data using a simple graphical interface.

For example, the entire surface of the VirtualEmbryo can be viewed in cylindrical projection with

colored surface plots whose height represents gene expression levels (Figure 7). This allows the

expression of several selected genes from one or more stages to be compared across all cells of the

embryo at once. Abstract visualizations are also available that allow gene expression profiles for

specified groups of cells to be compared using 2D and 3D scatter plots (see Supplemental Figure 7),

parallel coordinates, cluster analysis, and bar graphs (Rübel et al., 2006; Rübel, Weber, Huang,

DePace, Fowlkes, Luengo Hendriks, Keränen, Hagen, Bethel, Eisen, Knowles, Biggin, and

Hamann, unpublished data). This tool has been instrumental in informing the development of our

methods and in some of the discoveries made to date. Its availability greatly extends the utility of

our gene expression atlas. [[If the reviewers wish, we can make a version of PCX and the

VirtualEmbryo available to them]]

EXPERIMENTAL PROCEDURES

Imaging individual embryos

Individual embryos were fixed and fluorescently stained to label the mRNA and/or protein

expression patterns of two genes and nuclear DNA, mounted on microscope slides and imaged

31

using protocols in (Luengo Hendriks et al., 2006). Additional antisense RNA probes were

generated from PCR-products of cDNAs in Drosophila Gene Collection I and Drosophila Gold

Collection (for the list of used probes, see the online database). For protein stains, the primary rabbit

antibodies against BCD, KR, and GT were generated by BDTNP, the guinea pig antibodies against

HB and KR were gift from J. Reinitz. The primary antibodies were detected using Alexa546-,

Alexa555-, or Alexa610-conjugated secondary antibodies (Molecular probes, 1:500). Embryos were

usually stained for either eve or ftz and one of the other 93 genes shown in Supplemental Figure 3.

Photomultiplier gains were set so that all the pixels of interest fell within the 12 bit dynamic range.

These settings were recorded and later used to recover the absolute fluorescence level using a

calibrated power-law fit.

Expression values in the PointClouds were corrected for attenuation using a modified version of the

method described in (Luengo Hendriks et al., 2006). The mean intensity of the DNA stain within

the nucleus was assumed to be constant for all nuclei, but to account for different attenuation at

different wavelengths, an offset was computed for the DNA intensity that made the background of

the mRNA expression pattern equal to that DNA intensity (up to a scaling). The corrected

expression levels were then computed by dividing the measured mRNA intensity by the offset DNA

intensity.

Spatial registration

For each temporal cohort, we used the template consisting of eve or ftz stripe boundary points

sampled on a cylindrical grid at each of the 14 stripe boundaries and 40 points uniformly spread in

angle around the D-V axis. We extracted edges of each reference gene pattern in cylindrical

32

coordinates by finding local maxima in the response of an anisotropic Gaussian derivative filter

which was elongated by a factor of three along the D-V direction. At a fixed threshold, this filter

typically detected all the stripe boundaries but also yielded some outliers and an occasional missed

detection due to variability in the staining.

To deal robustly with these errors, we performed an optimization along each of the 40 A-P strips to

match the 14 points in our template with the edges in the PointCloud. This 1D alignment between

the template coordinates and the detected edges was carried out using dynamic programming on a

cost function that depended on the polarity of the edge being matched (whether it is the anterior or

posterior edge of a stripe) and the total displacement necessary to align the model along the strip.

While this enforced consistency of matches along the A-P axis (e.g., two stripes in a template

cannot be matched to the same stripe in the embryo), neighboring A-P strips could still be

inconsistent. Before estimating a warping, we performed a post-processing step using a global

quadratic cost to prune matches that were inconsistent with their neighbors in either the A-P or D-V

directions.

Let m be a matching that specifies for the ith point, located at it in our template, the corresponding

boundary point, )(im detected at location )(ims in an individual embryo. We evaluate the cost:

!"

"""=

jji

jjmiim

i

tt

tstsmQ

2

2)()( )()(

)(

and drop those points i for which iQ exceeds a fixed threshold. We also use the total quadratic cost,

!i

iQ , to evaluate whether a reasonable match was found. A large total cost is indicative of an

33

embryo with a poorly defined marker gene expression pattern. In such cases, the algorithm

automatically defaults to using the coarse alignment.

Modeling deformations

We would like to find a smooth warping 33: !"!u which maps our set of detected boundary

points{ }...,, )3()2()1( mmmsss to their respective targets{ }...,,

321ttt . We treat each coordinate of the

function ),,( 321 uuuu = separately, and solve for the regularized multivariate spline that minimizes

the functional:

dxxxx

xutsuuJ

lkj lkj

i

j

j

i

jm

ii ! "!=

##

$

%

&&

'

(

)))

)+*=

3

1,,

23

2)( )()()( +

Although iu is infinite-dimensional, the optimal solution can be specified in closed form with

coefficients given by the solution of a compact system of linear equations (Wahba, 1990).

The regularized TPS provides a free parameter! that trades off the fidelity of the warping u at the

marker points with the smoothness of the interpolation throughout the remainder of the embryo.

We set this parameter by cross-validation, choosing the value that minimized the apparent inter-

embryo variability on non-marker genes. We also used this same validation technique to compare

TPS to other deformation models. For example, we found that TPS removed 10-20% more

variability than Gaussian radial basis function (RBF) splines at the optimal parameter settings.

Aligning expression levels

We used embryos from different developmental time points that were fixed and hybridized in a

single batch in order to estimate the temporal progression of the 99th percentile expression level.

34

When more than one hybridization was available for a given gene, we estimated a scale parameter

for each batch in order to minimize the squared error relative to the mean. Since the absolute

expression level between different genes is not calibrated in a meaningful way, we scaled

expression so that the maximum average expression recorded for each gene over the entire time

interval was 1.0. These scaled measurements were smoothed using Gaussian process regression

(Rasmussen and Williams, 2006) to yield the smoothed expression time courses plotted as dotted

lines in Supplementary Figure 3. We used a squared exponential covariance function with

characteristic length scale of 3 cohorts (roughly 30 minutes) and independent noise model with

standard deviation of 0.3. Once the max expression level for each gene and cohort was estimated, a

gain and offset was chosen for each embryo that minimized the variability within the cohort while

matching the time course max. As noted by (Gregor et al., 2007) these optimal gains and offsets

can be computed by singular value decomposition.

Bounding geometric variability remaining after registration.

One possible concern with this three-way analysis is that the measurement error could be correlated

with batches of embryos used in the experiment. For example, if the co-stained batch happened to

have higher variability due to measurement error than the registered batch, the level of variability

measured in co-stained embryos would no longer be a lower bound on level of apparent variability

in registered embryos. We measured the average standard deviation after registration for each of

the three batches and found values of 0.116/0.0806 for the eve/gt co-stain, 0.177/0.137 for eve/ftz,

and 0.129/0.142 for ftz/gt. Since the single gene variability within the co-stained batch was

significantly lower for both eve (0.116 < 0.177) and gt (0.0806< 0.142) we conclude that our

estimates of the geometric variability remaining after registration are potentially quite conservative.

35

Fitting regulatory functions

In our experiments, we used a generic form for F consisting of a sigmoid applied to a linear

combination of the protein concentrations (with weighting vector a) and a constant offset b,

bPaT

e

baPF+

+=1

1),|(

The sigmoid provides a saturating, non-linear response that constrains the transcription rate to lie

between 0 and a maximum (scaled to 1) and can be motivated in part from thermodynamic

considerations (Mjolsness 2007, Bintu et. al. 2005). The parameter vector a determines the

steepness of the response while b sets the offset at which transcription reaches half of the maximum

value (normalized to 1). We also considered a similar model for F which included quadratic terms

(products of all pairs of protein concentrations). While such a model is inherently more flexible,

thus providing higher quality fits (not shown) and explicitly captures non-linear interactions

between factors, it results in a far greater number of parameters that are significantly more difficult

to interpret. For example, the response of a target may be non-monotonic with respect to the

concentration of an individual regulator. For this reason, we used the more parsimonious linear

model in the experiments described here.

36

To fit the parameters a and b to our observed data, we perform least squares minimization over all

nuclei at all time points for which we have data

2

,)),|),((),(min! "=

j

jjjjiba

batxPFtxMC

We characterized the goodness of fit for each target using )(

12

MVar

CR != which measures the

extent to which the model explains the variance in expression across measured nuclei and time

points.

37

REFERENCES

Aegerter-Wilmsen, T., Aegerterb, C., Bisselinga, T. (2005). Model for the robust establishment of

precise proportions in the early Drosophila embryo. J. Theor. Biol., 234, 13-19.

Alves F, Dilao R. (2006) Modeling segmental patterning in Drosophila: Maternal and gap genes.

J Theor Biol. 241(2):342-59.

Arnosti DN, Barolo S, Levine M, Small S (1996) The eve stripe 2 enhancer employs multiple

modes of transcriptional synergy. Development (Cambridge, England) 122(1): 205-214.

Azevedo, R., French, V., Partridge, L. (1996). Thermal evolution of egg size in Drosophila

Melanogaster. Evolution. 50, 2338-2345.

Bintu, L., Buchler, N. E., Garcia, H. G., Gerland, U. Hwa, T., Kondev, J. and Phillips, R. (2004)

Transcriptional regulation by the numbers: models. Curr Opin Genetics Dev 15(2): 116-124

Bookstein, F.L., (1989). Principal Warps: Thin-Plate Splines and Decomposition of Deformations.

IEEE Trans. Pattern Analysis and Machine Intelligence, 11(6), 567-585.

Carson, J.P., Ju, T., Lu, H.C., Thaller, C., Xu, M., Pallas, S.L., Crair, M.C., Warren, J., Chiu, W.,

Eichele, G. (2005). A Digital Atlas to Characterize the Mouse Brain Transcriptome. PLoS

Computational Biology 1(4), e41.

38

Christiansen, J., Yang, Y., Venkataraman, S., Richardson, L., Stevenson, P., Burton, N., Baldock,

R., Davidson, D. (2006). EMAGE: a spatial database of gene expression patterns during mouse

embryo development. Nucleic Acids Res. 34, D637-41.

Davidson, D., Bard, J., Brune, R., Burger, A., Dubreuil, C., Hill, W., Kaufman, M., Quinn, J., Stark,

M., Baldock, R. (1997). The mouse atlas and graphical gene-expression database. Seminars in Cell

& Developmental Biology 8(5), 509-517.

Duchon, J. (1977). Splines Minimizing Rotation-Invariant Semi-Norms in Sobolev Spaces. In

Constructive Theory of Functions of Several Variables, W. Schempp and K. Zeller, eds. (Berlin:

Springer-Verlag), 85-100.

Edgar, B.A., Kiehle, C.P., Schubiger, G. (1986). Cell cycle control by the nucleo-cytoplasmic ratio

in early Drosophila development. Cell. 44(2), 365-72.

Eldon, E.D. and Pirrotta, V. (1991) Interaction of the Drosophila gap gene giant with maternal and

zygotic pattern-forming genes. Development 111, 367-378.

Fowlkes, C. C., Luengo Hendriks, C.L., Keränen, S. V. E., Biggin, M.D., Knowles, D.W., Sudar,

D., Malik, J. (2006). Registering Drosophila Embryos at Cellular Resolution to Build a

Quantitative 3D Atlas of Gene Expression Patterns and Morphology. Proc. of CSB 2005 Workshop

on BioImage Data Minning and Informatics, Palo Alto, CA.

39

Fowlkes, C. C., Malik, J. (2006). Inferring nuclear movements from fixed material. Report no.

UCB/EECS-2006-142. EECS Department: University of California, Berkeley.

(http://www.eecs.berkeley.edu/Pubs/TechRpts/2006/EECS-2006-142.pdf)

Frankel, F. (2007). Expressing Genes, American Scientist 95, 69-71.

Fujioka M, Emi-Sarker Y, Yusibova GL, Goto T, Jaynes JB. (1999) Analysis of an even-skipped

rescue transgene reveals both composite and discrete neuronal and early blastoderm enhancers, and

multi-stripe positioning by gap gene repressor gradients. Development. 126(11):2527-38.

Gregor, T., Tank, D.W., Wieschaus, E.F., and Bialek, W. (2007). Probing the limits to positional

information. Cell 130(1), 153-164.

Gong, S., Zheng, C., Doughty, M.L., Losos, K., Didkovsky, N., Schambra, U.B., Nowak, N.J.,

Joyner, A., Leblanc, G., Hatten, M.E., Heintz, N. (2003). A gene expression atlas of the central

nervous system based on bacterial artificial chromosomes. Nature 425, 917-925.

Imai, K. S., Levine, M., Satoh, N., Satou, Y. (2006). Regulatory blueprint for a chordate embryo.

Science 312 (5777), 1183-7.

Janssens. H., Hou, S., Jaeger, J., Kim, A., Myasnikova, E., Sharp, D., Reinitz, J. (2006).

Quantitative and predictive modeling of transcriptional control of the Drosophila melanogaster even

skipped gene. Nat. Genetics, 38, 1159-65.

40

Jaeger J, Surkova S, Blagov M, Janssens H, Kosman D, Kozlov KN, Manu, Myasnikova E,

Vanario-Alonso CE, Samsonova M, Sharp DH, Reinitz J. (2004) Dynamic control of positional

information in the early Drosophila embryo. Nature 430(6997):368-71.

Keränen, S. V. E., Fowlkes, C. C., Luengo Hendriks, C. L., Sudar, D., Knowles, D. W., Malik, J.,

and Biggin, M. D. (2006). 3D morphology and gene expression in the Drosophila blastoderm at

cellular resolution II: dynamics. Genome Biology 7, R124

Kudoh, T., Tsang, M., Hukriede, N.A., Chen, X., Dedekian, M., Clarke, C.J., Kiang, A., Schultz, S.,

Epstein, J., Toyama, R., Dawid, I. (2001) A gene expression screen in zebrafish embryogenesis.

Genome Res. 11, 1979-1987

Kosman, D., Mizutani, C. M., Lemons, D., Cox, W. G., McGinnis, W., Bier, E. (2004). Multiplex

Detection of RNA Expression in Drosophila Embryos. Science 305, 846-846

Kumar, S., Jayaraman, K., Panchanathan, S., Gurunathan, R., Marti-Subirana, A., Newfeld, S.

(2002) BEST: a novel computational approach for comparing gene expression patterns from early

stages of Drosophila melanogaster development. Genetics, 162 (4), 2037-47.

Laney, J.D. and Biggin, M.D. (1996) Redundant control of Ultrabithorax by zeste involves functional

levels of zeste binding at the Ultrabithorax promoter. Development 122, 2303-2311.

41

Lein, E.S., Hawrylycz, M.J., Ao, N., Ayres, M., Bensigner, A., Bernard, A., Boe, A.F., Boguski,

M.S., Brockway, K.S., Byrnes, E.J., et al. (2007). Genome-wide atlas of gene expression in the

adult mouse brain. Nature 445, 168-176.

Levsky, J.M., Shenoy, S.M., Pezo, R.C., Singer, R.H. (2002). Single-Cell Gene Expression

Profiling. Science 297, 836-840.

Liang, Z., Biggin, M.D. (1998). Eve and ftz regulate a wide array of genes in blastoderm embryos:

the selector homeoproteins directly or indirectly regulate most genes in Drosophila. Development

125(22), 4471-82.

Lott SE, Kreitman M, Palsson A, Alekseeva E, Ludwig MZ. (2007) Canalization of segmentation

and its evolution in Drosophila. Proc Natl Acad Sci U S A. 104(26):10926-31.

Lucchetta EM, Lee JH, Fu LA, Patel NH, Ismagilov RF. (2005) Dynamics of Drosophila embryonic

patterning network perturbed in space and time using microfluidics. Nature 434(7037):1134-8.

Ludwig, M., Palsson, A., Alekseeva, E., Bergman, C., Nathan, J., Kreitman, M. (2005). Functional

Evolution of a cis-Regulatory Module. PLoS Bio. 3(4), 588-598.

Luengo Hendriks, C. L. , Keränen, S. V. E. , Fowlkes, C. C., Simirenko, L., Weber, G. H.,

Henriquez, C., Kaszuba, D. W., Hamann, B., Eisen, M. B., Malik, J., Sudar, D., Biggin, M. D., and

42

Knowles, D. W. (2006). 3D morphology and gene expression in the Drosophila blastoderm at

cellular resolution I: data acquisition pipeline. Genome Biology, 7, R123.

Manoukian, A. S., and Krause, H. M. 1992. Concentration-dependent activities of the even-skipped

protein in Drosophila embryos. Genes & Dev. 6: 1740-1751.

Mardia, K., Kent, J., Bibby, J. (1979) Multivariate Analysis (London: Academic Press)

Meinguet, J. (1979). Multivariate Interpolation at Arbitrary Points made Simple. J. Applied Math.

Physics (ZAMP) 5, 439-468.

Mjolsness, E. (2007). On cooperative quasi-equilibrium models of transcriptional regulation. J

Bioinform Comput Biol. 5(2B):467-90.

Myasnikova, E., Samsanova, A., Kozlov, K., Samsonova, M., Reinitz, J. (2001). Registration of the

expression patterns of Drosophila segmentation genes by two independent methods. Bioinformatics

17(1), 3-12.

Neidhardt, L., Gasca, S., Wertz, K., Obermayr, F., Worpenberg, S., Lehrach, H., Herrmann, B.G.

(2000). Large-scale screen for genes controlling mammalian embryogenesis, using high-throughput

gene expression analysis in mouse embryos. Mech Dev 98, 77-94.

43

Oliveri, P., Davidson, E.H. (2004). Gene regulatory network controlling embryonic specification in

the sea urchin. Curr Opin Genet Dev. 14(4), 351-60.

Peng, H., Long, F., Eisen, M., Myers, E. (2006). Clustering gene expression patterns of fly embryos.

Proc. IEEE 2006 International Symposium on Biomedical Imaging (ISBI 2006), 1144-1147.

Perkins TJ, Jaeger J, Reinitz J, Glass L (2006) Reverse Engineering the Gap Gene Network of

Drosophila melanogaster . PLoS Comput Biol 2(5): e51

Pollet, N., Muncke, N., Verbeek, B., Li, Y., Fenger, U., Delius, H., Niehrs, C. (2005). An atlas of

differential gene expression during early Xenopus embryogenesis. Mech Dev. 122(3), 365-439.

Rasmusen, C., Williams, C. (2006). Gaussian Processes for Machine Learning. (Boston: MIT

Press).

RiverA-Pomar R, Niessing D, Schmidt-Ott U, Gehring WJ, Jackle H. (1996) RNA binding and

translational suppression by bicoid. Nature. 379(6567):746-9.

Ronchi E, Treisman J, Dostatni N, Struhl G, Desplan C. (1993) Down-regulation of the Drosophila

morphogen bicoid by the torso receptor-mediated signal transduction cascade. Cell. 74(2):347-55.

Rübel, O., Weber, G.H., Keränen, S.V.E., Fowlkes, C.C., Luengo Hendriks, C.L., Shah, N.Y.,

Biggin, M.D., Hagen, H., Knowles, D.W., Malik, J., Sudar, D., Hamann, B. (2006).

44

PointCloudXplore: Visual analysis of 3D gene expression data using physical views and parallel

coordinates. In B.C. Santos, T. Ertl and K. Joy (eds.), Data Visualization 2006 (Proceedings of the

Eurographics/IEEE-VGTC Symposium on Visualization, Lisbon, Portugal, May 2006).

Simin, K., Scuderi, A., Reamey, J., Dunn, D., Weiss, R., Metherall, J.E., Letsou, A. (2002).

Profiling patterned transcripts in Drosophila embryos. Genome Res. 12, 1040-47.

Simpson-Brose M, Treisman J, Desplan C. (1994) Synergy between the hunchback and bicoid

morphogens is required for anterior patterning in Drosophila. Cell. 78(5):855-65.

Small S, Blair A, Levine M (1992) Regulation of even-skipped stripe 2 in the Drosophila embryo.

The EMBO journal 11(11): 4047-4057.

Spirov, A., Kazansky, A., Timakin, D., Reinitz, J. (2002). Reconstruction of the Dynamics of

Drosophila Genes Expression from Sets of Images Sharing a Common Pattern. Real-Time Imaging

8, 507-518.

Stathopoulos, A., van Drenth, M., Erives, A., Markstein, M., Levine, M. (2002). Whole-genome

analysis of dorsal-ventral patterning in the Drosophila embryo. Cell. 111(5), 687-701.

Struffi P, Corado M, Kulkarni M, Arnosti DN. (2004) Quantitative contributions of CtBP-dependent

and -independent repression activities of Knirps. Development. 131(10):2419-29.

45

Tassy, O., Daian, F., Hudson, C., Bertrand, V., Lemaire, P. (2006). A quantitative approach to the

study of cell shapes and interactions during early chordate embryogenesis. Current Biol. 16, 345-58.

Tomancak, P., Beaton, A., Weiszmann, R., Kwan, E., Shu, S., Lewis, S.E., Richards, S., Ashburner,

M., Hartenstein, V., Celniker, S.E., Rubin, G.M. (2002). Systematic determination of patterns of

gene expression during Drosophila embryogenesis. Genome Biol. 3(12), R88, 1-14.

Tomancak, P., Berman, B. P., Beaton, A., Weiszmann, R., Kwan, E., Hartenstein, V., Celniker, S.

E., Rubin, G. M. (2007) Global analysis of patterns of gene expression during Drosophila

embryogenesis. Genome Biol. 8(7), R145

Turner, F.R., and Mahowald, A.P. (1976) Scanning Electron Microscopy of Drosophila

Embryogenesis. Developmental Biol. 50, 95-108.

Visel, A., Thaller, C., Eichele, G. (2004). GenePaint.org: An atlas of gene expression patterns in the

mouse embryo. Nucleic Acids Res. 32, D552-D556.

Wahba, G. (1990). Spline Models for Observational Data. SIAM.

Walter, J., Dever, C., and Biggin, M.D. (1994) Two homeodomain proteins bind with similar

specificity to a wide range of DNA sites in Drosophila embryos. Genes and Dev. 8, 1678-1692

Warren, DC. (1924). Inheritance of Egg Size in Drosophila Melanogaster. Genetics 9(1), 41-69

46

Weber, GH, Rübel, O., Huang, M-Y., DePace, A.H., Fowlkes, C. C., Keränen, S.V.E., Luengo

Hendriks, C.L., Hagen, H., Knowles, D.W., Malik, J., Biggin, M. D., and Hamann, B. Visual

exploration of three-dimensional gene expression using physical views and linked abstract views, to

appear in: IEEE Transactions on Computational Biology and Bioinformatics, to appear

Yuh, C.H., Bolouri, H., Davidson, E.H. (2001). Cis-regulatory logic in the endo16 gene: switching

from a specification to a differentiation mode of control. Development 128(5), 617-29

Zalokar, M., Erk, I. (1976) Division and Migration of Nuclei during Early Embryogenesis of

Drosophila melanogaster. J. Micro. Biol. Cellularie 25, 97-106

47

ACKNOWLEDGEMENTS This work is part of a broader collaboration by the BDTNP. We are grateful for the frequent advice,

support, criticisms, and enthusiasm of its members. We thank Thomas Gregor and William Bialek

for insightful discussions. Work conducted by the BDTNP is funded by a grant from NIGMS and

NHGRI, GM704403, at Lawrence Berkeley National Laboratory under Department of Energy

contract DE-AC02-05CH11231.

48

FIGURE LEGENDS

Figure 1. Data from hundreds of individually imaged embryos is averaged into a composite

VirtualEmbryo. Top: Each individual embryo is stained for nuclei, a common marker gene (red)

and a gene of interest (second color). Center: Within each temporal cohort, the marker gene is used

to guide spatial registration on to a morphological template; temporal correspondences between

cohorts are provided by a model of typical nuclear movements. Bottom: Once correspondences

across embryos have been established, expression measurements are averaged and composited to

create a model VirtualEmbryo in which the expression of many genes can be compared.

Figure 2. Variations in blastoderm size, number of nuclei, and gene expression stripe locations

between embryos. Panels A and B show scatter plots in which embryo length (A) and surface area

(B) are both correlated with the number of nuclei (y axis), demonstrated here with a linear fit (red

line). Because experimental errors in nuclear count should not correlate with errors in determining

surface area or egg length, these correlations likely represent true biological variability among

embryos. Panels C and D show the variability in the location of ftz gene expression stripe locations

before (C) and after (D) embryos are scaled to the average cohort egg length just prior to

gastrulation (stage 5:75-100% cell membrane invagination). The plots are orthographic projections

in which the anterior of the embryo is to the left, the dorsal midline to the top and the center of mass

of the embryo is at the origin. Each line specifies the average stripe boundary location with error

bars showing one standard deviation in the A-P coordinate. Prior to scaling embryos to a common

length, standard deviations for ftz stripe locations are as large as 11.1µm, or 62% of the average ftz

49

stripe width. After scaling by egg length the variation is reduced but is still significant (std dev up

to 5.4µm, or 30% of average stripe width).

Figure 3. Fine registration and compositing expression values. A marker gene expression pattern

(red) is used to identify corresponding nuclei in different embryos and perform fine spatial

registration. Each individual PointCloud is warped to bring extracted marker gene boundaries (red

lines) into alignment with a standard morphological template (black lines). Individual per-nucleus

measurements (here both red and green) are then composited onto the template to produce an

estimate of average expression.

Figure 4. Examples of average temporal patterns of mRNA expression recorded in the

VirtualEmbryo for several gap (kni,gt,hb) and pair-rule (eve,ftz,slp1) genes. Temporal cohorts,

staged by percent membrane invagination, are arranged from left to right with each row

corresponding to a different gene. Each rectangle shows a lateral view of the blastoderm in a half

cylindrical projection with dorsal at top, anterior to the left (see also Figure 7).

Figure 5. Fine registration removes additional measured inter-embryo geometric variability

and produces average regulatory relationships comparable to those measured in individual

co-stained embryos.. The mean and standard deviations for anterior-posterior expression profiles

taken from a lateral strip before (Panel A) and after (Panel B) fine registration for three genes: ftz

(n=100), slp1 (n=25) and gt (n=8). The inter-embryo variability decreases significantly with fine

registration, both for the marker gene (ftz) but also for “held out” expression patterns (slp,gt). Inset

numbers for each gene give the standard deviation averaged over the entire embryo. Panel C shows

50

the co-expression of gt and eve near the dorsal surface along the anterior edge of eve stripe 2 in

embryos co-stained for eve and gt (n=47). Panel D shows the regulatory effect of gt on eve (2)

inferred from co-stained (blue curve) embryos binning nuclei based on the expression level of gt

and computing the mean and standard deviation of eve expression for each bin. The red and green

curves show the regulatory relation estimated by sampling pairs of nuclei in similar spatial locations

after coarse alignment (red) and fine registration based on eve (green). Panel E shows the

regulatory effect inferred from nuclei in different batches of embryos stained for either eve (n=35)

or gt (n=28) which were identified as corresponding based on coarse alignment (red) or fine

registration (green) using ftz. The co-stain estimate (blue curve) is repeated for comparison. The

inferred relation based on registered embryos is quite close to the true co-expression measured in

co-stained embryos. The variation in the regulatory relation across embryos in the virtual co-

expression estimate is significantly smaller than the coarse registration and comes closer to the

lower bound set by the level of variability quantified in individuals.

Figure 6. Regulatory relationships inferred from composite spatio-temporal expression data.

Panel A shows the coefficients of each of 17 regulators (columns) determined by fitting each target

eve stripe (rows). Each row contains only 6 non-zero entries corresponding to the selected regulators

for that target. Green indicates activation, red indicates repression, black indicates no interaction.

The right most column indicates the constant offset b. Quality of fit (R2 ) values are specified in

brackets. Panel B shows the coefficients for the individual components of the gap gene gt, ordered

by A-P location. Panel C shows the distribution of R2 fit values for all “modules” in the atlas (see

Supplementary Figure 6).

51

Figure 7. PointCloudXplore allows visualization of quantitative 3D expression data.

Expression of the gap genes fkh, gt, hb, kni, and Kr is shown for the stage 5: 4-8% cohort. Panel A

shows a 3D model of the blastoderm surface (anterior left, dorsal down) with each nucleus colored

according to the expression level of the 5 genes. Panel B shows a cylindrical projection of the entire

blastoderm (anterior left, ventral center, dorsal upper and lower edges). The heights of each surface

plot indicate the average expression level of the gene recorded at that point on the VirtualEmbryo,

making readily visible the quantitative changes in expression of these gap genes along both the A-P

and D-V axes.

Constructing a Quantitative Spatio-temporal Atlas of Gene...

Documents

Transcript of Constructing a Quantitative Spatio-temporal Atlas of Gene...