· Web viewSupplemental Information TomatoNet: An genome-wide co-functional network for unveiling...

46
Supplemental Information TomatoNet: An genome-wide co-functional network for unveiling complex traits of tomato, a model crop for fleshy fruits Hyojin Kim, Bong Suk Kim, Jung Eun Shim, Sohyun Hwang, Sunmo Yang, Eiru Kim, Anjali S. Iyer-Pascuzzi, and Insuk Lee Contents Supplemental Methods Supplemental Results Supplemental References Supplemental Figure 1 Supplemental Figure 2 Supplemental Figure 3 Supplemental Figure 4 Supplemental Table 1 Supplemental Table 2 Supplemental Table 3 Supplemental Table 4 Supplemental Table 5 Supplemental Table 6 Supplemental Table 7 Supplemental Table 8 Supplemental Table 9 1

Transcript of  · Web viewSupplemental Information TomatoNet: An genome-wide co-functional network for unveiling...

Supplemental Information

TomatoNet: An genome-wide co-functional network for unveiling complex traits of

tomato, a model crop for fleshy fruits

Hyojin Kim, Bong Suk Kim, Jung Eun Shim, Sohyun Hwang, Sunmo Yang, Eiru Kim, Anjali S. Iyer-Pascuzzi, and Insuk Lee

Contents

Supplemental Methods Supplemental Results Supplemental References Supplemental Figure 1 Supplemental Figure 2 Supplemental Figure 3 Supplemental Figure 4 Supplemental Table 1 Supplemental Table 2 Supplemental Table 3 Supplemental Table 4 Supplemental Table 5 Supplemental Table 6 Supplemental Table 7 Supplemental Table 8 Supplemental Table 9

1

Supplemental Methods

Reference genes and gold standard co-functional gene pairs for training a network of

tomato genes

Construction of TomatoNet is summarized in Supplemental Figure 1 and Supplemental

Table 1. We used 34,727 protein coding genes (http://solgenomics.net/) by The Tomato

Genome Consortium for the network construction. A set of gold-standard co-functional gene

pairs for training a tomato gene network was generated by pairing genes that share annotation

by either KEGG (Kanehisa et al., 2014) or MAPMAN (Thimm et al., 2004) pathway

databases. Pairs of genes that are annotated by either pathway database but no overlapped

annotation then would be gold-standard negative gene pairs. The resultant sets of gold

standard positive and negative pairs between tomato genes contain 190,125 and 24,299,376

pairs, respectively, for 6,999 genes (20.15 % of the coding genes).

Benchmarking and integrating gene networks

To integrate networks derived from heterogeneous data types that differ in data-intrinsic

score schemes, we re-evaluated the individual networks using a unified scoring scheme, log

likelihood scores (LLS) (Lee et al., 2004) of network links, based on a Bayesian statistics

framework. LLS was calculated as follows:

LLS=ln( P ( L|E )/ P(⌐ L∨E)P( L)/ P(⌐ L) )

,where P(L|E) and P(⌐L|E) represent the frequencies of positive (L) and negative (⌐L) gold

standard links observed in the given evidences (E), while P(L) and P(⌐L) represent the prior

expectations (i.e., the total frequencies of all positive and negative gold standard gene pairs,

respectively). We used ‘0.632 bootstrapping’ for all LLS calculations to avoid over-training

phenomenon.

Networks derived from different data types were integrated by naïve Bayes with weighted

sum (WS) scheme (Lee et al., 2004), which enables handling correlations among the networks

by optimizing two free parameters: T for the LLS threshold of network links to be integrated,

and D for the average dependence among networks:

2

WS=Lo+∑n=0

∞ LiD ∙i

, for all L ≥T

where, L represents a LLS, L0 represents the maximum score among multiple LLSs for a

unique gene pair, and i is the rank order index among secondary scores.

Inferring co-functional networks by co-expression analysis of tomato genes

Genes that operate in the same cellular processes tend to be co-expressed across various

experimental conditions. We inferred co-functional gene pairs in tomato from diverse sets of

expression samples available from a public depository of microarray and RNA-seq data, the

Gene Expression Omnibus (GEO) (Barrett et al., 2013) database. We tested a total of 1,473

expression samples for 64 GEO data series that contain at least 12 samples, and found that 12

series comprising 276 samples show co-expression that correlates with the probability of gold

standard functional gene pairs (Supplemental Table 2). Microarray data based on

Affymetrix gene chip were normalized by MAS5, which was known to be more optimal for

co-expression analysis (Frost et al., 2012). Microarray data based on other platforms were

obtained as GEO ‘soft format’ data and analyzed. For RNA-seq data, short read data were

processed into reads per kilo-base per million (RPKM) values using edgeR software (Lim et

al., 2007). All the preprocessed expression data were analyzed for co-expression by Pearson’s

correlation coefficient (PCC). Twelve co-expression networks inferred from the 12 GSEs

were benchmarked by log likelihood score, and then integrated into a single co-expression

network for tomato (SL-CX of Supplemental Table 1) using the weighted sum method

described above.

Inferring co-functional networks by phylogenetic profiling method

Genes belong to the same pathways tend to be inherited together during speciation.

Therefore, we may infer co-functional gene pairs based on similarity between profiles of

presence or absence of orthologs of the target protein across species, which is represented as

the phylogenetic profile. We previously found that network inference using the phylogenetic

profiling method is most effective with profiles within the domains of life (Shin and Lee,

2015). We thus constructed phylogenetic profiles of 34,727 coding genes of tomato for three

domains of life, Archaea, Bacteria, and Eukaryota, based on –log(E-value) of blastp hit 3

scores. We then calculated mutual information between gene pairs with each of three profile

sets and found that a profile set based on 122 Archaea and one based on 1,626 Bacteria show

a correlation between mutual information score and co-functional likelihood score. These two

networks were integrated into a single network based on phylogenetic profiling method in

tomato (SL-PG of Supplemental Table 1).

Inferring co-functional networks by genomic neighborhood method

There are two approaches of gene neighborhood identification, distance-based gene

neighborhood (DGN) and probability-based gene neighborhood (PGN), and their inferred

network links are complementary (Shin et al., 2014). Therefore, we inferred functional

associations using each of the methods and integrated them into a single tomato network

based on gene neighborhood (SL-GN of Supplemental Table 1). The principle behind these

approaches is that two tomato genes of which bacterial orthologs are close to each other in

bacterial genomes have a higher inclination for being functionally associated. Gene

neighborhood analysis for tomato was conducted for 122 Archaeal bacteria and 1,626

Eubacteria genomes. The detailed methods of both gene neighborhood methods were

described in our earlier work (Shin et al., 2014).

Inferring co-functional networks from associalogs of other species

If orthologs of two tomato genes are functionally associated in other species, they are likely

to be co-functional in tomato as well. This co-functional link transferred from other species

by orthology-relationship is called associalog (Robinson et al., 2010). Due to the pervasive

paralogs in the tomato genome, we identified orthologs of tomato in other species based on

bidirectional best hit (BBH). Based on the orthology-based transfer of functional networks,

we could infer a total of 15 networks of tomato genes from seven different species networks

including not only plants but also animals: AraNet v2 (Lee et al., 2015b), YeastNet v3 (Kim

et al., 2014), HumanNet (Lee et al., 2011), WormNet v3 (Cho et al., 2014), RiceNet v2 (Lee

et al., 2015a), FlyNet (Shin et al., 2015), and an unpublished gene network for zebrafish

(Supplemental Table 1).

4

Assessment of networks and network-based predictions

Tomato genes were ranked by sum of log likelihood scores of their links to known member

genes for each UniProtGOBP term. To reduce high variation of assessment scores by

pathways with only a few member genes, we excluded GOBP pathways with less than five

member genes. The exclusion of the large pathway terms is helpful to avoid functional bias

toward dominantly annotated pathways (Lee et al., 2007). Thus, we also excluded GOBP

pathways with more than 300 member genes. Therefore, we used only 813 GOBP pathways

that contain between five and 300 member genes. The functional links of the validation data

set were generated by pairing tomato genes that share annotations of 813 UniProtGOBP

terms. The final validation set contained 635,894 function links, of which only 4.4% (27,901

links) overlap with the training data, indicating independence of the validation data from the

training data. Then the retrieval rate of true positives and false positives were calculated to

measure precision for the given genome coverage. We also generated receiver operating

characteristic (ROC) curve, which was then summarized as area under the ROC curve (AUC)

to measure prediction capacity of TomatoNet for each GOBP term. If the member genes for a

GOBP pathway are well connected in the network, they will give a network edge score to

each other and rank them high, resulting in a high AUC. As a null model, we performed the

ROC analysis with randomized GOBP pathways in which original member genes were

replaced with random tomato genes.

Network algorithms to prioritize genes for complex traits of tomato

The major goal of TomatoNet was to provide network-based prediction tools for crop

scientists to prioritize genes for economic traits. We hence implemented two network-based

gene prioritization methods in a web-based software (www.inetbio.org/tomatonet). The first

gene prioritization option, pathway-centric prediction (see Supplemental Figure 1, STEP 4-

1), is based on the principle of guilt-by-association. The prediction system takes user-input

genes for a known trait, which serve as guide genes. If the given guide genes are well

connected by TomatoNet and elucidate a pathway (i.e., a connected network) for the trait,

other genes that connect to the guide genes are likely to be new members of the pathway and

involved in the trait. To prioritize candidate genes for the trait, we ranked other genes based

on the strength of network connection to the guide genes. TomatoNet takes into account edge

5

weights of all connections to the guide genes as a sum of log likelihood scores. If a group of

genes for the trait of interest are connected as a pathway, pathway-centric prediction would

be an effective method of network-based gene prioritization.

However, for some traits, few guide genes are known, and pathway-centric prediction would

be less effective. Therefore, we implemented an alternative network-based prediction option,

context-centric prediction (see Supplemental Figure 2, STEP 4-2), which is based on gene

expression data. Transcriptome profiles for the trait-relevant context provides several

hundred genes that respond to the relevant trait. For example, we may observe differential

expression of many tomato genes that regulate biotic stress resistance upon pathogen

inoculation. We hypothesize that a gene that is connected to many of the differential

expressed genes (DEGs) in a trait-relevant context is likely associated with the trait. If a gene

that is connected to many of the DEGs upon pathogen inoculation, this might suggest that the

gene is likely involved in defense responses to the pathogen. For the analysis, we defined

subnetworks for each gene of TomatoNet, in which a gene of interest becomes a hub of all

connected neighbor genes. The association of each hub gene with the given trait was

measured by significance of overlap between neighbor genes of the hub and DEGs for the

trait-relevant context, based on Fisher’s exact test. We demonstrated the effectiveness of

these two network-based prediction methods in predicting genes related to fruit development

and resistance to a bacterial pathogen.

Network analysis of a fruit development and ripening pathway by TomatoNet

The genes associated with fruit development and ripening were compiled from the FR

database (Yue et al., 2015) that contains fruit development and ripening genes for various

plant and crop species, collected by manual curation of the literature. We obtained a total of

211 tomato FR genes, of which 96 genes are particularly related to the ripening process

(Supplemental Table 3). Significance of the connected network of 131 FR genes by

TomatoNet was measured by Z-test based on the number of within-group edges among FR

genes compared to the distribution of the number of within-group edges for 1000 sets of

randomly sampled 131 tomato genes.

Analysis of differential expressed genes for fruit development and ripening

6

FR-related DEGs were compiled from the tomato gene expression atlas using RNA-seq data

for different stages during fruit development: flower bud, flower, 1 cm fruit, 2 cm fruit, 3 cm

fruit, mature green fruit, breaker, 10 days after breaker stages (Tomato Genome, 2012). Short

read data were aligned by Rsubread package (Lee et al., 2008) and then converted into read

count data by featureCount (Liao et al., 2014) software. The probability of differential

expression was calculated by DESeq2 (Love et al., 2014) software using read counts between

two pairs of conditions: (i) gene expression at 10 days after breaker stage vs. those at flower

stage to find DEGs related to the entire process of fruit development and ripening; (ii) gene

expression at 10 days after breaker stage vs. those at mature green fruit to find DEGs

particularly related to the ripening process. For context-centric prediction, only the top 100

up-regulated DEGs from (i) and (ii) settings were used to predict fruit development-ripening

genes or fruit ripening genes, respectively.

TomatoNet prediction of candidate genes for tomato defense responses to R.

solanacearum

Of the 156 differentially expressed probesets that were identified in a microarray analysis of

tomato shoots after infection with R. solanacearum (Ishihara et al., 2012), 128 had Sol gene

IDs on the Sol Genomics Network (solgenomics.net). 119 of these 128 Sol gene IDs were

unique. These 119 genes were divided into 109 genes that were upregulated after infection

with R. solanacearum, and 10 that were downregulated (Ishihara et al., 2012) (Supplemental

Table 7). Each list of 109 and 10 genes was input into the pathway-centric algorithm of

TomatoNet. Eleven genes, none of which were in the list of input genes, were selected from

the top 100 predicted from pathway-centric analysis for experimental validation through

qRT-PCR (Supplemental Table 8).

Plant growth and R. solanacearum infection

Seeds of HA7996 (H7996) and WVa700 (WVa) were planted in soil of 36 cell square pots of

insert (Metromix, WA) and grown in 30 °C, 16 h light and 8 h dark periods of 60 %

humidity. Tomato plants at the three-leaf stage were root inoculated with Ralstonia

solanacearum, K60 strain (1.0 x 108 cfu/ml). The shoots of ten plants were harvested at 48

hours post inoculation. K60 was grown on Casamino Peptide Agar (CPG) plates for two days

7

at 28 °C and bacterial colonies were scraped from plates and resuspended in sterilized ddH2O.

Bacterial concentration was determined by O.D. 600 = 0.1 as 108 CFU/ml.

RNA isolation and cleanup.

Total RNA from leaves or roots were isolated with Trizol according to the manufacturer’s

protocol (Invitrogen, CA). Approximately 100 mg of plant tissue was used to isolate RNA. 1

ml of Trizol was added to the ground tissue, the slurry was vortexed for 15 sec, and then

incubated at room temperature for 5 min. Chloroform (200 µl) was added to the mixture and

the sample was mixed for 15 sec, and incubated for 5 min at room temperature. The sample

was then centrifuged at 14,000 x g for 15 min at 4°C. The aqueous phase was recovered and

500 µl of isopropanol was added, and the samples were mixed gently by hand. After 10 min

incubation at room temperature, the sample was centrifuged at 14,000 x g for 10 min at 4° C.

The supernatant was removed and the pellet containing RNA was washed by 5 min spin at

10,000 rpm at 4°C in 1ml of ice-cold 70% ethanol. The pellet was dried at room temperature,

resuspended in 30 µl of DEPC-treated water. The isolated total RNA was further DNase

treated and cleaned with the column to remove residual genomic DNA using Norgen RNA

isolation kit according to manufacturer’s procedures. Total RNAs were eluted with 30 µl of

RNase-free water and were stored at -80° C until use.

Generation of cDNA.

cDNA was synthesized from the purified total RNA with the first cDNA kit (NEB, MA)

according to the manufacture’s protocol. Briefly, 1.0µg of total RNA was added with 1 µl of

oligo d(T) (200 pg/µl) and ddH2O to a final volume of 8 µl. The total RNA was incubated at

70° C for 5 min and the tubes were placed on ice for 5 min. 10 µl of reaction buffer and 2 µl

of M-MuLV reverse transcriptase (NEB, MA) was added to the tubes to a final volume of

20µl. For the negative control, the reaction above was prepared without the addition of the

reverse transcriptase. The samples were incubated for 60 min at 42°C, and 5 min at 85°C.

Real-time PCR.

8

Quantitative RT-PCR was performed by adding 1µl of cDNA, 5 µl of AB master mix (AB

biosystem, CA), forward and reverse primers at 10µM (0.5 µl each), and the volume was

adjusted to 10µl with DNase Free water. Two negative control reactions were prepared, one

without the addition of primers and another without template. Primers were used to this study

are listed in Supplemental Table 9. GAPDH primers were used as an internal control. The

Roche Light Cycler was used (Roche, CA) and the following protocol was used for

amplification: 50°C for 2 min and 95°C for 2 min followed by 40 cycles of 95°C for 15 sec

and 60°C for 1 min. PCR efficiency of the primers ranged from 95 % to 105 %. All

experiments were performed in triplicate and bar graphs were generated using the delta Ct

method and expressed as a fold change relative to the internal control and the mock-

inoculated control plant.

9

Supplemental Results

Literature survey of the eleven novel candidate genes for immune response against a

bacterial pathogen Ralstonia solanacearum

Two genes, an E3 ubiquitin ligase (E3UL) and a receptor like kinase (RLK), were

significantly upregulated more than 2-fold in the resistant variety, H7996. Seven of the genes

were repressed more than 2-fold in the susceptible line West Virginia 700 (WVa) or the

resistant line HA. These included Mitogen Activated Protein Kinase 3 (MAPK3), Ethylene

Response Factor (ERF), Phytophthora-Inhibited Protease 1 (PIP1), TolB, Temperature

Induced Lipocalin (TIL), and Vacuolar Sorting Receptor 6/7 (VSR6/7) in WVa, and a kinase

in HA (Supplemental Figure 4). Of these, MAPK3 was of particular interest because it also

showed a slight increase in expression in the resistant variety and is known to have a

significant role in plant immunity in Arabidopsis (Pitzschke et al., 2009; Tena et al., 2011)

(Meng and Zhang, 2013). Indeed, consistent with the prediction from TomatoNet, Virus

Induced Gene Silencing (VIGS) of MAPK3 in a resistant tomato line led to an increase in

bacterial colonization in stems (Chen et al., 2009). Two additional genes that were strongly

repressed in WVa also have known roles in plant defense pathways. These include ERF and

PIP1. Ethylene promotes defense responses to biotrophic pathogens and is known to function

in defense pathways to R. solanacearum (Chen et al., 2009; Ishihara et al., 2012). PIP1

encodes a Pathogenesis-Related (PR) protein that accumulates to high levels after infection

with bacterial, oomycete, and fungal pathogens (Shabab et al., 2008; Tian et al., 2007; Zhao

et al., 2003). The strong down-regulation of PIP expression in WVa suggests strong

suppression of defense responses in the susceptible line, consistent with its phenotype. The

other two genes examined, PM23 and DH, were also repressed in both WVa and H7996, and

have no known role in defense or disease pathways. Together, these data show that

TomatoNet was able to predict genes with known roles in resistance or disease pathways, as

well as predict novel candidates for these pathways.

10

Supplemental References

Barrett, T., Wilhite, S.E., Ledoux, P., Evangelista, C., Kim, I.F., Tomashevsky, M., Marshall, K.A., Phillippy, K.H., Sherman, P.M., Holko, M., et al. (2013). NCBI GEO: archive for functional genomics data sets--update. Nucleic acids research 41:D991-995.

Chen, Y.Y., Lin, Y.M., Chao, T.C., Wang, J.F., Liu, A.C., Ho, F.I., and Cheng, C.P. (2009). Virus-induced gene silencing reveals the involvement of ethylene-, salicylic acid- and mitogen-activated protein kinase-related defense pathways in the resistance of tomato to bacterial wilt. Physiol Plant 136:324-335.

Cho, A., Shin, J., Hwang, S., Kim, C., Shim, H., Kim, H., Kim, H., and Lee, I. (2014). WormNet v3: a network-assisted hypothesis-generating server for Caenorhabditis elegans. Nucleic acids research 42:W76-82.

Frost, A., Elgort, M.G., Brandman, O., Ives, C., Collins, S.R., Miller-Vedam, L., Weibezahn, J., Hein, M.Y., Poser, I., Mann, M., et al. (2012). Functional repurposing revealed by comparing S. pombe and S. cerevisiae genetic interactions. Cell 149:1339-1352.

Ishihara, T., Mitsuhara, I., Takahashi, H., and Nakaho, K. (2012). Transcriptome analysis of quantitative resistance-specific response upon Ralstonia solanacearum infection in tomato. PLoS One 7:e46763.

Kanehisa, M., Goto, S., Sato, Y., Kawashima, M., Furumichi, M., and Tanabe, M. (2014). Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic acids research 42:D199-205.

Kim, H., Shin, J., Kim, E., Kim, H., Hwang, S., Shim, J.E., and Lee, I. (2014). YeastNet v3: a public database of data-specific and integrated functional gene networks for Saccharomyces cerevisiae. Nucleic acids research 42:D731-736.

Lee, I., Blom, U.M., Wang, P.I., Shim, J.E., and Marcotte, E.M. (2011). Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Res 21:1109-1121.

Lee, I., Date, S.V., Adai, A.T., and Marcotte, E.M. (2004). A probabilistic functional network of yeast genes. Science 306:1555-1558.

Lee, I., Lehner, B., Crombie, C., Wong, W., Fraser, A.G., and Marcotte, E.M. (2008). A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans. Nat Genet 40:181-188.

Lee, I., Li, Z., and Marcotte, E.M. (2007). An improved, bias-reduced probabilistic functional gene network of baker's yeast, Saccharomyces cerevisiae. PLoS One 2:e988.

Lee, T., Oh, T., Yang, S., Shin, J., Hwang, S., Kim, C.Y., Kim, H., Shim, H., Shim, J.E., Ronald, P.C., et al. (2015a). RiceNet v2: an improved network prioritization server for rice genes. Nucleic acids research 43:W122-127.

Lee, T., Yang, S., Kim, E., Ko, Y., Hwang, S., Shin, J., Shim, J.E., Shim, H., Kim, H., Kim, C., et al. (2015b). AraNet v2: an improved database of co-functional gene networks

11

for the study of Arabidopsis thaliana and 27 other nonmodel plant species. Nucleic acids research 43:D996-1002.

Liao, Y., Smyth, G.K., and Shi, W. (2014). featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30:923-930.

Lim, W.K., Wang, K., Lefebvre, C., and Califano, A. (2007). Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks. Bioinformatics 23:i282-288.

Love, M.I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550.

Meng, X., and Zhang, S. (2013). MAPK cascades in plant disease resistance signaling. Annu Rev Phytopathol 51:245-266.

Pitzschke, A., Schikora, A., and Hirt, H. (2009). MAPK cascade signalling networks in plant defence. Curr Opin Plant Biol 12:421-426.

Robinson, M.D., McCarthy, D.J., and Smyth, G.K. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139-140.

Shabab, M., Shindo, T., Gu, C., Kaschani, F., Pansuriya, T., Chintha, R., Harzen, A., Colby, T., Kamoun, S., and van der Hoorn, R.A. (2008). Fungal effector protein AVR2 targets diversifying defense-related cys proteases of tomato. Plant Cell 20:1169-1183.

Shin, J., and Lee, I. (2015). Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling. PLoS One 10:e0139006.

Shin, J., Lee, T., Kim, H., and Lee, I. (2014). Complementarity between distance- and probability-based methods of gene neighbourhood identification for pathway reconstruction. Mol Biosyst 10:24-29.

Shin, J., Yang, S., Kim, E., Kim, C.Y., Shim, H., Cho, A., Kim, H., Hwang, S., Shim, J.E., and Lee, I. (2015). FlyNet: a versatile network prioritization server for the Drosophila community. Nucleic acids research 43:W91-97.

Tena, G., Boudsocq, M., and Sheen, J. (2011). Protein kinase signaling networks in plant innate immunity. Curr Opin Plant Biol 14:519-529.

Thimm, O., Blasing, O., Gibon, Y., Nagel, A., Meyer, S., Kruger, P., Selbig, J., Muller, L.A., Rhee, S.Y., and Stitt, M. (2004). MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J 37:914-939.

Tian, M., Win, J., Song, J., van der Hoorn, R., van der Knaap, E., and Kamoun, S. (2007). A Phytophthora infestans cystatin-like protein targets a novel tomato papain-like apoplastic protease. Plant Physiol 143:364-377.

12

Tomato Genome, C. (2012). The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485:635-641.

Yue, J., Ma, X., Ban, R., Huang, Q., Wang, W., Liu, J., and Liu, Y. (2015). FR database 1.0: a resource focused on fruit development and ripening. Database (Oxford) 2015.

Zhao, Y., Thilmony, R., Bender, C.L., Schaller, A., He, S.Y., and Howe, G.A. (2003). Virulence systems of Pseudomonas syringae pv. tomato promote bacterial speck disease in tomato by targeting the jasmonate signaling pathway. Plant J 36:485-499.

13

Supplemental Figure 3. Overview of TomatoNet construction and network-based

prediction methods. Individual gene networks were inferred from different data types and

network inference algorithms (STEP 1). The inferred networks were evaluated using

reference co-functional links derived from pathway annotations (STEP 2), and then integrated

into a single network (STEP 3), which turns out to be more accurate and comprehensive. The

integrated tomato gene network is used to predict new genes for the trait of interest with two

alternative network algorithms (STEP 4): pathway-centric prediction and context-centric

prediction. TomatoNet web server can run both network-based predictions with different

types of user-input genes.

14

Supplemental Figure 2. Benchmarking of component functional networks derived from

18 distinct data types and the integrated network, TomatoNet. The x-axis of the graph

indicates the coverage of the ~35000 tomato coding genes by networks and the y-axis is log

likelihood score (LLS) of functional coupling. Bootstrapping sampling was used to avoid

overtraining for each network. The integrated tomato network provides 721,626 links

mapping approximately 65% to 34,727 tomato-related genes. Overall, the integrated data

outperforms all individual component networks. TomatoNet w/o SL-CX: TomatoNet with

excluding functional links derived from SL-CX.

15

Supplemental Figure 3. TomatoNet is highly predictive for pathways in tomato. (A)

Assessment of TomatoNet based on precision-recall analysis, in which we measured

precision by percentage of tomato gene pairs that share UniProtGOBP annotations (y-axis)

for given coverage of the genome by the network (x-axis). A random model was generated by

pairing randomly selected two genes out of all tomato genes. TomatoNet shows substantially

higher precision for the entire range of the genome coverage. For example, TomatoNet links

show over 50% precision for gene pairs that share UniProtGOBP annotations for the network

covering ~20% of coding genome, while randomized model shows ~5% precision for the

same genome coverage. (B) Since the network connects genes that belong to the same GOBP

pathways, we may be able to predict tomato genes for each GOBP pathway by guilt-by-

association (GBA). To assess the GBA-based predictions for GOBP pathways by TomatoNet,

we employed receiver operating characteristic (ROC) analysis, which can be summarized as

area under the ROC curve (AUC) scores for each GOBP pathway. We found that the median

AUC of GOBP pathway predictions by TomatoNet is approximately 0.7; 425 of 813 (~52%)

tested GOBP pathways were predicted with an AUC of 0.7 or higher. This demonstrated that

TomatoNet is highly predictive for the majority of the pathways in tomato. We considered

only 813 GOBP terms with no less than five member genes. Random analysis was done by

using 813 gene sets of the same size of random tomato genes. As expected, the majority of

the AUCs by random samples show ~0.5.

16

Supplemental Figure 4: qRT-PCR analysis of the 11 genes predicted by TomatoNet to

have roles in tomato immune responses to R. solanacearum. A) H7996. Of the 11 genes,

two were upregulated in the resistant cultivar H7996 more than 2-fold at 48 hpi, while eight

of the other nine were significantly repressed 48 hpi, B) WVa. Nine of the 11 genes were

significantly repressed in WVa 48 hpi. Stars indicate significance at p < 0.05 for a given gene

compared to its mock-inoculated plant.

Supplemental Table 1. TomatoNet and component networks inferred from 18 distinct data types17

Network Description Genes Links

TomatoNet Integrated network 22,549 721,626Component networks

SL-CX By co-expression of tomato genes 17,167 170,530

SL-GN By gene neighborhood of two bacterial orthologs of tomato genes in prokaryotic genomes 3,209 36,109

SL-PG By phylogenetic profile similarity across species 3,980 121,494

AT-CC By co-citation of (A. thaliana) orthologs in Pubmed articles 3,598 19,727

AT-CX By co-expression of arabidopsis orthologs 10,237 132,506

AT-HT By high-throughput arabidopsis orthologous 1,903 2,650

AT-LC By literature curated arabidopsis orthologous 1,294 1,615

CE-CX By co-expression of worm (C. elegans) orthologs 2,097 14,665

DM-CX By co-expression of fly (D. melanogaster) orthologs 2,205 26,097

DR-CX By co-expression of zebrafish (D. rerio) orthologs 1,842 24,100

HS-CX By co-expression of human orthologs 1,317 10,842

HS-HT By high-throughput human orthologous PPIs 1,578 5,624

HS-LC By literature curated human orthologous PPIs 2,768 15,776

SC-CX By co-expression of yeast (S. cerevisiae) orthologs 2,113 48,149

SC-GT By genetic interactions of yeast orthologs 1,514 24,574

SC-HT By high-throughput yeast orthologous PPIs 2,044 36,576

SC-LC By literature curated yeast orthologous PPIs 1,952 14,316

OS-CX By co-expression of rice (O. sativa) orthologs 8,300 77,496

18

Supplemental Table 2. Gene expression data used for inferring co-expression networks.

GSE ID Data type GSE Title # Samples

GSE19326 microarray Expression data fromvarious tomato plant tissues 67

GSE22304 microarray Expression data in response to abiotic stresses in tomato at flowering stage 24

GSE22803 microarray

Expression data from rootsand first two leaves of tomato seedlings growing on regular MS medium or MS medium supplemented with multi-wall carbon nanotubes(0, 50, 100, 200 ug/ml) or activated carbon (50 mg/ml)

20

GSE23562 RNA-seqSmall RNA profiling Tomato,wild tomato relative and seriesof introgression lines

14

GSE28564 microarrayGlobal gene expression analysis of normal and ripening inhibitor (rin) mutant tomatoes during ripening

12

GSE30270 microarrayTranscriptomic and metabolic responses of mycorrhizal roots to nitrogen patches under field conditions

30

GSE35020 microarray Microarray analysis of gene expression under different light conditions 20

GSE39894 microarray

Comparative transcriptomeanalysis of responses to water deficit in Solanum lycopersicumand S. pimpinellifolium roots

24

GSE41135 microarray

Expression analysis oftomato plants TYLCV resistant, susceptible and resistant line silenced in the hexose transporter (LeHT1) gene before and 7 days after inoculation of tomato with Tomato yellow leaf curl virus (TYLCV).

16

GSE43492 microarray The effects of a novel agrochemical in alleviating salinity stress 12

GSE45774 RNA-seq Transcriptome Sequencing in wild and domesticated tomato species 19

GSE49289 RNA-seqTranscriptome profiling of tomato fruits of a FRUITFULL-suppressed line and rin mutant by next generation RNA sequencing

18

GSE: Gene expression sample series of GEO database

19

Supplemental Table 3. Fruit development and fruit ripening (grey for ripening) genes in tomato by the FR database

No. FDR gene list Symbol Description by FR Database 1.0

1 Solyc00g136560.2 During fruit development and ripening

2 Solyc01g006560.2 During fruit ripening

3 Solyc01g008710.2 MAN4 Fruit ripening

4 Solyc01g009170.2 Fruit ripening

5 Solyc01g010970.2 In early stage of fruit

6 Solyc01g056340.2 Det1 During fruit ripening

7 Solyc01g057270.2 During tomato fruit development and ripening

8 Solyc01g060140.2 During tomato fruit development and ripening

9 Solyc01g065980.2 ERF6 During fruit ripening

10 Solyc01g067710.2 nhx3 During fruit development

11 Solyc01g067890.2 dxs1 During fruit ripening

12 Solyc01g079620.2 During fruit ripening

13 Solyc01g080010.2 Xegip Fruit development

14 Solyc01g081610.2 Post-ripening

15 Solyc01g091160.2 ARG1 At the mature green stage

16 Solyc01g091170.2 ARG2 At the mature green stage

17 Solyc01g095080.2 Tomato ripening stages

18 Solyc01g096750.1 In tomato developmental processes

19 Solyc01g096810.2 EIL3 Fruit ripening

20 Solyc01g097290.2 Early fruit

21 Solyc01g097340.2 During fruit development

22 Solyc01g097980.2 During fruit development

23 Solyc01g098190.2 nhx4 During fruit development

24 Solyc01g099190.2 LOXB Mature fruit

25 Solyc01g099630.2 XTH1 During fruit growth and ripening

26 Solyc01g104950.2 XYL2 Fruit development and ripening

27 Solyc01g105230.2 During tomato fruit development and ripening

28 Solyc01g107730.2 During fruit development

29 Solyc01g109790.2 During fruit development and ripening

30 Solyc02g021470.2 During fruit development

31 Solyc02g021650.2 DDB1 Mature green

32 Solyc02g037530.2 Fruit initiation

33 Solyc02g065750.1 Fruit development

34 Solyc02g065770.2 Fruit development and ripening

35 Solyc02g067180.2 Tomato fruit ripening

36 Solyc02g070580.1 SD1 During fruit maturation

20

37 Solyc02g071730.2 Fruit development

38 Solyc02g077560.2 ARF3 During fruit development

39 Solyc02g079220.2 ht1 During tomato fruit development

40 Solyc02g079500.2 During fruit ripening

41 Solyc02g080280.1 During fruit ripening

42 Solyc02g080800.2 DHS Fruit softening and development

43 Solyc02g081170.2 ChrC Red-ripe

44 Solyc02g083950.2 wus Early developmental stages

45 Solyc02g084720.2 TBG6 Fruit development

46 Solyc02g085500.2 During fruit development

47 Solyc02g085870.2 During fruit development

48 Solyc02g086930.2 During fruit ripening

49 Solyc02g089160.2 Flower and fruit production

50 Solyc02g089200.2 Fruit and floral development

51 Solyc02g090730.2 Fruit development

52 Solyc02g090890.2 During fruit development

53 Solyc02g091970.2 Through fruit development

54 Solyc02g092980.2 cycD3 During fruit development

55 Solyc02g093150.2 During fruit development and ripening

56 Solyc03g006860.2 Frk1 Early in fruit development

57 Solyc03g006880.2 During fruit ripening

58 Solyc03g007960.2 CrtR-b2 During fruit ripening

59 Solyc03g025560.2 During fruit development and ripening

60 Solyc03g026270.1 Early developmental stages

61 Solyc03g026280.2 CBF1 Mature green

62 Solyc03g031860.2 Psy1 Fruit ripening

63 Solyc03g043880.2 Through fruit development

64 Solyc03g082420.2 HSP21 Fruit ripening

65 Solyc03g083910.2 Plant and fruit development

66 Solyc03g093130.2 XTH3 During fruit growth and ripening

67 Solyc03g093610.1 ERF1 During fruit ripening and softening

68 Solyc03g111720.2 During fruit ripening

69 Solyc03g114340.2 DXR Fruit ripening

70 Solyc03g114840.2 In fruit maturation

71 Solyc03g118290.2 ARF2 During fruit development

72 Solyc03g120500.2 IAA6 During fruit initiation and development

73 Solyc03g123630.2 PMEU1 During fruit ripening

74 Solyc03g123760.2 Fruit development and ripening

75 Solyc04g009440.2 Salt stress

21

76 Solyc04g009800.2 Tomato ripening

77 Solyc04g009900.2 From 8 to 20 days post anthesis

78 Solyc04g040190.1 Fruit development

79 Solyc04g051190.2 During fruit development

80 Solyc04g056270.2 ER66 During fruit development and ripening

81 Solyc04g056280.2 CDKC In developing tomato fruit

82 Solyc04g056600.2 NHX2 During fruit development

83 Solyc04g071880.2 During fruit early development

84 Solyc04g076880.2 Fruit ripening

85 Solyc04g077240.2 Through fruit development

86 Solyc04g078470.2 During fruit development

87 Solyc04g078840.2 During different developmental stages

88 Solyc04g078900.2 CYP707A1 Fruit set

89 Solyc04g081000.2 During ovary growth

90 Solyc04g081240.2 During fruit development

91 Solyc04g082840.2 cdkB2 In developing tomato fruit

92 Solyc05g005710.2 spdsyn During fruit ripening

93 Solyc05g007190.2 SUT2 Fruit and seed development

94 Solyc05g008060.2 Fruit set

95 Solyc05g012020.2 In fruit maturation and during fruit ripening

96 Solyc05g014280.2 vis1 Fruit ripening

97 Solyc05g015750.2 LeMADS5 In fruit maturation

98 Solyc05g052050.1 During fruit ripening

99 Solyc05g053550.2 CHS2 Fruit development

100 Solyc05g054410.2 TBP1 Seed and fruit development

101 Solyc06g005150.2 APX2 Flower and fruit development

102 Solyc06g008820.2 NHX1 During fruit development

103 Solyc06g036260.2 Pigmentation of flowers and fruits

104 Solyc06g049050.2 EXP2 Early tomato fruit growth

105 Solyc06g051400.2 During fruit maturation

106 Solyc06g051800.2 EXP1 Fruit ripening

107 Solyc06g053620.2 PPCK2 From 8 to 20 days post anthesis

108 Solyc06g053710.2 ETR4 During fruit development

109 Solyc06g059740.2 During fruit ripening

110 Solyc06g061000.2 Fruit development

111 Solyc06g065630.2 FZY During fruit ripening

112 Solyc06g066440.2 Hxk2 Fruit development

113 Solyc06g068090.2 PLDa1 Flower development and fruit development and ripening

114 Solyc06g068860.2 Post-ripening

22

115 Solyc06g069430.2 During fruit ripening

116 Solyc06g073190.2 FRK2 Early tomato seed development

117 Solyc06g073320.2 During fruit development and ripening

118 Solyc06g073720.1 EIL1 During ripening

119 Solyc06g073730.1 EIL4 Fruit ripening

120 Solyc06g074730.2 In tomato developmental processes

121 Solyc06g076520.1 Post-harvest fruit

122 Solyc06g076920.2 During fruit development and ripening

123 Solyc07g006900.1 At 4 days after anthesis

124 Solyc07g007600.2 During fruit early development

125 Solyc07g021630.2 Through fruit development

126 Solyc07g042260.2 Fruit set and development

127 Solyc07g043310.2 GABA-TP1 During fruit development and ripening

128 Solyc07g049530.2 ACO1 Post-harvest fruit

129 Solyc07g049690.2 HPL During fruit ripening

130 Solyc07g055290.2 Rab11a During the development and ripening

131 Solyc07g055920.2 TAGL1 During fruit development and ripening

132 Solyc07g056570.1 Fruit set

133 Solyc07g056580.2 ETR2 During fruit ripening

134 Solyc07g064170.2 PME1.9 Post harvest

135 Solyc07g064180.2 PME2.1 Post harvest

136 Solyc08g005660.1 During fruit development and ripening

137 Solyc08g005680.2 During fruit development and ripening

138 Solyc08g060810.2 EBF2 During fruit ripening

139 Solyc08g061130.2 Fruit development and ripening

140 Solyc08g065790.2 vpe Fruit development

141 Solyc08g066330.1 During early tomato fruit development

142 Solyc08g075540.2 Fruit development and ripening

143 Solyc08g077230.2 During fruit development and ripening

144 Solyc08g080090.2 During fruit ripening

145 Solyc08g080640.1 NP24 Ripening of tomato fruit

146 Solyc08g081190.2 Post-anthesis development and during ripening

147 Solyc08g081530.2 MDHAR During fruit development and ripening

148 Solyc08g081540.2 ACS1A During fruit ripening

149 Solyc08g082250.2 Cel8 Fruit ripening

150 Solyc08g082630.2 During fruit development

151 Solyc09g010080.2 lin5 During fruit development and ripening

152 Solyc09g010090.2 LIN7 During fruit ripening

153 Solyc09g010210.2 Cel2 Fruit abscission

23

154 Solyc09g011920.2 Fruit development and ripening

155 Solyc09g061280.2 krp2 Tomato fruit developing

156 Solyc09g065640.2 EOL1 Full ripe stage of fruit and fruit ripening

157 Solyc09g074520.2 Anthesis stages

158 Solyc09g074830.2 wee1 Early fruit development

159 Solyc09g075440.2 NR During fruit development

160 Solyc09g075820.2 During tomato fruit development

161 Solyc09g082690.2 Tomato fruit ripening

162 Solyc09g082990.2 In red fruits

163 Solyc09g089610.2 ETR6 During fruit ripening

164 Solyc09g090100.2 Flowering and fruit development

165 Solyc09g091510.2 CHS1 Fruit development

166 Solyc09g091550.2 During fruit development

167 Solyc09g091780.2 krp1 Tomato fruit developing

168 Solyc10g006880.2 During fruit ripening

169 Solyc10g007600.2 Expressed at 15 days after flowering

170 Solyc10g009110.1 During fruit development

171 Solyc10g047030.1 XYL1 Fruit development and ripening

172 Solyc10g052470.1 Very early stages of tomato fruit development

173 Solyc10g054590.1 Fruit and seed development

174 Solyc10g074720.1 cdkB1 In developing tomato fruit

175 Solyc10g078550.1 gdh1 During fruit ripening

176 Solyc10g079240.1 10, 20, 33 DPA

177 Solyc10g080210.1 Tomato fruit ripening process of softening

178 Solyc10g081120.1 ARF1 Fruit development and ripening

179 Solyc10g081470.1 The onset of fruit ripening

180 Solyc10g081650.1 CRTISO Fruit maturation

181 Solyc10g083790.1 During fruit development

182 Solyc10g085150.1 CPT4 During fruit development and ripening

183 Solyc10g086250.1 During fruit development and ripening

184 Solyc10g086260.1 ANT1 During fruit development and ripening

185 Solyc11g010570.1 Flower abscission

186 Solyc11g011260.1 During fruit-set and development

187 Solyc11g017010.1 SUT1 Fruit development

188 Solyc11g066270.1 XTH6 During fruit growth and ripening

189 Solyc11g069190.1 ARF4 During fruit development and ripening

190 Solyc11g069380.1 During fruit ripening

191 Solyc11g069500.1 During fruit ripening

192 Solyc11g071810.1 During fruit ripening

24

193 Solyc11g072630.1 MAPK4 Green mature and post-harvest

194 Solyc12g005950.1 COP1 Fruit development and ripening

195 Solyc12g006380.1 ODD Expressed at 15 days after flowering

196 Solyc12g006520.1 Fruit development

197 Solyc12g006530.1 Fruit development

198 Solyc12g008840.1 Fruit ripening and softening

199 Solyc12g009300.1 Fruit set and development

200 Solyc12g009560.1 EBF1 During fruit ripening

201 Solyc12g009570.1 cipk During fruit development

202 Solyc12g011330.2 ETR1 During fruit ripening

203 Solyc12g015860.1 FPS1 During early fruit development

204 Solyc12g035520.1 During tomato fruit development and ripening

205 Solyc12g038390.1 During fruit ripening

206 Solyc12g044880.1 Fruit ripening

207 Solyc12g056600.2 Breaker stage

208 Solyc12g088220.1 Through fruit development

209 Solyc12g095860.1 During the early development of tomato fruit

210 Solyc12g099200.1 During fruit ripening

211 Solyc12g099340.1 During tomato fruit development and ripening

25

Supplemental Table 4. Five validated fruit development and ripening genes among top 100 context-centric predictions using 100 up-regulated genes of ‘10 days after breaker stage’ compared to ‘flower stage’

Rank Gene ID Symbol TF DEG

PMID Validation method

Description

8 Solyc03g082420.2 HSP21 15879560 Overexpression [FR Database 1.0 base] It plays a role in plant development under normal growth conditions, in addition to its protective effect under stress conditions. Also, it affects fruit color changes.[Arabidopsis ortholog GOBP] protein folding, response to heat, response to high light intensity, heat acclimation, response to hydrogen peroxide

53 Solyc05g014280.2 vis1 DEG

12586896 Northern; Southern

[FR Database 1.0 base] It plays a role in pectin depolymerization to contribute to juice viscosity.

54 Solyc07g055920.2 TAGL1 TF 20335407; 21203447; 19880793; 19891701

RNAi [FR Database 1.0 base] It regulates tomato flower and fruit development and ripening.[AgriGO] regulation of transcription, DNA-dependent

76 Solyc01g065980.2 ERF6 TF 22111515 Southern [FR Database 1.0 base] It affects on trans-lycopene and beta-carotene accumulation.[Uniprot2GOBP] regulation of transcription, DNA-templated,heat acclimation,positive regulation of transcription, DNA-templated,ethylene-activated signaling pathway,cell death,transcription, DNA-templated[AgriGO] regulation of transcription, DNA-dependent[Arabidopsis ortholog GOBP] cell death, response to ethylene, response to cytokinin, response to jasmonic acid, ethylene-activated signaling pathway, heat acclimation, positive regulation of transcription & DNA-templated, response to other organism

91 Solyc05g015750.2 LeMADS5 TF 20946942 Northern; Southern

[FR Database 1.0 base] It plays crucial roles in organ and cell differentiation.[AgriGO] regulation of transcription, DNA-dependent[Arabidopsis ortholog GOBP] cell fate specification, regulation of transcription & DNA-templated, flower development, specification of floral organ identity, carpel development, ovule development, meristem development, specification of floral organ number

26

Supplemental Table 5. Five validated fruit ripening genes among top 100 context-centric predictions using 100 up-regulated genes of ‘10 days after breaker stage’ compared to ‘mature green stage’

Rank Gene ID Symbol TF DEG PMID Validation method

Description

17 Solyc05g014280.2 vis1 DEG 12586896

Northern; Southern

[FR Database 1.0 base] It plays a role in pectin depolymerization to contribute to juice viscosity.

27 Solyc03g082420.2 HSP21 15879560

Overexpression [FR Database 1.0 base] It plays a role in plant development under normal growth conditions,

in addition to its protective effect under stress conditions. Also, it affects fruit color changes.[Arabidopsis ortholog GOBP] protein folding, response to heat, response to high light intensity, heat acclimation, response to hydrogen peroxide

72 Solyc01g067890.2 dxs1 20591838

Western [FR Database 1.0 base] It is involved in the MEP pathway to increase monoterpene β-phellandrene and decrease two sesquiterpenes in trichomes.[AgriGO] metabolic process[Arabidopsis ortholog GOBP] sulfur amino acid metabolic process, glycine catabolic process, unsaturated fatty acid biosynthetic process, phosphatidylglycerol biosynthetic process, oxidoreduction coenzyme metabolic process, vitamin metabolic process, cellular amino acid biosynthetic process, aromatic amino acid family metabolic process, aromatic amino acid family biosynthetic process, lipoate metabolic process, coenzyme biosynthetic process, nucleotide metabolic process, response to light stimulus, jasmonic acid biosynthetic process, leaf morphogenesis, regulation of proton transport, chlorophyll metabolic process, chlorophyll biosynthetic process, carotenoid biosynthetic process, regulation of lipid metabolic process, isopentenyl diphosphate biosynthetic process & methylerythritol 4-phosphate pathway, cysteine biosynthetic process, secondary metabolic process, cell differentiation, oxylipin biosynthetic process, regulation of protein localization, sulfur compound biosynthetic process, positive regulation of transcription & DNA-templated, protein autophosphorylation

89 Solyc08g060810.2 EBF2 19903730

RNAi [FR Database 1.0 base] It negatively regulates the ethylene signalling pathway via mediating the degradation of EIN3/EIL proteins to trigger fruit ripening.[Arabidopsis ortholog GOBP] ubiquitin-dependent protein catabolic process, response to ethylene, negative regulation of ethylene-activated signaling pathway, regulation of circadian rhythm

97 Solyc10g085150.1 CPT4 23134568

Clone [FR Database 1.0 base] It is involved in the synthesis of long-chain polyisoprenoids.[Uniprot2GOBP] metabolic process[Arabidopsis ortholog GOBP] ubiquinone biosynthetic process, dolichol biosynthetic process

27

Supplemental Table 6. 12 implicated genes for fruit development and ripening among top 100 candidates by context-centric prediction using 100 up-regulated genes in ‘10 days after breaker stage’ compared to ‘mature green stage’. Annotations related to fruit development and ripening are highlighted.

Rank Gene ID Symbol

TF Description

15 Solyc02g092860.2 Uniprot2GOBP oxidation-reduction processAgriGO oxidation reductionArabidopsis ortholog GOBP response to ethylene, heat acclimation, response to karrikin

27 Solyc04g082960.1 Arabidopsis ortholog GOBP respiratory burst involved in defense response, ethylene-activated signaling pathway, response to chitin, intracellular signal transduction

41 Solyc01g104050.2 Uniprot2GOBP phosphorylation, ethylene biosynthetic process, response to ethylene, intracellular signal transduction, protein phosphorylation, abscisic acid-activated signaling pathwayAgriGO protein amino acid phosphorylationArabidopsis ortholog GOBP ethylene biosynthetic process, response to ethylene, abscisic acid-activated signaling pathway, intracellular signal transduction

45 Solyc02g087210.2 Uniprot2GOBP signal transduction, response to cyclopentenone, response to ethylene, toxin catabolic process, protein targeting to membrane, response to wounding, salicylic acid mediated signaling pathway, hyperosmotic salinity response, response to jasmonic acid, jasmonic acid mediated signaling pathway, response to auxin, abscisic acid-activated signaling pathway, response to water deprivation, heat acclimation, regulation of plant-type hypersensitive responseArabidopsis ortholog GOBP protein targeting to membrane, signal transduction, toxin catabolic process, response to water deprivation, response to wounding, response to ethylene, response to auxin, response to abscisic acid, abscisic acid-activated signaling pathway, response to jasmonic acid, salicylic acid mediated signaling pathway, jasmonic acid mediated signaling pathway, response to chitin, heat acclimation, regulation of plant-type hypersensitive response, response to cyclopentenone, hyperosmotic salinity response

48 Solyc06g071920.2 Uniprot2GOBP oxidation-reduction process, glucose metabolic processAgriGO oxidation reductionArabidopsis ortholog GOBP glucose catabolic process, gluconeogenesis, glycolytic process, pentose-phosphate shunt, water transport, hyperosmotic response, response to oxidative stress, cytoskeleton organization, Golgi organization, aerobic respiration, response to temperature stimulus, response to heat, response to salt stress, response to sucrose, fruit development, proteasomal protein catabolic process, response to endoplasmic reticulum stress, response to hydrogen peroxide, response to cadmium ion, seed development, response to redox state

69 Solyc06g073830.1 Arabidopsis ortholog GOBP response to wounding, response to fungus, jasmonic acid biosynthetic process, response to ethylene, abscisic acid-activated signaling pathway, response to jasmonic acid, ethylene-activated signaling pathway, response to chitin, intracellular signal transduction

75 Solyc11g066440.1 AgriGO oxidation reductionArabidopsis ortholog GOBP cell death, embryo development ending in seed dormancy, defense response to bacterium & incompatible interaction, flower development, leaf morphogenesis, thylakoid membrane organization, fruit development, vegetative to reproductive phase transition of meristem, chlorophyll catabolic process, iron-sulfur cluster assembly, cell differentiation, positive regulation of transcription & DNA-templated, ovule development

76 Solyc01g065980.2 ERF6 TF Uniprot2GOBP regulation of transcription, DNA-templated, heat acclimation, positive regulation of transcription, DNA-templated, ethylene-activated signaling pathway, cell death,transcription, DNA-templatedAgriGO regulation of transcription, DNA-dependentArabidopsis ortholog GOBP cell death, response to ethylene, response to cytokinin, response to jasmonic acid, ethylene-activated signaling pathway, heat acclimation, positive regulation of transcription & DNA-templated, response to other organism

80 Solyc04g077980.1 TF Arabidopsis ortholog GOBP respiratory burst involved in defense response, response to oxidative stress, signal transduction, response to cold, response to water deprivation, response to wounding, response to mechanical stimulus, response to fungus, response to high light intensity, response to salt stress, jasmonic acid biosynthetic process, response to ethylene, response to auxin, response to abscisic acid, abscisic acid-activated signaling pathway, response to jasmonic acid, ethylene-activated signaling pathway, photoprotection, response to chitin, photosynthesis, multicellular organism growth, intracellular signal transduction, hyperosmotic salinity response, negative regulation of transcription & DNA-templated

87 Solyc06g068460.2 TF AgriGO regulation of transcription, DNA-dependentArabidopsis ortholog GOBP MAPK cascade, response to molecule of bacterial origin, respiratory burst involved in defense response, regulation of transcription & DNA-templated, protein targeting to membrane, detection of biotic stimulus, response to wounding, response to mechanical stimulus, response to fungus, response to absence of

28

light, jasmonic acid biosynthetic process, salicylic acid biosynthetic process, response to ethylene, abscisic acid-activated signaling pathway, response to salicylic acid, response to jasmonic acid, systemic acquired resistance & salicylic acid mediated signaling pathway, salicylic acid mediated signaling pathway, jasmonic acid mediated signaling pathway, ethylene-activated signaling pathway, response to chitin, regulation of hydrogen peroxide metabolic process, regulation of plant-type hypersensitive response, detection of bacterium, endoplasmic reticulum unfolded protein response, regulation of defense response, negative regulation of defense response, intracellular signal transduction, defense response to bacterium, negative regulation of programmed cell death, regulation of multi-organism process, regulation of defense response to virus by host, regulation of immune response, defense response to fungus

89 Solyc01g104740.2 Arabidopsis ortholog GOBP transcription & DNA-templated, protein folding, response to heat, response to water deprivation, response to high light intensity, response to ethylene, response to abscisic acid, ethylene-activated signaling pathway, response to endoplasmic reticulum stress, response to hydrogen peroxide, positive regulation of transcription & DNA-templated

100 Solyc07g026650.2 aco5 AgriGO oxidation reductionArabidopsis ortholog GOBP ethylene biosynthetic process, xylem development, cell wall macromolecule metabolic process, cellular response to potassium ion starvation, cellular response to iron ion, cellular response to fatty acid, cellular response to nitric oxide

29

Supplemental Table 7A. List of 109 up-regulated genes in Ishihara et al. (2012)

Solyc01g005470.2 Solyc03g095770.2 Solyc06g050930.2 Solyc09g082270.2Solyc01g008620.2 Solyc03g098730.1 Solyc06g071280.2 Solyc09g089910.1Solyc01g081310.2 Solyc03g113220.2 Solyc06g071810.1 Solyc09g090070.1Solyc01g086680.2 Solyc03g114600.2 Solyc06g075690.2 Solyc09g090470.2Solyc01g095080.2 Solyc03g114890.2 Solyc06g076300.2 Solyc09g090730.1Solyc01g095170.2 Solyc03g115920.2 Solyc07g040710.2 Solyc09g090980.2Solyc01g097240.2 Solyc03g115930.1 Solyc07g045030.2 Solyc09g090990.2Solyc01g098590.2 Solyc03g117860.2 Solyc07g049530.2 Solyc09g097960.2Solyc01g100010.2 Solyc04g005050.1 Solyc07g049660.2 Solyc10g012370.2Solyc01g107390.2 Solyc04g009440.2 Solyc07g055710.2 Solyc10g050880.1Solyc01g107780.2 Solyc04g009860.2 Solyc07g056200.2 Solyc10g052880.1Solyc01g107820.2 Solyc04g016470.2 Solyc08g006470.2 Solyc10g055200.1Solyc01g109140.2 Solyc04g040180.2 Solyc08g007790.2 Solyc10g055760.1Solyc01g112220.2 Solyc04g048900.2 Solyc08g008280.2 Solyc10g079860.1Solyc02g061770.2 Solyc04g064870.2 Solyc08g028780.1 Solyc10g083290.1Solyc02g064830.2 Solyc04g071890.2 Solyc08g068700.1 Solyc10g083690.2Solyc02g069800.1 Solyc04g074950.2 Solyc08g075540.2 Solyc10g085010.1Solyc02g080070.2 Solyc04g078290.2 Solyc08g075550.2 Solyc11g011330.1Solyc02g082920.2 Solyc04g078660.1 Solyc08g080130.2 Solyc11g011340.1Solyc02g082930.2 Solyc05g005460.2 Solyc09g011590.2 Solyc12g006380.1Solyc02g086270.2 Solyc05g041910.2 Solyc09g011630.2 Solyc12g007030.1Solyc02g090490.2 Solyc05g050130.2 Solyc09g011870.1 Solyc12g008960.1Solyc02g092860.2 Solyc05g050800.2 Solyc09g014990.2 Solyc12g014010.1Solyc02g093050.2 Solyc05g056400.2 Solyc09g057960.1 Solyc12g042480.1Solyc03g025670.2 Solyc06g009040.2 Solyc09g061840.2 Solyc12g045030.1Solyc03g025720.2 Solyc06g009110.2 Solyc09g075020.2Solyc03g033840.2 Solyc06g034370.1 Solyc09g075820.2Solyc03g080190.2 Solyc06g036310.2 Solyc09g082240.2

Supplemental Table 7B. List of 10 down-regulated genes in Ishihara et al. (2012)

Solyc11g072310.1 Solyc11g045100.1 Solyc02g084950.2 Solyc03g111130.1

Solyc05g007210.2 Solyc06g075660.2 Solyc08g005680.2 Solyc03g111140.2

Solyc04g080540.2 Solyc01g096940.2    

30

Supplemental Table 8. TomatoNet predicted genes used for qRT-PCR validation

Ranka ID Gene and abbreviation Inputb

1 Solyc02g077040.2 Phytophthora-inhibited protease (PIP1), a cysteine protease UP

6 Solyc01g057080.1 Ethylene Response Factor (ERF), a transcription factor UP

7 Solyc09g008010.2 Probable receptor-like kinase (Kinase) UP

13 Solyc03g112340.1 E3 Ubiquitin protein ligase (E3UL) UP

14 Solyc06g005170.2 Mitogen Activated Protein Kinase 3 (MAPK3) UP

16 Solyc02g081970.2 Vacuolar Sorting Receptor (VSR6/7) UP

17 Solyc08g066210.2 Receptor like kinase (RLK) UP

47 Solyc06g008620.1 TolB-protein related (TolB) UP

3 Solyc12g010320.1 Temperature induced lipocalin (TIL) DN

35 Solyc01g006510.2 L-idonate 5-dehydrogenase (DH) DN

62 Solyc01g107380.2 Fragment of seed maturation protein (PM23) DN

Gene and abbreviation are from Sol Genomics Network: solcyc.solgenomics.net

a. Rank in pathway prediction method

b. Refers to the input gene list used for prediction. UP, up regulated DEGs; DN down regulated DEGs;

31

Supplemental Table 9. List of qRT-PCR primers

ID Gene Direction Seq (5' --- 3')

Solyc01g057080.1 ERF For CCTTCCATGTTACCGCCTAA

Rev AGCATGTTGTTGAGCAGGTG

Solyc08g066210.2 RLK For ACCTTGCACCTCTGTTGCTT

Rev CAGTGTTCCCTTCTCCGTGT

Solyc06g005170.2 MAPK3 For CTGAGCTTCTTGGCACTCCT

Rev TGGATTCACATGAGGGAACA

Solyc03g112340.1 E3UL For TGTAATGGACCGGGCTCTAC

Rev ACCTTGCACCTCTGTTGCTT

Solyc02g077040.2 PIP1 For CGAACAACCAGCAGCAGTTA

Rev GCCGCAATACCAACAGAAAT

Solyc01g107380.2 PM23 For TTGGCTCGTCTGTTGTTTTG

Rev TTAACCCACATGGCAAGGAT

Solyc01g006510 DH For AATCGACGCGAGTTTTGACT

Rev GCAGCTGGAGTGAGAGGAAC

Solyc12g010320.1 TIL For TGCTTTGATTGGTCAGCCTA

Rev GGCGTCTTGTGGAGCTTACT

Solyc02g081970.2 VSR6/7 For GTTCGGTAAACAACGGAGGA

Rev GCCATCCCCTTTAAAACCAT

Solyc09g008010.2 Kinase For GAACCATGGGATATGCTGCT

Rev TGCTCACTCCTAGGCCTGTT

Solyc06g008620.1 TolB For TGGATTGATACGATGCCAAA

Rev ATATCCTCCGAAGCCCTGAT

32