2 acidity in apple - Crop Science Society of America · 2 acidity in apple 3 Baiquan ... 4 to...
Transcript of 2 acidity in apple - Crop Science Society of America · 2 acidity in apple 3 Baiquan ... 4 to...
1
Genes encoding aluminum-activated malate transporter II and their association with fruit 1
acidity in apple 2
Baiquan Ma1,3
, Liao Liao1, Hongyu Zheng
1,3, Jie Chen
2,3, Benhong Wu
2, Collins Ogutu
1,3, Shaohua Li
1, 3
Schuyler S. Korban4, Yuepeng Han
1� 4
5
6
1Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden of 7
the Chinese Academy of Sciences, Wuhan, 430074, P.R. China 8
2Beijing Key Laboratory of Grape Sciences and Enology, and CAS Key Laboratory of Plant Resources, 9
Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China 10
3Graduate University of Chinese Academy of Sciences, 19A Yuquanlu, Beijing, 100049, P.R. China 11
4Department of Biology, University of Massachusetts Boston, Boston, MA 02184 USA 12
13
14
�Corresponding author: [email protected] 15
Tel/Fax: 86-27-8751-0872 16
17
18
19
20
21
Page 1 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
2
Abstract: A gene encoding aluminum-activated malate transporter (ALMT) was previously reported as a 1
candidate for the Ma locus controlling fruit acidity in apple. In this study, we found that apple ALMT genes 2
can be divided into three families and the Ma1 gene belongs to the ALMTII family. Duplication of ALMTII 3
genes in apple is related to the polyploid origin of the apple genome. Divergence in expression has 4
occurred between the Ma1 gene and its homologues in the ALMTII family and only the Ma1 gene is 5
significantly associated with malic acid content. The Ma locus consists of two alleles Ma1 and ma1. Ma1 6
resides in the tonoplast, and its ectopic expression in yeast was found to significantly increase the influx of 7
malic acid into yeast cells, suggesting it may function as a vacuolar malate channel. In contrast, ma1 8
encodes a truncated protein due to a single nucleotide substitution of G with A in the last exon. As this 9
truncated protein resides within the cell membrane, it is deemed non-functional as a vacuolar malate 10
channel. The frequency of the Ma1Ma1 genotype is very low in apple cultivars, but high in wild relatives, 11
which suggests that apple domestication may be accompanied by selection at the Ma1 gene. In addition, 12
variations in malic acid content of mature fruits were also observed between accessions with the same 13
genotype in the Ma locus. This suggests that the Ma gene is not the only genetic determinant of fruit 14
acidity in apple. 15
Key words: Apple, malic acid, domestication, association mapping, gene duplication 16
17
18
Page 2 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
3
Introduction 1
Apple (Malus × domestica Borkh.) is one of the most important fruit trees in temperate regions, and 2
displays a juvenile period of 6 to 10 years or even longer. The apple genome is highly heterozygous due 3
to self-incompatibility. Genetic improvement of apple using conventional breeding methods is hampered 4
by this long juvenility phase. To improve efficiency of apple breeding efforts, it is critical to accurately 5
identify young seedlings possessing desirable horticultural traits early using molecular-marker assisted 6
selection. In the past few decades, several studies have been conducted to identify molecular markers 7
that are closely linked to either major genes or quantitative trait loci (QTL) controlling economically 8
important traits in apple, including plant architecture (Kenis and Keulemans 2007; Segura et al. 2008), 9
disease resistance (Xu and Korban 2002; Belfanti et al. 2004; Le Roux et al. 2010), bud dormancy (Brunel et 10
al. 2002; Dyk et al. 2010), and fruit quality (Espley et al. 2009; Mellidou et al. 2012; Longhi et al. 2012 , 11
2013; Khan et al. 2012, Zhang et al 2012). 12
Genetic mapping in apple, as well as in other fruit trees, is usually undertaken using experimental 13
populations derived from bi-parental crosses. However, establishing and maintaining bi-parental crosses 14
and progenies of fruit trees are very difficult and costly. In addition, genetic resolution of QTL maps is 15
often limited due to occurrence of only limited numbers of meiotic events within a bi-parental mapping 16
population (Flint-Garcia et al. 2003). These negative factors together with the long juvenility phase 17
significantly contribute to low efficiency of linkage mapping, particularly for fruit quality traits. As a 18
result, despite numerous linkages mapping studies conducted for a variety of fruit tree species, only few 19
genes have been cloned using linkage-based mapping (Khan and Korban 2012). 20
Recently, an alternative approach, known as association mapping, has become increasingly popular 21
in pursuing genetic mapping studies in plants as it has demonstrated promise in overcoming the 22
limitations of linkage mapping (Rafalski, 2010). Association mapping, also referred to as linkage 23
disequilibrium (LD) mapping, identifies genetic polymorphisms that are significantly associated with 24
phenotypic variation. Compared to QTL mapping, association mapping offers several advantages, 25
including: (1) reduced time frame due to the use of an existing natural population instead of a progeny 26
resulting from a bi-parental cross; (2) availability of greater allele numbers; and (3) higher mapping 27
resolution as it exploits historical and evolutionary recombination events at the population level 28
(Flint-Garcia et al. 2005; Yu and Buckler 2006). Association analysis is divided into genome wide 29
association study (GWAS) and candidate gene association analysis. In GWAS, a high density of single 30
Page 3 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
4
nucleotide polymorphisms (SNPs) across the genome is often used to test potential loci associated with 1
phenotypic variation. Candidate gene association analysis relies on developing molecular tags of potential 2
candidate genes for detecting their association with phenotypic variation. 3
To date, association mapping has been successfully used to identify candidate genes responsible for 4
complex traits of interest in a variety of plants such as maize, pine, and Arabidopsis (Thornsberry et al. 5
2001; Gonzalez-Martinez et al. 2007; Ehrenreich et al. 2009; Li et al. 2013). With the release of whole 6
genome sequences of several fruit species including grape (Jaillon et al. 2007), apple (Velasco et al. 2010), 7
peach (Verde et al. 2013), pear (Wu et al. 2013), and citrus (Xu et al. 2013), several commercial SNP arrays 8
have been developed (Chagné et al. 2012; Khan et al. 2012; Myles et al. 2010; Verde et al. 2012), paving 9
the way for pursuing association mapping studies. For example, association mapping has recently been 10
reported in apple (Kumar et al. 2012, 2013). In addition, association mapping combined with QTL 11
mapping has also been conducted in both apple and grapevine (Fournier-Level et al. 2010; Dunemann at al. 12
2012; Khan et al. 2013; Longhi et al. 2013). 13
Fruit quality is a highly desirable trait in fruit breeding efforts. Organic acids together with sugars 14
and aromas contribute to the taste of fresh apple fruits. In mature apple fruits, malic acid is the 15
predominant organic acid, accounting for about 90% of the total organic acid content (Wu et al. 2007; 16
Zhang et al 2012). To date, several studies have been carried out to identify QTLs responsible for apple 17
fruit acidity, and a major QTL (designated Ma) has been mapped onto the top of linkage group (LG) 16 18
(Maliepaard et al. 1998; Liebhard et al. 2003; Kenis et al. 2008). More recently, a gene encoding 19
aluminum-activated malate transporter (ALMT) was identified as a potential candidate for the Ma locus 20
(Bai et al. 2012; Khan et al. 2013). Apple is a diploid, but has a polyploidy origin (Velasco et al. 2010). It 21
is unclear whether the homologues of the Ma1 gene are also responsible for fruit acidity although several 22
QTLs other than the Ma locus have been reported in apple (Liebhard et al. 2003; Kenis et al. 2008). The 23
Overall knowledge on genetic basis of the biosynthesis and accumulation of malic acid in apple fruits is still 24
limited. 25
Recently, we initiated the measurement of fruit quality characteristics in apple germplasm, and 26
found that fruit acidity is likely to have undergone selection during apple domestication (Ma et al. 2015). 27
The malic acid database of apple germplasm also provides an opportunity to investigate genetic basis of 28
fruit acidity using association mapping method. Since duplicated genes often share redundant functions 29
(Qian et al. 2010), we investigated the effect of homologues of the Ma1 gene on fruit acidity using 30
Page 4 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
5
candidate gene-based association mapping. The subcellular localization and functionality of the Ma1 gene 1
were also investigated. This study not only provide an insight into the role of the Ma1 gene in 2
determining fruit acidity in apple, but it also demonstrates that candidate gene-based association mapping 3
is an efficient strategy for identifying genes involved in complex traits in apple, and perhaps in other fruit 4
species as well. 5
6
Materials and Methods 7
Plant material 8
The apple collection used in this study is maintained at Xingcheng Institute of Pomology of the 9
Chinese Academy of Agricultural Sciences, Xingcheng, Liaoning, China. The collection consists of 352 10
cultivars and/or accessions that were reported in our previous study (Ma et al. 2015) and one Chinese wild 11
accession Lenghaitang (Additional data 1). 12
For the accession Lenghaitang, leaf sample was collected in the spring season and fruit sample was 13
collected at mature stage in 2010. The maturity was assessed by checking the colour of the peel and a 14
confirmation of the seed colour changing to brown. Each accession had three replicates, consisting of ten 15
fruits. Fruit skins were peeled off and then the flesh was cut into small sections. Both fruit and leaf 16
samples of ‘Lenghaitang’ were immediately frozen in liquid nitrogen, and then stored at -75 °C until use. 17
Leaf samples of the rest 352 accessions were the same as described in our previous study (Ma et al. 2015). 18
19
Measurements of malic acid contents in apple fruits 20
The measurement of malic acid content was only conducted for the accession Lenghaitang according 21
to our previous protocol (Ma et al. 2015). Briefly, five grams of frozen fruit were ground into powder in 22
liquid nitrogen, homogenized with 20 mL ddH2O, and incubated at room temperature for 5 min. The 23
mixture was then centrifuged at 12,000 rpm for 15 min at 4 °C. The supernatant was filtered successively 24
through solid phase extraction (Waters Spe-pak C18 Vac cartridges) and 0.22 μM millipore membrane. The 25
filtered fluid was used for measurement of malic acid content using high performance liquid 26
chromatography (HPLC), and the standard malic acid was purchased from Sigma (St. Louis, MO, USA). The 27
malic acid contents of the rest 352 accessions were measured in our previous study (Ma et al. 2015). 28
29
Development of molecular tags of ALMT genes and their utility in genotyping of individuals in an apple 30
Page 5 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
6
collection 1
Genomic DNA sequences of ALMT genes were retrieved from the draft genome of cv. Golden 2
Delicious (Velasco et al. 2010) to design and develop SSR and SNP markers. SSRs were identified using 3
the program of Perfect Microsatellite Repeat Finder 4
(http://sgdp.iop.kcl.ac.uk/nikammar/repeatfinder.html), and primers flanking SSR loci were designed 5
using Primer 3.0. In addition, a SNP in the coding region of Ma1 was used to develop a cleaved amplified 6
polymorphic sequence (CAPS) marker, and the corresponding primer sequences are as follows: 7
5’-AGAGTTGGTGTGGAATGTGC-3’ and 5’-TTGCCTTCTCACTCAGCTCT-3’. Amplification was performed using 8
the following program: 95 °C 5 min, 35 cycles of 30 s at 95 °C, 30 s at 60 °C, 45 s at 72 °C, along with a final 9
extension of 10 min at 72 °C. 10
A total of 5 μL of PCR products was mixed with an equal volume of loading buffer (98% formamide, 11
0.05% bromophenol blue, 0.05% xylene cyanol, and 10 mM NaOH), denatured at 95 °C for 5 min, and then 12
immediately chilled on ice. Denatured PCR products were resolved on 8% denaturing polyacrylamide gel 13
in 1X Tris-acetate-EDTA (TAE) buffer (40 mM Tris-acetate, 20 mM acetic acid, and 1 mM EDTA). 14
Electrophoresis was carried out at 1200 V for 40 min. Bands were visualized using silver staining, and 15
their sizes were estimated using a 25 bp DNA ladder standard (Promega, Madison, WI). 16
17
Analysis of population structure and marker-trait association 18
Seventeen SSR markers distributed along different chromosomes (Table S1) were randomly selected 19
to genotype individuals using polyacrylamide gel electrophoresis, as described above. Genotyping data 20
were collected to estimate the most likely K value (Q matrix) using STRUCTURE 2.2 software (Pritchard et 21
al. 2000). Genetic stratification was analyzed using a variant of the Markov chain Monte Carlo (MCMC) 22
algorithm. Patterns of genetic structure were calculated in multiple runs, wherein each run consisted of 23
500,000 iterations with a burn-in period of 50,000. The parameter K was set from 1 to 15, and 10 24
replicates were performed for each value of K. A modal value of ΔK was used to assess the most likely K 25
corresponding to Q matrix (Evanno et al. 2005). An individual was classified into a cluster when the 26
assignment probability was greater than 0.75. In addition, a microsatellite analyzer (MSA) software was 27
used to estimate relative kinship (K matrix) (Dieringer and Schlštterer 2003). 28
A total of 353 individuals were used to detect association between molecular markers and malic acid 29
content. Association analysis was performed with the software package TASSEL 3.0 using a mixed linear 30
Page 6 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
7
model (MLM) (Yu et al. 2006; Bradbury et al. 2007). This method utilized both the population structure Q 1
matrix and a K matrix. The critical p value for analyzing the significance of molecular markers was 2
calculated based on the false discovery rate (FDR) for malic acid content (Benjamini and Hochberg 1995; 3
Benjamini and Yekutieli 2001), which was found to be highly stringent. The criterion for marker-trait 4
association was set at P ≤0.005. In addition, the haplotype association test was performed using the PLINK 5
software, as described by Purcell et al. (2007), and the criterion for haplotype-trait association was set at 6
P ≤0.01. 7
8
RNA isolation and quantitative real-time PCR (qRT-PCR) 9
Eight cultivars, i.e. Aifeng, Belle de Boskoop, Zhumary, Jincui, Xinnongjin, McIntosh Red, Annuo, and 10
Yanhongmei, were selected for qRT-PCR assay and their fruit samples collected at 30, 60, and 90 days after 11
full bloom (DAFB). Each accession had three replicates, consisting of five fruits. Fruit flesh tissues of each 12
replicate were cut into small sections, mixed, and then subjected to total RNA extraction. Total RNA 13
extraction was performed using RNAprep pure Plant kit (Tiangen, Beijing, China) according to the 14
manufacturer’s instructions, and then adjusted to 500 ng/μL using NanoDrop™ Lite Spectrophotometer. 15
RNA extractions were treated with DNase I (Takara, Dalian, China) to remove any contamination of 16
genomic DNA. Approximately, 3 μg total RNA per sample was used for cDNA synthesis using TransScript 17
One-Step gDNA Renoval and cDNA Synthesis SuperMix (TRANS, Beijing, China) according to 18
manufacturer's instructions. A pair of primers 5’-GACTTGGGCTTCAACAGCTC-3’/5’-TTTTCGAGGATCCGAATGAC-3’ 19
was designed to investigate expression profiles of Ma1, a candidate gene of the Ma locus on LG16 of 20
apple. qRT-PCR was carried out in a total volume of 20 µL reaction mixture containing 10.0 µL of 2× SYBR 21
Green I Master Mix (Takara, Dalian, China), 0.2 µM of each primer, and 100 ng of template cDNA. 22
Amplifications were performed using Applied Biosystems® 7500 Real-Time PCR Systems (Applied 23
Biosystems, USA). The amplification program consisted of 1 cycle of 95 °C for 3 min, followed by 40 cycles 24
of 95 °C for 30 sec, and 60 °C for 34 sec. The fluorescent product was detected at the second step of each 25
cycle. Melting curve analysis was performed at the end of 40 cycles to ensure proper amplification of 26
target fragments. An actin gene was used as a constitutive control along with the following primer 27
sequences: 5’-TGACCGAATGAGCAAGGAAATTACT-3’ and 5’-TACTCAGCTTTGGCAATCCACATC-3’. All analyses were 28
repeated three times using biological replicates. 29
30
Page 7 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
8
Subcellular localization using a green fluorescent protein (GFP) 1
Subcellular localization of two alleles Ma1 and ma1 were conducted. The allele ma1 contains an 2
early stop codon due to a single nucleotide mutation, and its coding region is 252 bp shorter in length 3
compared to that of the allele Ma1. Therefore, two pairs of primers, 4
5’-GCGTCATGAATGGCGGCCAAAATCGGGTCC-3’/5’-CGTTCTTCAACCGCAAACTC-3’ and 5
5’-GCGTCATGAATGGCGGCCAAAATCGGGTCC-3’/5’-CTGATATTGGTCGTTTTAAAAGAC-3’, were designed to amplify the 6
coding regions of Ma1 and ma1, respectively, using cDNA templates from leaf tissues of apple cv. Golden 7
Delicious. 8
PCR products were purified, inserted into the TA cloning vector pEASY-T1, and inserts were validated 9
by direct sequencing. Plasmid DNAs were extracted and digested with two restriction enzymes, BspHI 10
and XbaI. DNA fragments were recovered and inserted into the expression vector pCAMBIA1302::GFP 11
under the control of cauliflower mosaic virus (CaMV) 35S promoter, thus generating constitutively 12
expressed fusion proteins Ma1-GFP and ma1-GFP. The two constructs were individually transformed 13
into epidermal cells of onion (Allium cepa) using a modified Agrobacterium infection method as described 14
by Sun et al. (2007). GFP-dependent fluorescence was detected at 12-36 h following transfection using a 15
D-FL EPI-Fluoresecence Attachment Instructions (Nikon, Japan). 16
17
Construction of yeast expression vectors 18
A pair of primers, 5’-CTCTTGTACGCTGTTCCGCT-3’/5’-ATGAGATCCAACCTTGGCC-3’, was designed to amplify 19
cDNA fragments of Ma1 using cDNA templates from fruits of cv. Golden Delicious. cDNA fragments were 20
cloned into the pEASY-T1 vector, and plasmid DNAs were used as PCR templates to generate full coding 21
sequences of Ma1 allele using a pair of primers 22
5’-GCGTGATCAATGGCGGCCAAAATCGGGTCC-3’/5’-GCGGCGGCCGCTTAGTTCTTCAACCGCAAAC-3’. The full coding 23
sequences were also cloned into the pEASY-T1 vector. Plasmid DNAs were then extracted, digested with 24
BclI and NotI, and ligated into a BamHI/NotI-digested mediated vector. DNA fragments covering the 25
TDH3 promoter, full coding regions of either Ma1 or ma1 alleles, and terminator sequences were then 26
amplified with a pair of primers 27
5’-GGTACCCTGCCATTTCAAAGAATACGTAAATAA-3’/5’-GTCGACATCATGTAATTAGTTATGTCACGCT-3’, and subsequently 28
inserted into the pEASY-T1 vector. 29
The cloned vector was digested with two enzymes, KpnI and XhoI. DNA fragments were retrieved, 30
Page 8 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
9
and introduced into the KpnI/SalI site of an integrative yeast expression vector pRS406, generating a 1
construct pRS406-Ma1. The integration vectors pRS406 and pRS406-Ma1 were linearized with the 2
restriction enzyme StuI, and then integrated into the ura3-52 locus of Saccharomyces cerevisiae WAT11 3
strain to generate yeast strains WAT11/pRS406 and WAT11/pRS406-Ma1, based on the PEG/LiAc method 4
(Gietz and Woods 2002). 5
6
Yeast collection and malic acid content measurements 7
Yeast strains were inoculated into 250 mL YPDA liquid medium, and incubated overnight at 30°C with 8
shaking at 250 rpm. Yeast cells were collected by centrifugation, and resuspended in 5 mL ddH2O. One mL 9
yeast solution was transferred into a 1.5 mL eppendorf tube, and 1.0 g of acid washed glass beads 10
(425-600 µm, Sigma) was added to mechanically disrupted yeast cells using grinder mill Scientz-48 11
(Zhejiang, China). Cellular fragments were sedimented by centrifugation at 12,000 rpm for 30 min at 4°C, 12
and the supernatant was collected and transferred to a new 1.5 mL Eppendorf tube. The supernatant was 13
subsequently filtered through solid phase extraction (Waters Spe-pak C18 Vac cartridges) and 0.22 μM 14
millipore membrane. The filtered fluid was used for measurement of malic acid content using HPLC. All 15
analyses were repeated three times using biological replicates. One-way analysis of variance (ANOVA) was 16
conducted using SPSS statistics 17.0 software (SPSS Inc. Released 2008. SPSS Statistics for Windows, 17
Version 17.0. Chicago: SPSS Inc). 18
19
Results 20
Genes encoding aluminum-activated malate transporter in the apple genome 21
The DNA coding sequence of ALMT family in Arabidopsis (Kovermann et al. 2007) was used as a 22
query sequence to identify homologous genes by Blast Searches of the reference genome sequence of 23
apple (Velasco et al. 2010). A total of 22 homologues were identified on chromosomes 1, 2, 3, 6, 11, 13, 14, 24
15, and 16, or among the unanchored sequences (Fig. 1a). Phylogenetic analysis revealed that these 25
homologues are grouped into three gene families, designated ALMTI, ALMTII and ALMTIII (Fig. 1b). The 26
ALMTI family includes eight homologues (MDP0000160829, MDP0000553830, MDP0000226551, 27
MDP0000873233, MDP0000227375, MDP0000851070, MDP0000186135, and MDP0000269351), whereas 28
the ALMTIII family includes six homologues (MDP0000392887, MDP0000261132, MDP0000315809, 29
MDP0000255420, MDP0000144432, and MDP0000230322). The ALMTII family consists of eight 30
Page 9 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
10
homologues (MDP0000791995, MDP0000259751, MDP0000312421, MDP0000252114, MDP0000768640, 1
MDP0000244249, MDP0000299697, and MDP0000290997), one of which (MDP0000252114) was 2
designated Ma1 gene in previous report (Bai et al. 2012). Thus, the Ma1 gene and its closely related 3
homologues in the ALMTII family were chosen to investigate their association with fruit acidity in apple. 4
Additionally, five more ALMT like genes (MDP0000217912, MDP0000241893, MDP0000315808, 5
MDP0000271850, and MDP0000196725), besides the 22 ALMT genes, were also reported in the apple 6
genome (Khan et al. 2013). However, MDP0000241893, MDP0000315808, and MDP0000271850 are 7
almost identical in DNA sequence to MDP0000261132, MDP0000255420, and MDP0000791995, 8
respectively. MDP0000196725 has a very short open reading frame (468 bp in size), and are supposed to 9
be a pseudogene. MDP0000217912 has low levels of sequence identity to all the 22 ALMT genes, and was 10
separated into an outgroup (data not shown). Thus, these additional five ALMT like genes were excluded 11
from the ALMT gene family in this study. 12
13
Association between ALMTII genes and fruit acidity in apple 14
Molecular tags, including 7 SSRs and 1 CAPS, were developed for the eight apple ALMTII genes (Table 15
1). These gene tags were subsequently used to screen a collection of 353 Malus accessions, and 16
genotyping data were collected for association mapping analysis (additional data 1). The malic acid 17
content in mature fruits ranged from 0.53 to 22.74 mg/g FW among the apple collection, with an average 18
of 4.85 mg/g FW. To avoid false associations between markers and traits of interest, the genetic structure 19
of the apple collection was evaluated. The STRUCTURE results indicated a peak ΔK value of K=3 (Fig. S1). 20
This suggested that there were three genetically distinct clusters present in this collection. Thus, the 21
corresponding Q and Kinship matrices were calculated, and used in the TASSEL 3.0 software. Candidate 22
gene-based association mapping was performed using a mixed linear model (MLM) method. As a result, 23
only the Ma1 gene (MDP0000252114) was significantly associated with malic acid content (P=6.07E-04), 24
accounting for ~7.58% of the observed phenotypic variation (Table 1). Molecular tag of the Ma1 gene 25
(WBG90, 1.4 kb downstream of the stop codon) revealed three genotypes, (TC)17(TC)17, (TC)20(TC)17, and 26
(TC)20(TC)20, in the apple collection (Fig. 2a, 2b-I). Mean values of malic acid contents in mature fruits 27
displayed the following pattern among the three genotypes: (TC)17(TC)17 > (TC)20(TC)17 > (TC)20(TC)20 (Fig. 3). 28
In contrast, no significant associations were observed between the homologues of the Ma1 gene and 29
malic acid contents (Table 1, P≥0.14). 30
Page 10 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
11
Since an A/G SNP 252 bp upstream of the stop codon of the Ma1 gene has been reported in previous 1
study (Bai et al. 2012), we also developed a CAPS marker to detect the A/G polymorphism (Fig. 2b-II). This 2
CAPS marker revealed three genotypes, designated AA, AG, and GG, in the apple collection. The allele ‘A’ 3
contained a GACC sequence resulting in a premature stop codon, while the ‘G’ allele containing a GGCC 4
sequence had a complete coding sequence (the same hereinafter). Mean values of malic acid contents in 5
mature fruits showed significant differences among the three genotypes and displayed the following 6
patterns: GG > AG > AA (Fig. 3). 7
The combination of a SNP locus and an SSR motif were evaluated, and four haplotypes, designated 8
HapA through HapD, were identified (Table 2). All haplotypes, except for HapC were significantly 9
associated with malic acid content. HapA and HapD showed the highest effects on malic acid content in 10
apple fruits. 11
12
Expression profiling of ALMTII genes in apple fruits at different stages 13
RT-PCR was initially conducted to investigate the expression profiles of ALMTII genes in fruits of four 14
cultivars, Yanhongmei, Belle de Boskoop, Zhumary, and Aifeng (Fig. 4). These four cultivars have similar 15
ripening periods, with maturation occurring about 90 days after full bloom, but show a great variation in 16
malic acid content. Of the eight ALMTII genes, the Ma1 gene was expressed during fruit development in all 17
tested cultivars. However, six ALMTII genes (MDP0000312421, MDP0000791995, MDP0000244249, 18
MDP0000259751, MDP0000290997, and MDP0000768640) were not expressed in all tested cultivars. One 19
ALMT gene (MDP0000299697) was weakly expressed during early stage of fruit development. 20
Furthermore, we evaluated transcript levels of the Ma1 gene using qRT-PCR. As mentioned above, 21
three genotypes were detected using a CAPS marker developed from the A/G polymorphic site present in 22
the last exon of the Ma1 gene. Thus, fruits of eight cultivars representing different genotypes were 23
selected for gene expression analysis. Transcript accumulation of the Ma1 gene was detected in fruits at 24
different developmental stages and in all genotypes, but showed different patterns during fruit 25
development (Fig. 5). For example, during fruit development, increased transcript accumulation was 26
observed in cvs. Xinnongjin, Jincui, and Yanhongmei, while slight reduction in transcript levels was 27
detected in cvs. Belle de Boskoop, Annuo, and McIntosh Red. In contrast, transcript levels were relatively 28
stable during fruit development in cvs. Aifeng and Zhumary. 29
Finally, we evaluated the relationship between transcript levels of Ma1 and malic acid content. 30
Page 11 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
12
Although transcript levels for cvs. Aifeng and Zhumary were quite similar, malic acid contents were 1
significantly different. Similarly, transcript accumulation patterns were very similar between cvs. 2
Xinnongjin and Yanhongmei, while malic acid contents of cv. Yanhongmei were significantly higher than 3
those of cv. Xinnongjin. Furthermore, transcript levels of the Ma1 gene in fruits of cv. Aifeng were slightly 4
higher than those of cv. Belle de Boskoop, yet malic acid content of cv. Aifeng was lower than that of cv. 5
Belle de Boskoop. 6
7
Subcellular localization of Ma1 and its functional analysis by ectopic expression in yeast 8
The TMpred program (http://www.ch.embnet.org/software/TMPRED_form.html) predicted that Ma1 9
consists of seven transmembrane domains (TMα1 to TMα7), and a single nucleotide mutation that results 10
in a premature stop codon located between TMα6 and TMα7 (Fig. 2c). To investigate effects of single 11
nucleotide mutation on gene functionality, subcellular localization analysis of two alleles of the Ma1 gene 12
was conducted. Allele containing a GACC sequence was designated as ma1, while the other containing a 13
GGCC sequence was designated as Ma1. Two constructs, 35S: Ma1-GFP and 35S: ma1-GFP, were 14
transiently transformed into onion epidermal cells using agro-infection (Fig. 6). It was found that the 15
Ma1-GFP fusion protein resided in the tonoplast, as the nucleus was surrounded with fluorescence, while 16
the ma1-GFP fusion protein was targeted to the cell membrane. This finding indicates that the C-terminal 17
amino acid sequence of Ma1 plays an important role in directing the protein to a specific subcellular site. 18
The Ma1 gene is closely related to AtALMT9 in Arabidopsis (Fig. 1b). AtALMT9 serves as a vacuolar 19
malate channel in Arabidopsis (Kovermann et al. 2007). Thus, the functionality of the Ma1 gene was 20
investigated via ectopic expression in yeast. When yeast lines were incubated in a standard YPDA liquid 21
medium, no differences in malic acid contents were observed between transgenic lines expressing Ma1 or 22
ma1 and control yeast lines carrying an empty vector (Table 3). However, in the presence of 0.02 g/mL 23
malic acid added to the YPDA liquid medium, malic acid contents increased significantly in transgenic lines 24
expressing Ma1 when compared to control lines carrying ma1 or the empty vector. However, no 25
significant difference in malic acid contents was detected between transgenic lines expressing ma1 and 26
control lines carrying the empty vector. This result suggests that Ma1, similar to AtALMT9, may also 27
function as a channel to increase the influx of malic acid, but ma1 is likely nonfunctional. In other words, 28
Ma1 is potentially associated with accumulation of malic acid in apple fruits. 29
30
Page 12 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
13
Discussion 1
Duplication of ALMTII genes in apple and their divergence in expression 2
Gene duplication, which can arise from polyploidization and/or segmental duplication, is a major 3
driving force for recruitment of genes in plants. Apple is a diploid, with an autopolyploidization origin (Han 4
et al., 2011; Velasco et al. 2010). In this study, eight ALMTII genes have been identified in the apple 5
genome. Of these ALMTII genes, seven are located on chromosomes 1, 6, 13, 14, and 16, and one is 6
among the unanchored sequences. Chromosomes 6 and 14 are homologous pairs (Han et al., 2011; 7
Velasco et al. 2010) and both contain an ALMTII gene on the bottom chromosome. Similarly, 8
chromosomes 13 and 16 are also homologous pairs and both contain a cluster of two ALMTII genes on the 9
top chromosome. These results suggest that duplication of ALMTII genes in apple is related to whole 10
genome duplication during the process of speciation, and the tandem duplication on chromosomes 13 11
and 16 is likely to have occurred in the ancestor of domesticated apples. Since chromosomes 1 and 7 are 12
known to be homologous pairs (Han et al., 2011; Velasco et al. 2010), the ALMTII gene among unanchored 13
sequences (MDP0000791995) is likely located on chromosome 7. 14
Most of polyploid-derived duplicated genes show a rapid divergence in expression (Blanc and Wolfe 15
2004). This phenomenon is also observed for the apple ALMTII genes. For example, two duplicate gene 16
pairs, Ma1 /MDP0000312421 and MDP0000299697/MDP0000290997, have acquired divergent expression 17
patterns. The Ma1 gene is expressed in fruit during the entire process of fruit development, while 18
MDP0000312421 is not expressed in fruit. Similarly, MDP0000299697 is weakly expressed in fruit at early 19
stage of fruit development, whereas MDP0000290997 is not expressed in fruit. However, it is unclear 20
whether or not the two duplicate gene pairs, MDP0000259751/MDP0000791995 and 21
MDP0000768640/MDP0000244249 diverge in expression although none of them showed expression in 22
fruit. 23
In summary, duplication of ALMTII genes in apple is related to the polyploid origin of the apple 24
genome. Divergence in expression has occurred between the Ma1 gene and its closely related 25
homologues, which is consistent with our findings above that only the Ma1 gene out of the ALMTII family 26
is associated with fruit acidity in apple. 27
28
Characterization of the Ma1 gene and complexity of the genetic basis of fruit acidity in apple 29
Our study indicates that the Ma1 gene is significantly associated with fruit acidity in an apple 30
Page 13 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
14
collection, which confirms previous studies (Bai et al. 2012; Khan et al. 2013). To gain further insight into 1
the role of Ma1 in determining fruit acidity, the functionality of this gene was investigated. Subcellular 2
localization has indicated that Ma1 resides in the tonoplast, and ectopic expression in yeast has further 3
demonstrated its potential functionality as a vacuolar malate channel. Ma1 consists of a TMD that is 4
formed by seven putative transmembrane α-helices, and the last putative transmembrane α-helix at the C 5
terminus seems to be essential for directing it to a specific subcellular compartment or membrane. A 6
single nucleotide substitution of G with A in the last exon results in a premature stop codon, which 7
contributes to a truncated protein lacking the last putative transmembrane α-helix. The truncated protein 8
resides in the cell membrane instead of the tonoplast, thus, losing the functionality as a vacuolar malate 9
channel. As malic acid is stored in the vacuole, the single nucleotide mutation must have an influence on 10
accumulation of malic acid in fruit tissues. Besides the A/G SNP locus, seven other SNPs in coding regions 11
were also identified in eight cultivars, which were subjected to qRT-PCR analysis (data not shown). More 12
studies are needed to address if these SNPs have an effect on gene function. 13
Linkage mapping studies have shown that the Ma locus on LG16 can explain 22.8-36% of the 14
observed variation in fruit acidity (Kenis et al. 2008; Liebhard et al. 2003). However, our association 15
mapping study indicates that the Ma1 gene accounts for ~ 7.46% of the observed phenotypic variation for 16
malic acid content in mature fruits. This result strongly suggests that the Ma locus on LG16 is not the only 17
determinant of fruit acidity in apple. Moreover, it is revealed that apple cv. Belle de Boskoop contains two 18
homozygous alleles ma1ma1; however, it has lower pH values in fruits at different stages of development. 19
In addition to the Ma1 gene, a major QTL for fruit acidity was detected on linkage group 8 (Liebhard et al. 20
2003; Zhang et al. 2012). These results suggest the likely presence of major gene(s), other than the Ma1 21
gene, in the apple genome that are also responsible for fruit acidity. The apple accessions used in this 22
study, such as cv. Belle de Boskoop, are highly valuable for identifying these additional major genes 23
responsible for apple fruit acidity. 24
In earlier studies, it was reported that Ma1 has a strong additive effect in increasing fruit acidity, and 25
it is incompletely dominant over ma1 (Xu et al. 2012), while other studies have indicated that Ma1 is 26
completely dominant over ma1 (Iwanami et al. 2012). Based on mean values of malic acid contents in 27
mature fruits of the apple collection used in this study, it is revealed that the following allelic dominance 28
effects are observed, wherein Ma1Ma1 > Ma1ma1> ma1ma1. Thus, it seems that Ma1 is incompletely 29
dominant over ma1. 30
Page 14 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
15
1
Variability in levels of fruit acidity within a collection and its implications to domestication of apple 2
Findings of this study not only confirms malic acid as the predominant organic acid in mature apple 3
fruits (Wu et al. 2007), but that wide variations in malic acid content of mature fruits are also observed 4
between cultivars from different geographic regions. Apple cultivars originating from the European 5
countries such as Germany, England, and France produce acidic fruits with average malic acid contents of 6
7.20, 5.80, and 5.42 mg/g FW in mature fruits, respectively. In contrast, apple cultivars from Asian 7
countries such as China, Japan, and North Korea produce sweet fruits, with average malic acid contents of 8
3.32, 3.15, and 2.95 mg/g FW in mature fruits, respectively. These observed differences may be attributed 9
to the following reasons. First, people in Asian countries prefer sweet and low-acid apples, while 10
Europeans prefer more acidic and/or tart apples. Second, cider is a popular drink in Europe and many 11
apple cultivars have been developed for cider production. Levels of malic acid are very critical in cider 12
processing as it plays an important role in maintaining a consistently sharp and crispy taste 13
(Pereira-Lorenzo et al. 2009). In general, fruits of apple cultivars used for cider processing contain high 14
levels of malic acid contents. Thus, these findings suggest that fruit acidity is an important determinant for 15
fresh consumption as well as processing in apple breeding programs (Crisosto et al 2002; Juniper and 16
Mabberley 2006; Wen et al. 2013; Khan et al. 2014). 17
As mentioned above, the Ma1 gene plays an important role in determining fruit acidity, and three 18
genotypes, AA (ma1ma1), AG (Ma1 ma1), and GG (Ma1Ma1), are identified in the apple collection based 19
on the A/G SNP locus of the Ma1 gene. Most of wild apple relatives have GG genotype (58%), followed by 20
AG (27%) and AA (15%) genotypes (Fig. 7a). In contrast, most apple cultivars have AA genotype (58%), 21
followed by AG (38%) and GG (4%) genotypes. On average, mature fruits of domesticated cultivars have 22
lower malic acid contents compared to the wild relatives (Fig. 7b). These results suggest that apple 23
domestication may be accompanied by selection of the Ma1 gene that plays an important role in 24
determining fruit acidity. In addition, it is worthy to note that the observed higher levels of fruit acidity in 25
wild species may be related to natural selection as higher levels of malic acid could contribute to fruit 26
survival by circumventing damage resulting from both biotic and abiotic stresses such as pests, pathogens, 27
heat, and drought (Fernie and Martinoia 2009). 28
Variations in malic acid content of mature fruits are also detected between accessions with the same 29
genotype in the Ma locus. For example, among accessions with the same genotype, wild relatives have 30
Page 15 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
16
generally the highest value of malic acid content in mature fruits, followed by European cultivars and Asian 1
cultivars (Fig. 7b). Asian cultivars have higher frequency of GG genotype (8%) when compared with 2
cultivars from Europe (2%) or other countries (3%). Among accessions with the same GG genotype, 3
however, Asian cultivars have lower value of malic acid content in mature fruits than cultivars from Europe 4
and other countries (Fig. 7b). These results further confirm our findings above that gene(s) other than 5
Ma1 are also responsible for fruit acidity in apple. 6
Apple was first domesticated in the Tian Shan Mountains of central Asia, wherein the wild species M. 7
sieversii is known to be widely distributed (Dzhangaliev 2003). Following sequencing of the apple genome, 8
M. sieversii has been identified as the main contributor to the genome of the cultivated apple (Velasco et 9
al. 2010). Thus, it is reasonable to speculate that sweet apples were initially developed in Central Asia, and 10
then introduced to Europe along the famous Silk Road. Hybridization between M. sylvestris and sweet 11
apples has been undertaken to develop cultivars for cider production and/or fresh fruit consumption 12
(Wagner and Weeden 2000). 13
14
Relationship between the origin of the cultivated apple and the population structure 15
As mentioned above, the initial progenitor of the cultivated apple, M. × domestica, was initially 16
domesticated from M. sieversii (Dzhangaliev 2003). Subsequently, the introgression of wild species into 17
the cultivated apple must have occurred as a distinct event during domestication. In East Asia, interspecific 18
hybridization between M. sieversii and another wild species, M. baccata, contributed to the development 19
of a new hybrid species, M. × asiatica, which has been grown as a local landrace since ancient time 20
(Pereira-Lorenzo et al. 2009). In Europe, interspecific hybridization between the initial progenitor of M. × 21
domestica and the European crabapple M. sylvestris has been reported to be an important event in the 22
evolution of domesticated apple cultivars that are more closely related to M. sylvestris than to M. sieversii 23
(Richards et al. 2008; Cornille et al. 2012). In short, M. sieversii and M. sylvestris are the two predominant 24
contributors to the M. × domestica gene pool (Velasco et al. 2010; Cornille et al. 2012), and the 25
introgression of other wild species such as M. baccata into the domesticated apple also contributes to the 26
diversity of apple. 27
The apple accessions used in this study predominantly originated from Europe and Asia, and they are 28
clustered into three groups based on the result of population structural analysis (Fig. S2). All accessions, 29
except Hongroupingguo (Malus sieversii) and Tukumanpingguo(Malus sieversii) , in the second group 30
Page 16 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
17
(marked with predominant green color) are cultivars. The first group (marked with predominant red color) 1
is composed of cultivars and most wild species such as M. baccata. The third group (marked with 2
predominant blue color) consists of cultivars and several wild species such as Malus sylvestris and Malus 3
prunifolia (Willd.) Borkh. Thus, our finding suggests that these apple collections are divided into three 4
groups based on different origin and evolution of the domesticated apple. 5
6
7
Acknowledgements 8
This work was supported by the National Natural Science Foundation of China (Grant Nos. 31420103914 9
and 31372048) and the National Basic Research Program of China (Grant No. 2011CB100600). 10
11
12
Page 17 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
18
References 1
Bai Y, Dougherty L, Li M, Fazio G, Cheng L, Xu K ( 2012) A natural mutation-led truncation in one of the two 2
aluminum-activated malate transporter-like genes at the Ma locus is associated with low fruit acidity in 3
apple. Mol Genet Genomics 287: 663-678 4
Belfanti E, Silfverberg-Dilworth E, Tartarini S, Patocchi A, Barbieri M, Zhu J, Vinatzer BA, Gianfranceschi L, 5
Gessler C, Sansavini S (2004) The HcrVf2 gene from a wild apple confers scab resistance to a transgenic 6
cultivated variety. Proc Natl Acad Sci USA 101: 886-890 7
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate:a practical and powerful approach to 8
multiple testing. J R Statist Soc B 57: 289–300 9
Benjamini Y, Yekutieli D. (2001). The control of the false discovery rate in multiple testing under 10
dependency. Ann Stat 29: 1165-1188 11
Blanc G, Wolfe KH (2004) Functional divergence of duplicated genes formed by polyploidy during 12
Arabidopsis evolution. Plant Cell 16: 1679-1691 13
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for 14
association mapping of complex traits in diverse samples. Bioinformatics (Oxford, England) 23: 2633–2635 15
Brunel N, Leduc N, Poupard P, Simoneau P, Mauget JC, Viémont JD (2002) KNAP2, a class I KN1-like gene is 16
a negative marker of bud growth potential in apple trees (Malus domestica [L.] Borkh.). J Exp Bot 53: 17
2143-2149 18
Chagné D, Crowhurst RN, Troggio M, Davey MW, Gilmore B, Lawly C, Vanderzande S, Hellens RP, Kumar S, 19
Cestaro A, Velasco R, Main D, Rees JD, Lezzoni A, Mockler T, Wilhelm L, de Weg EV, Gardiner SE, Bassil N, 20
Peace C (2012) Genome-wide SNP detection, validation, and development of an 8K SNP array for apple. 21
PLoS One 7: e31745 22
Cornille A, Gladieux P, Smulders MJ, Roldán-RuizI, Laurens F, Cam BL, Nersesyan A, Clavel J,Olonova M, 23
Feurence L, Gabrielyan I, Zhang X-G, Tenaillon M I, Giraud T (2012) New insight into the history of 24
domesticated apple: secondary contribution of the European wild apple to the genome of cultivated 25
varieties. PLoS Genet 8: e1002703 26
Crisosto CH, Crisosto GM (2002) Understanding American and Chinese consumer acceptance of ‘Redglobe’ 27
table grapes. Postharvest Biol Tech 24:155-162 28
Dieringer D, Schlštterer C (2003) Microsatellite analyser (MSA): a platform independent analysis tool for 29
Page 18 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
19
large microsatellite data sets. Mol Ecol Notes 3: 167-169 1
Dunemann F, Ulrich D, Malysheva-Otto L, Weber WE, Longhi S, Velasco R, Costa F (2012) Functional allelic 2
diversity of the apple alcohol acyl-transferase gene MdAAT1 associated with fruit ester volatile contents in 3
apple cultivars. Mol Breeding 29:609-625 4
Dyk M-M, Soeker MK, Labuschagne IF, Rees DJG (2010) Identification of a major QTL for time of initial 5
vegetative budbreak in apple (Malus x domestica Borkh.). Tree Genet Genomes 6: 489-502 6
Dzhangaliev AD (2003) The wild apple tree of Kazakhstan. Hortic Rev John Wiley & Sons, Inc pp 63–303 7
Ehrenreich IM, Hanzawa Y, Chou L, Roe JL, Kover PX, Purugganan M-D (2009) Candidate gene association 8
mapping of Arabidopsis flowering time. Genetics 183: 325-335 9
Espley RV, Brendolise C, Chagné D, Kutty-Amma S, Green S, Volz R, Putterill J, Schouten HJ, Gardiner SE, 10
Hellens RP, Allan AC (2009) Multiple repeats of a promoter segment causes transcription factor 11
autoregulation in red apples. Plant Cell 21: 168-183 12
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software 13
STRUCTURE: a simulation study. Mol Ecol 14: 2611-2620 14
Fernie A-R, Martinoia E (2009) Malate. Jack of all trades or master of a few? Phytochemistry 70: 828-832 15
Flint-Garcia SA, Thornsberry JM, Buckler ES (2003) Structure of linkage disequilibrium in plants. Annu Rev 16
Plant Biol 54: 357-374 17
Flint-Garcia SA, Thuillet AC, Yu J, Pressoir G, Romero SM, Mitchell SE, Doebley J, Kresovich S, Goodman 18
M-M, Buckler ES (2005) Maize association population: a high-resolution platform for quantitative trait 19
locus dissection. Plant J 44: 1054-1064 20
Fournier-Level A, Lacombe T, Le Cunff L, Boursiquot JM, This P (2010) Evolution of the VvMYbA gene family, 21
the major determinant of berry colour in cultivated grapevine (Vitis vinifera L.). Heredity 104: 351-362 22
Gietz RD, Woods RA (2002) Transformation of yeast by lithium acetate/single-stranded carrier 23
DNA/polyethylene glycol method. Methods Enzymol 350:87-96 24
Gonzalez-Martinez SC, Wheeler NC, Ersoz E, Nelson CD, Neale DB (2007) Association genetics in Pinus 25
taeda L. I. Wood property traits. Genetics 175: 399-409 26
Han Y, Zheng D, Vimolmangkang S, Khan MA, Beever JE, Korban SS (2011) Integration of physical and 27
genetic maps in apple confirms whole-genome and segmental duplications in the apple genome. J Exp Bot 28
Page 19 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
20
62: 5117-5130 1
Iwanami H, Moriya S, Kotoda N, Mimida N, Takahashi-Sumiyoshi S, Abe K (2012) Mode of inheritance in 2
fruit acidity in apple analysed with a mixed model of a major gene and polygenes using large complex 3
pedigree. Plant Breeding 131: 322-328 4
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, 5
Vezzi A, Legeai F, Hugueney P, Dasilva C, Horner D, Mica E, Jublot D, Poulain J, Bruyère C, Billault A, 6
Segurens B, Gouyvenoux M, Ugarte E, Cattonaro F, Anthouard V, Vico V, Del Fabbro C, Alaux M, Di Gaspero 7
G, Dumas V, Felice N, Paillard S, Juman I, Moroldo M, Scalabrin S, Canaguier A, Le Clainche I, Malacrida G, 8
Durand E, Pesole G, Laucou V, Chatelet P, Merdinoglu D, Delledonne M, Pezzotti M, Lecharny A, Scarpelli C, 9
Artiguenave F, Pè ME, Valle G, Morgante M, Caboche M, Adam-Blondon AF, Weissenbach J, Quétier F, 10
Wincker P (2007) The grapevine genome sequence suggests ancestral hexaploidization in major 11
angiosperm phyla. Nature 449: 463-467 12
Juniper BE, Mabberley DJ (2006) The story of the apple. Timber Press Portland, Oregon 13
Kenis K, Keulemans J (2007) Study of tree architecture of apple (Malus × domestica Borkh.) by QTL analysis 14
of growth traits. Mol. Breeding 19: 193-208 15
Kenis K, Keulemans J, Davey MW (2008) Identification and stability of QTLs for fruit quality traits in apple. 16
Tree Genet. Genomes 4: 647-661 17
Khan M-A, Korban SS (2012) Association mapping in forest trees and fruit crops. J Exp Bot 63: 4045-4060 18
Khan MA, Han Y, Zhao Y F, Korban SS (2012) A high-throughput apple SNP genotyping platform using the 19
GoldenGate™ assay. Gene 494: 196-201 20
Khan MA, Zhao YF Korban SS (2013) Identification of genetic loci associated with fire blight resistance in 21
Malus through combined use of QTL and association mapping. Physiol Plant 148: 344-353 22
Khan MA, Olsen KM, Sovero V, Kushad MM, Korban SS (2014) Fruit quality traits have played critical roles 23
in domestication of the apple. Plant Genome doi: 10.3835/plantgenome2014.04.0018 24
Khan SA, Chibon PY, de Vos RC, Schipper BA, Walraven E, Beekwilder J, van Dijk T, Finkers R, Visser RG, van 25
de Weg EW, Bovy A, Cestaro A, Velasco R, Jacobsen E, Schouten HJ (2012) Genetic analysis of metabolites 26
in apple fruits indicates an mQTL hotspot for phenolic compounds on linkage group 16. J Exp Bot 63: 27
2895-2908 28
Khan SA, Beekwilder J, Schaart JG, Mumm R, Soriano JM, Jacobsen E, Schouten HJ (2013) Differences in 29
acidity of apples are probably mainly caused by a malic acid transporter gene on LG16. Tree Genet 30
Page 20 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
21
Genomes 9: 475-487 1
Kovermann P, Meyer S, Hörtensteiner S, Picco C, Scholz-Starke J, Ravera S, Lee Y, Martinoia E (2007) The 2
Arabidopsis vacuolar malate channel is a member of the ALMT family. Plant J 52: 1169-1180 3
Kumar S, Chagné D, Bink MC, Volz RK, Whitworth C, Carlisle C (2012) Genomic selection for fruit quality 4
traits in apple (Malus × domestica Borkh.). PLoS One 7: e36674 5
Kumar S, Garrick DJ, Bink MC, Whitworth C, Chagné D, Volz RK (2013) Novel genomic approaches unravel 6
genetic architecture of complex traits in apple. BMC Genomics 14: 393 7
Le Roux PM, Khan MA, Broggini GA, Duffy B, Gessler C, Patocchi A (2010) Mapping of quantitative trait loci 8
for fire blight resistance in the apple cultivars 'Florina' and 'Nova Easygro'. Genome 53: 710-722 9
Li H, Peng Z, Yang X, Wang W, Fu J, Wang J, Han Y, Chai Y, Guo T, Yang N, Liu J, Warburton M L, Cheng Y, Hao 10
X, Zhang P, Zhao J, Liu Y, Wang G, Li J, Yan J (2013) Genome-wide association study dissects the genetic 11
architecture of oil biosynthesis in maize kernels. Nat Genet 45: 43-50 12
Liebhard R, Kellerhals M, Pfammatter W, Jertmini M, Gessler C (2003) Mapping quantitative physiological 13
traits in apple (Malus × domestica Borkh.). Plant Mol Biol 52: 511-526 14
Longhi S, Moretto M, Viola R, Velasco R, Costa F (2012) Comprehensive QTL mapping survey dissects the 15
complex fruit texture physiology in apple (Malus x domestica Borkh.). J Exp Bot 63: 1107-1121 16
Longhi S, Hamblin MT, Trainotti L, Peace CP, Velasco R, Fabrizio C (2013) A candidate gene based approach 17
validates Md-PG1 as the main responsible for a QTL impacting fruit texture in apple (Malus x domestica 18
Borkh). BMC Plant Biol 13: 37 19
Ma BQ, Chen J, Zheng HY, Fang T, Collins O, Li SH, Han YP, Wu BH.(2015) Comparative assessment of sugar 20
and malic acid composition in cultivated and wild apples. Food Chem 172:86-91. 21
Maliepaard C, Alston FH, van Arkel G, Brown LM, Chevreau E, Dunemann KM, Gaediner S, Guilford P, van 22
Heusden AW, Janse J, Laurens F, Lynn JR, Manganaris AG, den Nijs APM, Periam N, Rikkerink E, Roche P, 23
Ryder C, Sansavini S, Schmidt H, Tartarini S, Verhaegh JJ, Vrielink-van Ginkel M, King GJ (1998) Aligning 24
male and female linkage maps of apple (Malus pumila Mill.) using multi-allelic markers. Theor Appl Genet 25
97: 60-73 26
Mellidou I, Chagné D, Laing WA, Keulemans J, Davey MW (2012) Allelic variation in paralogs of 27
GDP-L-galactose phosphorylase is a major determinant of vitamin C concentrations in apple fruit. Plant 28
Physiol 160: 1613-1629 29
Page 21 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
22
Meyer S, Mumm P, Imes D, Endler A, Weder B, Al-Rasheid KA, Geiger D, Marten I, Martinoia E, Hedrich R 1
(2010) AtALMT12 represents an R-type anion channel required for stomatal movement in Arabidopsis 2
guard cells. Plant J 63:1054-62 3
Myles S, Chia JM, Hurwitz B, Simon C, Zhong GY, Buckler E, Ware D (2010) Rapid genomic characterization 4
of the genus vitis. PLoS One 5: e8219 5
Pereira-Lorenzo S, Ramos-Cabrer AM, Fischer M (2009) Breeding Apple (Malus x Domestica Borkh). 6
Breeding Plantation Tree Crops: Temperate Species pp 33-81 7
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype 8
data. Genetics 155: 945-959 9
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly 10
MJ, Sham PC (2007) PLINK: a tool set for whole-genome association and population-based linkage 11
analyses. Am J Hum Genet 81: 559-575 12
Qian W, Liao BY, Chang AY, Zhang J (2010) Maintenance of duplicate genes and their functional redundancy 13
by reduced expression. Trends Genet 26: 425-430 14
Rafalski JA (2010) Association genetics in crop improvement. Curr Opin Plant Biol 13: 1-7 15
Richards CM, Volk GM, Reilley AA, Henk AD, Lockwood DR, Reeves PA, Forsline PL (2008) Genetic diversity 16
and population structure in Malus sieversii, a wild progenitor species of domesticated apple. Tree Genet 17
Genomes 5: 339-347 18
Segura V, Cilas C, Costes E, (2008) Dissecting apple tree architecture into genetic, ontogenetic and 19
environmental effects: mixed linear modelling of repeated spatial and temporal measures. New Phytol 178: 20
302-314 21
Sun W, Cao ZY, Li Y, Zhao YX, Zhang H (2007) A simple and effective method for protein subcellular 22
localization using Agrobacterium-mediated transformation of onion epidermal cells. Biologia 62: 529-532 23
Thornsberry JM, Goodman MM, Doebley J, Kresovich S, Nielsen D, Buckler ES (2001) Dwarf8 24
polymorphisms associate with variation in flowering time. Nat Genet 28: 286-289 25
Velasco R, Zharkikh A, Affourtit J, Dhingra A, Cestaro A, Kalyanaraman A, Fontana P, Bhatnagar SK, Troggio 26
M, Pruss D, Salvi S, Pindo M, Baldi P, Castelletti S, Cavaiuolo M, Coppola G, Costa F, Cova V, Dal Ri A, 27
Goremykin V,Komjanc M, Longhi S, Magnago P, Malacarne G, Malnoy M, Micheletti D, Moretto M, 28
Perazzolli M, Si-Ammour A, Vezzulli S, Zini E, Eldredge G, Fitzgerald LM, Gutin N, Lanchbury J, Macalma T, 29
Mitchell JT, Reid J, Wardell B, Kodira C, Chen Z, Desany B, Niazi F, Palmer M, Koepke T, Jiwan D, Schaeffer S, 30
Page 22 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
23
Krishnan V, Wu C, Chu VT, King ST, Vick J, Tao Q, Mraz A, Stormo A, Stormo K, Bogden R, Ederle D, Stella A, 1
Vecchietti A, Kater MM, Masiero S, Lasserre P, Lespinasse Y, Allan AC, Bus V, Chagné D, Crowhurst RN, 2
Gleave AP, Lavezzo E, Fawcett JA, Proost S, Rouzé P, Sterck L, Toppo S, Lazzari B, Hellens RP, Durel CE, Gutin 3
A, Bumgarner RE, Gardiner SE, Skolnick M, Egholm M, Van de Peer Y, Salamini F, Viola R (2010) The 4
genome of the domesticated apple (Malus × domestica Borkh.). Nat Genet 42: 833-839 5
Verde I, Bassil N, Scalabrin S, Gilmore B, Lawley CT, Gasic K, Micheletti D, Rosyara UR, Cattonaro F, 6
Vendramin E, Main D, Aramini V, Blas AL, Mockler TC, Bryant DW, Wilhelm L, Troggio M, Sosinski B, 7
Aranzana MJ, Arús P, Iezzoni A, Morgante M, Peace C (2012) Development and evaluation of a 9K SNP 8
array for peach by internationally coordinated SNP detection and validation in breeding germplasm. PLoS 9
One 7: e35668 10
Verde I, Abbott AG, Scalabrin S, Jung S, Shu S, Marrono F, Zhebentyayeva T, Dettori MT, Grimwood J, 11
Cattonaro F, Zuccolo A, Rossini L, Jenkins J, Vendramin E, Meisel LA, Decroocq V, Sosinski B, Prochnik S, 12
Mitros T, Policriti A, Cipriani G, Dondini L, Ficklin S, Goodstein DM, Xu P, Fabbro CD, Aramini V, Copetti D, 13
Gonzalez S, Horner DS, Falchi R, Lucas S, Mica E, Maldonado J, Lazzari B, Bielenberg D, Pirona R, Miculan M, 14
Barakat A, Testolin R, Stella A, Tartarini S, Tonutti P, Arús P , Orellana A, Wells C, Main D, Vizzotto G, Silva H, 15
Salamini F, Schmutz J, Morgante M, Rokhsar, DS (2013) The high-quality draft genome of peach (Prunus 16
persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat Genet 45: 17
487-494 18
Wagner I, Weeden NF (2000) Isozyme in Malus sylvestris, Malus × domestica and in related Malus species. 19
Acta Horticulturae 538: 51-56 20
Sun W, Cao ZY, Li Y, Zhao YX, Zhang H (2007) A simple and effective method for protein subcellular 21
localization using Agrobacterium-mediated transformation of onion epidermal cells. Biologia 62: 529-532. 22
Wen YQ, Cui J, Zhang Y, Duan CQ, Pan QH (2013) Comparison of organic acid levels and L-IdnDH expression 23
in Chinese-type and European-type grapes. Euphytica 196: 63-76 24
Wu J, Gao H, Zhao L, Liao X, Chen F, Wang Z, Hu X (2007) Chemical compositional characterization of some 25
apple cultivars. Food Chem 103: 88-93 26
Xu K, Wang A, Brown S (2012) Genetic characterization of the Ma locus with pH and titratable acidity in 27
apple. Mol Breeding 30: 899-912 28
Xu, Korban SS (2002) A cluster of four receptor-like genes resides in the Vf locus that confers resistance to 29
apple scab disease. Genetics 162: 1995-2006 30
Xu Q, Chen LL, Ruan X, Chen D, Zhu A, Chen C, Bertrand D, Jiao WB, Hao BH, Lyon MP, Chen J, Gao song, 31
Page 23 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
24
Xing F, Lan H, Chang JW, Ge X, Lei Y, Hu Q, Miao Y, Wang L, Xiao S, Chen YJ, Xu J, Liu JH, Luo OJ, Tang Z, Guo 1
WW, Kuang H, Zhang HY, Roose ML, Nagarajan N, Deng XX, Ruan Y (2013) The draft genome of sweet 2
orange (Citrus sinensis). Nat Genet 45: 59-66 3
Yu J, Pressoir G, Briggs WH, Vroh BiI, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland 4
JB, Kresovich S, Buckler ES (2006) A unified mixed-model method for association mapping that accounts 5
for multiple levels of relatedness. Nat Genet 38: 203-208 6
Yu JM, Buckler ES (2006) Genetic association mapping and genome organization of maize. Curr Opin 7
Biotech 17: 155-160 8
Zhang J, Baetz U, Krügel U, Martinoia E, De Angeli A. (2013) Identification of a probable pore-forming 9
domain in the multimeric vacuolar anion channel AtALMT9. Plant Physiol 163:830-843 10
Zhang Q, Ma BQ, Li H, Chang YS, Han YY, Li J, Wei GC, Zhao S, Khan MA, Zhou Y, Gu C, Zhao XZ, Han ZH, 11
Korban SS, Li SH, Han YP.(2002) Identification, characterization, and utilization of genome-wide simple 12
sequence repeats to identify a QTL for acidity in apple. BMC Genomics 13:537. 13
14
Page 24 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
25
Table legends 1
Table 1 Candidate gene-based association analysis of Ma1 homologues and malic acid content in apple 2
mature fruits. *: Negative and positive numbers indicate upstream or downstream, respectively, of start or 3
stop codons. The names with prefix ‘MDP0000’ represent accession numbers of apple genes deposited in 4
Genome Database for Rosaceae (https://www.rosaceae.org). 5
6
Table 2 Correlations between haplotypes in the Ma1 gene and malic acid contents in mature fruits of an 7
apple population. 8
9
Table 3 Functional analysis of the Ma1 gene based on expression in yeast lines. *Ma1 containing a GGCC 10
sequence at the A/G polymorphic site and has a complete coding sequence. N/D, Not determined. 11
Means with different letters within the same column are significantly different at the 0.01 level of 12
probability. 13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Page 25 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
26
Figure legends 1
Figure 1. Figure 1. Genes encoding aluminum-activated malate transporter in the apple genome. (a) 2
Chromosomal distribution of the apple ALMT genes, whose accession numbers are retrieved from the 3
Genome Database for Rosaceae (GDR) database (http://www.rosaceae.org/). The dashed line indicates 4
anchored sequences. Genes belonging to the ALMTII family are highlighted in black background. Accession 5
numbers in brackets are identical in coding DNA sequences to their neighbours. (b) Phylogentic tree 6
derived from amino acid sequences of ALMT genes in apple and Arabidopsis. The numbers near branches 7
indicate bootstrap values calculated from 1,000 replicate analyses. 8
9
Figure 2. Characterization of the Ma1 gene responsible for fruit acidity in apple. (a) Genomic structure of 10
the Ma1 gene. SNP and WBG90 represent A/G polymorphic and microsatellite (TC)n loci, respectively. (b) 11
An example of genotyping of the apple population using two molecular tags of the Ma1 gene. I, a 12
polyacrylamide gel of SSR marker WBG90, and the polymorphic bands are indicated with arrows. II, an 13
agarose gel electrophoresis of PCR products digested with HaeIII enzyme. PCR products containing GGCC 14
sequences can be digested, while PCR products with GACC sequences cannot be digested. (c) The TMD of 15
the Ma1 gene is predicted to be formed by six putative transmembrane a-helices (TMa1–TMa6) with the N 16
terminus being located in the vacuole. The star indicates the A/G polymorphic locus, with the allele ‘A’ 17
corresponding to a premature stop codon. 18
19
Figure 3. Mean values of malic acid contents in mature fruits of different genotypes that are classified 20
based on either the A/G SNP or the microsatellite (TC)n locus of the Ma1 gene in apple. Different 21
lowercase letters indicate significant differences among genotypes (T-test, least significant difference test 22
at P < 0.01). Error bars show the SE of the mean. 23
24
Figure 4. Analysis of expression profiles of the eight apple ALMT genes using RT-PCR. S1, S2, and S3 25
indicate 30, 60, and 90 days after full bloom, respectively. 1, 2, 3, and 4 represent cv. Yanhongmei, Bell de 26
Boskoop, Zhumary, and Aifeng, respectively. No RT-PCR products of the expected sizes were obtained for 27
five ALMT genes (MDP0000312421, MDP0000791995, MDP0000244249, MDP0000259751, 28
MDP0000290997, and MDP0000768640). 29
30
Figure 5. Expression profiles of the Ma1 gene in fruits of different apple varieties. The genotypes at the 31
Page 26 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
27
A/G SNP locus are as follows: AA in cv. Aifeng and Belld de Boskoop; AG in cv. Zhumary, Jincui and 1
Xinnongjin; GG in cv. McIntosh Red, Annuo and Yanhongmei. All the eight cultivars show similar ripening 2
periods, with maturation occurring at approximately 90 days after full bloom. 3
4
Figure 6. Subcellular localization of Ma1 -GFP by transient expression in epidermal cells of onion. A, 5
Fluorescence of the GFP protein is detected throughout the whole cell, including both cytoplasm and 6
nucleus. B, Cells stained with the membrane dye Dil. C, Fluorescence of the ma1-GFP fusion protein is 7
targeted to the cell membrane. D, Fluorescence of the Ma1-GFP is targeted to the tonoplast, which 8
surrounds the nucleus (marked by arrows). E, Fluorescence of the AtALMT12-GFP fusion protein that was 9
previously localized at the cell membrane (Meyer et al. 2010). F, Fluorescence of the AtMT9-GFP fusion 10
protein that was previously localized at the tonoplast (Zhang et al. 2013). Ma1 and ma1 represent two 11
different alleles harboring GGCC and GACC sequences, respectively, at the A/G polymorphic site 252 bp 12
upstream of the stop codon. 13
14
Figure 7. Characterization of the apple collection used in this study. (a) Distribution of three genotypes (AA, 15
AG, and AG) among cultivars from different geographic regions and their wild relatives. Others include 16
United States, New Zealand, and Australia. The genotypes are classified based on the A/G SNP locus of the 17
Ma1 gene. (b) Mean values of malic acid contents in mature fruits of three genotypes (AA, AG, and AG) 18
among cultivars from different geographic regions and their wild relatives. Different lowercase letters 19
indicate significant differences among genotypes (T-test, least significant difference test at P < 0.01). 20
21
22
23
24
25
26
27
28
29
30
Page 27 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
28
Supplemental data 1
Table S1. SSR markers used for assessment of the structure of an apple population. *WBGCAS is a newly 2
developed marker in this study. E-SSR and G-SSR correspond to expressed and genomic microsatellites, 3
respectively. 4
5
Table S2. RT-PCR primers of ALMTII genes in apple. 6
7
Figure S1. Determination of the optimal value of K and inferred population structure of the apple 8
collection. The blue dot represents the ad hoc procedure described in Pritchard et al. (2000), and the 9
blue star indicates the second order of statistics (ΔK) based on Evanno et al. (2005). 10
11
Figure S2. Bayesian population assignment test based on 17 microsatellite loci with the software 12
STRUCTURE 2.2. Then numbers represent the apple accessions as listed in additional data 1. 13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page 28 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
29
Additional Data 1
Additional data 1. Genotype of apple germplasm based on the screening of two molecular markers of the 2
Ma1 gene 3
4
Additional data 2. Additional data 2: Genotyping data of SSR markers used for analysis of pulation 5
structure of apple germplasm. Note: the missing data are indicated with "-9". The numbers represent 6
genotypic values. 7
8
9
Page 29 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
1
1
2
Figure 1. Genes encoding aluminum-activated malate transporter in the apple genome. (a) Chromosomal 3
distribution of the apple ALMT genes, whose accession numbers are retrieved from the Genome Database 4
for Rosaceae (GDR) database (http://www.rosaceae.org/). The dashed line indicates anchored sequences. 5
Genes belonging to the ALMTII family are highlighted in black background. Accession numbers in brackets 6
are identical in coding DNA sequences to their neighbours. (b) Phylogentic tree derived from amino acid 7
sequences of ALMT genes in apple and Arabidopsis. The numbers near branches indicate bootstrap values 8
calculated from 1,000 replicate analyses. 9
10
11
12
13
14
15
16
17
18
19
20
Page 30 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
Figure 2. Characterization of the Ma1 gene responsible for fruit acidity in apple. (a) Genomic structure of 15
the Ma1 gene. SNP and WBG90 represent A/G polymorphic and microsatellite (TC)n loci, respectively. (b) 16
An example of genotyping of the apple population using two molecular tags of the Ma1 gene. I, a 17
polyacrylamide gel of SSR marker WBG90, and the polymorphic bands are indicated with arrows. II, an 18
agarose gel electrophoresis of PCR products digested with HaeIII enzyme. PCR products containing GGCC 19
sequences can be digested, while PCR products with GACC sequences cannot be digested. (c) The TMD of 20
the Ma1 gene is predicted to be formed by six putative transmembrane a-helices (TMa1–TMa6) with the N 21
terminus being located in the vacuole. The star indicates the A/G polymorphic locus, with the allele ‘A’ 22
corresponding to a premature stop codon. 23
24
25
26
27
28
29
30
Page 31 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
3
1
2
3
4
5
6
7
8
9
10
11
12
13
Figure 3. Mean values of malic acid contents in mature fruits of different genotypes that are classified 14
based on either the A/G SNP or the microsatellite (TC)n locus of the Ma1 gene in apple. Different 15
lowercase letters indicate significant differences among genotypes (T-test, least significant difference test 16
at P < 0.01). Error bars show the SE of the mean. 17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page 32 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Figure 4. Analysis of expression profiles of the eight apple ALMT genes using RT-PCR. S1, S2, and S3 21
indicate 30, 60, and 90 days after full bloom, respectively. 1, 2, 3, and 4 represent cv. Yanhongmei, Bell de 22
Boskoop, Zhumary, and Aifeng, respectively. No RT-PCR products of the expected sizes were obtained for 23
five ALMT genes (MDP0000312421, MDP0000791995, MDP0000244249, MDP0000259751, 24
MDP0000290997, and MDP0000768640). 25
26
27
28
29
30
31
32
33
34
35
Page 33 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Figure 5. Expression profiles of the Ma1 gene in fruits of different apple varieties. The genotypes at the 26
A/G SNP locus are as follows: AA in cv. Aifeng and Belld de Boskoop; AG in cv. Zhumary, Jincui and 27
Xinnongjin; GG in cv. McIntosh Red, Annuo and Yanhongmei. All the eight cultivars show similar ripening 28
periods, with maturation occurring at approximately 90 days after full bloom. 29
30
Page 34 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
6
1
2
3
4
5
6
7
8
9
10
11
12
13
14
Figure 6. Subcellular localization of Ma1 -GFP by transient expression in epidermal cells of onion. A, 15
Fluorescence of the GFP protein is detected throughout the whole cell, including both cytoplasm and 16
nucleus. B, Cells stained with the membrane dye Dil. C, Fluorescence of the ma1-GFP fusion protein is 17
targeted to the cell membrane. D, Fluorescence of the Ma1-GFP is targeted to the tonoplast, which 18
surrounds the nucleus (marked by arrows). E, Fluorescence of the AtALMT12-GFP fusion protein that was 19
previously localized at the cell membrane (Meyer et al. 2010). F, Fluorescence of the AtMT9-GFP fusion 20
protein that was previously localized at the tonoplast (Zhang et al. 2013). Ma1 and ma1 represent two 21
different alleles harboring GGCC and GACC sequences, respectively, at the A/G polymorphic site 252 bp 22
upstream of the stop codon. 23
24
25
26
27
28
29
30
31
32
Page 35 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
7
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Figure 7. Characterization of the apple collection used in this study. (a) Distribution of three genotypes (AA, 22
AG, and AG) among cultivars from different geographic regions and their wild relatives. Others include 23
United States, New Zealand, and Australia. The genotypes are classified based on the A/G SNP locus of the 24
Ma1 gene. (b) Mean values of malic acid contents in mature fruits of three genotypes (AA, AG, and AG) 25
among cultivars from different geographic regions and their wild relatives. Different lowercase letters 26
indicate significant differences among genotypes (T-test, least significant difference test at P < 0.01). 27
28
29
Page 36 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
1
1
Table 1 Candidate gene-based association analysis of Ma1 homologues and malic acid content in apple 2
mature fruits. 3
Homologue of Ma1
Molecular tag
LG
Primer (5’-3’)
P-value
Name Type Polymorphic site Location* Forward Reverse
MDP0000244249 WBG86 SSR (ATG)5 -566 bp 16 AAACCATTCACTTGATATGACA GACGATATCCCAAGTTAACAA 0.08
MDP0000252114 WBG90 SSR (TC)20 -1408 bp 16 GCCCACTAATGCTTTCTGTA CTCCAACAGATTGTCGAATC 0.0007
MDP0000768640 WBG106 SSR (GT)10 +1473 bp 13 CTGCTGGTGCCTACATGTGGT CAGGTAACTATCCGTGACGAG 0.21
MDP0000290997 WBG112 SSR (GGTTTT)3 +1740 bp 6 AGAAGCCAAGAGGACACTCT TCTGTATCGACCATGTGGTG 0.47
MDP0000791995 WBG113 SSR (TG)16 +665 bp U/N TTAATTTGTTTGGCGTTGGG TCATGGACGGTGATAGCTGA 0.02
MDP0000259751 WBG117 SSR (CTG)7 -252 bp 1 GTAGAGGCGGCCATGAATAA CTTCTCCGATCTCACAAGCC 0.03
MDP0000299697 WBG119 SSR (TA)9 +384 bp 14 CCTGTAACGACCCTGTATCG ACTTTTTCTTCGTCCGAATC 0.24
MDP0000312421 WBG121 CAPS GKAC +513 bp 13 AGATAACTCTGCAATCCCCAG TCCGACTTCGGTTTCAACAG 0.79
*: Negative and positive numbers indicate upstream or downstream, respectively, of start or stop codons. 4
The names with prefix ‘MDP0000’ represent accession numbers of apple genes deposited in Genome 5
Database for Rosaceae (https://www.rosaceae.org). 6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
Page 37 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
2
1
2
3
Table 2 Correlations between haplotypes in the Ma1 gene and malic acid contents in mature fruits of an 4
apple population. 5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
Haplotype SNP
(R=A/G)
WBG90 Frequency Correlation between haplotype and malic
acid content
Regression coefficient Probability
HapA G (TC)17 0.192 3.24 4.69E-18
HapB A (TC)17 0.296 -1.09 0.00676
HapC G (TC)20 0.115 1.32 0.01400
HapD A (TC)20 0.397 -3.12 1.42E-15
Page 38 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
3
1
2
3
Table 3 Functional analysis of the Ma1 gene based on expression in yeast lines. 4
YPDA liquid
medium
Construct* Malic acid content (mg/g) Mean value
1 2 3
Containing
0.02g/mL
malic acid
Ma1 3.85 3.67 3.87 3.80a ±0.11
ma1 1.45 1.55 1.41 1.47b ±0.07
Empty vector 1.38 1.29 1.40 1.36b ±0.06
Without malic
acid
Ma1 N/D N/D N/D
ma1 N/D N/D N/D
Empty vector N/D N/D N/D
* Ma1 containing a GGCC sequence at the A/G polymorphic site and has a complete coding sequence. N/D, 5
Not determined. Means with different letters within the same column are significantly different at the 6
0.01 level of probability. 7
8
9
10
11
12
Page 39 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
1
Supplemental data 1
2
Table S1. SSR markers used for assessment of the structure of an apple population 3
Marker* LG Type Motif Primer (5’-3’) Expected
size Forward Reverse
Hi20b03 8 G-SSR (ACA)7 AAACTGCAATCCACAACTGC GTTTAGTTGCTAATGGCGTGTCG 226 bp
Hi08g06 10 G-SSR (GAA)5 AATCGAACCAGCACAGGAAG GTTTAGATGGAGGTCGTGGTTACG 192 bp
Hi08h03 4 G-SSR (TTG)6 GCAATGGCGTTCTAGGATTC GGTGGTGAACCCTTAATTGG 153 bp
CN868471 9 E-SSR (TTC)6 TTCCTGTGAACCCAGTCCTC GATCGACAGGCAACATCCTT 281 bp
BACSSR20 11 G-SSR (GA)21 TGATACATGAAGTGTGCTTCCTC TGAGAGAACTGAATCGGGCT 229 bp
BACSSR51 15 G-SSR (TA)10 ACCAGCTGTTTCCAGTTTCC AAAGATTTATTGGCGGGCAT 134 bp
CN893277 5 E-SSR (AG)11 CACTGCAGGAACTGCAAAAA AGAAAGGGGTTGAATTTGGG 246 bp
CN876284 17 E-SSR (AGA)6 CAGCGAGGAGAAGGAAATTG GTTCCAGAACTTCACGCCAT 102 bp
BACSSR58 3 G-SSR (AT)10 ATGATCTGCATGGTGGTTCC GGCTTTGACTTCGTTTCAGC 245 bp
CH05h05 13 G-SSR (TC)19 ACATGTCACTCCTACGCGG GTGCAGTGATTAGCATTGCTGT 180 bp
WBGCAS84 16 G-SSR (AGG)6 CATGTTGTTTGGATATGGATATG TCTCACATCGGTTTCTCTGC 157 bp
WBGCAS130 6 G-SSR (GA)20 GGAGAGAGGATATGGCGTGA TACCAACACGCTCTCCACTC 141bp
WBGCAS131 2 G-SSR (AT)15 TGCTGGTAAGGGACATTACG TCTGGTTTGACCAAGGTTCA 274bp
WBGCAS132 14 G-SSR (CT)10 CGCGTCTTGAAACATTCCAA TGCGGTTTTTGAGATTGTGA 183bp
WBGCAS133 7 G-SSR (AT)11 CTTGGCCCCATTACGAAA GACATCCTTGATGTGCCACA 255bp
WBGCAS134 1 G-SSR (CT)30 ATGGGATACATTGGCTCCAC TGCACCACCTTCACACAGAA 220bp
WBGCAS135 12 G-SSR (CT)22 GCTGCTGCTGGAGGAAGTTT GAAGTTGGTCTCCTTGAGCA 180p
*WBGCAS is a newly developed marker in this study. E-SSR and G-SSR correspond to expressed and 4
genomic microsatellites, respectively. 5
6
7
8
9
10
11
12
13
14
Page 40 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
2
1
2
3
4
Table S2. RT-PCR primers of ALMTII genes in apple 5
Gene Primer sequence (5’-3’) Expected
size (bp) Forward Reverse
MDP0000252114 CCACAGCTTCGCCGAGAGAA CGACGACAGTGAGAATTGCC 323
MDP0000312421 ATGCCAGCCAGTATTCCATC GTCCCAGACACCAAGACGAT 304
MDP0000244249 GGCAGTTGTGACTCGACTGG GTGGTTCCCAGACTGCAAAT 308
MDP0000791995 TCCTCACTGTTGCCATCATG AGTCAGCCACCGAGTCAAAG 444
MDP0000299697 CTGATGATCGTCTCGCTGCT TCCGATACCCAGACACCATG 355
MDP0000259751 GATTTCACCTGCAAAGCCTC CCTTAAAGGGGAAGCCTTCT 196
MDP0000768640 CCCAGGAAAATCATCTTTGC ACTTCCTCCAACTTTGCAGC 257
MDP0000290997 GCTTTGGTTGTGGCAAACTT CCACCAAGTTGTGCAGATCC 313
Actin TGACCGAATGAGCAAGGAAATTACT TACTCAGCTTTGGCAATCCACATC 245
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
Page 41 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
Figure S1. Determination of the optimal value of K and inferred population structure of the apple 19
collection. The blue dot represents the ad hoc procedure described in Pritchard et al. (2000), and the 20
blue star indicates the second order of statistics (ΔK) based on Evanno et al. (2005). 21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
Page 42 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016
4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
Figure S2. Bayesian population assignment test based on 17 microsatellite loci with the software 23
STRUCTURE 2.2. Then numbers represent the apple accessions as listed in additional data 1. 24
25
26
27
28
Page 43 of 43The Plant Genome Accepted paper, posted 06/23/2015. doi:10.3835/plantgenome2015.03.0016