The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the...

4
NEW YORK FRUIT QUARTERLY . VOLUME 18 . NUMBER 4 . WINTER 2010 11 e Apple Genome: A Delicious Promise Kenong Xu Department of Horticulture New York State Agricultural Experiment Station, Cornell University, Geneva, NY The recent publication of the apple genome sequence is a major advance in apple genetics which should lead to identification of genes that can be used in marker assisted breeding to more rapidly breed improved varieties with many new interesting traits.B enefiting from the rapid advancement in genome sequenc- ing technologies, a draft sequence of the apple genome has been released by an Italian-led international team of sci- entists in August 2010 (Velasco et al. 2010). e genome of ‘Golden Deli- cious’, one of the most widely grown apple varieties in the world, is the source of the pub- lished sequence. e release marks a major breakthrough in apple genetics and genomics. Several mysterious questions have been answered by the release, includ- ing the number of genes encoded by the apple genome, the origin of the 17 chromosomes of apple, and the origin of the domesti- cated apple. Apple is the 12th plant for which genome sequences have been published, representing another important milestone in plant genomics. Apple genome sequence information will aid discovery of genes of biological and economic importance, ac- celerate the pace of genetic improvement of apple, and promise more delicious fruits to the apple industry and consumers. Overview of Genome Sequencing Technology e genome is a total collection of the genetic material in a set of monoploid (x) chromosomes (See Text Box 1). In animals and plants, nucleic DNA (deoxyribonucleic acid) molecules encode genetic information, including genes and non-coding RNAs (ri- bonucleic acid) that can be passed on from one generation to the next. A DNA molecule in chemistry is comprised of four differ- ent bases of nucleotides: adenine (A), guanine (G), cytosine (C), and thymine (T). In cells, millions of these four bases are joined together and turn into ex- tremely large DNA mol- ecules. Each such large DNA mol- ecule coils and combines with various pro- teins to form a DNA-protein complex struc- ture named a chromosome. Genome sequencing determines the order of the four bases of nucleotides in all of the large chromosomal DNA molecules, which helps our understanding of complex life phenomena, such as growth, development, and diseases in humans, animals and plants. A virus genome of approx. 5,000 base pairs or bp (See Text Box 1) was the first genome that was completely sequenced in the mid 1970s. e sequencing work was manually conducted by Dr. Frederick Sanger (Sanger et al. 1977), a winner of two Nobel Prizes in chemistry. e sequencing method relied on nucleotide analogs, known as dideoxynucleotide triphosphates (ddNTPs) as DNA chain terminators. Since its invention, Dr. Sanger’s DNA sequencing technology has been termed Chain- Termination Method or the Sanger Method. rough years of improvement and automation in commercial DNA sequencers, such as the ABI Prism DNA Analyzer 370 (marketed in 1987) and 3700 (marketed in 1998) series, the Sanger Method has become the gold standard for DNA sequencing because of the high accuracy and long sequence reads (See Text Box 1) of up to 1,000 bp. In fact, the Sanger Method is responsible for sequenc- ing the human genome of 3,200 million base pairs (Mb) (Lander et al. 2001; Venter et al. 2001), and the first plant genome of Arabidopsis of 157 Mb (Kaul et al. 2000). However, the Sanger method is too high in cost and low in speed (Fig . 1) to be used for “personal genomics” (Church, 2005), a term that describes individual access to their own genome sequence information, with a cost of $1,000, i.e. “the $1,000 genome”. is had led to the development and commercialization of non-Sanger ultra- high-throughput DNA sequencing systems, called the second generation (G2) sequencers in 2005. Representatives of these G2 sequencing systems include Roche 454 Genome Sequencer, Illumina Genome Analyzer, and Life Technologies SOLiD System. Currently, these G2 sequencers are capable of sequencing DNA at 5,000 Mb/day with a cost of $0.000001/bp, compared with the best Sanger Method based system that can generate 6 Mb/day at $0.05/bp. e G2 sequencers dramatically increased throughput Figure 1. Trend of DNA sequence cost (Adopt- ed from the Economist, Aug 12, 2010, New York. http://www.economist.com/ node/16791936) Text Box 1. Relevant terms in genetics/genomics: Base pair: A DNA molecule normally has two complimentary strands held by weak hydrogen bonds between bases of nucleotides A and T, and between G and C. The complementary bases A and T, or G and C are called a base pair (bp). Chromosome: an organized structure comprised of DNA and protein in cells. Contig: a set of overlapping DNA segments (or sequence reads) derived from a single genetic source. Haploid (n): the number of chromosomes in a gamete, e.g. pollen. Monoploid (x): the number of unique chromosomes in one complete set in a cell. Ploidy: the number of sets of chromosomes in a given cell. Sequence read: A piece of DNA sequence directly generated by a sequencer at the end of sequencing process. The concepts of monoploid and haploid are necessary in describing polyploidy species, such as bread wheat, which is a hexaploid (2n = 6x =42) where a haploid (n) is equivalent to a triploid (3x).

Transcript of The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the...

Page 1: The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the world, is the source of the pub-lished sequence. The release marks a major breakthrough

NEW YORK FRUIT QUARTERLY . VOLUME 18 . NUMBER 4 . WINTER 2010 11

The Apple Genome: A Delicious PromiseKenong XuDepartmentofHorticultureNewYorkStateAgriculturalExperimentStation,CornellUniversity,Geneva,NY

“The recent publication of the apple genome sequence is a major advance in apple genetics which should lead to identification of genes that can be used in marker assisted breeding to more rapidly breed improved varieties with many new interesting traits.”

Benefiting from the rapid advancement in genome sequenc-ing technologies, a draft sequence of the apple genome has been released by an Italian-led international team of sci-

entists in August 2010 (Velasco et al. 2010). The genome of ‘Golden Deli-cious’, one of the most widely grown apple varieties in the world, is the source of the pub-lished sequence. The release marks

a major breakthrough in apple genetics and genomics. Several mysterious questions have been answered by the release, includ-ing the number of genes encoded by the apple genome, the origin of the 17 chromosomes of apple, and the origin of the domesti-cated apple. Apple is the 12th plant for which genome sequences have been published, representing another important milestone in plant genomics. Apple genome sequence information will aid discovery of genes of biological and economic importance, ac-celerate the pace of genetic improvement of apple, and promise more delicious fruits to the apple industry and consumers.

Overview of Genome Sequencing TechnologyThe genome is a total collection of the genetic material in a set of monoploid (x) chromosomes (See Text Box 1). In animals and plants, nucleic DNA (deoxyribonucleic acid) molecules encode genetic information, including genes and non-coding RNAs (ri-bonucleic acid) that can be passed on from one generation to the next. A DNA molecule in chemistry is comprised of four differ-ent bases of nucleotides: adenine (A), guanine (G), cytosine (C), and thymine (T). In cells, millions of these four bases are joined

together and turn into ex-tremely large D N A m o l -ecules . Each s u c h l a r g e D N A m o l -ecule coils and combines with various pro-teins to form a DNA-protein complex struc-ture named a chromosome. Genome sequencing determines the order of the four bases of nucleotides in all of the large chromosomal DNA molecules, which helps our understanding of complex life phenomena, such as growth, development, and diseases in humans, animals and plants. A virus genome of approx. 5,000 base pairs or bp (See Text Box 1) was the first genome that was completely sequenced in the mid 1970s. The sequencing work was manually conducted by Dr. Frederick Sanger (Sanger et al. 1977), a winner of two Nobel Prizes in chemistry. The sequencing method relied on nucleotide analogs, known as dideoxynucleotide triphosphates (ddNTPs) as DNA chain terminators. Since its invention, Dr. Sanger’s DNA sequencing technology has been termed Chain-Termination Method or the Sanger Method. Through years of improvement and automation in commercial DNA sequencers, such as the ABI Prism DNA Analyzer 370 (marketed in 1987) and 3700 (marketed in 1998) series, the Sanger Method has become the gold standard for DNA sequencing because of the high accuracy and long sequence reads (See Text Box 1) of up to 1,000 bp. In fact, the Sanger Method is responsible for sequenc-ing the human genome of 3,200 million base pairs (Mb) (Lander et al. 2001; Venter et al. 2001), and the first plant genome of Arabidopsis of 157 Mb (Kaul et al. 2000). However, the Sanger method is too high in cost and low in speed (Fig . 1) to be used for “personal genomics” (Church, 2005), a term that describes individual access to their own genome sequence information, with a cost of $1,000, i.e. “the $1,000 genome”. This had led to the development and commercialization of non-Sanger ultra-high-throughput DNA sequencing systems, called the second generation (G2) sequencers in 2005. Representatives of these G2 sequencing systems include Roche 454 Genome Sequencer, Illumina Genome Analyzer, and Life Technologies SOLiD System. Currently, these G2 sequencers are capable of sequencing DNA at 5,000 Mb/day with a cost of $0.000001/bp, compared with the best Sanger Method based system that can generate 6 Mb/day at $0.05/bp. The G2 sequencers dramatically increased throughput

Figure1. Trend of DNA sequence cost (Adopt-ed from the Economist, Aug 12, 2010,New York. http://www.economist.com/node/16791936)

TextBox1.Relevanttermsingenetics/genomics:

• Base pair: A DNA molecule normally has two complimentary strands held by weak hydrogen bonds between bases of nucleotides A and T, and between G and C. The complementary bases A and T, or G and C are called a base pair (bp).

• Chromosome: an organized structure comprised of DNA and protein in cells.

• Contig: a set of overlapping DNA segments (or sequence reads) derived from a single genetic source.

• Haploid (n): the number of chromosomes in a gamete, e.g. pollen. • Monoploid (x): the number of unique chromosomes in one complete

set in a cell. • Ploidy: the number of sets of chromosomes in a given cell. • Sequence read: A piece of DNA sequence directly generated by a

sequencer at the end of sequencing process.• The concepts of monoploid and haploid are necessary in describing

polyploidy species, such as bread wheat, which is a hexaploid (2n = 6x =42) where a haploid (n) is equivalent to a triploid (3x).

Page 2: The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the world, is the source of the pub-lished sequence. The release marks a major breakthrough

12 NEW YORK STATE HORTICULTURAL SOCIETY

and also exponentially reduced sequencing cost (Figure 1). But the G2 systems have a major drawback of short read length (30-450 bp), requiring extremely high computing power for data storage and analyses. To keep the advantages of the Sanger Method based sequenc-ers and the high throughput of the G2 sequencers while eliminat-ing their drawbacks, the third generation (G3) sequencers using single molecule sequencing technologies have emerged recently. The most promising G3 system is the Single Molecule Real Time (SMRT) DNA sequencing system (PacBio RS) by Pacific Biosci-ences, which can produce sequence reads of 1,000-10,000 bp at a speed of 10,000 to 20,000 times faster than the G2 systems (McCarthy 2010). The central piece of the PacBio RS is the zero-mode wave-guide (ZMW) technology (Travers et al. 2010). The ZMW concept (See Text Box 2) was initially developed in the Laboratories of Dr. Watt Webb and Dr. Harold Craighead at Cornell University (Levene et al. 2003). Dr. Stephen W. Turner, a 2000 Cornell PhD graduate in physics and a contributor to the

concept, is the founder of Pacific Biosciences. Despite the rapid advance of DNA sequencing technologies, more innovative approaches are in the research and development stage, such as the microscopic image-based direct sequence read off (Tanaka and Kawai 2009) and the nanopore sequencing ap-proach (Branton et al. 2008). Commercialization of these future DNA sequencing technologies will provide a real possibility of sequencing a human genome with 10× coverage in one day at a cost of $1,000.

Sequencing the Apple GenomeApple is a diploid species (2n = 2x = 34), and an apple somatic cell, such as a leaf cell, has 34 chromosomes. Apple haploid (n), such as pollen, and monoploid (x) have the same number of chromosomes: a complete set of 17 unique chromosomes (See Text Box 1). However, the apple genome is defined by all DNA molecules in one complete set of 17 monoploid (x) chromosomes. Therefore, two homologous (or similar) genomes co-exist in an apple somatic cell. Since apple reproduction relies on out-cross-ing or crossing with another apple variety, the two homologous genomes are also highly heterozygous (or different). As a result, considerable variation is present in DNA sequences between the two homologous genomes, creating a number of uncertainties in assembling genome sequences. The estimated size of the apple genome is approx. 800 mil-lion base pairs (Mb), larger than many plants, including peach (230 Mb), grape (480Mb) and poplar (485 Mb). These attributes (genome heterozygosity and size) posed technical challenges but also financial challenges to sequence the apple genome before the G2 sequencers became available, which led to a significant drop in sequencing cost (Figure 1). The apple genome sequencing project was initiated

in 2007 when the sequencing cost was about $0.0001/bp (Figure 1). The project involved 85 scientists from Italy, US, New Zea-land, France and Belgium, and was coordinated by Dr. Riccardo Velasco, a plant geneticist based at the E. Mach Foundation, Trento, Italy (Velasco et al. 2010). A strategy known as Whole-Genome Shotgun (WGS) was adopted in apple genome sequencing. Prior to sequencing, high molecular weight genomic DNA (DNA molecule of several Mb) was isolated and fragmented into smaller pieces. The smaller pieces of DNA were used to construct 27 genomic libraries with size ranging from 500 bp to 135 kilo (1000) base pairs (Kb) for sequencing. Both the Sanger Method based sequencers (GE Healthcare’s MegaBACE 4500 and ABI 3730 XL DNA ana-lyzer) and the G2 sequencers (Roche/454 Genome Sequencer) were used. Ultimately, a total of 39.2 million reads accounting for approx. 13 billion sequenced nucleotides were generated, among which 26% were by the Sanger Method while 74% by the Roche/454 Genome Sequencers (Velasco et al. 2010). Assembling

of the 39.2 million reads (based on identical sequences shared between individual reads) has led to 122,146 contigs (See Text Box 1) of an accumulative length of 604 Mb, covering 81.3% of the genome with a new estimate of 742.3 Mb (See Text Box 3). Using 1,643 genetic markers, 528.3 Mb representing 103,076 contigs were anchored to 17 chromosomes. The anchored 528.3 Mb sequences are gene rich regions and contain more than 90% of the apple genes assigned with chromosomes (Velasco et al. 2010). However, 138.4 Mb (18.6% of the genome) have not been determined with their corresponding chromosomal origins al-though most (98%) are repetitive elements that make up nearly two thirds of the apple genome. In addition, an extensive final touch is needed to close the 100,000+ sequence gaps between and among the contigs anchored to chromosomes to enhance the utility of apple genome sequences.

Major Findings Uniqueness of the apple genome: Analyses of the apple genome sequences find that the apple genome encodes 57,386 predicted genes (Velasco et al. 2010). This number is the highest among the sequenced plant genomes to date (Table 1). Importantly, nearly

TextBox2.

• Zero-mode wave-guide: an optical waveguide that guides light energy into a volume that is small in all dimensions compared to the wavelength of the light in biophysics.

TextBox3.Applegenomebynumbers

• 27 genomic libraries constructed for sequencing.• 13 billion of nucleotides sequenced • x = 17 the chromosome number of apple • 742.3 millions of nucleotides the genome size• 57,386 genes encoded

Table1.Uniquenessofapplegenomeinplants1.

Genes related to biosynthesis of Genes related Resistance volatiles, aromatics, to sorbitol MADS-box II No. of genes pigments and metabolism transcription genes (No.) antioxidants (No. and %) (No.) factors (No.)

Apple 57,386 992 1,246 (2.2%) 71 91Cucumber 26,682 ND (61) ND ND n.d.Soybean 46,430 392 958 (2.00%) 43 212Poplar 45,654 402 1,034 (2.2%) 14 62Arabidopsis 27,228 178 719 (2.7%) 28 45Grape 33,514 341 1,121 (3.3%) 9 32Rice 40,577 535 910 (1.7%) 27 44Maize 32,540 129 457 (1.4%) 25 142

1Modified from (Velasco et al., 2010). ND: Not determined.

Page 3: The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the world, is the source of the pub-lished sequence. The release marks a major breakthrough

NEW YORK FRUIT QUARTERLY . VOLUME 18 . NUMBER 4 . WINTER 2010 13

20% (11,444) of those predicted genes are apple specific as they do not show significant similarities with other plant genes (Velasco et al. 2010). Discovery of the large number of apple specific genes strongly suggests that the apple genome is noticeably unique, sup-porting the distinctive patterns of apple as a tree fruit in growth, development and adaptation to the temperate environments. Several gene families are overrepresented in apple genome, including those related to sorbitol (a type of sugar) metabolism, disease resistance, and biosynthesis of volatiles, aromatics, pig-ments and antioxidants (Table 1). In addition, apple-specific groups of genes encoding MADS-box transcription factors (See Text Box 4) which are important in regulating apple fruit growth and development are identified as well (Table 1). The Origin of Tribe Pyreae: Cultivated apple is Malus × domestica, which belongs to the family Rosaceae, tribe Pyreae and genus Malus. Most Rosaceae have 7, 8 or 9 monoploid (x) chromosomes. But apple and pear have 17 chromosomes (x = 17). To explain the high number chromosomes, a number of models have been proposed. A recent study (Evans and Campbell 2002)

suggested that apple was derived from a duplication of the ge-nome of an apple ancestor that is similar to the present Gillenia (x = 9) in North America (see Text Box 5). Analyses of the apple genome sequences revealed that many chromosomal segments are largely duplicated, strongly suggesting there was a genome duplication event that occurred around 50 million years ago. It is likely that the Pyreae ancestor genome underwent duplication and subsequent chromosomal rearrangement, leading to a change from the monoploid chromosome number of 9 in the American ancestor to the current 17 of apple (Figure 2), supporting the idea of Evans and Campbell (2002). The Origin of the Domesticated Apple: There are 25 to 47 Malus species recognized. Among these Malus species, M. sieversii, is still widespread in the wild in the Tian-Shan moun-

tain areas across Kazakhstan and China, is largely regarded as the major progenitor species of domesticated apple based on morphological and molecular studies (Harris et al. 2002; Juni-per 2007). U.S. scientists have explored this region to collect M. sieversii germplasm to preserve and expand gene pools useful for apple genetic improvement (Forsline et al. 2003; Luby et al. 2001). However, this popular view has been challenged by a broad study of Malus chloroplast DNA variations (from the maternal side), suggesting that another species M. sylvestris might be geneti-cally closer to the domesticated apple than M. sieversii (Coart et al. 2006). To resolve the mysterious origin of the domesticated apple, 23 nucleic genes across 74 Malus accessions including M. domestica (12), M. sieversii (10) and M. sylvestris (21) were sequenced and compared (Velasco et al. 2010). The comprehen-sive study supported the view that M. domestica was originated from M. sieversii. As the cultivated apple was domesticated from M. sieversii only around 4000 years ago, a relatively short time in plant evolution, the study proposes that two species merged into one (Velasco et al. 2010). These major findings significantly increased our knowledge and understanding of apple as a fruit crop. Identification of apple specific genes provides clues to tackle important needs in the apple industry, such as better fruit quality and disease resistance. Understanding of the origin of apple genome and the origin of the domesticated apple should help guide us to introduce useful genes into apple from other species of Malus or even Pyreae to adapt to changing of environment.

From Discovery of Genes to Improvement of AppleHistory of apple genetic improvement is all about enhancing the ability of apple tree to bear more and better quality fruits (crispy, juicy, balanced sugar and acid, and long storage and shelf life), to resist various diseases, and to tolerate diverse environmental stresses. Many new apple varieties have been released with improvement in one or more of these traits, highlighting signifi-cant progress in apple breeding. For example, the Cornell Apple Breeding Program led by Dr. Susan Brown has just released two managed apple varieties (New York 1 and 2) with excellent fruit quality. But apple genetic improvement will remain challenging because of the complexity of evaluating these traits of economic importance and long juvenile stages before flowering and fruiting. In addition, genetic control of most of these traits is complex as well. The wealth of information in the apple genome sequence will make discovery of genes or gene networks controlling the traits of breeding objectives less difficult. For example, the Tree Fruit Genomics Program at Cornell University has been investigating apple fruit acidity, an important components of apple fruit quality, to understand the genetics behind fruits of tart, sweet or balanced taste. Previously, genomic regions that have significant effects on fruit acid levels (measured by titratable acid) were mapped to 7 chromosomes, with chro-mosome 16 having the strongest effect (Kenis et al. 2008; King et al. 2001; King et al. 2000). In genomics, these chromosomal regions showing significant effects on traits such as acidity are called quantitative trait loci (QTL). Considering the fact that the genomic regions on chromosome 16 are co-localized with the region of the Ma gene, the malic acid content gene (Maliepaard et al. 1998), it would be interesting to study the Ma region in detail. With the apple genome sequences, the Ma gene will likely be identified in the near future. A simple and direct application

Figure2. AdoptedfromVelascoetal(2010).

TextBox4.

A transcription factor is a protein that controls the gene expression from DNA to messenger RNA (mRNA) by binding to specific DNA sequences.

TextBox5.

• Gillenia is a genus of Rosaceae, which includes common plants Bowman’s root, Indian-physic and American ipecac.

Page 4: The Apple Genome: A Delicious Promisenyshs.org/wp-content/...Genome-A-Delicious-Promise.pdf · the world, is the source of the pub-lished sequence. The release marks a major breakthrough

14 NEW YORK STATE HORTICULTURAL SOCIETY

of this study, if successful, would be to develop a Ma gene based molecular marker to assist apple breeding program to remove up to 25% of seedlings with low acid in most of the breeding populations before planting, minimizing the operational cost while increasing selection efficiency. To enhance the application of genomic information in breed-ing programs, the RosBREED project (http://www.rosbreed.org/) has been funded by USDA NIFA Specialty Crops Research Initiative since 2009. The goal of RosBREED is to facilitate the use of marker assisted-breeding (MAB) in US breeding programs of Rosaceae (including apple, peach, sweet and tart cherries, and strawberry) to deliver improved plant materials more efficiently and rapidly. Combining the recently increased understanding in apple genetics and genomics with the expected implementation of MAB will lead to an accelerated pace of apple improvement for more delicious fruits.

Literature CitedBranton, D., D. W. Deamer, A. Marziali, H. Bayley, S. A. Benner et

al., 2008 The potential and challenges of nanopore sequenc-ing. Nature Biotechnology 26: 1146-1153.

Church, G. M., 2005 The Personal Genome Project. Molecular Systems Biology 1: 3.

Coart, E., S. Van Glabeke, M. De Loose, A. S. Larsen and I. Roldan-Ruiz, 2006 Chloroplast diversity in the genus Malus: new insights into the relationship between the European wild apple (Malus sylvestris (L.) Mill.) and the domesticated apple (Malus domestica Borkh.). Molecular Ecology 15: 2171-2182.

Evans, R. C., and C. S. Campbell, 2002 The origin of the apple subfamily (Maloideae; Rosaceae) is clarified by DNA se-quence data from duplicated GBSSI genes. American Journal of Botany 89: 1478-1484.

Forsline, P. L., H. S. Aldwinckle, E. E. Dickson, J. J. Luby and S. C. Hokanson, 2003 Collection, Maintenance, Characteriza-tion and Utilization of Wild Apples of Central Asia, pp. 1-61 in Horticultural Reviews, edited by J. Janick. John Wiley & Sons, Inc., New York.

Harris, S. A., J. P. Robinson and B. E. Juniper, 2002 Genetic clues to the origin of the apple. Trends in Genetics 18: 426-430.

Juniper, B. E., 2007 The mysterious origin of the sweet apple - On its way to a grocery counter near you, this delicious fruit traversed continents and mastered coevolution. American Scientist 95: 44-51.

Kaul, S., H. L. Koo, J. Jenkins, M. Rizzo, T. Rooney et al., 2000 Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408: 796-815.

Kenis, K., J. Keulemans and M. Davey, 2008 Identification and stability of QTLs for fruit quality traits in apple. Tree Genetics & Genomes 4: 647-661.

King, G. J., J. R. Lynn, C. J. Dover, K. M. Evans and G. B. Seymour, 2001 Resolution of quantitative trait loci for mechanical measures accounting for genetic variation in fruit texture of apple (Malus pumila Mill.). Theoretical and Applied Genetics 102: 1227-1235.

King, G. J., C. Maliepaard, J. R. Lynn, F. H. Alston, C. E. Durel et al., 2000 Quantitative genetic analysis and comparison of physical and sensory descriptors relating to fruit flesh firm-ness in apple (Malus pumila Mill.). Theoretical and Applied Genetics 100: 1074-1084.

Lander, E. S., L. M. Linton, B. Birren, C. Nusbaum, M. C. Zody

et al., 2001 Initial sequencing and analysis of the human genome. Nature 409: 860-921.

Levene, M. J., J. Korlach, S. W. Turner, M. Foquet, H. G. Craig-head et al., 2003 Zero-mode waveguides for single-molecule analysis at high concentrations. Science 299: 682-686.

Luby, J., P. Forsline, H. Aldwinckle, V. Bus and M. Geibel, 2001 Silk road apples - Collection, evaluation, and utilization of Malus sieversii from Central Asia. HortScience 36: 225-231.

Maliepaard, C., F. H. Alston, G. van Arkel, L. M. Brown, E. Chevreau et al., 1998 Aligning male and female linkage maps of apple (Malus pumila Mill.) using multi-allelic markers. Theoretical and Applied Genetics 97: 60-73.

McCarthy, A., 2010 Third Generation DNA Sequencing: Pacific Biosciences’ Single Molecule Real Time Technology. Chem-istry & Biology 17: 675-676.

Sanger, F., S. Nicklen and A. R. Coulson, 1977 DNA Sequencing with Chain-Terminating Inhibitors. Proceedings of the Na-tional Academy of Sciences of the United States of America 74: 5463-5467.

Tanaka, H., and T. Kawai, 2009 Partial sequencing of a single DNA molecule with a scanning tunneling microscope. Nature Nanotechnology 4: 518-522.

Travers, K. J., C. S. Chin, D. R. Rank, J. S. Eid and S. W. Turner, 2010 A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucleic Acids Research 38: e159.

Velasco, R., A. Zharkikh, J. Affourtit, A. Dhingra, A. Cestaro et al., 2010 The genome of the domesticated apple (Malus x domestica Borkh.). Nat. Genet. 42: 833-839.

Venter, J. C., M. D. Adams, E. W. Myers, P. W. Li, R. J. Mural et al., 2001 The sequence of the human genome. Science 291: 1304-1351.

Kenong Xu is a an assistant professor of plant genomics in the Dept. of Horticulture at Geneva who specializes in applied genomics of apple.