DNA Barcodes Genes, Genomics, And tics
Transcript of DNA Barcodes Genes, Genomics, And tics
8/8/2019 DNA Barcodes Genes, Genomics, And tics
http://slidepdf.com/reader/full/dna-barcodes-genes-genomics-and-tics 1/2
8/8/2019 DNA Barcodes Genes, Genomics, And tics
http://slidepdf.com/reader/full/dna-barcodes-genes-genomics-and-tics 2/2
plants, a primer set for matK that in-creases universality, trials of theirmarker on species identification and dis-covery, and the application of barcodesto important environmental issues. Thisarticle is welcome, but as with many of the other publications that have pro-posed candidate plant barcodes, theauthors omit quantitative criteria and
standards that are necessary to comparethe success and applicability of their fa- vored locus against all others.
Successful universal PCR amplifica-tion across a wide range of plants mustbe the primary criterion for selecting aDNA barcode. A challenging tradeoff exists between universal PCR amplifi-cation and high rates of sequencedivergence. This tradeoff, which is par-ticularly problematic in coding loci, isless so in noncoding regions becauseuniversal primers are normally found inthe highly conserved genes that f lankthe hypervariable intergenic spacers.
The taxonomic community has waveredon setting a level of universal amplifica-tion (e.g., all land plants or just flower-ing plants?) and the simplicity of PCRconditions (one primer set for all taxaor multiple sets across taxonomicgroups?) required for a barcode. DNAbarcoding must be practical for a widerange of practitioners and, therefore,the methodology must be accessible andeasily carried out by multiple users. Thepower of DNA barcoding is also directlyproportional to the data available in thebarcode library; building a very com-plete database will greatly increase the
power of DNA barcoding (15). Theseconsiderations require a narrow, stan-dard range of PCR conditions along
with a limited (ideally one) set of PCRprimers per locus that will provide arobust barcode marker for the widestrange of taxa and users. Lahaye et al.(2) purport to have tested their barcodeloci on the widest sample of taxa so farused in any published study. Althoughthe number of species is the largest sam-ple yet published on plants, 96% of those samples are in a single family, Or-chidaceae. The other samples are spreadacross 23 families in 18 orders, which isless than half the families and orderssampled in earlier trials (12, 13). In ad-
dition, they report employing severalprimer pairs, rather than an optimal sin-gle pair, to successfully amplify matK across the samples. For broad universal-ity and simplicity of use, matK has not
yet been demonstrated to pass the testfor a successful plant barcode.
A criterion related to PCR universal-ity is the relative success of a barcode
marker across the major lineages of landplants, including angiosperms, gymno-sperms, ferns, and mosses. Lahaye et al.(2) tested matK only on angiosperms,explicitly stating that it is not importantto select a barcode that works success-fully across all land plants. In today’secosystems in which the vast majority of plants are angiosperms, some might ar-gue that markers should be chosen that
work best for these dominant landplants. Yet given that the purpose of aDNA barcode is to facilitate identifica-tion of unknown samples, includingsmall isolated fragments of tissue, then
the selected loci should work easily onall groups of green land plants.
Conceptually, any consistent, nonzerosequence variation that distinguishes twospecies should work as a DNA barcode.Furthermore, DNA barcodes do not re-quire any demonstration of the homol-ogy of mutations as would be needed ina phylogenetic marker. In other words,low levels of divergence may be suffi-cient to distinguish among species evenif not adequate to estimate phylogeneticrelationships. Relative to CO1 in ani-mals, the mean divergence level betweenspecies in plants is usually quite low (13,
16). Curiously, Lahaye et al. (2) rejectthe gene region that showed the highestdivergence value (trnH – psbA) in favor of matK , which showed nearly 50% lessinterspecific divergence. As of yet aquantitative metric that can be used tocompare barcode candidates does notexist. The use of a simple statistic thatcould be calculated as the product of the levels of PCR universality and se-quence divergence would allow for di-rect comparisons between putative DNAbarcode markers. The proportion of taxa that are successfully amplified andsequenced across a targeted test set to-gether with the percentage of speciespairs that are differentiated by a partic-
ular locus are independent charactersthat can be combined as a product of the two values into a single metric forcomparison.
The simple comparative statistic pro-posed is relevant only when other fac-tors are considered, including the effortrequired to recover the PCR ampliconand the number of different primers andreaction conditions used for sequencingeach putative barcode locus. The suit-ability of a locus for large-scale DNAbarcoding could easily be evaluated bycomparing loci across the same set of taxa under a designated set of reactionconditions. Although not an explicitmeasure of how well a DNA barcode
will perform at identifying species within a bioinformatics context, this sta-tistic takes into account the intrinsictradeoff inherent in a DNA barcodemarker between the ability to amplifya locus and the rate of divergence of that locus across a phylogenetic rangeof t axa.
In conclusion, two final factors thatmay strongly affect how well barcodemarkers work at species identificationand discovery are database design andsequence search strategies. To date theexact method or algorithm to be used insearching the barcode database has notbeen thoroughly investigated nor debated,particularly as regards a multilocusDNA barcode (14, 17). The algorithmsused in the most commonly used data-bases, GenBank and the Barcode of LifeDatabase (BOLD), are quite different.However, a plethora of additional se-quence alignment methodologies areavailable, which can be evaluated foruse in DNA barcodes with regards tothe following: (i) the application of c on-fidence limits to species assignment, (ii)the use of partial sequences in databasesearches, and (iii) the impact on searchalgorithms of sequence length variationdue to insertion/deletion events and theinformative nature of these mutations.Clearly, DNA barcoding has great po-tential for enhancing ecological andevolutionary investigations if the rightgenetic markers are selected. The issuesraised here, if carefully considered andimplemented, will allow a rational selec-tion of a plant DNA barcode based on acomparative and quantitative analysis.
1. Savolainen V,Cowan RS,Vogler AP,Roderick GK,Lane,
R (2005) Phil Trans R Soc London Ser B 360:1805-1811.
2. LahayeR, vander BankM, Bogarin D,WarnerJ, Pupulin
F, GigotG, Maurin O, Duthoit S, Barraclough TG,Savol-
ainen V (2008) Proc Natl Acad Sci USA 105:2923–2928.
3. Kembel SW, Hubbell SP (2006) Ecology 87:S86–S99.
4. WestobyM, Wright IJ (2006) TrendsEcol Evol 21:261–268.
5. Webb CO, Ackerly DD, McPeek MA, Donoghue, MJ
(2002) Annu Rev Ecol Syst 33:475–505.
6. van Straalen NM, Roelofs D (2006) An Introduction to
Ecological Genomics (Oxford Univ Press, London).
7. Hebert PDN, Ratnasingham S, deWaard JR (2003) Proc
Roy Soc B 270(suppl):S96–SS99.
8. Cho Y, Mower JP, Qiu YL, Palmer JD (2004) Proc Natl
Acad Sci USA 101:17741–17746.
9. Pennisi E (2007) Science 318:190–191.
10. Kress WJ, Wurdack KJ, Zimmer EA, Weigt LA, Janzen
DH (2005) P r oc N a tl A c ad S c i U S A 102:8369–
8374.
11. Taberlet P, Coissac E, Pompanon F, Gielly L, Miquel C,
Valentini A, Vermat T, Corthier G, Brochmann C, Will-
erslev E (2007) Nucleic Acids Res 35(3):e14.
12. ChaseMW,Cowan RS,Hollingsworth PM,van denBerg
C, Madrinan S, Petersen G, Seberg O, Jorgsensen T,
CameronKN, Carine M, etal . (2007) Taxon 56:295–299.
13. Kress WJ, Erickson DL (2007) PLoS ONE 2(6):e508.
14. Sass C, Little DP, Stevenson DW, Specht CD (2007) PLoS
ONE 2(11):e1154.
15. Ekrem T, WillassenE, SturE (2007) MolPhylogenetEvol
43:530–542.
16. ShawJ, Lickey EB,Schilling EE,Small RL (2007) Am J Bot
94:275–288.
17. Little D, Stevenson DW (2006) Cladistics 22:1–21.
2762 www.pnas.orgcgidoi10.1073pnas.0800476105 Kress and Erickson