Cargando…

DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods

Recently diverged species are challenging for identification, yet they are frequently of special interest scientifically as well as from a regulatory perspective. DNA barcoding has proven instrumental in species identification, especially in insects and vertebrates, but for the identification of rec...

Descripción completa

Detalles Bibliográficos
Autores principales: van Velzen, Robin, Weitschek, Emanuel, Felici, Giovanni, Bakker, Freek T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3260286/
https://www.ncbi.nlm.nih.gov/pubmed/22272356
http://dx.doi.org/10.1371/journal.pone.0030490
_version_ 1782221477465030656
author van Velzen, Robin
Weitschek, Emanuel
Felici, Giovanni
Bakker, Freek T.
author_facet van Velzen, Robin
Weitschek, Emanuel
Felici, Giovanni
Bakker, Freek T.
author_sort van Velzen, Robin
collection PubMed
description Recently diverged species are challenging for identification, yet they are frequently of special interest scientifically as well as from a regulatory perspective. DNA barcoding has proven instrumental in species identification, especially in insects and vertebrates, but for the identification of recently diverged species it has been reported to be problematic in some cases. Problems are mostly due to incomplete lineage sorting or simply lack of a ‘barcode gap’ and probably related to large effective population size and/or low mutation rate. Our objective was to compare six methods in their ability to correctly identify recently diverged species with DNA barcodes: neighbor joining and parsimony (both tree-based), nearest neighbor and BLAST (similarity-based), and the diagnostic methods DNA-BAR, and BLOG. We analyzed simulated data assuming three different effective population sizes as well as three selected empirical data sets from published studies. Results show, as expected, that success rates are significantly lower for recently diverged species (∼75%) than for older species (∼97%) (P<0.00001). Similarity-based and diagnostic methods significantly outperform tree-based methods, when applied to simulated DNA barcode data (P<0.00001). The diagnostic method BLOG had highest correct query identification rate based on simulated (86.2%) as well as empirical data (93.1%), indicating that it is a consistently better method overall. Another advantage of BLOG is that it offers species-level information that can be used outside the realm of DNA barcoding, for instance in species description or molecular detection assays. Even though we can confirm that identification success based on DNA barcoding is generally high in our data, recently diverged species remain difficult to identify. Nevertheless, our results contribute to improved solutions for their accurate identification.
format Online
Article
Text
id pubmed-3260286
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32602862012-01-23 DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods van Velzen, Robin Weitschek, Emanuel Felici, Giovanni Bakker, Freek T. PLoS One Research Article Recently diverged species are challenging for identification, yet they are frequently of special interest scientifically as well as from a regulatory perspective. DNA barcoding has proven instrumental in species identification, especially in insects and vertebrates, but for the identification of recently diverged species it has been reported to be problematic in some cases. Problems are mostly due to incomplete lineage sorting or simply lack of a ‘barcode gap’ and probably related to large effective population size and/or low mutation rate. Our objective was to compare six methods in their ability to correctly identify recently diverged species with DNA barcodes: neighbor joining and parsimony (both tree-based), nearest neighbor and BLAST (similarity-based), and the diagnostic methods DNA-BAR, and BLOG. We analyzed simulated data assuming three different effective population sizes as well as three selected empirical data sets from published studies. Results show, as expected, that success rates are significantly lower for recently diverged species (∼75%) than for older species (∼97%) (P<0.00001). Similarity-based and diagnostic methods significantly outperform tree-based methods, when applied to simulated DNA barcode data (P<0.00001). The diagnostic method BLOG had highest correct query identification rate based on simulated (86.2%) as well as empirical data (93.1%), indicating that it is a consistently better method overall. Another advantage of BLOG is that it offers species-level information that can be used outside the realm of DNA barcoding, for instance in species description or molecular detection assays. Even though we can confirm that identification success based on DNA barcoding is generally high in our data, recently diverged species remain difficult to identify. Nevertheless, our results contribute to improved solutions for their accurate identification. Public Library of Science 2012-01-17 /pmc/articles/PMC3260286/ /pubmed/22272356 http://dx.doi.org/10.1371/journal.pone.0030490 Text en van Velzen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
van Velzen, Robin
Weitschek, Emanuel
Felici, Giovanni
Bakker, Freek T.
DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title_full DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title_fullStr DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title_full_unstemmed DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title_short DNA Barcoding of Recently Diverged Species: Relative Performance of Matching Methods
title_sort dna barcoding of recently diverged species: relative performance of matching methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3260286/
https://www.ncbi.nlm.nih.gov/pubmed/22272356
http://dx.doi.org/10.1371/journal.pone.0030490
work_keys_str_mv AT vanvelzenrobin dnabarcodingofrecentlydivergedspeciesrelativeperformanceofmatchingmethods
AT weitschekemanuel dnabarcodingofrecentlydivergedspeciesrelativeperformanceofmatchingmethods
AT felicigiovanni dnabarcodingofrecentlydivergedspeciesrelativeperformanceofmatchingmethods
AT bakkerfreekt dnabarcodingofrecentlydivergedspeciesrelativeperformanceofmatchingmethods