Cargando…

When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths

BACKGROUND: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been propose...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilson, John James, Rougerie, Rodolphe, Schonfeld, Justin, Janzen, Daniel H, Hallwachs, Winnie, Hajibabaei, Mehrdad, Kitching, Ian J, Haxaire, Jean, Hebert, Paul DN
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161832/
https://www.ncbi.nlm.nih.gov/pubmed/21806794
http://dx.doi.org/10.1186/1472-6785-11-18
_version_ 1782210737187323904
author Wilson, John James
Rougerie, Rodolphe
Schonfeld, Justin
Janzen, Daniel H
Hallwachs, Winnie
Hajibabaei, Mehrdad
Kitching, Ian J
Haxaire, Jean
Hebert, Paul DN
author_facet Wilson, John James
Rougerie, Rodolphe
Schonfeld, Justin
Janzen, Daniel H
Hallwachs, Winnie
Hajibabaei, Mehrdad
Kitching, Ian J
Haxaire, Jean
Hebert, Paul DN
author_sort Wilson, John James
collection PubMed
description BACKGROUND: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been proposed as critical factors affecting the accuracy of such assignments but have not been thoroughly investigated. We explored the accuracy of assignments to genus, tribe and subfamily in the Sphingidae, using the almost complete global DNA barcode reference library (1095 species) available for this family. Costa Rican sphingids (118 species), a well-documented, diverse subset of the family, with each of the tribes and subfamilies represented were used as queries. We simulated libraries with different levels of completeness (10-100% of the available species), and recorded assignments (positive or ambiguous) and their accuracy (true or false) under six criteria. RESULTS: A liberal tree-based criterion assigned 83% of queries accurately to genus, 74% to tribe and 90% to subfamily, compared to a strict tree-based criterion, which assigned 75% of queries accurately to genus, 66% to tribe and 84% to subfamily, with a library containing 100% of available species (but excluding the species of the query). The greater number of true positives delivered by more relaxed criteria was negatively balanced by the occurrence of more false positives. This effect was most sharply observed with libraries of the lowest completeness where, for example at the genus level, 32% of assignments were false positives with the liberal criterion versus < 1% when using the strict. We observed little difference (< 8% using the liberal criterion) however, in the overall accuracy of the assignments between the lowest and highest levels of library completeness at the tribe and subfamily level. CONCLUSIONS: Our results suggest that when using a strict tree-based criterion for higher taxon assignment with DNA barcodes, the likelihood of assigning a query a genus name incorrectly is very low, if a genus name is provided it has a high likelihood of being accurate, and if no genus match is available the query can nevertheless be assigned to a subfamily with high accuracy regardless of library completeness. DNA barcoding often correctly assigned sphingid moths to higher taxa when species matches were unavailable, suggesting that barcode reference libraries can be useful for higher taxon assignments long before they achieve complete species coverage.
format Online
Article
Text
id pubmed-3161832
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31618322011-08-26 When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths Wilson, John James Rougerie, Rodolphe Schonfeld, Justin Janzen, Daniel H Hallwachs, Winnie Hajibabaei, Mehrdad Kitching, Ian J Haxaire, Jean Hebert, Paul DN BMC Ecol Research Article BACKGROUND: When a specimen belongs to a species not yet represented in DNA barcode reference libraries there is disagreement over the effectiveness of using sequence comparisons to assign the query accurately to a higher taxon. Library completeness and the assignment criteria used have been proposed as critical factors affecting the accuracy of such assignments but have not been thoroughly investigated. We explored the accuracy of assignments to genus, tribe and subfamily in the Sphingidae, using the almost complete global DNA barcode reference library (1095 species) available for this family. Costa Rican sphingids (118 species), a well-documented, diverse subset of the family, with each of the tribes and subfamilies represented were used as queries. We simulated libraries with different levels of completeness (10-100% of the available species), and recorded assignments (positive or ambiguous) and their accuracy (true or false) under six criteria. RESULTS: A liberal tree-based criterion assigned 83% of queries accurately to genus, 74% to tribe and 90% to subfamily, compared to a strict tree-based criterion, which assigned 75% of queries accurately to genus, 66% to tribe and 84% to subfamily, with a library containing 100% of available species (but excluding the species of the query). The greater number of true positives delivered by more relaxed criteria was negatively balanced by the occurrence of more false positives. This effect was most sharply observed with libraries of the lowest completeness where, for example at the genus level, 32% of assignments were false positives with the liberal criterion versus < 1% when using the strict. We observed little difference (< 8% using the liberal criterion) however, in the overall accuracy of the assignments between the lowest and highest levels of library completeness at the tribe and subfamily level. CONCLUSIONS: Our results suggest that when using a strict tree-based criterion for higher taxon assignment with DNA barcodes, the likelihood of assigning a query a genus name incorrectly is very low, if a genus name is provided it has a high likelihood of being accurate, and if no genus match is available the query can nevertheless be assigned to a subfamily with high accuracy regardless of library completeness. DNA barcoding often correctly assigned sphingid moths to higher taxa when species matches were unavailable, suggesting that barcode reference libraries can be useful for higher taxon assignments long before they achieve complete species coverage. BioMed Central 2011-08-01 /pmc/articles/PMC3161832/ /pubmed/21806794 http://dx.doi.org/10.1186/1472-6785-11-18 Text en Copyright ©2011 Wilson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wilson, John James
Rougerie, Rodolphe
Schonfeld, Justin
Janzen, Daniel H
Hallwachs, Winnie
Hajibabaei, Mehrdad
Kitching, Ian J
Haxaire, Jean
Hebert, Paul DN
When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title_full When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title_fullStr When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title_full_unstemmed When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title_short When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths
title_sort when species matches are unavailable are dna barcodes correctly assigned to higher taxa? an assessment using sphingid moths
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161832/
https://www.ncbi.nlm.nih.gov/pubmed/21806794
http://dx.doi.org/10.1186/1472-6785-11-18
work_keys_str_mv AT wilsonjohnjames whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT rougerierodolphe whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT schonfeldjustin whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT janzendanielh whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT hallwachswinnie whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT hajibabaeimehrdad whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT kitchingianj whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT haxairejean whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths
AT hebertpauldn whenspeciesmatchesareunavailablearednabarcodescorrectlyassignedtohighertaxaanassessmentusingsphingidmoths