Cargando…
A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We prop...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338485/ https://www.ncbi.nlm.nih.gov/pubmed/22558244 http://dx.doi.org/10.1371/journal.pone.0035858 |
_version_ | 1782231200379699200 |
---|---|
author | Costa, Filipe O. Landi, Monica Martins, Rogelia Costa, Maria H. Costa, Maria E. Carneiro, Miguel Alves, Maria J. Steinke, Dirk Carvalho, Gary R. |
author_facet | Costa, Filipe O. Landi, Monica Martins, Rogelia Costa, Maria H. Costa, Maria E. Carneiro, Miguel Alves, Maria J. Steinke, Dirk Carvalho, Gary R. |
author_sort | Costa, Filipe O. |
collection | PubMed |
description | BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal. METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A–E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E). CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work. |
format | Online Article Text |
id | pubmed-3338485 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33384852012-05-03 A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal Costa, Filipe O. Landi, Monica Martins, Rogelia Costa, Maria H. Costa, Maria E. Carneiro, Miguel Alves, Maria J. Steinke, Dirk Carvalho, Gary R. PLoS One Research Article BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal. METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A–E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E). CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work. Public Library of Science 2012-04-25 /pmc/articles/PMC3338485/ /pubmed/22558244 http://dx.doi.org/10.1371/journal.pone.0035858 Text en Costa et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Costa, Filipe O. Landi, Monica Martins, Rogelia Costa, Maria H. Costa, Maria E. Carneiro, Miguel Alves, Maria J. Steinke, Dirk Carvalho, Gary R. A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title | A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title_full | A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title_fullStr | A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title_full_unstemmed | A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title_short | A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal |
title_sort | ranking system for reference libraries of dna barcodes: application to marine fish species from portugal |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338485/ https://www.ncbi.nlm.nih.gov/pubmed/22558244 http://dx.doi.org/10.1371/journal.pone.0035858 |
work_keys_str_mv | AT costafilipeo arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT landimonica arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT martinsrogelia arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT costamariah arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT costamariae arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT carneiromiguel arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT alvesmariaj arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT steinkedirk arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT carvalhogaryr arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT costafilipeo rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT landimonica rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT martinsrogelia rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT costamariah rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT costamariae rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT carneiromiguel rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT alvesmariaj rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT steinkedirk rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal AT carvalhogaryr rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal |