Cargando…

A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal

BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Costa, Filipe O., Landi, Monica, Martins, Rogelia, Costa, Maria H., Costa, Maria E., Carneiro, Miguel, Alves, Maria J., Steinke, Dirk, Carvalho, Gary R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338485/
https://www.ncbi.nlm.nih.gov/pubmed/22558244
http://dx.doi.org/10.1371/journal.pone.0035858
_version_ 1782231200379699200
author Costa, Filipe O.
Landi, Monica
Martins, Rogelia
Costa, Maria H.
Costa, Maria E.
Carneiro, Miguel
Alves, Maria J.
Steinke, Dirk
Carvalho, Gary R.
author_facet Costa, Filipe O.
Landi, Monica
Martins, Rogelia
Costa, Maria H.
Costa, Maria E.
Carneiro, Miguel
Alves, Maria J.
Steinke, Dirk
Carvalho, Gary R.
author_sort Costa, Filipe O.
collection PubMed
description BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal. METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A–E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E). CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work.
format Online
Article
Text
id pubmed-3338485
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33384852012-05-03 A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal Costa, Filipe O. Landi, Monica Martins, Rogelia Costa, Maria H. Costa, Maria E. Carneiro, Miguel Alves, Maria J. Steinke, Dirk Carvalho, Gary R. PLoS One Research Article BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal. METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A–E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E). CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work. Public Library of Science 2012-04-25 /pmc/articles/PMC3338485/ /pubmed/22558244 http://dx.doi.org/10.1371/journal.pone.0035858 Text en Costa et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Costa, Filipe O.
Landi, Monica
Martins, Rogelia
Costa, Maria H.
Costa, Maria E.
Carneiro, Miguel
Alves, Maria J.
Steinke, Dirk
Carvalho, Gary R.
A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title_full A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title_fullStr A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title_full_unstemmed A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title_short A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal
title_sort ranking system for reference libraries of dna barcodes: application to marine fish species from portugal
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3338485/
https://www.ncbi.nlm.nih.gov/pubmed/22558244
http://dx.doi.org/10.1371/journal.pone.0035858
work_keys_str_mv AT costafilipeo arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT landimonica arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT martinsrogelia arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT costamariah arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT costamariae arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT carneiromiguel arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT alvesmariaj arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT steinkedirk arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT carvalhogaryr arankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT costafilipeo rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT landimonica rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT martinsrogelia rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT costamariah rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT costamariae rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT carneiromiguel rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT alvesmariaj rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT steinkedirk rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal
AT carvalhogaryr rankingsystemforreferencelibrariesofdnabarcodesapplicationtomarinefishspeciesfromportugal