Cargando…

Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank

Fishes are, by far, the most diverse group of vertebrates. Their classification relies heavily on morphology. In practice, the correct morphological identification of species often depends on personal experience because many species vary in their body shape, color and other external characters. Thus...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xiaobing, Shen, Xuejuan, Chen, Xiao, Xiang, Dan, Murphy, Robert W., Shen, Yongyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5808227/
https://www.ncbi.nlm.nih.gov/pubmed/29467794
http://dx.doi.org/10.3389/fgene.2018.00030
_version_ 1783299427353493504
author Li, Xiaobing
Shen, Xuejuan
Chen, Xiao
Xiang, Dan
Murphy, Robert W.
Shen, Yongyi
author_facet Li, Xiaobing
Shen, Xuejuan
Chen, Xiao
Xiang, Dan
Murphy, Robert W.
Shen, Yongyi
author_sort Li, Xiaobing
collection PubMed
description Fishes are, by far, the most diverse group of vertebrates. Their classification relies heavily on morphology. In practice, the correct morphological identification of species often depends on personal experience because many species vary in their body shape, color and other external characters. Thus, the identification of a species may be prone to errors. Due to the rapid development of molecular biology, the number of sequences of fishes deposited in GenBank has grown explosively. These published data likely contain errors owing to invalid or incorrectly identified species. The erroneous data can lead to downstream problems. Thus, it is critical that such errors get identified and corrected. A strategy based on DNA barcoding can detect potentially erroneous data, especially when intraspecific K2P variation exceeds interspecific K2P divergence. Analyses of the most used DNA marker for fishes (mitochondrial Cytb) discovers that intraspecific differences of fishes are generally less than 1%, while interspecific differences are generally higher than 10%. Based on this ruler, our analyses identify 1,303 potential problematic Cytb sequences of fishes in GenBank and point to taxonomic problems, errors in identification, genetic introgression and other concerns. Care must be taken to avoid the perpetuation of errors when using these available data.
format Online
Article
Text
id pubmed-5808227
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-58082272018-02-21 Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank Li, Xiaobing Shen, Xuejuan Chen, Xiao Xiang, Dan Murphy, Robert W. Shen, Yongyi Front Genet Genetics Fishes are, by far, the most diverse group of vertebrates. Their classification relies heavily on morphology. In practice, the correct morphological identification of species often depends on personal experience because many species vary in their body shape, color and other external characters. Thus, the identification of a species may be prone to errors. Due to the rapid development of molecular biology, the number of sequences of fishes deposited in GenBank has grown explosively. These published data likely contain errors owing to invalid or incorrectly identified species. The erroneous data can lead to downstream problems. Thus, it is critical that such errors get identified and corrected. A strategy based on DNA barcoding can detect potentially erroneous data, especially when intraspecific K2P variation exceeds interspecific K2P divergence. Analyses of the most used DNA marker for fishes (mitochondrial Cytb) discovers that intraspecific differences of fishes are generally less than 1%, while interspecific differences are generally higher than 10%. Based on this ruler, our analyses identify 1,303 potential problematic Cytb sequences of fishes in GenBank and point to taxonomic problems, errors in identification, genetic introgression and other concerns. Care must be taken to avoid the perpetuation of errors when using these available data. Frontiers Media S.A. 2018-02-06 /pmc/articles/PMC5808227/ /pubmed/29467794 http://dx.doi.org/10.3389/fgene.2018.00030 Text en Copyright © 2018 Li, Shen, Chen, Xiang, Murphy and Shen. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Xiaobing
Shen, Xuejuan
Chen, Xiao
Xiang, Dan
Murphy, Robert W.
Shen, Yongyi
Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title_full Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title_fullStr Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title_full_unstemmed Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title_short Detection of Potential Problematic Cytb Gene Sequences of Fishes in GenBank
title_sort detection of potential problematic cytb gene sequences of fishes in genbank
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5808227/
https://www.ncbi.nlm.nih.gov/pubmed/29467794
http://dx.doi.org/10.3389/fgene.2018.00030
work_keys_str_mv AT lixiaobing detectionofpotentialproblematiccytbgenesequencesoffishesingenbank
AT shenxuejuan detectionofpotentialproblematiccytbgenesequencesoffishesingenbank
AT chenxiao detectionofpotentialproblematiccytbgenesequencesoffishesingenbank
AT xiangdan detectionofpotentialproblematiccytbgenesequencesoffishesingenbank
AT murphyrobertw detectionofpotentialproblematiccytbgenesequencesoffishesingenbank
AT shenyongyi detectionofpotentialproblematiccytbgenesequencesoffishesingenbank