Cargando…

Gene annotation errors are common in the mammalian mitochondrial genomes database

BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of g...

Descripción completa

Detalles Bibliográficos
Autores principales: Prada, Carlos F., Boore, Jeffrey L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6341679/
https://www.ncbi.nlm.nih.gov/pubmed/30669991
http://dx.doi.org/10.1186/s12864-019-5447-1
_version_ 1783388991608848384
author Prada, Carlos F.
Boore, Jeffrey L.
author_facet Prada, Carlos F.
Boore, Jeffrey L.
author_sort Prada, Carlos F.
collection PubMed
description BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of genomes. Mammalian mitochondrial genomes recently published in the GenBank database of NCBI show numerous rearrangements in various regions of the genome, from which it may be inferred that the mammalian mitochondrial genome is more dynamic than expected. However, it is alternatively possible that these are errors of annotation and, if so, are misleading our interpretations. In order to verify these possible errors of annotation, we performed a comparative genomic analysis of mammalian mitochondrial genomes available in the NCBI database. RESULTS: Using a combination of bioinformatics methods to carefully examine the mitochondrial gene arrangements in 304 mammalian species, we determined that there are only two sets of gene arrangements, one that is shared by all of the marsupials and another that is shared by all of the monotremes and eutherians, with these two arrangements differing only by the positions of tRNA genes in the region commonly designated as “WANCY” for the genes it comprises. All of the 68 other cases of reported gene rearrangements are errors. We note that there are also numerous errors of impossibly short, incorrect gene annotations, cases where genomes that are reported as complete are actually missing portions of the sequence, and genes that are clearly present but were not annotated in these records. CONCLUSIONS: We judge that the application of simple bioinformatic tools in the verification of gene annotation, particularly for organelle genomes, would be a very useful enhancement for the curation of genome sequences submitted to GenBank. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5447-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6341679
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63416792019-01-24 Gene annotation errors are common in the mammalian mitochondrial genomes database Prada, Carlos F. Boore, Jeffrey L. BMC Genomics Research Article BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of genomes. Mammalian mitochondrial genomes recently published in the GenBank database of NCBI show numerous rearrangements in various regions of the genome, from which it may be inferred that the mammalian mitochondrial genome is more dynamic than expected. However, it is alternatively possible that these are errors of annotation and, if so, are misleading our interpretations. In order to verify these possible errors of annotation, we performed a comparative genomic analysis of mammalian mitochondrial genomes available in the NCBI database. RESULTS: Using a combination of bioinformatics methods to carefully examine the mitochondrial gene arrangements in 304 mammalian species, we determined that there are only two sets of gene arrangements, one that is shared by all of the marsupials and another that is shared by all of the monotremes and eutherians, with these two arrangements differing only by the positions of tRNA genes in the region commonly designated as “WANCY” for the genes it comprises. All of the 68 other cases of reported gene rearrangements are errors. We note that there are also numerous errors of impossibly short, incorrect gene annotations, cases where genomes that are reported as complete are actually missing portions of the sequence, and genes that are clearly present but were not annotated in these records. CONCLUSIONS: We judge that the application of simple bioinformatic tools in the verification of gene annotation, particularly for organelle genomes, would be a very useful enhancement for the curation of genome sequences submitted to GenBank. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5447-1) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-22 /pmc/articles/PMC6341679/ /pubmed/30669991 http://dx.doi.org/10.1186/s12864-019-5447-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Prada, Carlos F.
Boore, Jeffrey L.
Gene annotation errors are common in the mammalian mitochondrial genomes database
title Gene annotation errors are common in the mammalian mitochondrial genomes database
title_full Gene annotation errors are common in the mammalian mitochondrial genomes database
title_fullStr Gene annotation errors are common in the mammalian mitochondrial genomes database
title_full_unstemmed Gene annotation errors are common in the mammalian mitochondrial genomes database
title_short Gene annotation errors are common in the mammalian mitochondrial genomes database
title_sort gene annotation errors are common in the mammalian mitochondrial genomes database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6341679/
https://www.ncbi.nlm.nih.gov/pubmed/30669991
http://dx.doi.org/10.1186/s12864-019-5447-1
work_keys_str_mv AT pradacarlosf geneannotationerrorsarecommoninthemammalianmitochondrialgenomesdatabase
AT boorejeffreyl geneannotationerrorsarecommoninthemammalianmitochondrialgenomesdatabase