Cargando…
Gene annotation errors are common in the mammalian mitochondrial genomes database
BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of g...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6341679/ https://www.ncbi.nlm.nih.gov/pubmed/30669991 http://dx.doi.org/10.1186/s12864-019-5447-1 |
_version_ | 1783388991608848384 |
---|---|
author | Prada, Carlos F. Boore, Jeffrey L. |
author_facet | Prada, Carlos F. Boore, Jeffrey L. |
author_sort | Prada, Carlos F. |
collection | PubMed |
description | BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of genomes. Mammalian mitochondrial genomes recently published in the GenBank database of NCBI show numerous rearrangements in various regions of the genome, from which it may be inferred that the mammalian mitochondrial genome is more dynamic than expected. However, it is alternatively possible that these are errors of annotation and, if so, are misleading our interpretations. In order to verify these possible errors of annotation, we performed a comparative genomic analysis of mammalian mitochondrial genomes available in the NCBI database. RESULTS: Using a combination of bioinformatics methods to carefully examine the mitochondrial gene arrangements in 304 mammalian species, we determined that there are only two sets of gene arrangements, one that is shared by all of the marsupials and another that is shared by all of the monotremes and eutherians, with these two arrangements differing only by the positions of tRNA genes in the region commonly designated as “WANCY” for the genes it comprises. All of the 68 other cases of reported gene rearrangements are errors. We note that there are also numerous errors of impossibly short, incorrect gene annotations, cases where genomes that are reported as complete are actually missing portions of the sequence, and genes that are clearly present but were not annotated in these records. CONCLUSIONS: We judge that the application of simple bioinformatic tools in the verification of gene annotation, particularly for organelle genomes, would be a very useful enhancement for the curation of genome sequences submitted to GenBank. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5447-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6341679 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63416792019-01-24 Gene annotation errors are common in the mammalian mitochondrial genomes database Prada, Carlos F. Boore, Jeffrey L. BMC Genomics Research Article BACKGROUND: Although animal mitochondrial DNA sequences are known to evolve rapidly, their gene arrangements often remain unchanged over long periods of evolutionary time. Therefore, comparisons of mitochondrial genomes may result in significant insights into the evolution both of organisms and of genomes. Mammalian mitochondrial genomes recently published in the GenBank database of NCBI show numerous rearrangements in various regions of the genome, from which it may be inferred that the mammalian mitochondrial genome is more dynamic than expected. However, it is alternatively possible that these are errors of annotation and, if so, are misleading our interpretations. In order to verify these possible errors of annotation, we performed a comparative genomic analysis of mammalian mitochondrial genomes available in the NCBI database. RESULTS: Using a combination of bioinformatics methods to carefully examine the mitochondrial gene arrangements in 304 mammalian species, we determined that there are only two sets of gene arrangements, one that is shared by all of the marsupials and another that is shared by all of the monotremes and eutherians, with these two arrangements differing only by the positions of tRNA genes in the region commonly designated as “WANCY” for the genes it comprises. All of the 68 other cases of reported gene rearrangements are errors. We note that there are also numerous errors of impossibly short, incorrect gene annotations, cases where genomes that are reported as complete are actually missing portions of the sequence, and genes that are clearly present but were not annotated in these records. CONCLUSIONS: We judge that the application of simple bioinformatic tools in the verification of gene annotation, particularly for organelle genomes, would be a very useful enhancement for the curation of genome sequences submitted to GenBank. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5447-1) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-22 /pmc/articles/PMC6341679/ /pubmed/30669991 http://dx.doi.org/10.1186/s12864-019-5447-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Prada, Carlos F. Boore, Jeffrey L. Gene annotation errors are common in the mammalian mitochondrial genomes database |
title | Gene annotation errors are common in the mammalian mitochondrial genomes database |
title_full | Gene annotation errors are common in the mammalian mitochondrial genomes database |
title_fullStr | Gene annotation errors are common in the mammalian mitochondrial genomes database |
title_full_unstemmed | Gene annotation errors are common in the mammalian mitochondrial genomes database |
title_short | Gene annotation errors are common in the mammalian mitochondrial genomes database |
title_sort | gene annotation errors are common in the mammalian mitochondrial genomes database |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6341679/ https://www.ncbi.nlm.nih.gov/pubmed/30669991 http://dx.doi.org/10.1186/s12864-019-5447-1 |
work_keys_str_mv | AT pradacarlosf geneannotationerrorsarecommoninthemammalianmitochondrialgenomesdatabase AT boorejeffreyl geneannotationerrorsarecommoninthemammalianmitochondrialgenomesdatabase |