Cargando…
Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes
With the rapid increase of sequenced metazoan mitochondrial genomes, a detailed manual annotation is becoming more and more infeasible. While it is easy to identify the approximate location of protein-coding genes within mitogenomes, the peculiar processing of mitochondrial transcripts, however, mak...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6847864/ https://www.ncbi.nlm.nih.gov/pubmed/31584075 http://dx.doi.org/10.1093/nar/gkz833 |
_version_ | 1783468993684701184 |
---|---|
author | Donath, Alexander Jühling, Frank Al-Arab, Marwa Bernhart, Stephan H Reinhardt, Franziska Stadler, Peter F Middendorf, Martin Bernt, Matthias |
author_facet | Donath, Alexander Jühling, Frank Al-Arab, Marwa Bernhart, Stephan H Reinhardt, Franziska Stadler, Peter F Middendorf, Martin Bernt, Matthias |
author_sort | Donath, Alexander |
collection | PubMed |
description | With the rapid increase of sequenced metazoan mitochondrial genomes, a detailed manual annotation is becoming more and more infeasible. While it is easy to identify the approximate location of protein-coding genes within mitogenomes, the peculiar processing of mitochondrial transcripts, however, makes the determination of precise gene boundaries a surprisingly difficult problem. We have analyzed the properties of annotated start and stop codon positions in detail, and use the inferred patterns to devise a new method for predicting gene boundaries in de novo annotations. Our method benefits from empirically observed prevalances of start/stop codons and gene lengths, and considers the dependence of these features on variations of genetic codes. Albeit not being perfect, our new approach yields a drastic improvement in the accuracy of gene boundaries and upgrades the mitochondrial genome annotation server MITOS to an even more sophisticated tool for fully automatic annotation of metazoan mitochondrial genomes. |
format | Online Article Text |
id | pubmed-6847864 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68478642019-11-18 Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes Donath, Alexander Jühling, Frank Al-Arab, Marwa Bernhart, Stephan H Reinhardt, Franziska Stadler, Peter F Middendorf, Martin Bernt, Matthias Nucleic Acids Res Computational Biology With the rapid increase of sequenced metazoan mitochondrial genomes, a detailed manual annotation is becoming more and more infeasible. While it is easy to identify the approximate location of protein-coding genes within mitogenomes, the peculiar processing of mitochondrial transcripts, however, makes the determination of precise gene boundaries a surprisingly difficult problem. We have analyzed the properties of annotated start and stop codon positions in detail, and use the inferred patterns to devise a new method for predicting gene boundaries in de novo annotations. Our method benefits from empirically observed prevalances of start/stop codons and gene lengths, and considers the dependence of these features on variations of genetic codes. Albeit not being perfect, our new approach yields a drastic improvement in the accuracy of gene boundaries and upgrades the mitochondrial genome annotation server MITOS to an even more sophisticated tool for fully automatic annotation of metazoan mitochondrial genomes. Oxford University Press 2019-11-18 2019-10-04 /pmc/articles/PMC6847864/ /pubmed/31584075 http://dx.doi.org/10.1093/nar/gkz833 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Donath, Alexander Jühling, Frank Al-Arab, Marwa Bernhart, Stephan H Reinhardt, Franziska Stadler, Peter F Middendorf, Martin Bernt, Matthias Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title | Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title_full | Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title_fullStr | Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title_full_unstemmed | Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title_short | Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
title_sort | improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6847864/ https://www.ncbi.nlm.nih.gov/pubmed/31584075 http://dx.doi.org/10.1093/nar/gkz833 |
work_keys_str_mv | AT donathalexander improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT juhlingfrank improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT alarabmarwa improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT bernhartstephanh improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT reinhardtfranziska improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT stadlerpeterf improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT middendorfmartin improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes AT berntmatthias improvedannotationofproteincodinggenesboundariesinmetazoanmitochondrialgenomes |