Cargando…

OMGene: mutual improvement of gene models through optimisation of evolutionary conservation

BACKGROUND: The accurate determination of the genomic coordinates for a given gene – its gene model – is of vital importance to the utility of its annotation, and the accuracy of bioinformatic analyses derived from it. Currently-available methods of computational gene prediction, while on the whole...

Descripción completa

Detalles Bibliográficos
Autores principales: Dunne, Michael P., Kelly, Steven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5923031/
https://www.ncbi.nlm.nih.gov/pubmed/29703150
http://dx.doi.org/10.1186/s12864-018-4704-z
_version_ 1783318273861877760
author Dunne, Michael P.
Kelly, Steven
author_facet Dunne, Michael P.
Kelly, Steven
author_sort Dunne, Michael P.
collection PubMed
description BACKGROUND: The accurate determination of the genomic coordinates for a given gene – its gene model – is of vital importance to the utility of its annotation, and the accuracy of bioinformatic analyses derived from it. Currently-available methods of computational gene prediction, while on the whole successful, frequently disagree on the model for a given predicted gene, with some or all of the variant gene models often failing to match the biologically observed structure. Many prediction methods can be bolstered by using experimental data such as RNA-seq. However, these resources are not always available, and rarely give a comprehensive portrait of an organism’s transcriptome due to temporal and tissue-specific expression profiles. RESULTS: Orthology between genes provides evolutionary evidence to guide the construction of gene models. OMGene (Optimise My Gene) aims to improve gene model accuracy in the absence of experimental data by optimising the consistency of multiple sequence alignments of orthologous genes from multiple species. Using RNA-seq data sets from plants, mammals, and fungi, considering intron/exon junction representation and exon coverage, and assessing the intra-orthogroup consistency of subcellular localisation predictions, we demonstrate the utility of OMGene for improving gene models in annotated genomes. CONCLUSIONS: We show that significant improvements in the accuracy of gene model annotations can be made, both in established and in de novo annotated genomes, by leveraging information from multiple species.
format Online
Article
Text
id pubmed-5923031
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59230312018-05-07 OMGene: mutual improvement of gene models through optimisation of evolutionary conservation Dunne, Michael P. Kelly, Steven BMC Genomics Software BACKGROUND: The accurate determination of the genomic coordinates for a given gene – its gene model – is of vital importance to the utility of its annotation, and the accuracy of bioinformatic analyses derived from it. Currently-available methods of computational gene prediction, while on the whole successful, frequently disagree on the model for a given predicted gene, with some or all of the variant gene models often failing to match the biologically observed structure. Many prediction methods can be bolstered by using experimental data such as RNA-seq. However, these resources are not always available, and rarely give a comprehensive portrait of an organism’s transcriptome due to temporal and tissue-specific expression profiles. RESULTS: Orthology between genes provides evolutionary evidence to guide the construction of gene models. OMGene (Optimise My Gene) aims to improve gene model accuracy in the absence of experimental data by optimising the consistency of multiple sequence alignments of orthologous genes from multiple species. Using RNA-seq data sets from plants, mammals, and fungi, considering intron/exon junction representation and exon coverage, and assessing the intra-orthogroup consistency of subcellular localisation predictions, we demonstrate the utility of OMGene for improving gene models in annotated genomes. CONCLUSIONS: We show that significant improvements in the accuracy of gene model annotations can be made, both in established and in de novo annotated genomes, by leveraging information from multiple species. BioMed Central 2018-04-27 /pmc/articles/PMC5923031/ /pubmed/29703150 http://dx.doi.org/10.1186/s12864-018-4704-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Dunne, Michael P.
Kelly, Steven
OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title_full OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title_fullStr OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title_full_unstemmed OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title_short OMGene: mutual improvement of gene models through optimisation of evolutionary conservation
title_sort omgene: mutual improvement of gene models through optimisation of evolutionary conservation
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5923031/
https://www.ncbi.nlm.nih.gov/pubmed/29703150
http://dx.doi.org/10.1186/s12864-018-4704-z
work_keys_str_mv AT dunnemichaelp omgenemutualimprovementofgenemodelsthroughoptimisationofevolutionaryconservation
AT kellysteven omgenemutualimprovementofgenemodelsthroughoptimisationofevolutionaryconservation