Cargando…

Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation

Genome alignments provide a powerful basis to transfer gene annotations from a well-annotated reference genome to many other aligned genomes. The completeness of these annotations crucially depends on the sensitivity of the underlying genome alignment. Here, we investigated the impact of the genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Virag, Hiller, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737078/
https://www.ncbi.nlm.nih.gov/pubmed/28645144
http://dx.doi.org/10.1093/nar/gkx554
_version_ 1783287472452534272
author Sharma, Virag
Hiller, Michael
author_facet Sharma, Virag
Hiller, Michael
author_sort Sharma, Virag
collection PubMed
description Genome alignments provide a powerful basis to transfer gene annotations from a well-annotated reference genome to many other aligned genomes. The completeness of these annotations crucially depends on the sensitivity of the underlying genome alignment. Here, we investigated the impact of the genome alignment parameters and found that parameters with a higher sensitivity allow the detection of thousands of novel alignments between orthologous exons that have been missed before. In particular, comparisons between species separated by an evolutionary distance of >0.75 substitutions per neutral site, like human and other non-placental vertebrates, benefit from increased sensitivity. To systematically test if increased sensitivity improves comparative gene annotations, we built a multiple alignment of 144 vertebrate genomes and used this alignment to map human genes to the other 143 vertebrates with CESAR. We found that higher alignment sensitivity substantially improves the completeness of comparative gene annotations by adding on average 2382 and 7440 novel exons and 117 and 317 novel genes for mammalian and non-mammalian species, respectively. Our results suggest a more sensitive alignment strategy that should generally be used for genome alignments between distantly-related species. Our 144-vertebrate genome alignment and the comparative gene annotations (https://bds.mpi-cbg.de/hillerlab/144VertebrateAlignment_CESAR/) are a valuable resource for comparative genomics.
format Online
Article
Text
id pubmed-5737078
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57370782018-01-08 Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation Sharma, Virag Hiller, Michael Nucleic Acids Res Genomics Genome alignments provide a powerful basis to transfer gene annotations from a well-annotated reference genome to many other aligned genomes. The completeness of these annotations crucially depends on the sensitivity of the underlying genome alignment. Here, we investigated the impact of the genome alignment parameters and found that parameters with a higher sensitivity allow the detection of thousands of novel alignments between orthologous exons that have been missed before. In particular, comparisons between species separated by an evolutionary distance of >0.75 substitutions per neutral site, like human and other non-placental vertebrates, benefit from increased sensitivity. To systematically test if increased sensitivity improves comparative gene annotations, we built a multiple alignment of 144 vertebrate genomes and used this alignment to map human genes to the other 143 vertebrates with CESAR. We found that higher alignment sensitivity substantially improves the completeness of comparative gene annotations by adding on average 2382 and 7440 novel exons and 117 and 317 novel genes for mammalian and non-mammalian species, respectively. Our results suggest a more sensitive alignment strategy that should generally be used for genome alignments between distantly-related species. Our 144-vertebrate genome alignment and the comparative gene annotations (https://bds.mpi-cbg.de/hillerlab/144VertebrateAlignment_CESAR/) are a valuable resource for comparative genomics. Oxford University Press 2017-08-21 2017-06-21 /pmc/articles/PMC5737078/ /pubmed/28645144 http://dx.doi.org/10.1093/nar/gkx554 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Genomics
Sharma, Virag
Hiller, Michael
Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title_full Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title_fullStr Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title_full_unstemmed Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title_short Increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
title_sort increased alignment sensitivity improves the usage of genome alignments for comparative gene annotation
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5737078/
https://www.ncbi.nlm.nih.gov/pubmed/28645144
http://dx.doi.org/10.1093/nar/gkx554
work_keys_str_mv AT sharmavirag increasedalignmentsensitivityimprovestheusageofgenomealignmentsforcomparativegeneannotation
AT hillermichael increasedalignmentsensitivityimprovestheusageofgenomealignmentsforcomparativegeneannotation