Cargando…

Efficient Gene Tree Correction Guided by Genome Evolution

MOTIVATIONS: Gene trees inferred solely from multiple alignments of homologous sequences often contain weakly supported and uncertain branches. Information for their full resolution may lie in the dependency between gene families and their genomic context. Integrative methods, using species tree inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Noutahi, Emmanuel, Semeria, Magali, Lafond, Manuel, Seguin, Jonathan, Boussau, Bastien, Guéguen, Laurent, El-Mabrouk, Nadia, Tannier, Eric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981423/
https://www.ncbi.nlm.nih.gov/pubmed/27513924
http://dx.doi.org/10.1371/journal.pone.0159559
_version_ 1782447613116678144
author Noutahi, Emmanuel
Semeria, Magali
Lafond, Manuel
Seguin, Jonathan
Boussau, Bastien
Guéguen, Laurent
El-Mabrouk, Nadia
Tannier, Eric
author_facet Noutahi, Emmanuel
Semeria, Magali
Lafond, Manuel
Seguin, Jonathan
Boussau, Bastien
Guéguen, Laurent
El-Mabrouk, Nadia
Tannier, Eric
author_sort Noutahi, Emmanuel
collection PubMed
description MOTIVATIONS: Gene trees inferred solely from multiple alignments of homologous sequences often contain weakly supported and uncertain branches. Information for their full resolution may lie in the dependency between gene families and their genomic context. Integrative methods, using species tree information in addition to sequence information, often rely on a computationally intensive tree space search which forecloses an application to large genomic databases. RESULTS: We propose a new method, called ProfileNJ, that takes a gene tree with statistical supports on its branches, and corrects its weakly supported parts by using a combination of information from a species tree and a distance matrix. Its low running time enabled us to use it on the whole Ensembl Compara database, for which we propose an alternative, arguably more plausible set of gene trees. This allowed us to perform a genome-wide analysis of duplication and loss patterns on the history of 63 eukaryote species, and predict ancestral gene content and order for all ancestors along the phylogeny. AVAILABILITY: A web interface called RefineTree, including ProfileNJ as well as a other gene tree correction methods, which we also test on the Ensembl gene families, is available at: http://www-ens.iro.umontreal.ca/~adbit/polytomysolver.html. The code of ProfileNJ as well as the set of gene trees corrected by ProfileNJ from Ensembl Compara version 73 families are also made available.
format Online
Article
Text
id pubmed-4981423
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49814232016-08-29 Efficient Gene Tree Correction Guided by Genome Evolution Noutahi, Emmanuel Semeria, Magali Lafond, Manuel Seguin, Jonathan Boussau, Bastien Guéguen, Laurent El-Mabrouk, Nadia Tannier, Eric PLoS One Research Article MOTIVATIONS: Gene trees inferred solely from multiple alignments of homologous sequences often contain weakly supported and uncertain branches. Information for their full resolution may lie in the dependency between gene families and their genomic context. Integrative methods, using species tree information in addition to sequence information, often rely on a computationally intensive tree space search which forecloses an application to large genomic databases. RESULTS: We propose a new method, called ProfileNJ, that takes a gene tree with statistical supports on its branches, and corrects its weakly supported parts by using a combination of information from a species tree and a distance matrix. Its low running time enabled us to use it on the whole Ensembl Compara database, for which we propose an alternative, arguably more plausible set of gene trees. This allowed us to perform a genome-wide analysis of duplication and loss patterns on the history of 63 eukaryote species, and predict ancestral gene content and order for all ancestors along the phylogeny. AVAILABILITY: A web interface called RefineTree, including ProfileNJ as well as a other gene tree correction methods, which we also test on the Ensembl gene families, is available at: http://www-ens.iro.umontreal.ca/~adbit/polytomysolver.html. The code of ProfileNJ as well as the set of gene trees corrected by ProfileNJ from Ensembl Compara version 73 families are also made available. Public Library of Science 2016-08-11 /pmc/articles/PMC4981423/ /pubmed/27513924 http://dx.doi.org/10.1371/journal.pone.0159559 Text en © 2016 Noutahi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Noutahi, Emmanuel
Semeria, Magali
Lafond, Manuel
Seguin, Jonathan
Boussau, Bastien
Guéguen, Laurent
El-Mabrouk, Nadia
Tannier, Eric
Efficient Gene Tree Correction Guided by Genome Evolution
title Efficient Gene Tree Correction Guided by Genome Evolution
title_full Efficient Gene Tree Correction Guided by Genome Evolution
title_fullStr Efficient Gene Tree Correction Guided by Genome Evolution
title_full_unstemmed Efficient Gene Tree Correction Guided by Genome Evolution
title_short Efficient Gene Tree Correction Guided by Genome Evolution
title_sort efficient gene tree correction guided by genome evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4981423/
https://www.ncbi.nlm.nih.gov/pubmed/27513924
http://dx.doi.org/10.1371/journal.pone.0159559
work_keys_str_mv AT noutahiemmanuel efficientgenetreecorrectionguidedbygenomeevolution
AT semeriamagali efficientgenetreecorrectionguidedbygenomeevolution
AT lafondmanuel efficientgenetreecorrectionguidedbygenomeevolution
AT seguinjonathan efficientgenetreecorrectionguidedbygenomeevolution
AT boussaubastien efficientgenetreecorrectionguidedbygenomeevolution
AT gueguenlaurent efficientgenetreecorrectionguidedbygenomeevolution
AT elmabrouknadia efficientgenetreecorrectionguidedbygenomeevolution
AT tanniereric efficientgenetreecorrectionguidedbygenomeevolution