Cargando…

TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees

Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining s...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yi-Chieh, Rasmussen, Matthew D., Bansal, Mukul S., Kellis, Manolis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526801/
https://www.ncbi.nlm.nih.gov/pubmed/22949484
http://dx.doi.org/10.1093/sysbio/sys076
_version_ 1782253626193870848
author Wu, Yi-Chieh
Rasmussen, Matthew D.
Bansal, Mukul S.
Kellis, Manolis
author_facet Wu, Yi-Chieh
Rasmussen, Matthew D.
Bansal, Mukul S.
Kellis, Manolis
author_sort Wu, Yi-Chieh
collection PubMed
description Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed using a maximum-likelihood phylogenetic program), TreeFix finds a “statistically equivalent” gene tree that minimizes a species tree-based cost function. We have applied TreeFix to 2 clades of 12 Drosophila and 16 fungal genomes, as well as to simulated phylogenies and show that it dramatically improves reconstructions compared with current state-of-the-art programs. Given its accuracy, speed, and simplicity, TreeFix should be applicable to a wide range of analyses and have many important implications for future investigations of gene evolution. The source code and a sample data set are available at http://compbio.mit.edu/treefix.
format Online
Article
Text
id pubmed-3526801
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35268012012-12-20 TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees Wu, Yi-Chieh Rasmussen, Matthew D. Bansal, Mukul S. Kellis, Manolis Syst Biol Regular Articles Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed using a maximum-likelihood phylogenetic program), TreeFix finds a “statistically equivalent” gene tree that minimizes a species tree-based cost function. We have applied TreeFix to 2 clades of 12 Drosophila and 16 fungal genomes, as well as to simulated phylogenies and show that it dramatically improves reconstructions compared with current state-of-the-art programs. Given its accuracy, speed, and simplicity, TreeFix should be applicable to a wide range of analyses and have many important implications for future investigations of gene evolution. The source code and a sample data set are available at http://compbio.mit.edu/treefix. Oxford University Press 2013-01 2012-11-19 /pmc/articles/PMC3526801/ /pubmed/22949484 http://dx.doi.org/10.1093/sysbio/sys076 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Wu, Yi-Chieh
Rasmussen, Matthew D.
Bansal, Mukul S.
Kellis, Manolis
TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title_full TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title_fullStr TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title_full_unstemmed TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title_short TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
title_sort treefix: statistically informed gene tree error correction using species trees
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526801/
https://www.ncbi.nlm.nih.gov/pubmed/22949484
http://dx.doi.org/10.1093/sysbio/sys076
work_keys_str_mv AT wuyichieh treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees
AT rasmussenmatthewd treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees
AT bansalmukuls treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees
AT kellismanolis treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees