Cargando…
TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees
Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526801/ https://www.ncbi.nlm.nih.gov/pubmed/22949484 http://dx.doi.org/10.1093/sysbio/sys076 |
_version_ | 1782253626193870848 |
---|---|
author | Wu, Yi-Chieh Rasmussen, Matthew D. Bansal, Mukul S. Kellis, Manolis |
author_facet | Wu, Yi-Chieh Rasmussen, Matthew D. Bansal, Mukul S. Kellis, Manolis |
author_sort | Wu, Yi-Chieh |
collection | PubMed |
description | Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed using a maximum-likelihood phylogenetic program), TreeFix finds a “statistically equivalent” gene tree that minimizes a species tree-based cost function. We have applied TreeFix to 2 clades of 12 Drosophila and 16 fungal genomes, as well as to simulated phylogenies and show that it dramatically improves reconstructions compared with current state-of-the-art programs. Given its accuracy, speed, and simplicity, TreeFix should be applicable to a wide range of analyses and have many important implications for future investigations of gene evolution. The source code and a sample data set are available at http://compbio.mit.edu/treefix. |
format | Online Article Text |
id | pubmed-3526801 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35268012012-12-20 TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees Wu, Yi-Chieh Rasmussen, Matthew D. Bansal, Mukul S. Kellis, Manolis Syst Biol Regular Articles Accurate gene tree reconstruction is a fundamental problem in phylogenetics, with many important applications. However, sequence data alone often lack enough information to confidently support one gene tree topology over many competing alternatives. Here, we present a novel framework for combining sequence data and species tree information, and we describe an implementation of this framework in TreeFix, a new phylogenetic program for improving gene tree reconstructions. Given a gene tree (preferably computed using a maximum-likelihood phylogenetic program), TreeFix finds a “statistically equivalent” gene tree that minimizes a species tree-based cost function. We have applied TreeFix to 2 clades of 12 Drosophila and 16 fungal genomes, as well as to simulated phylogenies and show that it dramatically improves reconstructions compared with current state-of-the-art programs. Given its accuracy, speed, and simplicity, TreeFix should be applicable to a wide range of analyses and have many important implications for future investigations of gene evolution. The source code and a sample data set are available at http://compbio.mit.edu/treefix. Oxford University Press 2013-01 2012-11-19 /pmc/articles/PMC3526801/ /pubmed/22949484 http://dx.doi.org/10.1093/sysbio/sys076 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Regular Articles Wu, Yi-Chieh Rasmussen, Matthew D. Bansal, Mukul S. Kellis, Manolis TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title | TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title_full | TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title_fullStr | TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title_full_unstemmed | TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title_short | TreeFix: Statistically Informed Gene Tree Error Correction Using Species Trees |
title_sort | treefix: statistically informed gene tree error correction using species trees |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526801/ https://www.ncbi.nlm.nih.gov/pubmed/22949484 http://dx.doi.org/10.1093/sysbio/sys076 |
work_keys_str_mv | AT wuyichieh treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees AT rasmussenmatthewd treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees AT bansalmukuls treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees AT kellismanolis treefixstatisticallyinformedgenetreeerrorcorrectionusingspeciestrees |