Cargando…

Fast and accurate methods for phylogenomic analyses

BACKGROUND: Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another) due to biological processes such as horizontal gene transfer, incomplete lineag...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Jimmy, Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283310/
https://www.ncbi.nlm.nih.gov/pubmed/22152123
http://dx.doi.org/10.1186/1471-2105-12-S9-S4
_version_ 1782224182280454144
author Yang, Jimmy
Warnow, Tandy
author_facet Yang, Jimmy
Warnow, Tandy
author_sort Yang, Jimmy
collection PubMed
description BACKGROUND: Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another) due to biological processes such as horizontal gene transfer, incomplete lineage sorting, and gene duplication and loss, so that no single gene tree is a reliable estimate of the species tree. Several methods have been developed to estimate species trees from estimated gene trees, differing according to the specific algorithmic technique used and the biological model used to explain differences between species and gene trees. Relatively little is known about the relative performance of these methods. RESULTS: We report on a study evaluating several different methods for estimating species trees from sequence datasets, simulating sequence evolution under a complex model including indels (insertions and deletions), substitutions, and incomplete lineage sorting. The most important finding of our study is that some fast and simple methods are nearly as accurate as the most accurate methods, which employ sophisticated statistical methods and are computationally quite intensive. We also observe that methods that explicitly consider errors in the estimated gene trees produce more accurate trees than methods that assume the estimated gene trees are correct. CONCLUSIONS: Our study shows that highly accurate estimations of species trees are achievable, even when gene trees differ from each other and from the species tree, and that these estimations can be obtained using fairly simple and computationally tractable methods.
format Online
Article
Text
id pubmed-3283310
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32833102012-02-22 Fast and accurate methods for phylogenomic analyses Yang, Jimmy Warnow, Tandy BMC Bioinformatics Proceedings BACKGROUND: Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another) due to biological processes such as horizontal gene transfer, incomplete lineage sorting, and gene duplication and loss, so that no single gene tree is a reliable estimate of the species tree. Several methods have been developed to estimate species trees from estimated gene trees, differing according to the specific algorithmic technique used and the biological model used to explain differences between species and gene trees. Relatively little is known about the relative performance of these methods. RESULTS: We report on a study evaluating several different methods for estimating species trees from sequence datasets, simulating sequence evolution under a complex model including indels (insertions and deletions), substitutions, and incomplete lineage sorting. The most important finding of our study is that some fast and simple methods are nearly as accurate as the most accurate methods, which employ sophisticated statistical methods and are computationally quite intensive. We also observe that methods that explicitly consider errors in the estimated gene trees produce more accurate trees than methods that assume the estimated gene trees are correct. CONCLUSIONS: Our study shows that highly accurate estimations of species trees are achievable, even when gene trees differ from each other and from the species tree, and that these estimations can be obtained using fairly simple and computationally tractable methods. BioMed Central 2011-10-05 /pmc/articles/PMC3283310/ /pubmed/22152123 http://dx.doi.org/10.1186/1471-2105-12-S9-S4 Text en Copyright ©2011 Yang and Warnow; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Yang, Jimmy
Warnow, Tandy
Fast and accurate methods for phylogenomic analyses
title Fast and accurate methods for phylogenomic analyses
title_full Fast and accurate methods for phylogenomic analyses
title_fullStr Fast and accurate methods for phylogenomic analyses
title_full_unstemmed Fast and accurate methods for phylogenomic analyses
title_short Fast and accurate methods for phylogenomic analyses
title_sort fast and accurate methods for phylogenomic analyses
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3283310/
https://www.ncbi.nlm.nih.gov/pubmed/22152123
http://dx.doi.org/10.1186/1471-2105-12-S9-S4
work_keys_str_mv AT yangjimmy fastandaccuratemethodsforphylogenomicanalyses
AT warnowtandy fastandaccuratemethodsforphylogenomicanalyses