Cargando…
Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been pa...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915361/ https://www.ncbi.nlm.nih.gov/pubmed/27189547 http://dx.doi.org/10.1093/molbev/msw079 |
_version_ | 1782438692976066560 |
---|---|
author | Sayyari, Erfan Mirarab, Siavash |
author_facet | Sayyari, Erfan Mirarab, Siavash |
author_sort | Sayyari, Erfan |
collection | PubMed |
description | Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been paid to fast and accurate methods of quantifying support. In this article, we propose a fast algorithm to compute quartet-based support for each branch of a given species tree with regard to a given set of gene trees. We then show how the quartet support can be used in the context of the MSC to compute (1) the local posterior probability (PP) that the branch is in the species tree and (2) the length of the branch in coalescent units. We evaluate the precision and recall of the local PP on a wide set of simulated and biological datasets, and show that it has very high precision and improved recall compared with multi-locus bootstrapping. The estimated branch lengths are highly accurate when gene tree estimation error is low, but are underestimated when gene tree estimation error increases. Computation of both the branch length and local PP is implemented as new features in ASTRAL. |
format | Online Article Text |
id | pubmed-4915361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-49153612016-06-22 Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies Sayyari, Erfan Mirarab, Siavash Mol Biol Evol Fast Track Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been paid to fast and accurate methods of quantifying support. In this article, we propose a fast algorithm to compute quartet-based support for each branch of a given species tree with regard to a given set of gene trees. We then show how the quartet support can be used in the context of the MSC to compute (1) the local posterior probability (PP) that the branch is in the species tree and (2) the length of the branch in coalescent units. We evaluate the precision and recall of the local PP on a wide set of simulated and biological datasets, and show that it has very high precision and improved recall compared with multi-locus bootstrapping. The estimated branch lengths are highly accurate when gene tree estimation error is low, but are underestimated when gene tree estimation error increases. Computation of both the branch length and local PP is implemented as new features in ASTRAL. Oxford University Press 2016-07 2016-04-15 /pmc/articles/PMC4915361/ /pubmed/27189547 http://dx.doi.org/10.1093/molbev/msw079 Text en © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Fast Track Sayyari, Erfan Mirarab, Siavash Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title | Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title_full | Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title_fullStr | Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title_full_unstemmed | Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title_short | Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies |
title_sort | fast coalescent-based computation of local branch support from quartet frequencies |
topic | Fast Track |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915361/ https://www.ncbi.nlm.nih.gov/pubmed/27189547 http://dx.doi.org/10.1093/molbev/msw079 |
work_keys_str_mv | AT sayyarierfan fastcoalescentbasedcomputationoflocalbranchsupportfromquartetfrequencies AT mirarabsiavash fastcoalescentbasedcomputationoflocalbranchsupportfromquartetfrequencies |