Cargando…

Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies

Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Sayyari, Erfan, Mirarab, Siavash
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915361/
https://www.ncbi.nlm.nih.gov/pubmed/27189547
http://dx.doi.org/10.1093/molbev/msw079
_version_ 1782438692976066560
author Sayyari, Erfan
Mirarab, Siavash
author_facet Sayyari, Erfan
Mirarab, Siavash
author_sort Sayyari, Erfan
collection PubMed
description Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been paid to fast and accurate methods of quantifying support. In this article, we propose a fast algorithm to compute quartet-based support for each branch of a given species tree with regard to a given set of gene trees. We then show how the quartet support can be used in the context of the MSC to compute (1) the local posterior probability (PP) that the branch is in the species tree and (2) the length of the branch in coalescent units. We evaluate the precision and recall of the local PP on a wide set of simulated and biological datasets, and show that it has very high precision and improved recall compared with multi-locus bootstrapping. The estimated branch lengths are highly accurate when gene tree estimation error is low, but are underestimated when gene tree estimation error increases. Computation of both the branch length and local PP is implemented as new features in ASTRAL.
format Online
Article
Text
id pubmed-4915361
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49153612016-06-22 Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies Sayyari, Erfan Mirarab, Siavash Mol Biol Evol Fast Track Species tree reconstruction is complicated by effects of incomplete lineage sorting, commonly modeled by the multi-species coalescent model (MSC). While there has been substantial progress in developing methods that estimate a species tree given a collection of gene trees, less attention has been paid to fast and accurate methods of quantifying support. In this article, we propose a fast algorithm to compute quartet-based support for each branch of a given species tree with regard to a given set of gene trees. We then show how the quartet support can be used in the context of the MSC to compute (1) the local posterior probability (PP) that the branch is in the species tree and (2) the length of the branch in coalescent units. We evaluate the precision and recall of the local PP on a wide set of simulated and biological datasets, and show that it has very high precision and improved recall compared with multi-locus bootstrapping. The estimated branch lengths are highly accurate when gene tree estimation error is low, but are underestimated when gene tree estimation error increases. Computation of both the branch length and local PP is implemented as new features in ASTRAL. Oxford University Press 2016-07 2016-04-15 /pmc/articles/PMC4915361/ /pubmed/27189547 http://dx.doi.org/10.1093/molbev/msw079 Text en © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Fast Track
Sayyari, Erfan
Mirarab, Siavash
Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title_full Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title_fullStr Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title_full_unstemmed Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title_short Fast Coalescent-Based Computation of Local Branch Support from Quartet Frequencies
title_sort fast coalescent-based computation of local branch support from quartet frequencies
topic Fast Track
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915361/
https://www.ncbi.nlm.nih.gov/pubmed/27189547
http://dx.doi.org/10.1093/molbev/msw079
work_keys_str_mv AT sayyarierfan fastcoalescentbasedcomputationoflocalbranchsupportfromquartetfrequencies
AT mirarabsiavash fastcoalescentbasedcomputationoflocalbranchsupportfromquartetfrequencies