Cargando…

Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer

BACKGROUND: Species tree estimation is challenged by gene tree heterogeneity resulting from biological processes such as duplication and loss, hybridization, incomplete lineage sorting (ILS), and horizontal gene transfer (HGT). Mathematical theory about reconstructing species trees in the presence o...

Descripción completa

Detalles Bibliográficos
Autores principales: Davidson, Ruth, Vachaspati, Pranjal, Mirarab, Siavash, Warnow, Tandy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4603753/
https://www.ncbi.nlm.nih.gov/pubmed/26450506
http://dx.doi.org/10.1186/1471-2164-16-S10-S1
_version_ 1782394950260883456
author Davidson, Ruth
Vachaspati, Pranjal
Mirarab, Siavash
Warnow, Tandy
author_facet Davidson, Ruth
Vachaspati, Pranjal
Mirarab, Siavash
Warnow, Tandy
author_sort Davidson, Ruth
collection PubMed
description BACKGROUND: Species tree estimation is challenged by gene tree heterogeneity resulting from biological processes such as duplication and loss, hybridization, incomplete lineage sorting (ILS), and horizontal gene transfer (HGT). Mathematical theory about reconstructing species trees in the presence of HGT alone or ILS alone suggests that quartet-based species tree methods (known to be statistically consistent under ILS, or under bounded amounts of HGT) might be effective techniques for estimating species trees when both HGT and ILS are present. RESULTS: We evaluated several publicly available coalescent-based methods and concatenation under maximum likelihood on simulated datasets with moderate ILS and varying levels of HGT. Our study shows that two quartet-based species tree estimation methods (ASTRAL-2 and weighted Quartets MaxCut) are both highly accurate, even on datasets with high rates of HGT. In contrast, although NJst and concatenation using maximum likelihood are highly accurate under low HGT, they are less robust to high HGT rates. CONCLUSION: Our study shows that quartet-based species-tree estimation methods can be highly accurate under the presence of both HGT and ILS. The study suggests the possibility that some quartet-based methods might be statistically consistent under phylogenomic models of gene tree heterogeneity with both HGT and ILS.
format Online
Article
Text
id pubmed-4603753
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46037532015-10-14 Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer Davidson, Ruth Vachaspati, Pranjal Mirarab, Siavash Warnow, Tandy BMC Genomics Research BACKGROUND: Species tree estimation is challenged by gene tree heterogeneity resulting from biological processes such as duplication and loss, hybridization, incomplete lineage sorting (ILS), and horizontal gene transfer (HGT). Mathematical theory about reconstructing species trees in the presence of HGT alone or ILS alone suggests that quartet-based species tree methods (known to be statistically consistent under ILS, or under bounded amounts of HGT) might be effective techniques for estimating species trees when both HGT and ILS are present. RESULTS: We evaluated several publicly available coalescent-based methods and concatenation under maximum likelihood on simulated datasets with moderate ILS and varying levels of HGT. Our study shows that two quartet-based species tree estimation methods (ASTRAL-2 and weighted Quartets MaxCut) are both highly accurate, even on datasets with high rates of HGT. In contrast, although NJst and concatenation using maximum likelihood are highly accurate under low HGT, they are less robust to high HGT rates. CONCLUSION: Our study shows that quartet-based species-tree estimation methods can be highly accurate under the presence of both HGT and ILS. The study suggests the possibility that some quartet-based methods might be statistically consistent under phylogenomic models of gene tree heterogeneity with both HGT and ILS. BioMed Central 2015-10-02 /pmc/articles/PMC4603753/ /pubmed/26450506 http://dx.doi.org/10.1186/1471-2164-16-S10-S1 Text en Copyright © 2015 Davidson et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Davidson, Ruth
Vachaspati, Pranjal
Mirarab, Siavash
Warnow, Tandy
Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title_full Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title_fullStr Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title_full_unstemmed Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title_short Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
title_sort phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4603753/
https://www.ncbi.nlm.nih.gov/pubmed/26450506
http://dx.doi.org/10.1186/1471-2164-16-S10-S1
work_keys_str_mv AT davidsonruth phylogenomicspeciestreeestimationinthepresenceofincompletelineagesortingandhorizontalgenetransfer
AT vachaspatipranjal phylogenomicspeciestreeestimationinthepresenceofincompletelineagesortingandhorizontalgenetransfer
AT mirarabsiavash phylogenomicspeciestreeestimationinthepresenceofincompletelineagesortingandhorizontalgenetransfer
AT warnowtandy phylogenomicspeciestreeestimationinthepresenceofincompletelineagesortingandhorizontalgenetransfer