Cargando…

Fast algorithms and heuristics for phylogenomics under ILS and hybridization

BACKGROUND: Phylogenomic analyses involving whole-genome or multi-locus data often entail dealing with incongruent gene trees. In this paper, we consider two causes of such incongruence, namely, incomplete lineage sorting (ILS) and hybridization, and consider both parsimony and probabilistic criteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Yun, Ristic, Nikola, Nakhleh, Luay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852049/
https://www.ncbi.nlm.nih.gov/pubmed/24564257
http://dx.doi.org/10.1186/1471-2105-14-S15-S6
_version_ 1782294408696168448
author Yu, Yun
Ristic, Nikola
Nakhleh, Luay
author_facet Yu, Yun
Ristic, Nikola
Nakhleh, Luay
author_sort Yu, Yun
collection PubMed
description BACKGROUND: Phylogenomic analyses involving whole-genome or multi-locus data often entail dealing with incongruent gene trees. In this paper, we consider two causes of such incongruence, namely, incomplete lineage sorting (ILS) and hybridization, and consider both parsimony and probabilistic criteria for dealing with them. RESULTS: Under the assumption of ILS, computing the probability of a gene tree given a species tree is a very hard problem. We present a heuristic for speeding up the computation, and demonstrate how it scales up computations to data sizes that are not feasible to analyze using current techniques, while achieving very good accuracy. Further, under the assumption of both ILS and hybridization, computing the probability of a gene tree and parsimoniously reconciling it with a phylogenetic network are both very hard problems. We present two exact algorithms for these two problems that speed up existing techniques significantly and enable analyses of much larger data sets than is currently feasible. CONCLUSION: Our heuristics and algorithms enable phylogenomic analyses of larger (in terms of numbers of taxa) data sets than is currently feasible. Further, our methods account for ILS and hybridization, thus allowing analyses of reticulate evolutionary histories.
format Online
Article
Text
id pubmed-3852049
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38520492013-12-20 Fast algorithms and heuristics for phylogenomics under ILS and hybridization Yu, Yun Ristic, Nikola Nakhleh, Luay BMC Bioinformatics Proceedings BACKGROUND: Phylogenomic analyses involving whole-genome or multi-locus data often entail dealing with incongruent gene trees. In this paper, we consider two causes of such incongruence, namely, incomplete lineage sorting (ILS) and hybridization, and consider both parsimony and probabilistic criteria for dealing with them. RESULTS: Under the assumption of ILS, computing the probability of a gene tree given a species tree is a very hard problem. We present a heuristic for speeding up the computation, and demonstrate how it scales up computations to data sizes that are not feasible to analyze using current techniques, while achieving very good accuracy. Further, under the assumption of both ILS and hybridization, computing the probability of a gene tree and parsimoniously reconciling it with a phylogenetic network are both very hard problems. We present two exact algorithms for these two problems that speed up existing techniques significantly and enable analyses of much larger data sets than is currently feasible. CONCLUSION: Our heuristics and algorithms enable phylogenomic analyses of larger (in terms of numbers of taxa) data sets than is currently feasible. Further, our methods account for ILS and hybridization, thus allowing analyses of reticulate evolutionary histories. BioMed Central 2013-10-15 /pmc/articles/PMC3852049/ /pubmed/24564257 http://dx.doi.org/10.1186/1471-2105-14-S15-S6 Text en Copyright © 2013 Yu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Yu, Yun
Ristic, Nikola
Nakhleh, Luay
Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title_full Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title_fullStr Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title_full_unstemmed Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title_short Fast algorithms and heuristics for phylogenomics under ILS and hybridization
title_sort fast algorithms and heuristics for phylogenomics under ils and hybridization
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852049/
https://www.ncbi.nlm.nih.gov/pubmed/24564257
http://dx.doi.org/10.1186/1471-2105-14-S15-S6
work_keys_str_mv AT yuyun fastalgorithmsandheuristicsforphylogenomicsunderilsandhybridization
AT risticnikola fastalgorithmsandheuristicsforphylogenomicsunderilsandhybridization
AT nakhlehluay fastalgorithmsandheuristicsforphylogenomicsunderilsandhybridization