Cargando…

Towards a practical O(nlogn) phylogeny algorithm

Recently, we have identified a randomized quartet phylogeny algorithm that has O(nlogn) runtime with high probability, which is asymptotically optimal. Our algorithm has high probability of returning the correct phylogeny when quartet errors are independent and occur with known probability, and when...

Descripción completa

Detalles Bibliográficos
Autores principales: Truszkowski, Jakub, Hao, Yanqi, Brown, Daniel G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561654/
https://www.ncbi.nlm.nih.gov/pubmed/23181935
http://dx.doi.org/10.1186/1748-7188-7-32
_version_ 1782257996206702592
author Truszkowski, Jakub
Hao, Yanqi
Brown, Daniel G
author_facet Truszkowski, Jakub
Hao, Yanqi
Brown, Daniel G
author_sort Truszkowski, Jakub
collection PubMed
description Recently, we have identified a randomized quartet phylogeny algorithm that has O(nlogn) runtime with high probability, which is asymptotically optimal. Our algorithm has high probability of returning the correct phylogeny when quartet errors are independent and occur with known probability, and when the algorithm uses a guide tree on O(loglogn) taxa that is correct with high probability. In practice, none of these assumptions is correct: quartet errors are positively correlated and occur with unknown probability, and the guide tree is often error prone. Here, we bring our work out of the purely theoretical setting. We present a variety of extensions which, while only slowing the algorithm down by a constant factor, make its performance nearly comparable to that of Neighbour Joining , which requires Θ(n(3)) runtime in existing implementations. Our results suggest a new direction for quartet-based phylogenetic reconstruction that may yield striking speed improvements at minimal accuracy cost. An early prototype implementation of our software is available at http://www.cs.uwaterloo.ca/jmtruszk/qtree.tar.gz.
format Online
Article
Text
id pubmed-3561654
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35616542013-02-05 Towards a practical O(nlogn) phylogeny algorithm Truszkowski, Jakub Hao, Yanqi Brown, Daniel G Algorithms Mol Biol Research Recently, we have identified a randomized quartet phylogeny algorithm that has O(nlogn) runtime with high probability, which is asymptotically optimal. Our algorithm has high probability of returning the correct phylogeny when quartet errors are independent and occur with known probability, and when the algorithm uses a guide tree on O(loglogn) taxa that is correct with high probability. In practice, none of these assumptions is correct: quartet errors are positively correlated and occur with unknown probability, and the guide tree is often error prone. Here, we bring our work out of the purely theoretical setting. We present a variety of extensions which, while only slowing the algorithm down by a constant factor, make its performance nearly comparable to that of Neighbour Joining , which requires Θ(n(3)) runtime in existing implementations. Our results suggest a new direction for quartet-based phylogenetic reconstruction that may yield striking speed improvements at minimal accuracy cost. An early prototype implementation of our software is available at http://www.cs.uwaterloo.ca/jmtruszk/qtree.tar.gz. BioMed Central 2012-11-26 /pmc/articles/PMC3561654/ /pubmed/23181935 http://dx.doi.org/10.1186/1748-7188-7-32 Text en Copyright ©2012 Truszkowski et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Truszkowski, Jakub
Hao, Yanqi
Brown, Daniel G
Towards a practical O(nlogn) phylogeny algorithm
title Towards a practical O(nlogn) phylogeny algorithm
title_full Towards a practical O(nlogn) phylogeny algorithm
title_fullStr Towards a practical O(nlogn) phylogeny algorithm
title_full_unstemmed Towards a practical O(nlogn) phylogeny algorithm
title_short Towards a practical O(nlogn) phylogeny algorithm
title_sort towards a practical o(nlogn) phylogeny algorithm
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561654/
https://www.ncbi.nlm.nih.gov/pubmed/23181935
http://dx.doi.org/10.1186/1748-7188-7-32
work_keys_str_mv AT truszkowskijakub towardsapracticalonlognphylogenyalgorithm
AT haoyanqi towardsapracticalonlognphylogenyalgorithm
AT browndanielg towardsapracticalonlognphylogenyalgorithm