Cargando…

Using tree diversity to compare phylogenetic heuristics

BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylog...

Descripción completa

Detalles Bibliográficos
Autores principales: Sul, Seung-Jin, Matthews, Suzanne, Williams, Tiffani L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681068/
https://www.ncbi.nlm.nih.gov/pubmed/19426451
http://dx.doi.org/10.1186/1471-2105-10-S4-S3
_version_ 1782167009661812736
author Sul, Seung-Jin
Matthews, Suzanne
Williams, Tiffani L
author_facet Sul, Seung-Jin
Matthews, Suzanne
Williams, Tiffani L
author_sort Sul, Seung-Jin
collection PubMed
description BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. RESULTS: Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. CONCLUSION: Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees—especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest.
format Text
id pubmed-2681068
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26810682009-05-13 Using tree diversity to compare phylogenetic heuristics Sul, Seung-Jin Matthews, Suzanne Williams, Tiffani L BMC Bioinformatics Proceedings BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. RESULTS: Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. CONCLUSION: Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees—especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest. BioMed Central 2009-04-29 /pmc/articles/PMC2681068/ /pubmed/19426451 http://dx.doi.org/10.1186/1471-2105-10-S4-S3 Text en Copyright © 2009 Sul et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Sul, Seung-Jin
Matthews, Suzanne
Williams, Tiffani L
Using tree diversity to compare phylogenetic heuristics
title Using tree diversity to compare phylogenetic heuristics
title_full Using tree diversity to compare phylogenetic heuristics
title_fullStr Using tree diversity to compare phylogenetic heuristics
title_full_unstemmed Using tree diversity to compare phylogenetic heuristics
title_short Using tree diversity to compare phylogenetic heuristics
title_sort using tree diversity to compare phylogenetic heuristics
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681068/
https://www.ncbi.nlm.nih.gov/pubmed/19426451
http://dx.doi.org/10.1186/1471-2105-10-S4-S3
work_keys_str_mv AT sulseungjin usingtreediversitytocomparephylogeneticheuristics
AT matthewssuzanne usingtreediversitytocomparephylogeneticheuristics
AT williamstiffanil usingtreediversitytocomparephylogeneticheuristics