Cargando…
Using tree diversity to compare phylogenetic heuristics
BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylog...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681068/ https://www.ncbi.nlm.nih.gov/pubmed/19426451 http://dx.doi.org/10.1186/1471-2105-10-S4-S3 |
_version_ | 1782167009661812736 |
---|---|
author | Sul, Seung-Jin Matthews, Suzanne Williams, Tiffani L |
author_facet | Sul, Seung-Jin Matthews, Suzanne Williams, Tiffani L |
author_sort | Sul, Seung-Jin |
collection | PubMed |
description | BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. RESULTS: Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. CONCLUSION: Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees—especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest. |
format | Text |
id | pubmed-2681068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26810682009-05-13 Using tree diversity to compare phylogenetic heuristics Sul, Seung-Jin Matthews, Suzanne Williams, Tiffani L BMC Bioinformatics Proceedings BACKGROUND: Evolutionary trees are family trees that represent the relationships between a group of organisms. Phylogenetic heuristics are used to search stochastically for the best-scoring trees in tree space. Given that better tree scores are believed to be better approximations of the true phylogeny, traditional evaluation techniques have used tree scores to determine the heuristics that find the best scores in the fastest time. We develop new techniques to evaluate phylogenetic heuristics based on both tree scores and topologies to compare Pauprat and Rec-I-DCM3, two popular Maximum Parsimony search algorithms. RESULTS: Our results show that although Pauprat and Rec-I-DCM3 find the trees with the same best scores, topologically these trees are quite different. Furthermore, the Rec-I-DCM3 trees cluster distinctly from the Pauprat trees. In addition to our heatmap visualizations of using parsimony scores and the Robinson-Foulds distance to compare best-scoring trees found by the two heuristics, we also develop entropy-based methods to show the diversity of the trees found. Overall, Pauprat identifies more diverse trees than Rec-I-DCM3. CONCLUSION: Overall, our work shows that there is value to comparing heuristics beyond the parsimony scores that they find. Pauprat is a slower heuristic than Rec-I-DCM3. However, our work shows that there is tremendous value in using Pauprat to reconstruct trees—especially since it finds identical scoring but topologically distinct trees. Hence, instead of discounting Pauprat, effort should go in improving its implementation. Ultimately, improved performance measures lead to better phylogenetic heuristics and will result in better approximations of the true evolutionary history of the organisms of interest. BioMed Central 2009-04-29 /pmc/articles/PMC2681068/ /pubmed/19426451 http://dx.doi.org/10.1186/1471-2105-10-S4-S3 Text en Copyright © 2009 Sul et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Sul, Seung-Jin Matthews, Suzanne Williams, Tiffani L Using tree diversity to compare phylogenetic heuristics |
title | Using tree diversity to compare phylogenetic heuristics |
title_full | Using tree diversity to compare phylogenetic heuristics |
title_fullStr | Using tree diversity to compare phylogenetic heuristics |
title_full_unstemmed | Using tree diversity to compare phylogenetic heuristics |
title_short | Using tree diversity to compare phylogenetic heuristics |
title_sort | using tree diversity to compare phylogenetic heuristics |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681068/ https://www.ncbi.nlm.nih.gov/pubmed/19426451 http://dx.doi.org/10.1186/1471-2105-10-S4-S3 |
work_keys_str_mv | AT sulseungjin usingtreediversitytocomparephylogeneticheuristics AT matthewssuzanne usingtreediversitytocomparephylogeneticheuristics AT williamstiffanil usingtreediversitytocomparephylogeneticheuristics |