Cargando…

SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations

BACKGROUND: Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed t...

Descripción completa

Detalles Bibliográficos
Autores principales: Martins, Wellington S, Carmo, Welton C, Longo, Humberto J, Rosa, Thierson C, Rangel, Thiago F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4225676/
https://www.ncbi.nlm.nih.gov/pubmed/24229408
http://dx.doi.org/10.1186/1471-2105-14-324
_version_ 1782343551694143488
author Martins, Wellington S
Carmo, Welton C
Longo, Humberto J
Rosa, Thierson C
Rangel, Thiago F
author_facet Martins, Wellington S
Carmo, Welton C
Longo, Humberto J
Rosa, Thierson C
Rangel, Thiago F
author_sort Martins, Wellington S
collection PubMed
description BACKGROUND: Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. RESULTS: In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. CONCLUSION: We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.
format Online
Article
Text
id pubmed-4225676
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42256762014-11-12 SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations Martins, Wellington S Carmo, Welton C Longo, Humberto J Rosa, Thierson C Rangel, Thiago F BMC Bioinformatics Software BACKGROUND: Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. RESULTS: In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. CONCLUSION: We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. BioMed Central 2013-11-15 /pmc/articles/PMC4225676/ /pubmed/24229408 http://dx.doi.org/10.1186/1471-2105-14-324 Text en Copyright © 2013 Martins et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Martins, Wellington S
Carmo, Welton C
Longo, Humberto J
Rosa, Thierson C
Rangel, Thiago F
SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title_full SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title_fullStr SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title_full_unstemmed SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title_short SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations
title_sort sunplin: simulation with uncertainty for phylogenetic investigations
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4225676/
https://www.ncbi.nlm.nih.gov/pubmed/24229408
http://dx.doi.org/10.1186/1471-2105-14-324
work_keys_str_mv AT martinswellingtons sunplinsimulationwithuncertaintyforphylogeneticinvestigations
AT carmoweltonc sunplinsimulationwithuncertaintyforphylogeneticinvestigations
AT longohumbertoj sunplinsimulationwithuncertaintyforphylogeneticinvestigations
AT rosathiersonc sunplinsimulationwithuncertaintyforphylogeneticinvestigations
AT rangelthiagof sunplinsimulationwithuncertaintyforphylogeneticinvestigations