Cargando…

Topology testing of phylogenies using least squares methods

BACKGROUND: The least squares (LS) method for constructing confidence sets of trees is closely related to LS tree building methods, in which the goodness of fit of the distances measured on the tree (patristic distances) to the observed distances between taxa is the criterion used for selecting the...

Descripción completa

Detalles Bibliográficos
Autores principales: Czarna, Aleksandra, Sanjuán, Rafael, González-Candelas, Fernando, Wróbel, Borys
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1698936/
https://www.ncbi.nlm.nih.gov/pubmed/17150093
http://dx.doi.org/10.1186/1471-2148-6-105
_version_ 1782131246612086784
author Czarna, Aleksandra
Sanjuán, Rafael
González-Candelas, Fernando
Wróbel, Borys
author_facet Czarna, Aleksandra
Sanjuán, Rafael
González-Candelas, Fernando
Wróbel, Borys
author_sort Czarna, Aleksandra
collection PubMed
description BACKGROUND: The least squares (LS) method for constructing confidence sets of trees is closely related to LS tree building methods, in which the goodness of fit of the distances measured on the tree (patristic distances) to the observed distances between taxa is the criterion used for selecting the best topology. The generalized LS (GLS) method for topology testing is often frustrated by the computational difficulties in calculating the covariance matrix and its inverse, which in practice requires approximations. The weighted LS (WLS) allows for a more efficient albeit approximate calculation of the test statistic by ignoring the covariances between the distances. RESULTS: The goal of this paper is to assess the applicability of the LS approach for constructing confidence sets of trees. We show that the approximations inherent to the WLS method did not affect negatively the accuracy and reliability of the test both in the analysis of biological sequences and DNA-DNA hybridization data (for which character-based testing methods cannot be used). On the other hand, we report several problems for the GLS method, at least for the available implementation. For many data sets of biological sequences, the GLS statistic could not be calculated. For some data sets for which it could, the GLS method included all the possible trees in the confidence set despite a strong phylogenetic signal in the data. Finally, contrary to WLS, for simulated sequences GLS showed undercoverage (frequent non-inclusion of the true tree in the confidence set). CONCLUSION: The WLS method provides a computationally efficient approximation to the GLS useful especially in exploratory analyses of confidence sets of trees, when assessing the phylogenetic signal in the data, and when other methods are not available.
format Text
id pubmed-1698936
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16989362006-12-19 Topology testing of phylogenies using least squares methods Czarna, Aleksandra Sanjuán, Rafael González-Candelas, Fernando Wróbel, Borys BMC Evol Biol Methodology Article BACKGROUND: The least squares (LS) method for constructing confidence sets of trees is closely related to LS tree building methods, in which the goodness of fit of the distances measured on the tree (patristic distances) to the observed distances between taxa is the criterion used for selecting the best topology. The generalized LS (GLS) method for topology testing is often frustrated by the computational difficulties in calculating the covariance matrix and its inverse, which in practice requires approximations. The weighted LS (WLS) allows for a more efficient albeit approximate calculation of the test statistic by ignoring the covariances between the distances. RESULTS: The goal of this paper is to assess the applicability of the LS approach for constructing confidence sets of trees. We show that the approximations inherent to the WLS method did not affect negatively the accuracy and reliability of the test both in the analysis of biological sequences and DNA-DNA hybridization data (for which character-based testing methods cannot be used). On the other hand, we report several problems for the GLS method, at least for the available implementation. For many data sets of biological sequences, the GLS statistic could not be calculated. For some data sets for which it could, the GLS method included all the possible trees in the confidence set despite a strong phylogenetic signal in the data. Finally, contrary to WLS, for simulated sequences GLS showed undercoverage (frequent non-inclusion of the true tree in the confidence set). CONCLUSION: The WLS method provides a computationally efficient approximation to the GLS useful especially in exploratory analyses of confidence sets of trees, when assessing the phylogenetic signal in the data, and when other methods are not available. BioMed Central 2006-12-06 /pmc/articles/PMC1698936/ /pubmed/17150093 http://dx.doi.org/10.1186/1471-2148-6-105 Text en Copyright © 2006 Czarna et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Czarna, Aleksandra
Sanjuán, Rafael
González-Candelas, Fernando
Wróbel, Borys
Topology testing of phylogenies using least squares methods
title Topology testing of phylogenies using least squares methods
title_full Topology testing of phylogenies using least squares methods
title_fullStr Topology testing of phylogenies using least squares methods
title_full_unstemmed Topology testing of phylogenies using least squares methods
title_short Topology testing of phylogenies using least squares methods
title_sort topology testing of phylogenies using least squares methods
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1698936/
https://www.ncbi.nlm.nih.gov/pubmed/17150093
http://dx.doi.org/10.1186/1471-2148-6-105
work_keys_str_mv AT czarnaaleksandra topologytestingofphylogeniesusingleastsquaresmethods
AT sanjuanrafael topologytestingofphylogeniesusingleastsquaresmethods
AT gonzalezcandelasfernando topologytestingofphylogeniesusingleastsquaresmethods
AT wrobelborys topologytestingofphylogeniesusingleastsquaresmethods