Cargando…

The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support

BACKGROUND: For parsimony analyses, the most common way to estimate confidence is by resampling plans (nonparametric bootstrap, jackknife), and Bremer support (Decay indices). The recent literature reveals that parameter settings that are quite commonly employed are not those that are recommended by...

Descripción completa

Detalles Bibliográficos
Autor principal: Müller, Kai F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282575/
https://www.ncbi.nlm.nih.gov/pubmed/16255783
http://dx.doi.org/10.1186/1471-2148-5-58
_version_ 1782126136674746368
author Müller, Kai F
author_facet Müller, Kai F
author_sort Müller, Kai F
collection PubMed
description BACKGROUND: For parsimony analyses, the most common way to estimate confidence is by resampling plans (nonparametric bootstrap, jackknife), and Bremer support (Decay indices). The recent literature reveals that parameter settings that are quite commonly employed are not those that are recommended by theoretical considerations and by previous empirical studies. The optimal search strategy to be applied during resampling was previously addressed solely via standard search strategies available in PAUP*. The question of a compromise between search extensiveness and improved support accuracy for Bremer support received even less attention. A set of experiments was conducted on different datasets to find an empirical cut-off point at which increased search extensiveness does not significantly change Bremer support and jackknife or bootstrap proportions any more. RESULTS: For the number of replicates needed for accurate estimates of support in resampling plans, a diagram is provided that helps to address the question whether apparently different support values really differ significantly. It is shown that the use of random addition cycles and parsimony ratchet iterations during bootstrapping does not translate into higher support, nor does any extension of the search extensiveness beyond the rather moderate effort of TBR (tree bisection and reconnection branch swapping) plus saving one tree per replicate. Instead, in case of very large matrices, saving more than one shortest tree per iteration and using a strict consensus tree of these yields decreased support compared to saving only one tree. This can be interpreted as a small risk of overestimating support but should be more than compensated by other factors that counteract an enhanced type I error. With regard to Bremer support, a rule of thumb can be derived stating that not much is gained relative to the surplus computational effort when searches are extended beyond 20 ratchet iterations per constrained node, at least not for datasets that fall within the size range found in the current literature. CONCLUSION: In view of these results, calculating bootstrap or jackknife proportions with narrow confidence intervals even for very large datasets can be achieved with less expense than often thought. In particular, iterated bootstrap methods that aim at reducing statistical bias inherent to these proportions are more feasible when the individual bootstrap searches require less time.
format Text
id pubmed-1282575
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12825752005-11-13 The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support Müller, Kai F BMC Evol Biol Methodology Article BACKGROUND: For parsimony analyses, the most common way to estimate confidence is by resampling plans (nonparametric bootstrap, jackknife), and Bremer support (Decay indices). The recent literature reveals that parameter settings that are quite commonly employed are not those that are recommended by theoretical considerations and by previous empirical studies. The optimal search strategy to be applied during resampling was previously addressed solely via standard search strategies available in PAUP*. The question of a compromise between search extensiveness and improved support accuracy for Bremer support received even less attention. A set of experiments was conducted on different datasets to find an empirical cut-off point at which increased search extensiveness does not significantly change Bremer support and jackknife or bootstrap proportions any more. RESULTS: For the number of replicates needed for accurate estimates of support in resampling plans, a diagram is provided that helps to address the question whether apparently different support values really differ significantly. It is shown that the use of random addition cycles and parsimony ratchet iterations during bootstrapping does not translate into higher support, nor does any extension of the search extensiveness beyond the rather moderate effort of TBR (tree bisection and reconnection branch swapping) plus saving one tree per replicate. Instead, in case of very large matrices, saving more than one shortest tree per iteration and using a strict consensus tree of these yields decreased support compared to saving only one tree. This can be interpreted as a small risk of overestimating support but should be more than compensated by other factors that counteract an enhanced type I error. With regard to Bremer support, a rule of thumb can be derived stating that not much is gained relative to the surplus computational effort when searches are extended beyond 20 ratchet iterations per constrained node, at least not for datasets that fall within the size range found in the current literature. CONCLUSION: In view of these results, calculating bootstrap or jackknife proportions with narrow confidence intervals even for very large datasets can be achieved with less expense than often thought. In particular, iterated bootstrap methods that aim at reducing statistical bias inherent to these proportions are more feasible when the individual bootstrap searches require less time. BioMed Central 2005-10-29 /pmc/articles/PMC1282575/ /pubmed/16255783 http://dx.doi.org/10.1186/1471-2148-5-58 Text en Copyright © 2005 Müller; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Müller, Kai F
The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title_full The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title_fullStr The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title_full_unstemmed The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title_short The efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and Bremer support
title_sort efficiency of different search strategies in estimating parsimony jackknife, bootstrap, and bremer support
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1282575/
https://www.ncbi.nlm.nih.gov/pubmed/16255783
http://dx.doi.org/10.1186/1471-2148-5-58
work_keys_str_mv AT mullerkaif theefficiencyofdifferentsearchstrategiesinestimatingparsimonyjackknifebootstrapandbremersupport
AT mullerkaif efficiencyofdifferentsearchstrategiesinestimatingparsimonyjackknifebootstrapandbremersupport