Cargando…

Comparison of mode estimation methods and application in molecular clock analysis

BACKGROUND: Distributions of time estimates in molecular clock studies are sometimes skewed or contain outliers. In those cases, the mode is a better estimator of the overall time of divergence than the mean or median. However, different methods are available for estimating the mode. We compared the...

Descripción completa

Detalles Bibliográficos
Autores principales: Hedges, S Blair, Shah, Prachi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC183840/
https://www.ncbi.nlm.nih.gov/pubmed/12892571
http://dx.doi.org/10.1186/1471-2105-4-31
_version_ 1782120880185278464
author Hedges, S Blair
Shah, Prachi
author_facet Hedges, S Blair
Shah, Prachi
author_sort Hedges, S Blair
collection PubMed
description BACKGROUND: Distributions of time estimates in molecular clock studies are sometimes skewed or contain outliers. In those cases, the mode is a better estimator of the overall time of divergence than the mean or median. However, different methods are available for estimating the mode. We compared these methods in simulations to determine their strengths and weaknesses and further assessed their performance when applied to real data sets from a molecular clock study. RESULTS: We found that the half-range mode and robust parametric mode methods have a lower bias than other mode methods under a diversity of conditions. However, the half-range mode suffers from a relatively high variance and the robust parametric mode is more susceptible to bias by outliers. We determined that bootstrapping reduces the variance of both mode estimators. Application of the different methods to real data sets yielded results that were concordant with the simulations. CONCLUSION: Because the half-range mode is a simple and fast method, and produced less bias overall in our simulations, we recommend the bootstrapped version of it as a general-purpose mode estimator and suggest a bootstrap method for obtaining the standard error and 95% confidence interval of the mode.
format Text
id pubmed-183840
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1838402003-08-27 Comparison of mode estimation methods and application in molecular clock analysis Hedges, S Blair Shah, Prachi BMC Bioinformatics Methodology Article BACKGROUND: Distributions of time estimates in molecular clock studies are sometimes skewed or contain outliers. In those cases, the mode is a better estimator of the overall time of divergence than the mean or median. However, different methods are available for estimating the mode. We compared these methods in simulations to determine their strengths and weaknesses and further assessed their performance when applied to real data sets from a molecular clock study. RESULTS: We found that the half-range mode and robust parametric mode methods have a lower bias than other mode methods under a diversity of conditions. However, the half-range mode suffers from a relatively high variance and the robust parametric mode is more susceptible to bias by outliers. We determined that bootstrapping reduces the variance of both mode estimators. Application of the different methods to real data sets yielded results that were concordant with the simulations. CONCLUSION: Because the half-range mode is a simple and fast method, and produced less bias overall in our simulations, we recommend the bootstrapped version of it as a general-purpose mode estimator and suggest a bootstrap method for obtaining the standard error and 95% confidence interval of the mode. BioMed Central 2003-07-31 /pmc/articles/PMC183840/ /pubmed/12892571 http://dx.doi.org/10.1186/1471-2105-4-31 Text en Copyright © 2003 Hedges and Shah; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Hedges, S Blair
Shah, Prachi
Comparison of mode estimation methods and application in molecular clock analysis
title Comparison of mode estimation methods and application in molecular clock analysis
title_full Comparison of mode estimation methods and application in molecular clock analysis
title_fullStr Comparison of mode estimation methods and application in molecular clock analysis
title_full_unstemmed Comparison of mode estimation methods and application in molecular clock analysis
title_short Comparison of mode estimation methods and application in molecular clock analysis
title_sort comparison of mode estimation methods and application in molecular clock analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC183840/
https://www.ncbi.nlm.nih.gov/pubmed/12892571
http://dx.doi.org/10.1186/1471-2105-4-31
work_keys_str_mv AT hedgessblair comparisonofmodeestimationmethodsandapplicationinmolecularclockanalysis
AT shahprachi comparisonofmodeestimationmethodsandapplicationinmolecularclockanalysis