Cargando…

ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap

In the analysis of clustered or hierarchical data, a variety of statistical techniques can be applied. Most of these techniques have assumptions that are crucial to the validity of their outcome. Mixed models rely on the correct specification of the random effects structure. Generalized estimating e...

Descripción completa

Detalles Bibliográficos
Autores principales: Deen, Mathijs, de Rooij, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148287/
https://www.ncbi.nlm.nih.gov/pubmed/31089956
http://dx.doi.org/10.3758/s13428-019-01252-y
_version_ 1783520562108170240
author Deen, Mathijs
de Rooij, Mark
author_facet Deen, Mathijs
de Rooij, Mark
author_sort Deen, Mathijs
collection PubMed
description In the analysis of clustered or hierarchical data, a variety of statistical techniques can be applied. Most of these techniques have assumptions that are crucial to the validity of their outcome. Mixed models rely on the correct specification of the random effects structure. Generalized estimating equations are most efficient when the working correlation form is chosen correctly and are not feasible when the within-subject variable is non-factorial. Assumptions and limitations of another common approach, ANOVA for repeated measurements, are even more worrisome: listwise deletion when data are missing, the sphericity assumption, inability to model an unevenly spaced time variable and time-varying covariates, and the limitation to normally distributed dependent variables. This paper introduces ClusterBootstrap, an R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap (GLMCB). Being a bootstrap method, the technique is relatively assumption-free, and it has already been shown to be comparable, if not superior, to GEE in its performance. The paper has three goals. First, GLMCB will be introduced. Second, there will be an empirical example, using the ClusterBootstrap package for a Gaussian and a dichotomous dependent variable. Third, GLMCB will be compared to mixed models in a Monte Carlo experiment. Although GLMCB can be applied to a multitude of hierarchical data forms, this paper discusses it in the context of the analysis of repeated measurements or longitudinal data. It will become clear that the GLMCB is a promising alternative to mixed models and the ClusterBootstrap package an easy-to-use R implementation of the technique.
format Online
Article
Text
id pubmed-7148287
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-71482872020-04-16 ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap Deen, Mathijs de Rooij, Mark Behav Res Methods Article In the analysis of clustered or hierarchical data, a variety of statistical techniques can be applied. Most of these techniques have assumptions that are crucial to the validity of their outcome. Mixed models rely on the correct specification of the random effects structure. Generalized estimating equations are most efficient when the working correlation form is chosen correctly and are not feasible when the within-subject variable is non-factorial. Assumptions and limitations of another common approach, ANOVA for repeated measurements, are even more worrisome: listwise deletion when data are missing, the sphericity assumption, inability to model an unevenly spaced time variable and time-varying covariates, and the limitation to normally distributed dependent variables. This paper introduces ClusterBootstrap, an R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap (GLMCB). Being a bootstrap method, the technique is relatively assumption-free, and it has already been shown to be comparable, if not superior, to GEE in its performance. The paper has three goals. First, GLMCB will be introduced. Second, there will be an empirical example, using the ClusterBootstrap package for a Gaussian and a dichotomous dependent variable. Third, GLMCB will be compared to mixed models in a Monte Carlo experiment. Although GLMCB can be applied to a multitude of hierarchical data forms, this paper discusses it in the context of the analysis of repeated measurements or longitudinal data. It will become clear that the GLMCB is a promising alternative to mixed models and the ClusterBootstrap package an easy-to-use R implementation of the technique. Springer US 2019-05-14 2020 /pmc/articles/PMC7148287/ /pubmed/31089956 http://dx.doi.org/10.3758/s13428-019-01252-y Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Deen, Mathijs
de Rooij, Mark
ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title_full ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title_fullStr ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title_full_unstemmed ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title_short ClusterBootstrap: An R package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
title_sort clusterbootstrap: an r package for the analysis of hierarchical data using generalized linear models with the cluster bootstrap
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148287/
https://www.ncbi.nlm.nih.gov/pubmed/31089956
http://dx.doi.org/10.3758/s13428-019-01252-y
work_keys_str_mv AT deenmathijs clusterbootstrapanrpackagefortheanalysisofhierarchicaldatausinggeneralizedlinearmodelswiththeclusterbootstrap
AT derooijmark clusterbootstrapanrpackagefortheanalysisofhierarchicaldatausinggeneralizedlinearmodelswiththeclusterbootstrap