Cargando…

Can the buck always be passed to the highest level of clustering?

BACKGROUND: Clustering commonly affects the uncertainty of parameter estimates in epidemiological studies. Cluster-robust variance estimates (CRVE) are used to construct confidence intervals that account for single-level clustering, and are easily implemented in standard software. When data are clus...

Descripción completa

Detalles Bibliográficos
Autores principales: Bottomley, Christian, Kirby, Matthew J., Lindsay, Steve W., Alexander, Neal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784323/
https://www.ncbi.nlm.nih.gov/pubmed/26956373
http://dx.doi.org/10.1186/s12874-016-0127-1
_version_ 1782420244307902464
author Bottomley, Christian
Kirby, Matthew J.
Lindsay, Steve W.
Alexander, Neal
author_facet Bottomley, Christian
Kirby, Matthew J.
Lindsay, Steve W.
Alexander, Neal
author_sort Bottomley, Christian
collection PubMed
description BACKGROUND: Clustering commonly affects the uncertainty of parameter estimates in epidemiological studies. Cluster-robust variance estimates (CRVE) are used to construct confidence intervals that account for single-level clustering, and are easily implemented in standard software. When data are clustered at more than one level (e.g. village and household) the level for the CRVE must be chosen. CRVE are consistent when used at the higher level of clustering (village), but since there are fewer clusters at the higher level, and consistency is an asymptotic property, there may be circumstances under which coverage is better from lower- rather than higher-level CRVE. Here we assess the relative importance of adjusting for clustering at the higher and lower level in a logistic regression model. METHODS: We performed a simulation study in which the coverage of 95 % confidence intervals was compared between adjustments at the higher and lower levels. RESULTS: Confidence intervals adjusted for the higher level of clustering had coverage close to 95 %, even when there were few clusters, provided that the intra-cluster correlation of the predictor was less than 0.5 for models with a single predictor and less than 0.2 for models with multiple predictors. CONCLUSIONS: When there are multiple levels of clustering it is generally preferable to use confidence intervals that account for the highest level of clustering. This only fails if there are few clusters at this level and the intra-cluster correlation of the predictor is high. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-016-0127-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4784323
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47843232016-03-10 Can the buck always be passed to the highest level of clustering? Bottomley, Christian Kirby, Matthew J. Lindsay, Steve W. Alexander, Neal BMC Med Res Methodol Research Article BACKGROUND: Clustering commonly affects the uncertainty of parameter estimates in epidemiological studies. Cluster-robust variance estimates (CRVE) are used to construct confidence intervals that account for single-level clustering, and are easily implemented in standard software. When data are clustered at more than one level (e.g. village and household) the level for the CRVE must be chosen. CRVE are consistent when used at the higher level of clustering (village), but since there are fewer clusters at the higher level, and consistency is an asymptotic property, there may be circumstances under which coverage is better from lower- rather than higher-level CRVE. Here we assess the relative importance of adjusting for clustering at the higher and lower level in a logistic regression model. METHODS: We performed a simulation study in which the coverage of 95 % confidence intervals was compared between adjustments at the higher and lower levels. RESULTS: Confidence intervals adjusted for the higher level of clustering had coverage close to 95 %, even when there were few clusters, provided that the intra-cluster correlation of the predictor was less than 0.5 for models with a single predictor and less than 0.2 for models with multiple predictors. CONCLUSIONS: When there are multiple levels of clustering it is generally preferable to use confidence intervals that account for the highest level of clustering. This only fails if there are few clusters at this level and the intra-cluster correlation of the predictor is high. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-016-0127-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-08 /pmc/articles/PMC4784323/ /pubmed/26956373 http://dx.doi.org/10.1186/s12874-016-0127-1 Text en © Bottomley et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bottomley, Christian
Kirby, Matthew J.
Lindsay, Steve W.
Alexander, Neal
Can the buck always be passed to the highest level of clustering?
title Can the buck always be passed to the highest level of clustering?
title_full Can the buck always be passed to the highest level of clustering?
title_fullStr Can the buck always be passed to the highest level of clustering?
title_full_unstemmed Can the buck always be passed to the highest level of clustering?
title_short Can the buck always be passed to the highest level of clustering?
title_sort can the buck always be passed to the highest level of clustering?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4784323/
https://www.ncbi.nlm.nih.gov/pubmed/26956373
http://dx.doi.org/10.1186/s12874-016-0127-1
work_keys_str_mv AT bottomleychristian canthebuckalwaysbepassedtothehighestlevelofclustering
AT kirbymatthewj canthebuckalwaysbepassedtothehighestlevelofclustering
AT lindsaystevew canthebuckalwaysbepassedtothehighestlevelofclustering
AT alexanderneal canthebuckalwaysbepassedtothehighestlevelofclustering