Cargando…

Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes

BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 2...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Baoyue, Lingsma, Hester F, Steyerberg, Ewout W, Lesaffre, Emmanuel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112198/ https://www.ncbi.nlm.nih.gov/pubmed/21605357 http://dx.doi.org/10.1186/1471-2288-11-77

_version_	1782205715761332224
author	Li, Baoyue Lingsma, Hester F Steyerberg, Ewout W Lesaffre, Emmanuel
author_facet	Li, Baoyue Lingsma, Hester F Steyerberg, Ewout W Lesaffre, Emmanuel
author_sort	Li, Baoyue
collection	PubMed
description	BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. RESULTS: The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. CONCLUSIONS: On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.
format	Online Article Text
id	pubmed-3112198
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31121982011-06-11 Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes Li, Baoyue Lingsma, Hester F Steyerberg, Ewout W Lesaffre, Emmanuel BMC Med Res Methodol Research Article BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. RESULTS: The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. CONCLUSIONS: On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain. BioMed Central 2011-05-23 /pmc/articles/PMC3112198/ /pubmed/21605357 http://dx.doi.org/10.1186/1471-2288-11-77 Text en Copyright ©2011 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Li, Baoyue Lingsma, Hester F Steyerberg, Ewout W Lesaffre, Emmanuel Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title	Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_full	Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_fullStr	Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_full_unstemmed	Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_short	Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_sort	logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112198/ https://www.ncbi.nlm.nih.gov/pubmed/21605357 http://dx.doi.org/10.1186/1471-2288-11-77
work_keys_str_mv	AT libaoyue logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes AT lingsmahesterf logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes AT steyerbergewoutw logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes AT lesaffreemmanuel logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes

Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes

Ejemplares similares