Cargando…

Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes

BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 2...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Baoyue, Lingsma, Hester F, Steyerberg, Ewout W, Lesaffre, Emmanuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112198/
https://www.ncbi.nlm.nih.gov/pubmed/21605357
http://dx.doi.org/10.1186/1471-2288-11-77
_version_ 1782205715761332224
author Li, Baoyue
Lingsma, Hester F
Steyerberg, Ewout W
Lesaffre, Emmanuel
author_facet Li, Baoyue
Lingsma, Hester F
Steyerberg, Ewout W
Lesaffre, Emmanuel
author_sort Li, Baoyue
collection PubMed
description BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. RESULTS: The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. CONCLUSIONS: On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.
format Online
Article
Text
id pubmed-3112198
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31121982011-06-11 Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes Li, Baoyue Lingsma, Hester F Steyerberg, Ewout W Lesaffre, Emmanuel BMC Med Res Methodol Research Article BACKGROUND: Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. METHODS: We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. RESULTS: The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. CONCLUSIONS: On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain. BioMed Central 2011-05-23 /pmc/articles/PMC3112198/ /pubmed/21605357 http://dx.doi.org/10.1186/1471-2288-11-77 Text en Copyright ©2011 Li et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Baoyue
Lingsma, Hester F
Steyerberg, Ewout W
Lesaffre, Emmanuel
Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_full Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_fullStr Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_full_unstemmed Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_short Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
title_sort logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112198/
https://www.ncbi.nlm.nih.gov/pubmed/21605357
http://dx.doi.org/10.1186/1471-2288-11-77
work_keys_str_mv AT libaoyue logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes
AT lingsmahesterf logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes
AT steyerbergewoutw logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes
AT lesaffreemmanuel logisticrandomeffectsregressionmodelsacomparisonofstatisticalpackagesforbinaryandordinaloutcomes