Cargando…

Prediction models for clustered data with informative priors for the random effects: a simulation study

BACKGROUND: Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distribution an...

Descripción completa

Detalles Bibliográficos
Autores principales: Ni, Haifang, Groenwold, Rolf H. H., Nielen, Mirjam, Klugkist, Irene
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6080562/
https://www.ncbi.nlm.nih.gov/pubmed/30081875
http://dx.doi.org/10.1186/s12874-018-0543-5
_version_ 1783345502661640192
author Ni, Haifang
Groenwold, Rolf H. H.
Nielen, Mirjam
Klugkist, Irene
author_facet Ni, Haifang
Groenwold, Rolf H. H.
Nielen, Mirjam
Klugkist, Irene
author_sort Ni, Haifang
collection PubMed
description BACKGROUND: Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distribution and investigated to what extent this could improve the predictive performance. METHODS: Data were simulated on the basis of a random effects logistic regression model. Five prediction models were specified: a frequentist model that set the random effects to zero for all new clusters, a Bayesian model with weakly informative priors for the random effects of new clusters, Bayesian models with expert opinion incorporated into low informative, medium informative and highly informative priors for the random effects. Expert opinion at the cluster level was elicited in the form of a truncated area of the random effects distribution. The predictive performance of the five models was assessed. In addition, impact of suboptimal expert opinion that deviated from the true quantity as well as including expert opinion by means of a categorical variable in the frequentist approach were explored. The five models were further investigated in various sensitivity analyses. RESULTS: The Bayesian prediction model using weakly informative priors for the random effects showed similar results to the frequentist model. Bayesian prediction models using expert opinion as informative priors showed smaller Brier scores, better overall discrimination and calibration, as well as better within cluster calibration. Results also indicated that incorporation of more precise expert opinion led to better predictions. Predictive performance from the frequentist models with expert opinion incorporated as categorical variable showed similar patterns as the Bayesian models with informative priors. When suboptimal expert opinion was used as prior information, results indicated that prediction still improved in certain settings. CONCLUSIONS: The prediction models that incorporated cluster level information showed better performance than the models that did not. The Bayesian prediction models we proposed, with cluster specific expert opinion incorporated as priors for the random effects showed better predictive ability in new data, compared to the frequentist method that replaced random effects with zero after model development. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-018-0543-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6080562
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60805622018-08-09 Prediction models for clustered data with informative priors for the random effects: a simulation study Ni, Haifang Groenwold, Rolf H. H. Nielen, Mirjam Klugkist, Irene BMC Med Res Methodol Research Article BACKGROUND: Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distribution and investigated to what extent this could improve the predictive performance. METHODS: Data were simulated on the basis of a random effects logistic regression model. Five prediction models were specified: a frequentist model that set the random effects to zero for all new clusters, a Bayesian model with weakly informative priors for the random effects of new clusters, Bayesian models with expert opinion incorporated into low informative, medium informative and highly informative priors for the random effects. Expert opinion at the cluster level was elicited in the form of a truncated area of the random effects distribution. The predictive performance of the five models was assessed. In addition, impact of suboptimal expert opinion that deviated from the true quantity as well as including expert opinion by means of a categorical variable in the frequentist approach were explored. The five models were further investigated in various sensitivity analyses. RESULTS: The Bayesian prediction model using weakly informative priors for the random effects showed similar results to the frequentist model. Bayesian prediction models using expert opinion as informative priors showed smaller Brier scores, better overall discrimination and calibration, as well as better within cluster calibration. Results also indicated that incorporation of more precise expert opinion led to better predictions. Predictive performance from the frequentist models with expert opinion incorporated as categorical variable showed similar patterns as the Bayesian models with informative priors. When suboptimal expert opinion was used as prior information, results indicated that prediction still improved in certain settings. CONCLUSIONS: The prediction models that incorporated cluster level information showed better performance than the models that did not. The Bayesian prediction models we proposed, with cluster specific expert opinion incorporated as priors for the random effects showed better predictive ability in new data, compared to the frequentist method that replaced random effects with zero after model development. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-018-0543-5) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-06 /pmc/articles/PMC6080562/ /pubmed/30081875 http://dx.doi.org/10.1186/s12874-018-0543-5 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ni, Haifang
Groenwold, Rolf H. H.
Nielen, Mirjam
Klugkist, Irene
Prediction models for clustered data with informative priors for the random effects: a simulation study
title Prediction models for clustered data with informative priors for the random effects: a simulation study
title_full Prediction models for clustered data with informative priors for the random effects: a simulation study
title_fullStr Prediction models for clustered data with informative priors for the random effects: a simulation study
title_full_unstemmed Prediction models for clustered data with informative priors for the random effects: a simulation study
title_short Prediction models for clustered data with informative priors for the random effects: a simulation study
title_sort prediction models for clustered data with informative priors for the random effects: a simulation study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6080562/
https://www.ncbi.nlm.nih.gov/pubmed/30081875
http://dx.doi.org/10.1186/s12874-018-0543-5
work_keys_str_mv AT nihaifang predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT groenwoldrolfhh predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT nielenmirjam predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT klugkistirene predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy