Cargando…

Applications of Bayesian shrinkage prior models in clinical research with categorical responses

BACKGROUND: Prediction and classification algorithms are commonly used in clinical research for identifying patients susceptible to clinical conditions such as diabetes, colon cancer, and Alzheimer’s disease. Developing accurate prediction and classification methods benefits personalized medicine. B...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhattacharyya, Arinjita, Pal, Subhadip, Mitra, Riten, Rai, Shesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046716/
https://www.ncbi.nlm.nih.gov/pubmed/35484507
http://dx.doi.org/10.1186/s12874-022-01560-6
_version_ 1784695570705678336
author Bhattacharyya, Arinjita
Pal, Subhadip
Mitra, Riten
Rai, Shesh
author_facet Bhattacharyya, Arinjita
Pal, Subhadip
Mitra, Riten
Rai, Shesh
author_sort Bhattacharyya, Arinjita
collection PubMed
description BACKGROUND: Prediction and classification algorithms are commonly used in clinical research for identifying patients susceptible to clinical conditions such as diabetes, colon cancer, and Alzheimer’s disease. Developing accurate prediction and classification methods benefits personalized medicine. Building an excellent predictive model involves selecting the features that are most significantly associated with the outcome. These features can include several biological and demographic characteristics, such as genomic biomarkers and health history. Such variable selection becomes challenging when the number of potential predictors is large. Bayesian shrinkage models have emerged as popular and flexible methods of variable selection in regression settings. This work discusses variable selection with three shrinkage priors and illustrates its application to clinical data such as Pima Indians Diabetes, Colon cancer, ADNI, and OASIS Alzheimer’s real-world data. METHODS: A unified Bayesian hierarchical framework that implements and compares shrinkage priors in binary and multinomial logistic regression models is presented. The key feature is the representation of the likelihood by a Polya-Gamma data augmentation, which admits a natural integration with a family of shrinkage priors, specifically focusing on Horseshoe, Dirichlet Laplace, and Double Pareto priors. Extensive simulation studies are conducted to assess the performances under different data dimensions and parameter settings. Measures of accuracy, AUC, brier score, L1 error, cross-entropy, and ROC surface plots are used as evaluation criteria comparing the priors with frequentist methods as Lasso, Elastic-Net, and Ridge regression. RESULTS: All three priors can be used for robust prediction on significant metrics, irrespective of their categorical response model choices. Simulation studies could achieve the mean prediction accuracy of 91.6% (95% CI: 88.5, 94.7) and 76.5% (95% CI: 69.3, 83.8) for logistic regression and multinomial logistic models, respectively. The model can identify significant variables for disease risk prediction and is computationally efficient. CONCLUSIONS: The models are robust enough to conduct both variable selection and prediction because of their high shrinkage properties and applicability to a broad range of classification problems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12874-022-01560-6).
format Online
Article
Text
id pubmed-9046716
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90467162022-04-28 Applications of Bayesian shrinkage prior models in clinical research with categorical responses Bhattacharyya, Arinjita Pal, Subhadip Mitra, Riten Rai, Shesh BMC Med Res Methodol Research BACKGROUND: Prediction and classification algorithms are commonly used in clinical research for identifying patients susceptible to clinical conditions such as diabetes, colon cancer, and Alzheimer’s disease. Developing accurate prediction and classification methods benefits personalized medicine. Building an excellent predictive model involves selecting the features that are most significantly associated with the outcome. These features can include several biological and demographic characteristics, such as genomic biomarkers and health history. Such variable selection becomes challenging when the number of potential predictors is large. Bayesian shrinkage models have emerged as popular and flexible methods of variable selection in regression settings. This work discusses variable selection with three shrinkage priors and illustrates its application to clinical data such as Pima Indians Diabetes, Colon cancer, ADNI, and OASIS Alzheimer’s real-world data. METHODS: A unified Bayesian hierarchical framework that implements and compares shrinkage priors in binary and multinomial logistic regression models is presented. The key feature is the representation of the likelihood by a Polya-Gamma data augmentation, which admits a natural integration with a family of shrinkage priors, specifically focusing on Horseshoe, Dirichlet Laplace, and Double Pareto priors. Extensive simulation studies are conducted to assess the performances under different data dimensions and parameter settings. Measures of accuracy, AUC, brier score, L1 error, cross-entropy, and ROC surface plots are used as evaluation criteria comparing the priors with frequentist methods as Lasso, Elastic-Net, and Ridge regression. RESULTS: All three priors can be used for robust prediction on significant metrics, irrespective of their categorical response model choices. Simulation studies could achieve the mean prediction accuracy of 91.6% (95% CI: 88.5, 94.7) and 76.5% (95% CI: 69.3, 83.8) for logistic regression and multinomial logistic models, respectively. The model can identify significant variables for disease risk prediction and is computationally efficient. CONCLUSIONS: The models are robust enough to conduct both variable selection and prediction because of their high shrinkage properties and applicability to a broad range of classification problems. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12874-022-01560-6). BioMed Central 2022-04-28 /pmc/articles/PMC9046716/ /pubmed/35484507 http://dx.doi.org/10.1186/s12874-022-01560-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Bhattacharyya, Arinjita
Pal, Subhadip
Mitra, Riten
Rai, Shesh
Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title_full Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title_fullStr Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title_full_unstemmed Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title_short Applications of Bayesian shrinkage prior models in clinical research with categorical responses
title_sort applications of bayesian shrinkage prior models in clinical research with categorical responses
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9046716/
https://www.ncbi.nlm.nih.gov/pubmed/35484507
http://dx.doi.org/10.1186/s12874-022-01560-6
work_keys_str_mv AT bhattacharyyaarinjita applicationsofbayesianshrinkagepriormodelsinclinicalresearchwithcategoricalresponses
AT palsubhadip applicationsofbayesianshrinkagepriormodelsinclinicalresearchwithcategoricalresponses
AT mitrariten applicationsofbayesianshrinkagepriormodelsinclinicalresearchwithcategoricalresponses
AT raishesh applicationsofbayesianshrinkagepriormodelsinclinicalresearchwithcategoricalresponses