Cargando…

A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study

Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors to create parsimonious scores, but such ‘black box’ variable selection limits interpretability, and varia...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ning, Yilin, Li, Siqi, Ong, Marcus Eng Hock, Xie, Feng, Chakraborty, Bibhas, Ting, Daniel Shu Wei, Liu, Nan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931273/ https://www.ncbi.nlm.nih.gov/pubmed/36812536 http://dx.doi.org/10.1371/journal.pdig.0000062

_version_	1784889212987768832
author	Ning, Yilin Li, Siqi Ong, Marcus Eng Hock Xie, Feng Chakraborty, Bibhas Ting, Daniel Shu Wei Liu, Nan
author_facet	Ning, Yilin Li, Siqi Ong, Marcus Eng Hock Xie, Feng Chakraborty, Bibhas Ting, Daniel Shu Wei Liu, Nan
author_sort	Ning, Yilin
collection	PubMed
description	Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors to create parsimonious scores, but such ‘black box’ variable selection limits interpretability, and variable importance evaluated from a single model can be biased. We propose a robust and interpretable variable selection approach using the recently developed Shapley variable importance cloud (ShapleyVIC) that accounts for variability in variable importance across models. Our approach evaluates and visualizes overall variable contributions for in-depth inference and transparent variable selection, and filters out non-significant contributors to simplify model building steps. We derive an ensemble variable ranking from variable contributions across models, which is easily integrated with an automated and modularized risk score generator, AutoScore, for convenient implementation. In a study of early death or unplanned readmission after hospital discharge, ShapleyVIC selected 6 variables from 41 candidates to create a well-performing risk score, which had similar performance to a 16-variable model from machine-learning-based ranking. Our work contributes to the recent emphasis on interpretability of prediction models for high-stakes decision making, providing a disciplined solution to detailed assessment of variable importance and transparent development of parsimonious clinical risk scores.
format	Online Article Text
id	pubmed-9931273
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-99312732023-02-16 A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study Ning, Yilin Li, Siqi Ong, Marcus Eng Hock Xie, Feng Chakraborty, Bibhas Ting, Daniel Shu Wei Liu, Nan PLOS Digit Health Research Article Risk scores are widely used for clinical decision making and commonly generated from logistic regression models. Machine-learning-based methods may work well for identifying important predictors to create parsimonious scores, but such ‘black box’ variable selection limits interpretability, and variable importance evaluated from a single model can be biased. We propose a robust and interpretable variable selection approach using the recently developed Shapley variable importance cloud (ShapleyVIC) that accounts for variability in variable importance across models. Our approach evaluates and visualizes overall variable contributions for in-depth inference and transparent variable selection, and filters out non-significant contributors to simplify model building steps. We derive an ensemble variable ranking from variable contributions across models, which is easily integrated with an automated and modularized risk score generator, AutoScore, for convenient implementation. In a study of early death or unplanned readmission after hospital discharge, ShapleyVIC selected 6 variables from 41 candidates to create a well-performing risk score, which had similar performance to a 16-variable model from machine-learning-based ranking. Our work contributes to the recent emphasis on interpretability of prediction models for high-stakes decision making, providing a disciplined solution to detailed assessment of variable importance and transparent development of parsimonious clinical risk scores. Public Library of Science 2022-06-13 /pmc/articles/PMC9931273/ /pubmed/36812536 http://dx.doi.org/10.1371/journal.pdig.0000062 Text en © 2022 Ning et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Ning, Yilin Li, Siqi Ong, Marcus Eng Hock Xie, Feng Chakraborty, Bibhas Ting, Daniel Shu Wei Liu, Nan A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title	A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title_full	A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title_fullStr	A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title_full_unstemmed	A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title_short	A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study
title_sort	novel interpretable machine learning system to generate clinical risk scores: an application for predicting early mortality or unplanned readmission in a retrospective cohort study
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931273/ https://www.ncbi.nlm.nih.gov/pubmed/36812536 http://dx.doi.org/10.1371/journal.pdig.0000062
work_keys_str_mv	AT ningyilin anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT lisiqi anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT ongmarcusenghock anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT xiefeng anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT chakrabortybibhas anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT tingdanielshuwei anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT liunan anovelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT ningyilin novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT lisiqi novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT ongmarcusenghock novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT xiefeng novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT chakrabortybibhas novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT tingdanielshuwei novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy AT liunan novelinterpretablemachinelearningsystemtogenerateclinicalriskscoresanapplicationforpredictingearlymortalityorunplannedreadmissioninaretrospectivecohortstudy

A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort study

Ejemplares similares