Cargando…

A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization

IMPORTANCE: Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic. OBJECTIVES: To describe the rapid development and evaluation o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yan, Yao, Schaffter, Thomas, Bergquist, Timothy, Yu, Thomas, Prosser, Justin, Aydin, Zafer, Jabeer, Amhar, Brugere, Ivan, Gao, Jifan, Chen, Guanhua, Causey, Jason, Yao, Yuxin, Bryson, Kevin, Long, Dustin R., Jarvik, Jeffrey G., Lee, Christoph I., Wilcox, Adam, Guinney, Justin, Mooney, Sean
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Medical Association 2021
Materias:	Original Investigation
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8506231/ https://www.ncbi.nlm.nih.gov/pubmed/34633425 http://dx.doi.org/10.1001/jamanetworkopen.2021.24946

_version_	1784581694162993152
author	Yan, Yao Schaffter, Thomas Bergquist, Timothy Yu, Thomas Prosser, Justin Aydin, Zafer Jabeer, Amhar Brugere, Ivan Gao, Jifan Chen, Guanhua Causey, Jason Yao, Yuxin Bryson, Kevin Long, Dustin R. Jarvik, Jeffrey G. Lee, Christoph I. Wilcox, Adam Guinney, Justin Mooney, Sean
author_facet	Yan, Yao Schaffter, Thomas Bergquist, Timothy Yu, Thomas Prosser, Justin Aydin, Zafer Jabeer, Amhar Brugere, Ivan Gao, Jifan Chen, Guanhua Causey, Jason Yao, Yuxin Bryson, Kevin Long, Dustin R. Jarvik, Jeffrey G. Lee, Christoph I. Wilcox, Adam Guinney, Justin Mooney, Sean
author_sort	Yan, Yao
collection	PubMed
description	IMPORTANCE: Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic. OBJECTIVES: To describe the rapid development and evaluation of clinical algorithms to predict COVID-19 diagnosis and hospitalization using patient data by citizen scientists, provide an unbiased assessment of model performance, and benchmark model performance on subgroups. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic and prognostic study operated a continuous, crowdsourced challenge using a model-to-data approach to securely enable the use of regularly updated COVID-19 patient data from the University of Washington by participants from May 6 to December 23, 2020. A postchallenge analysis was conducted from December 24, 2020, to April 7, 2021, to assess the generalizability of models on the cumulative data set as well as subgroups stratified by age, sex, race, and time of COVID-19 test. By December 23, 2020, this challenge engaged 482 participants from 90 teams and 7 countries. MAIN OUTCOMES AND MEASURES: Machine learning algorithms used patient data and output a score that represented the probability of patients receiving a positive COVID-19 test result or being hospitalized within 21 days after receiving a positive COVID-19 test result. Algorithms were evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) scores. Ensemble models aggregating models from the top challenge teams were developed and evaluated. RESULTS: In the analysis using the cumulative data set, the best performance for COVID-19 diagnosis prediction was an AUROC of 0.776 (95% CI, 0.775-0.777) and an AUPRC of 0.297, and for hospitalization prediction, an AUROC of 0.796 (95% CI, 0.794-0.798) and an AUPRC of 0.188. Analysis on top models submitting to the challenge showed consistently better model performance on the female group than the male group. Among all age groups, the best performance was obtained for the 25- to 49-year age group, and the worst performance was obtained for the group aged 17 years or younger. CONCLUSIONS AND RELEVANCE: In this diagnostic and prognostic study, models submitted by citizen scientists achieved high performance for the prediction of COVID-19 testing and hospitalization outcomes. Evaluation of challenge models on demographic subgroups and prospective data revealed performance discrepancies, providing insights into the potential bias and limitations in the models.
format	Online Article Text
id	pubmed-8506231
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	American Medical Association
record_format	MEDLINE/PubMed
spelling	pubmed-85062312021-10-27 A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization Yan, Yao Schaffter, Thomas Bergquist, Timothy Yu, Thomas Prosser, Justin Aydin, Zafer Jabeer, Amhar Brugere, Ivan Gao, Jifan Chen, Guanhua Causey, Jason Yao, Yuxin Bryson, Kevin Long, Dustin R. Jarvik, Jeffrey G. Lee, Christoph I. Wilcox, Adam Guinney, Justin Mooney, Sean JAMA Netw Open Original Investigation IMPORTANCE: Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic. OBJECTIVES: To describe the rapid development and evaluation of clinical algorithms to predict COVID-19 diagnosis and hospitalization using patient data by citizen scientists, provide an unbiased assessment of model performance, and benchmark model performance on subgroups. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic and prognostic study operated a continuous, crowdsourced challenge using a model-to-data approach to securely enable the use of regularly updated COVID-19 patient data from the University of Washington by participants from May 6 to December 23, 2020. A postchallenge analysis was conducted from December 24, 2020, to April 7, 2021, to assess the generalizability of models on the cumulative data set as well as subgroups stratified by age, sex, race, and time of COVID-19 test. By December 23, 2020, this challenge engaged 482 participants from 90 teams and 7 countries. MAIN OUTCOMES AND MEASURES: Machine learning algorithms used patient data and output a score that represented the probability of patients receiving a positive COVID-19 test result or being hospitalized within 21 days after receiving a positive COVID-19 test result. Algorithms were evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) scores. Ensemble models aggregating models from the top challenge teams were developed and evaluated. RESULTS: In the analysis using the cumulative data set, the best performance for COVID-19 diagnosis prediction was an AUROC of 0.776 (95% CI, 0.775-0.777) and an AUPRC of 0.297, and for hospitalization prediction, an AUROC of 0.796 (95% CI, 0.794-0.798) and an AUPRC of 0.188. Analysis on top models submitting to the challenge showed consistently better model performance on the female group than the male group. Among all age groups, the best performance was obtained for the 25- to 49-year age group, and the worst performance was obtained for the group aged 17 years or younger. CONCLUSIONS AND RELEVANCE: In this diagnostic and prognostic study, models submitted by citizen scientists achieved high performance for the prediction of COVID-19 testing and hospitalization outcomes. Evaluation of challenge models on demographic subgroups and prospective data revealed performance discrepancies, providing insights into the potential bias and limitations in the models. American Medical Association 2021-10-11 /pmc/articles/PMC8506231/ /pubmed/34633425 http://dx.doi.org/10.1001/jamanetworkopen.2021.24946 Text en Copyright 2021 Yan Y et al. JAMA Network Open. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the CC-BY License.
spellingShingle	Original Investigation Yan, Yao Schaffter, Thomas Bergquist, Timothy Yu, Thomas Prosser, Justin Aydin, Zafer Jabeer, Amhar Brugere, Ivan Gao, Jifan Chen, Guanhua Causey, Jason Yao, Yuxin Bryson, Kevin Long, Dustin R. Jarvik, Jeffrey G. Lee, Christoph I. Wilcox, Adam Guinney, Justin Mooney, Sean A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title	A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title_full	A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title_fullStr	A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title_full_unstemmed	A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title_short	A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization
title_sort	continuously benchmarked and crowdsourced challenge for rapid development and evaluation of models to predict covid-19 diagnosis and hospitalization
topic	Original Investigation
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8506231/ https://www.ncbi.nlm.nih.gov/pubmed/34633425 http://dx.doi.org/10.1001/jamanetworkopen.2021.24946
work_keys_str_mv	AT yanyao acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT schaffterthomas acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT bergquisttimothy acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT yuthomas acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT prosserjustin acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT aydinzafer acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT jabeeramhar acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT brugereivan acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT gaojifan acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT chenguanhua acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT causeyjason acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT yaoyuxin acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT brysonkevin acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT longdustinr acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT jarvikjeffreyg acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT leechristophi acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT wilcoxadam acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT guinneyjustin acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT mooneysean acontinuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT yanyao continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT schaffterthomas continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT bergquisttimothy continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT yuthomas continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT prosserjustin continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT aydinzafer continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT jabeeramhar continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT brugereivan continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT gaojifan continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT chenguanhua continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT causeyjason continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT yaoyuxin continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT brysonkevin continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT longdustinr continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT jarvikjeffreyg continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT leechristophi continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT wilcoxadam continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT guinneyjustin continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization AT mooneysean continuouslybenchmarkedandcrowdsourcedchallengeforrapiddevelopmentandevaluationofmodelstopredictcovid19diagnosisandhospitalization

A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization

Ejemplares similares