Cargando…

Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2

OBJECTIVE: Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV-2 [severe acute respiratory syndrome coronaviru...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Honghan, Zhang, Huayu, Karwath, Andreas, Ibrahim, Zina, Shi, Ting, Zhang, Xin, Wang, Kun, Sun, Jiaxing, Dhaliwal, Kevin, Bean, Daniel, Cardoso, Victor Roth, Li, Kezhi, Teo, James T, Banerjee, Amitava, Gao-Smith, Fang, Whitehouse, Tony, Veenith, Tonny, Gkoutos, Georgios V, Wu, Xiaodong, Dobson, Richard, Guthrie, Bruce
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7717299/
https://www.ncbi.nlm.nih.gov/pubmed/33185672
http://dx.doi.org/10.1093/jamia/ocaa295
_version_ 1783619302016942080
author Wu, Honghan
Zhang, Huayu
Karwath, Andreas
Ibrahim, Zina
Shi, Ting
Zhang, Xin
Wang, Kun
Sun, Jiaxing
Dhaliwal, Kevin
Bean, Daniel
Cardoso, Victor Roth
Li, Kezhi
Teo, James T
Banerjee, Amitava
Gao-Smith, Fang
Whitehouse, Tony
Veenith, Tonny
Gkoutos, Georgios V
Wu, Xiaodong
Dobson, Richard
Guthrie, Bruce
author_facet Wu, Honghan
Zhang, Huayu
Karwath, Andreas
Ibrahim, Zina
Shi, Ting
Zhang, Xin
Wang, Kun
Sun, Jiaxing
Dhaliwal, Kevin
Bean, Daniel
Cardoso, Victor Roth
Li, Kezhi
Teo, James T
Banerjee, Amitava
Gao-Smith, Fang
Whitehouse, Tony
Veenith, Tonny
Gkoutos, Georgios V
Wu, Xiaodong
Dobson, Richard
Guthrie, Bruce
author_sort Wu, Honghan
collection PubMed
description OBJECTIVE: Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV-2 [severe acute respiratory syndrome coronavirus 2] pandemic). This study aims at tackling this challenge by synergizing prediction models from the literature using ensemble learning. MATERIALS AND METHODS: In this study, we selected and reimplemented 7 prediction models for COVID-19 (coronavirus disease 2019) that were derived from diverse cohorts and used different implementation techniques. A novel ensemble learning framework was proposed to synergize them for realizing personalized predictions for individual patients. Four diverse international cohorts (2 from the United Kingdom and 2 from China; N = 5394) were used to validate all 8 models on discrimination, calibration, and clinical usefulness. RESULTS: Results showed that individual prediction models could perform well on some cohorts while poorly on others. Conversely, the ensemble model achieved the best performances consistently on all metrics quantifying discrimination, calibration, and clinical usefulness. Performance disparities were observed in cohorts from the 2 countries: all models achieved better performances on the China cohorts. DISCUSSION: When individual models were learned from complementary cohorts, the synergized model had the potential to achieve better performances than any individual model. Results indicate that blood parameters and physiological measurements might have better predictive powers when collected early, which remains to be confirmed by further studies. CONCLUSIONS: Combining a diverse set of individual prediction models, the ensemble method can synergize a robust and well-performing model by choosing the most competent ones for individual patients.
format Online
Article
Text
id pubmed-7717299
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77172992020-12-09 Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2 Wu, Honghan Zhang, Huayu Karwath, Andreas Ibrahim, Zina Shi, Ting Zhang, Xin Wang, Kun Sun, Jiaxing Dhaliwal, Kevin Bean, Daniel Cardoso, Victor Roth Li, Kezhi Teo, James T Banerjee, Amitava Gao-Smith, Fang Whitehouse, Tony Veenith, Tonny Gkoutos, Georgios V Wu, Xiaodong Dobson, Richard Guthrie, Bruce J Am Med Inform Assoc Research and Applications OBJECTIVE: Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV-2 [severe acute respiratory syndrome coronavirus 2] pandemic). This study aims at tackling this challenge by synergizing prediction models from the literature using ensemble learning. MATERIALS AND METHODS: In this study, we selected and reimplemented 7 prediction models for COVID-19 (coronavirus disease 2019) that were derived from diverse cohorts and used different implementation techniques. A novel ensemble learning framework was proposed to synergize them for realizing personalized predictions for individual patients. Four diverse international cohorts (2 from the United Kingdom and 2 from China; N = 5394) were used to validate all 8 models on discrimination, calibration, and clinical usefulness. RESULTS: Results showed that individual prediction models could perform well on some cohorts while poorly on others. Conversely, the ensemble model achieved the best performances consistently on all metrics quantifying discrimination, calibration, and clinical usefulness. Performance disparities were observed in cohorts from the 2 countries: all models achieved better performances on the China cohorts. DISCUSSION: When individual models were learned from complementary cohorts, the synergized model had the potential to achieve better performances than any individual model. Results indicate that blood parameters and physiological measurements might have better predictive powers when collected early, which remains to be confirmed by further studies. CONCLUSIONS: Combining a diverse set of individual prediction models, the ensemble method can synergize a robust and well-performing model by choosing the most competent ones for individual patients. Oxford University Press 2020-11-13 /pmc/articles/PMC7717299/ /pubmed/33185672 http://dx.doi.org/10.1093/jamia/ocaa295 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Wu, Honghan
Zhang, Huayu
Karwath, Andreas
Ibrahim, Zina
Shi, Ting
Zhang, Xin
Wang, Kun
Sun, Jiaxing
Dhaliwal, Kevin
Bean, Daniel
Cardoso, Victor Roth
Li, Kezhi
Teo, James T
Banerjee, Amitava
Gao-Smith, Fang
Whitehouse, Tony
Veenith, Tonny
Gkoutos, Georgios V
Wu, Xiaodong
Dobson, Richard
Guthrie, Bruce
Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title_full Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title_fullStr Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title_full_unstemmed Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title_short Ensemble learning for poor prognosis predictions: A case study on SARS-CoV-2
title_sort ensemble learning for poor prognosis predictions: a case study on sars-cov-2
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7717299/
https://www.ncbi.nlm.nih.gov/pubmed/33185672
http://dx.doi.org/10.1093/jamia/ocaa295
work_keys_str_mv AT wuhonghan ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT zhanghuayu ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT karwathandreas ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT ibrahimzina ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT shiting ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT zhangxin ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT wangkun ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT sunjiaxing ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT dhaliwalkevin ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT beandaniel ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT cardosovictorroth ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT likezhi ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT teojamest ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT banerjeeamitava ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT gaosmithfang ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT whitehousetony ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT veenithtonny ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT gkoutosgeorgiosv ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT wuxiaodong ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT dobsonrichard ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2
AT guthriebruce ensemblelearningforpoorprognosispredictionsacasestudyonsarscov2