Cargando…

Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis

BACKGROUND: The recent Coronavirus Disease 2019 (COVID-19) pandemic has placed severe stress on healthcare systems worldwide, which is amplified by the critical shortage of COVID-19 tests. METHODS: In this study, we propose to generate a more accurate diagnosis model of COVID-19 based on patient sym...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wei Tse, Ma, Jiayan, Shende, Neil, Castaneda, Grant, Chakladar, Jaideep, Tsai, Joseph C., Apostol, Lauren, Honda, Christine O., Xu, Jingyue, Wong, Lindsay M., Zhang, Tianyi, Lee, Abby, Gnanasekar, Aditi, Honda, Thomas K., Kuo, Selena Z., Yu, Michael Andrew, Chang, Eric Y., Rajasekaran, Mahadevan “ Raj”, Ongkeko, Weg M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7522928/
https://www.ncbi.nlm.nih.gov/pubmed/32993652
http://dx.doi.org/10.1186/s12911-020-01266-z
_version_ 1783588287885082624
author Li, Wei Tse
Ma, Jiayan
Shende, Neil
Castaneda, Grant
Chakladar, Jaideep
Tsai, Joseph C.
Apostol, Lauren
Honda, Christine O.
Xu, Jingyue
Wong, Lindsay M.
Zhang, Tianyi
Lee, Abby
Gnanasekar, Aditi
Honda, Thomas K.
Kuo, Selena Z.
Yu, Michael Andrew
Chang, Eric Y.
Rajasekaran, Mahadevan “ Raj”
Ongkeko, Weg M.
author_facet Li, Wei Tse
Ma, Jiayan
Shende, Neil
Castaneda, Grant
Chakladar, Jaideep
Tsai, Joseph C.
Apostol, Lauren
Honda, Christine O.
Xu, Jingyue
Wong, Lindsay M.
Zhang, Tianyi
Lee, Abby
Gnanasekar, Aditi
Honda, Thomas K.
Kuo, Selena Z.
Yu, Michael Andrew
Chang, Eric Y.
Rajasekaran, Mahadevan “ Raj”
Ongkeko, Weg M.
author_sort Li, Wei Tse
collection PubMed
description BACKGROUND: The recent Coronavirus Disease 2019 (COVID-19) pandemic has placed severe stress on healthcare systems worldwide, which is amplified by the critical shortage of COVID-19 tests. METHODS: In this study, we propose to generate a more accurate diagnosis model of COVID-19 based on patient symptoms and routine test results by applying machine learning to reanalyzing COVID-19 data from 151 published studies. We aim to investigate correlations between clinical variables, cluster COVID-19 patients into subtypes, and generate a computational classification model for discriminating between COVID-19 patients and influenza patients based on clinical variables alone. RESULTS: We discovered several novel associations between clinical variables, including correlations between being male and having higher levels of serum lymphocytes and neutrophils. We found that COVID-19 patients could be clustered into subtypes based on serum levels of immune cells, gender, and reported symptoms. Finally, we trained an XGBoost model to achieve a sensitivity of 92.5% and a specificity of 97.9% in discriminating COVID-19 patients from influenza patients. CONCLUSIONS: We demonstrated that computational methods trained on large clinical datasets could yield ever more accurate COVID-19 diagnostic models to mitigate the impact of lack of testing. We also presented previously unknown COVID-19 clinical variable correlations and clinical subgroups.
format Online
Article
Text
id pubmed-7522928
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75229282020-09-29 Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis Li, Wei Tse Ma, Jiayan Shende, Neil Castaneda, Grant Chakladar, Jaideep Tsai, Joseph C. Apostol, Lauren Honda, Christine O. Xu, Jingyue Wong, Lindsay M. Zhang, Tianyi Lee, Abby Gnanasekar, Aditi Honda, Thomas K. Kuo, Selena Z. Yu, Michael Andrew Chang, Eric Y. Rajasekaran, Mahadevan “ Raj” Ongkeko, Weg M. BMC Med Inform Decis Mak Research Article BACKGROUND: The recent Coronavirus Disease 2019 (COVID-19) pandemic has placed severe stress on healthcare systems worldwide, which is amplified by the critical shortage of COVID-19 tests. METHODS: In this study, we propose to generate a more accurate diagnosis model of COVID-19 based on patient symptoms and routine test results by applying machine learning to reanalyzing COVID-19 data from 151 published studies. We aim to investigate correlations between clinical variables, cluster COVID-19 patients into subtypes, and generate a computational classification model for discriminating between COVID-19 patients and influenza patients based on clinical variables alone. RESULTS: We discovered several novel associations between clinical variables, including correlations between being male and having higher levels of serum lymphocytes and neutrophils. We found that COVID-19 patients could be clustered into subtypes based on serum levels of immune cells, gender, and reported symptoms. Finally, we trained an XGBoost model to achieve a sensitivity of 92.5% and a specificity of 97.9% in discriminating COVID-19 patients from influenza patients. CONCLUSIONS: We demonstrated that computational methods trained on large clinical datasets could yield ever more accurate COVID-19 diagnostic models to mitigate the impact of lack of testing. We also presented previously unknown COVID-19 clinical variable correlations and clinical subgroups. BioMed Central 2020-09-29 /pmc/articles/PMC7522928/ /pubmed/32993652 http://dx.doi.org/10.1186/s12911-020-01266-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Li, Wei Tse
Ma, Jiayan
Shende, Neil
Castaneda, Grant
Chakladar, Jaideep
Tsai, Joseph C.
Apostol, Lauren
Honda, Christine O.
Xu, Jingyue
Wong, Lindsay M.
Zhang, Tianyi
Lee, Abby
Gnanasekar, Aditi
Honda, Thomas K.
Kuo, Selena Z.
Yu, Michael Andrew
Chang, Eric Y.
Rajasekaran, Mahadevan “ Raj”
Ongkeko, Weg M.
Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title_full Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title_fullStr Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title_full_unstemmed Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title_short Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis
title_sort using machine learning of clinical data to diagnose covid-19: a systematic review and meta-analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7522928/
https://www.ncbi.nlm.nih.gov/pubmed/32993652
http://dx.doi.org/10.1186/s12911-020-01266-z
work_keys_str_mv AT liweitse usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT majiayan usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT shendeneil usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT castanedagrant usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT chakladarjaideep usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT tsaijosephc usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT apostollauren usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT hondachristineo usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT xujingyue usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT wonglindsaym usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT zhangtianyi usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT leeabby usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT gnanasekaraditi usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT hondathomask usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT kuoselenaz usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT yumichaelandrew usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT changericy usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT rajasekaranmahadevanraj usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis
AT ongkekowegm usingmachinelearningofclinicaldatatodiagnosecovid19asystematicreviewandmetaanalysis