Cargando…
Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach
BACKGROUND: Machine learning models require large datasets that may be siloed across different health care institutions. Machine learning studies that focus on COVID-19 have been limited to single-hospital data, which limits model generalizability. OBJECTIVE: We aimed to use federated learning, a ma...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7842859/ https://www.ncbi.nlm.nih.gov/pubmed/33400679 http://dx.doi.org/10.2196/24207 |
_version_ | 1783644025105940480 |
---|---|
author | Vaid, Akhil Jaladanki, Suraj K Xu, Jie Teng, Shelly Kumar, Arvind Lee, Samuel Somani, Sulaiman Paranjpe, Ishan De Freitas, Jessica K Wanyan, Tingyi Johnson, Kipp W Bicak, Mesude Klang, Eyal Kwon, Young Joon Costa, Anthony Zhao, Shan Miotto, Riccardo Charney, Alexander W Böttinger, Erwin Fayad, Zahi A Nadkarni, Girish N Wang, Fei Glicksberg, Benjamin S |
author_facet | Vaid, Akhil Jaladanki, Suraj K Xu, Jie Teng, Shelly Kumar, Arvind Lee, Samuel Somani, Sulaiman Paranjpe, Ishan De Freitas, Jessica K Wanyan, Tingyi Johnson, Kipp W Bicak, Mesude Klang, Eyal Kwon, Young Joon Costa, Anthony Zhao, Shan Miotto, Riccardo Charney, Alexander W Böttinger, Erwin Fayad, Zahi A Nadkarni, Girish N Wang, Fei Glicksberg, Benjamin S |
author_sort | Vaid, Akhil |
collection | PubMed |
description | BACKGROUND: Machine learning models require large datasets that may be siloed across different health care institutions. Machine learning studies that focus on COVID-19 have been limited to single-hospital data, which limits model generalizability. OBJECTIVE: We aimed to use federated learning, a machine learning technique that avoids locally aggregating raw clinical data across multiple institutions, to predict mortality in hospitalized patients with COVID-19 within 7 days. METHODS: Patient data were collected from the electronic health records of 5 hospitals within the Mount Sinai Health System. Logistic regression with L1 regularization/least absolute shrinkage and selection operator (LASSO) and multilayer perceptron (MLP) models were trained by using local data at each site. We developed a pooled model with combined data from all 5 sites, and a federated model that only shared parameters with a central aggregator. RESULTS: The LASSO(federated) model outperformed the LASSO(local) model at 3 hospitals, and the MLP(federated) model performed better than the MLP(local) model at all 5 hospitals, as determined by the area under the receiver operating characteristic curve. The LASSO(pooled) model outperformed the LASSO(federated) model at all hospitals, and the MLP(federated) model outperformed the MLP(pooled) model at 2 hospitals. CONCLUSIONS: The federated learning of COVID-19 electronic health record data shows promise in developing robust predictive models without compromising patient privacy. |
format | Online Article Text |
id | pubmed-7842859 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-78428592021-01-29 Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach Vaid, Akhil Jaladanki, Suraj K Xu, Jie Teng, Shelly Kumar, Arvind Lee, Samuel Somani, Sulaiman Paranjpe, Ishan De Freitas, Jessica K Wanyan, Tingyi Johnson, Kipp W Bicak, Mesude Klang, Eyal Kwon, Young Joon Costa, Anthony Zhao, Shan Miotto, Riccardo Charney, Alexander W Böttinger, Erwin Fayad, Zahi A Nadkarni, Girish N Wang, Fei Glicksberg, Benjamin S JMIR Med Inform Original Paper BACKGROUND: Machine learning models require large datasets that may be siloed across different health care institutions. Machine learning studies that focus on COVID-19 have been limited to single-hospital data, which limits model generalizability. OBJECTIVE: We aimed to use federated learning, a machine learning technique that avoids locally aggregating raw clinical data across multiple institutions, to predict mortality in hospitalized patients with COVID-19 within 7 days. METHODS: Patient data were collected from the electronic health records of 5 hospitals within the Mount Sinai Health System. Logistic regression with L1 regularization/least absolute shrinkage and selection operator (LASSO) and multilayer perceptron (MLP) models were trained by using local data at each site. We developed a pooled model with combined data from all 5 sites, and a federated model that only shared parameters with a central aggregator. RESULTS: The LASSO(federated) model outperformed the LASSO(local) model at 3 hospitals, and the MLP(federated) model performed better than the MLP(local) model at all 5 hospitals, as determined by the area under the receiver operating characteristic curve. The LASSO(pooled) model outperformed the LASSO(federated) model at all hospitals, and the MLP(federated) model outperformed the MLP(pooled) model at 2 hospitals. CONCLUSIONS: The federated learning of COVID-19 electronic health record data shows promise in developing robust predictive models without compromising patient privacy. JMIR Publications 2021-01-27 /pmc/articles/PMC7842859/ /pubmed/33400679 http://dx.doi.org/10.2196/24207 Text en ©Akhil Vaid, Suraj K Jaladanki, Jie Xu, Shelly Teng, Arvind Kumar, Samuel Lee, Sulaiman Somani, Ishan Paranjpe, Jessica K De Freitas, Tingyi Wanyan, Kipp W Johnson, Mesude Bicak, Eyal Klang, Young Joon Kwon, Anthony Costa, Shan Zhao, Riccardo Miotto, Alexander W Charney, Erwin Böttinger, Zahi A Fayad, Girish N Nadkarni, Fei Wang, Benjamin S Glicksberg. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 27.01.2021. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Vaid, Akhil Jaladanki, Suraj K Xu, Jie Teng, Shelly Kumar, Arvind Lee, Samuel Somani, Sulaiman Paranjpe, Ishan De Freitas, Jessica K Wanyan, Tingyi Johnson, Kipp W Bicak, Mesude Klang, Eyal Kwon, Young Joon Costa, Anthony Zhao, Shan Miotto, Riccardo Charney, Alexander W Böttinger, Erwin Fayad, Zahi A Nadkarni, Girish N Wang, Fei Glicksberg, Benjamin S Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title | Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title_full | Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title_fullStr | Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title_full_unstemmed | Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title_short | Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach |
title_sort | federated learning of electronic health records to improve mortality prediction in hospitalized patients with covid-19: machine learning approach |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7842859/ https://www.ncbi.nlm.nih.gov/pubmed/33400679 http://dx.doi.org/10.2196/24207 |
work_keys_str_mv | AT vaidakhil federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT jaladankisurajk federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT xujie federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT tengshelly federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT kumararvind federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT leesamuel federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT somanisulaiman federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT paranjpeishan federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT defreitasjessicak federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT wanyantingyi federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT johnsonkippw federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT bicakmesude federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT klangeyal federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT kwonyoungjoon federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT costaanthony federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT zhaoshan federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT miottoriccardo federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT charneyalexanderw federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT bottingererwin federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT fayadzahia federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT nadkarnigirishn federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT wangfei federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach AT glicksbergbenjamins federatedlearningofelectronichealthrecordstoimprovemortalitypredictioninhospitalizedpatientswithcovid19machinelearningapproach |