Cargando…
Scalable and accurate deep learning with electronic health records
Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550175/ https://www.ncbi.nlm.nih.gov/pubmed/31304302 http://dx.doi.org/10.1038/s41746-018-0029-1 |
_version_ | 1783424143810625536 |
---|---|
author | Rajkomar, Alvin Oren, Eyal Chen, Kai Dai, Andrew M. Hajaj, Nissan Hardt, Michaela Liu, Peter J. Liu, Xiaobing Marcus, Jake Sun, Mimi Sundberg, Patrik Yee, Hector Zhang, Kun Zhang, Yi Flores, Gerardo Duggan, Gavin E. Irvine, Jamie Le, Quoc Litsch, Kurt Mossin, Alexander Tansuwan, Justin Wang, De Wexler, James Wilson, Jimbo Ludwig, Dana Volchenboum, Samuel L. Chou, Katherine Pearson, Michael Madabushi, Srinivasan Shah, Nigam H. Butte, Atul J. Howell, Michael D. Cui, Claire Corrado, Greg S. Dean, Jeffrey |
author_facet | Rajkomar, Alvin Oren, Eyal Chen, Kai Dai, Andrew M. Hajaj, Nissan Hardt, Michaela Liu, Peter J. Liu, Xiaobing Marcus, Jake Sun, Mimi Sundberg, Patrik Yee, Hector Zhang, Kun Zhang, Yi Flores, Gerardo Duggan, Gavin E. Irvine, Jamie Le, Quoc Litsch, Kurt Mossin, Alexander Tansuwan, Justin Wang, De Wexler, James Wilson, Jimbo Ludwig, Dana Volchenboum, Samuel L. Chou, Katherine Pearson, Michael Madabushi, Srinivasan Shah, Nigam H. Butte, Atul J. Howell, Michael D. Cui, Claire Corrado, Greg S. Dean, Jeffrey |
author_sort | Rajkomar, Alvin |
collection | PubMed |
description | Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient’s record. We propose a representation of patients’ entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93–0.94), 30-day unplanned readmission (AUROC 0.75–0.76), prolonged length of stay (AUROC 0.85–0.86), and all of a patient’s final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient’s chart. |
format | Online Article Text |
id | pubmed-6550175 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-65501752019-07-12 Scalable and accurate deep learning with electronic health records Rajkomar, Alvin Oren, Eyal Chen, Kai Dai, Andrew M. Hajaj, Nissan Hardt, Michaela Liu, Peter J. Liu, Xiaobing Marcus, Jake Sun, Mimi Sundberg, Patrik Yee, Hector Zhang, Kun Zhang, Yi Flores, Gerardo Duggan, Gavin E. Irvine, Jamie Le, Quoc Litsch, Kurt Mossin, Alexander Tansuwan, Justin Wang, De Wexler, James Wilson, Jimbo Ludwig, Dana Volchenboum, Samuel L. Chou, Katherine Pearson, Michael Madabushi, Srinivasan Shah, Nigam H. Butte, Atul J. Howell, Michael D. Cui, Claire Corrado, Greg S. Dean, Jeffrey NPJ Digit Med Article Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient’s record. We propose a representation of patients’ entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93–0.94), 30-day unplanned readmission (AUROC 0.75–0.76), prolonged length of stay (AUROC 0.85–0.86), and all of a patient’s final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient’s chart. Nature Publishing Group UK 2018-05-08 /pmc/articles/PMC6550175/ /pubmed/31304302 http://dx.doi.org/10.1038/s41746-018-0029-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Rajkomar, Alvin Oren, Eyal Chen, Kai Dai, Andrew M. Hajaj, Nissan Hardt, Michaela Liu, Peter J. Liu, Xiaobing Marcus, Jake Sun, Mimi Sundberg, Patrik Yee, Hector Zhang, Kun Zhang, Yi Flores, Gerardo Duggan, Gavin E. Irvine, Jamie Le, Quoc Litsch, Kurt Mossin, Alexander Tansuwan, Justin Wang, De Wexler, James Wilson, Jimbo Ludwig, Dana Volchenboum, Samuel L. Chou, Katherine Pearson, Michael Madabushi, Srinivasan Shah, Nigam H. Butte, Atul J. Howell, Michael D. Cui, Claire Corrado, Greg S. Dean, Jeffrey Scalable and accurate deep learning with electronic health records |
title | Scalable and accurate deep learning with electronic health records |
title_full | Scalable and accurate deep learning with electronic health records |
title_fullStr | Scalable and accurate deep learning with electronic health records |
title_full_unstemmed | Scalable and accurate deep learning with electronic health records |
title_short | Scalable and accurate deep learning with electronic health records |
title_sort | scalable and accurate deep learning with electronic health records |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550175/ https://www.ncbi.nlm.nih.gov/pubmed/31304302 http://dx.doi.org/10.1038/s41746-018-0029-1 |
work_keys_str_mv | AT rajkomaralvin scalableandaccuratedeeplearningwithelectronichealthrecords AT oreneyal scalableandaccuratedeeplearningwithelectronichealthrecords AT chenkai scalableandaccuratedeeplearningwithelectronichealthrecords AT daiandrewm scalableandaccuratedeeplearningwithelectronichealthrecords AT hajajnissan scalableandaccuratedeeplearningwithelectronichealthrecords AT hardtmichaela scalableandaccuratedeeplearningwithelectronichealthrecords AT liupeterj scalableandaccuratedeeplearningwithelectronichealthrecords AT liuxiaobing scalableandaccuratedeeplearningwithelectronichealthrecords AT marcusjake scalableandaccuratedeeplearningwithelectronichealthrecords AT sunmimi scalableandaccuratedeeplearningwithelectronichealthrecords AT sundbergpatrik scalableandaccuratedeeplearningwithelectronichealthrecords AT yeehector scalableandaccuratedeeplearningwithelectronichealthrecords AT zhangkun scalableandaccuratedeeplearningwithelectronichealthrecords AT zhangyi scalableandaccuratedeeplearningwithelectronichealthrecords AT floresgerardo scalableandaccuratedeeplearningwithelectronichealthrecords AT duggangavine scalableandaccuratedeeplearningwithelectronichealthrecords AT irvinejamie scalableandaccuratedeeplearningwithelectronichealthrecords AT lequoc scalableandaccuratedeeplearningwithelectronichealthrecords AT litschkurt scalableandaccuratedeeplearningwithelectronichealthrecords AT mossinalexander scalableandaccuratedeeplearningwithelectronichealthrecords AT tansuwanjustin scalableandaccuratedeeplearningwithelectronichealthrecords AT wangde scalableandaccuratedeeplearningwithelectronichealthrecords AT wexlerjames scalableandaccuratedeeplearningwithelectronichealthrecords AT wilsonjimbo scalableandaccuratedeeplearningwithelectronichealthrecords AT ludwigdana scalableandaccuratedeeplearningwithelectronichealthrecords AT volchenboumsamuell scalableandaccuratedeeplearningwithelectronichealthrecords AT choukatherine scalableandaccuratedeeplearningwithelectronichealthrecords AT pearsonmichael scalableandaccuratedeeplearningwithelectronichealthrecords AT madabushisrinivasan scalableandaccuratedeeplearningwithelectronichealthrecords AT shahnigamh scalableandaccuratedeeplearningwithelectronichealthrecords AT butteatulj scalableandaccuratedeeplearningwithelectronichealthrecords AT howellmichaeld scalableandaccuratedeeplearningwithelectronichealthrecords AT cuiclaire scalableandaccuratedeeplearningwithelectronichealthrecords AT corradogregs scalableandaccuratedeeplearningwithelectronichealthrecords AT deanjeffrey scalableandaccuratedeeplearningwithelectronichealthrecords |