Cargando…

Scalable and accurate deep learning with electronic health records

Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that...

Descripción completa

Detalles Bibliográficos
Autores principales: Rajkomar, Alvin, Oren, Eyal, Chen, Kai, Dai, Andrew M., Hajaj, Nissan, Hardt, Michaela, Liu, Peter J., Liu, Xiaobing, Marcus, Jake, Sun, Mimi, Sundberg, Patrik, Yee, Hector, Zhang, Kun, Zhang, Yi, Flores, Gerardo, Duggan, Gavin E., Irvine, Jamie, Le, Quoc, Litsch, Kurt, Mossin, Alexander, Tansuwan, Justin, Wang, De, Wexler, James, Wilson, Jimbo, Ludwig, Dana, Volchenboum, Samuel L., Chou, Katherine, Pearson, Michael, Madabushi, Srinivasan, Shah, Nigam H., Butte, Atul J., Howell, Michael D., Cui, Claire, Corrado, Greg S., Dean, Jeffrey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550175/
https://www.ncbi.nlm.nih.gov/pubmed/31304302
http://dx.doi.org/10.1038/s41746-018-0029-1
_version_ 1783424143810625536
author Rajkomar, Alvin
Oren, Eyal
Chen, Kai
Dai, Andrew M.
Hajaj, Nissan
Hardt, Michaela
Liu, Peter J.
Liu, Xiaobing
Marcus, Jake
Sun, Mimi
Sundberg, Patrik
Yee, Hector
Zhang, Kun
Zhang, Yi
Flores, Gerardo
Duggan, Gavin E.
Irvine, Jamie
Le, Quoc
Litsch, Kurt
Mossin, Alexander
Tansuwan, Justin
Wang, De
Wexler, James
Wilson, Jimbo
Ludwig, Dana
Volchenboum, Samuel L.
Chou, Katherine
Pearson, Michael
Madabushi, Srinivasan
Shah, Nigam H.
Butte, Atul J.
Howell, Michael D.
Cui, Claire
Corrado, Greg S.
Dean, Jeffrey
author_facet Rajkomar, Alvin
Oren, Eyal
Chen, Kai
Dai, Andrew M.
Hajaj, Nissan
Hardt, Michaela
Liu, Peter J.
Liu, Xiaobing
Marcus, Jake
Sun, Mimi
Sundberg, Patrik
Yee, Hector
Zhang, Kun
Zhang, Yi
Flores, Gerardo
Duggan, Gavin E.
Irvine, Jamie
Le, Quoc
Litsch, Kurt
Mossin, Alexander
Tansuwan, Justin
Wang, De
Wexler, James
Wilson, Jimbo
Ludwig, Dana
Volchenboum, Samuel L.
Chou, Katherine
Pearson, Michael
Madabushi, Srinivasan
Shah, Nigam H.
Butte, Atul J.
Howell, Michael D.
Cui, Claire
Corrado, Greg S.
Dean, Jeffrey
author_sort Rajkomar, Alvin
collection PubMed
description Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient’s record. We propose a representation of patients’ entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93–0.94), 30-day unplanned readmission (AUROC 0.75–0.76), prolonged length of stay (AUROC 0.85–0.86), and all of a patient’s final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient’s chart.
format Online
Article
Text
id pubmed-6550175
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-65501752019-07-12 Scalable and accurate deep learning with electronic health records Rajkomar, Alvin Oren, Eyal Chen, Kai Dai, Andrew M. Hajaj, Nissan Hardt, Michaela Liu, Peter J. Liu, Xiaobing Marcus, Jake Sun, Mimi Sundberg, Patrik Yee, Hector Zhang, Kun Zhang, Yi Flores, Gerardo Duggan, Gavin E. Irvine, Jamie Le, Quoc Litsch, Kurt Mossin, Alexander Tansuwan, Justin Wang, De Wexler, James Wilson, Jimbo Ludwig, Dana Volchenboum, Samuel L. Chou, Katherine Pearson, Michael Madabushi, Srinivasan Shah, Nigam H. Butte, Atul J. Howell, Michael D. Cui, Claire Corrado, Greg S. Dean, Jeffrey NPJ Digit Med Article Predictive modeling with electronic health record (EHR) data is anticipated to drive personalized medicine and improve healthcare quality. Constructing predictive statistical models typically requires extraction of curated predictor variables from normalized EHR data, a labor-intensive process that discards the vast majority of information in each patient’s record. We propose a representation of patients’ entire raw EHR records based on the Fast Healthcare Interoperability Resources (FHIR) format. We demonstrate that deep learning methods using this representation are capable of accurately predicting multiple medical events from multiple centers without site-specific data harmonization. We validated our approach using de-identified EHR data from two US academic medical centers with 216,221 adult patients hospitalized for at least 24 h. In the sequential format we propose, this volume of EHR data unrolled into a total of 46,864,534,945 data points, including clinical notes. Deep learning models achieved high accuracy for tasks such as predicting: in-hospital mortality (area under the receiver operator curve [AUROC] across sites 0.93–0.94), 30-day unplanned readmission (AUROC 0.75–0.76), prolonged length of stay (AUROC 0.85–0.86), and all of a patient’s final discharge diagnoses (frequency-weighted AUROC 0.90). These models outperformed traditional, clinically-used predictive models in all cases. We believe that this approach can be used to create accurate and scalable predictions for a variety of clinical scenarios. In a case study of a particular prediction, we demonstrate that neural networks can be used to identify relevant information from the patient’s chart. Nature Publishing Group UK 2018-05-08 /pmc/articles/PMC6550175/ /pubmed/31304302 http://dx.doi.org/10.1038/s41746-018-0029-1 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Rajkomar, Alvin
Oren, Eyal
Chen, Kai
Dai, Andrew M.
Hajaj, Nissan
Hardt, Michaela
Liu, Peter J.
Liu, Xiaobing
Marcus, Jake
Sun, Mimi
Sundberg, Patrik
Yee, Hector
Zhang, Kun
Zhang, Yi
Flores, Gerardo
Duggan, Gavin E.
Irvine, Jamie
Le, Quoc
Litsch, Kurt
Mossin, Alexander
Tansuwan, Justin
Wang, De
Wexler, James
Wilson, Jimbo
Ludwig, Dana
Volchenboum, Samuel L.
Chou, Katherine
Pearson, Michael
Madabushi, Srinivasan
Shah, Nigam H.
Butte, Atul J.
Howell, Michael D.
Cui, Claire
Corrado, Greg S.
Dean, Jeffrey
Scalable and accurate deep learning with electronic health records
title Scalable and accurate deep learning with electronic health records
title_full Scalable and accurate deep learning with electronic health records
title_fullStr Scalable and accurate deep learning with electronic health records
title_full_unstemmed Scalable and accurate deep learning with electronic health records
title_short Scalable and accurate deep learning with electronic health records
title_sort scalable and accurate deep learning with electronic health records
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550175/
https://www.ncbi.nlm.nih.gov/pubmed/31304302
http://dx.doi.org/10.1038/s41746-018-0029-1
work_keys_str_mv AT rajkomaralvin scalableandaccuratedeeplearningwithelectronichealthrecords
AT oreneyal scalableandaccuratedeeplearningwithelectronichealthrecords
AT chenkai scalableandaccuratedeeplearningwithelectronichealthrecords
AT daiandrewm scalableandaccuratedeeplearningwithelectronichealthrecords
AT hajajnissan scalableandaccuratedeeplearningwithelectronichealthrecords
AT hardtmichaela scalableandaccuratedeeplearningwithelectronichealthrecords
AT liupeterj scalableandaccuratedeeplearningwithelectronichealthrecords
AT liuxiaobing scalableandaccuratedeeplearningwithelectronichealthrecords
AT marcusjake scalableandaccuratedeeplearningwithelectronichealthrecords
AT sunmimi scalableandaccuratedeeplearningwithelectronichealthrecords
AT sundbergpatrik scalableandaccuratedeeplearningwithelectronichealthrecords
AT yeehector scalableandaccuratedeeplearningwithelectronichealthrecords
AT zhangkun scalableandaccuratedeeplearningwithelectronichealthrecords
AT zhangyi scalableandaccuratedeeplearningwithelectronichealthrecords
AT floresgerardo scalableandaccuratedeeplearningwithelectronichealthrecords
AT duggangavine scalableandaccuratedeeplearningwithelectronichealthrecords
AT irvinejamie scalableandaccuratedeeplearningwithelectronichealthrecords
AT lequoc scalableandaccuratedeeplearningwithelectronichealthrecords
AT litschkurt scalableandaccuratedeeplearningwithelectronichealthrecords
AT mossinalexander scalableandaccuratedeeplearningwithelectronichealthrecords
AT tansuwanjustin scalableandaccuratedeeplearningwithelectronichealthrecords
AT wangde scalableandaccuratedeeplearningwithelectronichealthrecords
AT wexlerjames scalableandaccuratedeeplearningwithelectronichealthrecords
AT wilsonjimbo scalableandaccuratedeeplearningwithelectronichealthrecords
AT ludwigdana scalableandaccuratedeeplearningwithelectronichealthrecords
AT volchenboumsamuell scalableandaccuratedeeplearningwithelectronichealthrecords
AT choukatherine scalableandaccuratedeeplearningwithelectronichealthrecords
AT pearsonmichael scalableandaccuratedeeplearningwithelectronichealthrecords
AT madabushisrinivasan scalableandaccuratedeeplearningwithelectronichealthrecords
AT shahnigamh scalableandaccuratedeeplearningwithelectronichealthrecords
AT butteatulj scalableandaccuratedeeplearningwithelectronichealthrecords
AT howellmichaeld scalableandaccuratedeeplearningwithelectronichealthrecords
AT cuiclaire scalableandaccuratedeeplearningwithelectronichealthrecords
AT corradogregs scalableandaccuratedeeplearningwithelectronichealthrecords
AT deanjeffrey scalableandaccuratedeeplearningwithelectronichealthrecords