Cargando…
Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records
Secondary use of electronic health records (EHRs) promises to advance clinical research and better inform clinical decision making. Challenges in summarizing and representing patient data prevent widespread practice of predictive modeling using EHRs. Here we present a novel unsupervised deep feature...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4869115/ https://www.ncbi.nlm.nih.gov/pubmed/27185194 http://dx.doi.org/10.1038/srep26094 |
_version_ | 1782432260428922880 |
---|---|
author | Miotto, Riccardo Li, Li Kidd, Brian A. Dudley, Joel T. |
author_facet | Miotto, Riccardo Li, Li Kidd, Brian A. Dudley, Joel T. |
author_sort | Miotto, Riccardo |
collection | PubMed |
description | Secondary use of electronic health records (EHRs) promises to advance clinical research and better inform clinical decision making. Challenges in summarizing and representing patient data prevent widespread practice of predictive modeling using EHRs. Here we present a novel unsupervised deep feature learning method to derive a general-purpose patient representation from EHR data that facilitates clinical predictive modeling. In particular, a three-layer stack of denoising autoencoders was used to capture hierarchical regularities and dependencies in the aggregated EHRs of about 700,000 patients from the Mount Sinai data warehouse. The result is a representation we name “deep patient”. We evaluated this representation as broadly predictive of health states by assessing the probability of patients to develop various diseases. We performed evaluation using 76,214 test patients comprising 78 diseases from diverse clinical domains and temporal windows. Our results significantly outperformed those achieved using representations based on raw EHR data and alternative feature learning strategies. Prediction performance for severe diabetes, schizophrenia, and various cancers were among the top performing. These findings indicate that deep learning applied to EHRs can derive patient representations that offer improved clinical predictions, and could provide a machine learning framework for augmenting clinical decision systems. |
format | Online Article Text |
id | pubmed-4869115 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-48691152016-06-01 Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records Miotto, Riccardo Li, Li Kidd, Brian A. Dudley, Joel T. Sci Rep Article Secondary use of electronic health records (EHRs) promises to advance clinical research and better inform clinical decision making. Challenges in summarizing and representing patient data prevent widespread practice of predictive modeling using EHRs. Here we present a novel unsupervised deep feature learning method to derive a general-purpose patient representation from EHR data that facilitates clinical predictive modeling. In particular, a three-layer stack of denoising autoencoders was used to capture hierarchical regularities and dependencies in the aggregated EHRs of about 700,000 patients from the Mount Sinai data warehouse. The result is a representation we name “deep patient”. We evaluated this representation as broadly predictive of health states by assessing the probability of patients to develop various diseases. We performed evaluation using 76,214 test patients comprising 78 diseases from diverse clinical domains and temporal windows. Our results significantly outperformed those achieved using representations based on raw EHR data and alternative feature learning strategies. Prediction performance for severe diabetes, schizophrenia, and various cancers were among the top performing. These findings indicate that deep learning applied to EHRs can derive patient representations that offer improved clinical predictions, and could provide a machine learning framework for augmenting clinical decision systems. Nature Publishing Group 2016-05-17 /pmc/articles/PMC4869115/ /pubmed/27185194 http://dx.doi.org/10.1038/srep26094 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Miotto, Riccardo Li, Li Kidd, Brian A. Dudley, Joel T. Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title | Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title_full | Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title_fullStr | Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title_full_unstemmed | Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title_short | Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records |
title_sort | deep patient: an unsupervised representation to predict the future of patients from the electronic health records |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4869115/ https://www.ncbi.nlm.nih.gov/pubmed/27185194 http://dx.doi.org/10.1038/srep26094 |
work_keys_str_mv | AT miottoriccardo deeppatientanunsupervisedrepresentationtopredictthefutureofpatientsfromtheelectronichealthrecords AT lili deeppatientanunsupervisedrepresentationtopredictthefutureofpatientsfromtheelectronichealthrecords AT kiddbriana deeppatientanunsupervisedrepresentationtopredictthefutureofpatientsfromtheelectronichealthrecords AT dudleyjoelt deeppatientanunsupervisedrepresentationtopredictthefutureofpatientsfromtheelectronichealthrecords |