Cargando…
TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records
Deep learning transformer-based models using longitudinal electronic health records (EHRs) have shown a great success in prediction of clinical diseases or outcomes. Pretraining on a large dataset can help such models map the input space better and boost their performance on relevant tasks through f...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10687211/ https://www.ncbi.nlm.nih.gov/pubmed/38030638 http://dx.doi.org/10.1038/s41467-023-43715-z |
_version_ | 1785151934718541824 |
---|---|
author | Yang, Zhichao Mitra, Avijit Liu, Weisong Berlowitz, Dan Yu, Hong |
author_facet | Yang, Zhichao Mitra, Avijit Liu, Weisong Berlowitz, Dan Yu, Hong |
author_sort | Yang, Zhichao |
collection | PubMed |
description | Deep learning transformer-based models using longitudinal electronic health records (EHRs) have shown a great success in prediction of clinical diseases or outcomes. Pretraining on a large dataset can help such models map the input space better and boost their performance on relevant tasks through finetuning with limited data. In this study, we present TransformEHR, a generative encoder-decoder model with transformer that is pretrained using a new pretraining objective—predicting all diseases and outcomes of a patient at a future visit from previous visits. TransformEHR’s encoder-decoder framework, paired with the novel pretraining objective, helps it achieve the new state-of-the-art performance on multiple clinical prediction tasks. Comparing with the previous model, TransformEHR improves area under the precision–recall curve by 2% (p < 0.001) for pancreatic cancer onset and by 24% (p = 0.007) for intentional self-harm in patients with post-traumatic stress disorder. The high performance in predicting intentional self-harm shows the potential of TransformEHR in building effective clinical intervention systems. TransformEHR is also generalizable and can be easily finetuned for clinical prediction tasks with limited data. |
format | Online Article Text |
id | pubmed-10687211 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106872112023-11-30 TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records Yang, Zhichao Mitra, Avijit Liu, Weisong Berlowitz, Dan Yu, Hong Nat Commun Article Deep learning transformer-based models using longitudinal electronic health records (EHRs) have shown a great success in prediction of clinical diseases or outcomes. Pretraining on a large dataset can help such models map the input space better and boost their performance on relevant tasks through finetuning with limited data. In this study, we present TransformEHR, a generative encoder-decoder model with transformer that is pretrained using a new pretraining objective—predicting all diseases and outcomes of a patient at a future visit from previous visits. TransformEHR’s encoder-decoder framework, paired with the novel pretraining objective, helps it achieve the new state-of-the-art performance on multiple clinical prediction tasks. Comparing with the previous model, TransformEHR improves area under the precision–recall curve by 2% (p < 0.001) for pancreatic cancer onset and by 24% (p = 0.007) for intentional self-harm in patients with post-traumatic stress disorder. The high performance in predicting intentional self-harm shows the potential of TransformEHR in building effective clinical intervention systems. TransformEHR is also generalizable and can be easily finetuned for clinical prediction tasks with limited data. Nature Publishing Group UK 2023-11-29 /pmc/articles/PMC10687211/ /pubmed/38030638 http://dx.doi.org/10.1038/s41467-023-43715-z Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Yang, Zhichao Mitra, Avijit Liu, Weisong Berlowitz, Dan Yu, Hong TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title | TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title_full | TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title_fullStr | TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title_full_unstemmed | TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title_short | TransformEHR: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
title_sort | transformehr: transformer-based encoder-decoder generative model to enhance prediction of disease outcomes using electronic health records |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10687211/ https://www.ncbi.nlm.nih.gov/pubmed/38030638 http://dx.doi.org/10.1038/s41467-023-43715-z |
work_keys_str_mv | AT yangzhichao transformehrtransformerbasedencoderdecodergenerativemodeltoenhancepredictionofdiseaseoutcomesusingelectronichealthrecords AT mitraavijit transformehrtransformerbasedencoderdecodergenerativemodeltoenhancepredictionofdiseaseoutcomesusingelectronichealthrecords AT liuweisong transformehrtransformerbasedencoderdecodergenerativemodeltoenhancepredictionofdiseaseoutcomesusingelectronichealthrecords AT berlowitzdan transformehrtransformerbasedencoderdecodergenerativemodeltoenhancepredictionofdiseaseoutcomesusingelectronichealthrecords AT yuhong transformehrtransformerbasedencoderdecodergenerativemodeltoenhancepredictionofdiseaseoutcomesusingelectronichealthrecords |