Cargando…
Leveraging graph-based hierarchical medical entity embedding for healthcare applications
Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dim...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7955058/ https://www.ncbi.nlm.nih.gov/pubmed/33712670 http://dx.doi.org/10.1038/s41598-021-85255-w |
_version_ | 1783664182683500544 |
---|---|
author | Wu, Tong Wang, Yunlong Wang, Yue Zhao, Emily Yuan, Yilian |
author_facet | Wu, Tong Wang, Yunlong Wang, Yue Zhao, Emily Yuan, Yilian |
author_sort | Wu, Tong |
collection | PubMed |
description | Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dimensional embedding vectors of the most common entities in EHR: medical services, doctors, and patients. ME2Vec features a hierarchical structure that encapsulates different node embedding schemes to cater for the unique characteristic of each medical entity. To embed medical services, we employ a biased-random-walk-based node embedding that leverages the irregular time intervals of medical services in EHR to embody their relative importance. To embed doctors and patients, we adhere to the principle “it’s what you do that defines you” and derive their embeddings based on their interactions with other types of entities through graph neural network and proximity-preserving network embedding, respectively. Using real-world clinical data, we demonstrate the efficacy of ME2Vec over competitive baselines on diagnosis prediction, readmission prediction, as well as recommending doctors to patients based on their medical conditions. In addition, medical service embeddings pretrained using ME2Vec can substantially improve the performance of sequential models in predicting patients clinical outcomes. Overall, ME2Vec can serve as a general-purpose representation learning algorithm for EHR data and benefit various downstream tasks in terms of both performance and interpretability. |
format | Online Article Text |
id | pubmed-7955058 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-79550582021-03-15 Leveraging graph-based hierarchical medical entity embedding for healthcare applications Wu, Tong Wang, Yunlong Wang, Yue Zhao, Emily Yuan, Yilian Sci Rep Article Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dimensional embedding vectors of the most common entities in EHR: medical services, doctors, and patients. ME2Vec features a hierarchical structure that encapsulates different node embedding schemes to cater for the unique characteristic of each medical entity. To embed medical services, we employ a biased-random-walk-based node embedding that leverages the irregular time intervals of medical services in EHR to embody their relative importance. To embed doctors and patients, we adhere to the principle “it’s what you do that defines you” and derive their embeddings based on their interactions with other types of entities through graph neural network and proximity-preserving network embedding, respectively. Using real-world clinical data, we demonstrate the efficacy of ME2Vec over competitive baselines on diagnosis prediction, readmission prediction, as well as recommending doctors to patients based on their medical conditions. In addition, medical service embeddings pretrained using ME2Vec can substantially improve the performance of sequential models in predicting patients clinical outcomes. Overall, ME2Vec can serve as a general-purpose representation learning algorithm for EHR data and benefit various downstream tasks in terms of both performance and interpretability. Nature Publishing Group UK 2021-03-12 /pmc/articles/PMC7955058/ /pubmed/33712670 http://dx.doi.org/10.1038/s41598-021-85255-w Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Wu, Tong Wang, Yunlong Wang, Yue Zhao, Emily Yuan, Yilian Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title | Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title_full | Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title_fullStr | Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title_full_unstemmed | Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title_short | Leveraging graph-based hierarchical medical entity embedding for healthcare applications |
title_sort | leveraging graph-based hierarchical medical entity embedding for healthcare applications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7955058/ https://www.ncbi.nlm.nih.gov/pubmed/33712670 http://dx.doi.org/10.1038/s41598-021-85255-w |
work_keys_str_mv | AT wutong leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications AT wangyunlong leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications AT wangyue leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications AT zhaoemily leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications AT yuanyilian leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications |