Cargando…

Leveraging graph-based hierarchical medical entity embedding for healthcare applications

Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dim...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Tong, Wang, Yunlong, Wang, Yue, Zhao, Emily, Yuan, Yilian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7955058/
https://www.ncbi.nlm.nih.gov/pubmed/33712670
http://dx.doi.org/10.1038/s41598-021-85255-w
_version_ 1783664182683500544
author Wu, Tong
Wang, Yunlong
Wang, Yue
Zhao, Emily
Yuan, Yilian
author_facet Wu, Tong
Wang, Yunlong
Wang, Yue
Zhao, Emily
Yuan, Yilian
author_sort Wu, Tong
collection PubMed
description Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dimensional embedding vectors of the most common entities in EHR: medical services, doctors, and patients. ME2Vec features a hierarchical structure that encapsulates different node embedding schemes to cater for the unique characteristic of each medical entity. To embed medical services, we employ a biased-random-walk-based node embedding that leverages the irregular time intervals of medical services in EHR to embody their relative importance. To embed doctors and patients, we adhere to the principle “it’s what you do that defines you” and derive their embeddings based on their interactions with other types of entities through graph neural network and proximity-preserving network embedding, respectively. Using real-world clinical data, we demonstrate the efficacy of ME2Vec over competitive baselines on diagnosis prediction, readmission prediction, as well as recommending doctors to patients based on their medical conditions. In addition, medical service embeddings pretrained using ME2Vec can substantially improve the performance of sequential models in predicting patients clinical outcomes. Overall, ME2Vec can serve as a general-purpose representation learning algorithm for EHR data and benefit various downstream tasks in terms of both performance and interpretability.
format Online
Article
Text
id pubmed-7955058
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79550582021-03-15 Leveraging graph-based hierarchical medical entity embedding for healthcare applications Wu, Tong Wang, Yunlong Wang, Yue Zhao, Emily Yuan, Yilian Sci Rep Article Automatic representation learning of key entities in electronic health record (EHR) data is a critical step for healthcare data mining that turns heterogeneous medical records into structured and actionable information. Here we propose ME2Vec, an algorithmic framework for learning continuous low-dimensional embedding vectors of the most common entities in EHR: medical services, doctors, and patients. ME2Vec features a hierarchical structure that encapsulates different node embedding schemes to cater for the unique characteristic of each medical entity. To embed medical services, we employ a biased-random-walk-based node embedding that leverages the irregular time intervals of medical services in EHR to embody their relative importance. To embed doctors and patients, we adhere to the principle “it’s what you do that defines you” and derive their embeddings based on their interactions with other types of entities through graph neural network and proximity-preserving network embedding, respectively. Using real-world clinical data, we demonstrate the efficacy of ME2Vec over competitive baselines on diagnosis prediction, readmission prediction, as well as recommending doctors to patients based on their medical conditions. In addition, medical service embeddings pretrained using ME2Vec can substantially improve the performance of sequential models in predicting patients clinical outcomes. Overall, ME2Vec can serve as a general-purpose representation learning algorithm for EHR data and benefit various downstream tasks in terms of both performance and interpretability. Nature Publishing Group UK 2021-03-12 /pmc/articles/PMC7955058/ /pubmed/33712670 http://dx.doi.org/10.1038/s41598-021-85255-w Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wu, Tong
Wang, Yunlong
Wang, Yue
Zhao, Emily
Yuan, Yilian
Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title_full Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title_fullStr Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title_full_unstemmed Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title_short Leveraging graph-based hierarchical medical entity embedding for healthcare applications
title_sort leveraging graph-based hierarchical medical entity embedding for healthcare applications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7955058/
https://www.ncbi.nlm.nih.gov/pubmed/33712670
http://dx.doi.org/10.1038/s41598-021-85255-w
work_keys_str_mv AT wutong leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications
AT wangyunlong leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications
AT wangyue leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications
AT zhaoemily leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications
AT yuanyilian leveraginggraphbasedhierarchicalmedicalentityembeddingforhealthcareapplications