Cargando…
EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different seque...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344186/ https://www.ncbi.nlm.nih.gov/pubmed/32714371 http://dx.doi.org/10.3389/fgene.2020.00630 |
_version_ | 1783555892158922752 |
---|---|
author | Wang, Li Wang, Qinghua Bai, Heming Liu, Cong Liu, Wei Zhang, Yuanpeng Jiang, Lei Xu, Huji Wang, Kai Zhou, Yunyun |
author_facet | Wang, Li Wang, Qinghua Bai, Heming Liu, Cong Liu, Wei Zhang, Yuanpeng Jiang, Lei Xu, Huji Wang, Kai Zhou, Yunyun |
author_sort | Wang, Li |
collection | PubMed |
description | Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different sequential concepts. Therefore, learning distributed representations from temporal patterns of clinical notes is an essential step for downstream applications on EHR data. However, existing methods for EHR representation learning can not adequately capture either contextual information per-visit or temporal information at multiple visits. In this study, we developed a new vector embedding method called EHR2Vec that can learn semantically-meaningful representations of clinical concepts. EHR2Vec incorporated the self-attention structure and showed its utility in accurately identifying relevant clinical concept entities considering time sequence information from multiple visits. Using EHR data from systemic lupus erythematosus (SLE) patients as a case study, we showed EHR2Vec outperforms in identifying interpretable representations compared to other well-known methods including Word2Vec and Med2Vec, according to clinical experts' evaluations. |
format | Online Article Text |
id | pubmed-7344186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73441862020-07-25 EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism Wang, Li Wang, Qinghua Bai, Heming Liu, Cong Liu, Wei Zhang, Yuanpeng Jiang, Lei Xu, Huji Wang, Kai Zhou, Yunyun Front Genet Genetics Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different sequential concepts. Therefore, learning distributed representations from temporal patterns of clinical notes is an essential step for downstream applications on EHR data. However, existing methods for EHR representation learning can not adequately capture either contextual information per-visit or temporal information at multiple visits. In this study, we developed a new vector embedding method called EHR2Vec that can learn semantically-meaningful representations of clinical concepts. EHR2Vec incorporated the self-attention structure and showed its utility in accurately identifying relevant clinical concept entities considering time sequence information from multiple visits. Using EHR data from systemic lupus erythematosus (SLE) patients as a case study, we showed EHR2Vec outperforms in identifying interpretable representations compared to other well-known methods including Word2Vec and Med2Vec, according to clinical experts' evaluations. Frontiers Media S.A. 2020-06-29 /pmc/articles/PMC7344186/ /pubmed/32714371 http://dx.doi.org/10.3389/fgene.2020.00630 Text en Copyright © 2020 Wang, Wang, Bai, Liu, Liu, Zhang, Jiang, Xu, Wang and Zhou. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Li Wang, Qinghua Bai, Heming Liu, Cong Liu, Wei Zhang, Yuanpeng Jiang, Lei Xu, Huji Wang, Kai Zhou, Yunyun EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title | EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title_full | EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title_fullStr | EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title_full_unstemmed | EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title_short | EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism |
title_sort | ehr2vec: representation learning of medical concepts from temporal patterns of clinical notes based on self-attention mechanism |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344186/ https://www.ncbi.nlm.nih.gov/pubmed/32714371 http://dx.doi.org/10.3389/fgene.2020.00630 |
work_keys_str_mv | AT wangli ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT wangqinghua ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT baiheming ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT liucong ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT liuwei ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT zhangyuanpeng ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT jianglei ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT xuhuji ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT wangkai ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism AT zhouyunyun ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism |