Cargando…

EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism

Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Li, Wang, Qinghua, Bai, Heming, Liu, Cong, Liu, Wei, Zhang, Yuanpeng, Jiang, Lei, Xu, Huji, Wang, Kai, Zhou, Yunyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344186/
https://www.ncbi.nlm.nih.gov/pubmed/32714371
http://dx.doi.org/10.3389/fgene.2020.00630
_version_ 1783555892158922752
author Wang, Li
Wang, Qinghua
Bai, Heming
Liu, Cong
Liu, Wei
Zhang, Yuanpeng
Jiang, Lei
Xu, Huji
Wang, Kai
Zhou, Yunyun
author_facet Wang, Li
Wang, Qinghua
Bai, Heming
Liu, Cong
Liu, Wei
Zhang, Yuanpeng
Jiang, Lei
Xu, Huji
Wang, Kai
Zhou, Yunyun
author_sort Wang, Li
collection PubMed
description Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different sequential concepts. Therefore, learning distributed representations from temporal patterns of clinical notes is an essential step for downstream applications on EHR data. However, existing methods for EHR representation learning can not adequately capture either contextual information per-visit or temporal information at multiple visits. In this study, we developed a new vector embedding method called EHR2Vec that can learn semantically-meaningful representations of clinical concepts. EHR2Vec incorporated the self-attention structure and showed its utility in accurately identifying relevant clinical concept entities considering time sequence information from multiple visits. Using EHR data from systemic lupus erythematosus (SLE) patients as a case study, we showed EHR2Vec outperforms in identifying interpretable representations compared to other well-known methods including Word2Vec and Med2Vec, according to clinical experts' evaluations.
format Online
Article
Text
id pubmed-7344186
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-73441862020-07-25 EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism Wang, Li Wang, Qinghua Bai, Heming Liu, Cong Liu, Wei Zhang, Yuanpeng Jiang, Lei Xu, Huji Wang, Kai Zhou, Yunyun Front Genet Genetics Efficiently learning representations of clinical concepts (i. e., symptoms, lab test, etc.) from unstructured clinical notes of electronic health record (EHR) data remain significant challenges, since each patient may have multiple visits at different times and each visit may contain different sequential concepts. Therefore, learning distributed representations from temporal patterns of clinical notes is an essential step for downstream applications on EHR data. However, existing methods for EHR representation learning can not adequately capture either contextual information per-visit or temporal information at multiple visits. In this study, we developed a new vector embedding method called EHR2Vec that can learn semantically-meaningful representations of clinical concepts. EHR2Vec incorporated the self-attention structure and showed its utility in accurately identifying relevant clinical concept entities considering time sequence information from multiple visits. Using EHR data from systemic lupus erythematosus (SLE) patients as a case study, we showed EHR2Vec outperforms in identifying interpretable representations compared to other well-known methods including Word2Vec and Med2Vec, according to clinical experts' evaluations. Frontiers Media S.A. 2020-06-29 /pmc/articles/PMC7344186/ /pubmed/32714371 http://dx.doi.org/10.3389/fgene.2020.00630 Text en Copyright © 2020 Wang, Wang, Bai, Liu, Liu, Zhang, Jiang, Xu, Wang and Zhou. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Li
Wang, Qinghua
Bai, Heming
Liu, Cong
Liu, Wei
Zhang, Yuanpeng
Jiang, Lei
Xu, Huji
Wang, Kai
Zhou, Yunyun
EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title_full EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title_fullStr EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title_full_unstemmed EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title_short EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
title_sort ehr2vec: representation learning of medical concepts from temporal patterns of clinical notes based on self-attention mechanism
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7344186/
https://www.ncbi.nlm.nih.gov/pubmed/32714371
http://dx.doi.org/10.3389/fgene.2020.00630
work_keys_str_mv AT wangli ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT wangqinghua ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT baiheming ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT liucong ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT liuwei ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT zhangyuanpeng ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT jianglei ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT xuhuji ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT wangkai ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism
AT zhouyunyun ehr2vecrepresentationlearningofmedicalconceptsfromtemporalpatternsofclinicalnotesbasedonselfattentionmechanism