Cargando…

Leveraging medical context to recommend semantically similar terms for chart reviews

BACKGROUND: Information retrieval (IR) help clinicians answer questions posed to large collections of electronic medical records (EMRs), such as how best to identify a patient’s cancer stage. One of the more promising approaches to IR for EMRs is to expand a keyword query with similar terms (e.g., a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ye, Cheng, Malin, Bradley A., Fabbri, Daniel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8684266/ https://www.ncbi.nlm.nih.gov/pubmed/34922536 http://dx.doi.org/10.1186/s12911-021-01724-2

_version_	1784617584255041536
author	Ye, Cheng Malin, Bradley A. Fabbri, Daniel
author_facet	Ye, Cheng Malin, Bradley A. Fabbri, Daniel
author_sort	Ye, Cheng
collection	PubMed
description	BACKGROUND: Information retrieval (IR) help clinicians answer questions posed to large collections of electronic medical records (EMRs), such as how best to identify a patient’s cancer stage. One of the more promising approaches to IR for EMRs is to expand a keyword query with similar terms (e.g., augmenting cancer with mets). However, there is a large range of clinical chart review tasks, such that fixed sets of similar terms is insufficient. Current language models, such as Bidirectional Encoder Representations from Transformers (BERT) embeddings, do not capture the full non-textual context of a task. In this study, we present new methods that provide similar terms dynamically by adjusting with the context of the chart review task. METHODS: We introduce a vector space for medical-context in which each word is represented by a vector that captures the word’s usage in different medical contexts (e.g., how frequently cancer is used when ordering a prescription versus describing family history) beyond the context learned from the surrounding text. These vectors are transformed into a vector space for customizing the set of similar terms selected for different chart review tasks. We evaluate the vector space model with multiple chart review tasks, in which supervised machine learning models learn to predict the preferred terms of clinically knowledgeable reviewers. To quantify the usefulness of the predicted similar terms to a baseline of standard word2vec embeddings, we measure (1) the prediction performance of the medical-context vector space model using the area under the receiver operating characteristic curve (AUROC) and (2) the labeling effort required to train the models. RESULTS: The vector space outperformed the baseline word2vec embeddings in all three chart review tasks with an average AUROC of 0.80 versus 0.66, respectively. Additionally, the medical-context vector space significantly reduced the number of labels required to learn and predict the preferred similar terms of reviewers. Specifically, the labeling effort was reduced to 10% of the entire dataset in all three tasks. CONCLUSIONS: The set of preferred similar terms that are relevant to a chart review task can be learned by leveraging the medical context of the task. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01724-2.
format	Online Article Text
id	pubmed-8684266
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-86842662021-12-20 Leveraging medical context to recommend semantically similar terms for chart reviews Ye, Cheng Malin, Bradley A. Fabbri, Daniel BMC Med Inform Decis Mak Research BACKGROUND: Information retrieval (IR) help clinicians answer questions posed to large collections of electronic medical records (EMRs), such as how best to identify a patient’s cancer stage. One of the more promising approaches to IR for EMRs is to expand a keyword query with similar terms (e.g., augmenting cancer with mets). However, there is a large range of clinical chart review tasks, such that fixed sets of similar terms is insufficient. Current language models, such as Bidirectional Encoder Representations from Transformers (BERT) embeddings, do not capture the full non-textual context of a task. In this study, we present new methods that provide similar terms dynamically by adjusting with the context of the chart review task. METHODS: We introduce a vector space for medical-context in which each word is represented by a vector that captures the word’s usage in different medical contexts (e.g., how frequently cancer is used when ordering a prescription versus describing family history) beyond the context learned from the surrounding text. These vectors are transformed into a vector space for customizing the set of similar terms selected for different chart review tasks. We evaluate the vector space model with multiple chart review tasks, in which supervised machine learning models learn to predict the preferred terms of clinically knowledgeable reviewers. To quantify the usefulness of the predicted similar terms to a baseline of standard word2vec embeddings, we measure (1) the prediction performance of the medical-context vector space model using the area under the receiver operating characteristic curve (AUROC) and (2) the labeling effort required to train the models. RESULTS: The vector space outperformed the baseline word2vec embeddings in all three chart review tasks with an average AUROC of 0.80 versus 0.66, respectively. Additionally, the medical-context vector space significantly reduced the number of labels required to learn and predict the preferred similar terms of reviewers. Specifically, the labeling effort was reduced to 10% of the entire dataset in all three tasks. CONCLUSIONS: The set of preferred similar terms that are relevant to a chart review task can be learned by leveraging the medical context of the task. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-021-01724-2. BioMed Central 2021-12-18 /pmc/articles/PMC8684266/ /pubmed/34922536 http://dx.doi.org/10.1186/s12911-021-01724-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Ye, Cheng Malin, Bradley A. Fabbri, Daniel Leveraging medical context to recommend semantically similar terms for chart reviews
title	Leveraging medical context to recommend semantically similar terms for chart reviews
title_full	Leveraging medical context to recommend semantically similar terms for chart reviews
title_fullStr	Leveraging medical context to recommend semantically similar terms for chart reviews
title_full_unstemmed	Leveraging medical context to recommend semantically similar terms for chart reviews
title_short	Leveraging medical context to recommend semantically similar terms for chart reviews
title_sort	leveraging medical context to recommend semantically similar terms for chart reviews
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8684266/ https://www.ncbi.nlm.nih.gov/pubmed/34922536 http://dx.doi.org/10.1186/s12911-021-01724-2
work_keys_str_mv	AT yecheng leveragingmedicalcontexttorecommendsemanticallysimilartermsforchartreviews AT malinbradleya leveragingmedicalcontexttorecommendsemanticallysimilartermsforchartreviews AT fabbridaniel leveragingmedicalcontexttorecommendsemanticallysimilartermsforchartreviews

Leveraging medical context to recommend semantically similar terms for chart reviews

Ejemplares similares