Cargando…

Automatically disambiguating medical acronyms with ontology-aware deep learning

Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for...

Descripción completa

Detalles Bibliográficos
Autores principales: Skreta, Marta, Arbabi, Aryan, Wang, Jixuan, Drysdale, Erik, Kelly, Jacob, Singh, Devin, Brudno, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8423722/
https://www.ncbi.nlm.nih.gov/pubmed/34493718
http://dx.doi.org/10.1038/s41467-021-25578-4
_version_ 1783749525909798912
author Skreta, Marta
Arbabi, Aryan
Wang, Jixuan
Drysdale, Erik
Kelly, Jacob
Singh, Devin
Brudno, Michael
author_facet Skreta, Marta
Arbabi, Aryan
Wang, Jixuan
Drysdale, Erik
Kelly, Jacob
Singh, Devin
Brudno, Michael
author_sort Skreta, Marta
collection PubMed
description Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for this task is restricted by the scarcity and imbalance of labeled training data. In this work we present a method that improves a model’s ability to generalize through novel data augmentation techniques that utilizes information from biomedical ontologies in the form of related medical concepts, as well as global context information within the medical note. We train our model on a public dataset (MIMIC III) and test its performance on automatically generated and hand-labelled datasets from different sources (MIMIC III, CASI, i2b2). Together, these techniques boost the accuracy of abbreviation disambiguation by up to 17% on hand-labeled data, without sacrificing performance on a held-out test set from MIMIC III.
format Online
Article
Text
id pubmed-8423722
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-84237222021-09-22 Automatically disambiguating medical acronyms with ontology-aware deep learning Skreta, Marta Arbabi, Aryan Wang, Jixuan Drysdale, Erik Kelly, Jacob Singh, Devin Brudno, Michael Nat Commun Article Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for this task is restricted by the scarcity and imbalance of labeled training data. In this work we present a method that improves a model’s ability to generalize through novel data augmentation techniques that utilizes information from biomedical ontologies in the form of related medical concepts, as well as global context information within the medical note. We train our model on a public dataset (MIMIC III) and test its performance on automatically generated and hand-labelled datasets from different sources (MIMIC III, CASI, i2b2). Together, these techniques boost the accuracy of abbreviation disambiguation by up to 17% on hand-labeled data, without sacrificing performance on a held-out test set from MIMIC III. Nature Publishing Group UK 2021-09-07 /pmc/articles/PMC8423722/ /pubmed/34493718 http://dx.doi.org/10.1038/s41467-021-25578-4 Text en © Crown 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Skreta, Marta
Arbabi, Aryan
Wang, Jixuan
Drysdale, Erik
Kelly, Jacob
Singh, Devin
Brudno, Michael
Automatically disambiguating medical acronyms with ontology-aware deep learning
title Automatically disambiguating medical acronyms with ontology-aware deep learning
title_full Automatically disambiguating medical acronyms with ontology-aware deep learning
title_fullStr Automatically disambiguating medical acronyms with ontology-aware deep learning
title_full_unstemmed Automatically disambiguating medical acronyms with ontology-aware deep learning
title_short Automatically disambiguating medical acronyms with ontology-aware deep learning
title_sort automatically disambiguating medical acronyms with ontology-aware deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8423722/
https://www.ncbi.nlm.nih.gov/pubmed/34493718
http://dx.doi.org/10.1038/s41467-021-25578-4
work_keys_str_mv AT skretamarta automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT arbabiaryan automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT wangjixuan automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT drysdaleerik automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT kellyjacob automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT singhdevin automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning
AT brudnomichael automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning