Cargando…
Automatically disambiguating medical acronyms with ontology-aware deep learning
Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8423722/ https://www.ncbi.nlm.nih.gov/pubmed/34493718 http://dx.doi.org/10.1038/s41467-021-25578-4 |
_version_ | 1783749525909798912 |
---|---|
author | Skreta, Marta Arbabi, Aryan Wang, Jixuan Drysdale, Erik Kelly, Jacob Singh, Devin Brudno, Michael |
author_facet | Skreta, Marta Arbabi, Aryan Wang, Jixuan Drysdale, Erik Kelly, Jacob Singh, Devin Brudno, Michael |
author_sort | Skreta, Marta |
collection | PubMed |
description | Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for this task is restricted by the scarcity and imbalance of labeled training data. In this work we present a method that improves a model’s ability to generalize through novel data augmentation techniques that utilizes information from biomedical ontologies in the form of related medical concepts, as well as global context information within the medical note. We train our model on a public dataset (MIMIC III) and test its performance on automatically generated and hand-labelled datasets from different sources (MIMIC III, CASI, i2b2). Together, these techniques boost the accuracy of abbreviation disambiguation by up to 17% on hand-labeled data, without sacrificing performance on a held-out test set from MIMIC III. |
format | Online Article Text |
id | pubmed-8423722 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-84237222021-09-22 Automatically disambiguating medical acronyms with ontology-aware deep learning Skreta, Marta Arbabi, Aryan Wang, Jixuan Drysdale, Erik Kelly, Jacob Singh, Devin Brudno, Michael Nat Commun Article Modern machine learning (ML) technologies have great promise for automating diverse clinical and research workflows; however, training them requires extensive hand-labelled datasets. Disambiguating abbreviations is important for automated clinical note processing; however, broad deployment of ML for this task is restricted by the scarcity and imbalance of labeled training data. In this work we present a method that improves a model’s ability to generalize through novel data augmentation techniques that utilizes information from biomedical ontologies in the form of related medical concepts, as well as global context information within the medical note. We train our model on a public dataset (MIMIC III) and test its performance on automatically generated and hand-labelled datasets from different sources (MIMIC III, CASI, i2b2). Together, these techniques boost the accuracy of abbreviation disambiguation by up to 17% on hand-labeled data, without sacrificing performance on a held-out test set from MIMIC III. Nature Publishing Group UK 2021-09-07 /pmc/articles/PMC8423722/ /pubmed/34493718 http://dx.doi.org/10.1038/s41467-021-25578-4 Text en © Crown 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Skreta, Marta Arbabi, Aryan Wang, Jixuan Drysdale, Erik Kelly, Jacob Singh, Devin Brudno, Michael Automatically disambiguating medical acronyms with ontology-aware deep learning |
title | Automatically disambiguating medical acronyms with ontology-aware deep learning |
title_full | Automatically disambiguating medical acronyms with ontology-aware deep learning |
title_fullStr | Automatically disambiguating medical acronyms with ontology-aware deep learning |
title_full_unstemmed | Automatically disambiguating medical acronyms with ontology-aware deep learning |
title_short | Automatically disambiguating medical acronyms with ontology-aware deep learning |
title_sort | automatically disambiguating medical acronyms with ontology-aware deep learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8423722/ https://www.ncbi.nlm.nih.gov/pubmed/34493718 http://dx.doi.org/10.1038/s41467-021-25578-4 |
work_keys_str_mv | AT skretamarta automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT arbabiaryan automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT wangjixuan automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT drysdaleerik automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT kellyjacob automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT singhdevin automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning AT brudnomichael automaticallydisambiguatingmedicalacronymswithontologyawaredeeplearning |