Cargando…

Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain

OBJECTIVES: Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear...

Descripción completa

Detalles Bibliográficos
Autores principales: Moon, Sungrim, McInnes, Bridget, Melton, Genevieve B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4330198/
https://www.ncbi.nlm.nih.gov/pubmed/25705556
http://dx.doi.org/10.4258/hir.2015.21.1.35
_version_ 1782357547544477696
author Moon, Sungrim
McInnes, Bridget
Melton, Genevieve B.
author_facet Moon, Sungrim
McInnes, Bridget
Melton, Genevieve B.
author_sort Moon, Sungrim
collection PubMed
description OBJECTIVES: Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear whether techniques effective for acronym and abbreviation WSD from biomedical literature are sufficient. METHODS: The authors discuss feature selection for automated techniques and challenges with WSD of acronyms and abbreviations in the clinical domain. RESULTS: There are significant challenges associated with the informal nature of clinical text, such as typographical errors and incomplete sentences; difficulty with insufficient clinical resources, such as clinical sense inventories; and obstacles with privacy and security for conducting research with clinical text. Although we anticipated that using sophisticated techniques, such as biomedical terminologies, semantic types, part-of-speech, and language modeling, would be needed for feature selection with automated machine learning approaches, we found instead that simple techniques, such as bag-of-words, were quite effective in many cases. Factors, such as majority sense prevalence and the degree of separateness between sense meanings, were also important considerations. CONCLUSIONS: The first lesson is that a comprehensive understanding of the unique characteristics of clinical text is important for automatic acronym and abbreviation WSD. The second lesson learned is that investigators may find that using simple approaches is an effective starting point for these tasks. Finally, similar to other WSD tasks, an understanding of baseline majority sense rates and separateness between senses is important. Further studies and practical solutions are needed to better address these issues.
format Online
Article
Text
id pubmed-4330198
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-43301982015-02-22 Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain Moon, Sungrim McInnes, Bridget Melton, Genevieve B. Healthc Inform Res Original Article OBJECTIVES: Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear whether techniques effective for acronym and abbreviation WSD from biomedical literature are sufficient. METHODS: The authors discuss feature selection for automated techniques and challenges with WSD of acronyms and abbreviations in the clinical domain. RESULTS: There are significant challenges associated with the informal nature of clinical text, such as typographical errors and incomplete sentences; difficulty with insufficient clinical resources, such as clinical sense inventories; and obstacles with privacy and security for conducting research with clinical text. Although we anticipated that using sophisticated techniques, such as biomedical terminologies, semantic types, part-of-speech, and language modeling, would be needed for feature selection with automated machine learning approaches, we found instead that simple techniques, such as bag-of-words, were quite effective in many cases. Factors, such as majority sense prevalence and the degree of separateness between sense meanings, were also important considerations. CONCLUSIONS: The first lesson is that a comprehensive understanding of the unique characteristics of clinical text is important for automatic acronym and abbreviation WSD. The second lesson learned is that investigators may find that using simple approaches is an effective starting point for these tasks. Finally, similar to other WSD tasks, an understanding of baseline majority sense rates and separateness between senses is important. Further studies and practical solutions are needed to better address these issues. Korean Society of Medical Informatics 2015-01 2015-01-31 /pmc/articles/PMC4330198/ /pubmed/25705556 http://dx.doi.org/10.4258/hir.2015.21.1.35 Text en © 2015 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Moon, Sungrim
McInnes, Bridget
Melton, Genevieve B.
Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title_full Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title_fullStr Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title_full_unstemmed Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title_short Challenges and Practical Approaches with Word Sense Disambiguation of Acronyms and Abbreviations in the Clinical Domain
title_sort challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4330198/
https://www.ncbi.nlm.nih.gov/pubmed/25705556
http://dx.doi.org/10.4258/hir.2015.21.1.35
work_keys_str_mv AT moonsungrim challengesandpracticalapproacheswithwordsensedisambiguationofacronymsandabbreviationsintheclinicaldomain
AT mcinnesbridget challengesandpracticalapproacheswithwordsensedisambiguationofacronymsandabbreviationsintheclinicaldomain
AT meltongenevieveb challengesandpracticalapproacheswithwordsensedisambiguationofacronymsandabbreviationsintheclinicaldomain