Cargando…

Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2

BACKGROUND: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult...

Descripción completa

Detalles Bibliográficos
Autores principales: Mowery, Danielle L., South, Brett R., Christensen, Lee, Leng, Jianwei, Peltonen, Laura-Maria, Salanterä, Sanna, Suominen, Hanna, Martinez, David, Velupillai, Sumithra, Elhadad, Noémie, Savova, Guergana, Pradhan, Sameer, Chapman, Wendy W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4930590/
https://www.ncbi.nlm.nih.gov/pubmed/27370271
http://dx.doi.org/10.1186/s13326-016-0084-y
_version_ 1782440766374674432
author Mowery, Danielle L.
South, Brett R.
Christensen, Lee
Leng, Jianwei
Peltonen, Laura-Maria
Salanterä, Sanna
Suominen, Hanna
Martinez, David
Velupillai, Sumithra
Elhadad, Noémie
Savova, Guergana
Pradhan, Sameer
Chapman, Wendy W.
author_facet Mowery, Danielle L.
South, Brett R.
Christensen, Lee
Leng, Jianwei
Peltonen, Laura-Maria
Salanterä, Sanna
Suominen, Hanna
Martinez, David
Velupillai, Sumithra
Elhadad, Noémie
Savova, Guergana
Pradhan, Sameer
Chapman, Wendy W.
author_sort Mowery, Danielle L.
collection PubMed
description BACKGROUND: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term. METHODS: In this study, we evaluate 1) accuracy of participating systems’ normalizing short forms compared to a majority sense baseline approach, 2) performance of participants’ systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems’ normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms. RESULTS: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy. CONCLUSION: Short form normalization continues to be a challenging problem. Short form normalization systems perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base with missed concept unique identifiers from the ShARe test set to further support patient understanding of unfamiliar medical terms.
format Online
Article
Text
id pubmed-4930590
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49305902016-07-03 Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2 Mowery, Danielle L. South, Brett R. Christensen, Lee Leng, Jianwei Peltonen, Laura-Maria Salanterä, Sanna Suominen, Hanna Martinez, David Velupillai, Sumithra Elhadad, Noémie Savova, Guergana Pradhan, Sameer Chapman, Wendy W. J Biomed Semantics Research BACKGROUND: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term. METHODS: In this study, we evaluate 1) accuracy of participating systems’ normalizing short forms compared to a majority sense baseline approach, 2) performance of participants’ systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems’ normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms. RESULTS: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy. CONCLUSION: Short form normalization continues to be a challenging problem. Short form normalization systems perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base with missed concept unique identifiers from the ShARe test set to further support patient understanding of unfamiliar medical terms. BioMed Central 2016-07-01 /pmc/articles/PMC4930590/ /pubmed/27370271 http://dx.doi.org/10.1186/s13326-016-0084-y Text en © Mowery et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Mowery, Danielle L.
South, Brett R.
Christensen, Lee
Leng, Jianwei
Peltonen, Laura-Maria
Salanterä, Sanna
Suominen, Hanna
Martinez, David
Velupillai, Sumithra
Elhadad, Noémie
Savova, Guergana
Pradhan, Sameer
Chapman, Wendy W.
Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title_full Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title_fullStr Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title_full_unstemmed Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title_short Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2
title_sort normalizing acronyms and abbreviations to aid patient understanding of clinical texts: share/clef ehealth challenge 2013, task 2
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4930590/
https://www.ncbi.nlm.nih.gov/pubmed/27370271
http://dx.doi.org/10.1186/s13326-016-0084-y
work_keys_str_mv AT mowerydaniellel normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT southbrettr normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT christensenlee normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT lengjianwei normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT peltonenlauramaria normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT salanterasanna normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT suominenhanna normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT martinezdavid normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT velupillaisumithra normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT elhadadnoemie normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT savovaguergana normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT pradhansameer normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2
AT chapmanwendyw normalizingacronymsandabbreviationstoaidpatientunderstandingofclinicaltextsshareclefehealthchallenge2013task2