Cargando…

Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach

Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused...

Descripción completa

Detalles Bibliográficos
Autores principales: Solarte Pabón, Oswaldo, Montenegro, Orlando, Torrente, Maria, Rodríguez González, Alejandro, Provencio, Mariano, Menasalvas, Ernestina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044225/
https://www.ncbi.nlm.nih.gov/pubmed/35494817
http://dx.doi.org/10.7717/peerj-cs.913
_version_ 1784695058839109632
author Solarte Pabón, Oswaldo
Montenegro, Orlando
Torrente, Maria
Rodríguez González, Alejandro
Provencio, Mariano
Menasalvas, Ernestina
author_facet Solarte Pabón, Oswaldo
Montenegro, Orlando
Torrente, Maria
Rodríguez González, Alejandro
Provencio, Mariano
Menasalvas, Ernestina
author_sort Solarte Pabón, Oswaldo
collection PubMed
description Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused on the English language. Most proposals developed for Spanish have focused mainly on negation detection and do not deal with uncertainty. In this paper, we propose a deep learning-based approach for both negation and uncertainty detection in clinical texts written in Spanish. The proposed approach explores two deep learning methods to achieve this goal: (i) Bidirectional Long-Short Term Memory with a Conditional Random Field layer (BiLSTM-CRF) and (ii) Bidirectional Encoder Representation for Transformers (BERT). The approach was evaluated using NUBES and IULA, two public corpora for the Spanish language. The results obtained showed an F-score of 92% and 80% in the scope recognition task for negation and uncertainty, respectively. We also present the results of a validation process conducted using a real-life annotated dataset from clinical notes belonging to cancer patients. The proposed approach shows the feasibility of deep learning-based methods to detect negation and uncertainty in Spanish clinical texts. Experiments also highlighted that this approach improves performance in the scope recognition task compared to other proposals in the biomedical domain.
format Online
Article
Text
id pubmed-9044225
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90442252022-04-28 Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach Solarte Pabón, Oswaldo Montenegro, Orlando Torrente, Maria Rodríguez González, Alejandro Provencio, Mariano Menasalvas, Ernestina PeerJ Comput Sci Data Mining and Machine Learning Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused on the English language. Most proposals developed for Spanish have focused mainly on negation detection and do not deal with uncertainty. In this paper, we propose a deep learning-based approach for both negation and uncertainty detection in clinical texts written in Spanish. The proposed approach explores two deep learning methods to achieve this goal: (i) Bidirectional Long-Short Term Memory with a Conditional Random Field layer (BiLSTM-CRF) and (ii) Bidirectional Encoder Representation for Transformers (BERT). The approach was evaluated using NUBES and IULA, two public corpora for the Spanish language. The results obtained showed an F-score of 92% and 80% in the scope recognition task for negation and uncertainty, respectively. We also present the results of a validation process conducted using a real-life annotated dataset from clinical notes belonging to cancer patients. The proposed approach shows the feasibility of deep learning-based methods to detect negation and uncertainty in Spanish clinical texts. Experiments also highlighted that this approach improves performance in the scope recognition task compared to other proposals in the biomedical domain. PeerJ Inc. 2022-03-07 /pmc/articles/PMC9044225/ /pubmed/35494817 http://dx.doi.org/10.7717/peerj-cs.913 Text en ©2022 Solarte Pabón et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Data Mining and Machine Learning
Solarte Pabón, Oswaldo
Montenegro, Orlando
Torrente, Maria
Rodríguez González, Alejandro
Provencio, Mariano
Menasalvas, Ernestina
Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title_full Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title_fullStr Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title_full_unstemmed Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title_short Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
title_sort negation and uncertainty detection in clinical texts written in spanish: a deep learning-based approach
topic Data Mining and Machine Learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044225/
https://www.ncbi.nlm.nih.gov/pubmed/35494817
http://dx.doi.org/10.7717/peerj-cs.913
work_keys_str_mv AT solartepabonoswaldo negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach
AT montenegroorlando negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach
AT torrentemaria negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach
AT rodriguezgonzalezalejandro negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach
AT provenciomariano negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach
AT menasalvasernestina negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach