Cargando…
Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach
Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044225/ https://www.ncbi.nlm.nih.gov/pubmed/35494817 http://dx.doi.org/10.7717/peerj-cs.913 |
_version_ | 1784695058839109632 |
---|---|
author | Solarte Pabón, Oswaldo Montenegro, Orlando Torrente, Maria Rodríguez González, Alejandro Provencio, Mariano Menasalvas, Ernestina |
author_facet | Solarte Pabón, Oswaldo Montenegro, Orlando Torrente, Maria Rodríguez González, Alejandro Provencio, Mariano Menasalvas, Ernestina |
author_sort | Solarte Pabón, Oswaldo |
collection | PubMed |
description | Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused on the English language. Most proposals developed for Spanish have focused mainly on negation detection and do not deal with uncertainty. In this paper, we propose a deep learning-based approach for both negation and uncertainty detection in clinical texts written in Spanish. The proposed approach explores two deep learning methods to achieve this goal: (i) Bidirectional Long-Short Term Memory with a Conditional Random Field layer (BiLSTM-CRF) and (ii) Bidirectional Encoder Representation for Transformers (BERT). The approach was evaluated using NUBES and IULA, two public corpora for the Spanish language. The results obtained showed an F-score of 92% and 80% in the scope recognition task for negation and uncertainty, respectively. We also present the results of a validation process conducted using a real-life annotated dataset from clinical notes belonging to cancer patients. The proposed approach shows the feasibility of deep learning-based methods to detect negation and uncertainty in Spanish clinical texts. Experiments also highlighted that this approach improves performance in the scope recognition task compared to other proposals in the biomedical domain. |
format | Online Article Text |
id | pubmed-9044225 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90442252022-04-28 Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach Solarte Pabón, Oswaldo Montenegro, Orlando Torrente, Maria Rodríguez González, Alejandro Provencio, Mariano Menasalvas, Ernestina PeerJ Comput Sci Data Mining and Machine Learning Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused on the English language. Most proposals developed for Spanish have focused mainly on negation detection and do not deal with uncertainty. In this paper, we propose a deep learning-based approach for both negation and uncertainty detection in clinical texts written in Spanish. The proposed approach explores two deep learning methods to achieve this goal: (i) Bidirectional Long-Short Term Memory with a Conditional Random Field layer (BiLSTM-CRF) and (ii) Bidirectional Encoder Representation for Transformers (BERT). The approach was evaluated using NUBES and IULA, two public corpora for the Spanish language. The results obtained showed an F-score of 92% and 80% in the scope recognition task for negation and uncertainty, respectively. We also present the results of a validation process conducted using a real-life annotated dataset from clinical notes belonging to cancer patients. The proposed approach shows the feasibility of deep learning-based methods to detect negation and uncertainty in Spanish clinical texts. Experiments also highlighted that this approach improves performance in the scope recognition task compared to other proposals in the biomedical domain. PeerJ Inc. 2022-03-07 /pmc/articles/PMC9044225/ /pubmed/35494817 http://dx.doi.org/10.7717/peerj-cs.913 Text en ©2022 Solarte Pabón et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Data Mining and Machine Learning Solarte Pabón, Oswaldo Montenegro, Orlando Torrente, Maria Rodríguez González, Alejandro Provencio, Mariano Menasalvas, Ernestina Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title | Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title_full | Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title_fullStr | Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title_full_unstemmed | Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title_short | Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach |
title_sort | negation and uncertainty detection in clinical texts written in spanish: a deep learning-based approach |
topic | Data Mining and Machine Learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044225/ https://www.ncbi.nlm.nih.gov/pubmed/35494817 http://dx.doi.org/10.7717/peerj-cs.913 |
work_keys_str_mv | AT solartepabonoswaldo negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach AT montenegroorlando negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach AT torrentemaria negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach AT rodriguezgonzalezalejandro negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach AT provenciomariano negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach AT menasalvasernestina negationanduncertaintydetectioninclinicaltextswritteninspanishadeeplearningbasedapproach |