Cargando…

A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text

Negation detection is an important task in biomedical text mining. Particularly in clinical settings, it is of critical importance to determine whether findings mentioned in text are present or absent. Rule-based negation detection algorithms are a common approach to the task, and more recent invest...

Descripción completa

Detalles Bibliográficos
Autores principales: Slater, Luke T., Bradlow, William, Motti, Dino FA., Hoehndorf, Robert, Ball, Simon, Gkoutos, Georgios V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910278/
https://www.ncbi.nlm.nih.gov/pubmed/33484944
http://dx.doi.org/10.1016/j.compbiomed.2021.104216
_version_ 1783656097058390016
author Slater, Luke T.
Bradlow, William
Motti, Dino FA.
Hoehndorf, Robert
Ball, Simon
Gkoutos, Georgios V.
author_facet Slater, Luke T.
Bradlow, William
Motti, Dino FA.
Hoehndorf, Robert
Ball, Simon
Gkoutos, Georgios V.
author_sort Slater, Luke T.
collection PubMed
description Negation detection is an important task in biomedical text mining. Particularly in clinical settings, it is of critical importance to determine whether findings mentioned in text are present or absent. Rule-based negation detection algorithms are a common approach to the task, and more recent investigations have resulted in the development of rule-based systems utilising the rich grammatical information afforded by typed dependency graphs. However, interacting with these complex representations inevitably necessitates complex rules, which are time-consuming to develop and do not generalise well. We hypothesise that a heuristic approach to determining negation via dependency graphs could offer a powerful alternative. We describe and implement an algorithm for negation detection based on grammatical distance from a negatory construct in a typed dependency graph. To evaluate the algorithm, we develop two testing corpora comprised of sentences of clinical text extracted from the MIMIC-III database and documents related to hypertrophic cardiomyopathy patients routinely collected at University Hospitals Birmingham NHS trust. Gold-standard validation datasets were built by a combination of human annotation and examination of algorithm error. Finally, we compare the performance of our approach with four other rule-based algorithms on both gold-standard corpora. The presented algorithm exhibits the best performance by f-measure over the MIMIC-III dataset, and a similar performance to the syntactic negation detection systems over the HCM dataset. It is also the fastest of the dependency-based negation systems explored in this study. Our results show that while a single heuristic approach to dependency-based negation detection is ignorant to certain advanced cases, it nevertheless forms a powerful and stable method, requiring minimal training and adaptation between datasets. As such, it could present a drop-in replacement or augmentation for many-rule negation approaches in clinical text-mining pipelines, particularly for cases where adaptation and rule development is not required or possible.
format Online
Article
Text
id pubmed-7910278
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-79102782021-03-04 A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text Slater, Luke T. Bradlow, William Motti, Dino FA. Hoehndorf, Robert Ball, Simon Gkoutos, Georgios V. Comput Biol Med Article Negation detection is an important task in biomedical text mining. Particularly in clinical settings, it is of critical importance to determine whether findings mentioned in text are present or absent. Rule-based negation detection algorithms are a common approach to the task, and more recent investigations have resulted in the development of rule-based systems utilising the rich grammatical information afforded by typed dependency graphs. However, interacting with these complex representations inevitably necessitates complex rules, which are time-consuming to develop and do not generalise well. We hypothesise that a heuristic approach to determining negation via dependency graphs could offer a powerful alternative. We describe and implement an algorithm for negation detection based on grammatical distance from a negatory construct in a typed dependency graph. To evaluate the algorithm, we develop two testing corpora comprised of sentences of clinical text extracted from the MIMIC-III database and documents related to hypertrophic cardiomyopathy patients routinely collected at University Hospitals Birmingham NHS trust. Gold-standard validation datasets were built by a combination of human annotation and examination of algorithm error. Finally, we compare the performance of our approach with four other rule-based algorithms on both gold-standard corpora. The presented algorithm exhibits the best performance by f-measure over the MIMIC-III dataset, and a similar performance to the syntactic negation detection systems over the HCM dataset. It is also the fastest of the dependency-based negation systems explored in this study. Our results show that while a single heuristic approach to dependency-based negation detection is ignorant to certain advanced cases, it nevertheless forms a powerful and stable method, requiring minimal training and adaptation between datasets. As such, it could present a drop-in replacement or augmentation for many-rule negation approaches in clinical text-mining pipelines, particularly for cases where adaptation and rule development is not required or possible. Elsevier 2021-03 /pmc/articles/PMC7910278/ /pubmed/33484944 http://dx.doi.org/10.1016/j.compbiomed.2021.104216 Text en © 2021 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Slater, Luke T.
Bradlow, William
Motti, Dino FA.
Hoehndorf, Robert
Ball, Simon
Gkoutos, Georgios V.
A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title_full A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title_fullStr A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title_full_unstemmed A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title_short A fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
title_sort fast, accurate, and generalisable heuristic-based negation detection algorithm for clinical text
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7910278/
https://www.ncbi.nlm.nih.gov/pubmed/33484944
http://dx.doi.org/10.1016/j.compbiomed.2021.104216
work_keys_str_mv AT slaterluket afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT bradlowwilliam afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT mottidinofa afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT hoehndorfrobert afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT ballsimon afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT gkoutosgeorgiosv afastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT slaterluket fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT bradlowwilliam fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT mottidinofa fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT hoehndorfrobert fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT ballsimon fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext
AT gkoutosgeorgiosv fastaccurateandgeneralisableheuristicbasednegationdetectionalgorithmforclinicaltext