Cargando…

Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity

Semantic similarity is a useful approach for comparing patient phenotypes, and holds the potential of an effective method for exploiting text-derived phenotypes for differential diagnosis, text and document classification, and outcome prediction. While approaches for context disambiguation are commo...

Descripción completa

Detalles Bibliográficos
Autores principales: Slater, Luke T., Karwath, Andreas, Hoehndorf, Robert, Gkoutos, Georgios V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8685209/
https://www.ncbi.nlm.nih.gov/pubmed/34939069
http://dx.doi.org/10.3389/fdgth.2021.781227
_version_ 1784617783984652288
author Slater, Luke T.
Karwath, Andreas
Hoehndorf, Robert
Gkoutos, Georgios V.
author_facet Slater, Luke T.
Karwath, Andreas
Hoehndorf, Robert
Gkoutos, Georgios V.
author_sort Slater, Luke T.
collection PubMed
description Semantic similarity is a useful approach for comparing patient phenotypes, and holds the potential of an effective method for exploiting text-derived phenotypes for differential diagnosis, text and document classification, and outcome prediction. While approaches for context disambiguation are commonly used in text mining applications, forming a standard component of information extraction pipelines, their effects on semantic similarity calculations have not been widely explored. In this work, we evaluate how inclusion and disclusion of negated and uncertain mentions of concepts from text-derived phenotypes affects similarity of patients, and the use of those profiles to predict diagnosis. We report on the effectiveness of these approaches and report a very small, yet significant, improvement in performance when classifying primary diagnosis over MIMIC-III patient visits.
format Online
Article
Text
id pubmed-8685209
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86852092021-12-21 Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity Slater, Luke T. Karwath, Andreas Hoehndorf, Robert Gkoutos, Georgios V. Front Digit Health Digital Health Semantic similarity is a useful approach for comparing patient phenotypes, and holds the potential of an effective method for exploiting text-derived phenotypes for differential diagnosis, text and document classification, and outcome prediction. While approaches for context disambiguation are commonly used in text mining applications, forming a standard component of information extraction pipelines, their effects on semantic similarity calculations have not been widely explored. In this work, we evaluate how inclusion and disclusion of negated and uncertain mentions of concepts from text-derived phenotypes affects similarity of patients, and the use of those profiles to predict diagnosis. We report on the effectiveness of these approaches and report a very small, yet significant, improvement in performance when classifying primary diagnosis over MIMIC-III patient visits. Frontiers Media S.A. 2021-12-06 /pmc/articles/PMC8685209/ /pubmed/34939069 http://dx.doi.org/10.3389/fdgth.2021.781227 Text en Copyright © 2021 Slater, Karwath, Hoehndorf and Gkoutos. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Slater, Luke T.
Karwath, Andreas
Hoehndorf, Robert
Gkoutos, Georgios V.
Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title_full Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title_fullStr Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title_full_unstemmed Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title_short Effects of Negation and Uncertainty Stratification on Text-Derived Patient Profile Similarity
title_sort effects of negation and uncertainty stratification on text-derived patient profile similarity
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8685209/
https://www.ncbi.nlm.nih.gov/pubmed/34939069
http://dx.doi.org/10.3389/fdgth.2021.781227
work_keys_str_mv AT slaterluket effectsofnegationanduncertaintystratificationontextderivedpatientprofilesimilarity
AT karwathandreas effectsofnegationanduncertaintystratificationontextderivedpatientprofilesimilarity
AT hoehndorfrobert effectsofnegationanduncertaintystratificationontextderivedpatientprofilesimilarity
AT gkoutosgeorgiosv effectsofnegationanduncertaintystratificationontextderivedpatientprofilesimilarity