Cargando…

Discovering body site and severity modifiers in clinical texts

OBJECTIVE: To research computational methods for discovering body site and severity modifiers in clinical texts. METHODS: We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich lingu...

Descripción completa

Detalles Bibliográficos
Autores principales: Dligach, Dmitriy, Bethard, Steven, Becker, Lee, Miller, Timothy, Savova, Guergana K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3994852/
https://www.ncbi.nlm.nih.gov/pubmed/24091648
http://dx.doi.org/10.1136/amiajnl-2013-001766
_version_ 1782312791622811648
author Dligach, Dmitriy
Bethard, Steven
Becker, Lee
Miller, Timothy
Savova, Guergana K
author_facet Dligach, Dmitriy
Bethard, Steven
Becker, Lee
Miller, Timothy
Savova, Guergana K
author_sort Dligach, Dmitriy
collection PubMed
description OBJECTIVE: To research computational methods for discovering body site and severity modifiers in clinical texts. METHODS: We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. RESULTS: The performance of our method for discovering body site modifiers achieves F1 of 0.740–0.908 and our method for discovering severity modifiers achieves F1 of 0.905–0.929. DISCUSSION: Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. CONCLUSIONS: We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES).
format Online
Article
Text
id pubmed-3994852
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-39948522014-04-22 Discovering body site and severity modifiers in clinical texts Dligach, Dmitriy Bethard, Steven Becker, Lee Miller, Timothy Savova, Guergana K J Am Med Inform Assoc Research and Applications OBJECTIVE: To research computational methods for discovering body site and severity modifiers in clinical texts. METHODS: We cast the task of discovering body site and severity modifiers as a relation extraction problem in the context of a supervised machine learning framework. We utilize rich linguistic features to represent the pairs of relation arguments and delegate the decision about the nature of the relationship between them to a support vector machine model. We evaluate our models using two corpora that annotate body site and severity modifiers. We also compare the model performance to a number of rule-based baselines. We conduct cross-domain portability experiments. In addition, we carry out feature ablation experiments to determine the contribution of various feature groups. Finally, we perform error analysis and report the sources of errors. RESULTS: The performance of our method for discovering body site modifiers achieves F1 of 0.740–0.908 and our method for discovering severity modifiers achieves F1 of 0.905–0.929. DISCUSSION: Results indicate that both methods perform well on both in-domain and out-domain data, approaching the performance of human annotators. The most salient features are token and named entity features, although syntactic dependency features also contribute to the overall performance. The dominant sources of errors are infrequent patterns in the data and inability of the system to discern deeper semantic structures. CONCLUSIONS: We investigated computational methods for discovering body site and severity modifiers in clinical texts. Our best system is released open source as part of the clinical Text Analysis and Knowledge Extraction System (cTAKES). BMJ Publishing Group 2014-05 2013-10-03 /pmc/articles/PMC3994852/ /pubmed/24091648 http://dx.doi.org/10.1136/amiajnl-2013-001766 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Research and Applications
Dligach, Dmitriy
Bethard, Steven
Becker, Lee
Miller, Timothy
Savova, Guergana K
Discovering body site and severity modifiers in clinical texts
title Discovering body site and severity modifiers in clinical texts
title_full Discovering body site and severity modifiers in clinical texts
title_fullStr Discovering body site and severity modifiers in clinical texts
title_full_unstemmed Discovering body site and severity modifiers in clinical texts
title_short Discovering body site and severity modifiers in clinical texts
title_sort discovering body site and severity modifiers in clinical texts
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3994852/
https://www.ncbi.nlm.nih.gov/pubmed/24091648
http://dx.doi.org/10.1136/amiajnl-2013-001766
work_keys_str_mv AT dligachdmitriy discoveringbodysiteandseveritymodifiersinclinicaltexts
AT bethardsteven discoveringbodysiteandseveritymodifiersinclinicaltexts
AT beckerlee discoveringbodysiteandseveritymodifiersinclinicaltexts
AT millertimothy discoveringbodysiteandseveritymodifiersinclinicaltexts
AT savovaguerganak discoveringbodysiteandseveritymodifiersinclinicaltexts