Cargando…

Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing

BACKGROUND: Opioid use disorder (OUD) is underdiagnosed in health system settings, limiting research on OUD using electronic health records (EHRs). Medical encounter notes can enrich structured EHR data with documented signs and symptoms of OUD and social risks and behaviors. To capture this informa...

Descripción completa

Detalles Bibliográficos
Autores principales:	Poulsen, Melissa N., Freda, Philip J., Troiani, Vanessa, Davoudi, Anahita, Mowery, Danielle L.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Public Health
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9124945/ https://www.ncbi.nlm.nih.gov/pubmed/35615042 http://dx.doi.org/10.3389/fpubh.2022.850619

_version_	1784711838275993600
author	Poulsen, Melissa N. Freda, Philip J. Troiani, Vanessa Davoudi, Anahita Mowery, Danielle L.
author_facet	Poulsen, Melissa N. Freda, Philip J. Troiani, Vanessa Davoudi, Anahita Mowery, Danielle L.
author_sort	Poulsen, Melissa N.
collection	PubMed
description	BACKGROUND: Opioid use disorder (OUD) is underdiagnosed in health system settings, limiting research on OUD using electronic health records (EHRs). Medical encounter notes can enrich structured EHR data with documented signs and symptoms of OUD and social risks and behaviors. To capture this information at scale, natural language processing (NLP) tools must be developed and evaluated. We developed and applied an annotation schema to deeply characterize OUD and related clinical, behavioral, and environmental factors, and automated the annotation schema using machine learning and deep learning-based approaches. METHODS: Using the MIMIC-III Critical Care Database, we queried hospital discharge summaries of patients with International Classification of Diseases (ICD-9) OUD diagnostic codes. We developed an annotation schema to characterize problematic opioid use, identify individuals with potential OUD, and provide psychosocial context. Two annotators reviewed discharge summaries from 100 patients. We randomly sampled patients with their associated annotated sentences and divided them into training (66 patients; 2,127 annotated sentences) and testing (29 patients; 1,149 annotated sentences) sets. We used the training set to generate features, employing three NLP algorithms/knowledge sources. We trained and tested prediction models for classification with a traditional machine learner (logistic regression) and deep learning approach (Autogluon based on ELECTRA's replaced token detection model). We applied a five-fold cross-validation approach to reduce bias in performance estimates. RESULTS: The resulting annotation schema contained 32 classes. We achieved moderate inter-annotator agreement, with F(1)-scores across all classes increasing from 48 to 66%. Five classes had a sufficient number of annotations for automation; of these, we observed consistently high performance (F(1)-scores) across training and testing sets for drug screening (training: 91–96; testing: 91–94) and opioid type (training: 86–96; testing: 86–99). Performance dropped from training and to testing sets for other drug use (training: 52–65; testing: 40–48), pain management (training: 72–78; testing: 61–78) and psychiatric (training: 73–80; testing: 72). Autogluon achieved the highest performance. CONCLUSION: This pilot study demonstrated that rich information regarding problematic opioid use can be manually identified by annotators. However, more training samples and features would improve our ability to reliably identify less common classes from clinical text, including text from outpatient settings.
format	Online Article Text
id	pubmed-9124945
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-91249452022-05-24 Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing Poulsen, Melissa N. Freda, Philip J. Troiani, Vanessa Davoudi, Anahita Mowery, Danielle L. Front Public Health Public Health BACKGROUND: Opioid use disorder (OUD) is underdiagnosed in health system settings, limiting research on OUD using electronic health records (EHRs). Medical encounter notes can enrich structured EHR data with documented signs and symptoms of OUD and social risks and behaviors. To capture this information at scale, natural language processing (NLP) tools must be developed and evaluated. We developed and applied an annotation schema to deeply characterize OUD and related clinical, behavioral, and environmental factors, and automated the annotation schema using machine learning and deep learning-based approaches. METHODS: Using the MIMIC-III Critical Care Database, we queried hospital discharge summaries of patients with International Classification of Diseases (ICD-9) OUD diagnostic codes. We developed an annotation schema to characterize problematic opioid use, identify individuals with potential OUD, and provide psychosocial context. Two annotators reviewed discharge summaries from 100 patients. We randomly sampled patients with their associated annotated sentences and divided them into training (66 patients; 2,127 annotated sentences) and testing (29 patients; 1,149 annotated sentences) sets. We used the training set to generate features, employing three NLP algorithms/knowledge sources. We trained and tested prediction models for classification with a traditional machine learner (logistic regression) and deep learning approach (Autogluon based on ELECTRA's replaced token detection model). We applied a five-fold cross-validation approach to reduce bias in performance estimates. RESULTS: The resulting annotation schema contained 32 classes. We achieved moderate inter-annotator agreement, with F(1)-scores across all classes increasing from 48 to 66%. Five classes had a sufficient number of annotations for automation; of these, we observed consistently high performance (F(1)-scores) across training and testing sets for drug screening (training: 91–96; testing: 91–94) and opioid type (training: 86–96; testing: 86–99). Performance dropped from training and to testing sets for other drug use (training: 52–65; testing: 40–48), pain management (training: 72–78; testing: 61–78) and psychiatric (training: 73–80; testing: 72). Autogluon achieved the highest performance. CONCLUSION: This pilot study demonstrated that rich information regarding problematic opioid use can be manually identified by annotators. However, more training samples and features would improve our ability to reliably identify less common classes from clinical text, including text from outpatient settings. Frontiers Media S.A. 2022-05-09 /pmc/articles/PMC9124945/ /pubmed/35615042 http://dx.doi.org/10.3389/fpubh.2022.850619 Text en Copyright © 2022 Poulsen, Freda, Troiani, Davoudi and Mowery. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Public Health Poulsen, Melissa N. Freda, Philip J. Troiani, Vanessa Davoudi, Anahita Mowery, Danielle L. Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title	Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title_full	Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title_fullStr	Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title_full_unstemmed	Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title_short	Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing
title_sort	classifying characteristics of opioid use disorder from hospital discharge summaries using natural language processing
topic	Public Health
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9124945/ https://www.ncbi.nlm.nih.gov/pubmed/35615042 http://dx.doi.org/10.3389/fpubh.2022.850619
work_keys_str_mv	AT poulsenmelissan classifyingcharacteristicsofopioidusedisorderfromhospitaldischargesummariesusingnaturallanguageprocessing AT fredaphilipj classifyingcharacteristicsofopioidusedisorderfromhospitaldischargesummariesusingnaturallanguageprocessing AT troianivanessa classifyingcharacteristicsofopioidusedisorderfromhospitaldischargesummariesusingnaturallanguageprocessing AT davoudianahita classifyingcharacteristicsofopioidusedisorderfromhospitaldischargesummariesusingnaturallanguageprocessing AT mowerydaniellel classifyingcharacteristicsofopioidusedisorderfromhospitaldischargesummariesusingnaturallanguageprocessing

Classifying Characteristics of Opioid Use Disorder From Hospital Discharge Summaries Using Natural Language Processing

Ejemplares similares