Cargando…

A natural language processing approach for identifying temporal disease onset information from mental healthcare text

Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with...

Descripción completa

Detalles Bibliográficos
Autores principales: Viani, Natalia, Botelle, Riley, Kerwin, Jack, Yin, Lucia, Patel, Rashmi, Stewart, Robert, Velupillai, Sumithra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7804184/
https://www.ncbi.nlm.nih.gov/pubmed/33436814
http://dx.doi.org/10.1038/s41598-020-80457-0
_version_ 1783636105695854592
author Viani, Natalia
Botelle, Riley
Kerwin, Jack
Yin, Lucia
Patel, Rashmi
Stewart, Robert
Velupillai, Sumithra
author_facet Viani, Natalia
Botelle, Riley
Kerwin, Jack
Yin, Lucia
Patel, Rashmi
Stewart, Robert
Velupillai, Sumithra
author_sort Viani, Natalia
collection PubMed
description Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with worse intervention outcomes. The growing adoption of electronic health records (EHRs) within mental health services provides an invaluable opportunity to study this problem at scale retrospectively. However, disease onset information is often only available in open text fields, requiring natural language processing (NLP) techniques for automated analyses. Since this variable can be documented at different points during a patient’s care, NLP methods that model clinical and temporal associations are needed. We address the identification of psychosis onset by: 1) manually annotating a corpus of mental health EHRs with disease onset mentions, 2) modelling the underlying NLP problem as a paragraph classification approach, and 3) combining multiple onset paragraphs at the patient level to generate a ranked list of likely disease onset dates. For 22/31 test patients (71%) the correct onset date was found among the top-3 NLP predictions. The proposed approach was also applied at scale, allowing an onset date to be estimated for 2483 patients.
format Online
Article
Text
id pubmed-7804184
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-78041842021-01-13 A natural language processing approach for identifying temporal disease onset information from mental healthcare text Viani, Natalia Botelle, Riley Kerwin, Jack Yin, Lucia Patel, Rashmi Stewart, Robert Velupillai, Sumithra Sci Rep Article Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with worse intervention outcomes. The growing adoption of electronic health records (EHRs) within mental health services provides an invaluable opportunity to study this problem at scale retrospectively. However, disease onset information is often only available in open text fields, requiring natural language processing (NLP) techniques for automated analyses. Since this variable can be documented at different points during a patient’s care, NLP methods that model clinical and temporal associations are needed. We address the identification of psychosis onset by: 1) manually annotating a corpus of mental health EHRs with disease onset mentions, 2) modelling the underlying NLP problem as a paragraph classification approach, and 3) combining multiple onset paragraphs at the patient level to generate a ranked list of likely disease onset dates. For 22/31 test patients (71%) the correct onset date was found among the top-3 NLP predictions. The proposed approach was also applied at scale, allowing an onset date to be estimated for 2483 patients. Nature Publishing Group UK 2021-01-12 /pmc/articles/PMC7804184/ /pubmed/33436814 http://dx.doi.org/10.1038/s41598-020-80457-0 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Viani, Natalia
Botelle, Riley
Kerwin, Jack
Yin, Lucia
Patel, Rashmi
Stewart, Robert
Velupillai, Sumithra
A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title_full A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title_fullStr A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title_full_unstemmed A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title_short A natural language processing approach for identifying temporal disease onset information from mental healthcare text
title_sort natural language processing approach for identifying temporal disease onset information from mental healthcare text
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7804184/
https://www.ncbi.nlm.nih.gov/pubmed/33436814
http://dx.doi.org/10.1038/s41598-020-80457-0
work_keys_str_mv AT vianinatalia anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT botelleriley anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT kerwinjack anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT yinlucia anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT patelrashmi anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT stewartrobert anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT velupillaisumithra anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT vianinatalia naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT botelleriley naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT kerwinjack naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT yinlucia naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT patelrashmi naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT stewartrobert naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext
AT velupillaisumithra naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext