Cargando…
A natural language processing approach for identifying temporal disease onset information from mental healthcare text
Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7804184/ https://www.ncbi.nlm.nih.gov/pubmed/33436814 http://dx.doi.org/10.1038/s41598-020-80457-0 |
_version_ | 1783636105695854592 |
---|---|
author | Viani, Natalia Botelle, Riley Kerwin, Jack Yin, Lucia Patel, Rashmi Stewart, Robert Velupillai, Sumithra |
author_facet | Viani, Natalia Botelle, Riley Kerwin, Jack Yin, Lucia Patel, Rashmi Stewart, Robert Velupillai, Sumithra |
author_sort | Viani, Natalia |
collection | PubMed |
description | Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with worse intervention outcomes. The growing adoption of electronic health records (EHRs) within mental health services provides an invaluable opportunity to study this problem at scale retrospectively. However, disease onset information is often only available in open text fields, requiring natural language processing (NLP) techniques for automated analyses. Since this variable can be documented at different points during a patient’s care, NLP methods that model clinical and temporal associations are needed. We address the identification of psychosis onset by: 1) manually annotating a corpus of mental health EHRs with disease onset mentions, 2) modelling the underlying NLP problem as a paragraph classification approach, and 3) combining multiple onset paragraphs at the patient level to generate a ranked list of likely disease onset dates. For 22/31 test patients (71%) the correct onset date was found among the top-3 NLP predictions. The proposed approach was also applied at scale, allowing an onset date to be estimated for 2483 patients. |
format | Online Article Text |
id | pubmed-7804184 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-78041842021-01-13 A natural language processing approach for identifying temporal disease onset information from mental healthcare text Viani, Natalia Botelle, Riley Kerwin, Jack Yin, Lucia Patel, Rashmi Stewart, Robert Velupillai, Sumithra Sci Rep Article Receiving timely and appropriate treatment is crucial for better health outcomes, and research on the contribution of specific variables is essential. In the mental health domain, an important research variable is the date of psychosis symptom onset, as longer delays in treatment are associated with worse intervention outcomes. The growing adoption of electronic health records (EHRs) within mental health services provides an invaluable opportunity to study this problem at scale retrospectively. However, disease onset information is often only available in open text fields, requiring natural language processing (NLP) techniques for automated analyses. Since this variable can be documented at different points during a patient’s care, NLP methods that model clinical and temporal associations are needed. We address the identification of psychosis onset by: 1) manually annotating a corpus of mental health EHRs with disease onset mentions, 2) modelling the underlying NLP problem as a paragraph classification approach, and 3) combining multiple onset paragraphs at the patient level to generate a ranked list of likely disease onset dates. For 22/31 test patients (71%) the correct onset date was found among the top-3 NLP predictions. The proposed approach was also applied at scale, allowing an onset date to be estimated for 2483 patients. Nature Publishing Group UK 2021-01-12 /pmc/articles/PMC7804184/ /pubmed/33436814 http://dx.doi.org/10.1038/s41598-020-80457-0 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Viani, Natalia Botelle, Riley Kerwin, Jack Yin, Lucia Patel, Rashmi Stewart, Robert Velupillai, Sumithra A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title | A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title_full | A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title_fullStr | A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title_full_unstemmed | A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title_short | A natural language processing approach for identifying temporal disease onset information from mental healthcare text |
title_sort | natural language processing approach for identifying temporal disease onset information from mental healthcare text |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7804184/ https://www.ncbi.nlm.nih.gov/pubmed/33436814 http://dx.doi.org/10.1038/s41598-020-80457-0 |
work_keys_str_mv | AT vianinatalia anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT botelleriley anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT kerwinjack anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT yinlucia anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT patelrashmi anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT stewartrobert anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT velupillaisumithra anaturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT vianinatalia naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT botelleriley naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT kerwinjack naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT yinlucia naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT patelrashmi naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT stewartrobert naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext AT velupillaisumithra naturallanguageprocessingapproachforidentifyingtemporaldiseaseonsetinformationfrommentalhealthcaretext |