Cargando…

Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache

BACKGROUND: Headache medicine is largely based on detailed history taking by physicians analysing patients’ descriptions of headache. Natural language processing (NLP) structures and processes linguistic data into quantifiable units. In this study, we apply these digital techniques on self-reported...

Descripción completa

Detalles Bibliográficos
Autores principales: Vandenbussche, Nicolas, Van Hee, Cynthia, Hoste, Véronique, Paemeleire, Koen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Milan 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9524092/
https://www.ncbi.nlm.nih.gov/pubmed/36180844
http://dx.doi.org/10.1186/s10194-022-01490-0
_version_ 1784800430997372928
author Vandenbussche, Nicolas
Van Hee, Cynthia
Hoste, Véronique
Paemeleire, Koen
author_facet Vandenbussche, Nicolas
Van Hee, Cynthia
Hoste, Véronique
Paemeleire, Koen
author_sort Vandenbussche, Nicolas
collection PubMed
description BACKGROUND: Headache medicine is largely based on detailed history taking by physicians analysing patients’ descriptions of headache. Natural language processing (NLP) structures and processes linguistic data into quantifiable units. In this study, we apply these digital techniques on self-reported narratives by patients with headache disorders to research the potential of analysing and automatically classifying human-generated text and information extraction in clinical contexts. METHODS: A prospective cross-sectional clinical trial collected self-reported narratives on headache disorders from participants with either migraine or cluster headache. NLP was applied for the analysis of lexical, semantic and thematic properties of the texts. Machine learning (ML) algorithms were applied to classify the descriptions of headache attacks from individual participants into their correct group (migraine versus cluster headache). RESULTS: One-hundred and twenty-one patients (81 participants with migraine and 40 participants with cluster headache) provided a self-reported narrative on their headache disorder. Lexical analysis of this text corpus resulted in several specific key words per diagnostic group (cluster headache: Dutch (nl): “oog” | English (en): “eye”, nl: “pijn” | en: “pain” and nl: “terug” | en: “back/to come back”; migraine: nl: “hoofdpijn” | en: “headache”, nl: “stress” | en: “stress” and nl: “misselijkheid” | en: “nausea”). Thematic and sentiment analysis of text revealed largely negative sentiment in texts by both patients with migraine and cluster headache. Logistic regression and support vector machine algorithms with different feature groups performed best for the classification of attack descriptions (with F1-scores for detecting cluster headache varying between 0.82 and 0.86) compared to naïve Bayes classifiers. CONCLUSIONS: Differences in lexical choices between patients with migraine and cluster headache are detected with NLP and are congruent with domain expert knowledge of the disorders. Our research shows that ML algorithms have potential to classify patients’ self-reported narratives of migraine or cluster headache with good performance. NLP shows its capability to discern relevant linguistic aspects in narratives from patients with different headache disorders and demonstrates relevance in clinical information extraction. The potential benefits on the classification performance of larger datasets and neural NLP methods can be investigated in the future. TRIAL REGISTRATION: This study was registered with clinicaltrials.gov with ID NCT05377437. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10194-022-01490-0.
format Online
Article
Text
id pubmed-9524092
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Milan
record_format MEDLINE/PubMed
spelling pubmed-95240922022-10-01 Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache Vandenbussche, Nicolas Van Hee, Cynthia Hoste, Véronique Paemeleire, Koen J Headache Pain Research BACKGROUND: Headache medicine is largely based on detailed history taking by physicians analysing patients’ descriptions of headache. Natural language processing (NLP) structures and processes linguistic data into quantifiable units. In this study, we apply these digital techniques on self-reported narratives by patients with headache disorders to research the potential of analysing and automatically classifying human-generated text and information extraction in clinical contexts. METHODS: A prospective cross-sectional clinical trial collected self-reported narratives on headache disorders from participants with either migraine or cluster headache. NLP was applied for the analysis of lexical, semantic and thematic properties of the texts. Machine learning (ML) algorithms were applied to classify the descriptions of headache attacks from individual participants into their correct group (migraine versus cluster headache). RESULTS: One-hundred and twenty-one patients (81 participants with migraine and 40 participants with cluster headache) provided a self-reported narrative on their headache disorder. Lexical analysis of this text corpus resulted in several specific key words per diagnostic group (cluster headache: Dutch (nl): “oog” | English (en): “eye”, nl: “pijn” | en: “pain” and nl: “terug” | en: “back/to come back”; migraine: nl: “hoofdpijn” | en: “headache”, nl: “stress” | en: “stress” and nl: “misselijkheid” | en: “nausea”). Thematic and sentiment analysis of text revealed largely negative sentiment in texts by both patients with migraine and cluster headache. Logistic regression and support vector machine algorithms with different feature groups performed best for the classification of attack descriptions (with F1-scores for detecting cluster headache varying between 0.82 and 0.86) compared to naïve Bayes classifiers. CONCLUSIONS: Differences in lexical choices between patients with migraine and cluster headache are detected with NLP and are congruent with domain expert knowledge of the disorders. Our research shows that ML algorithms have potential to classify patients’ self-reported narratives of migraine or cluster headache with good performance. NLP shows its capability to discern relevant linguistic aspects in narratives from patients with different headache disorders and demonstrates relevance in clinical information extraction. The potential benefits on the classification performance of larger datasets and neural NLP methods can be investigated in the future. TRIAL REGISTRATION: This study was registered with clinicaltrials.gov with ID NCT05377437. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s10194-022-01490-0. Springer Milan 2022-09-30 /pmc/articles/PMC9524092/ /pubmed/36180844 http://dx.doi.org/10.1186/s10194-022-01490-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Vandenbussche, Nicolas
Van Hee, Cynthia
Hoste, Véronique
Paemeleire, Koen
Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title_full Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title_fullStr Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title_full_unstemmed Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title_short Using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
title_sort using natural language processing to automatically classify written self-reported narratives by patients with migraine or cluster headache
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9524092/
https://www.ncbi.nlm.nih.gov/pubmed/36180844
http://dx.doi.org/10.1186/s10194-022-01490-0
work_keys_str_mv AT vandenbusschenicolas usingnaturallanguageprocessingtoautomaticallyclassifywrittenselfreportednarrativesbypatientswithmigraineorclusterheadache
AT vanheecynthia usingnaturallanguageprocessingtoautomaticallyclassifywrittenselfreportednarrativesbypatientswithmigraineorclusterheadache
AT hosteveronique usingnaturallanguageprocessingtoautomaticallyclassifywrittenselfreportednarrativesbypatientswithmigraineorclusterheadache
AT paemeleirekoen usingnaturallanguageprocessingtoautomaticallyclassifywrittenselfreportednarrativesbypatientswithmigraineorclusterheadache