Cargando…

Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach

BACKGROUND: Pain is a widespread issue, with 20% of adults (1 in 5) experiencing it globally. A strong association has been demonstrated between pain and mental health conditions, and this association is known to exacerbate disability and impairment. Pain is also known to be strongly related to emot...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaturvedi, Jaya, Chance, Natalia, Mirza, Luwaiza, Vernugopan, Veshalee, Velupillai, Sumithra, Stewart, Robert, Roberts, Angus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337440/
https://www.ncbi.nlm.nih.gov/pubmed/37358897
http://dx.doi.org/10.2196/45849
_version_ 1785071425126662144
author Chaturvedi, Jaya
Chance, Natalia
Mirza, Luwaiza
Vernugopan, Veshalee
Velupillai, Sumithra
Stewart, Robert
Roberts, Angus
author_facet Chaturvedi, Jaya
Chance, Natalia
Mirza, Luwaiza
Vernugopan, Veshalee
Velupillai, Sumithra
Stewart, Robert
Roberts, Angus
author_sort Chaturvedi, Jaya
collection PubMed
description BACKGROUND: Pain is a widespread issue, with 20% of adults (1 in 5) experiencing it globally. A strong association has been demonstrated between pain and mental health conditions, and this association is known to exacerbate disability and impairment. Pain is also known to be strongly related to emotions, which can lead to damaging consequences. As pain is a common reason for people to access health care facilities, electronic health records (EHRs) are a potential source of information on this pain. Mental health EHRs could be particularly beneficial since they can show the overlap of pain with mental health. Most mental health EHRs contain the majority of their information within the free-text sections of the records. However, it is challenging to extract information from free text. Natural language processing (NLP) methods are therefore required to extract this information from the text. OBJECTIVE: This research describes the development of a corpus of manually labeled mentions of pain and pain-related entities from the documents of a mental health EHR database, for use in the development and evaluation of future NLP methods. METHODS: The EHR database used, Clinical Record Interactive Search, consists of anonymized patient records from The South London and Maudsley National Health Service Foundation Trust in the United Kingdom. The corpus was developed through a process of manual annotation where pain mentions were marked as relevant (ie, referring to physical pain afflicting the patient), negated (ie, indicating absence of pain), or not relevant (ie, referring to pain affecting someone other than the patient, or metaphorical and hypothetical mentions). Relevant mentions were also annotated with additional attributes such as anatomical location affected by pain, pain character, and pain management measures, if mentioned. RESULTS: A total of 5644 annotations were collected from 1985 documents (723 patients). Over 70% (n=4028) of the mentions found within the documents were annotated as relevant, and about half of these mentions also included the anatomical location affected by the pain. The most common pain character was chronic pain, and the most commonly mentioned anatomical location was the chest. Most annotations (n=1857, 33%) were from patients who had a primary diagnosis of mood disorders (International Classification of Diseases—10th edition, chapter F30-39). CONCLUSIONS: This research has helped better understand how pain is mentioned within the context of mental health EHRs and provided insight into the kind of information that is typically mentioned around pain in such a data source. In future work, the extracted information will be used to develop and evaluate a machine learning–based NLP application to automatically extract relevant pain information from EHR databases.
format Online
Article
Text
id pubmed-10337440
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-103374402023-07-13 Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach Chaturvedi, Jaya Chance, Natalia Mirza, Luwaiza Vernugopan, Veshalee Velupillai, Sumithra Stewart, Robert Roberts, Angus JMIR Form Res Original Paper BACKGROUND: Pain is a widespread issue, with 20% of adults (1 in 5) experiencing it globally. A strong association has been demonstrated between pain and mental health conditions, and this association is known to exacerbate disability and impairment. Pain is also known to be strongly related to emotions, which can lead to damaging consequences. As pain is a common reason for people to access health care facilities, electronic health records (EHRs) are a potential source of information on this pain. Mental health EHRs could be particularly beneficial since they can show the overlap of pain with mental health. Most mental health EHRs contain the majority of their information within the free-text sections of the records. However, it is challenging to extract information from free text. Natural language processing (NLP) methods are therefore required to extract this information from the text. OBJECTIVE: This research describes the development of a corpus of manually labeled mentions of pain and pain-related entities from the documents of a mental health EHR database, for use in the development and evaluation of future NLP methods. METHODS: The EHR database used, Clinical Record Interactive Search, consists of anonymized patient records from The South London and Maudsley National Health Service Foundation Trust in the United Kingdom. The corpus was developed through a process of manual annotation where pain mentions were marked as relevant (ie, referring to physical pain afflicting the patient), negated (ie, indicating absence of pain), or not relevant (ie, referring to pain affecting someone other than the patient, or metaphorical and hypothetical mentions). Relevant mentions were also annotated with additional attributes such as anatomical location affected by pain, pain character, and pain management measures, if mentioned. RESULTS: A total of 5644 annotations were collected from 1985 documents (723 patients). Over 70% (n=4028) of the mentions found within the documents were annotated as relevant, and about half of these mentions also included the anatomical location affected by the pain. The most common pain character was chronic pain, and the most commonly mentioned anatomical location was the chest. Most annotations (n=1857, 33%) were from patients who had a primary diagnosis of mood disorders (International Classification of Diseases—10th edition, chapter F30-39). CONCLUSIONS: This research has helped better understand how pain is mentioned within the context of mental health EHRs and provided insight into the kind of information that is typically mentioned around pain in such a data source. In future work, the extracted information will be used to develop and evaluate a machine learning–based NLP application to automatically extract relevant pain information from EHR databases. JMIR Publications 2023-06-26 /pmc/articles/PMC10337440/ /pubmed/37358897 http://dx.doi.org/10.2196/45849 Text en ©Jaya Chaturvedi, Natalia Chance, Luwaiza Mirza, Veshalee Vernugopan, Sumithra Velupillai, Robert Stewart, Angus Roberts. Originally published in JMIR Formative Research (https://formative.jmir.org), 26.06.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Chaturvedi, Jaya
Chance, Natalia
Mirza, Luwaiza
Vernugopan, Veshalee
Velupillai, Sumithra
Stewart, Robert
Roberts, Angus
Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title_full Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title_fullStr Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title_full_unstemmed Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title_short Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach
title_sort development of a corpus annotated with mentions of pain in mental health records: natural language processing approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10337440/
https://www.ncbi.nlm.nih.gov/pubmed/37358897
http://dx.doi.org/10.2196/45849
work_keys_str_mv AT chaturvedijaya developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT chancenatalia developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT mirzaluwaiza developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT vernugopanveshalee developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT velupillaisumithra developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT stewartrobert developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach
AT robertsangus developmentofacorpusannotatedwithmentionsofpaininmentalhealthrecordsnaturallanguageprocessingapproach