Cargando…
Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus
BACKGROUND: Gaining insights that cannot be obtained from health care databases from patients has become an important topic in pharmacovigilance. OBJECTIVE: Our objective was to demonstrate a use case, in which patient-generated data were incorporated in pharmacovigilance, to understand the epidemio...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278300/ https://www.ncbi.nlm.nih.gov/pubmed/34255719 http://dx.doi.org/10.2196/29238 |
_version_ | 1783722236458303488 |
---|---|
author | Matsuda, Shinichi Ohtomo, Takumi Tomizawa, Shiho Miyano, Yuki Mogi, Miwako Kuriki, Hiroshi Nakayama, Terumi Watanabe, Shinichi |
author_facet | Matsuda, Shinichi Ohtomo, Takumi Tomizawa, Shiho Miyano, Yuki Mogi, Miwako Kuriki, Hiroshi Nakayama, Terumi Watanabe, Shinichi |
author_sort | Matsuda, Shinichi |
collection | PubMed |
description | BACKGROUND: Gaining insights that cannot be obtained from health care databases from patients has become an important topic in pharmacovigilance. OBJECTIVE: Our objective was to demonstrate a use case, in which patient-generated data were incorporated in pharmacovigilance, to understand the epidemiology and burden of illness in Japanese patients with systemic lupus erythematosus. METHODS: We used data on systemic lupus erythematosus, an autoimmune disease that substantially impairs quality of life, from 2 independent data sets. To understand the disease’s epidemiology, we analyzed a Japanese health insurance claims database. To understand the disease’s burden, we analyzed text data collected from Japanese disease blogs (tōbyōki) written by patients with systemic lupus erythematosus. Natural language processing was applied to these texts to identify frequent patient-level complaints, and term frequency–inverse document frequency was used to explore patient burden during treatment. We explored health-related quality of life based on patient descriptions. RESULTS: We analyzed data from 4694 and 635 patients with systemic lupus erythematosus in the health insurance claims database and tōbyōki blogs, respectively. Based on health insurance claims data, the prevalence of systemic lupus erythematosus is 107.70 per 100,000 persons. Tōbyōki text data analysis showed that pain-related words (eg, pain, severe pain, arthralgia) became more important after starting treatment. We also found an increase in patients’ references to mobility and self-care over time, which indicated increased attention to physical disability due to disease progression. CONCLUSIONS: A classical medical database represents only a part of a patient's entire treatment experience, and analysis using solely such a database cannot represent patient-level symptoms or patient concerns about treatments. This study showed that analysis of tōbyōki blogs can provide added information on patient-level details, advancing patient-centric pharmacovigilance. |
format | Online Article Text |
id | pubmed-8278300 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-82783002021-07-26 Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus Matsuda, Shinichi Ohtomo, Takumi Tomizawa, Shiho Miyano, Yuki Mogi, Miwako Kuriki, Hiroshi Nakayama, Terumi Watanabe, Shinichi JMIR Public Health Surveill Original Paper BACKGROUND: Gaining insights that cannot be obtained from health care databases from patients has become an important topic in pharmacovigilance. OBJECTIVE: Our objective was to demonstrate a use case, in which patient-generated data were incorporated in pharmacovigilance, to understand the epidemiology and burden of illness in Japanese patients with systemic lupus erythematosus. METHODS: We used data on systemic lupus erythematosus, an autoimmune disease that substantially impairs quality of life, from 2 independent data sets. To understand the disease’s epidemiology, we analyzed a Japanese health insurance claims database. To understand the disease’s burden, we analyzed text data collected from Japanese disease blogs (tōbyōki) written by patients with systemic lupus erythematosus. Natural language processing was applied to these texts to identify frequent patient-level complaints, and term frequency–inverse document frequency was used to explore patient burden during treatment. We explored health-related quality of life based on patient descriptions. RESULTS: We analyzed data from 4694 and 635 patients with systemic lupus erythematosus in the health insurance claims database and tōbyōki blogs, respectively. Based on health insurance claims data, the prevalence of systemic lupus erythematosus is 107.70 per 100,000 persons. Tōbyōki text data analysis showed that pain-related words (eg, pain, severe pain, arthralgia) became more important after starting treatment. We also found an increase in patients’ references to mobility and self-care over time, which indicated increased attention to physical disability due to disease progression. CONCLUSIONS: A classical medical database represents only a part of a patient's entire treatment experience, and analysis using solely such a database cannot represent patient-level symptoms or patient concerns about treatments. This study showed that analysis of tōbyōki blogs can provide added information on patient-level details, advancing patient-centric pharmacovigilance. JMIR Publications 2021-06-29 /pmc/articles/PMC8278300/ /pubmed/34255719 http://dx.doi.org/10.2196/29238 Text en ©Shinichi Matsuda, Takumi Ohtomo, Shiho Tomizawa, Yuki Miyano, Miwako Mogi, Hiroshi Kuriki, Terumi Nakayama, Shinichi Watanabe. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 29.06.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Matsuda, Shinichi Ohtomo, Takumi Tomizawa, Shiho Miyano, Yuki Mogi, Miwako Kuriki, Hiroshi Nakayama, Terumi Watanabe, Shinichi Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title | Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title_full | Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title_fullStr | Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title_full_unstemmed | Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title_short | Incorporating Unstructured Patient Narratives and Health Insurance Claims Data in Pharmacovigilance: Natural Language Processing Analysis of Patient-Generated Texts About Systemic Lupus Erythematosus |
title_sort | incorporating unstructured patient narratives and health insurance claims data in pharmacovigilance: natural language processing analysis of patient-generated texts about systemic lupus erythematosus |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278300/ https://www.ncbi.nlm.nih.gov/pubmed/34255719 http://dx.doi.org/10.2196/29238 |
work_keys_str_mv | AT matsudashinichi incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT ohtomotakumi incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT tomizawashiho incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT miyanoyuki incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT mogimiwako incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT kurikihiroshi incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT nakayamaterumi incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus AT watanabeshinichi incorporatingunstructuredpatientnarrativesandhealthinsuranceclaimsdatainpharmacovigilancenaturallanguageprocessinganalysisofpatientgeneratedtextsaboutsystemiclupuserythematosus |