Cargando…

Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus

OBJECTIVE: Electronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as system...

Descripción completa

Detalles Bibliográficos
Autores principales: Brunekreef, Tammo E., Otten, Henny G., van den Bosch, Suzanne C., Hoefer, Imo E., van Laar, Jacob M., Limper, Maarten, Haitjema, Saskia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7882527/
https://www.ncbi.nlm.nih.gov/pubmed/33434395
http://dx.doi.org/10.1002/acr2.11211
_version_ 1783651067219673088
author Brunekreef, Tammo E.
Otten, Henny G.
van den Bosch, Suzanne C.
Hoefer, Imo E.
van Laar, Jacob M.
Limper, Maarten
Haitjema, Saskia
author_facet Brunekreef, Tammo E.
Otten, Henny G.
van den Bosch, Suzanne C.
Hoefer, Imo E.
van Laar, Jacob M.
Limper, Maarten
Haitjema, Saskia
author_sort Brunekreef, Tammo E.
collection PubMed
description OBJECTIVE: Electronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as systemic lupus erythematosus (SLE). Current coding systems are unable to assess information recorded in the physician’s free‐text notes. This study shows that text mining can be used as a reliable alternative. METHODS: In a multidisciplinary research team of data scientists and medical experts, a text mining algorithm on 4607 patient records was developed to assess the diagnosis of 14 different immune‐mediated inflammatory diseases and the presence of 18 different symptoms in the EHR. The text mining algorithm included key words in the EHR, while mining the context for exclusion phrases. The accuracy of the text mining algorithm was assessed by manually checking the EHR of 100 random patients suspected of having SLE for diagnoses and symptoms and comparing the outcome with the outcome of the text mining algorithm. RESULTS: After evaluation of 100 patient records, the text mining algorithm had a sensitivity of 96.4% and a specificity of 93.3% in assessing the presence of SLE. The algorithm detected potentially life‐threatening symptoms (nephritis, pleuritis) with good sensitivity (80%‐82%) and high specificity (97%‐97%). CONCLUSION: We present a text mining algorithm that can accurately identify and characterize patients with SLE using routinely collected data from the EHR. Our study shows that using text mining, data from the EHR can be reused in research and quality control.
format Online
Article
Text
id pubmed-7882527
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-78825272021-02-19 Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus Brunekreef, Tammo E. Otten, Henny G. van den Bosch, Suzanne C. Hoefer, Imo E. van Laar, Jacob M. Limper, Maarten Haitjema, Saskia ACR Open Rheumatol Original Articles OBJECTIVE: Electronic health records (EHR) are increasingly being recognized as a major source of data reusable for medical research and quality monitoring, although patient identification and assessment of symptoms (characterization) remain challenging, especially in complex diseases such as systemic lupus erythematosus (SLE). Current coding systems are unable to assess information recorded in the physician’s free‐text notes. This study shows that text mining can be used as a reliable alternative. METHODS: In a multidisciplinary research team of data scientists and medical experts, a text mining algorithm on 4607 patient records was developed to assess the diagnosis of 14 different immune‐mediated inflammatory diseases and the presence of 18 different symptoms in the EHR. The text mining algorithm included key words in the EHR, while mining the context for exclusion phrases. The accuracy of the text mining algorithm was assessed by manually checking the EHR of 100 random patients suspected of having SLE for diagnoses and symptoms and comparing the outcome with the outcome of the text mining algorithm. RESULTS: After evaluation of 100 patient records, the text mining algorithm had a sensitivity of 96.4% and a specificity of 93.3% in assessing the presence of SLE. The algorithm detected potentially life‐threatening symptoms (nephritis, pleuritis) with good sensitivity (80%‐82%) and high specificity (97%‐97%). CONCLUSION: We present a text mining algorithm that can accurately identify and characterize patients with SLE using routinely collected data from the EHR. Our study shows that using text mining, data from the EHR can be reused in research and quality control. John Wiley and Sons Inc. 2021-01-12 /pmc/articles/PMC7882527/ /pubmed/33434395 http://dx.doi.org/10.1002/acr2.11211 Text en © 2021 The Authors. ACR Open Rheumatology published by Wiley Periodicals LLC on behalf of American College of Rheumatology. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Brunekreef, Tammo E.
Otten, Henny G.
van den Bosch, Suzanne C.
Hoefer, Imo E.
van Laar, Jacob M.
Limper, Maarten
Haitjema, Saskia
Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title_full Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title_fullStr Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title_full_unstemmed Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title_short Text Mining of Electronic Health Records Can Accurately Identify and Characterize Patients With Systemic Lupus Erythematosus
title_sort text mining of electronic health records can accurately identify and characterize patients with systemic lupus erythematosus
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7882527/
https://www.ncbi.nlm.nih.gov/pubmed/33434395
http://dx.doi.org/10.1002/acr2.11211
work_keys_str_mv AT brunekreeftammoe textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT ottenhennyg textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT vandenboschsuzannec textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT hoeferimoe textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT vanlaarjacobm textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT limpermaarten textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus
AT haitjemasaskia textminingofelectronichealthrecordscanaccuratelyidentifyandcharacterizepatientswithsystemiclupuserythematosus