Cargando…

Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study

Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evalu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hatef, Elham, Singh Deol, Gurmehar, Rouhizadeh, Masoud, Li, Ashley, Eibensteiner, Katyusha, Monsen, Craig B., Bratslaver, Roman, Senese, Margaret, Kharrazi, Hadi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8429931/
https://www.ncbi.nlm.nih.gov/pubmed/34513783
http://dx.doi.org/10.3389/fpubh.2021.697501
_version_ 1783750638310522880
author Hatef, Elham
Singh Deol, Gurmehar
Rouhizadeh, Masoud
Li, Ashley
Eibensteiner, Katyusha
Monsen, Craig B.
Bratslaver, Roman
Senese, Margaret
Kharrazi, Hadi
author_facet Hatef, Elham
Singh Deol, Gurmehar
Rouhizadeh, Masoud
Li, Ashley
Eibensteiner, Katyusha
Monsen, Craig B.
Bratslaver, Roman
Senese, Margaret
Kharrazi, Hadi
author_sort Hatef, Elham
collection PubMed
description Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evaluate a practical text mining approach (i.e., advanced pattern matching techniques) in identifying phrases referring to housing issues, an important SDOH domain affecting value-based healthcare providers, using EHR of a large multispecialty medical group in the New England region, United States. To present how this approach would help the health systems to address the SDOH challenges of their patients we assess the demographic and clinical characteristics of patients with and without housing issues and briefly look into the patterns of healthcare utilization among the study population and for those with and without housing challenges. Methods: We identified five categories of housing issues [i.e., homelessness current (HC), homelessness history (HH), homelessness addressed (HA), housing instability (HI), and building quality (BQ)] and developed several phrases addressing each one through collaboration with SDOH experts, consulting the literature, and reviewing existing coding standards. We developed pattern-matching algorithms (i.e., advanced regular expressions), and then applied them in the selected EHR. We assessed the text mining approach for recall (sensitivity) and precision (positive predictive value) after comparing the identified phrases with manually annotated free-text for different housing issues. Results: The study dataset included EHR structured data for a total of 20,342 patients and 2,564,344 free-text clinical notes. The mean (SD) age in the study population was 75.96 (7.51). Additionally, 58.78% of the cohort were female. BQ and HI were the most frequent housing issues documented in EHR free-text notes and HH was the least frequent one. The regular expression methodology, when compared to manual annotation, had a high level of precision (positive predictive value) at phrase, note, and patient levels (96.36, 95.00, and 94.44%, respectively) across different categories of housing issues, but the recall (sensitivity) rate was relatively low (30.11, 32.20, and 41.46%, respectively). Conclusion: Results of this study can be used to advance the research in this domain, to assess the potential value of EHR's free-text in identifying patients with a high risk of housing issues, to improve patient care and outcomes, and to eventually mitigate socioeconomic disparities across individuals and communities.
format Online
Article
Text
id pubmed-8429931
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84299312021-09-11 Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study Hatef, Elham Singh Deol, Gurmehar Rouhizadeh, Masoud Li, Ashley Eibensteiner, Katyusha Monsen, Craig B. Bratslaver, Roman Senese, Margaret Kharrazi, Hadi Front Public Health Public Health Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evaluate a practical text mining approach (i.e., advanced pattern matching techniques) in identifying phrases referring to housing issues, an important SDOH domain affecting value-based healthcare providers, using EHR of a large multispecialty medical group in the New England region, United States. To present how this approach would help the health systems to address the SDOH challenges of their patients we assess the demographic and clinical characteristics of patients with and without housing issues and briefly look into the patterns of healthcare utilization among the study population and for those with and without housing challenges. Methods: We identified five categories of housing issues [i.e., homelessness current (HC), homelessness history (HH), homelessness addressed (HA), housing instability (HI), and building quality (BQ)] and developed several phrases addressing each one through collaboration with SDOH experts, consulting the literature, and reviewing existing coding standards. We developed pattern-matching algorithms (i.e., advanced regular expressions), and then applied them in the selected EHR. We assessed the text mining approach for recall (sensitivity) and precision (positive predictive value) after comparing the identified phrases with manually annotated free-text for different housing issues. Results: The study dataset included EHR structured data for a total of 20,342 patients and 2,564,344 free-text clinical notes. The mean (SD) age in the study population was 75.96 (7.51). Additionally, 58.78% of the cohort were female. BQ and HI were the most frequent housing issues documented in EHR free-text notes and HH was the least frequent one. The regular expression methodology, when compared to manual annotation, had a high level of precision (positive predictive value) at phrase, note, and patient levels (96.36, 95.00, and 94.44%, respectively) across different categories of housing issues, but the recall (sensitivity) rate was relatively low (30.11, 32.20, and 41.46%, respectively). Conclusion: Results of this study can be used to advance the research in this domain, to assess the potential value of EHR's free-text in identifying patients with a high risk of housing issues, to improve patient care and outcomes, and to eventually mitigate socioeconomic disparities across individuals and communities. Frontiers Media S.A. 2021-08-27 /pmc/articles/PMC8429931/ /pubmed/34513783 http://dx.doi.org/10.3389/fpubh.2021.697501 Text en Copyright © 2021 Hatef, Singh Deol, Rouhizadeh, Li, Eibensteiner, Monsen, Bratslaver, Senese and Kharrazi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Hatef, Elham
Singh Deol, Gurmehar
Rouhizadeh, Masoud
Li, Ashley
Eibensteiner, Katyusha
Monsen, Craig B.
Bratslaver, Roman
Senese, Margaret
Kharrazi, Hadi
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title_full Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title_fullStr Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title_full_unstemmed Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title_short Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
title_sort measuring the value of a practical text mining approach to identify patients with housing issues in the free-text notes in electronic health record: findings of a retrospective cohort study
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8429931/
https://www.ncbi.nlm.nih.gov/pubmed/34513783
http://dx.doi.org/10.3389/fpubh.2021.697501
work_keys_str_mv AT hatefelham measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT singhdeolgurmehar measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT rouhizadehmasoud measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT liashley measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT eibensteinerkatyusha measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT monsencraigb measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT bratslaverroman measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT senesemargaret measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy
AT kharrazihadi measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy