Cargando…
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study
Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evalu...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8429931/ https://www.ncbi.nlm.nih.gov/pubmed/34513783 http://dx.doi.org/10.3389/fpubh.2021.697501 |
_version_ | 1783750638310522880 |
---|---|
author | Hatef, Elham Singh Deol, Gurmehar Rouhizadeh, Masoud Li, Ashley Eibensteiner, Katyusha Monsen, Craig B. Bratslaver, Roman Senese, Margaret Kharrazi, Hadi |
author_facet | Hatef, Elham Singh Deol, Gurmehar Rouhizadeh, Masoud Li, Ashley Eibensteiner, Katyusha Monsen, Craig B. Bratslaver, Roman Senese, Margaret Kharrazi, Hadi |
author_sort | Hatef, Elham |
collection | PubMed |
description | Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evaluate a practical text mining approach (i.e., advanced pattern matching techniques) in identifying phrases referring to housing issues, an important SDOH domain affecting value-based healthcare providers, using EHR of a large multispecialty medical group in the New England region, United States. To present how this approach would help the health systems to address the SDOH challenges of their patients we assess the demographic and clinical characteristics of patients with and without housing issues and briefly look into the patterns of healthcare utilization among the study population and for those with and without housing challenges. Methods: We identified five categories of housing issues [i.e., homelessness current (HC), homelessness history (HH), homelessness addressed (HA), housing instability (HI), and building quality (BQ)] and developed several phrases addressing each one through collaboration with SDOH experts, consulting the literature, and reviewing existing coding standards. We developed pattern-matching algorithms (i.e., advanced regular expressions), and then applied them in the selected EHR. We assessed the text mining approach for recall (sensitivity) and precision (positive predictive value) after comparing the identified phrases with manually annotated free-text for different housing issues. Results: The study dataset included EHR structured data for a total of 20,342 patients and 2,564,344 free-text clinical notes. The mean (SD) age in the study population was 75.96 (7.51). Additionally, 58.78% of the cohort were female. BQ and HI were the most frequent housing issues documented in EHR free-text notes and HH was the least frequent one. The regular expression methodology, when compared to manual annotation, had a high level of precision (positive predictive value) at phrase, note, and patient levels (96.36, 95.00, and 94.44%, respectively) across different categories of housing issues, but the recall (sensitivity) rate was relatively low (30.11, 32.20, and 41.46%, respectively). Conclusion: Results of this study can be used to advance the research in this domain, to assess the potential value of EHR's free-text in identifying patients with a high risk of housing issues, to improve patient care and outcomes, and to eventually mitigate socioeconomic disparities across individuals and communities. |
format | Online Article Text |
id | pubmed-8429931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84299312021-09-11 Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study Hatef, Elham Singh Deol, Gurmehar Rouhizadeh, Masoud Li, Ashley Eibensteiner, Katyusha Monsen, Craig B. Bratslaver, Roman Senese, Margaret Kharrazi, Hadi Front Public Health Public Health Introduction: Despite the growing efforts to standardize coding for social determinants of health (SDOH), they are infrequently captured in electronic health records (EHRs). Most SDOH variables are still captured in the unstructured fields (i.e., free-text) of EHRs. In this study we attempt to evaluate a practical text mining approach (i.e., advanced pattern matching techniques) in identifying phrases referring to housing issues, an important SDOH domain affecting value-based healthcare providers, using EHR of a large multispecialty medical group in the New England region, United States. To present how this approach would help the health systems to address the SDOH challenges of their patients we assess the demographic and clinical characteristics of patients with and without housing issues and briefly look into the patterns of healthcare utilization among the study population and for those with and without housing challenges. Methods: We identified five categories of housing issues [i.e., homelessness current (HC), homelessness history (HH), homelessness addressed (HA), housing instability (HI), and building quality (BQ)] and developed several phrases addressing each one through collaboration with SDOH experts, consulting the literature, and reviewing existing coding standards. We developed pattern-matching algorithms (i.e., advanced regular expressions), and then applied them in the selected EHR. We assessed the text mining approach for recall (sensitivity) and precision (positive predictive value) after comparing the identified phrases with manually annotated free-text for different housing issues. Results: The study dataset included EHR structured data for a total of 20,342 patients and 2,564,344 free-text clinical notes. The mean (SD) age in the study population was 75.96 (7.51). Additionally, 58.78% of the cohort were female. BQ and HI were the most frequent housing issues documented in EHR free-text notes and HH was the least frequent one. The regular expression methodology, when compared to manual annotation, had a high level of precision (positive predictive value) at phrase, note, and patient levels (96.36, 95.00, and 94.44%, respectively) across different categories of housing issues, but the recall (sensitivity) rate was relatively low (30.11, 32.20, and 41.46%, respectively). Conclusion: Results of this study can be used to advance the research in this domain, to assess the potential value of EHR's free-text in identifying patients with a high risk of housing issues, to improve patient care and outcomes, and to eventually mitigate socioeconomic disparities across individuals and communities. Frontiers Media S.A. 2021-08-27 /pmc/articles/PMC8429931/ /pubmed/34513783 http://dx.doi.org/10.3389/fpubh.2021.697501 Text en Copyright © 2021 Hatef, Singh Deol, Rouhizadeh, Li, Eibensteiner, Monsen, Bratslaver, Senese and Kharrazi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Hatef, Elham Singh Deol, Gurmehar Rouhizadeh, Masoud Li, Ashley Eibensteiner, Katyusha Monsen, Craig B. Bratslaver, Roman Senese, Margaret Kharrazi, Hadi Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title | Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title_full | Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title_fullStr | Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title_full_unstemmed | Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title_short | Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study |
title_sort | measuring the value of a practical text mining approach to identify patients with housing issues in the free-text notes in electronic health record: findings of a retrospective cohort study |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8429931/ https://www.ncbi.nlm.nih.gov/pubmed/34513783 http://dx.doi.org/10.3389/fpubh.2021.697501 |
work_keys_str_mv | AT hatefelham measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT singhdeolgurmehar measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT rouhizadehmasoud measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT liashley measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT eibensteinerkatyusha measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT monsencraigb measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT bratslaverroman measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT senesemargaret measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy AT kharrazihadi measuringthevalueofapracticaltextminingapproachtoidentifypatientswithhousingissuesinthefreetextnotesinelectronichealthrecordfindingsofaretrospectivecohortstudy |