Cargando…

Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records

OBJECTIVE: To develop a natural language processing (NLP) algorithm that identifies social determinants of health (SDoH), including housing, transportation, food, and medication insecurities, social isolation, abuse, neglect, or exploitation, and financial difficulties for patients with Alzheimer�...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Wenbo, Holkeboer, Kaes J., Kolawole, Temidun O., Carbone, Lorrie, Mahmoudi, Elham
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Blackwell Publishing Ltd 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622277/
https://www.ncbi.nlm.nih.gov/pubmed/37534741
http://dx.doi.org/10.1111/1475-6773.14210
_version_ 1785130501753798656
author Wu, Wenbo
Holkeboer, Kaes J.
Kolawole, Temidun O.
Carbone, Lorrie
Mahmoudi, Elham
author_facet Wu, Wenbo
Holkeboer, Kaes J.
Kolawole, Temidun O.
Carbone, Lorrie
Mahmoudi, Elham
author_sort Wu, Wenbo
collection PubMed
description OBJECTIVE: To develop a natural language processing (NLP) algorithm that identifies social determinants of health (SDoH), including housing, transportation, food, and medication insecurities, social isolation, abuse, neglect, or exploitation, and financial difficulties for patients with Alzheimer's disease and related dementias (ADRD) from unstructured electronic health records (EHRs). DATA SOURCES AND STUDY SETTING: We leveraged 1000 medical notes randomly selected from 7401 emergency department and inpatient social worker notes generated between 2015 and 2019 for 231 unique patients diagnosed with ADRD at Michigan Medicine. STUDY DESIGN: We developed a rule‐based NLP algorithm for the identification of seven domains of SDoH noted above. We also compared the rule‐based algorithm with deep learning and regularized logistic regression approaches. These models were compared using accuracy, sensitivity, specificity, F1 score, and the area under the receiver operating characteristic curve (AUC). All notes were split into 700 notes for training NLP algorithms, and 300 notes for validation. DATA COLLECTION/EXTRACTION METHODS: Social worker notes used in this study were extracted from the Michigan Medicine EHR database. PRINCIPAL FINDINGS: Of the 700 notes for training, F1 and AUC for the rule‐based algorithm were at least 0.94 and 0.95, respectively, for all SDoH categories. Of the 300 notes for validation, F1 and AUC were at least 0.80 and 0.97, respectively, for all SDoH except housing and medication insecurities. The deep learning and regularized logistic regression algorithms had unsatisfactory performance. CONCLUSIONS: The rule‐based algorithm can accurately extract SDoH information in all seven domains of SDoH except housing and medication insecurities. Findings from the algorithm can be used by clinicians and social workers to proactively address social needs of patients with ADRD and other vulnerable patient populations.
format Online
Article
Text
id pubmed-10622277
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Blackwell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-106222772023-11-04 Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records Wu, Wenbo Holkeboer, Kaes J. Kolawole, Temidun O. Carbone, Lorrie Mahmoudi, Elham Health Serv Res Methods Corner OBJECTIVE: To develop a natural language processing (NLP) algorithm that identifies social determinants of health (SDoH), including housing, transportation, food, and medication insecurities, social isolation, abuse, neglect, or exploitation, and financial difficulties for patients with Alzheimer's disease and related dementias (ADRD) from unstructured electronic health records (EHRs). DATA SOURCES AND STUDY SETTING: We leveraged 1000 medical notes randomly selected from 7401 emergency department and inpatient social worker notes generated between 2015 and 2019 for 231 unique patients diagnosed with ADRD at Michigan Medicine. STUDY DESIGN: We developed a rule‐based NLP algorithm for the identification of seven domains of SDoH noted above. We also compared the rule‐based algorithm with deep learning and regularized logistic regression approaches. These models were compared using accuracy, sensitivity, specificity, F1 score, and the area under the receiver operating characteristic curve (AUC). All notes were split into 700 notes for training NLP algorithms, and 300 notes for validation. DATA COLLECTION/EXTRACTION METHODS: Social worker notes used in this study were extracted from the Michigan Medicine EHR database. PRINCIPAL FINDINGS: Of the 700 notes for training, F1 and AUC for the rule‐based algorithm were at least 0.94 and 0.95, respectively, for all SDoH categories. Of the 300 notes for validation, F1 and AUC were at least 0.80 and 0.97, respectively, for all SDoH except housing and medication insecurities. The deep learning and regularized logistic regression algorithms had unsatisfactory performance. CONCLUSIONS: The rule‐based algorithm can accurately extract SDoH information in all seven domains of SDoH except housing and medication insecurities. Findings from the algorithm can be used by clinicians and social workers to proactively address social needs of patients with ADRD and other vulnerable patient populations. Blackwell Publishing Ltd 2023-08-03 2023-12 /pmc/articles/PMC10622277/ /pubmed/37534741 http://dx.doi.org/10.1111/1475-6773.14210 Text en © 2023 The Authors. Health Services Research published by Wiley Periodicals LLC on behalf of Health Research and Educational Trust. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Methods Corner
Wu, Wenbo
Holkeboer, Kaes J.
Kolawole, Temidun O.
Carbone, Lorrie
Mahmoudi, Elham
Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title_full Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title_fullStr Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title_full_unstemmed Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title_short Natural language processing to identify social determinants of health in Alzheimer's disease and related dementia from electronic health records
title_sort natural language processing to identify social determinants of health in alzheimer's disease and related dementia from electronic health records
topic Methods Corner
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10622277/
https://www.ncbi.nlm.nih.gov/pubmed/37534741
http://dx.doi.org/10.1111/1475-6773.14210
work_keys_str_mv AT wuwenbo naturallanguageprocessingtoidentifysocialdeterminantsofhealthinalzheimersdiseaseandrelateddementiafromelectronichealthrecords
AT holkeboerkaesj naturallanguageprocessingtoidentifysocialdeterminantsofhealthinalzheimersdiseaseandrelateddementiafromelectronichealthrecords
AT kolawoletemiduno naturallanguageprocessingtoidentifysocialdeterminantsofhealthinalzheimersdiseaseandrelateddementiafromelectronichealthrecords
AT carbonelorrie naturallanguageprocessingtoidentifysocialdeterminantsofhealthinalzheimersdiseaseandrelateddementiafromelectronichealthrecords
AT mahmoudielham naturallanguageprocessingtoidentifysocialdeterminantsofhealthinalzheimersdiseaseandrelateddementiafromelectronichealthrecords