Cargando…
Extracting social determinants of health from electronic health records using natural language processing: a systematic review
OBJECTIVE: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH i...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633615/ https://www.ncbi.nlm.nih.gov/pubmed/34613399 http://dx.doi.org/10.1093/jamia/ocab170 |
_version_ | 1784607966935121920 |
---|---|
author | Patra, Braja G Sharma, Mohit M Vekaria, Veer Adekkanattu, Prakash Patterson, Olga V Glicksberg, Benjamin Lepow, Lauren A Ryu, Euijung Biernacka, Joanna M Furmanchuk, Al’ona George, Thomas J Hogan, William Wu, Yonghui Yang, Xi Bian, Jiang Weissman, Myrna Wickramaratne, Priya Mann, J John Olfson, Mark Campion, Thomas R Weiner, Mark Pathak, Jyotishman |
author_facet | Patra, Braja G Sharma, Mohit M Vekaria, Veer Adekkanattu, Prakash Patterson, Olga V Glicksberg, Benjamin Lepow, Lauren A Ryu, Euijung Biernacka, Joanna M Furmanchuk, Al’ona George, Thomas J Hogan, William Wu, Yonghui Yang, Xi Bian, Jiang Weissman, Myrna Wickramaratne, Priya Mann, J John Olfson, Mark Campion, Thomas R Weiner, Mark Pathak, Jyotishman |
author_sort | Patra, Braja G |
collection | PubMed |
description | OBJECTIVE: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. MATERIALS AND METHODS: A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. RESULTS: Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). CONCLUSION: NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems. |
format | Online Article Text |
id | pubmed-8633615 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86336152021-12-01 Extracting social determinants of health from electronic health records using natural language processing: a systematic review Patra, Braja G Sharma, Mohit M Vekaria, Veer Adekkanattu, Prakash Patterson, Olga V Glicksberg, Benjamin Lepow, Lauren A Ryu, Euijung Biernacka, Joanna M Furmanchuk, Al’ona George, Thomas J Hogan, William Wu, Yonghui Yang, Xi Bian, Jiang Weissman, Myrna Wickramaratne, Priya Mann, J John Olfson, Mark Campion, Thomas R Weiner, Mark Pathak, Jyotishman J Am Med Inform Assoc Reviews OBJECTIVE: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. MATERIALS AND METHODS: A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. RESULTS: Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). CONCLUSION: NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems. Oxford University Press 2021-10-06 /pmc/articles/PMC8633615/ /pubmed/34613399 http://dx.doi.org/10.1093/jamia/ocab170 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Reviews Patra, Braja G Sharma, Mohit M Vekaria, Veer Adekkanattu, Prakash Patterson, Olga V Glicksberg, Benjamin Lepow, Lauren A Ryu, Euijung Biernacka, Joanna M Furmanchuk, Al’ona George, Thomas J Hogan, William Wu, Yonghui Yang, Xi Bian, Jiang Weissman, Myrna Wickramaratne, Priya Mann, J John Olfson, Mark Campion, Thomas R Weiner, Mark Pathak, Jyotishman Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title | Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title_full | Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title_fullStr | Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title_full_unstemmed | Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title_short | Extracting social determinants of health from electronic health records using natural language processing: a systematic review |
title_sort | extracting social determinants of health from electronic health records using natural language processing: a systematic review |
topic | Reviews |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633615/ https://www.ncbi.nlm.nih.gov/pubmed/34613399 http://dx.doi.org/10.1093/jamia/ocab170 |
work_keys_str_mv | AT patrabrajag extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT sharmamohitm extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT vekariaveer extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT adekkanattuprakash extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT pattersonolgav extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT glicksbergbenjamin extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT lepowlaurena extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT ryueuijung extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT biernackajoannam extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT furmanchukalona extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT georgethomasj extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT hoganwilliam extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT wuyonghui extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT yangxi extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT bianjiang extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT weissmanmyrna extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT wickramaratnepriya extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT mannjjohn extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT olfsonmark extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT campionthomasr extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT weinermark extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview AT pathakjyotishman extractingsocialdeterminantsofhealthfromelectronichealthrecordsusingnaturallanguageprocessingasystematicreview |