Cargando…

Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study

OBJECTIVES: Using free-text clinical notes and reports from hospitalised patients, determine the performance of natural language processing (NLP) ascertainment of Framingham heart failure (HF) criteria and phenotype. STUDY DESIGN: A retrospective observational study design of patients hospitalised i...

Descripción completa

Detalles Bibliográficos
Autores principales: Moore, Carlton R, Jain, Saumya, Haas, Stephanie, Yadav, Harish, Whitsel, Eric, Rosamand, Wayne, Heiss, Gerardo, Kucharska-Newton, Anna M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8204176/
https://www.ncbi.nlm.nih.gov/pubmed/34127492
http://dx.doi.org/10.1136/bmjopen-2020-047356
_version_ 1783708302581956608
author Moore, Carlton R
Jain, Saumya
Haas, Stephanie
Yadav, Harish
Whitsel, Eric
Rosamand, Wayne
Heiss, Gerardo
Kucharska-Newton, Anna M
author_facet Moore, Carlton R
Jain, Saumya
Haas, Stephanie
Yadav, Harish
Whitsel, Eric
Rosamand, Wayne
Heiss, Gerardo
Kucharska-Newton, Anna M
author_sort Moore, Carlton R
collection PubMed
description OBJECTIVES: Using free-text clinical notes and reports from hospitalised patients, determine the performance of natural language processing (NLP) ascertainment of Framingham heart failure (HF) criteria and phenotype. STUDY DESIGN: A retrospective observational study design of patients hospitalised in 2015 from four hospitals participating in the Atherosclerosis Risk in Communities (ARIC) study was used to determine NLP performance in the ascertainment of Framingham HF criteria and phenotype. SETTING: Four ARIC study hospitals, each representing an ARIC study region in the USA. PARTICIPANTS: A stratified random sample of hospitalisations identified using a broad range of International Classification of Disease, ninth revision, diagnostic codes indicative of an HF event and occurring during 2015 was drawn for this study. A randomly selected set of 394 hospitalisations was used as the derivation dataset and 406 hospitalisations was used as the validation dataset. INTERVENTION: Use of NLP on free-text clinical notes and reports to ascertain Framingham HF criteria and phenotype. PRIMARY AND SECONDARY OUTCOME MEASURES: NLP performance as measured by sensitivity, specificity, positive-predictive value (PPV) and agreement in ascertainment of Framingham HF criteria and phenotype. Manual medical record review by trained ARIC abstractors was used as the reference standard. RESULTS: Overall, performance of NLP ascertainment of Framingham HF phenotype in the validation dataset was good, with 78.8%, 81.7%, 84.4% and 80.0% for sensitivity, specificity, PPV and agreement, respectively. CONCLUSIONS: By decreasing the need for manual chart review, our results on the use of NLP to ascertain Framingham HF phenotype from free-text electronic health record data suggest that validated NLP technology holds the potential for significantly improving the feasibility and efficiency of conducting large-scale epidemiologic surveillance of HF prevalence and incidence.
format Online
Article
Text
id pubmed-8204176
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-82041762021-06-28 Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study Moore, Carlton R Jain, Saumya Haas, Stephanie Yadav, Harish Whitsel, Eric Rosamand, Wayne Heiss, Gerardo Kucharska-Newton, Anna M BMJ Open Cardiovascular Medicine OBJECTIVES: Using free-text clinical notes and reports from hospitalised patients, determine the performance of natural language processing (NLP) ascertainment of Framingham heart failure (HF) criteria and phenotype. STUDY DESIGN: A retrospective observational study design of patients hospitalised in 2015 from four hospitals participating in the Atherosclerosis Risk in Communities (ARIC) study was used to determine NLP performance in the ascertainment of Framingham HF criteria and phenotype. SETTING: Four ARIC study hospitals, each representing an ARIC study region in the USA. PARTICIPANTS: A stratified random sample of hospitalisations identified using a broad range of International Classification of Disease, ninth revision, diagnostic codes indicative of an HF event and occurring during 2015 was drawn for this study. A randomly selected set of 394 hospitalisations was used as the derivation dataset and 406 hospitalisations was used as the validation dataset. INTERVENTION: Use of NLP on free-text clinical notes and reports to ascertain Framingham HF criteria and phenotype. PRIMARY AND SECONDARY OUTCOME MEASURES: NLP performance as measured by sensitivity, specificity, positive-predictive value (PPV) and agreement in ascertainment of Framingham HF criteria and phenotype. Manual medical record review by trained ARIC abstractors was used as the reference standard. RESULTS: Overall, performance of NLP ascertainment of Framingham HF phenotype in the validation dataset was good, with 78.8%, 81.7%, 84.4% and 80.0% for sensitivity, specificity, PPV and agreement, respectively. CONCLUSIONS: By decreasing the need for manual chart review, our results on the use of NLP to ascertain Framingham HF phenotype from free-text electronic health record data suggest that validated NLP technology holds the potential for significantly improving the feasibility and efficiency of conducting large-scale epidemiologic surveillance of HF prevalence and incidence. BMJ Publishing Group 2021-06-14 /pmc/articles/PMC8204176/ /pubmed/34127492 http://dx.doi.org/10.1136/bmjopen-2020-047356 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Cardiovascular Medicine
Moore, Carlton R
Jain, Saumya
Haas, Stephanie
Yadav, Harish
Whitsel, Eric
Rosamand, Wayne
Heiss, Gerardo
Kucharska-Newton, Anna M
Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title_full Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title_fullStr Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title_full_unstemmed Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title_short Ascertaining Framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre Atherosclerosis Risk in Communities (ARIC) validation study
title_sort ascertaining framingham heart failure phenotype from inpatient electronic health record data using natural language processing: a multicentre atherosclerosis risk in communities (aric) validation study
topic Cardiovascular Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8204176/
https://www.ncbi.nlm.nih.gov/pubmed/34127492
http://dx.doi.org/10.1136/bmjopen-2020-047356
work_keys_str_mv AT moorecarltonr ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT jainsaumya ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT haasstephanie ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT yadavharish ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT whitseleric ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT rosamandwayne ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT heissgerardo ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy
AT kucharskanewtonannam ascertainingframinghamheartfailurephenotypefrominpatientelectronichealthrecorddatausingnaturallanguageprocessingamulticentreatherosclerosisriskincommunitiesaricvalidationstudy