Cargando…

Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source

BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smok...

Descripción completa

Detalles Bibliográficos
Autores principales: Palmer, Ellen L., Higgins, John, Hassanpour, Saeed, Sargent, James, Robinson, Christina M., Doherty, Jennifer A., Onega, Tracy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657182/
https://www.ncbi.nlm.nih.gov/pubmed/31345210
http://dx.doi.org/10.1186/s12911-019-0864-2
_version_ 1783438762536075264
author Palmer, Ellen L.
Higgins, John
Hassanpour, Saeed
Sargent, James
Robinson, Christina M.
Doherty, Jennifer A.
Onega, Tracy
author_facet Palmer, Ellen L.
Higgins, John
Hassanpour, Saeed
Sargent, James
Robinson, Christina M.
Doherty, Jennifer A.
Onega, Tracy
author_sort Palmer, Ellen L.
collection PubMed
description BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance. METHODS: Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire. RESULTS: We found that 73% of available notes contained no smoking behavior information. The weighted Cohen’s kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56–0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%. CONCLUSIONS: We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority.
format Online
Article
Text
id pubmed-6657182
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66571822019-07-31 Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source Palmer, Ellen L. Higgins, John Hassanpour, Saeed Sargent, James Robinson, Christina M. Doherty, Jennifer A. Onega, Tracy BMC Med Inform Decis Mak Research Article BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance. METHODS: Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire. RESULTS: We found that 73% of available notes contained no smoking behavior information. The weighted Cohen’s kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56–0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%. CONCLUSIONS: We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority. BioMed Central 2019-07-25 /pmc/articles/PMC6657182/ /pubmed/31345210 http://dx.doi.org/10.1186/s12911-019-0864-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Palmer, Ellen L.
Higgins, John
Hassanpour, Saeed
Sargent, James
Robinson, Christina M.
Doherty, Jennifer A.
Onega, Tracy
Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title_full Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title_fullStr Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title_full_unstemmed Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title_short Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
title_sort assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657182/
https://www.ncbi.nlm.nih.gov/pubmed/31345210
http://dx.doi.org/10.1186/s12911-019-0864-2
work_keys_str_mv AT palmerellenl assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT higginsjohn assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT hassanpoursaeed assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT sargentjames assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT robinsonchristinam assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT dohertyjennifera assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource
AT onegatracy assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource