Cargando…
Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source
BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smok...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657182/ https://www.ncbi.nlm.nih.gov/pubmed/31345210 http://dx.doi.org/10.1186/s12911-019-0864-2 |
_version_ | 1783438762536075264 |
---|---|
author | Palmer, Ellen L. Higgins, John Hassanpour, Saeed Sargent, James Robinson, Christina M. Doherty, Jennifer A. Onega, Tracy |
author_facet | Palmer, Ellen L. Higgins, John Hassanpour, Saeed Sargent, James Robinson, Christina M. Doherty, Jennifer A. Onega, Tracy |
author_sort | Palmer, Ellen L. |
collection | PubMed |
description | BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance. METHODS: Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire. RESULTS: We found that 73% of available notes contained no smoking behavior information. The weighted Cohen’s kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56–0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%. CONCLUSIONS: We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority. |
format | Online Article Text |
id | pubmed-6657182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-66571822019-07-31 Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source Palmer, Ellen L. Higgins, John Hassanpour, Saeed Sargent, James Robinson, Christina M. Doherty, Jennifer A. Onega, Tracy BMC Med Inform Decis Mak Research Article BACKGROUND: Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance. METHODS: Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire. RESULTS: We found that 73% of available notes contained no smoking behavior information. The weighted Cohen’s kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56–0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%. CONCLUSIONS: We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority. BioMed Central 2019-07-25 /pmc/articles/PMC6657182/ /pubmed/31345210 http://dx.doi.org/10.1186/s12911-019-0864-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Palmer, Ellen L. Higgins, John Hassanpour, Saeed Sargent, James Robinson, Christina M. Doherty, Jennifer A. Onega, Tracy Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title | Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title_full | Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title_fullStr | Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title_full_unstemmed | Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title_short | Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
title_sort | assessing data availability and quality within an electronic health record system through external validation against an external clinical data source |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6657182/ https://www.ncbi.nlm.nih.gov/pubmed/31345210 http://dx.doi.org/10.1186/s12911-019-0864-2 |
work_keys_str_mv | AT palmerellenl assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT higginsjohn assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT hassanpoursaeed assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT sargentjames assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT robinsonchristinam assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT dohertyjennifera assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource AT onegatracy assessingdataavailabilityandqualitywithinanelectronichealthrecordsystemthroughexternalvalidationagainstanexternalclinicaldatasource |