Cargando…
Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not pre...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671496/ https://www.ncbi.nlm.nih.gov/pubmed/34950914 http://dx.doi.org/10.1016/j.ajpc.2021.100300 |
_version_ | 1784615150105395200 |
---|---|
author | Gobbel, Glenn T. Matheny, Michael E. Reeves, Ruth R. Akeroyd, Julia M. Turchin, Alexander Ballantyne, Christie M. Petersen, Laura A. Virani, Salim S. |
author_facet | Gobbel, Glenn T. Matheny, Michael E. Reeves, Ruth R. Akeroyd, Julia M. Turchin, Alexander Ballantyne, Christie M. Petersen, Laura A. Virani, Salim S. |
author_sort | Gobbel, Glenn T. |
collection | PubMed |
description | OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not prescribing HIST in notes of 1152 randomly selected patients from across the VA healthcare system treated for ASCVD but not receiving HIST. Developers used reviewer annotations to train the Canary NLP tool to detect and extract notes containing one or more of these reasons. Negative predictive value (NPV), sensitivity, specificity and Area Under the Curve (AUC) were used to assess accuracy at detecting documents containing reasons when using structured data, NLP-extracted unstructured data, or both data sources combined. RESULTS: At least one documented reason for not prescribing HIST occurred in 47% of notes. The most frequent reasons were SASEs (41%) and general intolerance (20%). When identifying notes containing any documented reason for not using HIST, adding NLP-extracted, unstructured data significantly (p<0.05) increased sensitivity (0.69 (95% confidence interval [CI] 0.60–0.76) to 0.89 (95% CI 0.81–0.93)), NPV (0.90 (95% CI 0.87 to 0.93) to 0.96 (95% CI 0.93–0.98)), and AUC (0.84 (95% confidence interval [CI] 0.81–0.88) to 0.91 (95% CI 0.90–0.93)) compared to structured data alone. CONCLUSIONS: NLP extraction of data from unstructured text can improve identification of reasons for patients not being on HIST over structured data alone. The additional information provided through NLP of unstructured free text should help in tailoring and implementing system-level interventions to improve HIST use in patients with ASCVD. |
format | Online Article Text |
id | pubmed-8671496 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-86714962021-12-22 Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease Gobbel, Glenn T. Matheny, Michael E. Reeves, Ruth R. Akeroyd, Julia M. Turchin, Alexander Ballantyne, Christie M. Petersen, Laura A. Virani, Salim S. Am J Prev Cardiol Original Research Contribution OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not prescribing HIST in notes of 1152 randomly selected patients from across the VA healthcare system treated for ASCVD but not receiving HIST. Developers used reviewer annotations to train the Canary NLP tool to detect and extract notes containing one or more of these reasons. Negative predictive value (NPV), sensitivity, specificity and Area Under the Curve (AUC) were used to assess accuracy at detecting documents containing reasons when using structured data, NLP-extracted unstructured data, or both data sources combined. RESULTS: At least one documented reason for not prescribing HIST occurred in 47% of notes. The most frequent reasons were SASEs (41%) and general intolerance (20%). When identifying notes containing any documented reason for not using HIST, adding NLP-extracted, unstructured data significantly (p<0.05) increased sensitivity (0.69 (95% confidence interval [CI] 0.60–0.76) to 0.89 (95% CI 0.81–0.93)), NPV (0.90 (95% CI 0.87 to 0.93) to 0.96 (95% CI 0.93–0.98)), and AUC (0.84 (95% confidence interval [CI] 0.81–0.88) to 0.91 (95% CI 0.90–0.93)) compared to structured data alone. CONCLUSIONS: NLP extraction of data from unstructured text can improve identification of reasons for patients not being on HIST over structured data alone. The additional information provided through NLP of unstructured free text should help in tailoring and implementing system-level interventions to improve HIST use in patients with ASCVD. Elsevier 2021-12-03 /pmc/articles/PMC8671496/ /pubmed/34950914 http://dx.doi.org/10.1016/j.ajpc.2021.100300 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Research Contribution Gobbel, Glenn T. Matheny, Michael E. Reeves, Ruth R. Akeroyd, Julia M. Turchin, Alexander Ballantyne, Christie M. Petersen, Laura A. Virani, Salim S. Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title | Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title_full | Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title_fullStr | Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title_full_unstemmed | Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title_short | Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
title_sort | leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease |
topic | Original Research Contribution |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671496/ https://www.ncbi.nlm.nih.gov/pubmed/34950914 http://dx.doi.org/10.1016/j.ajpc.2021.100300 |
work_keys_str_mv | AT gobbelglennt leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT mathenymichaele leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT reevesruthr leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT akeroydjuliam leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT turchinalexander leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT ballantynechristiem leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT petersenlauraa leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease AT viranisalims leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease |