Cargando…

Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease

OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Gobbel, Glenn T., Matheny, Michael E., Reeves, Ruth R., Akeroyd, Julia M., Turchin, Alexander, Ballantyne, Christie M., Petersen, Laura A., Virani, Salim S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671496/
https://www.ncbi.nlm.nih.gov/pubmed/34950914
http://dx.doi.org/10.1016/j.ajpc.2021.100300
_version_ 1784615150105395200
author Gobbel, Glenn T.
Matheny, Michael E.
Reeves, Ruth R.
Akeroyd, Julia M.
Turchin, Alexander
Ballantyne, Christie M.
Petersen, Laura A.
Virani, Salim S.
author_facet Gobbel, Glenn T.
Matheny, Michael E.
Reeves, Ruth R.
Akeroyd, Julia M.
Turchin, Alexander
Ballantyne, Christie M.
Petersen, Laura A.
Virani, Salim S.
author_sort Gobbel, Glenn T.
collection PubMed
description OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not prescribing HIST in notes of 1152 randomly selected patients from across the VA healthcare system treated for ASCVD but not receiving HIST. Developers used reviewer annotations to train the Canary NLP tool to detect and extract notes containing one or more of these reasons. Negative predictive value (NPV), sensitivity, specificity and Area Under the Curve (AUC) were used to assess accuracy at detecting documents containing reasons when using structured data, NLP-extracted unstructured data, or both data sources combined. RESULTS: At least one documented reason for not prescribing HIST occurred in 47% of notes. The most frequent reasons were SASEs (41%) and general intolerance (20%). When identifying notes containing any documented reason for not using HIST, adding NLP-extracted, unstructured data significantly (p<0.05) increased sensitivity (0.69 (95% confidence interval [CI] 0.60–0.76) to 0.89 (95% CI 0.81–0.93)), NPV (0.90 (95% CI 0.87 to 0.93) to 0.96 (95% CI 0.93–0.98)), and AUC (0.84 (95% confidence interval [CI] 0.81–0.88) to 0.91 (95% CI 0.90–0.93)) compared to structured data alone. CONCLUSIONS: NLP extraction of data from unstructured text can improve identification of reasons for patients not being on HIST over structured data alone. The additional information provided through NLP of unstructured free text should help in tailoring and implementing system-level interventions to improve HIST use in patients with ASCVD.
format Online
Article
Text
id pubmed-8671496
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86714962021-12-22 Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease Gobbel, Glenn T. Matheny, Michael E. Reeves, Ruth R. Akeroyd, Julia M. Turchin, Alexander Ballantyne, Christie M. Petersen, Laura A. Virani, Salim S. Am J Prev Cardiol Original Research Contribution OBJECTIVE: To determine whether natural language processing (NLP) of unstructured medical text can improve identification of ASCVD patients not using high-intensity statin therapy (HIST) due to statin-associated side effects (SASEs) and other reasons. METHODS: Reviewers annotated reasons for not prescribing HIST in notes of 1152 randomly selected patients from across the VA healthcare system treated for ASCVD but not receiving HIST. Developers used reviewer annotations to train the Canary NLP tool to detect and extract notes containing one or more of these reasons. Negative predictive value (NPV), sensitivity, specificity and Area Under the Curve (AUC) were used to assess accuracy at detecting documents containing reasons when using structured data, NLP-extracted unstructured data, or both data sources combined. RESULTS: At least one documented reason for not prescribing HIST occurred in 47% of notes. The most frequent reasons were SASEs (41%) and general intolerance (20%). When identifying notes containing any documented reason for not using HIST, adding NLP-extracted, unstructured data significantly (p<0.05) increased sensitivity (0.69 (95% confidence interval [CI] 0.60–0.76) to 0.89 (95% CI 0.81–0.93)), NPV (0.90 (95% CI 0.87 to 0.93) to 0.96 (95% CI 0.93–0.98)), and AUC (0.84 (95% confidence interval [CI] 0.81–0.88) to 0.91 (95% CI 0.90–0.93)) compared to structured data alone. CONCLUSIONS: NLP extraction of data from unstructured text can improve identification of reasons for patients not being on HIST over structured data alone. The additional information provided through NLP of unstructured free text should help in tailoring and implementing system-level interventions to improve HIST use in patients with ASCVD. Elsevier 2021-12-03 /pmc/articles/PMC8671496/ /pubmed/34950914 http://dx.doi.org/10.1016/j.ajpc.2021.100300 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research Contribution
Gobbel, Glenn T.
Matheny, Michael E.
Reeves, Ruth R.
Akeroyd, Julia M.
Turchin, Alexander
Ballantyne, Christie M.
Petersen, Laura A.
Virani, Salim S.
Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title_full Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title_fullStr Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title_full_unstemmed Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title_short Leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
title_sort leveraging structured and unstructured electronic health record data to detect reasons for suboptimal statin therapy use in patients with atherosclerotic cardiovascular disease
topic Original Research Contribution
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8671496/
https://www.ncbi.nlm.nih.gov/pubmed/34950914
http://dx.doi.org/10.1016/j.ajpc.2021.100300
work_keys_str_mv AT gobbelglennt leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT mathenymichaele leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT reevesruthr leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT akeroydjuliam leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT turchinalexander leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT ballantynechristiem leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT petersenlauraa leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease
AT viranisalims leveragingstructuredandunstructuredelectronichealthrecorddatatodetectreasonsforsuboptimalstatintherapyuseinpatientswithatheroscleroticcardiovasculardisease