Cargando…

Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer

BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic hea...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ang, da Costa, Wilson L., Guffey, Danielle, Milner, Emily M., Allam, Anthony K., Kurian, Karen M., Novoa, Francisco J., Poche, Marguerite D., Bandyo, Raka, Granada, Carolina, Wallace, Courtney D., Zakai, Neil A., Amos, Christopher I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130880/
https://www.ncbi.nlm.nih.gov/pubmed/35647478
http://dx.doi.org/10.1002/rth2.12733
_version_ 1784713067981963264
author Li, Ang
da Costa, Wilson L.
Guffey, Danielle
Milner, Emily M.
Allam, Anthony K.
Kurian, Karen M.
Novoa, Francisco J.
Poche, Marguerite D.
Bandyo, Raka
Granada, Carolina
Wallace, Courtney D.
Zakai, Neil A.
Amos, Christopher I.
author_facet Li, Ang
da Costa, Wilson L.
Guffey, Danielle
Milner, Emily M.
Allam, Anthony K.
Kurian, Karen M.
Novoa, Francisco J.
Poche, Marguerite D.
Bandyo, Raka
Granada, Carolina
Wallace, Courtney D.
Zakai, Neil A.
Amos, Christopher I.
author_sort Li, Ang
collection PubMed
description BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic health record (EHR)–linked longitudinal database. We derived three algorithms that were ICD/medication based, natural language processing (NLP) based, or all combined. We then randomly sampled 400 patients from patients with VTE codes (n = 1111) and 400 from those without VTE codes (n = 7396). Weighted sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated on the entire sample using inverse probability weighting, followed by bootstrapped receiver operating curve analysis to calculate the concordance statistic (c statistic). RESULTS: Among 800 patients sampled, 280 had a confirmed acute VTE during the first year after cancer diagnosis. The ICD/medication algorithm had a weighted PPV of 95% and a weighted sensitivity of 81%, with a c statistic of 0.90 (95% confidence interval [CI], 0.89–0.91). Adding Current Procedural Terminology codes for inferior vena cava filter removal or early death did not improve the performance. The NLP algorithm had a weighted PPV of 80% and a weighted sensitivity of 90%, with a c statistic of 0.93 (95% CI, 0.92–0.94). The combined algorithm had a weighted PPV of 98% at the higher cutoff and a weighted sensitivity of 96% at the lower cutoff, with a c statistic of 0.98 (95% CI, 0.97–0.98). CONCLUSIONS: Our ICD/medication‐based algorithm can accurately identify VTE phenotype among patients with cancer with a high PPV of 95%. The combined algorithm should be considered in EHR databases that have access to such capabilities.
format Online
Article
Text
id pubmed-9130880
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-91308802022-05-26 Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer Li, Ang da Costa, Wilson L. Guffey, Danielle Milner, Emily M. Allam, Anthony K. Kurian, Karen M. Novoa, Francisco J. Poche, Marguerite D. Bandyo, Raka Granada, Carolina Wallace, Courtney D. Zakai, Neil A. Amos, Christopher I. Res Pract Thromb Haemost Original Articles BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic health record (EHR)–linked longitudinal database. We derived three algorithms that were ICD/medication based, natural language processing (NLP) based, or all combined. We then randomly sampled 400 patients from patients with VTE codes (n = 1111) and 400 from those without VTE codes (n = 7396). Weighted sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated on the entire sample using inverse probability weighting, followed by bootstrapped receiver operating curve analysis to calculate the concordance statistic (c statistic). RESULTS: Among 800 patients sampled, 280 had a confirmed acute VTE during the first year after cancer diagnosis. The ICD/medication algorithm had a weighted PPV of 95% and a weighted sensitivity of 81%, with a c statistic of 0.90 (95% confidence interval [CI], 0.89–0.91). Adding Current Procedural Terminology codes for inferior vena cava filter removal or early death did not improve the performance. The NLP algorithm had a weighted PPV of 80% and a weighted sensitivity of 90%, with a c statistic of 0.93 (95% CI, 0.92–0.94). The combined algorithm had a weighted PPV of 98% at the higher cutoff and a weighted sensitivity of 96% at the lower cutoff, with a c statistic of 0.98 (95% CI, 0.97–0.98). CONCLUSIONS: Our ICD/medication‐based algorithm can accurately identify VTE phenotype among patients with cancer with a high PPV of 95%. The combined algorithm should be considered in EHR databases that have access to such capabilities. John Wiley and Sons Inc. 2022-05-25 /pmc/articles/PMC9130880/ /pubmed/35647478 http://dx.doi.org/10.1002/rth2.12733 Text en © 2022 The Authors. Research and Practice in Thrombosis and Haemostasis published by Wiley Periodicals LLC on behalf of International Society on Thrombosis and Haemostasis (ISTH). https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Original Articles
Li, Ang
da Costa, Wilson L.
Guffey, Danielle
Milner, Emily M.
Allam, Anthony K.
Kurian, Karen M.
Novoa, Francisco J.
Poche, Marguerite D.
Bandyo, Raka
Granada, Carolina
Wallace, Courtney D.
Zakai, Neil A.
Amos, Christopher I.
Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title_full Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title_fullStr Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title_full_unstemmed Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title_short Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
title_sort developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130880/
https://www.ncbi.nlm.nih.gov/pubmed/35647478
http://dx.doi.org/10.1002/rth2.12733
work_keys_str_mv AT liang developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT dacostawilsonl developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT guffeydanielle developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT milneremilym developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT allamanthonyk developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT kuriankarenm developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT novoafranciscoj developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT pochemarguerited developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT bandyoraka developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT granadacarolina developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT wallacecourtneyd developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT zakaineila developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer
AT amoschristopheri developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer