Cargando…
Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer
BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic hea...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130880/ https://www.ncbi.nlm.nih.gov/pubmed/35647478 http://dx.doi.org/10.1002/rth2.12733 |
_version_ | 1784713067981963264 |
---|---|
author | Li, Ang da Costa, Wilson L. Guffey, Danielle Milner, Emily M. Allam, Anthony K. Kurian, Karen M. Novoa, Francisco J. Poche, Marguerite D. Bandyo, Raka Granada, Carolina Wallace, Courtney D. Zakai, Neil A. Amos, Christopher I. |
author_facet | Li, Ang da Costa, Wilson L. Guffey, Danielle Milner, Emily M. Allam, Anthony K. Kurian, Karen M. Novoa, Francisco J. Poche, Marguerite D. Bandyo, Raka Granada, Carolina Wallace, Courtney D. Zakai, Neil A. Amos, Christopher I. |
author_sort | Li, Ang |
collection | PubMed |
description | BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic health record (EHR)–linked longitudinal database. We derived three algorithms that were ICD/medication based, natural language processing (NLP) based, or all combined. We then randomly sampled 400 patients from patients with VTE codes (n = 1111) and 400 from those without VTE codes (n = 7396). Weighted sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated on the entire sample using inverse probability weighting, followed by bootstrapped receiver operating curve analysis to calculate the concordance statistic (c statistic). RESULTS: Among 800 patients sampled, 280 had a confirmed acute VTE during the first year after cancer diagnosis. The ICD/medication algorithm had a weighted PPV of 95% and a weighted sensitivity of 81%, with a c statistic of 0.90 (95% confidence interval [CI], 0.89–0.91). Adding Current Procedural Terminology codes for inferior vena cava filter removal or early death did not improve the performance. The NLP algorithm had a weighted PPV of 80% and a weighted sensitivity of 90%, with a c statistic of 0.93 (95% CI, 0.92–0.94). The combined algorithm had a weighted PPV of 98% at the higher cutoff and a weighted sensitivity of 96% at the lower cutoff, with a c statistic of 0.98 (95% CI, 0.97–0.98). CONCLUSIONS: Our ICD/medication‐based algorithm can accurately identify VTE phenotype among patients with cancer with a high PPV of 95%. The combined algorithm should be considered in EHR databases that have access to such capabilities. |
format | Online Article Text |
id | pubmed-9130880 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-91308802022-05-26 Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer Li, Ang da Costa, Wilson L. Guffey, Danielle Milner, Emily M. Allam, Anthony K. Kurian, Karen M. Novoa, Francisco J. Poche, Marguerite D. Bandyo, Raka Granada, Carolina Wallace, Courtney D. Zakai, Neil A. Amos, Christopher I. Res Pract Thromb Haemost Original Articles BACKGROUND: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer. METHODS: We used a cancer registry electronic health record (EHR)–linked longitudinal database. We derived three algorithms that were ICD/medication based, natural language processing (NLP) based, or all combined. We then randomly sampled 400 patients from patients with VTE codes (n = 1111) and 400 from those without VTE codes (n = 7396). Weighted sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated on the entire sample using inverse probability weighting, followed by bootstrapped receiver operating curve analysis to calculate the concordance statistic (c statistic). RESULTS: Among 800 patients sampled, 280 had a confirmed acute VTE during the first year after cancer diagnosis. The ICD/medication algorithm had a weighted PPV of 95% and a weighted sensitivity of 81%, with a c statistic of 0.90 (95% confidence interval [CI], 0.89–0.91). Adding Current Procedural Terminology codes for inferior vena cava filter removal or early death did not improve the performance. The NLP algorithm had a weighted PPV of 80% and a weighted sensitivity of 90%, with a c statistic of 0.93 (95% CI, 0.92–0.94). The combined algorithm had a weighted PPV of 98% at the higher cutoff and a weighted sensitivity of 96% at the lower cutoff, with a c statistic of 0.98 (95% CI, 0.97–0.98). CONCLUSIONS: Our ICD/medication‐based algorithm can accurately identify VTE phenotype among patients with cancer with a high PPV of 95%. The combined algorithm should be considered in EHR databases that have access to such capabilities. John Wiley and Sons Inc. 2022-05-25 /pmc/articles/PMC9130880/ /pubmed/35647478 http://dx.doi.org/10.1002/rth2.12733 Text en © 2022 The Authors. Research and Practice in Thrombosis and Haemostasis published by Wiley Periodicals LLC on behalf of International Society on Thrombosis and Haemostasis (ISTH). https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Original Articles Li, Ang da Costa, Wilson L. Guffey, Danielle Milner, Emily M. Allam, Anthony K. Kurian, Karen M. Novoa, Francisco J. Poche, Marguerite D. Bandyo, Raka Granada, Carolina Wallace, Courtney D. Zakai, Neil A. Amos, Christopher I. Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title | Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title_full | Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title_fullStr | Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title_full_unstemmed | Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title_short | Developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
title_sort | developing and optimizing a computable phenotype for incident venous thromboembolism in a longitudinal cohort of patients with cancer |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9130880/ https://www.ncbi.nlm.nih.gov/pubmed/35647478 http://dx.doi.org/10.1002/rth2.12733 |
work_keys_str_mv | AT liang developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT dacostawilsonl developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT guffeydanielle developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT milneremilym developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT allamanthonyk developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT kuriankarenm developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT novoafranciscoj developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT pochemarguerited developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT bandyoraka developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT granadacarolina developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT wallacecourtneyd developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT zakaineila developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer AT amoschristopheri developingandoptimizingacomputablephenotypeforincidentvenousthromboembolisminalongitudinalcohortofpatientswithcancer |