Cargando…

Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims

BACKGROUND: Lack of using a validated algorithm to select patients is a source of selection bias in oncology studies using administrative claims. The objective of this study to evaluate published algorithms to identify patients with soft tissue sarcoma (STS) in administrative claims and to evaluate...

Descripción completa

Detalles Bibliográficos
Autores principales: Princic, Nicole, McMorrow, Donna, Chan, Philip, Hess, Lisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7199315/
https://www.ncbi.nlm.nih.gov/pubmed/32391140
http://dx.doi.org/10.1186/s13569-020-00130-y
_version_ 1783529133339312128
author Princic, Nicole
McMorrow, Donna
Chan, Philip
Hess, Lisa
author_facet Princic, Nicole
McMorrow, Donna
Chan, Philip
Hess, Lisa
author_sort Princic, Nicole
collection PubMed
description BACKGROUND: Lack of using a validated algorithm to select patients is a source of selection bias in oncology studies using administrative claims. The objective of this study to evaluate published algorithms to identify patients with soft tissue sarcoma (STS) in administrative claims and to evaluate new algorithms to improved performance. METHODS: Two cancer populations including STS cases and non-STS controls were selected from the MarketScan Explorys Linked Claims-Electronic Medical Record (EMR) Database between January 1, 2000 and July 31, 2018. Eligible cases had a diagnosis on a clinical record for STS in the EMR while controls had no evidence of STS on any EMR records. Both cases and controls were enrolled in administrative claims during a period of observation and were aged ≥ 18 years. A split sample was used to test and validate algorithms using data from administrative claims. Values for sensitivity, specificity, and positive predictive value (PPV) were calculated for 14 algorithms. Prior literature validating algorithms in administrative claims across other cancer types report both sensitivity and specificity ranging from as low as 73% to as high as 95%. This was used as a benchmark for defining algorithm success. RESULTS: There were 784 STS cases and 249,062 non-STS cancer controls eligible for analysis. Requiring at least two claims with an ICD-CM diagnosis code for STS achieved a sensitivity of 67% but had a specificity of 72%. Algorithms that required NCCN-recommended systemic treatment for STS improved the specificity to over 90% but dropped the sensitivity to below 20%. Other combinations of diagnostic tests, symptoms, and procedures did not improve performance. CONCLUSIONS: The algorithms tested in this study sample did not achieve sufficient performance and suggest the ability to accurately identify the STS population in administrative data is problematic. Difficulties are likely due to the origin of STS in a variety of locations, the non-specific symptoms of STS, and the common diagnostic tests recommended to diagnose the disease. Future research applying machine learning to examine timing and patterns of variables that comprise the diagnostic process may further investigate the ability to accurately identify STS cases in claims databases.
format Online
Article
Text
id pubmed-7199315
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71993152020-05-08 Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims Princic, Nicole McMorrow, Donna Chan, Philip Hess, Lisa Clin Sarcoma Res Research BACKGROUND: Lack of using a validated algorithm to select patients is a source of selection bias in oncology studies using administrative claims. The objective of this study to evaluate published algorithms to identify patients with soft tissue sarcoma (STS) in administrative claims and to evaluate new algorithms to improved performance. METHODS: Two cancer populations including STS cases and non-STS controls were selected from the MarketScan Explorys Linked Claims-Electronic Medical Record (EMR) Database between January 1, 2000 and July 31, 2018. Eligible cases had a diagnosis on a clinical record for STS in the EMR while controls had no evidence of STS on any EMR records. Both cases and controls were enrolled in administrative claims during a period of observation and were aged ≥ 18 years. A split sample was used to test and validate algorithms using data from administrative claims. Values for sensitivity, specificity, and positive predictive value (PPV) were calculated for 14 algorithms. Prior literature validating algorithms in administrative claims across other cancer types report both sensitivity and specificity ranging from as low as 73% to as high as 95%. This was used as a benchmark for defining algorithm success. RESULTS: There were 784 STS cases and 249,062 non-STS cancer controls eligible for analysis. Requiring at least two claims with an ICD-CM diagnosis code for STS achieved a sensitivity of 67% but had a specificity of 72%. Algorithms that required NCCN-recommended systemic treatment for STS improved the specificity to over 90% but dropped the sensitivity to below 20%. Other combinations of diagnostic tests, symptoms, and procedures did not improve performance. CONCLUSIONS: The algorithms tested in this study sample did not achieve sufficient performance and suggest the ability to accurately identify the STS population in administrative data is problematic. Difficulties are likely due to the origin of STS in a variety of locations, the non-specific symptoms of STS, and the common diagnostic tests recommended to diagnose the disease. Future research applying machine learning to examine timing and patterns of variables that comprise the diagnostic process may further investigate the ability to accurately identify STS cases in claims databases. BioMed Central 2020-05-05 /pmc/articles/PMC7199315/ /pubmed/32391140 http://dx.doi.org/10.1186/s13569-020-00130-y Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Princic, Nicole
McMorrow, Donna
Chan, Philip
Hess, Lisa
Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title_full Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title_fullStr Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title_full_unstemmed Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title_short Evaluation of the accuracy of algorithms to identify soft tissue sarcoma (STS) in administrative claims
title_sort evaluation of the accuracy of algorithms to identify soft tissue sarcoma (sts) in administrative claims
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7199315/
https://www.ncbi.nlm.nih.gov/pubmed/32391140
http://dx.doi.org/10.1186/s13569-020-00130-y
work_keys_str_mv AT princicnicole evaluationoftheaccuracyofalgorithmstoidentifysofttissuesarcomastsinadministrativeclaims
AT mcmorrowdonna evaluationoftheaccuracyofalgorithmstoidentifysofttissuesarcomastsinadministrativeclaims
AT chanphilip evaluationoftheaccuracyofalgorithmstoidentifysofttissuesarcomastsinadministrativeclaims
AT hesslisa evaluationoftheaccuracyofalgorithmstoidentifysofttissuesarcomastsinadministrativeclaims