Cargando…
Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria
BACKGROUND: The risk reclassification table assesses clinical performance of a biomarker in terms of movements across relevant risk categories. The Reclassification- Calibration (RC) statistic has been developed for binary outcomes, but its performance for survival data with moderate to high censori...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6456068/ https://www.ncbi.nlm.nih.gov/pubmed/30984876 http://dx.doi.org/10.1186/s41512-018-0034-5 |
_version_ | 1783409701067685888 |
---|---|
author | Demler, Olga V. Paynter, Nina P. Cook, Nancy R. |
author_facet | Demler, Olga V. Paynter, Nina P. Cook, Nancy R. |
author_sort | Demler, Olga V. |
collection | PubMed |
description | BACKGROUND: The risk reclassification table assesses clinical performance of a biomarker in terms of movements across relevant risk categories. The Reclassification- Calibration (RC) statistic has been developed for binary outcomes, but its performance for survival data with moderate to high censoring rates has not been evaluated. METHODS: We develop an RC statistic for survival data with higher censoring rates using the Greenwood-Nam-D’Agostino approach (RC-GND). We examine its performance characteristics and compare its performance and utility to the Hosmer-Lemeshow goodness-of-fit test under various assumptions about the censoring rate and the shape of the baseline hazard. RESULTS: The RC-GND test was robust to high (up to 50%) censoring rates and did not exceed the targeted 5% Type I error in a variety of simulated scenarios. It achieved 80% power to detect better calibration with respect to clinical categories when an important predictor with a hazard ratio of at least 1.7 to 2.2 was added to the model, while the Hosmer-Lemeshow goodness-of-fit (gof) test had power of 5% in this scenario. CONCLUSIONS: The RC-GND test should be used to test the improvement in calibration with respect to clinically relevant risk strata. When an important predictor is omitted, the Hosmer-Lemeshow goodness-of-fit test is usually not significant, while the RC-GND test is sensitive to such an omission. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s41512-018-0034-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6456068 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64560682019-05-15 Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria Demler, Olga V. Paynter, Nina P. Cook, Nancy R. Diagn Progn Res Methodology BACKGROUND: The risk reclassification table assesses clinical performance of a biomarker in terms of movements across relevant risk categories. The Reclassification- Calibration (RC) statistic has been developed for binary outcomes, but its performance for survival data with moderate to high censoring rates has not been evaluated. METHODS: We develop an RC statistic for survival data with higher censoring rates using the Greenwood-Nam-D’Agostino approach (RC-GND). We examine its performance characteristics and compare its performance and utility to the Hosmer-Lemeshow goodness-of-fit test under various assumptions about the censoring rate and the shape of the baseline hazard. RESULTS: The RC-GND test was robust to high (up to 50%) censoring rates and did not exceed the targeted 5% Type I error in a variety of simulated scenarios. It achieved 80% power to detect better calibration with respect to clinical categories when an important predictor with a hazard ratio of at least 1.7 to 2.2 was added to the model, while the Hosmer-Lemeshow goodness-of-fit (gof) test had power of 5% in this scenario. CONCLUSIONS: The RC-GND test should be used to test the improvement in calibration with respect to clinically relevant risk strata. When an important predictor is omitted, the Hosmer-Lemeshow goodness-of-fit test is usually not significant, while the RC-GND test is sensitive to such an omission. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s41512-018-0034-5) contains supplementary material, which is available to authorized users. BioMed Central 2018-07-26 /pmc/articles/PMC6456068/ /pubmed/30984876 http://dx.doi.org/10.1186/s41512-018-0034-5 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Demler, Olga V. Paynter, Nina P. Cook, Nancy R. Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title | Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title_full | Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title_fullStr | Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title_full_unstemmed | Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title_short | Reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
title_sort | reclassification calibration test for censored survival data: performance and comparison to goodness-of-fit criteria |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6456068/ https://www.ncbi.nlm.nih.gov/pubmed/30984876 http://dx.doi.org/10.1186/s41512-018-0034-5 |
work_keys_str_mv | AT demlerolgav reclassificationcalibrationtestforcensoredsurvivaldataperformanceandcomparisontogoodnessoffitcriteria AT paynterninap reclassificationcalibrationtestforcensoredsurvivaldataperformanceandcomparisontogoodnessoffitcriteria AT cooknancyr reclassificationcalibrationtestforcensoredsurvivaldataperformanceandcomparisontogoodnessoffitcriteria |