Cargando…

Testing Calibration of Cox Survival Models at Extremes of Event Risk

Risk prediction models can translate genetic association findings for clinical decision-making. Most models are evaluated on their ability to discriminate, and the calibration of risk-prediction models is largely overlooked in applications. Models that demonstrate good discrimination in training dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Soave, David M., Strug, Lisa J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5972303/
https://www.ncbi.nlm.nih.gov/pubmed/29872446
http://dx.doi.org/10.3389/fgene.2018.00177
_version_ 1783326415206219776
author Soave, David M.
Strug, Lisa J.
author_facet Soave, David M.
Strug, Lisa J.
author_sort Soave, David M.
collection PubMed
description Risk prediction models can translate genetic association findings for clinical decision-making. Most models are evaluated on their ability to discriminate, and the calibration of risk-prediction models is largely overlooked in applications. Models that demonstrate good discrimination in training datasets, if not properly calibrated to produce unbiased estimates of risk, can perform poorly in new patient populations. Poorly calibrated models arise due to missing covariates, such as genetic interactions that may be unknown or not measured. We demonstrate that models omitting interactions can lead to increased bias in predicted risk for patients at the tails of the risk distribution; i.e., those patients who are most likely to be affected by clinical decision making. We propose a new calibration test for Cox risk-prediction models that aggregates martingale residuals for subjects from extreme high and low risk groups with a test statistic maximum chosen by varying which risk groups are included in the extremes. To estimate the empirical significance of our test statistic, we simulate from a Gaussian distribution using the covariance matrix for the grouped sums of martingale residuals. Simulation shows the new test maintains control of type 1 error with improved power over a conventional goodness-of-fit test when risk prediction deviates at the tails of the risk distribution. We apply our method in the development of a prediction model for risk of cystic fibrosis-related diabetes. Our study highlights the importance of assessing calibration and discrimination in predictive modeling, and provides a complementary tool in the assessment of risk model calibration.
format Online
Article
Text
id pubmed-5972303
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59723032018-06-05 Testing Calibration of Cox Survival Models at Extremes of Event Risk Soave, David M. Strug, Lisa J. Front Genet Genetics Risk prediction models can translate genetic association findings for clinical decision-making. Most models are evaluated on their ability to discriminate, and the calibration of risk-prediction models is largely overlooked in applications. Models that demonstrate good discrimination in training datasets, if not properly calibrated to produce unbiased estimates of risk, can perform poorly in new patient populations. Poorly calibrated models arise due to missing covariates, such as genetic interactions that may be unknown or not measured. We demonstrate that models omitting interactions can lead to increased bias in predicted risk for patients at the tails of the risk distribution; i.e., those patients who are most likely to be affected by clinical decision making. We propose a new calibration test for Cox risk-prediction models that aggregates martingale residuals for subjects from extreme high and low risk groups with a test statistic maximum chosen by varying which risk groups are included in the extremes. To estimate the empirical significance of our test statistic, we simulate from a Gaussian distribution using the covariance matrix for the grouped sums of martingale residuals. Simulation shows the new test maintains control of type 1 error with improved power over a conventional goodness-of-fit test when risk prediction deviates at the tails of the risk distribution. We apply our method in the development of a prediction model for risk of cystic fibrosis-related diabetes. Our study highlights the importance of assessing calibration and discrimination in predictive modeling, and provides a complementary tool in the assessment of risk model calibration. Frontiers Media S.A. 2018-05-22 /pmc/articles/PMC5972303/ /pubmed/29872446 http://dx.doi.org/10.3389/fgene.2018.00177 Text en Copyright © 2018 Soave and Strug. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Soave, David M.
Strug, Lisa J.
Testing Calibration of Cox Survival Models at Extremes of Event Risk
title Testing Calibration of Cox Survival Models at Extremes of Event Risk
title_full Testing Calibration of Cox Survival Models at Extremes of Event Risk
title_fullStr Testing Calibration of Cox Survival Models at Extremes of Event Risk
title_full_unstemmed Testing Calibration of Cox Survival Models at Extremes of Event Risk
title_short Testing Calibration of Cox Survival Models at Extremes of Event Risk
title_sort testing calibration of cox survival models at extremes of event risk
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5972303/
https://www.ncbi.nlm.nih.gov/pubmed/29872446
http://dx.doi.org/10.3389/fgene.2018.00177
work_keys_str_mv AT soavedavidm testingcalibrationofcoxsurvivalmodelsatextremesofeventrisk
AT struglisaj testingcalibrationofcoxsurvivalmodelsatextremesofeventrisk