Cargando…
Validation and updating of risk models based on multinomial logistic regression
BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/ https://www.ncbi.nlm.nih.gov/pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x |
_version_ | 1783409862557827072 |
---|---|
author | Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. |
author_facet | Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. |
author_sort | Van Calster, Ben |
collection | PubMed |
description | BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large. |
format | Online Article Text |
id | pubmed-6457140 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64571402019-05-15 Validation and updating of risk models based on multinomial logistic regression Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. Diagn Progn Res Methodology BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large. BioMed Central 2017-02-08 /pmc/articles/PMC6457140/ /pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. Validation and updating of risk models based on multinomial logistic regression |
title | Validation and updating of risk models based on multinomial logistic regression |
title_full | Validation and updating of risk models based on multinomial logistic regression |
title_fullStr | Validation and updating of risk models based on multinomial logistic regression |
title_full_unstemmed | Validation and updating of risk models based on multinomial logistic regression |
title_short | Validation and updating of risk models based on multinomial logistic regression |
title_sort | validation and updating of risk models based on multinomial logistic regression |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/ https://www.ncbi.nlm.nih.gov/pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x |
work_keys_str_mv | AT vancalsterben validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT vanhoordekirsten validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT vergouweyvonne validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT bobdiwalashabnam validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT condousgeorge validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT kirkemma validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT bournetom validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT steyerbergewoutw validationandupdatingofriskmodelsbasedonmultinomiallogisticregression |