Cargando…

Validation and updating of risk models based on multinomial logistic regression

BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend...

Descripción completa

Detalles Bibliográficos
Autores principales: Van Calster, Ben, Van Hoorde, Kirsten, Vergouwe, Yvonne, Bobdiwala, Shabnam, Condous, George, Kirk, Emma, Bourne, Tom, Steyerberg, Ewout W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/
https://www.ncbi.nlm.nih.gov/pubmed/31093534
http://dx.doi.org/10.1186/s41512-016-0002-x
_version_ 1783409862557827072
author Van Calster, Ben
Van Hoorde, Kirsten
Vergouwe, Yvonne
Bobdiwala, Shabnam
Condous, George
Kirk, Emma
Bourne, Tom
Steyerberg, Ewout W.
author_facet Van Calster, Ben
Van Hoorde, Kirsten
Vergouwe, Yvonne
Bobdiwala, Shabnam
Condous, George
Kirk, Emma
Bourne, Tom
Steyerberg, Ewout W.
author_sort Van Calster, Ben
collection PubMed
description BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large.
format Online
Article
Text
id pubmed-6457140
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64571402019-05-15 Validation and updating of risk models based on multinomial logistic regression Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. Diagn Progn Res Methodology BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large. BioMed Central 2017-02-08 /pmc/articles/PMC6457140/ /pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Van Calster, Ben
Van Hoorde, Kirsten
Vergouwe, Yvonne
Bobdiwala, Shabnam
Condous, George
Kirk, Emma
Bourne, Tom
Steyerberg, Ewout W.
Validation and updating of risk models based on multinomial logistic regression
title Validation and updating of risk models based on multinomial logistic regression
title_full Validation and updating of risk models based on multinomial logistic regression
title_fullStr Validation and updating of risk models based on multinomial logistic regression
title_full_unstemmed Validation and updating of risk models based on multinomial logistic regression
title_short Validation and updating of risk models based on multinomial logistic regression
title_sort validation and updating of risk models based on multinomial logistic regression
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/
https://www.ncbi.nlm.nih.gov/pubmed/31093534
http://dx.doi.org/10.1186/s41512-016-0002-x
work_keys_str_mv AT vancalsterben validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT vanhoordekirsten validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT vergouweyvonne validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT bobdiwalashabnam validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT condousgeorge validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT kirkemma validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT bournetom validationandupdatingofriskmodelsbasedonmultinomiallogisticregression
AT steyerbergewoutw validationandupdatingofriskmodelsbasedonmultinomiallogisticregression