Cargando…

Validation and updating of risk models based on multinomial logistic regression

BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend...

Descripción completa

Detalles Bibliográficos
Autores principales:	Van Calster, Ben, Van Hoorde, Kirsten, Vergouwe, Yvonne, Bobdiwala, Shabnam, Condous, George, Kirk, Emma, Bourne, Tom, Steyerberg, Ewout W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Methodology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/ https://www.ncbi.nlm.nih.gov/pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x

_version_	1783409862557827072
author	Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W.
author_facet	Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W.
author_sort	Van Calster, Ben
collection	PubMed
description	BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large.
format	Online Article Text
id	pubmed-6457140
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-64571402019-05-15 Validation and updating of risk models based on multinomial logistic regression Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. Diagn Progn Res Methodology BACKGROUND: Risk models often perform poorly at external validation in terms of discrimination or calibration. Updating methods are needed to improve performance of multinomial logistic regression models for risk prediction. METHODS: We consider simple and more refined updating approaches to extend previously proposed methods for dichotomous outcomes. These include model recalibration (adjustment of intercept and/or slope), revision (re-estimation of individual model coefficients), and extension (revision with additional markers). We suggest a closed testing procedure to assist in deciding on the updating complexity. These methods are demonstrated on a case study of women with pregnancies of unknown location (PUL). A previously developed risk model predicts the probability that a PUL is a failed, intra-uterine, or ectopic pregnancy. We validated and updated this model on more recent patients from the development setting (temporal updating; n = 1422) and on patients from a different hospital (geographical updating; n = 873). Internal validation of updated models was performed through bootstrap resampling. RESULTS: Contrary to dichotomous models, we noted that recalibration can also affect discrimination for multinomial risk models. If the number of outcome categories is higher than the number of variables, logistic recalibration is obsolete because straightforward model refitting does not require the estimation of more parameters. Although recalibration strongly improved performance in the case study, the closed testing procedure selected model revision. Further, revision of functional form of continuous predictors had a positive effect on discrimination, whereas penalized estimation of changes in model coefficients was beneficial for calibration. CONCLUSIONS: Methods for updating of multinomial risk models are now available to improve predictions in new settings. A closed testing procedure is helpful to decide whether revision is preferred over recalibration. Because multicategory outcomes increase the number of parameters to be estimated, we recommend full model revision only when the sample size for each outcome category is large. BioMed Central 2017-02-08 /pmc/articles/PMC6457140/ /pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Methodology Van Calster, Ben Van Hoorde, Kirsten Vergouwe, Yvonne Bobdiwala, Shabnam Condous, George Kirk, Emma Bourne, Tom Steyerberg, Ewout W. Validation and updating of risk models based on multinomial logistic regression
title	Validation and updating of risk models based on multinomial logistic regression
title_full	Validation and updating of risk models based on multinomial logistic regression
title_fullStr	Validation and updating of risk models based on multinomial logistic regression
title_full_unstemmed	Validation and updating of risk models based on multinomial logistic regression
title_short	Validation and updating of risk models based on multinomial logistic regression
title_sort	validation and updating of risk models based on multinomial logistic regression
topic	Methodology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457140/ https://www.ncbi.nlm.nih.gov/pubmed/31093534 http://dx.doi.org/10.1186/s41512-016-0002-x
work_keys_str_mv	AT vancalsterben validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT vanhoordekirsten validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT vergouweyvonne validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT bobdiwalashabnam validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT condousgeorge validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT kirkemma validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT bournetom validationandupdatingofriskmodelsbasedonmultinomiallogisticregression AT steyerbergewoutw validationandupdatingofriskmodelsbasedonmultinomiallogisticregression

Validation and updating of risk models based on multinomial logistic regression

Ejemplares similares