Cargando…

Comparison of standard and penalized logistic regression in risk model development

OBJECTIVE: Regression models are ubiquitous in thoracic surgical research. We aimed to compare the value of standard logistic regression with the more complex but increasingly used penalized regression models using a recently published risk model as an example. METHODS: Using a standardized data set...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Yan, Yang, Zhizhou, Semenkovich, Tara R., Kozower, Benjamin D., Meyers, Bryan F., Nava, Ruben G., Kreisel, Daniel, Puri, Varun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9390725/
https://www.ncbi.nlm.nih.gov/pubmed/36003440
http://dx.doi.org/10.1016/j.xjon.2022.01.016
_version_ 1784770718373773312
author Yan, Yan
Yang, Zhizhou
Semenkovich, Tara R.
Kozower, Benjamin D.
Meyers, Bryan F.
Nava, Ruben G.
Kreisel, Daniel
Puri, Varun
author_facet Yan, Yan
Yang, Zhizhou
Semenkovich, Tara R.
Kozower, Benjamin D.
Meyers, Bryan F.
Nava, Ruben G.
Kreisel, Daniel
Puri, Varun
author_sort Yan, Yan
collection PubMed
description OBJECTIVE: Regression models are ubiquitous in thoracic surgical research. We aimed to compare the value of standard logistic regression with the more complex but increasingly used penalized regression models using a recently published risk model as an example. METHODS: Using a standardized data set of clinical T1-3N0 esophageal cancer patients, we created models to predict the likelihood of unexpected pathologic nodal disease after surgical resection. Models were fitted using standard logistic regression or penalized regression (ridge, lasso, elastic net, and adaptive lasso). We compared the model performance (Brier score, calibration slope, C statistic, and overfitting) of standard regression with penalized regression models. RESULTS: Among 3206 patients with clinical T1-3N0 esophageal cancer, 668 (22%) had unexpected pathologic nodal disease. Of the 15 candidate variables considered in the models, the key predictors of nodal disease included clinical tumor stage, tumor size, grade, and presence of lymphovascular invasion. The standard regression model and all 4 penalized logistic regression models had virtually identical performance with Brier score ranging from 0.138 to 0.141, concordance index ranging from 0.775 to 0.788, and calibration slope from 0.965 to 1.05. CONCLUSIONS: For predictive modeling in surgical outcomes research, when the data set is large and the outcome of interest is relatively frequent, standard regression models and the more complicated penalized models are very likely to have similar predictive performance. The choice of statistical methods for risk model development should be on the basis of the nature of the data at hand and good statistical practice, rather than the novelty or complexity of statistical models.
format Online
Article
Text
id pubmed-9390725
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-93907252022-08-23 Comparison of standard and penalized logistic regression in risk model development Yan, Yan Yang, Zhizhou Semenkovich, Tara R. Kozower, Benjamin D. Meyers, Bryan F. Nava, Ruben G. Kreisel, Daniel Puri, Varun JTCVS Open Thoracic: Esophageal Cancer OBJECTIVE: Regression models are ubiquitous in thoracic surgical research. We aimed to compare the value of standard logistic regression with the more complex but increasingly used penalized regression models using a recently published risk model as an example. METHODS: Using a standardized data set of clinical T1-3N0 esophageal cancer patients, we created models to predict the likelihood of unexpected pathologic nodal disease after surgical resection. Models were fitted using standard logistic regression or penalized regression (ridge, lasso, elastic net, and adaptive lasso). We compared the model performance (Brier score, calibration slope, C statistic, and overfitting) of standard regression with penalized regression models. RESULTS: Among 3206 patients with clinical T1-3N0 esophageal cancer, 668 (22%) had unexpected pathologic nodal disease. Of the 15 candidate variables considered in the models, the key predictors of nodal disease included clinical tumor stage, tumor size, grade, and presence of lymphovascular invasion. The standard regression model and all 4 penalized logistic regression models had virtually identical performance with Brier score ranging from 0.138 to 0.141, concordance index ranging from 0.775 to 0.788, and calibration slope from 0.965 to 1.05. CONCLUSIONS: For predictive modeling in surgical outcomes research, when the data set is large and the outcome of interest is relatively frequent, standard regression models and the more complicated penalized models are very likely to have similar predictive performance. The choice of statistical methods for risk model development should be on the basis of the nature of the data at hand and good statistical practice, rather than the novelty or complexity of statistical models. Elsevier 2022-01-22 /pmc/articles/PMC9390725/ /pubmed/36003440 http://dx.doi.org/10.1016/j.xjon.2022.01.016 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Thoracic: Esophageal Cancer
Yan, Yan
Yang, Zhizhou
Semenkovich, Tara R.
Kozower, Benjamin D.
Meyers, Bryan F.
Nava, Ruben G.
Kreisel, Daniel
Puri, Varun
Comparison of standard and penalized logistic regression in risk model development
title Comparison of standard and penalized logistic regression in risk model development
title_full Comparison of standard and penalized logistic regression in risk model development
title_fullStr Comparison of standard and penalized logistic regression in risk model development
title_full_unstemmed Comparison of standard and penalized logistic regression in risk model development
title_short Comparison of standard and penalized logistic regression in risk model development
title_sort comparison of standard and penalized logistic regression in risk model development
topic Thoracic: Esophageal Cancer
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9390725/
https://www.ncbi.nlm.nih.gov/pubmed/36003440
http://dx.doi.org/10.1016/j.xjon.2022.01.016
work_keys_str_mv AT yanyan comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT yangzhizhou comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT semenkovichtarar comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT kozowerbenjamind comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT meyersbryanf comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT navarubeng comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT kreiseldaniel comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment
AT purivarun comparisonofstandardandpenalizedlogisticregressioninriskmodeldevelopment