Cargando…

Use of generalised additive models to categorise continuous variables in clinical prediction

BACKGROUND: In medical practice many, essentially continuous, clinical parameters tend to be categorised by physicians for ease of decision-making. Indeed, categorisation is a common practice both in medical research and in the development of clinical prediction rules, particularly where the ensuing...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrio, Irantzu, Arostegui, Inmaculada, Quintana, José M, Group, IRYSS-COPD
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716996/
https://www.ncbi.nlm.nih.gov/pubmed/23802742
http://dx.doi.org/10.1186/1471-2288-13-83
_version_ 1782277636414767104
author Barrio, Irantzu
Arostegui, Inmaculada
Quintana, José M
Group, IRYSS-COPD
author_facet Barrio, Irantzu
Arostegui, Inmaculada
Quintana, José M
Group, IRYSS-COPD
author_sort Barrio, Irantzu
collection PubMed
description BACKGROUND: In medical practice many, essentially continuous, clinical parameters tend to be categorised by physicians for ease of decision-making. Indeed, categorisation is a common practice both in medical research and in the development of clinical prediction rules, particularly where the ensuing models are to be applied in daily clinical practice to support clinicians in the decision-making process. Since the number of categories into which a continuous predictor must be categorised depends partly on the relationship between the predictor and the outcome, the need for more than two categories must be borne in mind. METHODS: We propose a categorisation methodology for clinical-prediction models, using Generalised Additive Models (GAMs) with P-spline smoothers to determine the relationship between the continuous predictor and the outcome. The proposed method consists of creating at least one average-risk category along with high- and low-risk categories based on the GAM smooth function. We applied this methodology to a prospective cohort of patients with exacerbated chronic obstructive pulmonary disease. The predictors selected were respiratory rate and partial pressure of carbon dioxide in the blood (PCO2), and the response variable was poor evolution. An additive logistic regression model was used to show the relationship between the covariates and the dichotomous response variable. The proposed categorisation was compared to the continuous predictor as the best option, using the AIC and AUC evaluation parameters. The sample was divided into a derivation (60%) and validation (40%) samples. The first was used to obtain the cut points while the second was used to validate the proposed methodology. RESULTS: The three-category proposal for the respiratory rate was ≤ 20;(20,24];> 24, for which the following values were obtained: AIC=314.5 and AUC=0.638. The respective values for the continuous predictor were AIC=317.1 and AUC=0.634, with no statistically significant differences being found between the two AUCs (p =0.079). The four-category proposal for PCO2 was ≤ 43;(43,52];(52,65];> 65, for which the following values were obtained: AIC=258.1 and AUC=0.81. No statistically significant differences were found between the AUC of the four-category option and that of the continuous predictor, which yielded an AIC of 250.3 and an AUC of 0.825 (p =0.115). CONCLUSIONS: Our proposed method provides clinicians with the number and location of cut points for categorising variables, and performs as successfully as the original continuous predictor when it comes to developing clinical prediction rules.
format Online
Article
Text
id pubmed-3716996
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37169962013-07-23 Use of generalised additive models to categorise continuous variables in clinical prediction Barrio, Irantzu Arostegui, Inmaculada Quintana, José M Group, IRYSS-COPD BMC Med Res Methodol Research Article BACKGROUND: In medical practice many, essentially continuous, clinical parameters tend to be categorised by physicians for ease of decision-making. Indeed, categorisation is a common practice both in medical research and in the development of clinical prediction rules, particularly where the ensuing models are to be applied in daily clinical practice to support clinicians in the decision-making process. Since the number of categories into which a continuous predictor must be categorised depends partly on the relationship between the predictor and the outcome, the need for more than two categories must be borne in mind. METHODS: We propose a categorisation methodology for clinical-prediction models, using Generalised Additive Models (GAMs) with P-spline smoothers to determine the relationship between the continuous predictor and the outcome. The proposed method consists of creating at least one average-risk category along with high- and low-risk categories based on the GAM smooth function. We applied this methodology to a prospective cohort of patients with exacerbated chronic obstructive pulmonary disease. The predictors selected were respiratory rate and partial pressure of carbon dioxide in the blood (PCO2), and the response variable was poor evolution. An additive logistic regression model was used to show the relationship between the covariates and the dichotomous response variable. The proposed categorisation was compared to the continuous predictor as the best option, using the AIC and AUC evaluation parameters. The sample was divided into a derivation (60%) and validation (40%) samples. The first was used to obtain the cut points while the second was used to validate the proposed methodology. RESULTS: The three-category proposal for the respiratory rate was ≤ 20;(20,24];> 24, for which the following values were obtained: AIC=314.5 and AUC=0.638. The respective values for the continuous predictor were AIC=317.1 and AUC=0.634, with no statistically significant differences being found between the two AUCs (p =0.079). The four-category proposal for PCO2 was ≤ 43;(43,52];(52,65];> 65, for which the following values were obtained: AIC=258.1 and AUC=0.81. No statistically significant differences were found between the AUC of the four-category option and that of the continuous predictor, which yielded an AIC of 250.3 and an AUC of 0.825 (p =0.115). CONCLUSIONS: Our proposed method provides clinicians with the number and location of cut points for categorising variables, and performs as successfully as the original continuous predictor when it comes to developing clinical prediction rules. BioMed Central 2013-06-26 /pmc/articles/PMC3716996/ /pubmed/23802742 http://dx.doi.org/10.1186/1471-2288-13-83 Text en Copyright © 2013 Barrio et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Barrio, Irantzu
Arostegui, Inmaculada
Quintana, José M
Group, IRYSS-COPD
Use of generalised additive models to categorise continuous variables in clinical prediction
title Use of generalised additive models to categorise continuous variables in clinical prediction
title_full Use of generalised additive models to categorise continuous variables in clinical prediction
title_fullStr Use of generalised additive models to categorise continuous variables in clinical prediction
title_full_unstemmed Use of generalised additive models to categorise continuous variables in clinical prediction
title_short Use of generalised additive models to categorise continuous variables in clinical prediction
title_sort use of generalised additive models to categorise continuous variables in clinical prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3716996/
https://www.ncbi.nlm.nih.gov/pubmed/23802742
http://dx.doi.org/10.1186/1471-2288-13-83
work_keys_str_mv AT barrioirantzu useofgeneralisedadditivemodelstocategorisecontinuousvariablesinclinicalprediction
AT arosteguiinmaculada useofgeneralisedadditivemodelstocategorisecontinuousvariablesinclinicalprediction
AT quintanajosem useofgeneralisedadditivemodelstocategorisecontinuousvariablesinclinicalprediction
AT groupirysscopd useofgeneralisedadditivemodelstocategorisecontinuousvariablesinclinicalprediction