Cargando…

Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models

BACKGROUND: In statistical modeling, finding the most favorable coding for an exploratory quantitative variable involves many tests. This process involves multiple testing problems and requires the correction of the significance level. METHODS: For each coding, a test on the nullity of the coefficie...

Descripción completa

Detalles Bibliográficos
Autores principales: Liquet, Benoit, Riou, Jérémie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3699399/
https://www.ncbi.nlm.nih.gov/pubmed/23758852
http://dx.doi.org/10.1186/1471-2288-13-75
_version_ 1782275378627215360
author Liquet, Benoit
Riou, Jérémie
author_facet Liquet, Benoit
Riou, Jérémie
author_sort Liquet, Benoit
collection PubMed
description BACKGROUND: In statistical modeling, finding the most favorable coding for an exploratory quantitative variable involves many tests. This process involves multiple testing problems and requires the correction of the significance level. METHODS: For each coding, a test on the nullity of the coefficient associated with the new coded variable is computed. The selected coding corresponds to that associated with the largest statistical test (or equivalently the smallest p(value)). In the context of the Generalized Linear Model, Liquet and Commenges (Stat Probability Lett,71:33–38,2005) proposed an asymptotic correction of the significance level. This procedure, based on the score test, has been developed for dichotomous and Box-Cox transformations. In this paper, we suggest the use of resampling methods to estimate the significance level for categorical transformations with more than two levels and, by definition those that involve more than one parameter in the model. The categorical transformation is a more flexible way to explore the unknown shape of the effect between an explanatory and a dependent variable. RESULTS: The simulations we ran in this study showed good performances of the proposed methods. These methods were illustrated using the data from a study of the relationship between cholesterol and dementia. CONCLUSION: The algorithms were implemented using R, and the associated CPMCGLM R package is available on the CRAN.
format Online
Article
Text
id pubmed-3699399
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36993992013-07-03 Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models Liquet, Benoit Riou, Jérémie BMC Med Res Methodol Research Article BACKGROUND: In statistical modeling, finding the most favorable coding for an exploratory quantitative variable involves many tests. This process involves multiple testing problems and requires the correction of the significance level. METHODS: For each coding, a test on the nullity of the coefficient associated with the new coded variable is computed. The selected coding corresponds to that associated with the largest statistical test (or equivalently the smallest p(value)). In the context of the Generalized Linear Model, Liquet and Commenges (Stat Probability Lett,71:33–38,2005) proposed an asymptotic correction of the significance level. This procedure, based on the score test, has been developed for dichotomous and Box-Cox transformations. In this paper, we suggest the use of resampling methods to estimate the significance level for categorical transformations with more than two levels and, by definition those that involve more than one parameter in the model. The categorical transformation is a more flexible way to explore the unknown shape of the effect between an explanatory and a dependent variable. RESULTS: The simulations we ran in this study showed good performances of the proposed methods. These methods were illustrated using the data from a study of the relationship between cholesterol and dementia. CONCLUSION: The algorithms were implemented using R, and the associated CPMCGLM R package is available on the CRAN. BioMed Central 2013-06-08 /pmc/articles/PMC3699399/ /pubmed/23758852 http://dx.doi.org/10.1186/1471-2288-13-75 Text en Copyright © 2013 Liquet and Riou; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liquet, Benoit
Riou, Jérémie
Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title_full Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title_fullStr Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title_full_unstemmed Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title_short Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
title_sort correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3699399/
https://www.ncbi.nlm.nih.gov/pubmed/23758852
http://dx.doi.org/10.1186/1471-2288-13-75
work_keys_str_mv AT liquetbenoit correctionofthesignificancelevelwhenattemptingmultipletransformationsofanexplanatoryvariableingeneralizedlinearmodels
AT rioujeremie correctionofthesignificancelevelwhenattemptingmultipletransformationsofanexplanatoryvariableingeneralizedlinearmodels