Cargando…

Are your covariates under control? How normalization can re-introduce covariate effects

Many statistical tests rely on the assumption that the residuals of a model are normally distributed. Rank-based inverse normal transformation (INT) of the dependent variable is one of the most popular approaches to satisfy the normality assumption. When covariates are included in the analysis, a co...

Descripción completa

Detalles Bibliográficos
Autores principales: Pain, Oliver, Dudbridge, Frank, Ronald, Angelica
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6057994/
https://www.ncbi.nlm.nih.gov/pubmed/29706643
http://dx.doi.org/10.1038/s41431-018-0159-6
_version_ 1783341612760301568
author Pain, Oliver
Dudbridge, Frank
Ronald, Angelica
author_facet Pain, Oliver
Dudbridge, Frank
Ronald, Angelica
author_sort Pain, Oliver
collection PubMed
description Many statistical tests rely on the assumption that the residuals of a model are normally distributed. Rank-based inverse normal transformation (INT) of the dependent variable is one of the most popular approaches to satisfy the normality assumption. When covariates are included in the analysis, a common approach is to first adjust for the covariates and then normalize the residuals. This study investigated the effect of regressing covariates against the dependent variable and then applying rank-based INT to the residuals. The correlation between the dependent variable and covariates at each stage of processing was assessed. An alternative approach was tested in which rank-based INT was applied to the dependent variable before regressing covariates. Analyses based on both simulated and real data examples demonstrated that applying rank-based INT to the dependent variable residuals after regressing out covariates re-introduces a linear correlation between the dependent variable and covariates, increasing type-I errors and reducing power. On the other hand, when rank-based INT was applied prior to controlling for covariate effects, residuals were normally distributed and linearly uncorrelated with covariates. This latter approach is therefore recommended in situations were normality of the dependent variable is required.
format Online
Article
Text
id pubmed-6057994
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-60579942018-07-27 Are your covariates under control? How normalization can re-introduce covariate effects Pain, Oliver Dudbridge, Frank Ronald, Angelica Eur J Hum Genet Article Many statistical tests rely on the assumption that the residuals of a model are normally distributed. Rank-based inverse normal transformation (INT) of the dependent variable is one of the most popular approaches to satisfy the normality assumption. When covariates are included in the analysis, a common approach is to first adjust for the covariates and then normalize the residuals. This study investigated the effect of regressing covariates against the dependent variable and then applying rank-based INT to the residuals. The correlation between the dependent variable and covariates at each stage of processing was assessed. An alternative approach was tested in which rank-based INT was applied to the dependent variable before regressing covariates. Analyses based on both simulated and real data examples demonstrated that applying rank-based INT to the dependent variable residuals after regressing out covariates re-introduces a linear correlation between the dependent variable and covariates, increasing type-I errors and reducing power. On the other hand, when rank-based INT was applied prior to controlling for covariate effects, residuals were normally distributed and linearly uncorrelated with covariates. This latter approach is therefore recommended in situations were normality of the dependent variable is required. Springer International Publishing 2018-04-30 2018-08 /pmc/articles/PMC6057994/ /pubmed/29706643 http://dx.doi.org/10.1038/s41431-018-0159-6 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Pain, Oliver
Dudbridge, Frank
Ronald, Angelica
Are your covariates under control? How normalization can re-introduce covariate effects
title Are your covariates under control? How normalization can re-introduce covariate effects
title_full Are your covariates under control? How normalization can re-introduce covariate effects
title_fullStr Are your covariates under control? How normalization can re-introduce covariate effects
title_full_unstemmed Are your covariates under control? How normalization can re-introduce covariate effects
title_short Are your covariates under control? How normalization can re-introduce covariate effects
title_sort are your covariates under control? how normalization can re-introduce covariate effects
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6057994/
https://www.ncbi.nlm.nih.gov/pubmed/29706643
http://dx.doi.org/10.1038/s41431-018-0159-6
work_keys_str_mv AT painoliver areyourcovariatesundercontrolhownormalizationcanreintroducecovariateeffects
AT dudbridgefrank areyourcovariatesundercontrolhownormalizationcanreintroducecovariateeffects
AT ronaldangelica areyourcovariatesundercontrolhownormalizationcanreintroducecovariateeffects