Cargando…

A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection

BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahachie John, Jestinah M, Van Lishout, François, Gusareva, Elena S, Van Steen, Kristel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668290/
https://www.ncbi.nlm.nih.gov/pubmed/23618370
http://dx.doi.org/10.1186/1756-0381-6-9
_version_ 1782271610433044480
author Mahachie John, Jestinah M
Van Lishout, François
Gusareva, Elena S
Van Steen, Kristel
author_facet Mahachie John, Jestinah M
Van Lishout, François
Gusareva, Elena S
Van Steen, Kristel
author_sort Mahachie John, Jestinah M
collection PubMed
description BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result in increased type I error, power loss, or biased parameter estimates. Remedial measures for violated underlying conditions or assumptions include data transformation or selecting a more relaxed modeling or testing strategy. Model-Based Multifactor Dimensionality Reduction (MB-MDR) for epistasis detection relies on association testing between a trait and a factor consisting of multilocus genotype information. For quantitative traits, the framework is essentially Analysis of Variance (ANOVA) that decomposes the variability in the trait amongst the different factors. In this study, we assess through simulations, the cumulative effect of deviations from normality and homoscedasticity on the overall performance of quantitative Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect 2-locus epistasis signals in the absence of main effects. METHODOLOGY: Our simulation study focuses on pure epistasis models with varying degrees of genetic influence on a quantitative trait. Conditional on a multilocus genotype, we consider quantitative trait distributions that are normal, chi-square or Student’s t with constant or non-constant phenotypic variances. All data are analyzed with MB-MDR using the built-in Student’s t-test for association, as well as a novel MB-MDR implementation based on Welch’s t-test. Traits are either left untransformed or are transformed into new traits via logarithmic, standardization or rank-based transformations, prior to MB-MDR modeling. RESULTS: Our simulation results show that MB-MDR controls type I error and false positive rates irrespective of the association test considered. Empirically-based MB-MDR power estimates for MB-MDR with Welch’s t-tests are generally lower than those for MB-MDR with Student’s t-tests. Trait transformations involving ranks tend to lead to increased power compared to the other considered data transformations. CONCLUSIONS: When performing MB-MDR screening for gene-gene interactions with quantitative traits, we recommend to first rank-transform traits to normality and then to apply MB-MDR modeling with Student’s t-tests as internal tests for association.
format Online
Article
Text
id pubmed-3668290
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36682902013-06-03 A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection Mahachie John, Jestinah M Van Lishout, François Gusareva, Elena S Van Steen, Kristel BioData Min Methodology BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result in increased type I error, power loss, or biased parameter estimates. Remedial measures for violated underlying conditions or assumptions include data transformation or selecting a more relaxed modeling or testing strategy. Model-Based Multifactor Dimensionality Reduction (MB-MDR) for epistasis detection relies on association testing between a trait and a factor consisting of multilocus genotype information. For quantitative traits, the framework is essentially Analysis of Variance (ANOVA) that decomposes the variability in the trait amongst the different factors. In this study, we assess through simulations, the cumulative effect of deviations from normality and homoscedasticity on the overall performance of quantitative Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect 2-locus epistasis signals in the absence of main effects. METHODOLOGY: Our simulation study focuses on pure epistasis models with varying degrees of genetic influence on a quantitative trait. Conditional on a multilocus genotype, we consider quantitative trait distributions that are normal, chi-square or Student’s t with constant or non-constant phenotypic variances. All data are analyzed with MB-MDR using the built-in Student’s t-test for association, as well as a novel MB-MDR implementation based on Welch’s t-test. Traits are either left untransformed or are transformed into new traits via logarithmic, standardization or rank-based transformations, prior to MB-MDR modeling. RESULTS: Our simulation results show that MB-MDR controls type I error and false positive rates irrespective of the association test considered. Empirically-based MB-MDR power estimates for MB-MDR with Welch’s t-tests are generally lower than those for MB-MDR with Student’s t-tests. Trait transformations involving ranks tend to lead to increased power compared to the other considered data transformations. CONCLUSIONS: When performing MB-MDR screening for gene-gene interactions with quantitative traits, we recommend to first rank-transform traits to normality and then to apply MB-MDR modeling with Student’s t-tests as internal tests for association. BioMed Central 2013-04-25 /pmc/articles/PMC3668290/ /pubmed/23618370 http://dx.doi.org/10.1186/1756-0381-6-9 Text en Copyright © 2013 Mahachie John et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Mahachie John, Jestinah M
Van Lishout, François
Gusareva, Elena S
Van Steen, Kristel
A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title_full A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title_fullStr A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title_full_unstemmed A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title_short A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
title_sort robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668290/
https://www.ncbi.nlm.nih.gov/pubmed/23618370
http://dx.doi.org/10.1186/1756-0381-6-9
work_keys_str_mv AT mahachiejohnjestinahm arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT vanlishoutfrancois arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT gusarevaelenas arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT vansteenkristel arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT mahachiejohnjestinahm robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT vanlishoutfrancois robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT gusarevaelenas robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection
AT vansteenkristel robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection