Cargando…
A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection
BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668290/ https://www.ncbi.nlm.nih.gov/pubmed/23618370 http://dx.doi.org/10.1186/1756-0381-6-9 |
_version_ | 1782271610433044480 |
---|---|
author | Mahachie John, Jestinah M Van Lishout, François Gusareva, Elena S Van Steen, Kristel |
author_facet | Mahachie John, Jestinah M Van Lishout, François Gusareva, Elena S Van Steen, Kristel |
author_sort | Mahachie John, Jestinah M |
collection | PubMed |
description | BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result in increased type I error, power loss, or biased parameter estimates. Remedial measures for violated underlying conditions or assumptions include data transformation or selecting a more relaxed modeling or testing strategy. Model-Based Multifactor Dimensionality Reduction (MB-MDR) for epistasis detection relies on association testing between a trait and a factor consisting of multilocus genotype information. For quantitative traits, the framework is essentially Analysis of Variance (ANOVA) that decomposes the variability in the trait amongst the different factors. In this study, we assess through simulations, the cumulative effect of deviations from normality and homoscedasticity on the overall performance of quantitative Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect 2-locus epistasis signals in the absence of main effects. METHODOLOGY: Our simulation study focuses on pure epistasis models with varying degrees of genetic influence on a quantitative trait. Conditional on a multilocus genotype, we consider quantitative trait distributions that are normal, chi-square or Student’s t with constant or non-constant phenotypic variances. All data are analyzed with MB-MDR using the built-in Student’s t-test for association, as well as a novel MB-MDR implementation based on Welch’s t-test. Traits are either left untransformed or are transformed into new traits via logarithmic, standardization or rank-based transformations, prior to MB-MDR modeling. RESULTS: Our simulation results show that MB-MDR controls type I error and false positive rates irrespective of the association test considered. Empirically-based MB-MDR power estimates for MB-MDR with Welch’s t-tests are generally lower than those for MB-MDR with Student’s t-tests. Trait transformations involving ranks tend to lead to increased power compared to the other considered data transformations. CONCLUSIONS: When performing MB-MDR screening for gene-gene interactions with quantitative traits, we recommend to first rank-transform traits to normality and then to apply MB-MDR modeling with Student’s t-tests as internal tests for association. |
format | Online Article Text |
id | pubmed-3668290 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36682902013-06-03 A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection Mahachie John, Jestinah M Van Lishout, François Gusareva, Elena S Van Steen, Kristel BioData Min Methodology BACKGROUND: Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result in increased type I error, power loss, or biased parameter estimates. Remedial measures for violated underlying conditions or assumptions include data transformation or selecting a more relaxed modeling or testing strategy. Model-Based Multifactor Dimensionality Reduction (MB-MDR) for epistasis detection relies on association testing between a trait and a factor consisting of multilocus genotype information. For quantitative traits, the framework is essentially Analysis of Variance (ANOVA) that decomposes the variability in the trait amongst the different factors. In this study, we assess through simulations, the cumulative effect of deviations from normality and homoscedasticity on the overall performance of quantitative Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect 2-locus epistasis signals in the absence of main effects. METHODOLOGY: Our simulation study focuses on pure epistasis models with varying degrees of genetic influence on a quantitative trait. Conditional on a multilocus genotype, we consider quantitative trait distributions that are normal, chi-square or Student’s t with constant or non-constant phenotypic variances. All data are analyzed with MB-MDR using the built-in Student’s t-test for association, as well as a novel MB-MDR implementation based on Welch’s t-test. Traits are either left untransformed or are transformed into new traits via logarithmic, standardization or rank-based transformations, prior to MB-MDR modeling. RESULTS: Our simulation results show that MB-MDR controls type I error and false positive rates irrespective of the association test considered. Empirically-based MB-MDR power estimates for MB-MDR with Welch’s t-tests are generally lower than those for MB-MDR with Student’s t-tests. Trait transformations involving ranks tend to lead to increased power compared to the other considered data transformations. CONCLUSIONS: When performing MB-MDR screening for gene-gene interactions with quantitative traits, we recommend to first rank-transform traits to normality and then to apply MB-MDR modeling with Student’s t-tests as internal tests for association. BioMed Central 2013-04-25 /pmc/articles/PMC3668290/ /pubmed/23618370 http://dx.doi.org/10.1186/1756-0381-6-9 Text en Copyright © 2013 Mahachie John et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Mahachie John, Jestinah M Van Lishout, François Gusareva, Elena S Van Steen, Kristel A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title | A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title_full | A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title_fullStr | A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title_full_unstemmed | A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title_short | A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
title_sort | robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3668290/ https://www.ncbi.nlm.nih.gov/pubmed/23618370 http://dx.doi.org/10.1186/1756-0381-6-9 |
work_keys_str_mv | AT mahachiejohnjestinahm arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT vanlishoutfrancois arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT gusarevaelenas arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT vansteenkristel arobustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT mahachiejohnjestinahm robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT vanlishoutfrancois robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT gusarevaelenas robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection AT vansteenkristel robustnessstudyofparametricandnonparametrictestsinmodelbasedmultifactordimensionalityreductionforepistasisdetection |