Cargando…
An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374597/ https://www.ncbi.nlm.nih.gov/pubmed/28361694 http://dx.doi.org/10.1186/s12864-017-3496-x |
_version_ | 1782518921755099136 |
---|---|
author | Leem, Sangseob Park, Taesung |
author_facet | Leem, Sangseob Park, Taesung |
author_sort | Leem, Sangseob |
collection | PubMed |
description | BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary classification into high-risk (H) or low-risk (L) groups. Unfortunately, this simple binary classification does not reflect the uncertainty of H/L classification. Thus, we proposed Fuzzy MDR to overcome limitations of binary classification by introducing the degree of membership of two fuzzy sets H/L. While Fuzzy MDR demonstrated higher power than that of MDR, its performance is highly dependent on the several tuning parameters. In real applications, it is not easy to choose appropriate tuning parameter values. RESULT: In this work, we propose an empirical fuzzy MDR (EF-MDR) which does not require specifying tuning parameters values. Here, we propose an empirical approach to estimating the membership degree that can be directly estimated from the data. In EF-MDR, the membership degree is estimated by the maximum likelihood estimator of the proportion of cases(controls) in each genotype combination. We also show that the balanced accuracy measure derived from this new membership function is a linear function of the standard chi-square statistics. This relationship allows us to perform the standard significance test using p-values in the MDR framework without permutation. Through two simulation studies, the power of the proposed EF-MDR is shown to be higher than those of MDR and Fuzzy MDR. We illustrate the proposed EF-MDR by analyzing Crohn’s disease (CD) and bipolar disorder (BD) in the Wellcome Trust Case Control Consortium (WTCCC) dataset. CONCLUSION: We propose an empirical Fuzzy MDR for detecting GGI using the maximum likelihood of the proportion of cases(controls) as the membership degree of the genotype combination. The program written in R for EF-MDR is available at http://statgen.snu.ac.kr/software/EF-MDR. |
format | Online Article Text |
id | pubmed-5374597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53745972017-03-31 An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions Leem, Sangseob Park, Taesung BMC Genomics Research BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary classification into high-risk (H) or low-risk (L) groups. Unfortunately, this simple binary classification does not reflect the uncertainty of H/L classification. Thus, we proposed Fuzzy MDR to overcome limitations of binary classification by introducing the degree of membership of two fuzzy sets H/L. While Fuzzy MDR demonstrated higher power than that of MDR, its performance is highly dependent on the several tuning parameters. In real applications, it is not easy to choose appropriate tuning parameter values. RESULT: In this work, we propose an empirical fuzzy MDR (EF-MDR) which does not require specifying tuning parameters values. Here, we propose an empirical approach to estimating the membership degree that can be directly estimated from the data. In EF-MDR, the membership degree is estimated by the maximum likelihood estimator of the proportion of cases(controls) in each genotype combination. We also show that the balanced accuracy measure derived from this new membership function is a linear function of the standard chi-square statistics. This relationship allows us to perform the standard significance test using p-values in the MDR framework without permutation. Through two simulation studies, the power of the proposed EF-MDR is shown to be higher than those of MDR and Fuzzy MDR. We illustrate the proposed EF-MDR by analyzing Crohn’s disease (CD) and bipolar disorder (BD) in the Wellcome Trust Case Control Consortium (WTCCC) dataset. CONCLUSION: We propose an empirical Fuzzy MDR for detecting GGI using the maximum likelihood of the proportion of cases(controls) as the membership degree of the genotype combination. The program written in R for EF-MDR is available at http://statgen.snu.ac.kr/software/EF-MDR. BioMed Central 2017-03-14 /pmc/articles/PMC5374597/ /pubmed/28361694 http://dx.doi.org/10.1186/s12864-017-3496-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Leem, Sangseob Park, Taesung An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title | An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title_full | An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title_fullStr | An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title_full_unstemmed | An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title_short | An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
title_sort | empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374597/ https://www.ncbi.nlm.nih.gov/pubmed/28361694 http://dx.doi.org/10.1186/s12864-017-3496-x |
work_keys_str_mv | AT leemsangseob anempiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions AT parktaesung anempiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions AT leemsangseob empiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions AT parktaesung empiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions |