Cargando…

An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions

BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary...

Descripción completa

Detalles Bibliográficos
Autores principales: Leem, Sangseob, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374597/
https://www.ncbi.nlm.nih.gov/pubmed/28361694
http://dx.doi.org/10.1186/s12864-017-3496-x
_version_ 1782518921755099136
author Leem, Sangseob
Park, Taesung
author_facet Leem, Sangseob
Park, Taesung
author_sort Leem, Sangseob
collection PubMed
description BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary classification into high-risk (H) or low-risk (L) groups. Unfortunately, this simple binary classification does not reflect the uncertainty of H/L classification. Thus, we proposed Fuzzy MDR to overcome limitations of binary classification by introducing the degree of membership of two fuzzy sets H/L. While Fuzzy MDR demonstrated higher power than that of MDR, its performance is highly dependent on the several tuning parameters. In real applications, it is not easy to choose appropriate tuning parameter values. RESULT: In this work, we propose an empirical fuzzy MDR (EF-MDR) which does not require specifying tuning parameters values. Here, we propose an empirical approach to estimating the membership degree that can be directly estimated from the data. In EF-MDR, the membership degree is estimated by the maximum likelihood estimator of the proportion of cases(controls) in each genotype combination. We also show that the balanced accuracy measure derived from this new membership function is a linear function of the standard chi-square statistics. This relationship allows us to perform the standard significance test using p-values in the MDR framework without permutation. Through two simulation studies, the power of the proposed EF-MDR is shown to be higher than those of MDR and Fuzzy MDR. We illustrate the proposed EF-MDR by analyzing Crohn’s disease (CD) and bipolar disorder (BD) in the Wellcome Trust Case Control Consortium (WTCCC) dataset. CONCLUSION: We propose an empirical Fuzzy MDR for detecting GGI using the maximum likelihood of the proportion of cases(controls) as the membership degree of the genotype combination. The program written in R for EF-MDR is available at http://statgen.snu.ac.kr/software/EF-MDR.
format Online
Article
Text
id pubmed-5374597
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53745972017-03-31 An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions Leem, Sangseob Park, Taesung BMC Genomics Research BACKGROUND: Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary classification into high-risk (H) or low-risk (L) groups. Unfortunately, this simple binary classification does not reflect the uncertainty of H/L classification. Thus, we proposed Fuzzy MDR to overcome limitations of binary classification by introducing the degree of membership of two fuzzy sets H/L. While Fuzzy MDR demonstrated higher power than that of MDR, its performance is highly dependent on the several tuning parameters. In real applications, it is not easy to choose appropriate tuning parameter values. RESULT: In this work, we propose an empirical fuzzy MDR (EF-MDR) which does not require specifying tuning parameters values. Here, we propose an empirical approach to estimating the membership degree that can be directly estimated from the data. In EF-MDR, the membership degree is estimated by the maximum likelihood estimator of the proportion of cases(controls) in each genotype combination. We also show that the balanced accuracy measure derived from this new membership function is a linear function of the standard chi-square statistics. This relationship allows us to perform the standard significance test using p-values in the MDR framework without permutation. Through two simulation studies, the power of the proposed EF-MDR is shown to be higher than those of MDR and Fuzzy MDR. We illustrate the proposed EF-MDR by analyzing Crohn’s disease (CD) and bipolar disorder (BD) in the Wellcome Trust Case Control Consortium (WTCCC) dataset. CONCLUSION: We propose an empirical Fuzzy MDR for detecting GGI using the maximum likelihood of the proportion of cases(controls) as the membership degree of the genotype combination. The program written in R for EF-MDR is available at http://statgen.snu.ac.kr/software/EF-MDR. BioMed Central 2017-03-14 /pmc/articles/PMC5374597/ /pubmed/28361694 http://dx.doi.org/10.1186/s12864-017-3496-x Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Leem, Sangseob
Park, Taesung
An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title_full An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title_fullStr An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title_full_unstemmed An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title_short An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
title_sort empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374597/
https://www.ncbi.nlm.nih.gov/pubmed/28361694
http://dx.doi.org/10.1186/s12864-017-3496-x
work_keys_str_mv AT leemsangseob anempiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions
AT parktaesung anempiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions
AT leemsangseob empiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions
AT parktaesung empiricalfuzzymultifactordimensionalityreductionmethodfordetectinggenegeneinteractions