Cargando…

Grammatical evolution decision trees for detecting gene-gene interactions

BACKGROUND: A fundamental goal of human genetics is the discovery of polymorphisms that predict common, complex diseases. It is hypothesized that complex diseases are due to a myriad of factors including environmental exposures and complex genetic risk models, including gene-gene interactions. Such...

Descripción completa

Detalles Bibliográficos
Autores principales: Motsinger-Reif, Alison A, Deodhar, Sushamna, Winham, Stacey J, Hardison, Nicholas E
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000379/
https://www.ncbi.nlm.nih.gov/pubmed/21087514
http://dx.doi.org/10.1186/1756-0381-3-8
_version_ 1782193535154388992
author Motsinger-Reif, Alison A
Deodhar, Sushamna
Winham, Stacey J
Hardison, Nicholas E
author_facet Motsinger-Reif, Alison A
Deodhar, Sushamna
Winham, Stacey J
Hardison, Nicholas E
author_sort Motsinger-Reif, Alison A
collection PubMed
description BACKGROUND: A fundamental goal of human genetics is the discovery of polymorphisms that predict common, complex diseases. It is hypothesized that complex diseases are due to a myriad of factors including environmental exposures and complex genetic risk models, including gene-gene interactions. Such epistatic models present an important analytical challenge, requiring that methods perform not only statistical modeling, but also variable selection to generate testable genetic model hypotheses. This challenge is amplified by recent advances in genotyping technology, as the number of potential predictor variables is rapidly increasing. METHODS: Decision trees are a highly successful, easily interpretable data-mining method that are typically optimized with a hierarchical model building approach, which limits their potential to identify interacting effects. To overcome this limitation, we utilize evolutionary computation, specifically grammatical evolution, to build decision trees to detect and model gene-gene interactions. In the current study, we introduce the Grammatical Evolution Decision Trees (GEDT) method and software and evaluate this approach on simulated data representing gene-gene interaction models of a range of effect sizes. We compare the performance of the method to a traditional decision tree algorithm and a random search approach and demonstrate the improved performance of the method to detect purely epistatic interactions. RESULTS: The results of our simulations demonstrate that GEDT has high power to detect even very moderate genetic risk models. GEDT has high power to detect interactions with and without main effects. CONCLUSIONS: GEDT, while still in its initial stages of development, is a promising new approach for identifying gene-gene interactions in genetic association studies.
format Text
id pubmed-3000379
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30003792010-12-15 Grammatical evolution decision trees for detecting gene-gene interactions Motsinger-Reif, Alison A Deodhar, Sushamna Winham, Stacey J Hardison, Nicholas E BioData Min Methodology BACKGROUND: A fundamental goal of human genetics is the discovery of polymorphisms that predict common, complex diseases. It is hypothesized that complex diseases are due to a myriad of factors including environmental exposures and complex genetic risk models, including gene-gene interactions. Such epistatic models present an important analytical challenge, requiring that methods perform not only statistical modeling, but also variable selection to generate testable genetic model hypotheses. This challenge is amplified by recent advances in genotyping technology, as the number of potential predictor variables is rapidly increasing. METHODS: Decision trees are a highly successful, easily interpretable data-mining method that are typically optimized with a hierarchical model building approach, which limits their potential to identify interacting effects. To overcome this limitation, we utilize evolutionary computation, specifically grammatical evolution, to build decision trees to detect and model gene-gene interactions. In the current study, we introduce the Grammatical Evolution Decision Trees (GEDT) method and software and evaluate this approach on simulated data representing gene-gene interaction models of a range of effect sizes. We compare the performance of the method to a traditional decision tree algorithm and a random search approach and demonstrate the improved performance of the method to detect purely epistatic interactions. RESULTS: The results of our simulations demonstrate that GEDT has high power to detect even very moderate genetic risk models. GEDT has high power to detect interactions with and without main effects. CONCLUSIONS: GEDT, while still in its initial stages of development, is a promising new approach for identifying gene-gene interactions in genetic association studies. BioMed Central 2010-11-18 /pmc/articles/PMC3000379/ /pubmed/21087514 http://dx.doi.org/10.1186/1756-0381-3-8 Text en Copyright ©2010 Motsinger-Reif et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Motsinger-Reif, Alison A
Deodhar, Sushamna
Winham, Stacey J
Hardison, Nicholas E
Grammatical evolution decision trees for detecting gene-gene interactions
title Grammatical evolution decision trees for detecting gene-gene interactions
title_full Grammatical evolution decision trees for detecting gene-gene interactions
title_fullStr Grammatical evolution decision trees for detecting gene-gene interactions
title_full_unstemmed Grammatical evolution decision trees for detecting gene-gene interactions
title_short Grammatical evolution decision trees for detecting gene-gene interactions
title_sort grammatical evolution decision trees for detecting gene-gene interactions
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3000379/
https://www.ncbi.nlm.nih.gov/pubmed/21087514
http://dx.doi.org/10.1186/1756-0381-3-8
work_keys_str_mv AT motsingerreifalisona grammaticalevolutiondecisiontreesfordetectinggenegeneinteractions
AT deodharsushamna grammaticalevolutiondecisiontreesfordetectinggenegeneinteractions
AT winhamstaceyj grammaticalevolutiondecisiontreesfordetectinggenegeneinteractions
AT hardisonnicholase grammaticalevolutiondecisiontreesfordetectinggenegeneinteractions