Cargando…

Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases

BACKGROUND: Large-scale genetic studies of common human diseases have focused almost exclusively on the independent main effects of single-nucleotide polymorphisms (SNPs) on disease susceptibility. These studies have had some success, but much of the genetic architecture of common disease remains un...

Descripción completa

Detalles Bibliográficos
Autores principales: Moore, Jason H., Andrews, Peter C., Olson, Randal S., Carlson, Sarah E., Larock, Curt R., Bulhoes, Mario J., O’Connor, James P., Greytak, Ellen M., Armentrout, Steven L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5450417/
https://www.ncbi.nlm.nih.gov/pubmed/28572842
http://dx.doi.org/10.1186/s13040-017-0139-3
_version_ 1783239969026867200
author Moore, Jason H.
Andrews, Peter C.
Olson, Randal S.
Carlson, Sarah E.
Larock, Curt R.
Bulhoes, Mario J.
O’Connor, James P.
Greytak, Ellen M.
Armentrout, Steven L.
author_facet Moore, Jason H.
Andrews, Peter C.
Olson, Randal S.
Carlson, Sarah E.
Larock, Curt R.
Bulhoes, Mario J.
O’Connor, James P.
Greytak, Ellen M.
Armentrout, Steven L.
author_sort Moore, Jason H.
collection PubMed
description BACKGROUND: Large-scale genetic studies of common human diseases have focused almost exclusively on the independent main effects of single-nucleotide polymorphisms (SNPs) on disease susceptibility. These studies have had some success, but much of the genetic architecture of common disease remains unexplained. Attention is now turning to detecting SNPs that impact disease susceptibility in the context of other genetic factors and environmental exposures. These context-dependent genetic effects can manifest themselves as non-additive interactions, which are more challenging to model using parametric statistical approaches. The dimensionality that results from a multitude of genotype combinations, which results from considering many SNPs simultaneously, renders these approaches underpowered. We previously developed the multifactor dimensionality reduction (MDR) approach as a nonparametric and genetic model-free machine learning alternative. Approaches such as MDR can improve the power to detect gene-gene interactions but are limited in their ability to exhaustively consider SNP combinations in genome-wide association studies (GWAS), due to the combinatorial explosion of the search space. We introduce here a stochastic search algorithm called Crush for the application of MDR to modeling high-order gene-gene interactions in genome-wide data. The Crush-MDR approach uses expert knowledge to guide probabilistic searches within a framework that capitalizes on the use of biological knowledge to filter gene sets prior to analysis. Here we evaluated the ability of Crush-MDR to detect hierarchical sets of interacting SNPs using a biology-based simulation strategy that assumes non-additive interactions within genes and additivity in genetic effects between sets of genes within a biochemical pathway. RESULTS: We show that Crush-MDR is able to identify genetic effects at the gene or pathway level significantly better than a baseline random search with the same number of model evaluations. We then applied the same methodology to a GWAS for Alzheimer’s disease and showed base level validation that Crush-MDR was able to identify a set of interacting genes with biological ties to Alzheimer’s disease. CONCLUSIONS: We discuss the role of stochastic search and cloud computing for detecting complex genetic effects in genome-wide data.
format Online
Article
Text
id pubmed-5450417
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54504172017-06-01 Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases Moore, Jason H. Andrews, Peter C. Olson, Randal S. Carlson, Sarah E. Larock, Curt R. Bulhoes, Mario J. O’Connor, James P. Greytak, Ellen M. Armentrout, Steven L. BioData Min Methodology BACKGROUND: Large-scale genetic studies of common human diseases have focused almost exclusively on the independent main effects of single-nucleotide polymorphisms (SNPs) on disease susceptibility. These studies have had some success, but much of the genetic architecture of common disease remains unexplained. Attention is now turning to detecting SNPs that impact disease susceptibility in the context of other genetic factors and environmental exposures. These context-dependent genetic effects can manifest themselves as non-additive interactions, which are more challenging to model using parametric statistical approaches. The dimensionality that results from a multitude of genotype combinations, which results from considering many SNPs simultaneously, renders these approaches underpowered. We previously developed the multifactor dimensionality reduction (MDR) approach as a nonparametric and genetic model-free machine learning alternative. Approaches such as MDR can improve the power to detect gene-gene interactions but are limited in their ability to exhaustively consider SNP combinations in genome-wide association studies (GWAS), due to the combinatorial explosion of the search space. We introduce here a stochastic search algorithm called Crush for the application of MDR to modeling high-order gene-gene interactions in genome-wide data. The Crush-MDR approach uses expert knowledge to guide probabilistic searches within a framework that capitalizes on the use of biological knowledge to filter gene sets prior to analysis. Here we evaluated the ability of Crush-MDR to detect hierarchical sets of interacting SNPs using a biology-based simulation strategy that assumes non-additive interactions within genes and additivity in genetic effects between sets of genes within a biochemical pathway. RESULTS: We show that Crush-MDR is able to identify genetic effects at the gene or pathway level significantly better than a baseline random search with the same number of model evaluations. We then applied the same methodology to a GWAS for Alzheimer’s disease and showed base level validation that Crush-MDR was able to identify a set of interacting genes with biological ties to Alzheimer’s disease. CONCLUSIONS: We discuss the role of stochastic search and cloud computing for detecting complex genetic effects in genome-wide data. BioMed Central 2017-05-30 /pmc/articles/PMC5450417/ /pubmed/28572842 http://dx.doi.org/10.1186/s13040-017-0139-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Moore, Jason H.
Andrews, Peter C.
Olson, Randal S.
Carlson, Sarah E.
Larock, Curt R.
Bulhoes, Mario J.
O’Connor, James P.
Greytak, Ellen M.
Armentrout, Steven L.
Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title_full Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title_fullStr Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title_full_unstemmed Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title_short Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
title_sort grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5450417/
https://www.ncbi.nlm.nih.gov/pubmed/28572842
http://dx.doi.org/10.1186/s13040-017-0139-3
work_keys_str_mv AT moorejasonh gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT andrewspeterc gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT olsonrandals gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT carlsonsarahe gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT larockcurtr gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT bulhoesmarioj gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT oconnorjamesp gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT greytakellenm gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases
AT armentroutstevenl gridbasedstochasticsearchforhierarchicalgenegeneinteractionsinpopulationbasedgeneticstudiesofcommonhumandiseases