Cargando…
Identification of interactions using model-based multifactor dimensionality reduction
BACKGROUND: Common complex traits may involve multiple genetic and environmental factors and their interactions. Many methods have been proposed to identify these interaction effects, among them several machine learning and data mining methods. These are attractive for identifying interactions becau...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133504/ https://www.ncbi.nlm.nih.gov/pubmed/27980625 http://dx.doi.org/10.1186/s12919-016-0019-8 |
_version_ | 1782471276192858112 |
---|---|
author | Gola, Damian König, Inke R. |
author_facet | Gola, Damian König, Inke R. |
author_sort | Gola, Damian |
collection | PubMed |
description | BACKGROUND: Common complex traits may involve multiple genetic and environmental factors and their interactions. Many methods have been proposed to identify these interaction effects, among them several machine learning and data mining methods. These are attractive for identifying interactions because they do not rely on specific genetic model assumptions. To handle the computational burden arising from an exhaustive search, including all possible combinations of factors, filter methods try to select promising factors in advance. METHODS: Model-based multifactor dimensionality reduction (MB-MDR), a semiparametric machine learning method allowing adjustment for confounding variables and lower level effects, is applied to Genetic Analysis Workshop 19 (GAW19) data to identify interaction effects on different traits. Several filtering methods based on the nearest neighbor algorithm are assessed in terms of compatibility with MB-MDR. RESULTS: Single nucleotide polymorphism (SNP) rs859400 shows a significant interaction effect (corrected p value <0.05) with age on systolic blood pressure (SBP). We identified 23 SNP–SNP interaction effects on hypertension status (HS), 42 interaction effects on SBP, and 26 interaction effects on diastolic blood pressure (DBP). Several of these SNPs are in strong linkage disequilibrium (LD). Three of the interaction effects on HS are identified in filtered subsets. CONCLUSIONS: The considered filtering methods seem not to be appropriate to use with MB-MDR. LD pruning is further quality control to be incorporated, which can reduce the combinatorial burden by removing redundant SNPs. |
format | Online Article Text |
id | pubmed-5133504 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51335042016-12-15 Identification of interactions using model-based multifactor dimensionality reduction Gola, Damian König, Inke R. BMC Proc Proceedings BACKGROUND: Common complex traits may involve multiple genetic and environmental factors and their interactions. Many methods have been proposed to identify these interaction effects, among them several machine learning and data mining methods. These are attractive for identifying interactions because they do not rely on specific genetic model assumptions. To handle the computational burden arising from an exhaustive search, including all possible combinations of factors, filter methods try to select promising factors in advance. METHODS: Model-based multifactor dimensionality reduction (MB-MDR), a semiparametric machine learning method allowing adjustment for confounding variables and lower level effects, is applied to Genetic Analysis Workshop 19 (GAW19) data to identify interaction effects on different traits. Several filtering methods based on the nearest neighbor algorithm are assessed in terms of compatibility with MB-MDR. RESULTS: Single nucleotide polymorphism (SNP) rs859400 shows a significant interaction effect (corrected p value <0.05) with age on systolic blood pressure (SBP). We identified 23 SNP–SNP interaction effects on hypertension status (HS), 42 interaction effects on SBP, and 26 interaction effects on diastolic blood pressure (DBP). Several of these SNPs are in strong linkage disequilibrium (LD). Three of the interaction effects on HS are identified in filtered subsets. CONCLUSIONS: The considered filtering methods seem not to be appropriate to use with MB-MDR. LD pruning is further quality control to be incorporated, which can reduce the combinatorial burden by removing redundant SNPs. BioMed Central 2016-10-18 /pmc/articles/PMC5133504/ /pubmed/27980625 http://dx.doi.org/10.1186/s12919-016-0019-8 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Gola, Damian König, Inke R. Identification of interactions using model-based multifactor dimensionality reduction |
title | Identification of interactions using model-based multifactor dimensionality reduction |
title_full | Identification of interactions using model-based multifactor dimensionality reduction |
title_fullStr | Identification of interactions using model-based multifactor dimensionality reduction |
title_full_unstemmed | Identification of interactions using model-based multifactor dimensionality reduction |
title_short | Identification of interactions using model-based multifactor dimensionality reduction |
title_sort | identification of interactions using model-based multifactor dimensionality reduction |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5133504/ https://www.ncbi.nlm.nih.gov/pubmed/27980625 http://dx.doi.org/10.1186/s12919-016-0019-8 |
work_keys_str_mv | AT goladamian identificationofinteractionsusingmodelbasedmultifactordimensionalityreduction AT koniginker identificationofinteractionsusingmodelbasedmultifactordimensionalityreduction |