Cargando…
Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently,...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6267805/ https://www.ncbi.nlm.nih.gov/pubmed/30497383 http://dx.doi.org/10.1186/s12859-018-2447-0 |
_version_ | 1783376158098718720 |
---|---|
author | Abo Alchamlat, Sinan Farnir, Frédéric |
author_facet | Abo Alchamlat, Sinan Farnir, Frédéric |
author_sort | Abo Alchamlat, Sinan |
collection | PubMed |
description | BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various methods have attempted to deal with the simultaneous mapping of genomic regions, assuming that these regions might interact, leading to a complex determinism for various traits. Despite some successes, no gold standard methodology has emerged. Actually, combining several interaction mapping methods might be a better strategy, leading to positive results over a larger set of situations. Our work is a step in that direction. RESULTS: We first have demonstrated why aggregating results from several distinct methods might increase the statistical power while controlling the type I error. We have illustrated the approach using 6 existing methods (namely: MDR, Boost, BHIT, KNN-MDR, MegaSNPHunter and AntEpiSeeker) on simulated and real data sets. We have used a very simple aggregation strategy: a majority vote across the best loci combinations identified by the individual methods. In order to assess the performances of our aggregation approach in problems where most individual methods tend to fail, we have simulated difficult situations where no marginal effects of individual genes exist and where genetic heterogeneity is present. we have also demonstrated the use of the strategy on real data, using a WTCCC dataset on rheumatoid arthritis. Since we have been using simplistic assumptions to infer the expected power of the aggregation method, the actual power we estimated from our simulations has turned out to be a bit smaller than theoretically expected. Results nevertheless have shown that grouping the results of several methods is advantageous in terms of power, accuracy and type I error control. Furthermore, as more methods should become available in the future, using a grouping strategy will become more advantageous since adding more methods seems to improve the performances of the aggregated method. CONCLUSIONS: The aggregation of methods as a tool to detect genetic interactions is a potentially useful addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2447-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6267805 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62678052018-12-05 Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) Abo Alchamlat, Sinan Farnir, Frédéric BMC Bioinformatics Methodology Article BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various methods have attempted to deal with the simultaneous mapping of genomic regions, assuming that these regions might interact, leading to a complex determinism for various traits. Despite some successes, no gold standard methodology has emerged. Actually, combining several interaction mapping methods might be a better strategy, leading to positive results over a larger set of situations. Our work is a step in that direction. RESULTS: We first have demonstrated why aggregating results from several distinct methods might increase the statistical power while controlling the type I error. We have illustrated the approach using 6 existing methods (namely: MDR, Boost, BHIT, KNN-MDR, MegaSNPHunter and AntEpiSeeker) on simulated and real data sets. We have used a very simple aggregation strategy: a majority vote across the best loci combinations identified by the individual methods. In order to assess the performances of our aggregation approach in problems where most individual methods tend to fail, we have simulated difficult situations where no marginal effects of individual genes exist and where genetic heterogeneity is present. we have also demonstrated the use of the strategy on real data, using a WTCCC dataset on rheumatoid arthritis. Since we have been using simplistic assumptions to infer the expected power of the aggregation method, the actual power we estimated from our simulations has turned out to be a bit smaller than theoretically expected. Results nevertheless have shown that grouping the results of several methods is advantageous in terms of power, accuracy and type I error control. Furthermore, as more methods should become available in the future, using a grouping strategy will become more advantageous since adding more methods seems to improve the performances of the aggregated method. CONCLUSIONS: The aggregation of methods as a tool to detect genetic interactions is a potentially useful addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2447-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-21 /pmc/articles/PMC6267805/ /pubmed/30497383 http://dx.doi.org/10.1186/s12859-018-2447-0 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Abo Alchamlat, Sinan Farnir, Frédéric Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title | Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title_full | Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title_fullStr | Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title_full_unstemmed | Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title_short | Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
title_sort | aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6267805/ https://www.ncbi.nlm.nih.gov/pubmed/30497383 http://dx.doi.org/10.1186/s12859-018-2447-0 |
work_keys_str_mv | AT aboalchamlatsinan aggregationofexpertsanapplicationinthefieldofinteractomicsdetectionofinteractionsonthebasisofgenomicdata AT farnirfrederic aggregationofexpertsanapplicationinthefieldofinteractomicsdetectionofinteractionsonthebasisofgenomicdata |