Cargando…

Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)

BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently,...

Descripción completa

Detalles Bibliográficos
Autores principales: Abo Alchamlat, Sinan, Farnir, Frédéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6267805/
https://www.ncbi.nlm.nih.gov/pubmed/30497383
http://dx.doi.org/10.1186/s12859-018-2447-0
_version_ 1783376158098718720
author Abo Alchamlat, Sinan
Farnir, Frédéric
author_facet Abo Alchamlat, Sinan
Farnir, Frédéric
author_sort Abo Alchamlat, Sinan
collection PubMed
description BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various methods have attempted to deal with the simultaneous mapping of genomic regions, assuming that these regions might interact, leading to a complex determinism for various traits. Despite some successes, no gold standard methodology has emerged. Actually, combining several interaction mapping methods might be a better strategy, leading to positive results over a larger set of situations. Our work is a step in that direction. RESULTS: We first have demonstrated why aggregating results from several distinct methods might increase the statistical power while controlling the type I error. We have illustrated the approach using 6 existing methods (namely: MDR, Boost, BHIT, KNN-MDR, MegaSNPHunter and AntEpiSeeker) on simulated and real data sets. We have used a very simple aggregation strategy: a majority vote across the best loci combinations identified by the individual methods. In order to assess the performances of our aggregation approach in problems where most individual methods tend to fail, we have simulated difficult situations where no marginal effects of individual genes exist and where genetic heterogeneity is present. we have also demonstrated the use of the strategy on real data, using a WTCCC dataset on rheumatoid arthritis. Since we have been using simplistic assumptions to infer the expected power of the aggregation method, the actual power we estimated from our simulations has turned out to be a bit smaller than theoretically expected. Results nevertheless have shown that grouping the results of several methods is advantageous in terms of power, accuracy and type I error control. Furthermore, as more methods should become available in the future, using a grouping strategy will become more advantageous since adding more methods seems to improve the performances of the aggregated method. CONCLUSIONS: The aggregation of methods as a tool to detect genetic interactions is a potentially useful addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2447-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6267805
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62678052018-12-05 Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data) Abo Alchamlat, Sinan Farnir, Frédéric BMC Bioinformatics Methodology Article BACKGROUND: Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various methods have attempted to deal with the simultaneous mapping of genomic regions, assuming that these regions might interact, leading to a complex determinism for various traits. Despite some successes, no gold standard methodology has emerged. Actually, combining several interaction mapping methods might be a better strategy, leading to positive results over a larger set of situations. Our work is a step in that direction. RESULTS: We first have demonstrated why aggregating results from several distinct methods might increase the statistical power while controlling the type I error. We have illustrated the approach using 6 existing methods (namely: MDR, Boost, BHIT, KNN-MDR, MegaSNPHunter and AntEpiSeeker) on simulated and real data sets. We have used a very simple aggregation strategy: a majority vote across the best loci combinations identified by the individual methods. In order to assess the performances of our aggregation approach in problems where most individual methods tend to fail, we have simulated difficult situations where no marginal effects of individual genes exist and where genetic heterogeneity is present. we have also demonstrated the use of the strategy on real data, using a WTCCC dataset on rheumatoid arthritis. Since we have been using simplistic assumptions to infer the expected power of the aggregation method, the actual power we estimated from our simulations has turned out to be a bit smaller than theoretically expected. Results nevertheless have shown that grouping the results of several methods is advantageous in terms of power, accuracy and type I error control. Furthermore, as more methods should become available in the future, using a grouping strategy will become more advantageous since adding more methods seems to improve the performances of the aggregated method. CONCLUSIONS: The aggregation of methods as a tool to detect genetic interactions is a potentially useful addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2447-0) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-21 /pmc/articles/PMC6267805/ /pubmed/30497383 http://dx.doi.org/10.1186/s12859-018-2447-0 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Abo Alchamlat, Sinan
Farnir, Frédéric
Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title_full Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title_fullStr Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title_full_unstemmed Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title_short Aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
title_sort aggregation of experts: an application in the field of “interactomics” (detection of interactions on the basis of genomic data)
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6267805/
https://www.ncbi.nlm.nih.gov/pubmed/30497383
http://dx.doi.org/10.1186/s12859-018-2447-0
work_keys_str_mv AT aboalchamlatsinan aggregationofexpertsanapplicationinthefieldofinteractomicsdetectionofinteractionsonthebasisofgenomicdata
AT farnirfrederic aggregationofexpertsanapplicationinthefieldofinteractomicsdetectionofinteractionsonthebasisofgenomicdata