Cargando…
KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies
BACKGROUND: Finding epistatic interactions in large association studies like genome-wide association studies (GWAS) with the nowadays-available large volume of genomic data is a challenging and largely unsolved issue. Few previous studies could handle genome-wide data due to the intractable difficul...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5361736/ https://www.ncbi.nlm.nih.gov/pubmed/28327091 http://dx.doi.org/10.1186/s12859-017-1599-7 |
_version_ | 1782516825717735424 |
---|---|
author | Abo Alchamlat, Sinan Farnir, Frédéric |
author_facet | Abo Alchamlat, Sinan Farnir, Frédéric |
author_sort | Abo Alchamlat, Sinan |
collection | PubMed |
description | BACKGROUND: Finding epistatic interactions in large association studies like genome-wide association studies (GWAS) with the nowadays-available large volume of genomic data is a challenging and largely unsolved issue. Few previous studies could handle genome-wide data due to the intractable difficulties met in searching a combinatorial explosive search space and statistically evaluating epistatic interactions given a limited number of samples. Our work is a contribution to this field. We propose a novel approach combining K-Nearest Neighbors (KNN) and Multi Dimensional Reduction (MDR) methods for detecting gene-gene interactions as a possible alternative to existing algorithms, e especially in situations where the number of involved determinants is high. After describing the approach, a comparison of our method (KNN-MDR) to a set of the other most performing methods (i.e., MDR, BOOST, BHIT, MegaSNPHunter and AntEpiSeeker) is carried on to detect interactions using simulated data as well as real genome-wide data. RESULTS: Experimental results on both simulated data and real genome-wide data show that KNN-MDR has interesting properties in terms of accuracy and power, and that, in many cases, it significantly outperforms its recent competitors. CONCLUSIONS: The presented methodology (KNN-MDR) is valuable in the context of loci and interactions mapping and can be seen as an interesting addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1599-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5361736 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-53617362017-03-24 KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies Abo Alchamlat, Sinan Farnir, Frédéric BMC Bioinformatics Methodology Article BACKGROUND: Finding epistatic interactions in large association studies like genome-wide association studies (GWAS) with the nowadays-available large volume of genomic data is a challenging and largely unsolved issue. Few previous studies could handle genome-wide data due to the intractable difficulties met in searching a combinatorial explosive search space and statistically evaluating epistatic interactions given a limited number of samples. Our work is a contribution to this field. We propose a novel approach combining K-Nearest Neighbors (KNN) and Multi Dimensional Reduction (MDR) methods for detecting gene-gene interactions as a possible alternative to existing algorithms, e especially in situations where the number of involved determinants is high. After describing the approach, a comparison of our method (KNN-MDR) to a set of the other most performing methods (i.e., MDR, BOOST, BHIT, MegaSNPHunter and AntEpiSeeker) is carried on to detect interactions using simulated data as well as real genome-wide data. RESULTS: Experimental results on both simulated data and real genome-wide data show that KNN-MDR has interesting properties in terms of accuracy and power, and that, in many cases, it significantly outperforms its recent competitors. CONCLUSIONS: The presented methodology (KNN-MDR) is valuable in the context of loci and interactions mapping and can be seen as an interesting addition to the arsenal used in complex traits analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1599-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-21 /pmc/articles/PMC5361736/ /pubmed/28327091 http://dx.doi.org/10.1186/s12859-017-1599-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Abo Alchamlat, Sinan Farnir, Frédéric KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title | KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title_full | KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title_fullStr | KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title_full_unstemmed | KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title_short | KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies |
title_sort | knn-mdr: a learning approach for improving interactions mapping performances in genome wide association studies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5361736/ https://www.ncbi.nlm.nih.gov/pubmed/28327091 http://dx.doi.org/10.1186/s12859-017-1599-7 |
work_keys_str_mv | AT aboalchamlatsinan knnmdralearningapproachforimprovinginteractionsmappingperformancesingenomewideassociationstudies AT farnirfrederic knnmdralearningapproachforimprovinginteractionsmappingperformancesingenomewideassociationstudies |