Cargando…
Discovering epistasis in large scale genetic association studies by exploiting graphics cards
Despite the enormous investments made in collecting DNA samples and generating germline variation data across thousands of individuals in modern genome-wide association studies (GWAS), progress has been frustratingly slow in explaining much of the heritability in common disease. Today's paradig...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3848199/ https://www.ncbi.nlm.nih.gov/pubmed/24348518 http://dx.doi.org/10.3389/fgene.2013.00266 |
_version_ | 1782293729224163328 |
---|---|
author | Chen, Gary K. Guo, Yunfei |
author_facet | Chen, Gary K. Guo, Yunfei |
author_sort | Chen, Gary K. |
collection | PubMed |
description | Despite the enormous investments made in collecting DNA samples and generating germline variation data across thousands of individuals in modern genome-wide association studies (GWAS), progress has been frustratingly slow in explaining much of the heritability in common disease. Today's paradigm of testing independent hypotheses on each single nucleotide polymorphism (SNP) marker is unlikely to adequately reflect the complex biological processes in disease risk. Alternatively, modeling risk as an ensemble of SNPs that act in concert in a pathway, and/or interact non-additively on log risk for example, may be a more sensible way to approach gene mapping in modern studies. Implementing such analyzes genome-wide can quickly become intractable due to the fact that even modest size SNP panels on modern genotype arrays (500k markers) pose a combinatorial nightmare, require tens of billions of models to be tested for evidence of interaction. In this article, we provide an in-depth analysis of programs that have been developed to explicitly overcome these enormous computational barriers through the use of processors on graphics cards known as Graphics Processing Units (GPU). We include tutorials on GPU technology, which will convey why they are growing in appeal with today's numerical scientists. One obvious advantage is the impressive density of microprocessor cores that are available on only a single GPU. Whereas high end servers feature up to 24 Intel or AMD CPU cores, the latest GPU offerings from nVidia feature over 2600 cores. Each compute node may be outfitted with up to 4 GPU devices. Success on GPUs varies across problems. However, epistasis screens fare well due to the high degree of parallelism exposed in these problems. Papers that we review routinely report GPU speedups of over two orders of magnitude (>100x) over standard CPU implementations. |
format | Online Article Text |
id | pubmed-3848199 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-38481992013-12-17 Discovering epistasis in large scale genetic association studies by exploiting graphics cards Chen, Gary K. Guo, Yunfei Front Genet Genetics Despite the enormous investments made in collecting DNA samples and generating germline variation data across thousands of individuals in modern genome-wide association studies (GWAS), progress has been frustratingly slow in explaining much of the heritability in common disease. Today's paradigm of testing independent hypotheses on each single nucleotide polymorphism (SNP) marker is unlikely to adequately reflect the complex biological processes in disease risk. Alternatively, modeling risk as an ensemble of SNPs that act in concert in a pathway, and/or interact non-additively on log risk for example, may be a more sensible way to approach gene mapping in modern studies. Implementing such analyzes genome-wide can quickly become intractable due to the fact that even modest size SNP panels on modern genotype arrays (500k markers) pose a combinatorial nightmare, require tens of billions of models to be tested for evidence of interaction. In this article, we provide an in-depth analysis of programs that have been developed to explicitly overcome these enormous computational barriers through the use of processors on graphics cards known as Graphics Processing Units (GPU). We include tutorials on GPU technology, which will convey why they are growing in appeal with today's numerical scientists. One obvious advantage is the impressive density of microprocessor cores that are available on only a single GPU. Whereas high end servers feature up to 24 Intel or AMD CPU cores, the latest GPU offerings from nVidia feature over 2600 cores. Each compute node may be outfitted with up to 4 GPU devices. Success on GPUs varies across problems. However, epistasis screens fare well due to the high degree of parallelism exposed in these problems. Papers that we review routinely report GPU speedups of over two orders of magnitude (>100x) over standard CPU implementations. Frontiers Media S.A. 2013-12-03 /pmc/articles/PMC3848199/ /pubmed/24348518 http://dx.doi.org/10.3389/fgene.2013.00266 Text en Copyright © 2013 Chen and Guo. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Chen, Gary K. Guo, Yunfei Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title | Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title_full | Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title_fullStr | Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title_full_unstemmed | Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title_short | Discovering epistasis in large scale genetic association studies by exploiting graphics cards |
title_sort | discovering epistasis in large scale genetic association studies by exploiting graphics cards |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3848199/ https://www.ncbi.nlm.nih.gov/pubmed/24348518 http://dx.doi.org/10.3389/fgene.2013.00266 |
work_keys_str_mv | AT chengaryk discoveringepistasisinlargescalegeneticassociationstudiesbyexploitinggraphicscards AT guoyunfei discoveringepistasisinlargescalegeneticassociationstudiesbyexploitinggraphicscards |