Cargando…
High-throughput analysis of epistasis in genome-wide association studies with BiForce
Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast scre...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400955/ https://www.ncbi.nlm.nih.gov/pubmed/22618535 http://dx.doi.org/10.1093/bioinformatics/bts304 |
_version_ | 1782238551673405440 |
---|---|
author | Gyenesei, Attila Moody, Jonathan Semple, Colin A.M. Haley, Chris S. Wei, Wen-Hua |
author_facet | Gyenesei, Attila Moody, Jonathan Semple, Colin A.M. Haley, Chris S. Wei, Wen-Hua |
author_sort | Gyenesei, Attila |
collection | PubMed |
description | Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case–control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, >4500 individuals) and two disease traits in another (>340 000 SNPs, >1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits. Availability and implementation: The software is free and can be downloaded from http://bioinfo.utu.fi/BiForce/. Contact: wenhua.wei@igmm.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3400955 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34009552012-07-20 High-throughput analysis of epistasis in genome-wide association studies with BiForce Gyenesei, Attila Moody, Jonathan Semple, Colin A.M. Haley, Chris S. Wei, Wen-Hua Bioinformatics Original Papers Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case–control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, >4500 individuals) and two disease traits in another (>340 000 SNPs, >1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits. Availability and implementation: The software is free and can be downloaded from http://bioinfo.utu.fi/BiForce/. Contact: wenhua.wei@igmm.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-08-01 2012-05-21 /pmc/articles/PMC3400955/ /pubmed/22618535 http://dx.doi.org/10.1093/bioinformatics/bts304 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Gyenesei, Attila Moody, Jonathan Semple, Colin A.M. Haley, Chris S. Wei, Wen-Hua High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title | High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title_full | High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title_fullStr | High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title_full_unstemmed | High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title_short | High-throughput analysis of epistasis in genome-wide association studies with BiForce |
title_sort | high-throughput analysis of epistasis in genome-wide association studies with biforce |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400955/ https://www.ncbi.nlm.nih.gov/pubmed/22618535 http://dx.doi.org/10.1093/bioinformatics/bts304 |
work_keys_str_mv | AT gyeneseiattila highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce AT moodyjonathan highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce AT semplecolinam highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce AT haleychriss highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce AT weiwenhua highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce |