Cargando…

High-throughput analysis of epistasis in genome-wide association studies with BiForce

Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast scre...

Descripción completa

Detalles Bibliográficos
Autores principales: Gyenesei, Attila, Moody, Jonathan, Semple, Colin A.M., Haley, Chris S., Wei, Wen-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400955/
https://www.ncbi.nlm.nih.gov/pubmed/22618535
http://dx.doi.org/10.1093/bioinformatics/bts304
_version_ 1782238551673405440
author Gyenesei, Attila
Moody, Jonathan
Semple, Colin A.M.
Haley, Chris S.
Wei, Wen-Hua
author_facet Gyenesei, Attila
Moody, Jonathan
Semple, Colin A.M.
Haley, Chris S.
Wei, Wen-Hua
author_sort Gyenesei, Attila
collection PubMed
description Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case–control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, >4500 individuals) and two disease traits in another (>340 000 SNPs, >1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits. Availability and implementation: The software is free and can be downloaded from http://bioinfo.utu.fi/BiForce/. Contact: wenhua.wei@igmm.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3400955
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34009552012-07-20 High-throughput analysis of epistasis in genome-wide association studies with BiForce Gyenesei, Attila Moody, Jonathan Semple, Colin A.M. Haley, Chris S. Wei, Wen-Hua Bioinformatics Original Papers Motivation: Gene–gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS. Results: We present BiForce to support high-throughput analysis of epistasis in GWAS for either quantitative or binary disease (case–control) traits. BiForce achieves great computational efficiency by using memory efficient data structures, Boolean bitwise operations and multithreaded parallelization. It performs a full pair-wise genome scan to detect interactions involving SNPs with or without significant marginal effects using appropriate Bonferroni-corrected significance thresholds. We show that BiForce is more powerful and significantly faster than published tools for both binary and quantitative traits in a series of performance tests on simulated and real datasets. We demonstrate BiForce in analysing eight metabolic traits in a GWAS cohort (323 697 SNPs, >4500 individuals) and two disease traits in another (>340 000 SNPs, >1750 cases and 1500 controls) on a 32-node computing cluster. BiForce completed analyses of the eight metabolic traits within 1 day, identified nine epistatic pairs of SNPs in five metabolic traits and 18 SNP pairs in two disease traits. BiForce can make the analysis of epistasis a routine exercise in GWAS and thus improve our understanding of the role of epistasis in the genetic regulation of complex traits. Availability and implementation: The software is free and can be downloaded from http://bioinfo.utu.fi/BiForce/. Contact: wenhua.wei@igmm.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-08-01 2012-05-21 /pmc/articles/PMC3400955/ /pubmed/22618535 http://dx.doi.org/10.1093/bioinformatics/bts304 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Gyenesei, Attila
Moody, Jonathan
Semple, Colin A.M.
Haley, Chris S.
Wei, Wen-Hua
High-throughput analysis of epistasis in genome-wide association studies with BiForce
title High-throughput analysis of epistasis in genome-wide association studies with BiForce
title_full High-throughput analysis of epistasis in genome-wide association studies with BiForce
title_fullStr High-throughput analysis of epistasis in genome-wide association studies with BiForce
title_full_unstemmed High-throughput analysis of epistasis in genome-wide association studies with BiForce
title_short High-throughput analysis of epistasis in genome-wide association studies with BiForce
title_sort high-throughput analysis of epistasis in genome-wide association studies with biforce
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3400955/
https://www.ncbi.nlm.nih.gov/pubmed/22618535
http://dx.doi.org/10.1093/bioinformatics/bts304
work_keys_str_mv AT gyeneseiattila highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce
AT moodyjonathan highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce
AT semplecolinam highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce
AT haleychriss highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce
AT weiwenhua highthroughputanalysisofepistasisingenomewideassociationstudieswithbiforce