Cargando…

TEAM: efficient two-locus epistasis tests in human genome-wide association study

As a promising tool for identifying genetic markers underlying phenotypic differences, genome-wide association study (GWAS) has been extensively investigated in recent years. In GWAS, detecting epistasis (or gene–gene interaction) is preferable over single locus study since many diseases are known t...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xiang, Huang, Shunping, Zou, Fei, Wang, Wei
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881371/
https://www.ncbi.nlm.nih.gov/pubmed/20529910
http://dx.doi.org/10.1093/bioinformatics/btq186
_version_ 1782182108688547840
author Zhang, Xiang
Huang, Shunping
Zou, Fei
Wang, Wei
author_facet Zhang, Xiang
Huang, Shunping
Zou, Fei
Wang, Wei
author_sort Zhang, Xiang
collection PubMed
description As a promising tool for identifying genetic markers underlying phenotypic differences, genome-wide association study (GWAS) has been extensively investigated in recent years. In GWAS, detecting epistasis (or gene–gene interaction) is preferable over single locus study since many diseases are known to be complex traits. A brute force search is infeasible for epistasis detection in the genome-wide scale because of the intensive computational burden. Existing epistasis detection algorithms are designed for dataset consisting of homozygous markers and small sample size. In human study, however, the genotype may be heterozygous, and number of individuals can be up to thousands. Thus, existing methods are not readily applicable to human datasets. In this article, we propose an efficient algorithm, TEAM, which significantly speeds up epistasis detection for human GWAS. Our algorithm is exhaustive, i.e. it does not ignore any epistatic interaction. Utilizing the minimum spanning tree structure, the algorithm incrementally updates the contingency tables for epistatic tests without scanning all individuals. Our algorithm has broader applicability and is more efficient than existing methods for large sample study. It supports any statistical test that is based on contingency tables, and enables both family-wise error rate and false discovery rate controlling. Extensive experiments show that our algorithm only needs to examine a small portion of the individuals to update the contingency tables, and it achieves at least an order of magnitude speed up over the brute force approach. Contact: xiang@cs.unc.edu
format Text
id pubmed-2881371
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28813712010-06-08 TEAM: efficient two-locus epistasis tests in human genome-wide association study Zhang, Xiang Huang, Shunping Zou, Fei Wang, Wei Bioinformatics Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa As a promising tool for identifying genetic markers underlying phenotypic differences, genome-wide association study (GWAS) has been extensively investigated in recent years. In GWAS, detecting epistasis (or gene–gene interaction) is preferable over single locus study since many diseases are known to be complex traits. A brute force search is infeasible for epistasis detection in the genome-wide scale because of the intensive computational burden. Existing epistasis detection algorithms are designed for dataset consisting of homozygous markers and small sample size. In human study, however, the genotype may be heterozygous, and number of individuals can be up to thousands. Thus, existing methods are not readily applicable to human datasets. In this article, we propose an efficient algorithm, TEAM, which significantly speeds up epistasis detection for human GWAS. Our algorithm is exhaustive, i.e. it does not ignore any epistatic interaction. Utilizing the minimum spanning tree structure, the algorithm incrementally updates the contingency tables for epistatic tests without scanning all individuals. Our algorithm has broader applicability and is more efficient than existing methods for large sample study. It supports any statistical test that is based on contingency tables, and enables both family-wise error rate and false discovery rate controlling. Extensive experiments show that our algorithm only needs to examine a small portion of the individuals to update the contingency tables, and it achieves at least an order of magnitude speed up over the brute force approach. Contact: xiang@cs.unc.edu Oxford University Press 2010-06-15 2010-06-01 /pmc/articles/PMC2881371/ /pubmed/20529910 http://dx.doi.org/10.1093/bioinformatics/btq186 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
Zhang, Xiang
Huang, Shunping
Zou, Fei
Wang, Wei
TEAM: efficient two-locus epistasis tests in human genome-wide association study
title TEAM: efficient two-locus epistasis tests in human genome-wide association study
title_full TEAM: efficient two-locus epistasis tests in human genome-wide association study
title_fullStr TEAM: efficient two-locus epistasis tests in human genome-wide association study
title_full_unstemmed TEAM: efficient two-locus epistasis tests in human genome-wide association study
title_short TEAM: efficient two-locus epistasis tests in human genome-wide association study
title_sort team: efficient two-locus epistasis tests in human genome-wide association study
topic Ismb 2010 Conference Proceedings July 11 to July 13, 2010, Boston, Ma, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881371/
https://www.ncbi.nlm.nih.gov/pubmed/20529910
http://dx.doi.org/10.1093/bioinformatics/btq186
work_keys_str_mv AT zhangxiang teamefficienttwolocusepistasistestsinhumangenomewideassociationstudy
AT huangshunping teamefficienttwolocusepistasistestsinhumangenomewideassociationstudy
AT zoufei teamefficienttwolocusepistasistestsinhumangenomewideassociationstudy
AT wangwei teamefficienttwolocusepistasistestsinhumangenomewideassociationstudy