Cargando…

Comparative analysis of methods for detecting interacting loci

BACKGROUND: Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and ex...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Li, Yu, Guoqiang, Langefeld, Carl D, Miller, David J, Guy, Richard T, Raghuram, Jayaram, Yuan, Xiguo, Herrington, David M, Wang, Yue
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161015/ https://www.ncbi.nlm.nih.gov/pubmed/21729295 http://dx.doi.org/10.1186/1471-2164-12-344

_version_	1782210624794656768
author	Chen, Li Yu, Guoqiang Langefeld, Carl D Miller, David J Guy, Richard T Raghuram, Jayaram Yuan, Xiguo Herrington, David M Wang, Yue
author_facet	Chen, Li Yu, Guoqiang Langefeld, Carl D Miller, David J Guy, Richard T Raghuram, Jayaram Yuan, Xiguo Herrington, David M Wang, Yue
author_sort	Chen, Li
collection	PubMed
description	BACKGROUND: Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted. RESULTS: We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding multiple sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs. CONCLUSION: This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list.
format	Online Article Text
id	pubmed-3161015
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31610152011-08-25 Comparative analysis of methods for detecting interacting loci Chen, Li Yu, Guoqiang Langefeld, Carl D Miller, David J Guy, Richard T Raghuram, Jayaram Yuan, Xiguo Herrington, David M Wang, Yue BMC Genomics Methodology Article BACKGROUND: Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted. RESULTS: We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding multiple sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs. CONCLUSION: This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list. BioMed Central 2011-07-05 /pmc/articles/PMC3161015/ /pubmed/21729295 http://dx.doi.org/10.1186/1471-2164-12-344 Text en Copyright ©2011 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Chen, Li Yu, Guoqiang Langefeld, Carl D Miller, David J Guy, Richard T Raghuram, Jayaram Yuan, Xiguo Herrington, David M Wang, Yue Comparative analysis of methods for detecting interacting loci
title	Comparative analysis of methods for detecting interacting loci
title_full	Comparative analysis of methods for detecting interacting loci
title_fullStr	Comparative analysis of methods for detecting interacting loci
title_full_unstemmed	Comparative analysis of methods for detecting interacting loci
title_short	Comparative analysis of methods for detecting interacting loci
title_sort	comparative analysis of methods for detecting interacting loci
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3161015/ https://www.ncbi.nlm.nih.gov/pubmed/21729295 http://dx.doi.org/10.1186/1471-2164-12-344
work_keys_str_mv	AT chenli comparativeanalysisofmethodsfordetectinginteractingloci AT yuguoqiang comparativeanalysisofmethodsfordetectinginteractingloci AT langefeldcarld comparativeanalysisofmethodsfordetectinginteractingloci AT millerdavidj comparativeanalysisofmethodsfordetectinginteractingloci AT guyrichardt comparativeanalysisofmethodsfordetectinginteractingloci AT raghuramjayaram comparativeanalysisofmethodsfordetectinginteractingloci AT yuanxiguo comparativeanalysisofmethodsfordetectinginteractingloci AT herringtondavidm comparativeanalysisofmethodsfordetectinginteractingloci AT wangyue comparativeanalysisofmethodsfordetectinginteractingloci

Comparative analysis of methods for detecting interacting loci

Ejemplares similares