Cargando…
Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis
BACKGROUND: Identifying high-order genetics associations with non-additive (i.e. epistatic) effects in population-based studies of common human diseases is a computational challenge. Multifactor dimensionality reduction (MDR) is a machine learning method that was designed specifically for this probl...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618340/ https://www.ncbi.nlm.nih.gov/pubmed/23418869 http://dx.doi.org/10.1186/1756-0381-6-4 |
_version_ | 1782265404618440704 |
---|---|
author | Collins, Ryan L Hu, Ting Wejse, Christian Sirugo, Giorgio Williams, Scott M Moore, Jason H |
author_facet | Collins, Ryan L Hu, Ting Wejse, Christian Sirugo, Giorgio Williams, Scott M Moore, Jason H |
author_sort | Collins, Ryan L |
collection | PubMed |
description | BACKGROUND: Identifying high-order genetics associations with non-additive (i.e. epistatic) effects in population-based studies of common human diseases is a computational challenge. Multifactor dimensionality reduction (MDR) is a machine learning method that was designed specifically for this problem. The goal of the present study was to apply MDR to mining high-order epistatic interactions in a population-based genetic study of tuberculosis (TB). RESULTS: The study used a previously published data set consisting of 19 candidate single-nucleotide polymorphisms (SNPs) in 321 pulmonary TB cases and 347 healthy controls from Guniea-Bissau in Africa. The ReliefF algorithm was applied first to generate a smaller set of the five most informative SNPs. MDR with 10-fold cross-validation was then applied to look at all possible combinations of two, three, four and five SNPs. The MDR model with the best testing accuracy (TA) consisted of SNPs rs2305619, rs187084, and rs11465421 (TA = 0.588) in PTX3, TLR9 and DC-Sign, respectively. A general 1000-fold permutation test of the null hypothesis of no association confirmed the statistical significance of the model (p = 0.008). An additional 1000-fold permutation test designed specifically to test the linear null hypothesis that the association effects are only additive confirmed the presence of non-additive (i.e. nonlinear) or epistatic effects (p = 0.013). An independent information-gain measure corroborated these results with a third-order epistatic interaction that was stronger than any lower-order associations. CONCLUSIONS: We have identified statistically significant evidence for a three-way epistatic interaction that is associated with susceptibility to TB. This interaction is stronger than any previously described one-way or two-way associations. This study highlights the importance of using machine learning methods that are designed to embrace, rather than ignore, the complexity of common diseases such as TB. We recommend future studies of the genetics of TB take into account the possibility that high-order epistatic interactions might play an important role in disease susceptibility. |
format | Online Article Text |
id | pubmed-3618340 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-36183402013-04-07 Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis Collins, Ryan L Hu, Ting Wejse, Christian Sirugo, Giorgio Williams, Scott M Moore, Jason H BioData Min Short Report BACKGROUND: Identifying high-order genetics associations with non-additive (i.e. epistatic) effects in population-based studies of common human diseases is a computational challenge. Multifactor dimensionality reduction (MDR) is a machine learning method that was designed specifically for this problem. The goal of the present study was to apply MDR to mining high-order epistatic interactions in a population-based genetic study of tuberculosis (TB). RESULTS: The study used a previously published data set consisting of 19 candidate single-nucleotide polymorphisms (SNPs) in 321 pulmonary TB cases and 347 healthy controls from Guniea-Bissau in Africa. The ReliefF algorithm was applied first to generate a smaller set of the five most informative SNPs. MDR with 10-fold cross-validation was then applied to look at all possible combinations of two, three, four and five SNPs. The MDR model with the best testing accuracy (TA) consisted of SNPs rs2305619, rs187084, and rs11465421 (TA = 0.588) in PTX3, TLR9 and DC-Sign, respectively. A general 1000-fold permutation test of the null hypothesis of no association confirmed the statistical significance of the model (p = 0.008). An additional 1000-fold permutation test designed specifically to test the linear null hypothesis that the association effects are only additive confirmed the presence of non-additive (i.e. nonlinear) or epistatic effects (p = 0.013). An independent information-gain measure corroborated these results with a third-order epistatic interaction that was stronger than any lower-order associations. CONCLUSIONS: We have identified statistically significant evidence for a three-way epistatic interaction that is associated with susceptibility to TB. This interaction is stronger than any previously described one-way or two-way associations. This study highlights the importance of using machine learning methods that are designed to embrace, rather than ignore, the complexity of common diseases such as TB. We recommend future studies of the genetics of TB take into account the possibility that high-order epistatic interactions might play an important role in disease susceptibility. BioMed Central 2013-02-18 /pmc/articles/PMC3618340/ /pubmed/23418869 http://dx.doi.org/10.1186/1756-0381-6-4 Text en Copyright © 2013 Collins et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Short Report Collins, Ryan L Hu, Ting Wejse, Christian Sirugo, Giorgio Williams, Scott M Moore, Jason H Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title | Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title_full | Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title_fullStr | Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title_full_unstemmed | Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title_short | Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
title_sort | multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis |
topic | Short Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3618340/ https://www.ncbi.nlm.nih.gov/pubmed/23418869 http://dx.doi.org/10.1186/1756-0381-6-4 |
work_keys_str_mv | AT collinsryanl multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis AT huting multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis AT wejsechristian multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis AT sirugogiorgio multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis AT williamsscottm multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis AT moorejasonh multifactordimensionalityreductionrevealsathreelocusepistaticinteractionassociatedwithsusceptibilitytopulmonarytuberculosis |