Cargando…
ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121365/ https://www.ncbi.nlm.nih.gov/pubmed/30072632 http://dx.doi.org/10.3390/ijms19082267 |
_version_ | 1783352450460155904 |
---|---|
author | Cao, Xia Yu, Guoxian Liu, Jie Jia, Lianyin Wang, Jun |
author_facet | Cao, Xia Yu, Guoxian Liu, Jie Jia, Lianyin Wang, Jun |
author_sort | Cao, Xia |
collection | PubMed |
description | Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches generally suffer from the high computational complexity resulting from the explosion of candidate high-order interactions. In this paper, we propose a two-stage approach (called ClusterMI) to detect high-order genome-wide SNP interactions based on significant pairwise SNP combinations. In the screening stage, to alleviate the huge computational burden, ClusterMI firstly applies a clustering algorithm combined with mutual information to divide SNPs into different clusters. Then, ClusterMI utilizes conditional mutual information to screen significant pairwise SNP combinations in each cluster. In this way, there is a higher probability of identifying significant two-locus combinations in each group, and the computational load for the follow-up search can be greatly reduced. In the search stage, two different search strategies (exhaustive search and improved ant colony optimization search) are provided to detect high-order SNP interactions based on the cardinality of significant two-locus combinations. Extensive simulation experiments show that ClusterMI has better performance than other related and competitive approaches. Experiments on two real case-control datasets from Wellcome Trust Case Control Consortium (WTCCC) also demonstrate that ClusterMI is more capable of identifying high-order SNP interactions from genome-wide data. |
format | Online Article Text |
id | pubmed-6121365 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-61213652018-09-07 ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information Cao, Xia Yu, Guoxian Liu, Jie Jia, Lianyin Wang, Jun Int J Mol Sci Article Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches generally suffer from the high computational complexity resulting from the explosion of candidate high-order interactions. In this paper, we propose a two-stage approach (called ClusterMI) to detect high-order genome-wide SNP interactions based on significant pairwise SNP combinations. In the screening stage, to alleviate the huge computational burden, ClusterMI firstly applies a clustering algorithm combined with mutual information to divide SNPs into different clusters. Then, ClusterMI utilizes conditional mutual information to screen significant pairwise SNP combinations in each cluster. In this way, there is a higher probability of identifying significant two-locus combinations in each group, and the computational load for the follow-up search can be greatly reduced. In the search stage, two different search strategies (exhaustive search and improved ant colony optimization search) are provided to detect high-order SNP interactions based on the cardinality of significant two-locus combinations. Extensive simulation experiments show that ClusterMI has better performance than other related and competitive approaches. Experiments on two real case-control datasets from Wellcome Trust Case Control Consortium (WTCCC) also demonstrate that ClusterMI is more capable of identifying high-order SNP interactions from genome-wide data. MDPI 2018-08-02 /pmc/articles/PMC6121365/ /pubmed/30072632 http://dx.doi.org/10.3390/ijms19082267 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Cao, Xia Yu, Guoxian Liu, Jie Jia, Lianyin Wang, Jun ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title | ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title_full | ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title_fullStr | ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title_full_unstemmed | ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title_short | ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information |
title_sort | clustermi: detecting high-order snp interactions based on clustering and mutual information |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121365/ https://www.ncbi.nlm.nih.gov/pubmed/30072632 http://dx.doi.org/10.3390/ijms19082267 |
work_keys_str_mv | AT caoxia clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation AT yuguoxian clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation AT liujie clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation AT jialianyin clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation AT wangjun clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation |