Cargando…

ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information

Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Xia, Yu, Guoxian, Liu, Jie, Jia, Lianyin, Wang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121365/
https://www.ncbi.nlm.nih.gov/pubmed/30072632
http://dx.doi.org/10.3390/ijms19082267
_version_ 1783352450460155904
author Cao, Xia
Yu, Guoxian
Liu, Jie
Jia, Lianyin
Wang, Jun
author_facet Cao, Xia
Yu, Guoxian
Liu, Jie
Jia, Lianyin
Wang, Jun
author_sort Cao, Xia
collection PubMed
description Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches generally suffer from the high computational complexity resulting from the explosion of candidate high-order interactions. In this paper, we propose a two-stage approach (called ClusterMI) to detect high-order genome-wide SNP interactions based on significant pairwise SNP combinations. In the screening stage, to alleviate the huge computational burden, ClusterMI firstly applies a clustering algorithm combined with mutual information to divide SNPs into different clusters. Then, ClusterMI utilizes conditional mutual information to screen significant pairwise SNP combinations in each cluster. In this way, there is a higher probability of identifying significant two-locus combinations in each group, and the computational load for the follow-up search can be greatly reduced. In the search stage, two different search strategies (exhaustive search and improved ant colony optimization search) are provided to detect high-order SNP interactions based on the cardinality of significant two-locus combinations. Extensive simulation experiments show that ClusterMI has better performance than other related and competitive approaches. Experiments on two real case-control datasets from Wellcome Trust Case Control Consortium (WTCCC) also demonstrate that ClusterMI is more capable of identifying high-order SNP interactions from genome-wide data.
format Online
Article
Text
id pubmed-6121365
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61213652018-09-07 ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information Cao, Xia Yu, Guoxian Liu, Jie Jia, Lianyin Wang, Jun Int J Mol Sci Article Identifying single nucleotide polymorphism (SNP) interactions is considered as a popular and crucial way for explaining the missing heritability of complex diseases in genome-wide association studies (GWAS). Many approaches have been proposed to detect SNP interactions. However, existing approaches generally suffer from the high computational complexity resulting from the explosion of candidate high-order interactions. In this paper, we propose a two-stage approach (called ClusterMI) to detect high-order genome-wide SNP interactions based on significant pairwise SNP combinations. In the screening stage, to alleviate the huge computational burden, ClusterMI firstly applies a clustering algorithm combined with mutual information to divide SNPs into different clusters. Then, ClusterMI utilizes conditional mutual information to screen significant pairwise SNP combinations in each cluster. In this way, there is a higher probability of identifying significant two-locus combinations in each group, and the computational load for the follow-up search can be greatly reduced. In the search stage, two different search strategies (exhaustive search and improved ant colony optimization search) are provided to detect high-order SNP interactions based on the cardinality of significant two-locus combinations. Extensive simulation experiments show that ClusterMI has better performance than other related and competitive approaches. Experiments on two real case-control datasets from Wellcome Trust Case Control Consortium (WTCCC) also demonstrate that ClusterMI is more capable of identifying high-order SNP interactions from genome-wide data. MDPI 2018-08-02 /pmc/articles/PMC6121365/ /pubmed/30072632 http://dx.doi.org/10.3390/ijms19082267 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cao, Xia
Yu, Guoxian
Liu, Jie
Jia, Lianyin
Wang, Jun
ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title_full ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title_fullStr ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title_full_unstemmed ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title_short ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information
title_sort clustermi: detecting high-order snp interactions based on clustering and mutual information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121365/
https://www.ncbi.nlm.nih.gov/pubmed/30072632
http://dx.doi.org/10.3390/ijms19082267
work_keys_str_mv AT caoxia clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation
AT yuguoxian clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation
AT liujie clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation
AT jialianyin clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation
AT wangjun clustermidetectinghighordersnpinteractionsbasedonclusteringandmutualinformation