Cargando…

A Markov blanket-based method for detecting causal SNPs in GWAS

BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epist...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Bing, Park, Meeyoung, Chen, Xue-wen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863064/
https://www.ncbi.nlm.nih.gov/pubmed/20438652
http://dx.doi.org/10.1186/1471-2105-11-S3-S5
_version_ 1782180742343688192
author Han, Bing
Park, Meeyoung
Chen, Xue-wen
author_facet Han, Bing
Park, Meeyoung
Chen, Xue-wen
author_sort Han, Bing
collection PubMed
description BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epistatic interactions associated with common diseases becomes a great challenge to bioinformatics society, because the study of epistatic interactions often deals with the large size of the genotyped data and the huge amount of combinations of all the possible genetic factors. Most existing computational detection methods are based on the classification capacity of SNP sets, which may fail to identify SNP sets that are strongly associated with the diseases and introduce a lot of false positives. In addition, most methods are not suitable for genome-wide scale studies due to their computational complexity. RESULTS: We propose a new Markov Blanket-based method, DASSO-MB (Detection of ASSOciations using Markov Blanket) to detect epistatic interactions in case-control GWAS. Markov blanket of a target variable T can completely shield T from all other variables. Thus, we can guarantee that the SNP set detected by DASSO-MB has a strong association with diseases and contains fewest false positives. Furthermore, DASSO-MB uses a heuristic search strategy by calculating the association between variables to avoid the time-consuming training process as in other machine-learning methods. We apply our algorithm to simulated datasets and a real case-control dataset. We compare DASSO-MB to other commonly-used methods and show that our method significantly outperforms other methods and is capable of finding SNPs strongly associated with diseases. CONCLUSIONS: Our study shows that DASSO-MB can identify a minimal set of causal SNPs associated with diseases, which contains less false positives compared to other existing methods. Given the huge size of genomic dataset produced by GWAS, this is critical in saving the potential costs of biological experiments and being an efficient guideline for pathogenesis research.
format Text
id pubmed-2863064
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28630642010-05-04 A Markov blanket-based method for detecting causal SNPs in GWAS Han, Bing Park, Meeyoung Chen, Xue-wen BMC Bioinformatics Proceedings BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epistatic interactions associated with common diseases becomes a great challenge to bioinformatics society, because the study of epistatic interactions often deals with the large size of the genotyped data and the huge amount of combinations of all the possible genetic factors. Most existing computational detection methods are based on the classification capacity of SNP sets, which may fail to identify SNP sets that are strongly associated with the diseases and introduce a lot of false positives. In addition, most methods are not suitable for genome-wide scale studies due to their computational complexity. RESULTS: We propose a new Markov Blanket-based method, DASSO-MB (Detection of ASSOciations using Markov Blanket) to detect epistatic interactions in case-control GWAS. Markov blanket of a target variable T can completely shield T from all other variables. Thus, we can guarantee that the SNP set detected by DASSO-MB has a strong association with diseases and contains fewest false positives. Furthermore, DASSO-MB uses a heuristic search strategy by calculating the association between variables to avoid the time-consuming training process as in other machine-learning methods. We apply our algorithm to simulated datasets and a real case-control dataset. We compare DASSO-MB to other commonly-used methods and show that our method significantly outperforms other methods and is capable of finding SNPs strongly associated with diseases. CONCLUSIONS: Our study shows that DASSO-MB can identify a minimal set of causal SNPs associated with diseases, which contains less false positives compared to other existing methods. Given the huge size of genomic dataset produced by GWAS, this is critical in saving the potential costs of biological experiments and being an efficient guideline for pathogenesis research. BioMed Central 2010-04-29 /pmc/articles/PMC2863064/ /pubmed/20438652 http://dx.doi.org/10.1186/1471-2105-11-S3-S5 Text en Copyright ©2010 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Han, Bing
Park, Meeyoung
Chen, Xue-wen
A Markov blanket-based method for detecting causal SNPs in GWAS
title A Markov blanket-based method for detecting causal SNPs in GWAS
title_full A Markov blanket-based method for detecting causal SNPs in GWAS
title_fullStr A Markov blanket-based method for detecting causal SNPs in GWAS
title_full_unstemmed A Markov blanket-based method for detecting causal SNPs in GWAS
title_short A Markov blanket-based method for detecting causal SNPs in GWAS
title_sort markov blanket-based method for detecting causal snps in gwas
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863064/
https://www.ncbi.nlm.nih.gov/pubmed/20438652
http://dx.doi.org/10.1186/1471-2105-11-S3-S5
work_keys_str_mv AT hanbing amarkovblanketbasedmethodfordetectingcausalsnpsingwas
AT parkmeeyoung amarkovblanketbasedmethodfordetectingcausalsnpsingwas
AT chenxuewen amarkovblanketbasedmethodfordetectingcausalsnpsingwas
AT hanbing markovblanketbasedmethodfordetectingcausalsnpsingwas
AT parkmeeyoung markovblanketbasedmethodfordetectingcausalsnpsingwas
AT chenxuewen markovblanketbasedmethodfordetectingcausalsnpsingwas