Cargando…
A Markov blanket-based method for detecting causal SNPs in GWAS
BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epist...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863064/ https://www.ncbi.nlm.nih.gov/pubmed/20438652 http://dx.doi.org/10.1186/1471-2105-11-S3-S5 |
_version_ | 1782180742343688192 |
---|---|
author | Han, Bing Park, Meeyoung Chen, Xue-wen |
author_facet | Han, Bing Park, Meeyoung Chen, Xue-wen |
author_sort | Han, Bing |
collection | PubMed |
description | BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epistatic interactions associated with common diseases becomes a great challenge to bioinformatics society, because the study of epistatic interactions often deals with the large size of the genotyped data and the huge amount of combinations of all the possible genetic factors. Most existing computational detection methods are based on the classification capacity of SNP sets, which may fail to identify SNP sets that are strongly associated with the diseases and introduce a lot of false positives. In addition, most methods are not suitable for genome-wide scale studies due to their computational complexity. RESULTS: We propose a new Markov Blanket-based method, DASSO-MB (Detection of ASSOciations using Markov Blanket) to detect epistatic interactions in case-control GWAS. Markov blanket of a target variable T can completely shield T from all other variables. Thus, we can guarantee that the SNP set detected by DASSO-MB has a strong association with diseases and contains fewest false positives. Furthermore, DASSO-MB uses a heuristic search strategy by calculating the association between variables to avoid the time-consuming training process as in other machine-learning methods. We apply our algorithm to simulated datasets and a real case-control dataset. We compare DASSO-MB to other commonly-used methods and show that our method significantly outperforms other methods and is capable of finding SNPs strongly associated with diseases. CONCLUSIONS: Our study shows that DASSO-MB can identify a minimal set of causal SNPs associated with diseases, which contains less false positives compared to other existing methods. Given the huge size of genomic dataset produced by GWAS, this is critical in saving the potential costs of biological experiments and being an efficient guideline for pathogenesis research. |
format | Text |
id | pubmed-2863064 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28630642010-05-04 A Markov blanket-based method for detecting causal SNPs in GWAS Han, Bing Park, Meeyoung Chen, Xue-wen BMC Bioinformatics Proceedings BACKGROUND: Detecting epistatic interactions associated with complex and common diseases can help to improve prevention, diagnosis and treatment of these diseases. With the development of genome-wide association studies (GWAS), designing powerful and robust computational method for identifying epistatic interactions associated with common diseases becomes a great challenge to bioinformatics society, because the study of epistatic interactions often deals with the large size of the genotyped data and the huge amount of combinations of all the possible genetic factors. Most existing computational detection methods are based on the classification capacity of SNP sets, which may fail to identify SNP sets that are strongly associated with the diseases and introduce a lot of false positives. In addition, most methods are not suitable for genome-wide scale studies due to their computational complexity. RESULTS: We propose a new Markov Blanket-based method, DASSO-MB (Detection of ASSOciations using Markov Blanket) to detect epistatic interactions in case-control GWAS. Markov blanket of a target variable T can completely shield T from all other variables. Thus, we can guarantee that the SNP set detected by DASSO-MB has a strong association with diseases and contains fewest false positives. Furthermore, DASSO-MB uses a heuristic search strategy by calculating the association between variables to avoid the time-consuming training process as in other machine-learning methods. We apply our algorithm to simulated datasets and a real case-control dataset. We compare DASSO-MB to other commonly-used methods and show that our method significantly outperforms other methods and is capable of finding SNPs strongly associated with diseases. CONCLUSIONS: Our study shows that DASSO-MB can identify a minimal set of causal SNPs associated with diseases, which contains less false positives compared to other existing methods. Given the huge size of genomic dataset produced by GWAS, this is critical in saving the potential costs of biological experiments and being an efficient guideline for pathogenesis research. BioMed Central 2010-04-29 /pmc/articles/PMC2863064/ /pubmed/20438652 http://dx.doi.org/10.1186/1471-2105-11-S3-S5 Text en Copyright ©2010 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Han, Bing Park, Meeyoung Chen, Xue-wen A Markov blanket-based method for detecting causal SNPs in GWAS |
title | A Markov blanket-based method for detecting causal SNPs in GWAS |
title_full | A Markov blanket-based method for detecting causal SNPs in GWAS |
title_fullStr | A Markov blanket-based method for detecting causal SNPs in GWAS |
title_full_unstemmed | A Markov blanket-based method for detecting causal SNPs in GWAS |
title_short | A Markov blanket-based method for detecting causal SNPs in GWAS |
title_sort | markov blanket-based method for detecting causal snps in gwas |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2863064/ https://www.ncbi.nlm.nih.gov/pubmed/20438652 http://dx.doi.org/10.1186/1471-2105-11-S3-S5 |
work_keys_str_mv | AT hanbing amarkovblanketbasedmethodfordetectingcausalsnpsingwas AT parkmeeyoung amarkovblanketbasedmethodfordetectingcausalsnpsingwas AT chenxuewen amarkovblanketbasedmethodfordetectingcausalsnpsingwas AT hanbing markovblanketbasedmethodfordetectingcausalsnpsingwas AT parkmeeyoung markovblanketbasedmethodfordetectingcausalsnpsingwas AT chenxuewen markovblanketbasedmethodfordetectingcausalsnpsingwas |