Cargando…

MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study

BACKGROUND: The interactions of multiple single nucleotide polymorphisms (SNPs) are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle th...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Xiang, Yang, Can, Yang, Qiang, Xue, Hong, Tang, Nelson LS, Yu, Weichuan
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642772/
https://www.ncbi.nlm.nih.gov/pubmed/19134182
http://dx.doi.org/10.1186/1471-2105-10-13
_version_ 1782164648749957120
author Wan, Xiang
Yang, Can
Yang, Qiang
Xue, Hong
Tang, Nelson LS
Yu, Weichuan
author_facet Wan, Xiang
Yang, Can
Yang, Qiang
Xue, Hong
Tang, Nelson LS
Yu, Weichuan
author_sort Wan, Xiang
collection PubMed
description BACKGROUND: The interactions of multiple single nucleotide polymorphisms (SNPs) are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples. RESULTS: Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips). The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC) using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies. CONCLUSION: The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis.
format Text
id pubmed-2642772
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26427722009-02-17 MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study Wan, Xiang Yang, Can Yang, Qiang Xue, Hong Tang, Nelson LS Yu, Weichuan BMC Bioinformatics Methodology Article BACKGROUND: The interactions of multiple single nucleotide polymorphisms (SNPs) are highly hypothesized to affect an individual's susceptibility to complex diseases. Although many works have been done to identify and quantify the importance of multi-SNP interactions, few of them could handle the genome wide data due to the combinatorial explosive search space and the difficulty to statistically evaluate the high-order interactions given limited samples. RESULTS: Three comparative experiments are designed to evaluate the performance of MegaSNPHunter. The first experiment uses synthetic data generated on the basis of epistasis models. The second one uses a genome wide study on Parkinson disease (data acquired by using Illumina HumanHap300 SNP chips). The third one chooses the rheumatoid arthritis study from Wellcome Trust Case Control Consortium (WTCCC) using Affymetrix GeneChip 500K Mapping Array Set. MegaSNPHunter outperforms the best solution in this area and reports many potential interactions for the two real studies. CONCLUSION: The experimental results on both synthetic data and two real data sets demonstrate that our proposed approach outperforms the best solution that is currently available in handling large-scale SNP data both in terms of speed and in terms of detection of potential interactions that were not identified before. To our knowledge, MegaSNPHunter is the first approach that is capable of identifying the disease-associated SNP interactions from WTCCC studies and is promising for practical disease prognosis. BioMed Central 2009-01-09 /pmc/articles/PMC2642772/ /pubmed/19134182 http://dx.doi.org/10.1186/1471-2105-10-13 Text en Copyright © 2009 Wan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wan, Xiang
Yang, Can
Yang, Qiang
Xue, Hong
Tang, Nelson LS
Yu, Weichuan
MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title_full MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title_fullStr MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title_full_unstemmed MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title_short MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study
title_sort megasnphunter: a learning approach to detect disease predisposition snps and high level interactions in genome wide association study
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2642772/
https://www.ncbi.nlm.nih.gov/pubmed/19134182
http://dx.doi.org/10.1186/1471-2105-10-13
work_keys_str_mv AT wanxiang megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy
AT yangcan megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy
AT yangqiang megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy
AT xuehong megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy
AT tangnelsonls megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy
AT yuweichuan megasnphunteralearningapproachtodetectdiseasepredispositionsnpsandhighlevelinteractionsingenomewideassociationstudy