Cargando…

Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data

Using single-nucleotide polymorphism (SNP) genotypes from the 1000 Genomes Project pilot3 data provided for Genetic Analysis Workshop 17 (GAW17), we applied Bayesian network structure learning (BNSL) to identify potential causal SNPs associated with the Affected phenotype. We focus on the setting in...

Descripción completa

Detalles Bibliográficos
Autores principales: Schlosberg, Christopher E, Schwantes-An, Tae-Hwi, Duan, Weimin, Saccone, Nancy L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287832/
https://www.ncbi.nlm.nih.gov/pubmed/22373088
http://dx.doi.org/10.1186/1753-6561-5-S9-S109
_version_ 1782224753791074304
author Schlosberg, Christopher E
Schwantes-An, Tae-Hwi
Duan, Weimin
Saccone, Nancy L
author_facet Schlosberg, Christopher E
Schwantes-An, Tae-Hwi
Duan, Weimin
Saccone, Nancy L
author_sort Schlosberg, Christopher E
collection PubMed
description Using single-nucleotide polymorphism (SNP) genotypes from the 1000 Genomes Project pilot3 data provided for Genetic Analysis Workshop 17 (GAW17), we applied Bayesian network structure learning (BNSL) to identify potential causal SNPs associated with the Affected phenotype. We focus on the setting in which target genes that harbor causal variants have already been chosen for resequencing; the goal was to detect true causal SNPs from among the measured variants in these genes. Examining all available SNPs in the known causal genes, BNSL produced a Bayesian network from which subsets of SNPs connected to the Affected outcome were identified and measured for statistical significance using the hypergeometric distribution. The exploratory phase of analysis for pooled replicates sometimes identified a set of involved SNPs that contained more true causal SNPs than expected by chance in the Asian population. Analyses of single replicates gave inconsistent results. No nominally significant results were found in analyses of African or European populations. Overall, the method was not able to identify sets of involved SNPs that included a higher proportion of true causal SNPs than expected by chance alone. We conclude that this method, as currently applied, is not effective for identifying causal SNPs that follow the simulation model for the GAW17 data set, which includes many rare causal SNPs.
format Online
Article
Text
id pubmed-3287832
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32878322012-02-28 Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data Schlosberg, Christopher E Schwantes-An, Tae-Hwi Duan, Weimin Saccone, Nancy L BMC Proc Proceedings Using single-nucleotide polymorphism (SNP) genotypes from the 1000 Genomes Project pilot3 data provided for Genetic Analysis Workshop 17 (GAW17), we applied Bayesian network structure learning (BNSL) to identify potential causal SNPs associated with the Affected phenotype. We focus on the setting in which target genes that harbor causal variants have already been chosen for resequencing; the goal was to detect true causal SNPs from among the measured variants in these genes. Examining all available SNPs in the known causal genes, BNSL produced a Bayesian network from which subsets of SNPs connected to the Affected outcome were identified and measured for statistical significance using the hypergeometric distribution. The exploratory phase of analysis for pooled replicates sometimes identified a set of involved SNPs that contained more true causal SNPs than expected by chance in the Asian population. Analyses of single replicates gave inconsistent results. No nominally significant results were found in analyses of African or European populations. Overall, the method was not able to identify sets of involved SNPs that included a higher proportion of true causal SNPs than expected by chance alone. We conclude that this method, as currently applied, is not effective for identifying causal SNPs that follow the simulation model for the GAW17 data set, which includes many rare causal SNPs. BioMed Central 2011-11-29 /pmc/articles/PMC3287832/ /pubmed/22373088 http://dx.doi.org/10.1186/1753-6561-5-S9-S109 Text en Copyright ©2011 Schlosberg et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Schlosberg, Christopher E
Schwantes-An, Tae-Hwi
Duan, Weimin
Saccone, Nancy L
Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title_full Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title_fullStr Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title_full_unstemmed Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title_short Application of Bayesian network structure learning to identify causal variant SNPs from resequencing data
title_sort application of bayesian network structure learning to identify causal variant snps from resequencing data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287832/
https://www.ncbi.nlm.nih.gov/pubmed/22373088
http://dx.doi.org/10.1186/1753-6561-5-S9-S109
work_keys_str_mv AT schlosbergchristophere applicationofbayesiannetworkstructurelearningtoidentifycausalvariantsnpsfromresequencingdata
AT schwantesantaehwi applicationofbayesiannetworkstructurelearningtoidentifycausalvariantsnpsfromresequencingdata
AT duanweimin applicationofbayesiannetworkstructurelearningtoidentifycausalvariantsnpsfromresequencingdata
AT sacconenancyl applicationofbayesiannetworkstructurelearningtoidentifycausalvariantsnpsfromresequencingdata