Cargando…

A Latent Model for Prioritization of SNPs for Functional Studies

One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many s...

Descripción completa

Detalles Bibliográficos
Autores principales: Fridley, Brooke L., Iversen, Ed, Tsai, Ya-Yu, Jenkins, Gregory D., Goode, Ellen L., Sellers, Thomas A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110798/
https://www.ncbi.nlm.nih.gov/pubmed/21687685
http://dx.doi.org/10.1371/journal.pone.0020764
_version_ 1782205554150604800
author Fridley, Brooke L.
Iversen, Ed
Tsai, Ya-Yu
Jenkins, Gregory D.
Goode, Ellen L.
Sellers, Thomas A.
author_facet Fridley, Brooke L.
Iversen, Ed
Tsai, Ya-Yu
Jenkins, Gregory D.
Goode, Ellen L.
Sellers, Thomas A.
author_sort Fridley, Brooke L.
collection PubMed
description One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating “features” about a SNP to estimate a latent “quality score”, with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2(nd) and 7(th) based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197(th) for invasive case analysis), was ranked 8(th) based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP “features” for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking.
format Online
Article
Text
id pubmed-3110798
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31107982011-06-16 A Latent Model for Prioritization of SNPs for Functional Studies Fridley, Brooke L. Iversen, Ed Tsai, Ya-Yu Jenkins, Gregory D. Goode, Ellen L. Sellers, Thomas A. PLoS One Research Article One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating “features” about a SNP to estimate a latent “quality score”, with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2(nd) and 7(th) based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197(th) for invasive case analysis), was ranked 8(th) based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP “features” for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking. Public Library of Science 2011-06-08 /pmc/articles/PMC3110798/ /pubmed/21687685 http://dx.doi.org/10.1371/journal.pone.0020764 Text en Fridley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fridley, Brooke L.
Iversen, Ed
Tsai, Ya-Yu
Jenkins, Gregory D.
Goode, Ellen L.
Sellers, Thomas A.
A Latent Model for Prioritization of SNPs for Functional Studies
title A Latent Model for Prioritization of SNPs for Functional Studies
title_full A Latent Model for Prioritization of SNPs for Functional Studies
title_fullStr A Latent Model for Prioritization of SNPs for Functional Studies
title_full_unstemmed A Latent Model for Prioritization of SNPs for Functional Studies
title_short A Latent Model for Prioritization of SNPs for Functional Studies
title_sort latent model for prioritization of snps for functional studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110798/
https://www.ncbi.nlm.nih.gov/pubmed/21687685
http://dx.doi.org/10.1371/journal.pone.0020764
work_keys_str_mv AT fridleybrookel alatentmodelforprioritizationofsnpsforfunctionalstudies
AT iversened alatentmodelforprioritizationofsnpsforfunctionalstudies
AT tsaiyayu alatentmodelforprioritizationofsnpsforfunctionalstudies
AT jenkinsgregoryd alatentmodelforprioritizationofsnpsforfunctionalstudies
AT goodeellenl alatentmodelforprioritizationofsnpsforfunctionalstudies
AT sellersthomasa alatentmodelforprioritizationofsnpsforfunctionalstudies
AT fridleybrookel latentmodelforprioritizationofsnpsforfunctionalstudies
AT iversened latentmodelforprioritizationofsnpsforfunctionalstudies
AT tsaiyayu latentmodelforprioritizationofsnpsforfunctionalstudies
AT jenkinsgregoryd latentmodelforprioritizationofsnpsforfunctionalstudies
AT goodeellenl latentmodelforprioritizationofsnpsforfunctionalstudies
AT sellersthomasa latentmodelforprioritizationofsnpsforfunctionalstudies