Cargando…
A Latent Model for Prioritization of SNPs for Functional Studies
One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many s...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110798/ https://www.ncbi.nlm.nih.gov/pubmed/21687685 http://dx.doi.org/10.1371/journal.pone.0020764 |
_version_ | 1782205554150604800 |
---|---|
author | Fridley, Brooke L. Iversen, Ed Tsai, Ya-Yu Jenkins, Gregory D. Goode, Ellen L. Sellers, Thomas A. |
author_facet | Fridley, Brooke L. Iversen, Ed Tsai, Ya-Yu Jenkins, Gregory D. Goode, Ellen L. Sellers, Thomas A. |
author_sort | Fridley, Brooke L. |
collection | PubMed |
description | One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating “features” about a SNP to estimate a latent “quality score”, with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2(nd) and 7(th) based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197(th) for invasive case analysis), was ranked 8(th) based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP “features” for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking. |
format | Online Article Text |
id | pubmed-3110798 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31107982011-06-16 A Latent Model for Prioritization of SNPs for Functional Studies Fridley, Brooke L. Iversen, Ed Tsai, Ya-Yu Jenkins, Gregory D. Goode, Ellen L. Sellers, Thomas A. PLoS One Research Article One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating “features” about a SNP to estimate a latent “quality score”, with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2(nd) and 7(th) based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197(th) for invasive case analysis), was ranked 8(th) based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP “features” for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking. Public Library of Science 2011-06-08 /pmc/articles/PMC3110798/ /pubmed/21687685 http://dx.doi.org/10.1371/journal.pone.0020764 Text en Fridley et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Fridley, Brooke L. Iversen, Ed Tsai, Ya-Yu Jenkins, Gregory D. Goode, Ellen L. Sellers, Thomas A. A Latent Model for Prioritization of SNPs for Functional Studies |
title | A Latent Model for Prioritization of SNPs for Functional Studies |
title_full | A Latent Model for Prioritization of SNPs for Functional Studies |
title_fullStr | A Latent Model for Prioritization of SNPs for Functional Studies |
title_full_unstemmed | A Latent Model for Prioritization of SNPs for Functional Studies |
title_short | A Latent Model for Prioritization of SNPs for Functional Studies |
title_sort | latent model for prioritization of snps for functional studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3110798/ https://www.ncbi.nlm.nih.gov/pubmed/21687685 http://dx.doi.org/10.1371/journal.pone.0020764 |
work_keys_str_mv | AT fridleybrookel alatentmodelforprioritizationofsnpsforfunctionalstudies AT iversened alatentmodelforprioritizationofsnpsforfunctionalstudies AT tsaiyayu alatentmodelforprioritizationofsnpsforfunctionalstudies AT jenkinsgregoryd alatentmodelforprioritizationofsnpsforfunctionalstudies AT goodeellenl alatentmodelforprioritizationofsnpsforfunctionalstudies AT sellersthomasa alatentmodelforprioritizationofsnpsforfunctionalstudies AT fridleybrookel latentmodelforprioritizationofsnpsforfunctionalstudies AT iversened latentmodelforprioritizationofsnpsforfunctionalstudies AT tsaiyayu latentmodelforprioritizationofsnpsforfunctionalstudies AT jenkinsgregoryd latentmodelforprioritizationofsnpsforfunctionalstudies AT goodeellenl latentmodelforprioritizationofsnpsforfunctionalstudies AT sellersthomasa latentmodelforprioritizationofsnpsforfunctionalstudies |