Cargando…
A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporat...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7470748/ https://www.ncbi.nlm.nih.gov/pubmed/32833970 http://dx.doi.org/10.1371/journal.pgen.1008947 |
_version_ | 1783578640616783872 |
---|---|
author | Dong, Xinyuan Su, Yu-Ru Barfield, Richard Bien, Stephanie A. He, Qianchuan Harrison, Tabitha A. Huyghe, Jeroen R. Keku, Temitope O. Lindor, Noralane M. Schafmayer, Clemens Chan, Andrew T. Gruber, Stephen B. Jenkins, Mark A. Kooperberg, Charles Peters, Ulrike Hsu, Li |
author_facet | Dong, Xinyuan Su, Yu-Ru Barfield, Richard Bien, Stephanie A. He, Qianchuan Harrison, Tabitha A. Huyghe, Jeroen R. Keku, Temitope O. Lindor, Noralane M. Schafmayer, Clemens Chan, Andrew T. Gruber, Stephen B. Jenkins, Mark A. Kooperberg, Charles Peters, Ulrike Hsu, Li |
author_sort | Dong, Xinyuan |
collection | PubMed |
description | Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci. |
format | Online Article Text |
id | pubmed-7470748 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-74707482020-09-14 A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study Dong, Xinyuan Su, Yu-Ru Barfield, Richard Bien, Stephanie A. He, Qianchuan Harrison, Tabitha A. Huyghe, Jeroen R. Keku, Temitope O. Lindor, Noralane M. Schafmayer, Clemens Chan, Andrew T. Gruber, Stephen B. Jenkins, Mark A. Kooperberg, Charles Peters, Ulrike Hsu, Li PLoS Genet Research Article Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci. Public Library of Science 2020-08-24 /pmc/articles/PMC7470748/ /pubmed/32833970 http://dx.doi.org/10.1371/journal.pgen.1008947 Text en © 2020 Dong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Dong, Xinyuan Su, Yu-Ru Barfield, Richard Bien, Stephanie A. He, Qianchuan Harrison, Tabitha A. Huyghe, Jeroen R. Keku, Temitope O. Lindor, Noralane M. Schafmayer, Clemens Chan, Andrew T. Gruber, Stephen B. Jenkins, Mark A. Kooperberg, Charles Peters, Ulrike Hsu, Li A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title | A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title_full | A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title_fullStr | A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title_full_unstemmed | A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title_short | A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study |
title_sort | general framework for functionally informed set-based analysis: application to a large-scale colorectal cancer study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7470748/ https://www.ncbi.nlm.nih.gov/pubmed/32833970 http://dx.doi.org/10.1371/journal.pgen.1008947 |
work_keys_str_mv | AT dongxinyuan ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT suyuru ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT barfieldrichard ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT bienstephaniea ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT heqianchuan ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT harrisontabithaa ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT huyghejeroenr ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT kekutemitopeo ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT lindornoralanem ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT schafmayerclemens ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT chanandrewt ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT gruberstephenb ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT jenkinsmarka ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT kooperbergcharles ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT petersulrike ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT hsuli ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT dongxinyuan generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT suyuru generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT barfieldrichard generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT bienstephaniea generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT heqianchuan generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT harrisontabithaa generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT huyghejeroenr generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT kekutemitopeo generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT lindornoralanem generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT schafmayerclemens generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT chanandrewt generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT gruberstephenb generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT jenkinsmarka generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT kooperbergcharles generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT petersulrike generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy AT hsuli generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy |