Cargando…

A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study

Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporat...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Xinyuan, Su, Yu-Ru, Barfield, Richard, Bien, Stephanie A., He, Qianchuan, Harrison, Tabitha A., Huyghe, Jeroen R., Keku, Temitope O., Lindor, Noralane M., Schafmayer, Clemens, Chan, Andrew T., Gruber, Stephen B., Jenkins, Mark A., Kooperberg, Charles, Peters, Ulrike, Hsu, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7470748/
https://www.ncbi.nlm.nih.gov/pubmed/32833970
http://dx.doi.org/10.1371/journal.pgen.1008947
_version_ 1783578640616783872
author Dong, Xinyuan
Su, Yu-Ru
Barfield, Richard
Bien, Stephanie A.
He, Qianchuan
Harrison, Tabitha A.
Huyghe, Jeroen R.
Keku, Temitope O.
Lindor, Noralane M.
Schafmayer, Clemens
Chan, Andrew T.
Gruber, Stephen B.
Jenkins, Mark A.
Kooperberg, Charles
Peters, Ulrike
Hsu, Li
author_facet Dong, Xinyuan
Su, Yu-Ru
Barfield, Richard
Bien, Stephanie A.
He, Qianchuan
Harrison, Tabitha A.
Huyghe, Jeroen R.
Keku, Temitope O.
Lindor, Noralane M.
Schafmayer, Clemens
Chan, Andrew T.
Gruber, Stephen B.
Jenkins, Mark A.
Kooperberg, Charles
Peters, Ulrike
Hsu, Li
author_sort Dong, Xinyuan
collection PubMed
description Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci.
format Online
Article
Text
id pubmed-7470748
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-74707482020-09-14 A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study Dong, Xinyuan Su, Yu-Ru Barfield, Richard Bien, Stephanie A. He, Qianchuan Harrison, Tabitha A. Huyghe, Jeroen R. Keku, Temitope O. Lindor, Noralane M. Schafmayer, Clemens Chan, Andrew T. Gruber, Stephen B. Jenkins, Mark A. Kooperberg, Charles Peters, Ulrike Hsu, Li PLoS Genet Research Article Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci. Public Library of Science 2020-08-24 /pmc/articles/PMC7470748/ /pubmed/32833970 http://dx.doi.org/10.1371/journal.pgen.1008947 Text en © 2020 Dong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dong, Xinyuan
Su, Yu-Ru
Barfield, Richard
Bien, Stephanie A.
He, Qianchuan
Harrison, Tabitha A.
Huyghe, Jeroen R.
Keku, Temitope O.
Lindor, Noralane M.
Schafmayer, Clemens
Chan, Andrew T.
Gruber, Stephen B.
Jenkins, Mark A.
Kooperberg, Charles
Peters, Ulrike
Hsu, Li
A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title_full A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title_fullStr A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title_full_unstemmed A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title_short A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study
title_sort general framework for functionally informed set-based analysis: application to a large-scale colorectal cancer study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7470748/
https://www.ncbi.nlm.nih.gov/pubmed/32833970
http://dx.doi.org/10.1371/journal.pgen.1008947
work_keys_str_mv AT dongxinyuan ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT suyuru ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT barfieldrichard ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT bienstephaniea ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT heqianchuan ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT harrisontabithaa ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT huyghejeroenr ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT kekutemitopeo ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT lindornoralanem ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT schafmayerclemens ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT chanandrewt ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT gruberstephenb ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT jenkinsmarka ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT kooperbergcharles ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT petersulrike ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT hsuli ageneralframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT dongxinyuan generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT suyuru generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT barfieldrichard generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT bienstephaniea generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT heqianchuan generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT harrisontabithaa generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT huyghejeroenr generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT kekutemitopeo generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT lindornoralanem generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT schafmayerclemens generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT chanandrewt generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT gruberstephenb generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT jenkinsmarka generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT kooperbergcharles generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT petersulrike generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy
AT hsuli generalframeworkforfunctionallyinformedsetbasedanalysisapplicationtoalargescalecolorectalcancerstudy