Cargando…

Power analysis of transcriptome-wide association study: Implications for practical protocol choice

The transcriptome-wide association study (TWAS) has emerged as one of several promising techniques for integrating multi-scale ‘omics’ data into traditional genome-wide association studies (GWAS). Unlike GWAS, which associates phenotypic variance directly with genetic variants, TWAS uses a reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Chen, Ding, Bowei, Li, Qing, Kwok, Devin, Wu, Jingjing, Long, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946362/
https://www.ncbi.nlm.nih.gov/pubmed/33635859
http://dx.doi.org/10.1371/journal.pgen.1009405
_version_ 1783663036814327808
author Cao, Chen
Ding, Bowei
Li, Qing
Kwok, Devin
Wu, Jingjing
Long, Quan
author_facet Cao, Chen
Ding, Bowei
Li, Qing
Kwok, Devin
Wu, Jingjing
Long, Quan
author_sort Cao, Chen
collection PubMed
description The transcriptome-wide association study (TWAS) has emerged as one of several promising techniques for integrating multi-scale ‘omics’ data into traditional genome-wide association studies (GWAS). Unlike GWAS, which associates phenotypic variance directly with genetic variants, TWAS uses a reference dataset to train a predictive model for gene expressions, which allows it to associate phenotype with variants through the mediating effect of expressions. Although effective, this core innovation of TWAS is poorly understood, since the predictive accuracy of the genotype-expression model is generally low and further bounded by expression heritability. This raises the question: to what degree does the accuracy of the expression model affect the power of TWAS? Furthermore, would replacing predictions with actual, experimentally determined expressions improve power? To answer these questions, we compared the power of GWAS, TWAS, and a hypothetical protocol utilizing real expression data. We derived non-centrality parameters (NCPs) for linear mixed models (LMMs) to enable closed-form calculations of statistical power that do not rely on specific protocol implementations. We examined two representative scenarios: causality (genotype contributes to phenotype through expression) and pleiotropy (genotype contributes directly to both phenotype and expression), and also tested the effects of various properties including expression heritability. Our analysis reveals two main outcomes: (1) Under pleiotropy, the use of predicted expressions in TWAS is superior to actual expressions. This explains why TWAS can function with weak expression models, and shows that TWAS remains relevant even when real expressions are available. (2) GWAS outperforms TWAS when expression heritability is below a threshold of 0.04 under causality, or 0.06 under pleiotropy. Analysis of existing publications suggests that TWAS has been misapplied in place of GWAS, in situations where expression heritability is low.
format Online
Article
Text
id pubmed-7946362
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79463622021-03-22 Power analysis of transcriptome-wide association study: Implications for practical protocol choice Cao, Chen Ding, Bowei Li, Qing Kwok, Devin Wu, Jingjing Long, Quan PLoS Genet Research Article The transcriptome-wide association study (TWAS) has emerged as one of several promising techniques for integrating multi-scale ‘omics’ data into traditional genome-wide association studies (GWAS). Unlike GWAS, which associates phenotypic variance directly with genetic variants, TWAS uses a reference dataset to train a predictive model for gene expressions, which allows it to associate phenotype with variants through the mediating effect of expressions. Although effective, this core innovation of TWAS is poorly understood, since the predictive accuracy of the genotype-expression model is generally low and further bounded by expression heritability. This raises the question: to what degree does the accuracy of the expression model affect the power of TWAS? Furthermore, would replacing predictions with actual, experimentally determined expressions improve power? To answer these questions, we compared the power of GWAS, TWAS, and a hypothetical protocol utilizing real expression data. We derived non-centrality parameters (NCPs) for linear mixed models (LMMs) to enable closed-form calculations of statistical power that do not rely on specific protocol implementations. We examined two representative scenarios: causality (genotype contributes to phenotype through expression) and pleiotropy (genotype contributes directly to both phenotype and expression), and also tested the effects of various properties including expression heritability. Our analysis reveals two main outcomes: (1) Under pleiotropy, the use of predicted expressions in TWAS is superior to actual expressions. This explains why TWAS can function with weak expression models, and shows that TWAS remains relevant even when real expressions are available. (2) GWAS outperforms TWAS when expression heritability is below a threshold of 0.04 under causality, or 0.06 under pleiotropy. Analysis of existing publications suggests that TWAS has been misapplied in place of GWAS, in situations where expression heritability is low. Public Library of Science 2021-02-26 /pmc/articles/PMC7946362/ /pubmed/33635859 http://dx.doi.org/10.1371/journal.pgen.1009405 Text en © 2021 Cao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cao, Chen
Ding, Bowei
Li, Qing
Kwok, Devin
Wu, Jingjing
Long, Quan
Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title_full Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title_fullStr Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title_full_unstemmed Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title_short Power analysis of transcriptome-wide association study: Implications for practical protocol choice
title_sort power analysis of transcriptome-wide association study: implications for practical protocol choice
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7946362/
https://www.ncbi.nlm.nih.gov/pubmed/33635859
http://dx.doi.org/10.1371/journal.pgen.1009405
work_keys_str_mv AT caochen poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice
AT dingbowei poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice
AT liqing poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice
AT kwokdevin poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice
AT wujingjing poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice
AT longquan poweranalysisoftranscriptomewideassociationstudyimplicationsforpracticalprotocolchoice