Cargando…

Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()

Genome-wide association studies (GWAS) have been successful in facilitating the understanding of genetic architecture behind human diseases, but this approach faces many challenges. To identify disease-related loci with modest to weak effect size, GWAS requires very large sample sizes, which can be...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Binglan, Verma, Shefali S., Veturi, Yogasudha C., Verma, Anurag, Bradford, Yuki, Haas, David W., Ritchie, Marylyn D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749400/
https://www.ncbi.nlm.nih.gov/pubmed/29218904
_version_ 1783289583692152832
author Li, Binglan
Verma, Shefali S.
Veturi, Yogasudha C.
Verma, Anurag
Bradford, Yuki
Haas, David W.
Ritchie, Marylyn D.
author_facet Li, Binglan
Verma, Shefali S.
Veturi, Yogasudha C.
Verma, Anurag
Bradford, Yuki
Haas, David W.
Ritchie, Marylyn D.
author_sort Li, Binglan
collection PubMed
description Genome-wide association studies (GWAS) have been successful in facilitating the understanding of genetic architecture behind human diseases, but this approach faces many challenges. To identify disease-related loci with modest to weak effect size, GWAS requires very large sample sizes, which can be computational burdensome. In addition, the interpretation of discovered associations remains difficult. PrediXcan was developed to help address these issues. With built in SNP-expression models, PrediXcan is able to predict the expression of genes that are regulated by putative expression quantitative trait loci (eQTLs), and these predicted expression levels can then be used to perform gene-based association studies. This approach reduces the multiple testing burden from millions of variants down to several thousand genes. But most importantly, the identified associations can reveal the genes that are under regulation of eQTLs and consequently involved in disease pathogenesis. In this study, two of the most practical functions of PrediXcan were tested: 1) predicting gene expression, and 2) prioritizing GWAS results. We tested the prediction accuracy of PrediXcan by comparing the predicted and observed gene expression levels, and also looked into some potential influential factors and a filter criterion with the aim of improving PrediXcan performance. As for GWAS prioritization, predicted gene expression levels were used to obtain gene-trait associations, and background regions of significant associations were examined to decrease the likelihood of false positives. Our results showed that 1) PrediXcan predicted gene expression levels accurately for some but not all genes; 2) including more putative eQTLs into prediction did not improve the prediction accuracy; and 3) integrating predicted gene expression levels from the two PrediXcan whole blood models did not eliminate false positives. Still, PrediXcan was able to prioritize GWAS associations that were below the genome-wide significance threshold in GWAS, while retaining GWAS significant results. This study suggests several ways to consider PrediXcan’s performance that will be of value to eQTL and complex human disease research.
format Online
Article
Text
id pubmed-5749400
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-57494002018-01-02 Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression() Li, Binglan Verma, Shefali S. Veturi, Yogasudha C. Verma, Anurag Bradford, Yuki Haas, David W. Ritchie, Marylyn D. Pac Symp Biocomput Article Genome-wide association studies (GWAS) have been successful in facilitating the understanding of genetic architecture behind human diseases, but this approach faces many challenges. To identify disease-related loci with modest to weak effect size, GWAS requires very large sample sizes, which can be computational burdensome. In addition, the interpretation of discovered associations remains difficult. PrediXcan was developed to help address these issues. With built in SNP-expression models, PrediXcan is able to predict the expression of genes that are regulated by putative expression quantitative trait loci (eQTLs), and these predicted expression levels can then be used to perform gene-based association studies. This approach reduces the multiple testing burden from millions of variants down to several thousand genes. But most importantly, the identified associations can reveal the genes that are under regulation of eQTLs and consequently involved in disease pathogenesis. In this study, two of the most practical functions of PrediXcan were tested: 1) predicting gene expression, and 2) prioritizing GWAS results. We tested the prediction accuracy of PrediXcan by comparing the predicted and observed gene expression levels, and also looked into some potential influential factors and a filter criterion with the aim of improving PrediXcan performance. As for GWAS prioritization, predicted gene expression levels were used to obtain gene-trait associations, and background regions of significant associations were examined to decrease the likelihood of false positives. Our results showed that 1) PrediXcan predicted gene expression levels accurately for some but not all genes; 2) including more putative eQTLs into prediction did not improve the prediction accuracy; and 3) integrating predicted gene expression levels from the two PrediXcan whole blood models did not eliminate false positives. Still, PrediXcan was able to prioritize GWAS associations that were below the genome-wide significance threshold in GWAS, while retaining GWAS significant results. This study suggests several ways to consider PrediXcan’s performance that will be of value to eQTL and complex human disease research. 2018 /pmc/articles/PMC5749400/ /pubmed/29218904 Text en http://creativecommons.org/licenses/by-nc/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Li, Binglan
Verma, Shefali S.
Veturi, Yogasudha C.
Verma, Anurag
Bradford, Yuki
Haas, David W.
Ritchie, Marylyn D.
Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title_full Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title_fullStr Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title_full_unstemmed Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title_short Evaluation of PrediXcan for prioritizing GWAS associations and predicting gene expression()
title_sort evaluation of predixcan for prioritizing gwas associations and predicting gene expression()
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749400/
https://www.ncbi.nlm.nih.gov/pubmed/29218904
work_keys_str_mv AT libinglan evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT vermashefalis evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT veturiyogasudhac evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT vermaanurag evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT bradfordyuki evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT haasdavidw evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression
AT ritchiemarylynd evaluationofpredixcanforprioritizinggwasassociationsandpredictinggeneexpression