Cargando…

Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations

Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized methods that leverage shared regulatory effects across different...

Descripción completa

Detalles Bibliográficos
Autores principales: Araujo, Daniel S., Nguyen, Chris, Hu, Xiaowei, Mikhaylova, Anna V., Gignoux, Chris, Ardlie, Kristin, Taylor, Kent D., Durda, Peter, Liu, Yongmei, Papanicolaou, George, Cho, Michael H., Rich, Stephen S., Rotter, Jerome I., Im, Hae Kyung, Manichaikul, Ani, Wheeler, Heather E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9934635/
https://www.ncbi.nlm.nih.gov/pubmed/36798214
http://dx.doi.org/10.1101/2023.02.09.527747
_version_ 1784889922123988992
author Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
author_facet Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
author_sort Araujo, Daniel S.
collection PubMed
description Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized methods that leverage shared regulatory effects across different conditions, in this case, across different populations may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWAS) using different methods (Elastic Net, Joint-Tissue Imputation (JTI), Matrix eQTL, Multivariate Adaptive Shrinkage in R (MASHR), and Transcriptome-Integrated Genetic Association Resource (TIGAR)) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWAS, we integrated publicly available multi-ethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology Study (PAGE) and Pan-UK Biobank with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multi-ethnic TWAS, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWAS and new loci previously not found in GWAS. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWAS for multi-ethnic or underrepresented populations.
format Online
Article
Text
id pubmed-9934635
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99346352023-02-17 Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. bioRxiv Article Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized methods that leverage shared regulatory effects across different conditions, in this case, across different populations may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWAS) using different methods (Elastic Net, Joint-Tissue Imputation (JTI), Matrix eQTL, Multivariate Adaptive Shrinkage in R (MASHR), and Transcriptome-Integrated Genetic Association Resource (TIGAR)) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWAS, we integrated publicly available multi-ethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology Study (PAGE) and Pan-UK Biobank with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multi-ethnic TWAS, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWAS and new loci previously not found in GWAS. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWAS for multi-ethnic or underrepresented populations. Cold Spring Harbor Laboratory 2023-05-20 /pmc/articles/PMC9934635/ /pubmed/36798214 http://dx.doi.org/10.1101/2023.02.09.527747 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title_full Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title_fullStr Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title_full_unstemmed Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title_short Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
title_sort multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9934635/
https://www.ncbi.nlm.nih.gov/pubmed/36798214
http://dx.doi.org/10.1101/2023.02.09.527747
work_keys_str_mv AT araujodaniels multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT nguyenchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT huxiaowei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT mikhaylovaannav multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT gignouxchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT ardliekristin multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT taylorkentd multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT durdapeter multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT liuyongmei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT papanicolaougeorge multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT chomichaelh multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT richstephens multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT rotterjeromei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT imhaekyung multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT manichaikulani multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations
AT wheelerheathere multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionfortranscriptomewideassociationstudiesinunderrepresentedpopulations