Cargando…

Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations

Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across diff...

Descripción completa

Detalles Bibliográficos
Autores principales: Araujo, Daniel S., Nguyen, Chris, Hu, Xiaowei, Mikhaylova, Anna V., Gignoux, Chris, Ardlie, Kristin, Taylor, Kent D., Durda, Peter, Liu, Yongmei, Papanicolaou, George, Cho, Michael H., Rich, Stephen S., Rotter, Jerome I., Im, Hae Kyung, Manichaikul, Ani, Wheeler, Heather E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589725/
https://www.ncbi.nlm.nih.gov/pubmed/37869564
http://dx.doi.org/10.1016/j.xhgg.2023.100216
_version_ 1785123844936171520
author Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
author_facet Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
author_sort Araujo, Daniel S.
collection PubMed
description Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations.
format Online
Article
Text
id pubmed-10589725
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105897252023-10-22 Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. HGG Adv Article Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations. Elsevier 2023-07-01 /pmc/articles/PMC10589725/ /pubmed/37869564 http://dx.doi.org/10.1016/j.xhgg.2023.100216 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Araujo, Daniel S.
Nguyen, Chris
Hu, Xiaowei
Mikhaylova, Anna V.
Gignoux, Chris
Ardlie, Kristin
Taylor, Kent D.
Durda, Peter
Liu, Yongmei
Papanicolaou, George
Cho, Michael H.
Rich, Stephen S.
Rotter, Jerome I.
Im, Hae Kyung
Manichaikul, Ani
Wheeler, Heather E.
Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title_full Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title_fullStr Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title_full_unstemmed Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title_short Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
title_sort multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589725/
https://www.ncbi.nlm.nih.gov/pubmed/37869564
http://dx.doi.org/10.1016/j.xhgg.2023.100216
work_keys_str_mv AT araujodaniels multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT nguyenchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT huxiaowei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT mikhaylovaannav multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT gignouxchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT ardliekristin multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT taylorkentd multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT durdapeter multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT liuyongmei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT papanicolaougeorge multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT chomichaelh multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT richstephens multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT rotterjeromei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT imhaekyung multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT manichaikulani multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations
AT wheelerheathere multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations