Cargando…
Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations
Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across diff...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589725/ https://www.ncbi.nlm.nih.gov/pubmed/37869564 http://dx.doi.org/10.1016/j.xhgg.2023.100216 |
_version_ | 1785123844936171520 |
---|---|
author | Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. |
author_facet | Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. |
author_sort | Araujo, Daniel S. |
collection | PubMed |
description | Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations. |
format | Online Article Text |
id | pubmed-10589725 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-105897252023-10-22 Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. HGG Adv Article Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations’ effect size estimates in order to improve TWASs for multiethnic or underrepresented populations. Elsevier 2023-07-01 /pmc/articles/PMC10589725/ /pubmed/37869564 http://dx.doi.org/10.1016/j.xhgg.2023.100216 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Araujo, Daniel S. Nguyen, Chris Hu, Xiaowei Mikhaylova, Anna V. Gignoux, Chris Ardlie, Kristin Taylor, Kent D. Durda, Peter Liu, Yongmei Papanicolaou, George Cho, Michael H. Rich, Stephen S. Rotter, Jerome I. Im, Hae Kyung Manichaikul, Ani Wheeler, Heather E. Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title | Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title_full | Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title_fullStr | Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title_full_unstemmed | Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title_short | Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
title_sort | multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10589725/ https://www.ncbi.nlm.nih.gov/pubmed/37869564 http://dx.doi.org/10.1016/j.xhgg.2023.100216 |
work_keys_str_mv | AT araujodaniels multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT nguyenchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT huxiaowei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT mikhaylovaannav multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT gignouxchris multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT ardliekristin multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT taylorkentd multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT durdapeter multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT liuyongmei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT papanicolaougeorge multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT chomichaelh multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT richstephens multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT rotterjeromei multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT imhaekyung multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT manichaikulani multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations AT wheelerheathere multivariateadaptiveshrinkageimprovescrosspopulationtranscriptomepredictionandassociationstudiesinunderrepresentedpopulations |