Cargando…

Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction

Genetic prediction of common complex disease risk is an essential component of precision medicine. Currently, genome-wide association studies (GWASs) are mostly composed of European-ancestry samples and resulting polygenic scores (PGSs) have been shown to poorly transfer to other ancestries partly d...

Descripción completa

Detalles Bibliográficos
Autores principales: Bocher, Ozvan, Gilly, Arthur, Park, Young-Chan, Zeggini, Eleftheria, Morris, Andrew P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336686/
https://www.ncbi.nlm.nih.gov/pubmed/37448981
http://dx.doi.org/10.1016/j.xhgg.2023.100214
_version_ 1785071262126571520
author Bocher, Ozvan
Gilly, Arthur
Park, Young-Chan
Zeggini, Eleftheria
Morris, Andrew P.
author_facet Bocher, Ozvan
Gilly, Arthur
Park, Young-Chan
Zeggini, Eleftheria
Morris, Andrew P.
author_sort Bocher, Ozvan
collection PubMed
description Genetic prediction of common complex disease risk is an essential component of precision medicine. Currently, genome-wide association studies (GWASs) are mostly composed of European-ancestry samples and resulting polygenic scores (PGSs) have been shown to poorly transfer to other ancestries partly due to heterogeneity of allelic effects between populations. Fixed-effects (FETA) and random-effects (RETA) trans-ancestry meta-analyses do not model such ancestry-related heterogeneity, while ancestry-specific (AS) scores may suffer from low power due to low sample sizes. In contrast, trans-ancestry meta-regression (TAMR) builds ancestry-aware PGS that account for more complex trans-ancestry architectures. Here, we examine the predictive performance of these four PGSs under multiple genetic architectures and ancestry configurations. We show that the predictive performance of FETA and RETA is strongly affected by cross-ancestry genetic heterogeneity, while AS PGS performance decreases in under-represented target populations. TAMR PGS is also impacted by heterogeneity but maintains good prediction performance in most situations, especially in ancestry-diverse scenarios. In simulations of human complex traits, TAMR scores currently explain 25% more phenotypic variance than AS in triglyceride levels and 33% more phenotypic variance than FETA in type 2 diabetes in most non-European populations. Importantly, a high proportion of non-European-ancestry individuals is needed to reach prediction levels that are comparable in those populations to the one observed in European-ancestry studies. Our results highlight the need to rebalance the ancestral composition of GWAS to enable accurate prediction in non-European-ancestry groups, and demonstrate the relevance of meta-regression approaches for compensating some of the current population biases in GWAS.
format Online
Article
Text
id pubmed-10336686
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103366862023-07-13 Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction Bocher, Ozvan Gilly, Arthur Park, Young-Chan Zeggini, Eleftheria Morris, Andrew P. HGG Adv Report Genetic prediction of common complex disease risk is an essential component of precision medicine. Currently, genome-wide association studies (GWASs) are mostly composed of European-ancestry samples and resulting polygenic scores (PGSs) have been shown to poorly transfer to other ancestries partly due to heterogeneity of allelic effects between populations. Fixed-effects (FETA) and random-effects (RETA) trans-ancestry meta-analyses do not model such ancestry-related heterogeneity, while ancestry-specific (AS) scores may suffer from low power due to low sample sizes. In contrast, trans-ancestry meta-regression (TAMR) builds ancestry-aware PGS that account for more complex trans-ancestry architectures. Here, we examine the predictive performance of these four PGSs under multiple genetic architectures and ancestry configurations. We show that the predictive performance of FETA and RETA is strongly affected by cross-ancestry genetic heterogeneity, while AS PGS performance decreases in under-represented target populations. TAMR PGS is also impacted by heterogeneity but maintains good prediction performance in most situations, especially in ancestry-diverse scenarios. In simulations of human complex traits, TAMR scores currently explain 25% more phenotypic variance than AS in triglyceride levels and 33% more phenotypic variance than FETA in type 2 diabetes in most non-European populations. Importantly, a high proportion of non-European-ancestry individuals is needed to reach prediction levels that are comparable in those populations to the one observed in European-ancestry studies. Our results highlight the need to rebalance the ancestral composition of GWAS to enable accurate prediction in non-European-ancestry groups, and demonstrate the relevance of meta-regression approaches for compensating some of the current population biases in GWAS. Elsevier 2023-06-15 /pmc/articles/PMC10336686/ /pubmed/37448981 http://dx.doi.org/10.1016/j.xhgg.2023.100214 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Report
Bocher, Ozvan
Gilly, Arthur
Park, Young-Chan
Zeggini, Eleftheria
Morris, Andrew P.
Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title_full Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title_fullStr Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title_full_unstemmed Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title_short Bridging the diversity gap: Analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
title_sort bridging the diversity gap: analytical and study design considerations for improving the accuracy of trans-ancestry genetic prediction
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10336686/
https://www.ncbi.nlm.nih.gov/pubmed/37448981
http://dx.doi.org/10.1016/j.xhgg.2023.100214
work_keys_str_mv AT bocherozvan bridgingthediversitygapanalyticalandstudydesignconsiderationsforimprovingtheaccuracyoftransancestrygeneticprediction
AT gillyarthur bridgingthediversitygapanalyticalandstudydesignconsiderationsforimprovingtheaccuracyoftransancestrygeneticprediction
AT parkyoungchan bridgingthediversitygapanalyticalandstudydesignconsiderationsforimprovingtheaccuracyoftransancestrygeneticprediction
AT zegginieleftheria bridgingthediversitygapanalyticalandstudydesignconsiderationsforimprovingtheaccuracyoftransancestrygeneticprediction
AT morrisandrewp bridgingthediversitygapanalyticalandstudydesignconsiderationsforimprovingtheaccuracyoftransancestrygeneticprediction