Cargando…

Optimal strategies for learning multi-ancestry polygenic scores vary across traits

Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry...

Descripción completa

Detalles Bibliográficos
Autores principales: Lehmann, Brieuc, Mackintosh, Maxine, McVean, Gil, Holmes, Chris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328935/
https://www.ncbi.nlm.nih.gov/pubmed/37419925
http://dx.doi.org/10.1038/s41467-023-38930-7
_version_ 1785069913314951168
author Lehmann, Brieuc
Mackintosh, Maxine
McVean, Gil
Holmes, Chris
author_facet Lehmann, Brieuc
Mackintosh, Maxine
McVean, Gil
Holmes, Chris
author_sort Lehmann, Brieuc
collection PubMed
description Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry individuals. Although there has been recent progress in combining multiple PGS trained on distinct populations, the problem of how to maximize performance given a multiple-ancestry cohort is largely unexplored. Here, we investigate the effect of sample size and ancestry composition on PGS performance for fifteen traits in UK Biobank. For some traits, PGS estimated using a relatively small African-ancestry training set outperformed, on an African-ancestry test set, PGS estimated using a much larger European-ancestry only training set. We observe similar, but not identical, results when considering other minority-ancestry groups within UK Biobank. Our results emphasise the importance of targeted data collection from underrepresented groups in order to address existing disparities in PGS performance.
format Online
Article
Text
id pubmed-10328935
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103289352023-07-09 Optimal strategies for learning multi-ancestry polygenic scores vary across traits Lehmann, Brieuc Mackintosh, Maxine McVean, Gil Holmes, Chris Nat Commun Article Polygenic scores (PGSs) are individual-level measures that aggregate the genome-wide genetic predisposition to a given trait. As PGS have predominantly been developed using European-ancestry samples, trait prediction using such European ancestry-derived PGS is less accurate in non-European ancestry individuals. Although there has been recent progress in combining multiple PGS trained on distinct populations, the problem of how to maximize performance given a multiple-ancestry cohort is largely unexplored. Here, we investigate the effect of sample size and ancestry composition on PGS performance for fifteen traits in UK Biobank. For some traits, PGS estimated using a relatively small African-ancestry training set outperformed, on an African-ancestry test set, PGS estimated using a much larger European-ancestry only training set. We observe similar, but not identical, results when considering other minority-ancestry groups within UK Biobank. Our results emphasise the importance of targeted data collection from underrepresented groups in order to address existing disparities in PGS performance. Nature Publishing Group UK 2023-07-07 /pmc/articles/PMC10328935/ /pubmed/37419925 http://dx.doi.org/10.1038/s41467-023-38930-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Lehmann, Brieuc
Mackintosh, Maxine
McVean, Gil
Holmes, Chris
Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title_full Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title_fullStr Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title_full_unstemmed Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title_short Optimal strategies for learning multi-ancestry polygenic scores vary across traits
title_sort optimal strategies for learning multi-ancestry polygenic scores vary across traits
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328935/
https://www.ncbi.nlm.nih.gov/pubmed/37419925
http://dx.doi.org/10.1038/s41467-023-38930-7
work_keys_str_mv AT lehmannbrieuc optimalstrategiesforlearningmultiancestrypolygenicscoresvaryacrosstraits
AT mackintoshmaxine optimalstrategiesforlearningmultiancestrypolygenicscoresvaryacrosstraits
AT mcveangil optimalstrategiesforlearningmultiancestrypolygenicscoresvaryacrosstraits
AT holmeschris optimalstrategiesforlearningmultiancestrypolygenicscoresvaryacrosstraits