Cargando…

The construction of cross-population polygenic risk scores using transfer learning

As most existing genome-wide association studies (GWASs) were conducted in European-ancestry cohorts, and as the existing polygenic risk score (PRS) models have limited transferability across ancestry groups, PRS research on non-European-ancestry groups needs to make efficient use of available data...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Zhangchen, Fritsche, Lars G., Smith, Jennifer A., Mukherjee, Bhramar, Lee, Seunggeun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9674947/
https://www.ncbi.nlm.nih.gov/pubmed/36240765
http://dx.doi.org/10.1016/j.ajhg.2022.09.010
_version_ 1784833262889205760
author Zhao, Zhangchen
Fritsche, Lars G.
Smith, Jennifer A.
Mukherjee, Bhramar
Lee, Seunggeun
author_facet Zhao, Zhangchen
Fritsche, Lars G.
Smith, Jennifer A.
Mukherjee, Bhramar
Lee, Seunggeun
author_sort Zhao, Zhangchen
collection PubMed
description As most existing genome-wide association studies (GWASs) were conducted in European-ancestry cohorts, and as the existing polygenic risk score (PRS) models have limited transferability across ancestry groups, PRS research on non-European-ancestry groups needs to make efficient use of available data until we attain large sample sizes across all ancestry groups. Here we propose a PRS method using transfer learning techniques. Our approach, TL-PRS, uses gradient descent to fine-tune the baseline PRS model from an ancestry group with large sample GWASs to the dataset of target ancestry. In our application of constructing PRS for seven quantitative and two dichotomous traits for 10,285 individuals of South Asian ancestry and 8,168 individuals of African ancestry in UK Biobank, TL-PRS using PRS-CS as a baseline method obtained 25% average relative improvement for South Asian samples and 29% for African samples compared to the standard PRS-CS method in terms of predicted R(2). Our approach increases the transferability of PRSs across ancestries and thereby helps reduce existing inequities in genetics research.
format Online
Article
Text
id pubmed-9674947
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-96749472022-11-20 The construction of cross-population polygenic risk scores using transfer learning Zhao, Zhangchen Fritsche, Lars G. Smith, Jennifer A. Mukherjee, Bhramar Lee, Seunggeun Am J Hum Genet Article As most existing genome-wide association studies (GWASs) were conducted in European-ancestry cohorts, and as the existing polygenic risk score (PRS) models have limited transferability across ancestry groups, PRS research on non-European-ancestry groups needs to make efficient use of available data until we attain large sample sizes across all ancestry groups. Here we propose a PRS method using transfer learning techniques. Our approach, TL-PRS, uses gradient descent to fine-tune the baseline PRS model from an ancestry group with large sample GWASs to the dataset of target ancestry. In our application of constructing PRS for seven quantitative and two dichotomous traits for 10,285 individuals of South Asian ancestry and 8,168 individuals of African ancestry in UK Biobank, TL-PRS using PRS-CS as a baseline method obtained 25% average relative improvement for South Asian samples and 29% for African samples compared to the standard PRS-CS method in terms of predicted R(2). Our approach increases the transferability of PRSs across ancestries and thereby helps reduce existing inequities in genetics research. Elsevier 2022-11-03 2022-10-13 /pmc/articles/PMC9674947/ /pubmed/36240765 http://dx.doi.org/10.1016/j.ajhg.2022.09.010 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Zhao, Zhangchen
Fritsche, Lars G.
Smith, Jennifer A.
Mukherjee, Bhramar
Lee, Seunggeun
The construction of cross-population polygenic risk scores using transfer learning
title The construction of cross-population polygenic risk scores using transfer learning
title_full The construction of cross-population polygenic risk scores using transfer learning
title_fullStr The construction of cross-population polygenic risk scores using transfer learning
title_full_unstemmed The construction of cross-population polygenic risk scores using transfer learning
title_short The construction of cross-population polygenic risk scores using transfer learning
title_sort construction of cross-population polygenic risk scores using transfer learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9674947/
https://www.ncbi.nlm.nih.gov/pubmed/36240765
http://dx.doi.org/10.1016/j.ajhg.2022.09.010
work_keys_str_mv AT zhaozhangchen theconstructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT fritschelarsg theconstructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT smithjennifera theconstructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT mukherjeebhramar theconstructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT leeseunggeun theconstructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT zhaozhangchen constructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT fritschelarsg constructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT smithjennifera constructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT mukherjeebhramar constructionofcrosspopulationpolygenicriskscoresusingtransferlearning
AT leeseunggeun constructionofcrosspopulationpolygenicriskscoresusingtransferlearning