Cargando…

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts

One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To...

Descripción completa

Detalles Bibliográficos
Autores principales: Pärna, Katri, Nolte, Ilja M., Snieder, Harold, Fischer, Krista, Marnetto, Davide, Pagani, Luca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340200/
https://www.ncbi.nlm.nih.gov/pubmed/35923706
http://dx.doi.org/10.3389/fgene.2022.899523
_version_ 1784760348965863424
author Pärna, Katri
Nolte, Ilja M.
Snieder, Harold
Fischer, Krista
Marnetto, Davide
Pagani, Luca
author_facet Pärna, Katri
Nolte, Ilja M.
Snieder, Harold
Fischer, Krista
Marnetto, Davide
Pagani, Luca
author_sort Pärna, Katri
collection PubMed
description One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS.
format Online
Article
Text
id pubmed-9340200
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93402002022-08-02 A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts Pärna, Katri Nolte, Ilja M. Snieder, Harold Fischer, Krista Marnetto, Davide Pagani, Luca Front Genet Genetics One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS. Frontiers Media S.A. 2022-07-18 /pmc/articles/PMC9340200/ /pubmed/35923706 http://dx.doi.org/10.3389/fgene.2022.899523 Text en Copyright © 2022 Pärna, Nolte, Snieder, Fischer, Marnetto and Pagani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Pärna, Katri
Nolte, Ilja M.
Snieder, Harold
Fischer, Krista
Marnetto, Davide
Pagani, Luca
A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title_full A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title_fullStr A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title_full_unstemmed A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title_short A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
title_sort principal component informed approach to address polygenic risk score transferability across european cohorts
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340200/
https://www.ncbi.nlm.nih.gov/pubmed/35923706
http://dx.doi.org/10.3389/fgene.2022.899523
work_keys_str_mv AT parnakatri aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT nolteiljam aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT sniederharold aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT fischerkrista aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT marnettodavide aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT paganiluca aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT parnakatri principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT nolteiljam principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT sniederharold principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT fischerkrista principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT marnettodavide principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts
AT paganiluca principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts