Cargando…
A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts
One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340200/ https://www.ncbi.nlm.nih.gov/pubmed/35923706 http://dx.doi.org/10.3389/fgene.2022.899523 |
_version_ | 1784760348965863424 |
---|---|
author | Pärna, Katri Nolte, Ilja M. Snieder, Harold Fischer, Krista Marnetto, Davide Pagani, Luca |
author_facet | Pärna, Katri Nolte, Ilja M. Snieder, Harold Fischer, Krista Marnetto, Davide Pagani, Luca |
author_sort | Pärna, Katri |
collection | PubMed |
description | One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS. |
format | Online Article Text |
id | pubmed-9340200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93402002022-08-02 A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts Pärna, Katri Nolte, Ilja M. Snieder, Harold Fischer, Krista Marnetto, Davide Pagani, Luca Front Genet Genetics One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this PRS transferability issue. Throughout the study, we used two complex model traits, height and body mass index, and samples from UK and Estonian Biobanks. We aimed to investigate 1) whether using a reference population (1000G) for computation of the PCs adjusted for in the discovery cohort improves the resulting PRS performance in a target set from another population and 2) whether adjusting the validation model for PCs is required at all. Our results showed that any other set of PCs performed worse than the one computed on samples from the same population as the discovery dataset. Furthermore, we show that PC correction in GWAS cannot prevent residual population structure information in the PRS, also for non-structured traits. Therefore, we confirm the utility of PC correction in the validation model when the investigated trait shows an actual correlation with population genetic structure, to account for the residual confounding effect when evaluating the predictive value of PRS. Frontiers Media S.A. 2022-07-18 /pmc/articles/PMC9340200/ /pubmed/35923706 http://dx.doi.org/10.3389/fgene.2022.899523 Text en Copyright © 2022 Pärna, Nolte, Snieder, Fischer, Marnetto and Pagani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Pärna, Katri Nolte, Ilja M. Snieder, Harold Fischer, Krista Marnetto, Davide Pagani, Luca A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title | A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title_full | A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title_fullStr | A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title_full_unstemmed | A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title_short | A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts |
title_sort | principal component informed approach to address polygenic risk score transferability across european cohorts |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9340200/ https://www.ncbi.nlm.nih.gov/pubmed/35923706 http://dx.doi.org/10.3389/fgene.2022.899523 |
work_keys_str_mv | AT parnakatri aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT nolteiljam aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT sniederharold aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT fischerkrista aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT marnettodavide aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT paganiluca aprincipalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT parnakatri principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT nolteiljam principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT sniederharold principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT fischerkrista principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT marnettodavide principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts AT paganiluca principalcomponentinformedapproachtoaddresspolygenicriskscoretransferabilityacrosseuropeancohorts |